Zechen Zhang

@zechenzhangAGI

Building the future of AI-human collaborations

82 skills 140,384 total stars

GitHub Profile 🔗 Website 𝕏 @ZechenZhang5

find ~/zechenzhangAGI/ -name "*.skill"

peft-fine-tuning ai

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models...

openrlhf-training ai

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models...

rwkv-architecture ai

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel),...

axolotl ai

Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO,...

slime-rl-training ai

Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM...

grpo-rl-training ai

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

unsloth ai

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

implementing-llms-litgpt ai

Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen,...

simpo-training ai

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4...

distributed-llm-pretraining-torchtitan ai

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use...

verl-rl-training ai

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when...

torchforge-rl-training ai

Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms....

sentencepiece ai

Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k...

llama-factory ai

Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA,...

nanogpt ai

Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for...

miles-rl-training ai

Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training...

ml-paper-writing ai

Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from...

mamba-architecture ai

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV...

huggingface-tokenizers ai

Fast tokenizers optimized for research and production. Rust-based implementation tokenizes 1GB in <20 seconds....

fine-tuning-with-trl ai

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment,...

Zechen Zhang

find ~/zechenzhangAGI/ -name "*.skill"

Confirm

Submit a Skill