Zechen Zhang

@zechenzhangAGI

Building the future of AI-human collaborations

82 skills 140,384 total stars

GitHub Profile 🔗 Website 𝕏 @ZechenZhang5

find ~/zechenzhangAGI/ -name "*.skill"

simpo-training ai

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4...

ray-data ai

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images....

distributed-llm-pretraining-torchtitan ai

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use...

llamaguard ai

Meta's 7-8B specialized moderation model for LLM input/output filtering. 6 safety categories - violence/hate, sexual...

torchforge-rl-training ai

Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms....

training-llms-megatron ai

Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies....

huggingface-accelerate ai

Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for...

verl-rl-training ai

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when...

pytorch-lightning ai

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks...

sentencepiece ai

Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k...

lambda-labs-gpu-cloud ai

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances...

llama-factory ai

Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA,...

skypilot-multi-cloud-orchestration ai

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or...

nanogpt ai

Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for...

unsloth ai

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

ml-paper-writing ai

Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from...

mamba-architecture ai

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV...

huggingface-tokenizers ai

Fast tokenizers optimized for research and production. Rust-based implementation tokenizes 1GB in <20 seconds....

fine-tuning-with-trl ai

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment,...

deepspeed ai

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism,...

Zechen Zhang

find ~/zechenzhangAGI/ -name "*.skill"

Confirm

Submit a Skill