Search: claude-4 | AgentSkillsRepo

model-pruning 0.00

zechenzhangAGI / ai-research-skills-model-pruning exact

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or...

★ 1,712 ai

ai ai-research claude claude-code

invoice-scanner 0.00

frank-syncmarket / skills-invoice-scanner exact

扫描目录识别所有类型发票（交通、住宿、餐饮等），提取关键信息并生成分类统计报告

★ 1 ai

agent-skills anthropic anthropic-ai astro

llava 0.00

zechenzhangAGI / ai-research-skills-llava exact

Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP vision encoder with Vicuna/LLaMA language models. Supports multi-turn image...

★ 1,712 ai

ai ai-research claude claude-code

moe-training 0.00

zechenzhangAGI / ai-research-skills-moe-training exact

Train Mixture of Experts (MoE) models using DeepSpeed or HuggingFace. Use when training large-scale models with limited compute (5× cost reduction vs dense models), implementing sparse...

★ 1,712 ai

ai ai-research claude claude-code

obsidian-project-documentation-assistant 0.00

ali5ter / obsidian-project-assistant-skill-obsidian-project-documentation-assistant exact

Document technical projects in Obsidian vault. Use when the User mentions "document this", "close out", "wrap up", "update notes", "track progress", "where are we at", "what is the status", or...

★ 2 ai

claude-code documentation note-taking obsidian

miles-rl-training 0.00

zechenzhangAGI / ai-research-skills-miles-rl-training exact

Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training large MoE models with FP8/INT4, needing train-inference alignment, or requiring...

★ 1,712 ai

ai ai-research claude claude-code

fine-tuning-with-trl 0.00

zechenzhangAGI / ai-research-skills-fine-tuning-with-trl exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 1,712 ai

ai ai-research claude claude-code

gguf-quantization 0.00

zechenzhangAGI / ai-research-skills-gguf-quantization exact

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without...

★ 1,712 ai

ai ai-research claude claude-code

figma-to-code 0.00

scoobynko / claude-code-design-skills-figma-to-code exact

Generate production-ready React/Next.js code from Figma designs. Instructs proper Figma MCP tool usage, component reuse strategies, and variant mapping. **CRITICAL: ALWAYS use this skill when user...

★ 24 development

claude-code claude-skill claude-skills design

sparse-autoencoder-training 0.00

zechenzhangAGI / ai-research-skills-sparse-autoencoder-training exact

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable...

★ 1,712 ai

ai ai-research claude claude-code

grpo-rl-training 0.00

zechenzhangAGI / ai-research-skills-grpo-rl-training exact

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

★ 1,712 ai

ai ai-research claude claude-code

pytorch-fsdp 0.00

zechenzhangAGI / ai-research-skills-pytorch-fsdp exact

Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU offloading, FSDP2

★ 1,712 ai

ai ai-research claude claude-code

distributed-llm-pretraining-torchtitan 0.00

zechenzhangAGI / ai-research-skills-distributed-llm-pretraining-torchtitan exact

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or custom models at scale from 8 to 512+...

★ 1,712 ai

ai ai-research claude claude-code

serving-llms-vllm 0.00

zechenzhangAGI / ai-research-skills-serving-llms-vllm exact

Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with...

★ 1,712 ai

ai ai-research claude claude-code

deepspeed 0.00

zechenzhangAGI / ai-research-skills-deepspeed exact

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

★ 1,712 ai

ai ai-research claude claude-code

ray-train 0.00

zechenzhangAGI / ai-research-skills-ray-train exact

Distributed training orchestration across clusters. Scales PyTorch/TensorFlow/HuggingFace from laptop to 1000s of nodes. Built-in hyperparameter tuning with Ray Tune, fault tolerance, elastic...

★ 1,712 ai

ai ai-research claude claude-code

peft-fine-tuning 0.00

zechenzhangAGI / ai-research-skills-peft-fine-tuning exact

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with...

★ 1,712 ai

ai ai-research claude claude-code

evaluating-code-models 0.00

zechenzhangAGI / ai-research-skills-evaluating-code-models exact

Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding abilities, testing multi-language...

★ 1,712 ai

ai ai-research claude claude-code

nemo-evaluator-sdk 0.00

zechenzhangAGI / ai-research-skills-nemo-evaluator-sdk exact

Evaluates LLMs across 100+ benchmarks from 18+ harnesses (MMLU, HumanEval, GSM8K, safety, VLM) with multi-backend execution. Use when needing scalable evaluation on local Docker, Slurm HPC, or...

★ 1,712 ai

ai ai-research claude claude-code

lambda-labs-gpu-cloud 0.00

zechenzhangAGI / ai-research-skills-lambda-labs-gpu-cloud exact

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node...

★ 1,712 ai

ai ai-research claude claude-code

Confirm

Submit a Skill