Search: claude-plugins | AgentSkillsRepo

grpo-rl-training 0.00

zechenzhangAGI / ai-research-skills-grpo-rl-training exact

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

ai ai-research claude claude-code

commit-messages 0.00

TheDecipherist / claude-code-mastery-commit-messages exact

Generate clear, conventional commit messages from git diffs. Use when writing commit messages, reviewing staged changes, or preparing releases.

★ 250 ai

ai-agents anthropic claude claude-code

evaluating-llms-harness 0.00

zechenzhangAGI / ai-research-skills-evaluating-llms-harness exact

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking...

★ 1,712 ai

ai ai-research claude claude-code

fine-tuning-with-trl 0.00

zechenzhangAGI / ai-research-skills-fine-tuning-with-trl exact

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF,...

★ 1,712 ai

ai ai-research claude claude-code

mamba-architecture 0.00

zechenzhangAGI / ai-research-skills-mamba-architecture exact

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware design. Mamba-1 (d_state=16) and Mamba-2...

★ 1,712 ai

ai ai-research claude claude-code

tensorrt-llm 0.00

zechenzhangAGI / ai-research-skills-tensorrt-llm exact

Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than...

★ 1,712 ai

ai ai-research claude claude-code

llama-cpp 0.00

zechenzhangAGI / ai-research-skills-llama-cpp exact

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization...

★ 1,712 ai

ai ai-research claude claude-code

model-pruning 0.00

zechenzhangAGI / ai-research-skills-model-pruning exact

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or...

★ 1,712 ai

ai ai-research claude claude-code

training-llms-megatron 0.00

zechenzhangAGI / ai-research-skills-training-llms-megatron exact

Trains large language models (2B-462B parameters) using NVIDIA Megatron-Core with advanced parallelism strategies. Use when training models >1B parameters, need maximum GPU efficiency (47% MFU on...

★ 1,712 ai

ai ai-research claude claude-code

phoenix-observability 0.00

zechenzhangAGI / ai-research-skills-phoenix-observability exact

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running evaluations on datasets, or monitoring...

★ 1,712 ai

ai ai-research claude claude-code

tensorboard 0.00

zechenzhangAGI / ai-research-skills-tensorboard exact

Visualize training metrics, debug models with histograms, compare experiments, visualize model graphs, and profile performance with TensorBoard - Google's ML visualization toolkit

★ 1,712 ai

ai ai-research claude claude-code

distributed-llm-pretraining-torchtitan 0.00

zechenzhangAGI / ai-research-skills-distributed-llm-pretraining-torchtitan exact

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or custom models at scale from 8 to 512+...

★ 1,712 ai

ai ai-research claude claude-code

gguf-quantization 0.00

zechenzhangAGI / ai-research-skills-gguf-quantization exact

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without...

★ 1,712 ai

ai ai-research claude claude-code

llama-factory 0.00

zechenzhangAGI / ai-research-skills-llama-factory exact

Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA, multimodal support

★ 1,712 ai

ai ai-research claude claude-code

skills 0.00

stotihv / skills exact

🤖 Coordinate multi-agent tasks and enhance workflow with versatile skills for autonomous planning and execution.

★ 0 ai

anthropic-ai astro automate claude

invoice-scanner 0.00

frank-syncmarket / skills-invoice-scanner exact

扫描目录识别所有类型发票（交通、住宿、餐饮等），提取关键信息并生成分类统计报告

★ 1 ai

agent-skills anthropic anthropic-ai astro

awq-quantization 0.00

zechenzhangAGI / ai-research-skills-awq-quantization exact

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited GPU memory, when you need faster...

★ 1,712 ai

ai ai-research claude claude-code

plaid 0.00

raintree-technology / claude-starter-plaid exact

Plaid banking API expert for financial data integration. Covers Plaid Link, Auth (account/routing numbers), Transactions, Identity verification, Balance checking, and webhooks. Build fintech apps...

★ 38 ai

anthropic claude claude-ai claude-code

browser-tools 0.00

Brawl345 / browser-tools exact

Interact with a web browser. Can start a browser, connect to it, evaluate JavaScript, make screenshots, read console logs and let the user select DOM elements. Use when interacting with unknown...

★ 0 ai

ai ai-agent claude claude-code

openrlhf-training 0.00

zechenzhangAGI / ai-research-skills-openrlhf-training exact

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2× faster than DeepSpeedChat with...

★ 1,712 ai

ai ai-research claude claude-code

Confirm

Submit a Skill