Search: claude-plugin | AgentSkillsRepo

model-pruning 0.00

zechenzhangAGI / ai-research-skills-model-pruning exact

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50% sparsity with minimal accuracy loss, or...

★ 1,712 ai

ai ai-research claude claude-code

simpo-training 0.00

zechenzhangAGI / ai-research-skills-simpo-training exact

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model needed, more efficient than DPO. Use...

★ 1,712 ai

ai ai-research claude claude-code

long-context 0.00

zechenzhangAGI / ai-research-skills-long-context exact

Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+ tokens), extending pre-trained models...

★ 1,712 ai

ai ai-research claude claude-code

grpo-rl-training 0.00

zechenzhangAGI / ai-research-skills-grpo-rl-training exact

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

★ 1,712 ai

ai ai-research claude claude-code

verl-rl-training 0.00

zechenzhangAGI / ai-research-skills-verl-rl-training exact

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with...

★ 1,712 ai

ai ai-research claude claude-code

reviewing-ai-papers 0.00

oaustegard / claude-skills-reviewing-ai-papers exact

Analyze AI/ML technical content (papers, articles, blog posts) and extract actionable insights filtered through enterprise AI engineering lens. Use when user provides URL/document for AI/ML...

★ 30 ai

claude claude-skill claude-skills

gptq 0.00

zechenzhangAGI / ai-research-skills-gptq exact

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with <2% perplexity...

★ 1,712 ai

ai ai-research claude claude-code

sentencepiece 0.00

zechenzhangAGI / ai-research-skills-sentencepiece exact

Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k sentences/sec), lightweight (6MB memory), deterministic vocabulary. Used by T5, ALBERT,...

★ 1,712 ai

ai ai-research claude claude-code

transformer-lens-interpretability 0.00

zechenzhangAGI / ai-research-skills-transformer-lens-interpretability exact

Provides guidance for mechanistic interpretability research using TransformerLens to inspect and manipulate transformer internals via HookPoints and activation caching. Use when...

★ 1,712 ai

ai ai-research claude claude-code

sparse-autoencoder-training 0.00

zechenzhangAGI / ai-research-skills-sparse-autoencoder-training exact

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable...

★ 1,712 ai

ai ai-research claude claude-code

llama-cpp 0.00

zechenzhangAGI / ai-research-skills-llama-cpp exact

Runs LLM inference on CPU, Apple Silicon, and consumer GPUs without NVIDIA hardware. Use for edge deployment, M1/M2/M3 Macs, AMD/Intel GPUs, or when CUDA is unavailable. Supports GGUF quantization...

★ 1,712 ai

ai ai-research claude claude-code

faiss 0.00

zechenzhangAGI / ai-research-skills-faiss exact

Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW). Use for fast k-NN...

★ 1,712 ai

ai ai-research claude claude-code

whop 0.00

raintree-technology / claude-starter-whop exact

Whop platform expert for digital products, memberships, and community monetization. Covers memberships API, payments, courses, forums, webhooks, OAuth apps, and checkout integration. Build SaaS,...

★ 38 ai

anthropic claude claude-ai claude-code

rwkv-architecture 0.00

zechenzhangAGI / ai-research-skills-rwkv-architecture exact

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows,...

★ 1,712 ai

ai ai-research claude claude-code

hello-demo 0.00

oaustegard / claude-skills-hello-demo exact

Delivers a static Hello World HTML demo page with bookmarklet. Use when user requests the hello demo, hello world demo, or demo page.

★ 30 ai

claude claude-skill claude-skills

langsmith-observability 0.00

zechenzhangAGI / ai-research-skills-langsmith-observability exact

LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building...

★ 1,712 ai

ai ai-research claude claude-code

deepspeed 0.00

zechenzhangAGI / ai-research-skills-deepspeed exact

Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention

★ 1,712 ai

ai ai-research claude claude-code

slime-rl-training 0.00

zechenzhangAGI / ai-research-skills-slime-rl-training exact

Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM...

★ 1,712 ai

ai ai-research claude claude-code

lambda-labs-gpu-cloud 0.00

zechenzhangAGI / ai-research-skills-lambda-labs-gpu-cloud exact

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent filesystems, or high-performance multi-node...

★ 1,712 ai

ai ai-research claude claude-code

tensorrt-llm 0.00

zechenzhangAGI / ai-research-skills-tensorrt-llm exact

Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than...

★ 1,712 ai

ai ai-research claude claude-code

Confirm

Submit a Skill