Search: llms | AgentSkillsRepo

openai-prompt-engineer 0.00

jamesrochabrun / skills-openai-prompt-engineer exact

Generate and improve prompts using best practices for OpenAI GPT-5 and other LLMs. Apply advanced techniques like chain-of-thought, few-shot prompting, and progressive disclosure.

★ 33 ai

verl-rl-training 0.00

zechenzhangAGI / ai-research-skills-verl-rl-training exact

Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when implementing RLHF, GRPO, PPO, or other RL algorithms for LLM post-training at scale with...

★ 1,712 ai

ai ai-research claude claude-code

model-quantization 0.00

martinholovsky / claude-skills-generator-model-quantization exact

Expert skill for AI model quantization and optimization. Covers 4-bit/8-bit quantization, GGUF conversion, memory optimization, and quality-performance tradeoffs for deploying LLMs in...

★ 20 ai

mcp-builder 0.00

itsAR-VR / goatedskills-mcp-builder exact

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate...

★ 0 ai

mcp-builder 0.00

skillcreatorai / ai-agent-skills-mcp-builder exact

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate...

★ 637 ai

sglang 0.00

zechenzhangAGI / ai-research-skills-sglang exact

Fast structured generation and serving for LLMs with RadixAttention prefix caching. Use for JSON/regex outputs, constrained decoding, agentic workflows with tool calls, or when you need 5× faster...

★ 1,712 ai

ai ai-research claude claude-code

sglang 0.00

ovachiever / droid-tings-sglang exact

Fast structured generation and serving for LLMs with RadixAttention prefix caching. Use for JSON/regex outputs, constrained decoding, agentic workflows with tool calls, or when you need 5× faster...

★ 19 ai

nemo-evaluator-sdk 0.00

zechenzhangAGI / ai-research-skills-nemo-evaluator-sdk exact

Evaluates LLMs across 100+ benchmarks from 18+ harnesses (MMLU, HumanEval, GSM8K, safety, VLM) with multi-backend execution. Use when needing scalable evaluation on local Docker, Slurm HPC, or...

★ 1,712 ai

ai ai-research claude claude-code

scrum-conductor 0.00

YuniorGlez / gemini-elite-core-scrum-conductor exact

Senior Agile Facilitator & Delivery Architect for 2026. Specialized in AI-enhanced Scrum orchestration, automated ticket management, and high-velocity sprint coordination. Expert in utilizing LLMs...

★ 3 ai

quantizing-models-bitsandbytes 0.00

zechenzhangAGI / ai-research-skills-quantizing-models-bitsandbytes exact

Quantizes LLMs to 8-bit or 4-bit for 50-75% memory reduction with minimal accuracy loss. Use when GPU memory is limited, need to fit larger models, or want faster inference. Supports INT8, NF4,...

★ 1,712 ai

ai ai-research claude claude-code

quantizing-models-bitsandbytes 0.00

ovachiever / droid-tings-quantizing-models-bitsandbytes exact

Quantizes LLMs to 8-bit or 4-bit for 50-75% memory reduction with minimal accuracy loss. Use when GPU memory is limited, need to fit larger models, or want faster inference. Supports INT8, NF4,...

★ 19 ai

gptq 0.00

ovachiever / droid-tings-gptq exact

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with <2% perplexity...

★ 19 ai

gptq 0.00

zechenzhangAGI / ai-research-skills-gptq exact

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with <2% perplexity...

★ 1,712 ai

ai ai-research claude claude-code

local-llm-router 0.00

hoodini / ai-agents-skills-local-llm-router exact

Route AI coding queries to local LLMs in air-gapped networks. Integrates Serena MCP for semantic code understanding. Use when working offline, with local models (Ollama, LM Studio, Jan,...

★ 69 ai

reviewing-code 0.00

jlowin / fastmcp-reviewing-code exact

Review code for quality, maintainability, and correctness. Use when reviewing pull requests, evaluating code changes, or providing feedback on implementations. Focuses on API design, patterns, and...

★ 22,408 ai

model-context-protocol fastmcp mcp agents

phoenix-cli 0.00

Arize-ai / phoenix-phoenix-cli exact

Debug LLM applications using the Phoenix CLI. Fetch traces, analyze errors, review experiments, and inspect datasets. Use when debugging AI/LLM applications, analyzing trace data, working with...

★ 8,402 ai

llmops ai-monitoring ai-observability llm-eval

phoenix-evals 0.00

Arize-ai / phoenix-phoenix-evals exact

Build and run evaluators for AI/LLM applications using Phoenix.

★ 8,402 ai

llmops ai-monitoring ai-observability llm-eval

phoenix-tracing 0.00

Arize-ai / phoenix-phoenix-tracing exact

OpenInference semantic conventions and instrumentation for Phoenix AI observability. Use when implementing LLM tracing, creating custom spans, or deploying to production.

★ 8,402 ai

llmops ai-monitoring ai-observability llm-eval

testing-python 0.00

jlowin / fastmcp-testing-python exact

Write and evaluate effective Python tests using pytest. Use when writing tests, reviewing test code, debugging test failures, or improving test coverage. Covers test design, fixtures,...

★ 22,408 ai

model-context-protocol fastmcp mcp agents

repomix 0.00

samhvw8 / dot-claude-repomix exact

Repository packaging for AI/LLM analysis. Capabilities: pack repos into single files, generate AI-friendly context, codebase snapshots, security audit prep, filter/exclude patterns, token...

★ 5 ai

Confirm

Submit a Skill