Search: llm-eval | AgentSkillsRepo

ai-product-strategy 0.00

RefoundAI / lenny-skills-ai-product-strategy exact

Help users define AI product strategy. Use when someone is building an AI product, deciding where to apply AI in their product, planning an AI roadmap, evaluating build vs buy for AI capabilities,...

★ 1 ai

ai-agents ai-assistant claude claude-code

prompt-engineering 0.00

kongyo2 / prompt-engineering-skill-prompt-engineering exact

Comprehensive prompt engineering framework for designing, optimizing, and iterating LLM prompts. This skill should be used when users request prompt creation, optimization, or improvement for any...

★ 1 ai

agent-skills claude-skill claude-skills

multi-agent-orchestration 0.00

omer-metin / skills-for-antigravity-multi-agent-orchestration exact

Patterns for coordinating multiple LLM agents including sequential, parallel, router, and hierarchical architectures—the AI equivalent of microservicesUse when "multi-agent, agent orchestration,...

★ 5 ai

ai-agents antigravity antigravity-ide skills

browser-use 0.00

browser-use / browser-use-browser-use exact

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots,...

★ 77,311 ai

llm ai-agents ai-tools browser-automation

ai-risk-mapper 0.00

totallyGreg / claude-mp-ai-risk-mapper exact

This skill should be used when identifying, analyzing, and mitigating security risks in Artificial Intelligence systems using the CoSAI (Coalition for Secure AI) Risk Map framework. Use when...

★ 0 ai

structured-output 0.00

omer-metin / skills-for-antigravity-structured-output exact

Expert in getting reliable, typed outputs from LLMs. Covers JSON mode, function calling, Instructor library, Outlines for constrained generation, Pydantic validation, and response format...

★ 5 ai

ai-agents antigravity antigravity-ide skills

slime-rl-training 0.00

zechenzhangAGI / ai-research-skills-slime-rl-training exact

Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM...

★ 1,712 ai

ai ai-research claude claude-code

clean-code 0.00

xenitV1 / claude-code-maestro-clean-code exact

The Foundation Skill. LLM Firewall + 2025 Security + Cross-Skill Coordination. Use for ALL code output - prevents hallucinations, enforces security, ensures quality.

★ 188 ai

llms-txt-generator 0.00

testacode / llm-toolkit-llms-txt-generator exact

Genera documentación llms.txt optimizada para LLMs. Usa cuando el usuario diga "crear llms.txt", "documentar para AI", "crear documentación para LLMs", "generar docs para modelos", o quiera hacer...

★ 0 ai

agents ai-tools claude-code llm

smoke-test 0.00

mastra-ai / mastra-smoke-test exact

Create a Mastra project using create-mastra and smoke test the studio in Chrome

★ 20,554 ai

agents ai chatbots javascript

annotate 0.00

haizelabs / annotate-annotate exact

Create flexible annotation workflows for AI applications. Contains common tools to explore raw ai agent logs/transcripts, extract out relevant evaluation data, and llm-as-a-judge creation.

★ 15 ai

e2e-tests-studio 0.00

mastra-ai / mastra-e2e-tests-studio exact

>

★ 20,554 ai

agents ai chatbots javascript

hugging-face-evaluation-manager 0.00

eugenepyvovarov / mcpbundler-agent-skills-marketplace-hugging-face-evaluation-manager exact

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom model...

★ 5 ai

agent-skill agent-skills claude codex

hugging-face-evaluation 0.00

huggingface / skills-hugging-face-evaluation exact

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom model...

★ 1,015 ai

writing-prds 0.00

RefoundAI / lenny-skills-writing-prds exact

Help users write effective PRDs. Use when someone is documenting product requirements, preparing specs for engineering, writing feature briefs, or defining what to build for their team.

★ 1 ai

ai-agents ai-assistant claude claude-code

pydantic-ai 0.00

itechmeat / llm-code-pydantic-ai exact

Build production AI agents with Pydantic AI: type-safe tools, structured output, embeddings, MCP, 30+ model providers, evals, graphs, and observability.

★ 1 ai

ai-engineer 0.00

404kidwiz / claude-supercode-skills-ai-engineer exact

Expert in building comprehensive AI systems, integrating LLMs, RAG architectures, and autonomous agents into production applications. Use when building AI-powered features, implementing LLM...

★ 6 ai

advanced-evaluation 0.00

itsAR-VR / goatedskills-advanced-evaluation exact

Master LLM-as-a-Judge evaluation techniques including direct scoring, pairwise comparison, rubric generation, and bias mitigation. Use when building evaluation systems, comparing model outputs, or...

★ 0 ai

advanced-evaluation 0.00

guanyang / antigravity-skills-advanced-evaluation exact

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise...

★ 105 ai

ai-skills antigravity antigravity-ai antigravity-ide

advanced-evaluation 0.00

Kalyanikhandare29 / agent-skills-for-context-engineering-advanced-evaluation exact

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise...

★ 0 ai

accept-language agentic-context-enginnering ai ai-agents

Confirm

Submit a Skill