Search: answers | AgentSkillsRepo

evaluation 0.00

shipshitdev / library-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 4 ai

claude-code codex commands skills

evaluation 0.00

mjunaidca / mjs-agent-skills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 19 ai

agent agent-skills agentskills ai-agents-for-business

evaluation 0.00

itsAR-VR / goatedskills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 0 ai

frontend-design-principles 0.00

joshuadavidthomas / agentkit-frontend-design-principles exact

Create polished, intentional frontend interfaces. Use this skill when building any UI — dashboards, admin panels, landing pages, marketing sites, or web applications. Routes to specialized...

★ 0 web

evaluation 0.00

guanyang / antigravity-skills-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 105 ai

ai-skills antigravity antigravity-ai antigravity-ide

evaluation 0.00

muratcankoylan / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 7,926 ai

evaluation 0.00

Kalyanikhandare29 / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 0 ai

accept-language agentic-context-enginnering ai ai-agents

research-external 0.00

namesreallyblank / clorch-research-external exact

External research workflow for docs, web, APIs - NOT codebase exploration

★ 2 web

research-external 0.00

parcadei / continuous-claude-v3-research-external exact

External research workflow for docs, web, APIs - NOT codebase exploration

★ 3,433 ai

agents claude-code claude-code-cli claude-code-hooks

llm-evaluation 0.00

ovachiever / droid-tings-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 19 ai

tool-design 0.00

mjunaidca / mjs-agent-skills-tool-design exact

Design tools that agents can use effectively. Use when creating new tools for agents, debugging tool-related failures, or optimizing existing tool sets.

★ 19 ai

agent agent-skills agentskills ai-agents-for-business

llm-evaluation 0.00

halay08 / fullstack-agent-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 0 ai

llm-evaluation 0.00

rmyndharis / antigravity-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 187 ai

llm-evaluation 0.00

404kidwiz / agent-skills-backup-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 0 ai

copy-editing 0.00

Ianfr13 / claude-code-plugins-copy-editing exact