Search: evaluation-framework

evaluation 0.30

guanyang / antigravity-skills-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 105 ai

ai-skills antigravity antigravity-ai antigravity-ide

evaluation 0.30

Kalyanikhandare29 / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 0 ai

accept-language agentic-context-enginnering ai ai-agents

evaluation 0.30

muratcankoylan / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

★ 7,926 ai

evaluation 0.28

shishiv / gsd-evaluation exact

Build evaluation frameworks for agent systems

★ 0 ai

use-framework 0.27

Przemocny / strategic-frameworks-use-framework exact

Apply strategic frameworks through facilitated workshop dialogue. Use when user selected framework via choose-framework; explicitly requests specific framework; knows which framework to apply; or...

★ 13 ai

agent-skills agentskills anthropic business-strategy

discover-framework 0.27

Przemocny / strategic-frameworks-discover-framework exact

Research and add new strategic frameworks to the system (meta-skill). Use when user wants to add framework not in library; discovered new framework in their domain; asks "Can you add...

★ 13 ai

agent-skills agentskills anthropic business-strategy

evaluation 0.27

shipshitdev / library-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 4 ai

claude-code codex commands skills

evaluation 0.27

itsAR-VR / goatedskills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 0 ai

evaluation 0.27

mjunaidca / mjs-agent-skills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 19 ai

agent agent-skills agentskills ai-agents-for-business

scholar-evaluation 0.26

jackspace / claudeskillz-scholar-evaluation exact

Systematic framework for evaluating scholarly and research work based on the ScholarEval methodology. This skill should be used when assessing research papers, evaluating literature reviews,...

★ 8 ai

agentic-coding ai-skills automation bioinformatics

scholar-evaluation 0.26

ovachiever / droid-tings-scholar-evaluation exact

Systematic framework for evaluating scholarly and research work based on the ScholarEval methodology. This skill should be used when assessing research papers, evaluating literature reviews,...

★ 19 development

choose-framework 0.25

Przemocny / strategic-frameworks-choose-framework exact

Select the right strategic framework for your situation through exploratory dialogue. Use when user describes a problem, decision, or challenge; needs structured thinking approach; mentions...

★ 13 ai

agent-skills agentskills anthropic business-strategy

promptfoo-evaluation 0.25

daymade / claude-code-skills-promptfoo-evaluation exact

Configures and runs LLM evaluation using Promptfoo framework. Use when setting up prompt testing, creating evaluation configs (promptfooconfig.yaml), writing Python custom assertions, implementing...

★ 510 ai

llm-evaluation 0.25

halay08 / fullstack-agent-skills-llm-evaluation exact