Search: model-evaluation | AgentSkillsRepo

evaluation 0.00

itsAR-VR / goatedskills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 0 ai

evaluation 0.00

mjunaidca / mjs-agent-skills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

★ 19 ai

agent agent-skills agentskills ai-agents-for-business

model-merging 0.00

zechenzhangAGI / ai-research-skills-model-merging exact

Merge multiple fine-tuned models using mergekit to combine capabilities without retraining. Use when creating specialized models by blending domain-specific expertise (math + coding + chat),...

★ 1,712 ai

ai ai-research claude claude-code

model-merging 0.00

ovachiever / droid-tings-model-merging exact

Merge multiple fine-tuned models using mergekit to combine capabilities without retraining. Use when creating specialized models by blending domain-specific expertise (math + coding + chat),...

★ 19 ai

model-usage 0.00

yueweilu / ai-agent-skills-model-usage exact

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level...

★ 0 ai

model-usage 0.00

moltbot / moltbot-model-usage exact

Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level...

★ 65,001 ai

ai assistant clawd own-your-data

3d-modeling 0.00

omer-metin / skills-for-antigravity-3d-modeling exact

Expert 3D modeling specialist with deep knowledge of topology, UV mapping, game-ready and film-ready pipelines, DCC tool workflows (Blender, Maya, ZBrush, 3ds Max, Houdini), retopology, LOD...

★ 5 ai

ai-agents antigravity antigravity-ide skills

threat-model 0.00

cosmix / loom-threat-model exact

Threat modeling methodologies (STRIDE, DREAD, PASTA, attack trees) for secure architecture design. Use when planning new systems, reviewing architecture security, identifying threats, or assessing...

★ 6 ai

agentic-coding agents claude claude-code

business-model-design 0.00

omer-metin / skills-for-antigravity-business-model-design exact

Expert in business model design - the architecture of how a company creates, delivers, and captures value. Covers business model canvas, revenue model selection, value chain design, and business...

★ 5 ai

ai-agents antigravity antigravity-ide skills

scholar-evaluation 0.00

ovachiever / droid-tings-scholar-evaluation exact

Systematic framework for evaluating scholarly and research work based on the ScholarEval methodology. This skill should be used when assessing research papers, evaluating literature reviews,...

★ 19 development

scholar-evaluation 0.00

jackspace / claudeskillz-scholar-evaluation exact

Systematic framework for evaluating scholarly and research work based on the ScholarEval methodology. This skill should be used when assessing research papers, evaluating literature reviews,...

★ 8 ai

agentic-coding ai-skills automation bioinformatics

llm-evaluation 0.00

ovachiever / droid-tings-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 19 ai

llm-evaluation 0.00

404kidwiz / agent-skills-backup-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 0 ai

llm-evaluation 0.00

rmyndharis / antigravity-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 187 ai

llm-evaluation 0.00

halay08 / fullstack-agent-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 0 ai

model-equivariance-auditor 0.00

lyndonkl / claude-model-equivariance-auditor exact

Use when you have implemented an equivariant model and need to verify it correctly respects the intended symmetries. Invoke when user mentions testing model equivariance, debugging symmetry bugs,...

★ 15 development

pydantic-models 0.00

ngxtm / devkit-pydantic-models exact

Create Pydantic models following the multi-model pattern with Base, Create, Update, Response, and InDB variants. Use when defining API request/response schemas, database models, or data validation...

★ 0 ai

agent ai automation claude

openrouter-trending-models 0.00

MadAppGang / claude-code-openrouter-trending-models exact

Fetch trending programming models from OpenRouter rankings. Use when selecting models for multi-model review, updating model recommendations, or researching current AI coding trends. Provides...

★ 219 ai

data-schema-knowledge-modeling 0.00

lyndonkl / claude-data-schema-knowledge-modeling exact

Use when designing database schemas, need to model domain entities and relationships clearly, building knowledge graphs or ontologies, creating API data models, defining system boundaries and...

★ 15 ai

model-first-reasoning 0.00

petekp / agent-skills-model-first-reasoning exact

Apply Model-First Reasoning (MFR) to code generation tasks. Use when the user requests "model-first", "MFR", "formal modeling before coding", "model then implement", or when tasks involve complex...

★ 1 ai

Confirm

Submit a Skill