2145 results (16.8ms) page 7 / 108
mindrally / skills-meta-prompt exact

Meta-prompting framework for critiquing responses, analyzing solution trajectories, and evaluating AI-generated content quality

ValorVie / custom-skills-eval-harness exact

A formal evaluation framework for Claude Code sessions, implementing eval-driven development (EDD) principles.

secucon / cc-sys-eval-harness exact

Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles

UrlAudit / claude-toolbox-eval-harness exact

Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles

clawdbotborges / upskill-skill exact

Generate, evaluate, and iterate on agent skills using HuggingFace's Upskill tool. Transfer domain expertise from frontier models to smaller/local models.

daffy0208 / ai-dev-standards-quality-auditor exact

Comprehensive quality auditing and evaluation of tools, frameworks, and systems against industry best practices with detailed scoring across 12 critical dimensions

open-horizon-labs / skills-solution-space exact

Explore candidate solutions before committing. Use when you have a problem statement and need to evaluate approaches - band-aid, optimize, reframe, or redesign.

Tomlord1122 / tomtom-skill-deep-research exact

Deep research expert for comprehensive technical investigations. Use when conducting technology evaluations, comparing solutions, analyzing papers, or exploring technical trends.

proffesor-for-testing / agentic-qe-quality-metrics exact

Measure quality effectively with actionable metrics. Use when establishing quality dashboards, defining KPIs, or evaluating test effectiveness.

erichowens / some-claude-skills-research-analyst exact

Conducts thorough landscape research, competitive analysis, best practices evaluation, and evidence-based recommendations. Expert in market research and trend analysis.

synaptiai / agent-capability-standard-compare exact

Compare multiple alternatives using explicit criteria, weighted scoring, and tradeoff analysis. Use when choosing between options, evaluating alternatives, or making decisions.

eddiebe147 / claude-settings-financial-analyst exact

Analyze financial data, build models, evaluate investments, and provide data-driven financial recommendations

Jeffallan / claude-skills-prompt-engineer exact

Use when designing prompts for LLMs, optimizing model performance, building evaluation frameworks, or implementing advanced prompting techniques like chain-of-thought, few-shot learning, or...

jackspace / claudeskillz-peer-review exact

Systematic peer review toolkit. Evaluate methodology, statistics, design, reproducibility, ethics, figure integrity, reporting standards, for manuscript and grant review across disciplines.

ovachiever / droid-tings-peer-review exact

Systematic peer review toolkit. Evaluate methodology, statistics, design, reproducibility, ethics, figure integrity, reporting standards, for manuscript and grant review across disciplines.

synaptiai / agent-capability-standard-constrain exact

Enforce policies, guardrails, and permission boundaries; refuse unsafe actions and apply least privilege. Use when evaluating actions against policies, checking permissions, or reducing scope to...

ovachiever / droid-tings-scientific-critical-thinking exact

Evaluate research rigor. Assess methodology, experimental design, statistical validity, biases, confounding, evidence quality (GRADE, Cochrane ROB), for critical analysis of scientific claims.

jackspace / claudeskillz-scientific-critical-thinking exact

Evaluate research rigor. Assess methodology, experimental design, statistical validity, biases, confounding, evidence quality (GRADE, Cochrane ROB), for critical analysis of scientific claims.

dkyazzentwatwa / chatgpt-skills-classification-helper exact

Quick classifier training with automatic model selection, hyperparameter tuning, and comprehensive evaluation metrics.