Search: answers | AgentSkillsRepo

agent-evaluation 0.00

omer-metin / skills-for-antigravity-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 5 ai

ai-agents antigravity antigravity-ide skills

agent-evaluation 0.00

Ianfr13 / claude-code-plugins-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

sickn33 / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 2,844 ai

agentic-skills ai-agents antigravity autonomous-coding

agent-evaluation 0.00

cleodin / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 1 ai

agentic-skills ai-agents antigravity antigravity-ide

agent-evaluation 0.00

automindtechnologie-jpg / ultimate-skill-md-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

halay08 / fullstack-agent-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

solidity 0.00

mindrally / skills-solidity exact

Expert in Solidity smart contract development with security and gas optimization

★ 3 security

agent-evaluation 0.00

shishiv / gsd-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

ngxtm / devkit-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent ai automation claude

agent-evaluation 0.00

404kidwiz / agent-skills-backup-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

agent-evaluation 0.00

ramidamolis-alt / agent-skills-workflows-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

★ 0 ai

create-prd 0.00

richtabor / agent-skills-create-prd exact

Plan features interactively. Asks clarifying questions, then generates a detailed PRD document.

★ 37 ai

claude-code codex skills

research 0.00

sarfraznawaz2005 / agent-skills-collection-research exact

Deep research via Gemini CLI — runs in background sub-agent.

★ 1 ai

Lightweight Implementation Analysis Protocol 0.00

NTCoding / claude-skillz-lightweight-implementation-analysis-protocol exact

This skill should be used when fixing bugs, implementing features, debugging issues, or making code changes. Ensures understanding of code flow before implementation by: (1) Tracing execution path...

★ 138 development

coding 0.00

randerzander / skill-agent-coding exact

Write and execute Python code to process data, analyze scraped content, or perform computations

★ 0 development

feynman 0.00

neurofoo / agent-skills-feynman exact

Feynman Technique for deep learning—explain a concept simply, identify gaps, fill them, then refine. Use when learning something new, testing understanding, or preparing to teach.

★ 35 development

gtd 0.00

realYushi / my-gtd-buddy-gtd exact

GTD mentor for inbox processing, weekly reviews, and coaching. Triggers on "process inbox", "weekly review", "what should I do", "I'm stuck", or /gtd command.

★ 15 ai

agent gtd agents-skills claude-code

cynefin 0.00

neurofoo / agent-skills-cynefin exact

Cynefin sense-making framework categorizing problems as Simple, Complicated, Complex, Chaotic, or Confused to select the right approach. Use when unsure how to tackle a problem.

★ 35 tools

observability-sre 0.00

omer-metin / skills-for-antigravity-observability-sre exact

Site reliability specialist for Prometheus metrics, distributed tracing, alerting strategies, and SLO designUse when "observability, monitoring, prometheus, grafana, alerting, slo, sli, metrics,...

★ 5 ai

ai-agents antigravity antigravity-ide skills

causal-scientist 0.00

omer-metin / skills-for-antigravity-causal-scientist exact

Causal inference specialist for causal discovery, counterfactual reasoning, and effect estimationUse when "causal inference, causal discovery, counterfactual, intervention effect, confounder,...

★ 5 ai

ai-agents antigravity antigravity-ide skills

Confirm

Submit a Skill