8556 results (63.6ms) page 6 / 428
404kidwiz / agent-skills-backup-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

halay08 / fullstack-agent-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

Ianfr13 / claude-code-plugins-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

automindtechnologie-jpg / ultimate-skill-md-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

WHQ25 / agent-canvas-agent-canvas exact

Draw diagrams, flowcharts, and visualizations on an Excalidraw canvas. Use when the user asks to draw, visualize, create diagrams, or sketch ideas.

ngxtm / devkit-parallel-agents exact

Multi-agent orchestration patterns. Use when multiple independent tasks can run with different domain expertise or when comprehensive analysis requires multiple perspectives.

muratcankoylan / agent-skills-for-context-engineering-hosted-agents exact

This skill should be used when the user asks to "build background agent", "create hosted coding agent", "set up sandboxed execution", "implement multiplayer agent", or mentions background agents,...

ngxtm / devkit-voice-agents exact

Voice agents represent the frontier of AI interaction - humans speaking naturally with AI systems. The challenge isn't just speech recognition and synthesis, it's achieving natural conversation...

parcadei / continuous-claude-v3-sub-agents exact

Create and configure Claude Code sub-agents with custom prompts, tools, and models

parcadei / continuous-claude-v3-planning-agent exact

Planning agent that creates implementation plans and handoffs from conversation context

parcadei / continuous-claude-v3-validate-agent exact

Validation agent that validates plan tech choices against current best practices

parcadei / continuous-claude-v3-research-agent exact

Research agent for external documentation, best practices, and library APIs via MCP tools

hoodini / ai-agents-skills-honest-agent exact

Configure AI coding agents to be honest, objective, and non-sycophantic. Use when the user wants to set up honest feedback, disable people-pleasing behavior, enable objective criticism, or...

unagi / agent-document-reviewer-agent-document-reviewer exact

Review and improve AI agent instruction documents (AGENTS.md, Claude.md, etc.) for quality, clarity, and effectiveness. Use when users request review of agent documentation, ask to evaluate...

erichowens / some-claude-skills-agent-creator exact

Meta-agent for creating new custom agents, skills, and MCP integrations. Expert in agent design, MCP development, skill architecture, and rapid prototyping. Activate on 'create agent', 'new...

ShirokumaLibrary / shirokuma-skills-managing-rules exact

Create, update, and organize Claude Code rules (.claude/rules/) following official best practices. Use when "create rule", "add rule", "ルール作成", "ルール追加", or configuring path-specific conventions.

claude-world / director-mode-lite-agents exact

List all available agents (core + expert)

arielperez82 / agents-and-skills-refactoring-agents exact

Guides ecosystem-level refactors of ap-* agents. Use when agents overlap, responsibilities are unclear, or you need to merge, split, rename, or re-scope agents and formalize collaboration contracts.