3551 results (27.1ms) page 45 / 178
shipshitdev / library-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

itsAR-VR / goatedskills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

mjunaidca / mjs-agent-skills-evaluation exact

Build evaluation frameworks for agent systems. Use when testing agent performance, validating context engineering choices, or measuring improvements over time.

Kalyanikhandare29 / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

muratcankoylan / agent-skills-for-context-engineering-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

guanyang / antigravity-skills-evaluation exact

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...

richard-gyiko / which-llm-which-llm exact

Select optimal LLM(s) for a task based on skill requirements, budget, and constraints. Uses the `which-llm` CLI to query Artificial Analysis benchmarks enriched with capability data from models.dev.

julianobarbosa / claude-code-skills-dependency-track-skill exact

Comprehensive guide for Dependency-Track - Software Composition Analysis (SCA) and SBOM management platform. USE WHEN deploying Dependency-Track, integrating with CI/CD pipelines, configuring...

oaustegard / claude-skills-controlling-spotify exact

Control Spotify playback and manage playlists via MCP server. Use when user requests playing music, controlling Spotify, creating playlists, searching songs, or managing their Spotify library.

williamzujkowski / cognitive-toolworks-oscal-ssp-validator exact

Validates OSCAL System Security Plan documents against schemas, profiles, and cross-reference requirements with tiered validation depth.

williamzujkowski / cognitive-toolworks-redis-database-architect exact

Design Redis architectures with caching patterns, data structures, eviction policies, persistence (RDB/AOF), and high availability (Sentinel/Cluster).

eugenepyvovarov / mcpbundler-agent-skills-marketplace-solana-kit exact

Comprehensive guide for building Solana apps with @solana/kit (web3.js 2.0). Use when you need modern RPC/subscriptions, transaction building, signing, and program interactions in JavaScript/TypeScript.

williamzujkowski / cognitive-toolworks-frontend-performance-optimizer exact

Analyzes and optimizes frontend performance using Core Web Vitals, bundle analysis, lazy loading, image optimization, and caching strategies

namesreallyblank / clorch-agentica-sdk exact

Build Python agents with Agentica SDK - @agentic decorator, spawn(), persistence, MCP integration

parcadei / continuous-claude-v3-agentica-sdk exact

Build Python agents with Agentica SDK - @agentic decorator, spawn(), persistence, MCP integration

alinaqi / claude-bootstrap-team-coordination exact

Multi-person projects - shared state, todo claiming, handoffs

synaptiai / agent-capability-standard-search exact

Find relevant items under uncertainty across repositories, databases, web sources, or any searchable corpus. Use when exploring unknown territory, finding related information, or discovering...