This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge,...
Model Context Protocol (MCP) server development and tool management. Languages: Python, TypeScript. Capabilities: build MCP servers, integrate external APIs, discover/execute MCP tools, manage...
A formal evaluation framework for Claude Code sessions, implementing eval-driven development (EDD) principles.
Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles
Formal evaluation framework for Claude Code sessions implementing eval-driven development (EDD) principles
Generic multi-domain research framework with domain-aware scheduling. Model-agnostic and domain-agnostic - perform comprehensive research on any topic.
Deep research and market validation for app ideas. Use when starting a new project, validating an idea, or when the user says "research my idea", "validate my app", or "help me start a new project".
OODA loop decision framework (Observe, Orient, Decide, Act). Use for complex decisions, problem-solving, unclear situations, or when someone is jumping to solutions without analysis.
Guide for building AI agents in Go using adk-go framework. Use when creating agents with Gemini or OpenAI models, implementing tools, running agents with streaming, or managing conversation sessions.
Master AI-powered test automation with modern frameworks,
Master AI-powered test automation with modern frameworks,
Master AI-powered test automation with modern frameworks,
Master AI-powered test automation with modern frameworks,
Generate 30+ LinkedIn posts that attract your target prospects. Creates industry insights, thought leadership, engagement prompts, and comment strategies. Use when building personal brand to...
Build and execute multi-step prompt chains for complex tasks
AI-first application patterns, LLM testing, prompt management
Visual concepts for album artwork and AI art generation prompts
Make better decisions using structured frameworks and mental models
Analyze AI/ML technical content (papers, articles, blog posts) and extract actionable insights filtered through enterprise AI engineering lens. Use when user provides URL/document for AI/ML...
Refactor legacy codebases, migrate outdated frameworks, and