Search: evaluation | AgentSkillsRepo

langfuse 0.00

omer-metin / skills-for-antigravity-langfuse exact

Expert in Langfuse - the open-source LLM observability platform. Covers tracing, prompt management, evaluation, datasets, and integration with LangChain, LlamaIndex, and OpenAI. Essential for...

★ 5 ai

ai-agents antigravity antigravity-ide skills

cto-advisor 0.00

ovachiever / droid-tings-cto-advisor exact

Technical leadership guidance for engineering teams, architecture decisions, and technology strategy. Includes tech debt analyzer, team scaling calculator, engineering metrics frameworks,...

★ 19 devops

cto-advisor 0.00

shipshitdev / library-cto-advisor exact

Technical leadership guidance for engineering teams, architecture decisions, and technology strategy. Includes tech debt analyzer, team scaling calculator, engineering metrics frameworks,...

★ 4 ai

claude-code codex commands skills

opik-eval 0.00

armelhbobdad / opik-skills-opik-eval exact

Create and run evaluations on your LLM outputs. Use when testing prompts, measuring quality, comparing models, or creating evaluation datasets.

★ 0 ai

deslop 0.00

Md0bR / claude-config-deslop exact

Remove AI-generated code slop from the current branch. Use after writing code to clean up unnecessary comments, defensive checks, and inconsistent style.

★ 0 ai

ai ai-agent bmad-method ccr

find-skills 0.00

Md0bR / claude-config-find-skills exact

Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities....

★ 0 ai

ai ai-agent bmad-method ccr

sentry 0.00

Md0bR / claude-config-sentry exact

Sentry error monitoring and performance tracing patterns for Next.js applications.

★ 0 ai

ai ai-agent bmad-method ccr

agent-browser 0.00

Md0bR / claude-config-agent-browser exact

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots,...

★ 0 ai

ai ai-agent bmad-method ccr

bun 0.00

Md0bR / claude-config-bun exact

Use Bun instead of Node.js, npm, pnpm, or vite. Provides command mappings, Bun-specific APIs, and development patterns.

★ 0 ai

ai ai-agent bmad-method ccr

favicon 0.00

Md0bR / claude-config-favicon exact

Generate a complete set of favicons from a source image and update HTML. Use when setting up favicons for a web project.

★ 0 ai

ai ai-agent bmad-method ccr

rams 0.00

Md0bR / claude-config-rams exact

Run accessibility and visual design review on components. Use when reviewing UI code for WCAG compliance and design issues.

★ 0 ai

ai ai-agent bmad-method ccr

reclaude 0.00

Md0bR / claude-config-reclaude exact

Refactor CLAUDE.md files to follow progressive disclosure principles. Use when CLAUDE.md is too long or disorganized.

★ 0 ai

ai ai-agent bmad-method ccr

knip 0.00

Md0bR / claude-config-knip exact

Run knip to find and remove unused files, dependencies, and exports. Use for cleaning up dead code and unused dependencies.

★ 0 ai

ai ai-agent bmad-method ccr

skill-creator 0.00

Md0bR / claude-config-skill-creator exact

Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge,...

★ 0 ai

ai ai-agent bmad-method ccr

simplify 0.00

Md0bR / claude-config-simplify exact

Simplify and refine recently modified code for clarity and consistency. Use after writing code to improve readability without changing functionality.

★ 0 ai

ai ai-agent bmad-method ccr

langsmith-observability 0.00

zechenzhangAGI / ai-research-skills-langsmith-observability exact

LLM observability platform for tracing, evaluation, and monitoring. Use when debugging LLM applications, evaluating model outputs against datasets, monitoring production systems, or building...

★ 1,712 ai

ai ai-research claude claude-code

llm_evaluation 0.00

DonggangChen / antigravity-agentic-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 2 ai

llm_evaluation 0.00

vuralserhat86 / antigravity-agentic-skills-llm-evaluation exact

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or...

★ 27 ai

phoenix-observability 0.00

zechenzhangAGI / ai-research-skills-phoenix-observability exact

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running evaluations on datasets, or monitoring...

★ 1,712 ai

ai ai-research claude claude-code

skill_evaluator 0.00

DonggangChen / antigravity-agentic-skills-skill-evaluator exact

Evaluates agent skills against Anthropic's best practices. Use when asked to review, evaluate, assess, or audit a skill for quality. Analyzes SKILL.md structure, naming conventions, description...

★ 2 ai

Confirm

Submit a Skill