Search: llm-eval | AgentSkillsRepo

advanced-evaluation 0.00

Kalyanikhandare29 / agent-skills-for-context-engineering-advanced-evaluation exact

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise...

★ 0 ai

accept-language agentic-context-enginnering ai ai-agents

langchain 0.00

Ianfr13 / claude-code-plugins-langchain exact

Framework for building LLM-powered applications with agents, chains, and RAG. Supports multiple providers (OpenAI, Anthropic, Google), 500+ integrations, ReAct agents, tool calling, memory...

★ 0 ai

langchain 0.00

ovachiever / droid-tings-langchain exact

Framework for building LLM-powered applications with agents, chains, and RAG. Supports multiple providers (OpenAI, Anthropic, Google), 500+ integrations, ReAct agents, tool calling, memory...

★ 19 ai

langchain 0.00

zechenzhangAGI / ai-research-skills-langchain exact

Framework for building LLM-powered applications with agents, chains, and RAG. Supports multiple providers (OpenAI, Anthropic, Google), 500+ integrations, ReAct agents, tool calling, memory...

★ 1,712 ai

ai ai-research claude claude-code

biomni 0.00

ovachiever / droid-tings-biomni exact

Autonomous biomedical AI agent framework for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis. Use this skill when conducting multi-step...

★ 19 ai

biomni 0.00

jackspace / claudeskillz-biomni exact

Autonomous biomedical AI agent framework for executing complex research tasks across genomics, drug discovery, molecular biology, and clinical analysis. Use this skill when conducting multi-step...

★ 8 ai

agentic-coding ai-skills automation bioinformatics

dspy-evaluation-suite 0.00

OmidZamani / dspy-skills-dspy-evaluation-suite exact

This skill should be used when the user asks to "evaluate a DSPy program", "test my DSPy module", "measure performance", "create evaluation metrics", "use answer_exact_match or SemanticF1",...

★ 20 ai

agent-skills claude-code claude-skills dspy

model-evaluation 0.00

cosmix / loom-model-evaluation exact

Evaluates machine learning models for performance, fairness, and reliability using appropriate metrics and validation techniques. Covers training debugging, hyperparameter tuning, and production...

★ 6 ai

agentic-coding agents claude claude-code

guidance 0.00

ovachiever / droid-tings-guidance exact

Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained...

★ 19 ai

guidance 0.00

zechenzhangAGI / ai-research-skills-guidance exact

Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance - Microsoft Research's constrained...

★ 1,712 ai

ai ai-research claude claude-code

deepfake-detection 0.00

dirnbauer / webconsulting-skills-deepfake-detection exact

Multimodal media authentication and deepfake forensics. PRNU analysis, IGH classification, DQ detection, semantic forensics, and LLM-augmented sensemaking for the post-empirical era.

★ 3 ai

ai-engineer 0.00

halay08 / fullstack-agent-skills-ai-engineer exact

Build production-ready LLM applications, advanced RAG systems, and

★ 0 ai

ai-engineer 0.00

rmyndharis / antigravity-skills-ai-engineer exact

Build production-ready LLM applications, advanced RAG systems, and

★ 187 ai

ai-engineer 0.00

404kidwiz / agent-skills-backup-ai-engineer exact

Build production-ready LLM applications, advanced RAG systems, and

★ 0 ai

dspy-miprov2-optimizer 0.00

OmidZamani / dspy-skills-dspy-miprov2-optimizer exact

This skill should be used when the user asks to "optimize a DSPy program", "use MIPROv2", "tune instructions and demos", "get best DSPy performance", "run Bayesian optimization", mentions...

★ 20 ai

agent-skills claude-code claude-skills dspy

skill-permissions 0.00

guo-yu / skills-skill-permissions exact

Skill permission analysis, one-time authorization, analyze skill permissions, batch authorization

★ 217 ai

claude-code llm skills

dspy-output-refinement-constraints 0.00

OmidZamani / dspy-skills-dspy-output-refinement-constraints exact

This skill should be used when the user asks to "refine DSPy outputs", "enforce constraints", "use dspy.Refine", "select best output", "use dspy.BestOfN", mentions "output validation", "constraint...

★ 20 ai

agent-skills claude-code claude-skills dspy

dspy-simba-optimizer 0.00

OmidZamani / dspy-skills-dspy-simba-optimizer exact

This skill should be used when the user asks to "optimize with SIMBA", "use Bayesian optimization", "optimize agents with custom feedback", mentions "SIMBA optimizer", "mini-batch optimization",...

★ 20 ai

agent-skills claude-code claude-skills dspy

claude-md-writer 0.00

testacode / llm-toolkit-claude-md-writer exact

Escribe y mejora CLAUDE.md siguiendo best practices. Usa cuando el usuario diga "crear CLAUDE.md", "mejorar CLAUDE.md", "actualizar CLAUDE.md", "revisar CLAUDE.md", "escribir instrucciones del...

★ 0 ai

agents ai-tools claude-code llm

local-skills-mcp-guide 0.00

kdpa-llc / local-skills-mcp-local-skills-mcp-guide exact

Expert guide for understanding the Local Skills MCP server repository - its structure, architecture, and implementation. Use when exploring this MCP server's codebase, understanding how Local...

★ 12 ai

agent ai ai-agent anthropic

Confirm

Submit a Skill