Search: llm-eval | AgentSkillsRepo

fastapi-code-review 0.00

existential-birds / beagle-fastapi-code-review exact

Reviews FastAPI code for routing patterns, dependency injection, validation, and async handlers. Use when reviewing FastAPI apps, checking APIRouter setup, Depends() usage, or response models.

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

shadcn-code-review 0.00

existential-birds / beagle-shadcn-code-review exact

Reviews shadcn/ui components for CVA patterns, composition with asChild, accessibility states, and data-slot usage. Use when reviewing React components using shadcn/ui, Radix primitives, or...

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

ai-elements 0.00

existential-birds / beagle-ai-elements exact

Vercel AI Elements for workflow UI components. Use when building chat interfaces, displaying tool execution, showing reasoning/thinking, or creating job queues. Triggers on ai-elements, Queue,...

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

domain-hunter 0.00

ReScienceLab / opc-skills-domain-hunter exact

Search domains, compare prices, find promo codes, get purchase recommendations. Use when user wants to buy a domain, check prices, or find domain deals.

★ 32 ai

agent-skills opc ai-tools claude-code

python-code-review 0.00

existential-birds / beagle-python-code-review exact

Reviews Python code for type safety, async patterns, error handling, and common mistakes. Use when reviewing .py files, checking type hints, async/await usage, or exception handling.

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

go-code-review 0.00

existential-birds / beagle-go-code-review exact

Reviews Go code for idiomatic patterns, error handling, concurrency safety, and common mistakes. Use when reviewing .go files, checking error handling, goroutine usage, or interface design.

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

requesthunt 0.00

ReScienceLab / opc-skills-requesthunt exact

Generate user demand research reports from real user feedback. Scrape and analyze feature requests, complaints, and questions from Reddit, X, and GitHub.

★ 32 ai

agent-skills opc ai-tools claude-code

adr-writing 0.00

existential-birds / beagle-adr-writing exact

Write Architectural Decision Records following MADR template. Applies Definition of Done criteria, marks gaps for later completion. Use when generating ADR documents from extracted decisions.

★ 15 ai

ai-agents bubbletea claude-code claude-code-plugin

paper-writer 0.00

grahama1970 / agent-skills-paper-writer exact

>

★ 0 tools

blip-2-vision-language 0.00

zechenzhangAGI / ai-research-skills-blip-2-vision-language exact

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text retrieval, or multimodal chat with...

★ 1,712 ai

ai ai-research claude claude-code

codex-readiness-integration-test 0.00

openai / skills-codex-readiness-integration-test exact

Run the Codex Readiness integration test. Use when you need an end-to-end agentic loop with build/test scoring.

★ 1,908 development

deepeval 0.00

Hisham-Hussein / claude-forge-deepeval exact

Use when discussing or working with DeepEval (the python AI evaluation framework)

★ 0 ai

agentic-development 0.00

alinaqi / claude-bootstrap-agentic-development exact

Build AI agents with Pydantic AI (Python) and Claude SDK (Node.js)

★ 457 ai

ai-coding claude claude-code developer-tools

cohere-langgraph 0.00

RSHVR / unofficial-cohere-best-practices-cohere-langgraph exact

Cohere LangGraph agents reference for building ReAct agents, multi-tool workflows, agents with memory, and human-in-the-loop patterns. Covers both prebuilt and custom agent architectures.

★ 1 ai

agent claude-code cohere cohere-ai

go 0.00

maragudk / skills-go exact

Guide for how to develop Go apps and modules/libraries. Always use this skill when reading or writing Go code.

★ 29 development

evaluating-llms-harness 0.00

ovachiever / droid-tings-evaluating-llms-harness exact

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking...

★ 19 ai

evaluating-llms-harness 0.00

zechenzhangAGI / ai-research-skills-evaluating-llms-harness exact

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting academic results, or tracking...

★ 1,712 ai

ai ai-research claude claude-code

code-review 0.00

llama-farm / llamafarm-code-review exact

Comprehensive code review for diffs. Analyzes changed code for security vulnerabilities, anti-patterns, and quality issues. Auto-detects domain (frontend/backend) from file paths.

★ 810 ai

ai edge edge-computing llama3

nemo-evaluator-sdk 0.00

zechenzhangAGI / ai-research-skills-nemo-evaluator-sdk exact

Evaluates LLMs across 100+ benchmarks from 18+ harnesses (MMLU, HumanEval, GSM8K, safety, VLM) with multi-backend execution. Use when needing scalable evaluation on local Docker, Slurm HPC, or...

★ 1,712 ai

ai ai-research claude claude-code

langfuse-observability 0.00

phrazzld / claude-config-langfuse-observability exact

|

★ 2 tools

Confirm

Submit a Skill