1543 results (14.8ms) page 48 / 78
automindtechnologie-jpg / ultimate-skill-md-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

ngxtm / devkit-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

Ianfr13 / claude-code-plugins-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

404kidwiz / agent-skills-backup-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

ramidamolis-alt / agent-skills-workflows-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

sickn33 / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

shishiv / gsd-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

onewave-ai / claude-skills-skill-composer-studio exact

CHAIN multiple existing skills into custom multi-step workflows. Programmable skill combinations with automatic handoffs. Create composite skills from building blocks with conditional logic.

omer-metin / skills-for-antigravity-privacy-guardian exact

Security and privacy specialist for differential privacy, encryption, and complianceUse when "privacy, encryption, differential privacy, PII, GDPR, CCPA, access control, audit trail, data...

omer-metin / skills-for-antigravity-vector-specialist exact

Embedding and vector retrieval expert for semantic searchUse when "vector search, embeddings, semantic search, qdrant, pgvector, similarity search, reranking, hybrid retrieval, embeddings,...

rmyndharis / antigravity-skills-monorepo-architect exact

Expert in monorepo architecture, build systems, and dependency management at scale. Masters Nx, Turborepo, Bazel, and Lerna for efficient multi-project development. Use PROACTIVELY for monorepo setup,

halay08 / fullstack-agent-skills-monorepo-architect exact

Expert in monorepo architecture, build systems, and dependency management at scale. Masters Nx, Turborepo, Bazel, and Lerna for efficient multi-project development. Use PROACTIVELY for monorepo setup,

404kidwiz / agent-skills-backup-monorepo-architect exact

Expert in monorepo architecture, build systems, and dependency management at scale. Masters Nx, Turborepo, Bazel, and Lerna for efficient multi-project development. Use PROACTIVELY for monorepo setup,

shishiv / gsd-monorepo-architect exact

Expert in monorepo architecture, build systems, and dependency management at scale. Masters Nx, Turborepo, Bazel, and Lerna for efficient multi-project development. Use PROACTIVELY for monorepo setup,

omer-metin / skills-for-antigravity-ml-memory exact

Memory systems specialist for hierarchical memory, consolidation, and outcome-based learningUse when "memory system, memory hierarchy, memory consolidation, forgetting strategy, salience learning,...

RSHVR / unofficial-cohere-best-practices-cohere-cookbooks exact

Official Cohere cookbooks and tutorials for production patterns. Links to RAG implementations, agent workflows, enterprise integrations, and real-world use cases from the Cohere developer...

omer-metin / skills-for-antigravity-causal-scientist exact

Causal inference specialist for causal discovery, counterfactual reasoning, and effect estimationUse when "causal inference, causal discovery, counterfactual, intervention effect, confounder,...

omer-metin / skills-for-antigravity-graph-engineer exact

Knowledge graph specialist for entity and causal relationship modelingUse when "knowledge graph, graph database, falkordb, neo4j, cypher query, entity resolution, causal relationships, graph...

aktsmm / agent-skills-book-writing-workspace exact

Set up a complete book writing workspace with AI agents, instructions, prompts, and scripts. Use when users want to create a new book/technical writing project with Markdown + Re:VIEW + PDF output...

halay08 / fullstack-agent-skills-autonomous-agents exact

Autonomous agents are AI systems that can independently decompose goals, plan actions, execute tools, and self-correct without constant human guidance. The challenge isn't making them capable -...