9079 results (67.6ms) page 34 / 454
ramidamolis-alt / agent-skills-workflows-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

automindtechnologie-jpg / ultimate-skill-md-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

ngxtm / devkit-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

sickn33 / antigravity-awesome-skills-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

shishiv / gsd-agent-evaluation exact

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...

hardw00t / ai-security-arsenal-llm-security exact

LLM and AI application security testing skill for prompt injection, jailbreaking, and AI system vulnerabilities. This skill should be used when testing AI/ML applications for security issues,...

Charon-Fan / agent-playbook-qa-expert exact

Quality assurance expert for testing strategies and quality gates. Use when planning test coverage, setting up QA processes, or improving quality standards.

proffesor-for-testing / agentic-qe-swarm-advanced exact

Advanced swarm orchestration patterns for research, development, testing, and complex distributed workflows

erichowens / some-claude-skills-github-actions-pipeline-builder exact

Build production CI/CD pipelines with GitHub Actions. Implements matrix builds, caching, deployments, testing, security scanning. Use for automated testing, deployments, release workflows....

hardw00t / ai-security-arsenal-dast-automation exact

Automated Dynamic Application Security Testing (DAST) using Playwright MCP for browser-based security scanning. Performs blackbox/greybox security testing on single or multiple domains with...

daffy0208 / ai-dev-standards-quality-assurance exact

Comprehensive quality assurance combining testing strategy, code quality enforcement, and validation gates. Consolidated from testing-strategist, code-quality-enforcer, and validation-gate-checker.

ravnhq / ai-toolkit-tech-vitest exact

Vitest-specific testing utilities, mocking, and assertion patterns. Extends platform-testing with Vitest rules. Use when writing tests with Vitest.

itsAR-VR / goatedskills-qa-regression exact

Automate QA regression testing with reusable test skills. Create login flows, dashboard checks, user creation, and other common test scenarios that run consistently.

skillcreatorai / ai-agent-skills-qa-regression exact

Automate QA regression testing with reusable test skills. Create login flows, dashboard checks, user creation, and other common test scenarios that run consistently.

ngxtm / devkit-qa-regression exact

Automate QA regression testing with reusable test skills. Create login flows, dashboard checks, user creation, and other common test scenarios that run consistently.

omer-metin / skills-for-antigravity-chaos-engineer exact

Resilience testing specialist for failure injection, game day planning, and building confidence in system reliabilityUse when "chaos engineering, resilience testing, failure injection, game day,...

ci-cd 0.00
cosmix / loom-ci-cd exact

Designs and implements CI/CD pipelines for automated testing, building, deployment, and security scanning across multiple platforms. Covers pipeline optimization, test integration, artifact...

antfu / skills-vitest exact

Vitest fast unit testing framework powered by Vite with Jest-compatible API. Use when writing tests, mocking, configuring coverage, or working with test filtering and fixtures.

omer-metin / skills-for-antigravity-ai-ad-creative exact

The intersection of AI generation and performance marketing. This skill covers creating ad creatives at scale using AI tools—from static images to video ads to dynamic creative optimization—while...

proffesor-for-testing / agentic-qe-github-release-management exact

Comprehensive GitHub release orchestration with AI swarm coordination for automated versioning, testing, deployment, and rollback management