Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...
Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world...
Test PydanticAI agents using TestModel, FunctionModel, VCR cassettes, and inline snapshots. Use when writing unit tests, mocking LLM responses, or recording API interactions.
Assist with core product management activities including writing PRDs, analyzing features, synthesizing user research, planning roadmaps, and communicating product decisions. Use when you need...
A set of resources to help me write all kinds of internal communications, using the formats that my company likes to use. Claude should use this skill whenever asked to write some sort of internal...
>
Manage Kapso workflows, AI agents, functions, and databases. Edit workflow graphs, configure triggers, manage executions, create functions, and perform database CRUD. Use when building...
Build agents that generate creative content including music, memes, podcasts, and multimedia. Covers generative models, content synthesis, style transfer, and creative control. Use when building...
Vite build tool and dev server for fast modern web development.
Webpack module bundler configuration and optimization.
Modern JavaScript conventions and ES2020+ features.
Internationalization and localization practices for multilingual applications.
MUI X Charts for data visualization in React applications.
When the user wants to create, optimize, or analyze a referral program, affiliate program, or word-of-mouth strategy. Also use when the user mentions 'referral,' 'affiliate,' 'ambassador,' 'word...
Semantic HTML best practices for web development.
Clear and effective technical communication for documentation, code comments, and team collaboration.
Schema validation for JavaScript objects with expressive API.
RTK Query for data fetching and caching in Redux applications.
React Native mobile development patterns and best practices.
TypeScript-first schema validation with static type inference.