This skill analyzes code for design quality improvements across 8 dimensions: Naming, Object Calisthenics, Coupling & Cohesion, Immutability, Domain Integrity, Type System, Simplicity, and...
Use when making high-stakes decisions under uncertainty that require stakeholder buy-in. Invoke when evaluating strategic options (build vs buy, market entry, resource allocation), quantifying...
Expert in designing effective prompts for LLM-powered applications. Masters prompt structure, context management, output formatting, and prompt evaluation. Use when "prompt engineering, system...
Evaluate routine and class design quality using Code Complete checklists (43 items). Use when designing routines or classes, reviewing class interfaces, choosing between inheritance and...
Apply Nassim Taleb's Skin in the Game principles for evaluating trust, designing incentives, and making ethical decisions. Use when assessing advisors, structuring partnerships, evaluating...
Assess a codebase's readiness for autonomous agent development and provide tailored recommendations. Use when asked to evaluate how well a project supports unattended agent execution, assess...
Conducts comprehensive frontend design reviews covering UI/UX design quality, design system validation, accessibility compliance, responsive design patterns, component library architecture, and...
AI-powered testability assessment using 10 principles of intrinsic testability with Playwright and optional Vibium integration. Evaluates web applications against Observability, Controllability,...
Visual design intelligence and UI aesthetics. Integrates: chrome-devtools, ai-multimodal, media-processing. Capabilities: design analysis, visual hierarchy, color theory, typography,...
Implement comprehensive observability for LLM applications including tracing (Langfuse/Helicone), cost tracking, token optimization, RAG evaluation metrics (RAGAS), hallucination detection, and...
Code review practices with technical rigor and verification gates. Practices: receiving feedback, requesting reviews, verification gates. Capabilities: technical evaluation, evidence-based claims,...
Technical decision-making frameworks - trade-off evaluation, reversibility analysis, and second-order thinking for better engineering choicesUse when "should we, which is better, trade-off,...
Find and evaluate influencers for marketing partnerships
Build and run evaluators for AI/LLM applications using Phoenix.
Design and evaluate compression strategies for long-running sessions
Use when discussing or working with DeepEval (the python AI evaluation framework)
Property analysis, investment evaluation, and market research for real estate professionals and investors
Best practices for scikit-learn machine learning, model development, evaluation, and deployment in Python
Insurance policy analysis, claims evaluation, coverage assessment, and risk management for individuals and businesses
Richard Rumelt's framework for crafting and evaluating strategy through diagnosis, guiding policy, and coherent actions.