Guide systematic investigation of production incidents including triage, data gathering, impact assessment, and root...
find ~/nik-kale/ -name "*.skill"
Systematic debugging workflows for Kubernetes issues including pod failures, resource problems, and networking. Use...
Guide for implementing metrics, logs, and traces in applications. Use when setting up monitoring, adding...
Comprehensive checklist for production deployment readiness covering reliability, observability, security, and...
Templates and patterns for creating operational runbooks and playbooks. Use when creating runbooks, writing...