error-detector

by @404kidwiz in Development

# Install this skill:

npx skills add 404kidwiz/claude-supercode-skills --skill "error-detector"

Install specific skill from multi-skill repository

# Description

Advanced error analysis and pattern detection specialist for identifying, analyzing, and preventing software errors

# SKILL.md

name: error-detector
description: Advanced error analysis and pattern detection specialist for identifying, analyzing, and preventing software errors

Error Detector Skill

Purpose

Provides error analysis and pattern detection expertise specializing in proactive identification of software defects, code analysis, and system behavior monitoring. Identifies, analyzes, and helps prevent software errors through static and dynamic analysis techniques.

When to Use

Performing static code analysis and anti-pattern detection
Analyzing runtime errors and exception patterns
Detecting memory leaks and performance bottlenecks
Monitoring and analyzing error logs
Identifying security vulnerabilities through code patterns
Conducting proactive error prevention analysis

Overview

Specialized in error analysis, pattern detection, and proactive identification of software defects through code analysis, log monitoring, and system behavior analysis.

Error Detection Methodologies

Static Analysis

Code pattern recognition
Anti-pattern identification
Complexity analysis
Security vulnerability detection
Performance bottleneck identification

Dynamic Analysis

Runtime error monitoring
Exception pattern analysis
Memory leak detection
Performance profiling
Resource utilization tracking

Log-Based Analysis

# Example patterns for error detection
grep -r "ERROR\|FATAL\|CRITICAL" logs/ --include="*.log" --include="*.txt"
grep -r "exception\|error\|failed" src/ --include="*.js" --include="*.py" --include="*.java"
grep -r "TODO\|FIXME\|HACK" src/ --include="*.*" --exclude-dir=node_modules

Error Categories & Patterns

Common Programming Errors

Null pointer exceptions
Array index out of bounds
Type conversion errors
Resource leak issues
Concurrency problems

Logic Errors

Off-by-one errors
Incorrect conditionals
Loop termination issues
State management problems
Data validation failures

Performance Errors

Inefficient algorithms
Memory optimization issues
Database query problems
Network timeout handling
Resource contention

Advanced Detection Techniques

Machine Learning-Based Detection

Anomaly detection in system behavior
Pattern recognition in error logs
Predictive failure modeling
Classification of error types
Automated root cause analysis

Statistical Analysis

Error frequency distribution
Time series analysis of failures
Correlation analysis between components
Regression testing failure patterns
Performance degradation detection

Code Complexity Metrics

Cyclomatic complexity analysis
Cognitive complexity assessment
Maintainability index calculation
Technical debt quantification
Code duplication detection

Error Analysis Frameworks

Root Cause Analysis (RCA)

Five Whys methodology
Fishbone diagram analysis
Pareto analysis for prioritization
Fault tree analysis
Change impact assessment

Error Classification Systems

Severity categorization
Priority assignment frameworks
Impact assessment matrices
Frequency-based prioritization
Business risk evaluation

Pattern Recognition

Repetitive error identification
Error clustering algorithms
Sequence pattern analysis
Correlation detection
Temporal pattern analysis

Monitoring & Alerting

Real-Time Monitoring

System health dashboards
Error rate monitoring
Performance threshold alerts
Log aggregation and analysis
Automated incident response

Predictive Analysis

Failure prediction models
Early warning systems
Trend analysis and forecasting
Capacity planning alerts
Proactive maintenance scheduling

Logging Best Practices

Structured logging implementation
Log level optimization
Sensitive data protection
Log rotation policies
Centralized log management

Error Prevention Strategies

Code Quality Improvement

Peer review processes
Automated testing coverage
Static analysis tools integration
Code style enforcement
Documentation standards

Development Process Optimization

Test-driven development (TDD)
Continuous integration practices
Automated deployment pipelines
Rollback procedures
Feature flag implementation

System Design Patterns

Circuit breaker patterns
Retry mechanisms
Graceful degradation
Fallback systems
Redundancy implementation

Error Detection Tools & Integration

Static Analysis Tools

ESLint for JavaScript/TypeScript
Pylint for Python
SonarQube for multi-language analysis
Checkstyle for Java
FxCop for C#

Dynamic Monitoring Tools

Application Performance Monitoring (APM)
Error tracking services (Sentry, Bugsnag)
Log management systems (ELK stack)
Distributed tracing tools
Infrastructure monitoring

Custom Detection Scripts

Error pattern matching
Anomaly detection algorithms
Automated regression testing
Performance benchmarking
Data validation checks

Error Response & Resolution

Incident Management

Error triage procedures
Escalation protocols
Communication templates
Resolution tracking
Post-incident reviews

Automated Recovery

Self-healing mechanisms
Automatic restart procedures
Failover systems
Data recovery processes
Service restoration workflows

Knowledge Management

Error documentation databases
Solution repositories
Best practice libraries
Training materials
Lessons learned archives

Specific Domain Expertise

Web Application Errors

HTTP error code analysis
JavaScript runtime errors
API failure patterns
Database connection issues
Frontend performance problems

Mobile Application Errors

Device-specific issues
Network connectivity problems
App store rejection patterns
Battery usage optimization
Memory management issues

Backend System Errors

Database transaction failures
Message queue processing errors
Authentication and authorization issues
Microservices communication problems
Resource exhaustion scenarios

Reporting & Analytics

Error Metrics

Mean Time To Detection (MTTD)
Mean Time To Resolution (MTTR)
Error frequency trends
Resolution effectiveness
Preventive action impact

Quality Dashboards

Real-time error monitoring
Historical trend analysis
Team performance metrics
System health indicators
Compliance status tracking

Deliverables

Analysis Reports

Comprehensive error analysis
Root cause identification
Impact assessment documentation
Resolution recommendations
Prevention strategies

Implementation Plans

Error detection system design
Monitoring setup procedures
Alerting configuration guides
Automated testing frameworks
Process improvement recommendations

Training Materials

Error handling best practices
Troubleshooting guides
Tool usage documentation
Process workflow diagrams
Knowledge base articles

Examples

Example 1: E-Commerce Platform Error Monitoring

Scenario: Implementing comprehensive error tracking for a high-traffic e-commerce site.

Implementation:
1. Error Tracking: Sentry integration across all services
2. Log Aggregation: ELK stack for centralized log management
3. Alerting: PagerDuty integration for critical errors
4. Dashboard: Custom Grafana dashboards for error metrics

Results:
- MTTD reduced from hours to minutes
- 40% reduction in time-to-resolution
- Proactive identification of emerging issues

Example 2: Mobile App Crash Reporting

Scenario: Setting up crash reporting for iOS and Android applications.

Approach:
1. Crash Reporting: Firebase Crashlytics integration
2. Symbolication: Automated dSYM upload for readable stack traces
3. Breadcrumbs: User action tracking for context
4. Release Tracking: Correlation of crashes with app versions

Key Metrics Tracked:
- Crash-free users rate (target: 99.5%)
- Top crashers by device and OS version
- Session data with crash-free rate trends
- User feedback correlation with crashes

Example 3: API Gateway Error Analysis

Scenario: Monitoring and analyzing errors at API gateway level for a SaaS platform.

Monitoring Setup:
1. Request Logging: All API requests logged with status codes
2. Rate Tracking: Monitoring for 429 Too Many Requests patterns
3. Latency Analysis: P95, P99 latency tracking by endpoint
4. Authentication Errors: Tracking failed auth attempts for security

Alert Configuration:
- Error rate spikes (> 5% for 5 minutes)
- Latency degradation (> 1s for P95)
- Authentication failures (> 100/min from single IP)
- Circuit breaker state changes

Best Practices

Error Detection Configuration

Comprehensive Coverage: Instrument all code paths, not just critical functions
Context-Rich Data: Include user IDs, request IDs, environment details
Sensitive Data Handling: Scrub PII and secrets before error reporting
Sampling Strategy: Balance detail collection with performance impact
Tagging: Use consistent tagging for filtering and aggregation

Alert Management

Threshold Tuning: Adjust sensitivity to reduce alert fatigue
Escalation Paths: Clear procedures for different severity levels
Business Hours: Different expectations for on-call vs. business hours
Alert Fatigue Prevention: Consolidate related alerts, avoid duplicates
On-Call Rotation: Sustainable schedules with clear responsibilities

Metrics and Reporting

Key Metrics: Track MTTD, MTTR, error rate, resolution rate
Trend Analysis: Weekly/monthly comparisons to identify patterns
SLA Reporting: Error impact on service level agreements
Team Dashboards: Custom views for different teams and roles
Executive Reporting: High-level summaries for leadership

Error Handling Best Practices

Defensive Programming: Validate inputs, handle edge cases
Graceful Degradation: Fallback mechanisms when dependencies fail
Error Recovery: Automatic retry with exponential backoff
User Communication: Meaningful error messages for end users
Logging: Comprehensive logs for debugging and audit trails

Continuous Improvement

Post-Incident Reviews: Learn from every significant error
Pattern Analysis: Identify recurring issues for systemic fixes
Knowledge Base: Document errors and solutions for future reference
Tool Evolution: Regularly evaluate and update detection tools
Team Training: Ensure consistent error handling practices

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.