Manage Apple Reminders via the `remindctl` CLI on macOS (list, add, edit, complete, delete)....
npx skills add coffeefuelbump/csv-data-summarizer-claude-skill
Or install specific skill: npx add-skill https://github.com/coffeefuelbump/csv-data-summarizer-claude-skill
# Description
Analyzes CSV files, generates summary stats, and plots quick visualizations using Python and pandas.
# SKILL.md
name: csv-data-summarizer
description: Analyzes CSV files, generates summary stats, and plots quick visualizations using Python and pandas.
metadata:
version: 2.1.0
dependencies: python>=3.8, pandas>=2.0.0, matplotlib>=3.7.0, seaborn>=0.12.0
CSV Data Summarizer
This Skill analyzes CSV files and provides comprehensive summaries with statistical insights and visualizations.
When to Use This Skill
Claude should use this Skill whenever the user:
- Uploads or references a CSV file
- Asks to summarize, analyze, or visualize tabular data
- Requests insights from CSV data
- Wants to understand data structure and quality
How It Works
⚠️ CRITICAL BEHAVIOR REQUIREMENT ⚠️
DO NOT ASK THE USER WHAT THEY WANT TO DO WITH THE DATA.
DO NOT OFFER OPTIONS OR CHOICES.
DO NOT SAY "What would you like me to help you with?"
DO NOT LIST POSSIBLE ANALYSES.
IMMEDIATELY AND AUTOMATICALLY:
1. Run the comprehensive analysis
2. Generate ALL relevant visualizations
3. Present complete results
4. NO questions, NO options, NO waiting for user input
THE USER WANTS A FULL ANALYSIS RIGHT AWAY - JUST DO IT.
Automatic Analysis Steps:
The skill intelligently adapts to different data types and industries by inspecting the data first, then determining what analyses are most relevant.
- Load and inspect the CSV file into pandas DataFrame
- Identify data structure - column types, date columns, numeric columns, categories
- Determine relevant analyses based on what's actually in the data:
- Sales/E-commerce data (order dates, revenue, products): Time-series trends, revenue analysis, product performance
- Customer data (demographics, segments, regions): Distribution analysis, segmentation, geographic patterns
- Financial data (transactions, amounts, dates): Trend analysis, statistical summaries, correlations
- Operational data (timestamps, metrics, status): Time-series, performance metrics, distributions
- Survey data (categorical responses, ratings): Frequency analysis, cross-tabulations, distributions
-
Generic tabular data: Adapts based on column types found
-
Only create visualizations that make sense for the specific dataset:
- Time-series plots ONLY if date/timestamp columns exist
- Correlation heatmaps ONLY if multiple numeric columns exist
- Category distributions ONLY if categorical columns exist
-
Histograms for numeric distributions when relevant
-
Generate comprehensive output automatically including:
- Data overview (rows, columns, types)
- Key statistics and metrics relevant to the data type
- Missing data analysis
- Multiple relevant visualizations (only those that apply)
-
Actionable insights based on patterns found in THIS specific dataset
-
Present everything in one complete analysis - no follow-up questions
Example adaptations:
- Healthcare data with patient IDs → Focus on demographics, treatment patterns, temporal trends
- Inventory data with stock levels → Focus on quantity distributions, reorder patterns, SKU analysis
- Web analytics with timestamps → Focus on traffic patterns, conversion metrics, time-of-day analysis
- Survey responses → Focus on response distributions, demographic breakdowns, sentiment patterns
Behavior Guidelines
✅ CORRECT APPROACH - SAY THIS:
- "I'll analyze this data comprehensively right now."
- "Here's the complete analysis with visualizations:"
- "I've identified this as [type] data and generated relevant insights:"
- Then IMMEDIATELY show the full analysis
✅ DO:
- Immediately run the analysis script
- Generate ALL relevant charts automatically
- Provide complete insights without being asked
- Be thorough and complete in first response
- Act decisively without asking permission
❌ NEVER SAY THESE PHRASES:
- "What would you like to do with this data?"
- "What would you like me to help you with?"
- "Here are some common options:"
- "Let me know what you'd like help with"
- "I can create a comprehensive analysis if you'd like!"
- Any sentence ending with "?" asking for user direction
- Any list of options or choices
- Any conditional "I can do X if you want"
❌ FORBIDDEN BEHAVIORS:
- Asking what the user wants
- Listing options for the user to choose from
- Waiting for user direction before analyzing
- Providing partial analysis that requires follow-up
- Describing what you COULD do instead of DOING it
Usage
The Skill provides a Python function summarize_csv(file_path) that:
- Accepts a path to a CSV file
- Returns a comprehensive text summary with statistics
- Generates multiple visualizations automatically based on data structure
Example Prompts
"Here's
sales_data.csv. Can you summarize this file?""Analyze this customer data CSV and show me trends."
"What insights can you find in
orders.csv?"
Example Output
Dataset Overview
- 5,000 rows × 8 columns
- 3 numeric columns, 1 date column
Summary Statistics
- Average order value: $58.2
- Standard deviation: $12.4
- Missing values: 2% (100 cells)
Insights
- Sales show upward trend over time
- Peak activity in Q4
(Attached: trend plot)
Files
analyze.py- Core analysis logicrequirements.txt- Python dependenciesresources/sample.csv- Example dataset for testingresources/README.md- Additional documentation
Notes
- Automatically detects date columns (columns containing 'date' in name)
- Handles missing data gracefully
- Generates visualizations only when date columns are present
- All numeric columns are included in statistical summary
# README.md
📊 CSV Data Summarizer - Claude Skill
A powerful Claude Skill that automatically analyzes CSV files and generates comprehensive insights with visualizations. Upload any CSV and get instant, intelligent analysis without being asked what you want!
🚀 Features
- 🤖 Intelligent & Adaptive - Automatically detects data type (sales, customer, financial, survey, etc.) and applies relevant analysis
- 📈 Comprehensive Analysis - Generates statistics, correlations, distributions, and trends
- 🎨 Auto Visualizations - Creates multiple charts based on what's in your data:
- Time-series plots for date-based data
- Correlation heatmaps for numeric relationships
- Distribution histograms
- Categorical breakdowns
- ⚡ Proactive - No questions asked! Just upload CSV and get complete analysis immediately
- 🔍 Data Quality Checks - Automatically detects and reports missing values
- 📊 Multi-Industry Support - Adapts to e-commerce, healthcare, finance, operations, surveys, and more
📥 Quick Download
📦 What's Included
csv-data-summarizer-claude-skill/
├── SKILL.md # Claude Skill definition
├── analyze.py # Comprehensive analysis engine
├── requirements.txt # Python dependencies
├── examples/
│ └── showcase_financial_pl_data.csv # Demo P&L financial dataset (15 months, 25 metrics)
└── resources/
├── sample.csv # Example dataset
└── README.md # Usage documentation
🎯 How It Works
- Upload any CSV file to Claude.ai
- Skill activates automatically when CSV is detected
- Analysis runs immediately - inspects data structure and adapts
- Results delivered - Complete analysis with multiple visualizations
No prompting needed. No options to choose. Just instant, comprehensive insights!
📥 Installation
For Claude.ai Users
- Download the latest release:
csv-data-summarizer.zip - Go to Claude.ai → Settings → Capabilities → Skills
- Upload the zip file
- Enable the skill
- Done! Upload any CSV and watch it work ✨
For Developers
git clone [email protected]:coffeefuelbump/csv-data-summarizer-claude-skill.git
cd csv-data-summarizer-claude-skill
pip install -r requirements.txt
📊 Sample Dataset Highlights
The included demo CSV contains 15 months of P&L data with:
- 3 product lines (SaaS, Enterprise, Services)
- 25 financial metrics including revenue, expenses, margins, CAC, LTV
- Quarterly trends showing business growth
- Perfect for showcasing time-series analysis, correlations, and financial insights
🎨 Example Use Cases
- 📊 Sales Data → Revenue trends, product performance, regional analysis
- 👥 Customer Data → Demographics, segmentation, geographic patterns
- 💰 Financial Data → Transaction analysis, trend detection, correlations
- ⚙️ Operational Data → Performance metrics, time-series analysis
- 📋 Survey Data → Response distributions, cross-tabulations
🛠️ Technical Details
Dependencies:
- Python 3.8+
- pandas 2.0+
- matplotlib 3.7+
- seaborn 0.12+
Visualizations Generated:
- Time-series trend plots
- Correlation heatmaps
- Distribution histograms
- Categorical bar charts
📝 Example Output
============================================================
📊 DATA OVERVIEW
============================================================
Rows: 100 | Columns: 15
📋 DATA TYPES:
• order_date: object
• total_revenue: float64
• customer_segment: object
...
🔍 DATA QUALITY:
✓ No missing values - dataset is complete!
📈 NUMERICAL ANALYSIS:
[Summary statistics for all numeric columns]
🔗 CORRELATIONS:
[Correlation matrix showing relationships]
📅 TIME SERIES ANALYSIS:
Date range: 2024-01-05 to 2024-04-11
Span: 97 days
📊 VISUALIZATIONS CREATED:
✓ correlation_heatmap.png
✓ time_series_analysis.png
✓ distributions.png
✓ categorical_distributions.png
🌟 Connect & Learn More
🤝 Contributing
Contributions are welcome! Feel free to:
- Report bugs
- Suggest new features
- Submit pull requests
- Share your use cases
📄 License
MIT License - feel free to use this skill for personal or commercial projects!
🙏 Acknowledgments
Built for the Claude Skills platform by Anthropic.
# Supported AI Coding Agents
This skill is compatible with the SKILL.md standard and works with all major AI coding agents:
Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.