40 results (4.2ms) page 1 / 2
cocoindex-io / cocoindex-claude-cocoindex exact

Comprehensive toolkit for developing with the CocoIndex library. Use when users need to create data transformation pipelines (flows), write custom functions, or operate flows via CLI or API....

cocoindex-io / cocoindex-claude-cocoindex-v1 exact

This skill should be used when building data processing pipelines with CocoIndex v1, a Python library for incremental data transformation. Use when the task involves processing files/data into...

omer-metin / skills-for-antigravity-data-engineer exact

Data pipeline specialist for ETL design, data quality, CDC patterns, and batch/stream processingUse when "data pipeline, etl, cdc, data quality, batch processing, stream processing, data...

rand / cc-polymath-discover-data exact

Automatically discover data pipeline and ETL skills when working with ETL. Activates for data development tasks.

erichowens / some-claude-skills-data-pipeline-engineer exact

Expert data engineer for ETL/ELT pipelines, streaming, data warehousing. Activate on: data pipeline, ETL, ELT, data warehouse, Spark, Kafka, Airflow, dbt, data modeling, star schema, streaming...

cosmix / loom-database-design exact

Designs database schemas, data models, relationships, indexes, and migrations for relational, NoSQL, time-series, and warehouse databases. Covers normalization, denormalization, ETL optimization,...

vsvale / skills-databricks-assistent-agent-lakeflow-sdp exact

Develop and operate Lakeflow Spark Declarative Pipelines (SDP) for batch and streaming ETL on Databricks. Covers pipelines, flows, streaming tables, materialized views, expectations, Auto Loader,...

404kidwiz / claude-supercode-skills-data-engineer exact

Use when user needs scalable data pipeline development, ETL/ELT implementation, or data infrastructure design.

DonggangChen / antigravity-agentic-skills-data-transform exact

Transform raw data into analytical assets using ETL/ELT patterns, SQL (dbt), Python (pandas/polars/PySpark), and orchestration (Airflow). Use when building data pipelines, implementing incremental...

vuralserhat86 / antigravity-agentic-skills-data-transform exact

Transform raw data into analytical assets using ETL/ELT patterns, SQL (dbt), Python (pandas/polars/PySpark), and orchestration (Airflow). Use when building data pipelines, implementing incremental...

zechenzhangAGI / ai-research-skills-ray-data exact

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s...

ovachiever / droid-tings-ray-data exact

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s...

cosmix / loom-background-jobs exact

Background job processing patterns including job queues, scheduled jobs, worker pools, and retry strategies. Use when implementing async processing, job queues, workers, task queues, async tasks,...

dkyazzentwatwa / chatgpt-skills-data-quality-auditor exact

Assess data quality with checks for missing values, duplicates, type issues, and inconsistencies. Use for data validation, ETL pipelines, or dataset documentation.

dkyazzentwatwa / chatgpt-skills-dataset-comparer exact

Compare two datasets to find differences, added/removed rows, changed values. Use for data validation, ETL verification, or tracking changes.

silvainfm / claude-skills-polars exact

Lightning-fast DataFrame library written in Rust for high-performance data manipulation and analysis. Use when user wants blazing fast data transformations, working with large datasets, lazy...

K-Dense-AI / claude-scientific-skills-polars exact

Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for...

ovachiever / droid-tings-senior-data-engineer exact

World-class data engineering skill for building scalable data pipelines, ETL/ELT systems, and data infrastructure. Expertise in Python, SQL, Spark, Airflow, dbt, Kafka, and modern data stack....