📊

Data & Analytics

Data processing, analysis, visualization, and ML pipelines

702 Skills

Fast DataFrame library (Apache Arrow). Select, filter, group_by, joins, lazy evaluation, CSV/Parquet I/O, expression...

Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense +...

Access RCSB PDB for 3D protein/nucleic acid structures. Search by text/sequence/structure, download coordinates...

Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown...

Access Human Metabolome Database (220K+ metabolites). Search by name/ID/structure, retrieve chemical properties,...

Query NHGRI-EBI GWAS Catalog for SNP-trait associations. Search variants by rs ID, disease/trait, gene, retrieve...

CLI/Python toolkit for rapid bioinformatics queries. Preferred for quick BLAST searches. Access to 20+ databases:...

Access NCBI GEO for gene expression/genomics data. Search/download microarray and RNA-seq datasets (GSE, GSM, GPL),...

Query NCBI Gene via E-utilities/Datasets API. Search by symbol/ID, retrieve gene info (RefSeqs, GO, locations,...

Query openFDA API for drugs, devices, adverse events, recalls, regulatory submissions (510k, PMA), substance...

Pythonic wrapper around RDKit with simplified interface and sensible defaults. Preferred for standard drug...

Parallel/distributed computing. Scale pandas/NumPy beyond memory, parallel DataFrames/Arrays, multi-file processing,...