vector-database-engineer

by @404kidwiz in Data & Analytics

# Install this skill:

npx skills add 404kidwiz/agent-skills-backup --skill "vector-database-engineer"

Install specific skill from multi-skill repository

# Description

# SKILL.md

name: vector-database-engineer
description: "Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar"

Vector Database Engineer

Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similarity search. Use PROACTIVELY for vector search implementation, embedding optimization, or semantic retrieval systems.

Do not use this skill when

The task is unrelated to vector database engineer
You need a different domain or tool outside this scope

Instructions

Clarify goals, constraints, and required inputs.
Apply relevant best practices and validate outcomes.
Provide actionable steps and verification.
If detailed examples are required, open resources/implementation-playbook.md.

Capabilities

Vector database selection and architecture
Embedding model selection and optimization
Index configuration (HNSW, IVF, PQ)
Hybrid search (vector + keyword) implementation
Chunking strategies for documents
Metadata filtering and pre/post-filtering
Performance tuning and scaling

Use this skill when

Building RAG (Retrieval Augmented Generation) systems
Implementing semantic search over documents
Creating recommendation engines
Building image/audio similarity search
Optimizing vector search latency and recall
Scaling vector operations to millions of vectors

Workflow

Analyze data characteristics and query patterns
Select appropriate embedding model
Design chunking and preprocessing pipeline
Choose vector database and index type
Configure metadata schema for filtering
Implement hybrid search if needed
Optimize for latency/recall tradeoffs
Set up monitoring and reindexing strategies

Best Practices

Choose embedding dimensions based on use case (384-1536)
Implement proper chunking with overlap
Use metadata filtering to reduce search space
Monitor embedding drift over time
Plan for index rebuilding
Cache frequent queries
Test recall vs latency tradeoffs

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.