Search: audio-classification

yt-dlp 0.00

lwmxiaobei / yt-dlp-skill exact

Download videos and extract audio from various platforms using yt-dlp. Use when user provides a video URL, asks to download a video, or when conversation contains video links from YouTube,...

★ 4 ai

omnicaptions-transcribe 0.00

lattifai / omni-captions-skills-omnicaptions-transcribe exact

Use when transcribing audio/video to text with timestamps, speaker labels, and chapters. Supports YouTube URLs and local files. Produces structured markdown output.

★ 21 development

gemini-live-api 0.00

Hildegaardchiasmal966 / claude-skills-gemini-live-api exact

Expert developer skill for implementing real-time voice and video interactions using the Google Gemini Live API. This skill should be used when implementing bidirectional audio streaming, voice...

★ 1 development

agentic-ai ai anthropic-ai anthropic-skills

assemblyai-streaming 0.00

ratacat / claude-skills-assemblyai-streaming exact

This skill should be used when working with AssemblyAI’s Speech-to-Text and LLM Gateway APIs, especially for streaming/live transcription, meeting notetakers, and voice agents that need...

★ 16 ai

whisper 0.00

0xbeedao / agentic-tools-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 0 ai

whisper 0.00

ovachiever / droid-tings-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 19 ai

whisper 0.00

zechenzhangAGI / ai-research-skills-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 1,712 ai

ai ai-research claude claude-code

voice-ai-integration 0.00

qodex-ai / ai-agent-skills-voice-ai-integration exact

Build voice-enabled AI applications with speech recognition, text-to-speech, and voice-based interactions. Supports multiple voice providers and real-time processing. Use when creating voice...

★ 1 tools

realitykit-visionos-developer 0.00

tomkrikorian / visionosagents-realitykit-visionos-developer exact

Build, debug, and optimize RealityKit scenes for visionOS, including entity/component setup, rendering, animation, physics, audio, input, attachments, and custom systems. Use when implementing...

★ 31 development

youtube-downloader 0.00

frank-syncmarket / skills-youtube-downloader exact

Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, save, or grab YouTube videos. Supports various quality settings (best, 1080p,...

★ 1 ai

agent-skills anthropic anthropic-ai astro

speakturbo-tts 0.00

emzod / speak-turbo exact

Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency. 8 built-in voices for instant voice...

★ 5 ai

ai-agents local-first python rust

elevenlabs-remotion 0.00

maartenlouis / elevenlabs-remotion-skill exact

Generate professional voiceovers using ElevenLabs AI. Use when the user needs to create voiceovers for videos, audio narration, or text-to-speech content. Supports multiple voices with character...

★ 1 ai

immersive-visuals-router 0.00

Bbeierle12 / skill-mcp-claude-immersive-visuals-router exact

Master router for immersive visual experiences combining React Three Fiber, shaders, particles, post-processing, GSAP animation, and audio. Use when building 3D web experiences, visualizers,...

★ 4 ai

markitdown 0.00

K-Dense-AI / claude-scientific-skills-markitdown exact

Convert files and office documents to Markdown. Supports PDF, DOCX, PPTX, XLSX, images (with OCR), audio (with transcription), HTML, CSV, JSON, XML, ZIP, YouTube URLs, EPubs and more.

★ 6,907 ai

ai-scientist bioinformatics chemoinformatics claude

speak-tts 0.00

emzod / speak exact

Give your agent the ability to speak to you real-time. Talk to your Claude! Local TTS, text-to-speech, voice synthesis, audio generation with voice cloning on Apple Silicon. Use for reading...

★ 4 ai

ai apple-silicon chatterbox cli

scikit-learn 0.00

jackspace / claudeskillz-scikit-learn exact

Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model...

★ 8 ai

agentic-coding ai-skills automation bioinformatics

scikit-learn 0.00

ovachiever / droid-tings-scikit-learn exact

Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model...

★ 19 ai

scikit-learn 0.00

K-Dense-AI / claude-scientific-skills-scikit-learn exact

Machine learning in Python with scikit-learn. Use when working with supervised learning (classification, regression), unsupervised learning (clustering, dimensionality reduction), model...

★ 6,907 ai

ai-scientist bioinformatics chemoinformatics claude

video-downloader 0.00

isjiamu / jiamu-skills-video-downloader exact

Download videos from 1000+ websites (YouTube, Bilibili, Twitter/X, TikTok, etc.) using yt-dlp. Use this skill when users provide video URLs and want to download videos, extract audio, or need help...

★ 35 web

electron-dev 0.00

jamditis / claude-skills-journalism-electron-dev exact

Electron desktop application development with React, TypeScript, and Vite. Use when building desktop apps, implementing IPC communication, managing windows/tray, handling PTY terminals,...

★ 3 development

Confirm

Submit a Skill