Search: audio | AgentSkillsRepo

esphome-box3-builder 0.00

nodnarbnitram / claude-code-extensions-esphome-box3-builder exact

This skill should be used when the user asks to "configure esp32-s3-box-3", "set up box-3", "create box-3 voice assistant", "display lambda on box-3", "configure ili9xxx display", "set up gt911...

★ 3 development

ai-multimodal 0.00

samhvw8 / dot-claude-ai-multimodal exact

Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection,...

★ 5 data

ai-multimodal 0.00

binhmuc / autobot-review-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

★ 21 ai

ai-multimodal 0.00

binjuhor / shadcn-lar-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

★ 59 ai

admin admin-dashboard admin-panel laravel

good-TTvideo2text 0.00

ImGoodBai / goodable-good-ttvideo2text exact

Extract audio from short videos (Douyin/TikTok) and transcribe to text with timestamps. Use when user provides video URL and needs audio transcription.

★ 84 ai

base44 claudecode codeagent lovable

text-to-speech 0.00

martinholovsky / claude-skills-generator-text-to-speech exact

Expert skill for implementing text-to-speech with Kokoro TTS. Covers voice synthesis, audio generation, performance optimization, and secure handling of generated audio for JARVIS voice assistant.

★ 20 tools

elevenlabs 0.00

digitalsamba / claude-code-video-toolkit-elevenlabs exact

Generate AI voiceovers, sound effects, and music using ElevenLabs APIs. Use when creating audio content for videos, podcasts, or games. Triggers include generating voiceovers, narration, dialogue,...

★ 23 ai

ai-video-generator claude-code developer-tools elevenlabs

video-transcript-downloader 0.00

steipete / agent-scripts-video-transcript-downloader exact

Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported site. Use when asked to “download this video”, “save this clip”, “rip audio”,...

★ 1,503 ai

ai-agents

video-transcript-downloader 0.00

devskale / skale-skills-video-transcript-downloader exact

Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported site. Use when asked to “download this video”, “save this clip”, “rip audio”,...

★ 0 development

Video Processor 0.00

iamzhihuix / happy-claude-skills-video-processor exact

Download and process videos from YouTube and other platforms. Supports video download, audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions YouTube...

★ 241 development

video-transcript-downloader 0.00

lancenunes / codex-skills-video-transcript-downloader exact

Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported site. Use when asked to “download this video”, “save this clip”, “rip audio”,...

★ 1 ai

agent-skills agents ai-development ai-tools

yt-dlp-downloader 0.00

MapleShaw / yt-dlp-downloader-skill exact

Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download...

★ 141 development

md-download 0.00

chaye7417 / claude-skill-md-download exact

下载 Markdown 文件中的音视频和图片附件并本地化嵌入 Obsidian。支持 YouTube 视频、Patreon 视频、SoundCloud 音频、网络图片，可选下载字幕。Download SoundCloud audio, YouTube videos, Patreon videos, and web images from markdown files and embed...

★ 0 development

azure-ai-voicelive 0.00

ngxtm / devkit-azure-ai-voicelive exact

Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication...

★ 0 ai

agent ai automation claude

youtube-downloader 0.00

jackspace / claudeskillz-youtube-downloader exact

Download videos, audio, playlists, and channels from YouTube and 1000+ websites using yt-dlp. Supports quality selection, format conversion, subtitle download, playlist filtering, metadata...

★ 8 ai

agentic-coding ai-skills automation bioinformatics

media-processing 0.00

jackspace / claudeskillz-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

★ 8 development

agentic-coding ai-skills automation bioinformatics

media-processing 0.00

mrgoonie / claudekit-skills-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

★ 1,504 development

media-processing 0.00

ngxtm / devkit-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

★ 0 ai

agent ai automation claude

media-processing 0.00

zircote / claude-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

★ 9 development

media-processing 0.00

binhmuc / autobot-review-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects,...

★ 21 development

Confirm

Submit a Skill