Search: text-to-speech | AgentSkillsRepo

ready ~/ agentskillsrepo

login

4697 results (63.7ms) page 2 / 235

voice-agents 0.00

omer-metin / skills-for-antigravity-voice-agents exact

Voice agents represent the frontier of AI interaction - humans speaking naturally with AI systems. The challenge isn't just speech recognition and synthesis, it's achieving natural conversation...

★ 5 ai

ai-agents antigravity antigravity-ide skills

ai-music-audio 0.00

omer-metin / skills-for-antigravity-ai-music-audio exact

Comprehensive patterns for AI-powered audio generation including text-to-music, voice synthesis, text-to-speech, sound effects, and audio manipulation using MusicGen, Bark, ElevenLabs, and more....

★ 5 ai

ai-agents antigravity antigravity-ide skills

pdf-text-extractor 0.00

yueweilu / ai-agent-skills-pdf-text-extractor exact

Extract text content from local PDF files for the AI to process.

★ 0 tools

voice-ai-development 0.00

sickn33 / antigravity-awesome-skills-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 2,844 ai

agentic-skills ai-agents antigravity autonomous-coding

voice-ai-development 0.00

cleodin / antigravity-awesome-skills-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 1 ai

agentic-skills ai-agents antigravity antigravity-ide

voice-ai-development 0.00

404kidwiz / agent-skills-backup-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 0 ai

voice-ai-development 0.00

ngxtm / devkit-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 0 ai

agent ai automation claude

voice-ai-development 0.00

automindtechnologie-jpg / ultimate-skill-md-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 0 ai

voice-ai-development 0.00

halay08 / fullstack-agent-skills-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

★ 0 ai

voice-audio-engineer 0.00

erichowens / some-claude-skills-voice-audio-engineer exact

Expert in voice synthesis, TTS, voice cloning, podcast production, speech processing, and voice UI design via ElevenLabs integration. Specializes in vocal clarity, loudness standards (LUFS),...

★ 20 development

voice-interface-builder 0.00

daffy0208 / ai-dev-standards-voice-interface-builder exact

Expert in building voice interfaces, speech recognition, and text-to-speech systems

★ 7 tools

ai-multimodal 0.00

samhvw8 / dot-claude-ai-multimodal exact

Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection,...

★ 5 data

ai-multimodal 0.00

binjuhor / shadcn-lar-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

★ 59 ai

admin admin-dashboard admin-panel laravel

ai-multimodal 0.00

binhmuc / autobot-review-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

★ 21 ai

voice-ai-integration 0.00

qodex-ai / ai-agent-skills-voice-ai-integration exact

Build voice-enabled AI applications with speech recognition, text-to-speech, and voice-based interactions. Supports multiple voice providers and real-time processing. Use when creating voice...

★ 1 tools

kinetic-video-creator 0.00

aviz85 / claude-skills-library-kinetic-video-creator exact

Create professional kinetic typography videos from scratch. Includes speech writing, TTS with emotional dynamics, music generation, and animated text. Use for: promo videos, explainers, social...

★ 11 ai

whisper 0.00

0xbeedao / agentic-tools-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 0 ai

whisper 0.00

ovachiever / droid-tings-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 19 ai

whisper 0.00

zechenzhangAGI / ai-research-skills-whisper exact

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M...

★ 1,712 ai

ai ai-research claude claude-code

ai-multimodal 0.00

jackspace / claudeskillz-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

★ 8 ai

agentic-coding ai-skills automation bioinformatics