4697 results (34.0ms) page 3 / 235
mrgoonie / claudekit-skills-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

zircote / claude-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

onewave-ai / claude-skills-sms-text-optimizer exact

Condense messages to 160 characters without losing meaning. Remove unnecessary words while keeping tone.

itechmeat / llm-code-inworld exact

Inworld TTS API. Covers voice cloning, audio markups, timestamps. Keywords: text-to-speech, visemes.

ratacat / claude-skills-assemblyai-streaming exact

This skill should be used when working with AssemblyAI’s Speech-to-Text and LLM Gateway APIs, especially for streaming/live transcription, meeting notetakers, and voice agents that need...

ThePlasmak / faster-whisper exact

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. Supports standard and distilled models...

maartenlouis / elevenlabs-remotion-skill exact

Generate professional voiceovers using ElevenLabs AI. Use when the user needs to create voiceovers for videos, audio narration, or text-to-speech content. Supports multiple voices with character...

Mucho-G / pi-skills-transcribe exact

Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

badlogic / pi-skills-transcribe exact

Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.

daymade / claude-code-skills-transcript-fixer exact

Corrects speech-to-text transcription errors in meeting notes, lectures, and interviews using dictionary rules and AI. Learns patterns to build personalized correction databases. Use when working...

moltbot / moltbot-openai-whisper exact

Local speech-to-text with the Whisper CLI (no API key).

sag 0.00
moltbot / moltbot-sag exact

ElevenLabs text-to-speech with mac-style say UX.

sag 0.00
yueweilu / ai-agent-skills-sag exact

ElevenLabs text-to-speech with mac-style say UX.

jackspace / claudeskillz-transformers exact

This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification,...

ovachiever / droid-tings-transformers exact

This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification,...

K-Dense-AI / claude-scientific-skills-transformers exact

This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification,...

moltbot / moltbot-sherpa-onnx-tts exact

Local text-to-speech via sherpa-onnx (offline, no cloud)

ngxtm / devkit-convert-pcm-to-wav-see-scripts-pcm-to-wav-py exact

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast...