40 results (52.7ms) page 2 / 2
frank-syncmarket / skills-text-to-speech exact

文本转语音工具 - 支持脚本解析、情绪标记和后处理,基于 Edge TTS

eddiebe147 / claude-settings-embedding-generator exact

Generate and manage text embeddings for semantic search, clustering, and similarity tasks

onewave-ai / claude-skills-sms-text-optimizer exact

Condense messages to 160 characters without losing meaning. Remove unnecessary words while keeping tone.

BenedictKing / exa-search-exa-search exact

Semantic search skill using Exa API for embeddings-based search, similar content discovery, and structured research. Use when you need semantic search, find similar pages, or category-specific...

intellectronica / agent-skills-lorem-ipsum exact

Generate lorem ipsum placeholder text. This skill should be used when users ask to generate lorem ipsum content, placeholder text, dummy text, or filler text. Supports various structures including...

404kidwiz / claude-supercode-skills-nlp-engineer exact

Expert in Natural Language Processing, designing systems for text classification, NER, translation, and LLM integration using Hugging Face, spaCy, and LangChain. Use when building NLP pipelines,...

akrindev / google-studio-skills-gemini-tts exact

Generate speech from text using Google Gemini TTS models via scripts/. Use for text-to-speech, audio generation, voice synthesis, multi-speaker conversations, and creating audio content. Supports...

Johnny-xuan / smart-voice-chat exact

Voice conversation: transcribe voice input, reply in same format (voice-to-voice, text-to-text) by default. User can override with voice or text commands.

YPares / agent-skills-read-bin-docs exact

Straightforward text extraction from document files (text-based PDF only for now, no OCR or docx). Use when you just need to read/extract text from binary documents.

omer-metin / skills-for-antigravity-ai-music-audio exact

Comprehensive patterns for AI-powered audio generation including text-to-music, voice synthesis, text-to-speech, sound effects, and audio manipulation using MusicGen, Bark, ElevenLabs, and more....

clip 0.13
zechenzhangAGI / ai-research-skills-clip exact

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search,...

clip 0.13
ovachiever / droid-tings-clip exact

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search,...

zechenzhangAGI / ai-research-skills-audiocraft-audio-generation exact

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform...

omer-metin / skills-for-antigravity-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

xdrshjr / jr-agent-skills-doubao-open-tts exact

Text-to-Speech using Doubao (Volcano Engine) API. Use when converting text to natural-sounding speech, generating audio files from text, listing available TTS voices, or synthesizing speech with...

emzod / speak-turbo exact

Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency. 8 built-in voices for instant voice...

ahmedibrahim085 / claude-multi-agent-research-system-skill-semantic-search exact

Semantic search for finding code by meaning using natural language queries. Orchestrates semantic-search-reader (search/find-similar/list-projects) and semantic-search-indexer...

omer-metin / skills-for-antigravity-ai-image-generation exact

Mastery of AI image generation across the full spectrum: Midjourney for aesthetic perfection, Flux for prompt adherence, DALL-E 3 for concept clarity, Stable Diffusion for control, and Imagen 3...

emzod / speak exact

Give your agent the ability to speak to you real-time. Talk to your Claude! Local TTS, text-to-speech, voice synthesis, audio generation with voice cloning on Apple Silicon. Use for reading...

ASR 0.13
AnswerZhao / agent-skills-glm-skills-asr exact

Implement speech-to-text (ASR/automatic speech recognition) capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to transcribe audio files, convert speech to text, build...