80 results (3.5ms) page 3 / 4
jezweb / claude-skills-openai-api exact

|

qodex-ai / ai-agent-skills-voice-ai-integration exact

Build voice-enabled AI applications with speech recognition, text-to-speech, and voice-based interactions. Supports multiple voice providers and real-time processing. Use when creating voice...

alinaqi / claude-bootstrap-ai-models exact

Latest AI models reference - Claude, OpenAI, Gemini, Eleven Labs, Replicate

resemble-ai / remotion-resemble-skill-remotion-resemble-ai exact

Create professional AI-narrated videos with Remotion and Resemble.ai - from educational tutorials to product launches

jackspace / claudeskillz-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

samhvw8 / dot-claude-ai-multimodal exact

Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection,...

mrgoonie / claudekit-skills-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

zircote / claude-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

YuniorGlez / gemini-elite-core-voice-ux-pro exact

Master of Voice-First Interfaces, specialized in sub-300ms Latency, Spatial Hearing AI, and Multimodal Voice-Haptic feedback.

akrindev / google-studio-skills-gemini-text exact

Generate text content using Google Gemini models via scripts/. Use for text generation, multimodal prompts with images, thinking mode for complex reasoning, JSON-formatted outputs, and Google...