Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects,...
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...
World-class voiceover expertise combining the narrative craft of documentary producers, the commercial precision of advertising agencies, and the accessibility of modern AI voice technology....
The craft of writing for the screen—from 6-second ads to 10-minute brand films. This skill bridges storytelling with the unique constraints and opportunities of video format: visual thinking,...
Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...
Best practices for Remotion - Video creation in React
Voice agents represent the frontier of AI interaction - humans speaking naturally with AI systems. The challenge isn't just speech recognition and synthesis, it's achieving natural conversation...
Game development orchestrator. Routes to platform-specific skills based on project needs.
Convert audio to sheet music, create songbooks
Place album art in correct audio and content locations
Generate promo videos for social media from mastered audio
Audio mastering guidance, loudness optimization, platform delivery specs
Build communication features with Twilio: SMS messaging, voice calls, WhatsApp Business API, and user verification (2FA). Covers the full spectrum from simple notifications to complex IVR systems...
This skill should be used when the user asks to "test for XSS vulnerabilities", "perform cross-site scripting attacks", "identify HTML injection flaws", "exploit client-side injection...
Expert in live streaming, WebRTC, and real-time video/audio
Enables Claude to create and interact with NotebookLM for document analysis, audio overviews, and knowledge synthesis
Use when downloading videos, audio, or captions from YouTube and other video platforms. Supports quality selection.
Enables Claude to manage Clubhouse rooms and audio social interactions