1298 results (11.5ms) page 4 / 65
chaye7417 / claude-skill-md-download exact

下载 Markdown 文件中的音视频和图片附件并本地化嵌入 Obsidian。支持 YouTube 视频、Patreon 视频、SoundCloud 音频、网络图片,可选下载字幕。Download SoundCloud audio, YouTube videos, Patreon videos, and web images from markdown files and embed...

ngxtm / devkit-azure-ai-voicelive exact

Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication...

jackspace / claudeskillz-youtube-downloader exact

Download videos, audio, playlists, and channels from YouTube and 1000+ websites using yt-dlp. Supports quality selection, format conversion, subtitle download, playlist filtering, metadata...

jackspace / claudeskillz-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

ngxtm / devkit-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

mrgoonie / claudekit-skills-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

zircote / claude-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration) and ImageMagick (image manipulation, format conversion, batch processing,...

binhmuc / autobot-review-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects,...

binjuhor / shadcn-lar-media-processing exact

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects,...

jackspace / claudeskillz-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

zircote / claude-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

mrgoonie / claudekit-skills-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

omer-metin / skills-for-antigravity-voiceover exact

World-class voiceover expertise combining the narrative craft of documentary producers, the commercial precision of advertising agencies, and the accessibility of modern AI voice technology....

omer-metin / skills-for-antigravity-video-scriptwriting exact

The craft of writing for the screen—from 6-second ads to 10-minute brand films. This skill bridges storytelling with the unique constraints and opportunities of video format: visual thinking,...

dirnbauer / webconsulting-skills-deepfake-detection exact

Multimodal media authentication and deepfake forensics. PRNU analysis, IGH classification, DQ detection, semantic forensics, and LLM-augmented sensemaking for the post-empirical era.

cleodin / antigravity-awesome-skills-voice-ai-development exact

Expert in building voice AI applications - from real-time voice agents to voice-enabled apps. Covers OpenAI Realtime API, Vapi for voice agents, Deepgram for transcription, ElevenLabs for...

cleodin / antigravity-awesome-skills-remotion-best-practices exact

Best practices for Remotion - Video creation in React

cleodin / antigravity-awesome-skills-cross-site-scripting-and-html-injection-testing exact

This skill should be used when the user asks to "test for XSS vulnerabilities", "perform cross-site scripting attacks", "identify HTML injection flaws", "exploit client-side injection...

ngxtm / devkit-ai-multimodal exact

Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.

Xsir0 / xsir-skills-google-gemini-media exact

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".