binhmuc

media-processing

21
3
# Install this skill:
npx skills add binhmuc/autobot-review --skill "media-processing"

Install specific skill from multi-skill repository

# Description

Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.

# SKILL.md


name: media-processing
description: Process multimedia files with FFmpeg (video/audio encoding, conversion, streaming, filtering, hardware acceleration), ImageMagick (image manipulation, format conversion, batch processing, effects, composition), and RMBG (AI-powered background removal). Use when converting media formats, encoding videos with specific codecs (H.264, H.265, VP9), resizing/cropping images, removing backgrounds from images, extracting audio from video, applying filters and effects, optimizing file sizes, creating streaming manifests (HLS/DASH), generating thumbnails, batch processing images, creating composite images, or implementing media processing pipelines. Supports 100+ formats, hardware acceleration (NVENC, QSV), and complex filtergraphs.
license: MIT


Media Processing Skill

Process video, audio, and images using FFmpeg, ImageMagick, and RMBG CLI tools.

Tool Selection

Task Tool Reason
Video encoding/conversion FFmpeg Native codec support, streaming
Audio extraction/conversion FFmpeg Direct stream manipulation
Image resize/effects ImageMagick Optimized for still images
Background removal RMBG AI-powered, local processing
Batch images ImageMagick mogrify for in-place edits
Video thumbnails FFmpeg Frame extraction built-in
GIF creation FFmpeg/ImageMagick FFmpeg for video, ImageMagick for images

Installation

# macOS
brew install ffmpeg imagemagick
npm install -g rmbg-cli

# Ubuntu/Debian
sudo apt-get install ffmpeg imagemagick
npm install -g rmbg-cli

# Verify
ffmpeg -version && magick -version && rmbg --version

Essential Commands

# Video: Convert/re-encode
ffmpeg -i input.mkv -c copy output.mp4
ffmpeg -i input.avi -c:v libx264 -crf 22 -c:a aac output.mp4

# Video: Extract audio
ffmpeg -i video.mp4 -vn -c:a copy audio.m4a

# Image: Convert/resize
magick input.png output.jpg
magick input.jpg -resize 800x600 output.jpg

# Image: Batch resize
mogrify -resize 800x -quality 85 *.jpg

# Background removal
rmbg input.jpg                          # Basic (modnet)
rmbg input.jpg -m briaai -o output.png  # High quality
rmbg input.jpg -m u2netp -o output.png  # Fast

Key Parameters

FFmpeg:
- -c:v libx264 - H.264 codec
- -crf 22 - Quality (0-51, lower=better)
- -preset slow - Speed/compression balance
- -c:a aac - Audio codec

ImageMagick:
- 800x600 - Fit within (maintains aspect)
- 800x600^ - Fill (may crop)
- -quality 85 - JPEG quality
- -strip - Remove metadata

RMBG:
- -m briaai - High quality model
- -m u2netp - Fast model
- -r 4096 - Max resolution

References

Detailed guides in references/:
- ffmpeg-encoding.md - Codecs, quality, hardware acceleration
- ffmpeg-streaming.md - HLS/DASH, live streaming
- ffmpeg-filters.md - Filters, complex filtergraphs
- imagemagick-editing.md - Effects, transformations
- imagemagick-batch.md - Batch processing, parallel ops
- rmbg-background-removal.md - AI models, CLI usage
- common-workflows.md - Video optimization, responsive images, GIF creation
- troubleshooting.md - Error fixes, performance tips
- format-compatibility.md - Format support, codec recommendations

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.