783 results (20.9ms) page 8 / 40
samhvw8 / dot-claude-aesthetic exact

Visual design intelligence and UI aesthetics. Integrates: chrome-devtools, ai-multimodal, media-processing. Capabilities: design analysis, visual hierarchy, color theory, typography,...

sheikh-mohammad / project-a1-extract-your-human-job-into-skills-browsing-with-playwright exact

Browser automation using Playwright MCP. Navigate websites, fill forms, click elements, take screenshots, and extract data. Use for web browsing, form submission, web scraping, or UI testing. NOT...

remorses / playwriter-playwriter exact

Control the user own Chrome browser via Playwriter extension with Playwright code snippets in a stateful local js sandbox via playwriter cli. Automate web interactions, take screenshots, inspect...

ngxtm / devkit-github-issue-creator exact

Convert raw notes, error logs, voice dictation, or screenshots into crisp GitHub-flavored markdown issue reports. Use when the user pastes bug info, error messages, or informal descriptions and...

Charon-Fan / agent-playbook-figma-designer exact

Analyzes Figma designs and generates implementation-ready PRDs with detailed visual specifications. Use when user provides Figma link or uploads design screenshots. Requires Figma MCP server connection.

aktsmm / agent-skills-ocr-super-surya exact

GPU-optimized OCR using Surya. Use when: (1) Extracting text from images/screenshots, (2) Processing PDFs with embedded images, (3) Multi-language document OCR, (4) Layout analysis and table...

binjuhor / shadcn-lar-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

binhmuc / autobot-review-ai-multimodal exact

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech...

Brawl345 / browser-tools exact

Interact with a web browser. Can start a browser, connect to it, evaluate JavaScript, make screenshots, read console logs and let the user select DOM elements. Use when interacting with unknown...

jackspace / claudeskillz-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

mrgoonie / claudekit-skills-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

zircote / claude-ai-multimodal exact

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis...

samhvw8 / dot-claude-ai-multimodal exact

Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection,...

omer-metin / skills-for-antigravity-blog-writing exact

Legendary blog writing that makes readers forget they're reading. This skill combines the narrative mastery of Paul Graham's essays, the technical accessibility of Julia Evans, the conversational...

qodex-ai / ai-agent-skills-visual-quality-improver exact

Enhance and improve image quality and visual content. Applies enhancement techniques, color correction, and optimization transformations.

imsus / pi-extension-minimax-coding-plan-mcp-minimax-image-understanding exact

Analyze images using AI with the understand_image tool

iamzifei / xiaohongshu-images-skill exact

Transform markdown/HTML into styled 3:4 ratio images for Xiaohongshu

felix-huber / appbuilder-skill-agent-browser exact

Browser automation for E2E testing. Use when testing user journeys, verifying UI behavior, or running end-to-end tests.