Build or update the BlueBubbles external channel plugin for Moltbot (extension package, REST...
npx skills add terry-li-hm/skills --skill "content-fetch"
Install specific skill from multi-skill repository
# Description
Reference for URL fetching patterns and fallbacks. Consult when extracting content from web URLs.
# SKILL.md
name: content-fetch
description: Reference for URL fetching patterns and fallbacks. Consult when extracting content from web URLs.
user_invocable: false
Content Fetch
Patterns for fetching and extracting content from URLs.
Tool Selection
| URL Type | Primary Tool | Fallback |
|---|---|---|
| General web | WebFetch |
tavily_extract |
| WeChat articles | wechat-article script |
Manual copy |
| YouTube | youtube-transcript |
yt-dlp |
| PDFs | pdf-extract (LlamaParse) |
Local OCR |
| Login-required | Browser automation | None |
WebFetch Patterns
WebFetch(url, prompt="Extract the main content")
Handles:
- HTML → Markdown conversion
- Redirect following (returns redirect URL if different host)
- 15-minute cache
Gotchas:
- Fails on authenticated pages (Google Docs, Confluence, Jira)
- Use ToolSearch first to find specialized MCP tools
Redirect Handling
When WebFetch returns a redirect message:
1. Extract the redirect URL from response
2. Make a new WebFetch request with that URL
3. Don't assume original URL worked
WeChat URL Patterns
| Pattern | Type | Handling |
|---|---|---|
mp.weixin.qq.com/s/... |
Short URL | Fetch directly |
mp.weixin.qq.com/s?__biz=... |
Long URL | Fetch directly |
weixin.qq.com/r/... |
QR redirect | Follow redirect first |
Error Handling
| Error | Meaning | Action |
|---|---|---|
INVALID_URL |
Malformed URL | Check URL format |
POOR_CONTENT_QUALITY |
Extraction failed | Try tavily_extract |
| 404 | Page not found | URL may have expired |
| 429 | Rate limited | Wait and retry |
| Login wall | Requires auth | Use browser automation |
Fallback Hierarchy
1. WebFetch (fast, cached)
↓ fails
2. tavily_extract (better at complex pages)
↓ fails
3. Browser automation (for login-required)
↓ fails
4. Ask user for copy/paste
Login-Required Sites
These always need browser automation:
- LinkedIn (job pages, profiles)
- X/Twitter
- WhatsApp Web
- Most banking/corporate sites
Content Extraction Prompts
For articles:
"Extract the main article content, including title, author, date, and body text"
For job postings:
"Extract job title, company, location, requirements, responsibilities, and salary if disclosed"
For documentation:
"Extract the technical documentation, including code examples"
Related Skills
evaluate-article— Article evaluation workflowwechat-article— WeChat-specific extractionyoutube-transcript— YouTube transcript extractionpdf-extract— PDF text extractionchrome-automation— Browser fallback patterns
# Supported AI Coding Agents
This skill is compatible with the SKILL.md standard and works with all major AI coding agents:
Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.