browser-tools

by @Thomas-TyTech in Web & API

# Install this skill:

npx skills add Thomas-TyTech/browser-skill

Or install specific skill: npx add-skill https://github.com/Thomas-TyTech/browser-skill

# Description

Minimal browser automation toolkit for web scraping, testing, and site exploration using Chrome DevTools Protocol. Use when tasks require browser interaction, JavaScript execution, element selection, or visual inspection of web pages. More efficient than browser MCP servers with only ~225 tokens vs 13,000-18,000 tokens.

# SKILL.md

name: browser-tools
description: Minimal browser automation toolkit for web scraping, testing, and site exploration using Chrome DevTools Protocol. Use when tasks require browser interaction, JavaScript execution, element selection, or visual inspection of web pages. More efficient than browser MCP servers with only ~225 tokens vs 13,000-18,000 tokens.
license: MIT

Browser Tools

Minimal CDP-based browser automation for collaborative site exploration, web scraping, and frontend development tasks. Provides token-efficient browser control without the overhead of full MCP servers.

All scripts are globally available with the browser-tools- prefix and can be executed from any directory.

When to Use This Skill

Use this skill when tasks involve:
- Web scraping and data extraction
- Frontend development and testing
- Visual inspection of web pages
- Interactive element selection for DOM analysis
- JavaScript execution in browser context
- Site exploration requiring browser state (cookies, sessions)
- Building deterministic scrapers with authenticated sessions

Core Browser Tools

Start Chrome Browser

browser-tools-start              # Fresh profile
browser-tools-start --profile    # Copy user profile (cookies, logins)

Start Chrome on :9222 with remote debugging enabled. Use --profile to copy your default Chrome profile for authenticated sessions. Kills any existing Chrome instances first.

Navigate to URLs

browser-tools-nav https://example.com       # Navigate current tab
browser-tools-nav https://example.com --new # Open new tab

Navigate to URLs in existing tab or create new tabs. Waits for DOM content to load before returning.

Execute JavaScript

browser-tools-eval 'document.title'
browser-tools-eval 'document.querySelectorAll("a").length'
browser-tools-eval 'Array.from(document.querySelectorAll("h1")).map(h => h.textContent)'
browser-tools-eval 'document.querySelector("form").submit()'

Execute JavaScript in active page context with full DOM access and async support. Results are automatically formatted for readability - arrays and objects are displayed with key-value pairs.

Take Screenshots

browser-tools-screenshot

Capture viewport screenshot of active tab. Returns temporary file path with timestamp for visual analysis. Use with Read tool to analyze images.

Interactive Element Selection

browser-tools-pick "Select the navigation menu items"
browser-tools-pick "Click the form fields to extract"
browser-tools-pick "Choose the product cards"

Launch interactive element picker overlay on the active page. Instructions:
- Move mouse to highlight elements
- Click to select single element
- Cmd/Ctrl+Click for multi-select
- Enter to finish with selected elements
- ESC to cancel

Returns detailed element information including tag, id, class, text content, HTML snippet, and parent hierarchy for building selectors.

Extract Cookies

browser-tools-cookies                    # Extract all cookies
browser-tools-cookies --domain google.com # Extract cookies for specific domain

Extract HTTP-only cookies for authenticated scraping. Useful for building deterministic scrapers that need to maintain session state.

Installation & Setup

The skill requires puppeteer-core to be installed:

cd ~/agent-tools/browser-tools
npm install

Make scripts executable:

chmod +x scripts/*.js

Workflow Patterns

Web Scraping Setup

Start browser with profile: browser-tools-start --profile
Navigate to target: browser-tools-nav https://target-site.com
Use pick tool to identify elements interactively: browser-tools-pick "Select data elements"
Execute JavaScript to extract data: browser-tools-eval 'extractionCode'
Save results using standard file operations or pipe to files

Frontend Development

Start fresh browser: browser-tools-start
Navigate to dev server: browser-tools-nav http://localhost:3000
Execute JavaScript for debugging: browser-tools-eval 'debugCode'
Take screenshots: browser-tools-screenshot for visual regression testing

Multi-page Workflows

Chain navigation and extraction commands:

browser-tools-nav https://site.com/page1 && browser-tools-eval 'extractData()' > page1.json
browser-tools-nav https://site.com/page2 && browser-tools-eval 'extractData()' > page2.json

Building Scrapers with Pick Tool

Use pick tool to interactively identify elements: browser-tools-pick "Select product info"
Copy returned selectors and element info
Build extraction JavaScript using the selectors
Test with eval tool: browser-tools-eval 'document.querySelectorAll("selector")'
Create deterministic scraper script

Token Efficiency

This skill uses ~225 tokens compared to 13,000-18,000 tokens for equivalent MCP servers. Efficiency comes from:
- Leveraging existing model knowledge of Bash and JavaScript
- Focused tool descriptions rather than comprehensive API documentation
- Minimal context overhead while maintaining full functionality

Composability Benefits

Unlike MCP servers, tool outputs are fully composable:
- Pipe results to files for batch processing: browser-tools-eval 'data()' > output.json
- Chain multiple commands: browser-tools-nav url && browser-tools-eval 'code'
- Process outputs with standard Unix tools: browser-tools-cookies | grep domain
- Save intermediate results without consuming context space
- Build complex workflows combining multiple tools

Implementation Notes

All scripts use Puppeteer Core connecting to Chrome on localhost:9222
JavaScript execution happens in async page context with full DOM access
Element picker provides detailed selector and context information for scraper building
Screenshots saved to temporary directory with timestamp filenames
Cookie extraction includes HTTP-only cookies inaccessible to page JavaScript
Scripts handle connection failures gracefully and provide clear error messages

Extension Pattern

Add new tools by creating focused scripts that:
1. Connect to Chrome DevTools on :9222
2. Perform specific operation
3. Output structured, parseable results
4. Handle errors gracefully with clear messages
5. Disconnect cleanly

Keep each tool focused on single responsibility for maximum reusability and token efficiency. Follow the same argument parsing and output formatting patterns as existing tools.

# README.md

Browser Tools - Global Agent Setup

This directory contains browser automation tools designed to work globally across all Claude agents.

Quick Setup

Install dependencies:
bash cd ~/agent-tools/browser-tools npm install
Add to shell PATH (add to your ~/.zshrc or ~/.bashrc):
bash export PATH="$PATH:$HOME/agent-tools/browser-tools/scripts"
Or create alias for Claude Code:
bash alias cl="PATH=$PATH:$HOME/agent-tools/browser-tools/scripts claude --dangerously-skip-permissions"
Add to Claude Code working directories:
bash # In Claude Code, run: # /add-dir ~/agent-tools/browser-tools

Global Usage

Once set up, all tools are available globally:

browser-tools-start - Start Chrome with debugging
browser-tools-nav - Navigate to URLs
browser-tools-eval - Execute JavaScript
browser-tools-screenshot - Capture screenshots
browser-tools-pick - Interactive element picker
browser-tools-cookies - Extract cookies

Referencing the Skill

In Claude Code, reference the skill with:

@SKILL.md

This loads the complete browser tools documentation into context (only ~225 tokens vs 13,000+ for MCP servers).

Platform Notes

macOS: Chrome path is hardcoded to /Applications/Google Chrome.app/Contents/MacOS/Google Chrome

Linux/Windows: Update the Chrome path in scripts/start.js for your platform:
- Linux: /usr/bin/google-chrome or /opt/google/chrome/chrome
- Windows: "C:\\Program Files\\Google\\Chrome\\Application\\chrome.exe"

Benefits Over MCP Servers

Token Efficient: ~225 tokens vs 13,000-18,000 for browser MCP servers
Composable: Pipe outputs, chain commands, process with Unix tools
Extensible: Easy to add new focused tools
Fast: No MCP server overhead or context switching
Universal: Works with any agent that can run Bash commands# browser-skill

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.