stagehand-expert

Name: stagehand-expert
Rating: 5 (3 reviews)
Author: YuniorGlez

by @YuniorGlez in DevOps & Cloud

# Install this skill:

npx skills add YuniorGlez/gemini-elite-core --skill "stagehand-expert"

Install specific skill from multi-skill repository

# Description

Master Architect in Stagehand V3. Expert in Direct CDP Automation, Decision Caching, and Agentic Web Orchestration for 2026.

# SKILL.md

name: stagehand-expert
id: stagehand-expert
version: 4.1.0
description: "Master Architect in Stagehand V3. Expert in Direct CDP Automation, Decision Caching, and Agentic Web Orchestration for 2026."

🎭 Skill: Stagehand Expert (v4.1.0)

Executive Summary

The stagehand-expert is the elite specialist in browser automation and high-precision agent orchestration. In 2026, web automation has shifted from brittle selectors to Natural Language Primitives and Direct CDP Communication. This skill focuses on mastering Stagehand V3, leveraging Decision Caching for zero-cost CI/CD, and navigating complex Shadow DOM/iframe structures with 44% more velocity.

🔍 Proactive Investigation Protocol

Before writing a single test, the expert MUST perform a Deep Discovery:
1. Route Mapping: identify the user flow from page.tsx or router configs.
2. UI Component Audit: Read source code to find IDs, labels, and loading states.
3. Vibe Check: Measure layout stability using the CDP "Vibe Score."
4. Schema Inference: Analyze existing backend/DB types to create 100% compatible extract() Zod schemas.

🚫 The "Do Not" List (Anti-Patterns)

Anti-Pattern	Why it fails in 2026	Modern Alternative
Manual Frame Switching	Fragile and slow.	Use DeepLocator (>>) & CDP.
Hardcoded Wait(2000)	Unreliable and causes jank.	Use `domSettleTimeout`.
Missing finally { close() }	Leaves zombie processes.	Mandatory `try...finally`.
LLM Calls in CI	Slow and expensive.	Use Persistent Decision Caches.
Ignoring CSS Animations	Interactions fail during transitions.	Use Reanimated-aware Waiters.

⚡ Core Primitives Mastery

Act: Precise natural language instructions with mapped variables.
Observe: Single-turn identification of all page elements for 70% cost reduction.
Extract: Structured, Zod-validated data pulling with semantic flattening.

💾 Advanced Decision Caching

Transform E2E tests into a deterministic asset:
- Develop Locally: Live LLM generates the cache.
- Commit Cache: Store DOM snapshots and results in Git.
- Zero-Cost CI: Run tests in "Cached-Only" mode.

See References: Agent Caching for details.

🤖 Autonomous Agents & CUA

For the most complex UIs (Cross-origin iframes, dynamic canvas):
- Computer Use Agent (CUA): Pure visual reasoning for impossible-to-parse elements.
- Safety Callbacks: Mandatory human-in-the-loop for financial or destructive actions.

📖 Reference Library

Detailed deep-dives into Stagehand Excellence:

Direct CDP Communication: Velocity and deep access.
Agent Caching: Determinism and cost savings.
Shadow DOM Mastery: Jumping документ boundaries.
Installation & Setup: The Bun/Playwright stack.

Updated: January 22, 2026 - 21:20

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.