runhuman-testing

by @volter-ai in DevOps & Cloud

# Install this skill:

npx skills add volter-ai/runhuman-skills

Or install specific skill: npx add-skill https://github.com/volter-ai/runhuman-skills

# Description

Create and manage human-powered QA tests using Runhuman CLI. Use this skill when you need to test web applications with real human testers, get UX feedback, validate user flows, check mobile responsiveness, or find bugs that automated tests miss. Covers CLI commands, writing effective test descriptions, common workflows (pre-deployment, CI/CD, mobile testing), best practices, and troubleshooting. Ideal for testing signup flows, checkout processes, visual layouts, and exploratory testing.

# SKILL.md

name: runhuman-testing
description: Create and manage human-powered QA tests using Runhuman CLI. Use this skill when you need to test web applications with real human testers, get UX feedback, validate user flows, check mobile responsiveness, or find bugs that automated tests miss. Covers CLI commands, writing effective test descriptions, common workflows (pre-deployment, CI/CD, mobile testing), best practices, and troubleshooting. Ideal for testing signup flows, checkout processes, visual layouts, and exploratory testing.

Runhuman Testing Skills

Version: 1.0.0
Skills Format: Agent Skills v1
Last Updated: 2026-01-30

Overview

This skill helps AI agents create and manage human-powered QA tests using Runhuman. Runhuman connects AI coding tools to on-demand professional human testers who provide real user feedback.

Prerequisites

Node.js 18+ or Bun 1.0.3+
Runhuman API key (get one at https://runhuman.com)
Basic understanding of QA testing concepts

Core Concepts

What is Runhuman?

Runhuman is a human QA testing API. Instead of writing automated tests, you describe what you want tested in natural language, and real human testers perform the test and provide structured feedback.

Key Benefits:
- Catch issues automated tests miss (UX problems, visual bugs, mobile responsiveness)
- Get results in 1-2 minutes average
- No test maintenance overhead
- Real human perspective on user experience

When to Use Runhuman

✅ Good Use Cases:
- User flows (signup, checkout, onboarding)
- Visual testing (layout, responsive design, cross-browser)
- Mobile testing (touch interactions, scrolling, gestures)
- UX feedback (confusing UI, accessibility issues)
- Exploratory testing (finding edge cases)

❌ Not Ideal For:
- Unit tests (use Jest/Vitest)
- Performance testing (use Lighthouse)
- Security testing (use dedicated security tools)
- Tests that need to run 100+ times per day (cost prohibitive)

CLI Usage (Primary Method)

Installation

# Install CLI globally
npm install -g @runhuman/cli

# Or use npx (no installation)
npx @runhuman/cli test <url>

Quick Start

Simplest possible test:

npx @runhuman/cli test https://example.com

This sends your site to a human tester with default instructions: "Explore the site and report any issues."

Basic CLI Patterns

1. Test with Custom Description

npx @runhuman/cli test https://example.com \
  --description "Test the checkout flow: add item to cart, proceed to checkout, fill in shipping info. Check for bugs and UX issues."

Best Practice: Be specific about:
- What pages/flows to test
- What actions to perform
- What to look for (bugs, UX issues, mobile responsiveness)

2. Wait for Results Synchronously

# Block until test completes (useful in CI/CD)
npx @runhuman/cli test https://example.com \
  --description "Test signup form" \
  --wait

When to use:
- CI/CD pipelines (block deployment if test fails)
- Critical flows before merging PR
- One-off manual testing

3. Configure Timeouts

# Set custom timeout (default: 5 minutes)
npx @runhuman/cli test https://example.com \
  --timeout 300000 \  # 5 minutes in milliseconds
  --description "Full e-commerce flow"

4. Check Test Status

# Get status of a specific job
npx @runhuman/cli status <job-id>

# List all recent tests
npx @runhuman/cli list

Common CLI Workflows

Workflow 1: Pre-Deployment Check

#!/bin/bash
# Test critical flows before deploying

echo "Testing staging site before deployment..."

npx @runhuman/cli test https://staging.myapp.com \
  --description "Test user signup: click 'Sign Up', fill form, verify email sent" \
  --wait

if [ $? -eq 0 ]; then
  echo "✅ Tests passed - proceeding with deployment"
  ./deploy.sh
else
  echo "❌ Tests failed - deployment aborted"
  exit 1
fi

Workflow 2: Mobile Responsiveness Test

# Test on mobile viewport
npx @runhuman/cli test https://myapp.com \
  --description "Test on mobile device: check responsive layout, touch interactions, and navigation menu. Report any layout issues or hard-to-tap buttons."

Workflow 3: Cross-Browser Testing

# Multiple tests for different browsers
npx @runhuman/cli test https://myapp.com \
  --description "Test in Chrome: full site navigation and checkout flow"

npx @runhuman/cli test https://myapp.com \
  --description "Test in Safari: same as Chrome test, note any browser-specific issues"

Advanced CLI Usage

Environment Variables

# Set API key via environment variable
export RUNHUMAN_API_KEY=rh_live_abc123

# Use staging API
export RUNHUMAN_API_URL=https://staging.runhuman.com/api

# Then run tests without --api-key flag
npx @runhuman/cli test https://example.com

Configuration File

Create .runhumanrc.json in your project root:

{
  "apiKey": "rh_live_abc123",
  "defaultTimeout": 300000,
  "defaultDescription": "Test for bugs and UX issues"
}

Output Formats

# JSON output (for scripting)
npx @runhuman/cli test https://example.com --format json

# Pretty console output (default)
npx @runhuman/cli test https://example.com --format pretty

# CI/CD output (minimal, exit codes only)
npx @runhuman/cli test https://example.com --format ci

Writing Effective Test Descriptions

✅ Good Descriptions

Example 1: Specific Flow

Test the checkout process:
1. Add product to cart
2. Click "Checkout"
3. Fill in shipping address
4. Select payment method
5. Review order
6. Complete purchase

Report: bugs, UX confusions, mobile layout issues

Example 2: Exploratory Testing

Explore the dashboard as a new user. Try to:
- Create a project
- Add team members
- Navigate all menu items

Report: anything confusing, broken links, visual bugs

Example 3: Visual Testing

Check the homepage on mobile:
- Does text fit without wrapping awkwardly?
- Are images properly sized?
- Can you tap all buttons easily?
- Does the navigation menu work?

Report: layout issues, hard-to-read text, broken images

❌ Bad Descriptions

Too Vague:

"Test the site" ❌
Better: "Test the signup flow: click sign up, fill form, check confirmation email"

Too Technical:

"Verify JWT token expiration after 15 minutes of inactivity" ❌
Better: "Log in, wait 15 minutes without activity, then try to use the app. Check if you're logged out."

Missing Context:

"Click the button" ❌
Better: "Click the 'Get Started' button in the hero section"

Integration Patterns

Pattern 1: Pre-Merge PR Testing

# In .github/workflows/pr-test.yml
name: Pre-Merge QA Test

on:
  pull_request:
    types: [opened, synchronize]

jobs:
  human-qa:
    runs-on: ubuntu-latest
    steps:
      - name: Deploy PR preview
        run: ./deploy-preview.sh ${{ github.event.pull_request.number }}

      - name: Run human QA test
        run: |
          npx @runhuman/cli test https://pr-${{ github.event.pull_request.number }}.preview.app \
            --description "Test the changes in this PR: ${{ github.event.pull_request.title }}" \
            --wait \
            --api-key ${{ secrets.RUNHUMAN_API_KEY }}

Pattern 2: Nightly Full Regression

# In .github/workflows/nightly-qa.yml
name: Nightly Full Regression

on:
  schedule:
    - cron: '0 2 * * *'  # 2 AM daily

jobs:
  regression:
    runs-on: ubuntu-latest
    steps:
      - name: Test critical user flows
        run: |
          npx @runhuman/cli test https://production.app \
            --description "Full regression: signup, login, checkout, account settings" \
            --wait \
            --timeout 600000 \  # 10 minutes
            --api-key ${{ secrets.RUNHUMAN_API_KEY }}

Pattern 3: Manual Testing Script

#!/bin/bash
# test-suite.sh - Run multiple tests in sequence

echo "🧪 Running Runhuman Test Suite"

# Test 1: Homepage
npx @runhuman/cli test https://myapp.com \
  --description "Test homepage: hero section, navigation, footer links"

# Test 2: Signup
npx @runhuman/cli test https://myapp.com/signup \
  --description "Test signup: form validation, error messages, success flow"

# Test 3: Mobile
npx @runhuman/cli test https://myapp.com \
  --description "Test on mobile: responsive layout, touch interactions"

echo "✅ All tests submitted. Check dashboard for results."

Best Practices

1. Test Description Guidelines

Be specific: Include exact steps or flows
Include context: What page, what button, what form
Define success: What should happen vs. what to look for
List concerns: Bugs? UX? Mobile? Accessibility?

2. When to Use --wait Flag

Use --wait when:
- Running in CI/CD pipelines
- Need to block deployment
- Testing critical flows pre-release

Don't use --wait when:
- Running many tests in parallel
- Testing non-blocking features
- Doing exploratory testing

3. Frequency Recommendations

Per Commit: Only critical flows (signup, checkout)
Per PR: Moderate (flows changed in PR)
Nightly: Comprehensive (full regression suite)
Pre-Release: Exhaustive (everything)

4. Cost Optimization

Tests cost ~$1-3 per test. Optimize by:
- Batch related tests: "Test signup AND login" (1 test) vs. two separate tests
- Use --wait judiciously: Only when necessary
- Test staging, not production: Cheaper to catch bugs early
- Prioritize critical flows: Not every page needs daily testing

Troubleshooting

Error: "API key not found"

Solution:

# Set API key
export RUNHUMAN_API_KEY=rh_live_abc123

# Or pass inline
npx @runhuman/cli test https://example.com --api-key rh_live_abc123

Error: "Timeout exceeded"

Solution:

# Increase timeout for complex flows
npx @runhuman/cli test https://example.com \
  --timeout 600000 \  # 10 minutes
  --description "Complex multi-step flow"

Error: "Invalid URL"

Solution:
- Ensure URL includes protocol: https://example.com not example.com
- Ensure site is publicly accessible (not localhost)
- For localhost testing, use a tunnel (ngrok, etc.)

Test Results Show "No Issues Found"

Possible reasons:
- Test description too vague
- Site genuinely has no issues (rare!)
- Tester didn't understand instructions

Solution: Make description more specific with exact steps.

Quick Reference

Most Common Commands

# Basic test
npx @runhuman/cli test <url>

# Test with description
npx @runhuman/cli test <url> --description "..."

# Synchronous test (wait for result)
npx @runhuman/cli test <url> --wait

# Check status
npx @runhuman/cli status <job-id>

# List recent tests
npx @runhuman/cli list

Environment Variables

RUNHUMAN_API_KEY=rh_live_...     # Your API key
RUNHUMAN_API_URL=https://...     # API endpoint (optional)

Exit Codes

0 - Test passed (no critical issues found)
1 - Test failed (critical issues found)
2 - Test timeout
3 - API error

Additional Resources

Dashboard: https://runhuman.com/dashboard
API Docs: https://runhuman.com/docs/api
GitHub Actions: https://github.com/marketplace/actions/runhuman-qa-test
Support: https://runhuman.com/discord

End of Skill

# README.md

Runhuman Agent Skills

Agent Skills for Runhuman - Best practices and procedural knowledge for AI agents working with human-powered QA testing.

What are Agent Skills?

Agent Skills are folders of instructions, scripts, and resources that AI agents can discover and use to do things more accurately and efficiently. This package contains Runhuman-specific knowledge in the Agent Skills format.

Installation

npx skills add volter-ai/runhuman-skills

Supported AI Tools

These skills work with any Agent Skills-compatible tool:
- Claude Code (Anthropic)
- Cursor
- Codex
- Cline
- Any AI agent supporting the Agent Skills format

What's Included

This skills package teaches AI agents how to:

CLI Usage (Primary Focus)

Run QA tests using @runhuman/cli
Write effective test descriptions
Configure timeouts and options
Integrate with CI/CD pipelines
Optimize costs and test frequency

Core Concepts

When to use human QA vs. automated testing
Best practices for test descriptions
Common workflow patterns
Troubleshooting guide

Integration Patterns

Pre-merge PR testing
Nightly regression testing
Manual test scripts
GitHub Actions integration

Usage with Claude Code

Install the skills:
bash npx skills add volter-ai/runhuman-skills
Start Claude Code in your project:
bash claude
Prompt Claude to create tests:
"Create a QA test for my checkout flow using Runhuman CLI"

Claude will now use the skills to generate correct commands and best practices automatically!

Example Prompts

Once skills are installed, try these prompts with Claude Code:

Basic Testing

"Test my signup flow at https://staging.myapp.com using Runhuman"

CI/CD Integration

"Add a Runhuman test to my GitHub Actions workflow for PR testing"

Complex Workflows

"Create a test suite script that tests signup, login, and checkout flows"

Mobile Testing

"Test mobile responsiveness of my landing page with Runhuman"

SKILL.md - Main skill file with comprehensive CLI usage guide (460+ lines)
README.md - This file

Repository

https://github.com/volter-ai/runhuman-skills

License

MIT

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.

runhuman-testing

# Description

# SKILL.md

Runhuman Testing Skills

Overview

Prerequisites

Core Concepts

What is Runhuman?

When to Use Runhuman

CLI Usage (Primary Method)

Installation

Quick Start

Basic CLI Patterns

1. Test with Custom Description

2. Wait for Results Synchronously

3. Configure Timeouts

4. Check Test Status

Common CLI Workflows

Workflow 1: Pre-Deployment Check

Workflow 2: Mobile Responsiveness Test

Workflow 3: Cross-Browser Testing

Advanced CLI Usage

Environment Variables

Configuration File

Output Formats

Writing Effective Test Descriptions

✅ Good Descriptions

❌ Bad Descriptions

Integration Patterns

Pattern 1: Pre-Merge PR Testing

Pattern 2: Nightly Full Regression

Pattern 3: Manual Testing Script

Best Practices

1. Test Description Guidelines

2. When to Use --wait Flag

3. Frequency Recommendations

4. Cost Optimization

Troubleshooting

Error: "API key not found"

Error: "Timeout exceeded"

Error: "Invalid URL"

Test Results Show "No Issues Found"

Quick Reference

Most Common Commands

Environment Variables

Exit Codes

Additional Resources

# README.md

Runhuman Agent Skills

What are Agent Skills?

Installation

Supported AI Tools

What's Included

CLI Usage (Primary Focus)

Core Concepts

Integration Patterns

Usage with Claude Code

Example Prompts

Basic Testing

CI/CD Integration

Complex Workflows

Mobile Testing

Contents

Repository

License

Links

# Related Skills

# Supported AI Coding Agents

Confirm

Submit a Skill