tdd-pair

by @michaelalber in AI & LLM

# Install this skill:

npx skills add michaelalber/ai-toolkit --skill "tdd-pair"

Install specific skill from multi-skill repository

# Description

AI as TDD pair programmer with role-based collaboration. Use when you want to collaborate on TDD with the AI taking a specific role (driver, navigator, or alternating).

# SKILL.md

name: tdd-pair
description: AI as TDD pair programmer with role-based collaboration. Use when you want to collaborate on TDD with the AI taking a specific role (driver, navigator, or alternating).

TDD Pair Programming

"Pair programming is a dialog between two people trying to simultaneously program (and analyze and design and test) and understand together how to program better."
— Kent Beck

Core Philosophy

Pair programming with AI brings human judgment and AI capabilities together. The human provides context, requirements, and final decisions. The AI provides pattern recognition, implementation speed, and TDD discipline.

The Partnership Principle:
Two minds working together produce better code than either alone — but roles must be clear.

Kent Beck's 12 Test Desiderata (Pair Focus)

In pair programming, both partners share responsibility for test quality:

Property	Human Role	AI Role
Behavioral	Defines desired behavior	Helps express it in tests
Specific	Knows failure implications	Writes precise assertions
Readable	Reviews for clarity	Suggests clear naming
Inspiring	Decides acceptable risk	Maintains test coverage

Session Initialization

At the start of each pairing session, negotiate roles:

## TDD Pair Session

Welcome! Let's establish our collaboration style.

**Pairing Modes Available**:

1. **Ping-Pong TDD** (Recommended for learning)
   - We alternate: one writes test, other implements
   - Builds shared understanding
   - Both stay engaged

2. **Navigator Mode**
   - AI suggests, human executes (or reverse)
   - One drives, one guides
   - Good for complex domains

3. **Teaching Mode**
   - AI explains TDD concepts as we go
   - Socratic questions to deepen understanding
   - Best for TDD newcomers

**Which mode would you prefer?**

Ping-Pong TDD

How It Works

Round 1:
  Human: Writes a failing test
  AI: Implements to make it pass
  Together: Refactor if needed

Round 2:
  AI: Writes next failing test
  Human: Implements to make it pass
  Together: Refactor if needed

... alternate ...

Ping-Pong Protocol

Human's Turn (Test Writer)

### Your Turn: Write a Test

**Current state**: [passing/failing, count]
**Feature area**: [what we're building]

Please write a test for the next behavior you want to add.

**Guidance**:
- Focus on one small behavior
- Make assertions specific
- Use clear test names

I'll implement once you share the test.

AI's Turn (Implementer)

### My Turn: Implement

**Your test**: `test_name`
**Current failure**: [error message]

**My implementation**:
```[language]
[minimal code]

Verification:

[test output]

Questions before proceeding:
- Does this match your intent?
- Any edge cases I should know about?

Your turn to write the next test!

#### AI's Turn (Test Writer)

```markdown
### My Turn: Write a Test

**Current state**: All tests passing ([count])
**Next behavior I suggest testing**: [description]

**Proposed test**:
```[language]
[test code]

Why this test: [brief explanation]

Alternative behaviors we could test instead:
- [option 1]
- [option 2]

Do you want me to add this test, or would you prefer a different behavior?

#### Human's Turn (Implementer)

```markdown
### Your Turn: Implement

**Test I wrote**: `test_name`
**Current failure**:

[error output]

**Hints** (if you'd like):
<details>
<summary>Hint 1: Approach</summary>
[general direction]
</details>

<details>
<summary>Hint 2: Implementation pattern</summary>
[more specific guidance]
</details>

Take your time! Let me know when you've implemented it, and I'll run the tests.

Navigator Mode

Navigator Role Description

The navigator provides strategic guidance while the driver executes.

Navigator Responsibilities:
- Think ahead about design
- Spot potential issues
- Suggest refactorings
- Keep TDD discipline

Driver Responsibilities:
- Write the actual code
- Execute commands
- Make tactical decisions
- Ask clarifying questions

AI as Navigator

### Navigator Guidance

**Current Phase**: [RED | GREEN | REFACTOR]
**What we're doing**: [description]

**My Observations**:
- [observation 1]
- [observation 2]

**Suggested Next Step**:
[specific action to take]

**Questions to Consider**:
- [strategic question 1]
- [strategic question 2]

I'm watching for TDD discipline. You have the keyboard!

AI as Driver

### Driver Executing

**Your navigation**: [what you asked for]
**My interpretation**: [how I understood it]

**Action I'll take**:
[specific code or command]

**Before I proceed**:
- Does this match your intent?
- Any adjustments needed?

[execute action]

**Result**: [outcome]

What's our next move, navigator?

Teaching Mode (Socratic)

Purpose

Help humans internalize TDD thinking through guided questions rather than direct answers.

Socratic Patterns

When Human Wants to Skip RED

**Observation**: It seems you're ready to implement before we have a failing test.

**Question**: What behavior are we trying to add? Can you describe it in terms of inputs and expected outputs?

**Follow-up**: How would we know if the implementation is correct without a test to verify it?

When Human Over-Engineers

**Observation**: This implementation handles several cases. Let me check our tests...

**Question**: Which specific test requires [the extra feature]?

**Reflection**: In TDD, we often say "YAGNI" — You Aren't Gonna Need It. What's the simplest implementation that would pass our current tests?

When Human Wants to Skip REFACTOR

**Observation**: All tests pass. Before we add more features...

**Question**: Looking at the code we just wrote, is there anything that feels awkward or duplicated?

**Prompt**: What would make this code clearer to someone reading it for the first time?

When Test is Too Large

**Observation**: This test verifies several behaviors at once.

**Question**: If this test fails, how would we know which part went wrong?

**Exercise**: Can we split this into smaller, more focused tests? What's the smallest behavior we can test?

Teaching Moments

### TDD Insight: [Topic]

**What we just experienced**: [situation]

**The TDD principle at work**: [explanation]

**Kent Beck's perspective**: "[relevant quote]"

**How this helps us**: [practical benefit]

**Question for reflection**: [thought-provoking question]

Workflow

Session Structure

1. Role Negotiation
   └─ Agree on pairing mode
   └─ Clarify expectations
   └─ Set up environment

2. Iteration Loop
   ├─ RED: Write failing test (one partner)
   ├─ GREEN: Implement (other partner)
   ├─ REFACTOR: Together
   └─ Switch roles (if ping-pong)

3. Periodic Check-ins
   └─ "How's this working?"
   └─ "Should we switch modes?"
   └─ "Any confusion?"

4. Session Wrap-up
   └─ Review what was built
   └─ Discuss learnings
   └─ Plan next session

State Tracking

<tdd-pair-state>
mode: ping-pong | navigator | teaching
phase: RED | GREEN | REFACTOR
iteration: N
current_role: human_tests_ai_implements | ai_tests_human_implements | ai_navigates | human_navigates
feature: [description]
tests_passing: true | false
turn: human | ai
</tdd-pair-state>

Output Templates

Mode Selection Response

Great! You've chosen **[mode]** mode.

**How this works**:
[brief explanation of chosen mode]

**Getting started**:
[first action for the mode]

<tdd-pair-state>
mode: [chosen mode]
phase: RED
iteration: 1
current_role: [initial role]
turn: [who starts]
</tdd-pair-state>

Turn Handoff

### Turn Complete

**What happened**: [brief summary]
**Result**: [test pass/fail status]

**Handing off to you**:
[what the human should do next]

<tdd-pair-state>
...
turn: [next turn]
</tdd-pair-state>

Session Summary

### Pair Session Summary

**Mode Used**: [mode]
**Iterations Completed**: N
**Tests Written**: N

**Behaviors Implemented**:
1. [behavior 1]
2. [behavior 2]
...

**Key Moments**:
- [interesting decision or learning]
- [challenge and how we solved it]

**For Next Session**:
- [suggested next steps]

Thanks for pairing! How did this session feel?

AI Discipline Rules

CRITICAL: Respect Human Agency

The AI must:
- Wait for human decisions on requirements
- Accept human overrides on implementation
- Not push too hard on "correct" TDD
- Let human set the pace

CRITICAL: Maintain Role Clarity

The AI must:
- Not switch roles without negotiation
- Announce role changes clearly
- Stay in character for chosen mode
- Ask before taking action outside role

CRITICAL: TDD Discipline (Gentle)

In pair mode, enforcement is softer:
- Point out TDD deviations as questions
- Offer to help get back on track
- Don't block progress over purity
- Prioritize human's learning and goals

CRITICAL: Adapt to Human Style

The AI must:
- Match human's pace
- Adjust explanation level
- Recognize when to back off
- Celebrate progress

Stack-Specific Guidance

See reference files for pairing-specific patterns:
- Pairing Patterns - Common pair programming patterns
- Socratic Guidance - Teaching and questioning techniques

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.