judge

Name: judge
Author: terry-li-hm

by @terry-li-hm in AI & LLM

# Install this skill:

npx skills add terry-li-hm/skills --skill "judge"

Install specific skill from multi-skill repository

# Description

Review drafts and analyses against quality criteria. Use after generating output that benefits from a second pass.

# SKILL.md

name: judge
description: Review drafts and analyses against quality criteria. Use after generating output that benefits from a second pass.
user_invocable: false

Judge Skill

Reusable quality review for skill outputs. Call this after generating drafts, analyses, or any output that benefits from a second pass.

Trigger

Called by other skills, not directly by user. Skills invoke this at the end of their workflow:

Execute main task → Call /judge → Iterate if needed → Output

Inputs

output — The content to judge (required)
goal — What the output was supposed to achieve (required)
domain — Which criteria to use: outreach, job-eval, technical, article, default (optional, auto-detect if not specified)
max_iterations — How many revision cycles before giving up (default: 2)

Workflow

Load criteria from criteria/{domain}.yaml (or auto-detect domain from goal)
Evaluate output against each check:
For each criterion, assess pass/fail
Weight by importance (high/medium/low)
Note specific issues found
Decide verdict:
All high-weight checks pass + majority of medium → pass
Any high-weight check fails → needs_work
Only low-weight issues → pass with notes
Return structured result:
```yaml
verdict: pass | needs_work
score: 0-100
issues:
- check: personalization
  severity: high
  problem: "No specific reference to recipient's work"
  suggestion: "Mention their recent post about X"
  summary: "One-line overall assessment"
```
If called with iteration context:
Compare against previous version
Note what improved, what's still missing
If same issues persist after max_iterations, return pass with caveats

Auto-Detection Logic

If domain not specified, infer from goal keywords:

Keywords in Goal	Domain
outreach, message, linkedin, email, networking	outreach
job, role, position, application, evaluate	job-eval
code, technical, implementation, architecture	technical
article, blog, essay, writing	article
(none matched)	default

Integration Example

Other skills call judge like this:

## Workflow (in /outreach)

1. Gather context about recipient
2. Draft personalized message
3. **Review with judge:**
   - Call judge with output=draft, goal="personalized networking message", domain="outreach"
   - If needs_work: revise based on feedback
   - Max 2 iterations
4. Output final draft to user

Criteria Files

Located in criteria/ subdirectory. Each file defines domain-specific checks.

Format:

name: Domain Name
description: What this domain covers
checks:
  - name: check_name
    question: "Yes/no question to evaluate"
    weight: high | medium | low
    examples:
      good: "Example of passing"
      bad: "Example of failing"

Output Modes

For skill integration (default):
Return structured YAML for programmatic handling

For user review (if called directly):
Return readable summary with specific feedback

Notes

Be constructive, not just critical — always include how to fix
High-weight failures are blockers; low-weight are suggestions
Don't over-iterate — diminishing returns after 2 passes
Trust the criteria files; don't invent new checks on the fly

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.