Change persistent state with checkpoint and rollback support. Use when modifying files, updating databases, changing...

Quantify values with uncertainty bounds. Use when estimating metrics, calculating risk scores, assessing magnitude,...

Execute a composed workflow by name. Use when running predefined workflows, orchestrating multi-step processes, or...

Combine heterogeneous data sources into a unified model with conflict resolution, schema alignment, and provenance...

Request clarification when input is ambiguous. Use when user request has missing parameters, conflicting...

Anchor claims to evidence from authoritative sources. Use when validating assertions, establishing provenance,...

Create a new artifact (text, code, plan, data) under specified constraints. Use when producing content, writing...

Produce clear reasoning with assumptions, causal chains, and evidence. Use when clarifying decisions, teaching...

Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4...

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models...

Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training...

Split work across subagents with explicit contracts, interfaces, and merge strategies. Use when parallelizing tasks,...

Break a goal into subgoals, constraints, and acceptance criteria. Use when planning complex work, creating work...

Execute the Debug Code Change workflow end-to-end with safety gates. Use when debugging code changes, investigating...

Find failure modes, edge cases, ambiguities, and exploit paths in plans, code, or designs. Use when reviewing...

Enforce policies, guardrails, and permission boundaries; refuse unsafe actions and apply least privilege. Use when...

Compare multiple alternatives using explicit criteria, weighted scoring, and tradeoff analysis. Use when choosing...

Assign labels or categories to items based on characteristics. Use when categorizing entities, tagging content,...

Create a safety checkpoint marker before mutation or execution steps. Use when about to modify files, execute plans,...