Change persistent state with checkpoint and rollback support. Use when modifying files, updating databases, changing...
cat ~/新着
マーケットプレイスに追加された最新のスキルを閲覧
Quantify values with uncertainty bounds. Use when estimating metrics, calculating risk scores, assessing magnitude,...
Execute a composed workflow by name. Use when running predefined workflows, orchestrating multi-step processes, or...
Combine heterogeneous data sources into a unified model with conflict resolution, schema alignment, and provenance...
Request clarification when input is ambiguous. Use when user request has missing parameters, conflicting...
Anchor claims to evidence from authoritative sources. Use when validating assertions, establishing provenance,...
Create a new artifact (text, code, plan, data) under specified constraints. Use when producing content, writing...
Produce clear reasoning with assumptions, causal chains, and evidence. Use when clarifying decisions, teaching...
Provides guidance for training LLMs with reinforcement learning using verl (Volcano Engine RL). Use when...
Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment,...
Provides guidance for PyTorch-native agentic RL using torchforge, Meta's library separating infra from algorithms....
Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM...
Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4...
High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models...
Provides guidance for enterprise-grade RL training using miles, a production-ready fork of slime. Use when training...
Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training
Split work across subagents with explicit contracts, interfaces, and merge strategies. Use when parallelizing tasks,...
Break a goal into subgoals, constraints, and acceptance criteria. Use when planning complex work, creating work...
Execute the Debug Code Change workflow end-to-end with safety gates. Use when debugging code changes, investigating...
Find failure modes, edge cases, ambiguities, and exploit paths in plans, code, or designs. Use when reviewing...
Enforce policies, guardrails, and permission boundaries; refuse unsafe actions and apply least privilege. Use when...
Compare multiple alternatives using explicit criteria, weighted scoring, and tradeoff analysis. Use when choosing...
Assign labels or categories to items based on characteristics. Use when categorizing entities, tagging content,...
Create a safety checkpoint marker before mutation or execution steps. Use when about to modify files, execute plans,...