context-engineering-collection

Name: context-engineering-collection
Rating: 5 (7653 reviews)
Author: muratcankoylan

by @muratcankoylan in AI & LLM

7,653

606

# Install this skill:

npx skills add muratcankoylan/Agent-Skills-for-Context-Engineering

Or install specific skill: npx add-skill https://github.com/muratcankoylan/Agent-Skills-for-Context-Engineering

⭐ Awesome Collections

This skill is featured in:

⭐ Agent Skills for Context Engineering

# Description

A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require effective context management.

# SKILL.md

name: context-engineering-collection
description: A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems that require effective context management.

Agent Skills for Context Engineering

This collection provides structured guidance for building production-grade AI agent systems through effective context engineering.

When to Activate

Activate these skills when:
- Building new agent systems from scratch
- Optimizing existing agent performance
- Debugging context-related failures
- Designing multi-agent architectures
- Creating or evaluating tools for agents
- Implementing memory and persistence layers

Skill Map

Foundational Context Engineering

Understanding Context Fundamentals
Context is not just prompt text—it is the complete state available to the language model at inference time, including system instructions, tool definitions, retrieved documents, message history, and tool outputs. Effective context engineering means understanding what information truly matters for the task at hand and curating that information for maximum signal-to-noise ratio.

Recognizing Context Degradation
Language models exhibit predictable degradation patterns as context grows: the "lost-in-middle" phenomenon where information in the center of context receives less attention; U-shaped attention curves that prioritize beginning and end; context poisoning when errors compound; and context distraction when irrelevant information overwhelms relevant content.

Architectural Patterns

Multi-Agent Coordination
Production multi-agent systems converge on three dominant patterns: supervisor/orchestrator architectures with centralized control, peer-to-peer swarm architectures for flexible handoffs, and hierarchical structures for complex task decomposition. The critical insight is that sub-agents exist primarily to isolate context rather than to simulate organizational roles.

Memory System Design
Memory architectures range from simple scratchpads to sophisticated temporal knowledge graphs. Vector RAG provides semantic retrieval but loses relationship information. Knowledge graphs preserve structure but require more engineering investment. The file-system-as-memory pattern enables just-in-time context loading without stuffing context windows.

Filesystem-Based Context
The filesystem provides a single interface for storing, retrieving, and updating effectively unlimited context. Key patterns include scratch pads for tool output offloading, plan persistence for long-horizon tasks, sub-agent communication via shared files, and dynamic skill loading. Agents use ls, glob, grep, and read_file for targeted context discovery, often outperforming semantic search for structural queries.

Hosted Agent Infrastructure
Background coding agents run in remote sandboxed environments rather than on local machines. Key patterns include pre-built environment images refreshed on regular cadence, warm sandbox pools for instant session starts, filesystem snapshots for session persistence, and multiplayer support for collaborative agent sessions. Critical optimizations include allowing file reads before git sync completes (blocking only writes), predictive sandbox warming when users start typing, and self-spawning agents for parallel task execution.

Tool Design Principles
Tools are contracts between deterministic systems and non-deterministic agents. Effective tool design follows the consolidation principle (prefer single comprehensive tools over multiple narrow ones), returns contextual information in errors, supports response format options for token efficiency, and uses clear namespacing.

Operational Excellence

Context Compression
When agent sessions exhaust memory, compression becomes mandatory. The correct optimization target is tokens-per-task, not tokens-per-request. Structured summarization with explicit sections for files, decisions, and next steps preserves more useful information than aggressive compression. Artifact trail integrity remains the weakest dimension across all compression methods.

Context Optimization
Techniques include compaction (summarizing context near limits), observation masking (replacing verbose tool outputs with references), prefix caching (reusing KV blocks across requests), and strategic context partitioning (splitting work across sub-agents with isolated contexts).

Evaluation Frameworks
Production agent evaluation requires multi-dimensional rubrics covering factual accuracy, completeness, tool efficiency, and process quality. Effective patterns include LLM-as-judge for scalability, human evaluation for edge cases, and end-state evaluation for agents that mutate persistent state.

Development Methodology

Project Development
Effective LLM project development begins with task-model fit analysis: validating through manual prototyping that a task is well-suited for LLM processing before building automation. Production pipelines follow staged, idempotent architectures (acquire, prepare, process, parse, render) with file system state management for debugging and caching. Structured output design with explicit format specifications enables reliable parsing. Start with minimal architecture and add complexity only when proven necessary.

Core Concepts

The collection is organized around three core themes. First, context fundamentals establish what context is, how attention mechanisms work, and why context quality matters more than quantity. Second, architectural patterns cover the structures and coordination mechanisms that enable effective agent systems. Third, operational excellence addresses the ongoing work of optimizing and evaluating production systems.

Practical Guidance

Each skill can be used independently or in combination. Start with fundamentals to establish context management mental models. Branch into architectural patterns based on your system requirements. Reference operational skills when optimizing production systems.

The skills are platform-agnostic and work with Claude Code, Cursor, or any agent framework that supports custom instructions or skill-like constructs.

Integration

This collection integrates with itself—skills reference each other and build on shared concepts. The fundamentals skill provides context for all other skills. Architectural skills (multi-agent, memory, tools) can be combined for complex systems. Operational skills (optimization, evaluation) apply to any system built using the foundational and architectural skills.

References

Internal skills in this collection:
- context-fundamentals
- context-degradation
- context-compression
- multi-agent-patterns
- memory-systems
- tool-design
- filesystem-context
- hosted-agents
- context-optimization
- evaluation
- project-development

External resources on context engineering:
- Research on attention mechanisms and context window limitations
- Production experience from leading AI labs on agent system design
- Framework documentation for LangGraph, AutoGen, and CrewAI

Skill Metadata

Created: 2025-12-20
Last Updated: 2025-12-25
Author: Agent Skills for Context Engineering Contributors
Version: 1.2.0

# README.md

Agent Skills for Context Engineering

A comprehensive, open collection of Agent Skills focused on context engineering principles for building production-grade AI agent systems. These skills teach the art and science of curating context to maximize agent effectiveness across any agent platform.

What is Context Engineering?

Context engineering is the discipline of managing the language model's context window. Unlike prompt engineering, which focuses on crafting effective instructions, context engineering addresses the holistic curation of all information that enters the model's limited attention budget: system prompts, tool definitions, retrieved documents, message history, and tool outputs.

The fundamental challenge is that context windows are constrained not by raw token capacity but by attention mechanics. As context length increases, models exhibit predictable degradation patterns: the "lost-in-the-middle" phenomenon, U-shaped attention curves, and attention scarcity. Effective context engineering means finding the smallest possible set of high-signal tokens that maximize the likelihood of desired outcomes.

Skills Overview

Foundational Skills

These skills establish the foundational understanding required for all subsequent context engineering work.

Skill	Description
context-fundamentals	Understand what context is, why it matters, and the anatomy of context in agent systems
context-degradation	Recognize patterns of context failure: lost-in-middle, poisoning, distraction, and clash
context-compression	Design and evaluate compression strategies for long-running sessions

Architectural Skills

These skills cover the patterns and structures for building effective agent systems.

Skill	Description
multi-agent-patterns	Master orchestrator, peer-to-peer, and hierarchical multi-agent architectures
memory-systems	Design short-term, long-term, and graph-based memory architectures
tool-design	Build tools that agents can use effectively
filesystem-context	Use filesystems for dynamic context discovery, tool output offloading, and plan persistence
hosted-agents	NEW Build background coding agents with sandboxed VMs, pre-built images, multiplayer support, and multi-client interfaces

Operational Skills

These skills address the ongoing operation and optimization of agent systems.

Skill	Description
context-optimization	Apply compaction, masking, and caching strategies
evaluation	Build evaluation frameworks for agent systems
advanced-evaluation	Master LLM-as-a-Judge techniques: direct scoring, pairwise comparison, rubric generation, and bias mitigation

Development Methodology

These skills cover the meta-level practices for building LLM-powered projects.

Skill	Description
project-development	Design and build LLM projects from ideation through deployment, including task-model fit analysis, pipeline architecture, and structured output design

Cognitive Architecture Skills

These skills cover formal cognitive modeling for rational agent systems.

Skill	Description
bdi-mental-states	NEW Transform external RDF context into agent mental states (beliefs, desires, intentions) using formal BDI ontology patterns for deliberative reasoning and explainability

Design Philosophy

Progressive Disclosure

Each skill is structured for efficient context use. At startup, agents load only skill names and descriptions. Full content loads only when a skill is activated for relevant tasks.

Platform Agnosticism

These skills focus on transferable principles rather than vendor-specific implementations. The patterns work across Claude Code, Cursor, and any agent platform that supports skills or allows custom instructions.

Conceptual Foundation with Practical Examples

Scripts and examples demonstrate concepts using Python pseudocode that works across environments without requiring specific dependency installations.

Usage

Usage with Claude Code

This repository is a Claude Code Plugin Marketplace containing context engineering skills that Claude automatically discovers and activates based on your task context.

Installation

Step 1: Add the Marketplace

Run this command in Claude Code to register this repository as a plugin source:

/plugin marketplace add muratcankoylan/Agent-Skills-for-Context-Engineering

Step 2: Browse and Install

Option A - Browse available plugins:
1. Select Browse and install plugins
2. Select context-engineering-marketplace
3. Choose a plugin (e.g., context-engineering-fundamentals, agent-architecture)
4. Select Install now

Option B - Direct install via command:

/plugin install context-engineering-fundamentals@context-engineering-marketplace
/plugin install agent-architecture@context-engineering-marketplace
/plugin install agent-evaluation@context-engineering-marketplace
/plugin install agent-development@context-engineering-marketplace
/plugin install cognitive-architecture@context-engineering-marketplace

Available Plugins

Plugin	Skills Included
`context-engineering-fundamentals`	context-fundamentals, context-degradation, context-compression, context-optimization
`agent-architecture`	multi-agent-patterns, memory-systems, tool-design, filesystem-context, hosted-agents
`agent-evaluation`	evaluation, advanced-evaluation
`agent-development`	project-development
`cognitive-architecture`	bdi-mental-states

Skill Triggers

Skill	Triggers On
`context-fundamentals`	"understand context", "explain context windows", "design agent architecture"
`context-degradation`	"diagnose context problems", "fix lost-in-middle", "debug agent failures"
`context-compression`	"compress context", "summarize conversation", "reduce token usage"
`context-optimization`	"optimize context", "reduce token costs", "implement KV-cache"
`multi-agent-patterns`	"design multi-agent system", "implement supervisor pattern"
`memory-systems`	"implement agent memory", "build knowledge graph", "track entities"
`tool-design`	"design agent tools", "reduce tool complexity", "implement MCP tools"
`filesystem-context`	"offload context to files", "dynamic context discovery", "agent scratch pad", "file-based context"
`hosted-agents`	"build background agent", "create hosted coding agent", "sandboxed execution", "multiplayer agent", "Modal sandboxes"
`evaluation`	"evaluate agent performance", "build test framework", "measure quality"
`advanced-evaluation`	"implement LLM-as-judge", "compare model outputs", "mitigate bias"
`project-development`	"start LLM project", "design batch pipeline", "evaluate task-model fit"
`bdi-mental-states`	"model agent mental states", "implement BDI architecture", "transform RDF to beliefs", "build cognitive agent"

Screenshot 2025-12-26 at 12 34 47 PM

For Cursor & Codex & IDE

Copy skill content into .rules or create project-specific Skills folders. The skills provide the context and guidelines that agent needs for effective context engineering and agent design.

For Custom Implementations

Extract the principles and patterns from any skill and implement them in your agent framework. The skills are deliberately platform-agnostic.

Examples

The examples folder contains complete system designs that demonstrate how multiple skills work together in practice.

Example	Description	Skills Applied
digital-brain-skill	NEW Personal operating system for founders and creators. Complete Claude Code skill with 6 modules, 4 automation scripts	context-fundamentals, context-optimization, memory-systems, tool-design, multi-agent-patterns, evaluation, project-development
x-to-book-system	Multi-agent system that monitors X accounts and generates daily synthesized books	multi-agent-patterns, memory-systems, context-optimization, tool-design, evaluation
llm-as-judge-skills	Production-ready LLM evaluation tools with TypeScript implementation, 19 passing tests	advanced-evaluation, tool-design, context-fundamentals, evaluation
book-sft-pipeline	Train models to write in any author's style. Includes Gertrude Stein case study with 70% human score on Pangram, $2 total cost	project-development, context-compression, multi-agent-patterns, evaluation

Each example includes:
- Complete PRD with architecture decisions
- Skills mapping showing which concepts informed each decision
- Implementation guidance

Digital Brain Skill Example

The digital-brain-skill example is a complete personal operating system demonstrating comprehensive skills application:

Progressive Disclosure: 3-level loading (SKILL.md → MODULE.md → data files)
Module Isolation: 6 independent modules (identity, content, knowledge, network, operations, agents)
Append-Only Memory: JSONL files with schema-first lines for agent-friendly parsing
Automation Scripts: 4 consolidated tools (weekly_review, content_ideas, stale_contacts, idea_to_draft)

Includes detailed traceability in HOW-SKILLS-BUILT-THIS.md mapping every architectural decision to specific skill principles.

LLM-as-Judge Skills Example

The llm-as-judge-skills example is a complete TypeScript implementation demonstrating:

Direct Scoring: Evaluate responses against weighted criteria with rubric support
Pairwise Comparison: Compare responses with position bias mitigation
Rubric Generation: Create domain-specific evaluation standards
EvaluatorAgent: High-level agent combining all evaluation capabilities

Book SFT Pipeline Example

The book-sft-pipeline example demonstrates training small models (8B) to write in any author's style:

Intelligent Segmentation: Two-tier chunking with overlap for maximum training examples
Prompt Diversity: 15+ templates to prevent memorization and force style learning
Tinker Integration: Complete LoRA training workflow with $2 total cost
Validation Methodology: Modern scenario testing proves style transfer vs content memorization

Integrates with context engineering skills: project-development, context-compression, multi-agent-patterns, evaluation.

Star History

star-history-2026113

Structure

Each skill follows the Agent Skills specification:

skill-name/
├── SKILL.md              # Required: instructions + metadata
├── scripts/              # Optional: executable code demonstrating concepts
└── references/           # Optional: additional documentation and resources

See the template folder for the canonical skill structure.

Contributing

This repository follows the Agent Skills open development model. Contributions are welcome from the broader ecosystem. When contributing:

Follow the skill template structure
Provide clear, actionable instructions
Include working examples where appropriate
Document trade-offs and potential issues
Keep SKILL.md under 500 lines for optimal performance

Feel free to contact Muratcan Koylan for collaboration opportunities or any inquiries.

License

MIT License - see LICENSE file for details.

References

The principles in these skills are derived from research and production experience at leading AI labs and framework developers. Each skill includes references to the underlying research and case studies that inform its recommendations.

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

⚡ Amp 🚀 Antigravity 🤖 Claude Code 🦀 Clawdbot 📝 Codex ▶️ Cursor 🤖 Droid 💎 Gemini CLI 🐙 GitHub Copilot 🪿 Goose 📊 Kilo Code 🔧 Kiro CLI 💻 OpenCode 🦘 Roo Code 🌲 Trae 🏄 Windsurf

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.

context-engineering-collection

⭐ Awesome Collections

# Description

# SKILL.md

Agent Skills for Context Engineering

When to Activate

Skill Map

Foundational Context Engineering

Architectural Patterns

Operational Excellence

Development Methodology

Core Concepts

Practical Guidance

Integration

References

Skill Metadata

# README.md

Agent Skills for Context Engineering

What is Context Engineering?

Skills Overview

Foundational Skills

Architectural Skills

Operational Skills

Development Methodology

Cognitive Architecture Skills

Design Philosophy

Progressive Disclosure

Platform Agnosticism

Conceptual Foundation with Practical Examples

Usage

Usage with Claude Code

Installation

Available Plugins

Skill Triggers

For Cursor & Codex & IDE

For Custom Implementations

Examples

Digital Brain Skill Example

LLM-as-Judge Skills Example

Book SFT Pipeline Example

Star History

Structure

Contributing

License

References

# Related Skills

# Supported AI Coding Agents

Confirm

Submit a Skill