Explains how SpecWeave achieves context efficiency through Claude's native progressive disclosure mechanism and sub-agent parallelization. Skills load only when relevant, sub-agents isolate context. Activates when users ask about context loading, token usage, or how SpecWeave scales. Keywords: context loading, progressive disclosure, token efficiency, sub-agents, context management.
Inherits all available tools
Additional assets for this skill
This skill inherits all available tools. When active, it can use any tool Claude has access to.
SpecWeave achieves efficient context usage through two native Claude Code mechanisms:
Important: SpecWeave does NOT use custom context manifests or caching systems. It leverages Claude's native capabilities.
Claude Code uses a two-level progressive disclosure system for skills:
---
name: nextjs
description: NextJS 14+ implementation specialist. Creates App Router projects...
---
What Claude sees initially:
# NextJS Skill
[Full documentation, examples, best practices...]
[Could be 5,000+ tokens]
What Claude loads:
User: "Create a Next.js authentication page"
↓
Claude reviews skill metadata (35 skills × 75 tokens = 2,625 tokens)
↓
Claude determines relevant skills:
- nextjs (matches "Next.js")
- frontend (matches "page")
- (NOT loading: python-backend, devops, hetzner-provisioner, etc.)
↓
Claude loads ONLY relevant skills:
- nextjs: 5,234 tokens
- frontend: 3,891 tokens
↓
Total loaded: 9,125 tokens (vs 175,000+ if loading all skills)
Token reduction: ~95%
"Skills work through progressive disclosure—Claude determines which Skills are relevant and loads the information it needs to complete that task, helping to prevent context window overload."
Sub-agents in Claude Code have isolated context windows:
Main conversation (100K tokens used)
↓
Launches 3 sub-agents in parallel
↓
├─ Sub-agent 1: Fresh context (0K tokens used)
├─ Sub-agent 2: Fresh context (0K tokens used)
└─ Sub-agent 3: Fresh context (0K tokens used)
Benefits:
Context Isolation
Parallelization
Token Multiplication
User: "Build a full-stack Next.js app with auth, payments, and admin"
↓
Main conversation launches 3 sub-agents in parallel:
↓
├─ Sub-agent 1 (Frontend)
│ - Loads: nextjs, frontend skills
│ - Context: 12K tokens
│ - Implements: Auth UI, payment forms
│
├─ Sub-agent 2 (Backend)
│ - Loads: nodejs-backend, security skills
│ - Context: 15K tokens
│ - Implements: API routes, auth logic
│
└─ Sub-agent 3 (DevOps)
- Loads: devops, hetzner-provisioner skills
- Context: 8K tokens
- Implements: Deployment configs
↓
All 3 work in parallel with isolated contexts
↓
Results merged back to main conversation
↓
Total effective context: 35K tokens across 3 agents
(vs 175K+ if loaded all skills in main conversation)
Scenario: User asks about Next.js
Without progressive disclosure:
Load all 35 skills: ~175,000 tokens
Context bloat: Massive
With progressive disclosure:
Metadata (all skills): ~2,625 tokens
Load relevant (2 skills): ~9,000 tokens
Total: ~11,625 tokens
Reduction: ~93%
Scenario: Complex multi-domain task
Single agent approach:
Load all relevant skills: ~50,000 tokens
Main conversation history: ~80,000 tokens
Total context used: ~130,000 tokens
Risk: Approaching context limit
Sub-agent approach:
Main conversation: ~5,000 tokens (coordination only)
Sub-agent 1: ~15,000 tokens (isolated)
Sub-agent 2: ~18,000 tokens (isolated)
Sub-agent 3: ~12,000 tokens (isolated)
Total: ~50,000 tokens across 4 contexts
Reduction: ~62% (130K → 50K)
Note: Exact percentages vary by task complexity. These are approximate based on typical usage patterns.
SpecWeave organizes 35+ skills with clear, focused descriptions:
# Good: Focused description
---
name: nextjs
description: NextJS 14+ App Router specialist. Server Components, SSR, routing.
---
# Bad: Vague description
---
name: frontend
description: Does frontend stuff
---
Why this matters:
SpecWeave's role-orchestrator skill automatically:
Example:
User: "/sw:inc 'Full-stack SaaS with Stripe payments'"
↓
role-orchestrator activates
↓
Launches sub-agents in parallel:
├─ PM agent (requirements)
├─ Architect agent (system design)
├─ Security agent (threat model)
└─ DevOps agent (deployment)
↓
Each loads only relevant skills in isolated context
↓
Results merged into increment spec
Reality: No. SpecWeave uses Claude's native progressive disclosure. Skills load based on Claude's relevance detection, not custom YAML manifests.
Reality: No custom caching. Claude Code handles caching internally (if applicable). SpecWeave doesn't implement additional caching layers.
Reality: Token savings vary by task:
Reality: Progressive disclosure and sub-agents are automatic. You don't configure them. Claude handles skill loading, sub-agent context isolation happens automatically when agents are launched.
Do:
Don't:
When to use sub-agents:
When NOT to use sub-agents:
When Claude mentions using a skill:
User: "Create a Next.js page"
Claude: "🎨 Using nextjs skill..."
This means:
When Claude mentions launching agents:
Claude: "🤖 Launching 3 specialized agents in parallel..."
This means:
SpecWeave achieves context efficiency through:
Progressive Disclosure (Native Claude)
Sub-Agent Parallelization (Native Claude Code)
No custom manifests. No custom caching. Just smart use of Claude's native capabilities.