name: deep-research-orchestrator
Inherits all available tools
Additional assets for this skill
This skill inherits all available tools. When active, it can use any tool Claude has access to.
deep-research-orchestrator-process.dotname: deep-research-orchestrator description: Meta-orchestrator for complete Deep Research SOP lifecycle managing 3 phases, 9 pipelines (A-I), and 3 quality gates. Use when starting new research projects, conducting systematic ML research, or ensuring rigorous scientific methodology from literature review through production deployment. Coordinates all SOP skills and agents for end-to-end research execution. version: 1.0.0 category: research tags:
Master orchestration skill for the complete Deep Research Standard Operating Procedure (SOP), managing the entire research lifecycle from ideation through production deployment with rigorous quality gates.
Purpose: Orchestrate complete research lifecycle following Deep Research SOP methodology
When to Use:
Quality Gates: Manages ALL 3 quality gates (Data & Methods, Model & Evaluation, Production & Artifacts)
Prerequisites:
Outputs:
Time Estimate: 2-6 months (full research lifecycle)
Skills Orchestrated: baseline-replication, method-development, holistic-evaluation, literature-synthesis, reproducibility-audit, deployment-readiness, research-publication, gate-validation
Agents Used: ALL 4 P0 agents (data-steward, ethics-agent, archivist, evaluator) + system-architect, coder, tester, reviewer, researcher
# Initialize Deep Research SOP project
npx claude-flow@alpha hooks pre-task \
--description "Deep Research SOP: [Project Name]"
# Create project structure
mkdir -p deep-research-project/{phase1-foundations,phase2-development,phase3-production,gates,docs}
# Store research question in memory
npx claude-flow@alpha memory store \
--key "sop/project/research-question" \
--value "How does multi-scale attention improve long-range dependency modeling in vision transformers?"
# Literature synthesis
claude-code invoke-skill literature-synthesis
# Dataset validation
npx claude-flow@alpha sparc run data-steward "/init-datasheet"
# Baseline replication
claude-code invoke-skill baseline-replication
# Ethics review (initial)
npx claude-flow@alpha sparc run ethics-agent "/assess-risks --component dataset --gate 1"
# Gate 1 validation
claude-code invoke-skill gate-validation --gate 1
# Method development
claude-code invoke-skill method-development
# Holistic evaluation
claude-code invoke-skill holistic-evaluation
# Ethics review (model)
npx claude-flow@alpha sparc run ethics-agent "/assess-risks --component model --gate 2"
# Gate 2 validation
claude-code invoke-skill gate-validation --gate 2
# Reproducibility audit
claude-code invoke-skill reproducibility-audit
# Deployment readiness
claude-code invoke-skill deployment-readiness
# Archival
npx claude-flow@alpha sparc run archivist "/init-model-card"
# Gate 3 validation
claude-code invoke-skill gate-validation --gate 3
# Research publication
claude-code invoke-skill research-publication
Objective: Systematic literature review identifying SOTA methods, gaps, opportunities
Execution:
claude-code invoke-skill literature-synthesis \
--query "vision transformers attention mechanisms" \
--databases "arxiv,semantic-scholar,papers-with-code" \
--output phase1-foundations/literature/
Deliverables:
Agent: researcher
Objective: Dataset validation, bias audit, ethics clearance
Execution:
# Data steward: Create datasheet
npx claude-flow@alpha sparc run data-steward \
"/init-datasheet --dataset ImageNet --output phase1-foundations/datasheet.md"
# Data steward: Bias audit
npx claude-flow@alpha sparc run data-steward \
"Run bias audit on ImageNet dataset following Gebru et al. 2021"
# Ethics agent: Risk assessment
npx claude-flow@alpha sparc run ethics-agent \
"/assess-risks --component dataset --gate 1"
Deliverables:
Agents: data-steward, ethics-agent
Objective: PRISMA-compliant systematic literature review
Execution:
npx claude-flow@alpha sparc run researcher \
"/prisma-init --topic 'attention mechanisms in vision transformers'"
Deliverables:
Agent: researcher
Objective: Reproduce published baseline with ±1% tolerance
Execution:
claude-code invoke-skill baseline-replication \
--paper "Attention is All You Need" \
--dataset ImageNet \
--tolerance 0.01
Deliverables:
Agents: researcher, data-steward, coder, tester, archivist, evaluator
Objective: GO/NO-GO decision for method development
Execution:
claude-code invoke-skill gate-validation --gate 1
Gate 1 Requirements:
Decision:
Agent: evaluator
Objective: Develop novel method with ablation studies
Execution:
claude-code invoke-skill method-development \
--baseline-checkpoint phase1-foundations/baseline/checkpoint.pth \
--novel-components "multi-scale-attention,prenorm-residual"
Deliverables:
Agents: system-architect, coder, tester, reviewer
Objective: Comprehensive evaluation across 6+ dimensions
Execution:
claude-code invoke-skill holistic-evaluation \
--model phase2-development/novel-method/checkpoint.pth \
--dimensions "accuracy,fairness,robustness,efficiency,interpretability,safety"
Deliverables:
Agents: tester, ethics-agent
Objective: Ethics validation for model deployment
Execution:
npx claude-flow@alpha sparc run ethics-agent \
"/assess-risks --component model --gate 2"
npx claude-flow@alpha sparc run ethics-agent \
"/safety-eval --model phase2-development/novel-method/checkpoint.pth"
Deliverables:
Agent: ethics-agent
Objective: GO/NO-GO decision for production deployment
Execution:
claude-code invoke-skill gate-validation --gate 2
Gate 2 Requirements:
Decision:
Agent: evaluator
Objective: Create production-ready reproducibility package
Execution:
# Reproducibility audit
claude-code invoke-skill reproducibility-audit \
--package phase2-development/novel-method/
# Archival
npx claude-flow@alpha sparc run archivist \
"/init-model-card --method novel-method --include-metrics"
npx claude-flow@alpha sparc run archivist \
"Create reproducibility package with Docker, assign DOIs (Zenodo)"
Deliverables:
Agent: archivist
Objective: Production deployment validation
Execution:
claude-code invoke-skill deployment-readiness \
--model phase3-production/final-checkpoint.pth \
--environment production
Deliverables:
Agents: tester, archivist
Objective: Academic paper with reproducibility artifacts
Execution:
claude-code invoke-skill research-publication \
--results phase1-foundations/ phase2-development/ phase3-production/ \
--venue "NeurIPS" \
--artifact-track true
Deliverables:
Agents: researcher, archivist
Objective: Final GO/NO-GO for production deployment and publication
Execution:
claude-code invoke-skill gate-validation --gate 3
Gate 3 Requirements:
Decision:
Agent: evaluator
Phase 1: FOUNDATIONS (2-4 weeks)
├── Literature Synthesis (Pipeline A)
├── Data & Ethics Foundation (Pipeline B)
├── PRISMA Protocol (Pipeline C, optional)
├── Baseline Replication (Pipeline D)
└── Quality Gate 1 → GO/NO-GO
Phase 2: DEVELOPMENT (6-12 weeks)
├── Method Development (Pipeline D continued)
├── Holistic Evaluation (Pipeline E)
├── Ethics & Safety Review (Pipeline F)
└── Quality Gate 2 → GO/NO-GO
Phase 3: PRODUCTION (2-4 weeks)
├── Reproducibility & Archival (Pipeline G)
├── Deployment Readiness (Pipeline H)
├── Publication (Pipeline I)
└── Quality Gate 3 → GO/NO-GO → DEPLOY
| Phase | Pipeline | Lead Agent | Supporting Agents |
|---|---|---|---|
| 1 | A (Literature) | researcher | - |
| 1 | B (Data & Ethics) | data-steward | ethics-agent |
| 1 | C (PRISMA) | researcher | - |
| 1 | D (Baseline) | coder | researcher, tester, archivist |
| 1 | Gate 1 | evaluator | ALL agents review |
| 2 | D (Method Dev) | system-architect | coder, tester, reviewer |
| 2 | E (Holistic Eval) | tester | ethics-agent |
| 2 | F (Ethics) | ethics-agent | - |
| 2 | Gate 2 | evaluator | ethics-agent reviews |
| 3 | G (Archival) | archivist | - |
| 3 | H (Deployment) | tester | archivist |
| 3 | I (Publication) | researcher | archivist |
| 3 | Gate 3 | evaluator | archivist reviews |
All project state stored in Memory MCP for cross-session coordination:
# Store phase progress
npx claude-flow@alpha memory store \
--key "sop/project/phase1/status" \
--value "COMPLETE" \
--metadata '{"gate1": "APPROVED", "date": "2025-11-01"}'
# Retrieve previous work
npx claude-flow@alpha memory retrieve \
--key "sop/project/phase1/baseline-results"
# Agent coordination via memory
npx claude-flow@alpha memory store \
--key "sop/coordination/ethics-agent/status" \
--value "Awaiting Gate 2 validation" \
--metadata '{"blocking": ["evaluator"]}'
# data-steward stores bias audit results
npx claude-flow@alpha memory store \
--key "sop/gate1/bias-audit" \
--value "$(cat phase1-foundations/bias-audit.json)"
# ethics-agent retrieves for risk assessment
npx claude-flow@alpha memory retrieve \
--key "sop/gate1/bias-audit"
Symptoms: evaluator returns REJECTED status for Gate 1 Common Causes:
Solutions:
# Check Gate 1 requirements
claude-code invoke-skill gate-validation --gate 1 --verbose
# Re-run baseline replication with debugging
claude-code invoke-skill baseline-replication --debug
# Complete datasheet gaps
npx claude-flow@alpha sparc run data-steward \
"/init-datasheet --fill-missing-sections"
Symptoms: Novel method fails holistic evaluation or ethics review Solutions:
# Review holistic evaluation failures
claude-code invoke-skill holistic-evaluation --dimensions "fairness,safety" --verbose
# Address ethics concerns
npx claude-flow@alpha sparc run ethics-agent \
"/assess-risks --component model --gate 2 --mitigation-plan"
# Re-run method development with improvements
claude-code invoke-skill method-development --incorporate-feedback
Symptoms: Reproducibility package fails or deployment validation issues Solutions:
# Audit reproducibility package
claude-code invoke-skill reproducibility-audit --strict
# Fix deployment issues
claude-code invoke-skill deployment-readiness --fix-issues
# Complete model card
npx claude-flow@alpha sparc run archivist \
"/init-model-card --complete-missing-sections"
Symptoms: Cannot proceed to next phase due to pending validations Solutions:
# Check all gate requirements
npx claude-flow@alpha memory retrieve --key "sop/gates/status"
# Identify blocking agents
npx claude-flow@alpha memory retrieve --key "sop/coordination/*/status"
# Resolve blocking tasks
# (Address specific agent requirements)
This orchestrator implements the complete Deep Research SOP as specified in:
literature-synthesis - Systematic literature reviewbaseline-replication - Reproduce published baselinesmethod-development - Develop novel methodsholistic-evaluation - Comprehensive evaluationreproducibility-audit - Audit reproducibilitydeployment-readiness - Production deployment validationresearch-publication - Academic publicationgate-validation - Quality gate validation (all 3 gates)/init-datasheet - Create dataset documentation/prisma-init - Initialize systematic review/assess-risks - Ethics and safety assessment/init-model-card - Create model card/validate-gate-{1,2,3} - Gate validationWeek 1-2: Phase 1 Start
- Literature synthesis (50+ papers)
- Datasheet creation
- Bias audit
Week 3-4: Phase 1 Complete
- Baseline replication
- Ethics review (Gate 1)
- Gate 1 validation → APPROVED
Week 5-8: Phase 2 Development
- Novel method implementation
- Ablation studies
- Hyperparameter optimization
Week 9-12: Phase 2 Evaluation
- Holistic evaluation (6 dimensions)
- Ethics review (Gate 2)
- Gate 2 validation → APPROVED
Week 13-14: Phase 3 Archival
- Reproducibility package creation
- Model card, DOI assignment
- Registry publishing
Week 15-16: Phase 3 Deployment & Publication
- Deployment readiness validation
- Paper writing
- Gate 3 validation → APPROVED → DEPLOY
Total: 16 weeks (4 months) for complete research lifecycle
| Gate | APPROVED | CONDITIONAL | REJECTED |
|---|---|---|---|
| Gate 1 | All requirements met, proceed to Phase 2 | Minor datasheet gaps, proceed with restrictions | Baseline >±1% or critical ethics issues |
| Gate 2 | All requirements met, proceed to Phase 3 | Mitigation plan for fairness/robustness gaps | Performance regression or critical safety risks |
| Gate 3 | All requirements met, DEPLOY to production | Minor documentation fixes required | Reproducibility failures or ethics violations |