Skill

agent-codex-gate

Spawns subagents that must get Codex approval before their work is accepted. Use for parallel agents needing independent review gates.

automation

developer-tools

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/llm-gateway:agent-codex-gate

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Spawn subagents to do work, then each agent submits its work to Codex for review via the LLM gateway. Agents iterate on Codex feedback until they get unconditional approval. Work is not accepted until Codex approves.

SKILL.md

149 lines · ~2.2k tokens

Stats

LanguageTypeScript

Stars10

MaintenanceExcellent

Last CommitJun 21, 2026

Actions

View Source View Plugin View on GitHub View README

Agent + Codex Gate Pattern

Dispatch Defaults

Apply these on every dispatch unless the caller has explicitly overridden a rule in the current turn:

Omit model — let the gateway use its configured default per CLI. Nominating a model risks deprecated IDs (o3, o3-pro, gpt-4o, …) and capability mismatches.
approvalStrategy:"mcp_managed" is the skill dispatch default (the gateway schema default is "legacy"). For Codex, also pass fullAuto:true; this gives sandboxed autonomy while keeping the gateway approval gate in front of execution.
No wallclock timeout; poll every 60 s — idleTimeoutMs is a separate no-output safeguard.
Iterate until unconditional APPROVED (review dispatches only) — every review prompt must end with "End with APPROVED or NOT APPROVED with findings." Loop: dispatch → poll → parse verdict → on NOT APPROVED or conditional approval, dispatch fixes + re-review → repeat. Escalate after 3 rounds. This rule does not apply to pure implementation or non-review analysis dispatches.

When to Use

Dispatching multiple parallel agents to implement different tasks
Any workflow where "spawn agents, let them work, gate on Codex approval" is the pattern
When you want autonomous agents with quality gates
Implementation plans with independent tasks

Protocol

For the Orchestrator (You)

Spawn subagents for each independent task
Include these instructions in each agent's prompt:

After completing your implementation:
1. Build and test to verify your changes work
2. Submit your work for Codex review via the llm MCP gateway:
   codex_request({
     prompt: "Review [description of what was done] in [paths]. End with APPROVED or NOT APPROVED with findings.",
     fullAuto: true,
     approvalStrategy: "mcp_managed"
   })
3. If the response contains status:"deferred", poll llm_job_status every 60 seconds until completed, then fetch with llm_job_result
4. If NOT APPROVED or conditional: fix every issue Codex identified, then re-submit
5. Iterate until you get unconditional APPROVED from Codex (max 3 rounds, then escalate)
6. Report back with: what you did, Codex's final verdict, and the approval details

Review each agent's report — verify Codex approved, check the work makes sense
Only accept work that has Codex's unconditional approval

For Each Subagent

The subagent follows this loop:

implement → build → test → submit to Codex →
  if APPROVED (unconditional): done, report back
  if NOT APPROVED or conditional: fix issues → rebuild → retest → resubmit to Codex
  if deferred: poll every 60s → get result → parse verdict

Handling Deferred Reviews

Codex reviews often exceed 45s. Subagents must handle deferral:

// Submit review
result = codex_request({
  prompt: "Review... End with APPROVED or NOT APPROVED with findings.",
  fullAuto: true,
  approvalStrategy: "mcp_managed"
})

// Check if deferred
if result contains "status":"deferred":
  jobId = result.jobId
  // Poll every 60 seconds (no wallclock timeout; cancel only on explicit instruction or hard failure)
  loop:
    yield_until_next_poll(60 seconds)   // see "Wait mechanism" below
    status = llm_job_status({jobId})
    if status.job.status in ["completed", "failed", "canceled"]: break
  // Get the review
  review = llm_job_result({jobId})
  // Parse APPROVED or NOT APPROVED from review.result.stdout

Wait mechanism (orchestrator-specific)

yield_until_next_poll(60 seconds) above is an abstraction: yield control for ~60 s, then poll once. Standalone sleep 60 is blocked in some orchestrators (e.g. the Claude Code harness). Use:

Claude Code harness: Bash({command: "sleep 60 && echo done", run_in_background: true}) — returns a task ID, emits a completion notification after 60s. Monitor is for streaming progress, not one-shot waits. Do not chain short sleeps.
ScheduleWakeup (if available in your orchestrator): schedule a wakeup with delaySeconds: 60 and a prompt that resumes the polling loop.
Other orchestrators: use the native non-blocking wait primitive. Never a synchronous blocking sleep that freezes the agent loop.

Permissions — The Most Common Mistake

If Codex says "cannot verify" or shows bwrap sandbox errors, fullAuto: true was not passed. Without it, Codex cannot read files, run commands, or use MCP tools. Always include fullAuto: true and approvalStrategy: "mcp_managed" in every codex_request for reviews. The gateway's mcp_managed gate scores the request first; fullAuto:true gives Codex sandboxed file/shell access.

In the rare case Codex genuinely cannot access something (needs credentials it doesn't have), provide the evidence inline:

Paste build output, test results, or file contents
Re-submit with this evidence alongside fullAuto: true

Example: Parallel Implementation with Gates

// Orchestrator dispatches 3 agents in parallel:

Agent 1: "Implement Task A in src/feature-a.ts. [full task spec]
After completing, get Codex review. Iterate until unconditional approval."

Agent 2: "Implement Task B in src/feature-b.ts. [full task spec]
After completing, get Codex review. Iterate until unconditional approval."

Agent 3: "Implement Task C in src/feature-c.ts. [full task spec]
After completing, get Codex review. Iterate until unconditional approval."

// Each agent works independently, gets own Codex review
// Orchestrator collects results only after all three have Codex approval

Escalation

Agent can't get Codex approval after 3 rounds → escalate to orchestrator
Codex consistently unreachable → report the error, don't skip the gate
Codex findings are wrong → provide evidence and re-submit, don't ignore

Quality Checklist

Before accepting an agent's work:

Agent reports Codex gave unconditional APPROVED
Agent addressed all findings from earlier rounds (if any)
Build passes
Tests pass
Changes match the original task specification

Tips

Always include fullAuto: true and approvalStrategy: "mcp_managed" for Codex reviews
Omit model — let the gateway default apply
Use correlationId per agent per round: "agent1-review-r1", "agent1-review-r2"
For large tasks, expect 2-3 review rounds
Don't let agents skip the Codex gate because "it's a small change"
If an agent reports "Codex approved with residual notes" — that counts as approved if the notes are informational only
For round 2+, agents can pass resumeLatest:true to Codex to carry the prior review's context (or sessionId:<UUID> for a specific Codex session). Note: --full-auto is silently dropped on resume; the original session's approval policy is inherited. Gateway-generated gw-* IDs are rejected for Codex.
Deferred jobs are durable (default 30-day retention, LLM_GATEWAY_JOB_RETENTION_DAYS). If a subagent crashes between polls, it can re-issue the identical review call — auto-dedup snaps back onto the live Codex job. Or fetch by jobId after the fact. Use forceRefresh:true only when the underlying changes have shifted.
For high-stakes work, optionally add a Grok diversity gate alongside Codex: grok_request_async({prompt:"Independent review of agent's work in [paths]... End with APPROVED or NOT APPROVED with findings.",approvalStrategy:"mcp_managed",correlationId:"agent1-review-r1-grok"}) — accept only when both reviewers return APPROVED.
For maximum diversity, add a Mistral Vibe reviewer alongside Grok: mistral_request_async({prompt:"...End with APPROVED or NOT APPROVED with findings.",approvalStrategy:"mcp_managed",correlationId:"agent1-review-r1-mistral"}) — Vibe defaults to --agent auto-approve; pick permissionMode:"plan" if you want a stricter mode.
For agents that loop on the same review brief across multiple rounds, prefer the structured promptParts field over prompt: keep the review-criteria in system, the file paths under review in context, and let the round-specific question be the task. prompt and promptParts are mutually exclusive. Stable system/context across rounds keeps the prefix bytes identical, raising the provider's implicit cache hit rate across the gate loop. Confirm via cache-state://prefix/{hash} (tokens/hashes only, no prompt text).

agent-codex-gate

Popularity

Invocation

Context Preview

SKILL.md

agent-codex-gate

Popularity

Invocation

Context Preview

SKILL.md

Agent + Codex Gate Pattern

Dispatch Defaults

When to Use

Protocol

For the Orchestrator (You)

For Each Subagent

Handling Deferred Reviews

Wait mechanism (orchestrator-specific)

Permissions — The Most Common Mistake

Example: Parallel Implementation with Gates

Escalation

Quality Checklist

Tips

Similar Skills

Agent + Codex Gate Pattern

Dispatch Defaults

When to Use

Protocol

For the Orchestrator (You)

For Each Subagent

Handling Deferred Reviews

Wait mechanism (orchestrator-specific)

Permissions — The Most Common Mistake

Example: Parallel Implementation with Gates

Escalation

Quality Checklist

Tips

Similar Skills