By WatchAndyTW
Routes Claude Code tasks to local CLI backends (Gemini, OpenAI Codex, etc.) based on task size/type with fallback to native Claude. Includes a decompose-dispatch-verify-fix-synthesize pipeline, multi-model reasoning for consensus answers, and optional statusline HUD. Configuration via editable roster reduces costs and leverages specialized backends.
Set up multi-model-team — create your personal roster at ~/.claude/mmt-roster.json (seeded from the shipped default) so your tuning survives plugin updates, and print the next steps (roster precedence + statusline HUD).
Run a question through the multi-model Fusion pipeline — fan it out to a configurable panel of models in parallel (default panel is the user-configurable set from `reasoning.panel` in roster.json, defaulting to Opus+Sonnet+Gemini; override per-invocation with a spec like "2:gemini,opus,codex"), have a judge compare their answers into structured analysis (consensus / contradictions / unique insights / blind spots), then synthesize one unified answer that is better than any single model's.
Dry-run the multi-model-team router on a task and print the decision JSON + score breakdown. No model call — pure routing logic, for tuning.
Run a task through the multi-model team pipeline — decompose into backend-assigned subtasks (commodity → parallel agy/Gemini, judgment/hard-line → native Claude), dispatch dependency-aware, verify each result, fix failures in a bounded loop, then synthesize. Optional caps like "5:gemini,2:claude". Add --writable to give each agent its own git worktree and merge into an integration branch.
Dispatcher for the agy (Gemini) CLI backend — a configurable, equal tool, not a fixed task bucket. WHERE work routes is decided by config/roster.json (routes + tags.txt) and per-subtask /team assignments, not hardcoded here. By the shipped routing config, agy is the default lane for fast, commodity, easily-verifiable work and Gemini's edges: new UI/React components, CSS, SVG/animation, scaffolding, CRUD, REST endpoints, scripts, CLI tools, glue code, SQL, regex, configs, Dockerfiles, unit tests, fixtures, data transforms, web-search/doc summarization, bulk text, and audio/video (Gemini-exclusive). The shipped rules keep reverse-engineering, IL2CPP/protobuf-RE, disasm, FFI/unsafe, injection/hooking, concurrency, protocol/KCP design and other systems-hard work on native Opus — but that is roster policy you can retune, not a property of this agent. Spawning this agent is an explicit choice to run on agy: it relays the task to the CLI and returns the result verbatim; it does not re-route or refuse based on content.
Dispatcher for the codex (OpenAI Codex CLI) backend — a configurable, equal tool, not a fixed task bucket. WHERE work routes is decided by config/roster.json (routes + tags.txt) and per-subtask /team assignments, not hardcoded here. By the shipped routing config, codex is the default lane for code review, test-writing, and verification — review a diff/file for correctness, bugs, edge cases and regressions; write or extend unit/integration/e2e test suites; verify an implementation meets its spec — and it serves as the default /team verifier. It runs full-auto in its own git worktree under /team --writable, so it can also make real code changes there. The shipped rules keep reverse-engineering, IL2CPP/protobuf-RE, injection and other systems-hard work on native Opus, but that is roster policy you can retune, not a property of this agent. Spawning this agent is an explicit choice to run on codex: it relays the task to the CLI and returns the result verbatim; it does not re-route or refuse based on content.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Let Claude Code delegate the grunt work to Gemini & Codex — and keep the hard thinking for itself.
Multi-model orchestration for Claude Code. Route by task, fan out in parallel, fall back gracefully.
A Claude Code plugin that offloads token-heavy, self-contained tasks to local pre-authed CLI
backends — agy (Gemini) and codex (OpenAI Codex CLI) — picking the backend and model by
task size and type, with credit-exhaustion fallback through the chain to native Claude, and a
glanceable statusline HUD.
The core idea:
Offload commodity work (UI/components, scaffolding, CRUD, scripts, SQL, configs, unit tests, web research, bulk summarization) to a fast/cheap CLI — keep judgment-heavy and systems-hard work (reverse-engineering, FFI/unsafe, injection, concurrency, protocol design) on Claude. Every routing decision is config-driven; tune it without touching code.
agy, codex, and native Claude are equal, configurable tools. /team decomposes a task and
assigns each subtask to its best-fit backend; /reasoning fans one question across a panel of all
three and fuses the answers.
1 · Install the backends (one-time, pre-auth each)
npm install -g node-pty # the one native dep — gives agy a pseudo-terminal (see note below)
npm install -g @openai/codex # then: codex login
# Windows Powershell
irm https://antigravity.google/cli/install.ps1 | iex # then: agy login
# macOS / Linux
curl -fsSL https://antigravity.google/cli/install.sh | bash # then: agy login
2 · Add the plugin. This repo is the plugin — point Claude Code at it as a local plugin (local
marketplace or --plugin-dir). On enable, Claude Code auto-discovers commands/, agents/, and
hooks/hooks.json. Nothing else to wire up.
3 · (Optional) Turn on the HUD. Add a statusLine to your own ~/.claude/settings.json
(the plugin can't register one for you) — see Statusline HUD.
4 · Use it.
/reasoning 2:gemini,opus,codex What's the best caching strategy for a read-heavy API?
/team 3:gemini,1:codex Build a REST CRUD service with tests
/route-test Write a SQL query to list users by signup date ← dry-run, no call
…or just work normally and let Claude reach for the agy / codex agents on its own.
| Command | What it does |
|---|---|
/reasoning [panel] <question> | Fusion pipeline. Fan one question across a panel of models in parallel → a judge compares them (consensus / contradictions / unique insights / blind spots) → synthesize one unified answer better than any single model's. |
/team [--writable] [N:gemini,M:claude,X:codex] <task> | Team pipeline. Decompose → dispatch each subtask to its best-fit backend (dependency-aware waves) → verify each result → bounded fix loop → synthesize. Add --writable to let agents actually edit code in isolated git worktrees (see below). |
/route-test <task> | Dry-run the router: prints {backend, model, tier}, detected types, matched rule. No backend call — a tuning tool. |
Both /team and /reasoning have two engines: an Ultracode deterministic Workflow path
(preferred, when the Workflow tool is available) and a parallel Task-agent fallback. Either way the
work runs across parallel agents — never one inline session.
/team modes — read-only (default) vs --writable/team runs in one of two modes:
npx claudepluginhub watchandytw/multi-model-team --plugin multi-model-teamMulti-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Unified status line for Claude Code with multi-CLI (Claude, Codex, Gemini, z.ai) usage monitoring, context, rate limits, and cost tracking
Curate auto-memory, promote learnings to CLAUDE.md and rules, extract proven patterns into reusable skills.