By verivus-oss
Route prompts to multiple LLMs (Claude, Codex, Gemini, Grok, Mistral) from a single interface, with session management, async jobs, and configurable approval strategies for parallel reviews, code review, design review, and security analysis.
Manage LLM gateway sessions (list, create, delete, switch)
Delegate a task to Gemini via the LLM gateway
Send a prompt to Claude Code via the LLM gateway with session tracking
Delegate a task to Codex via the LLM gateway
Pattern for spawning subagents that must get Codex approval before their work is accepted. Use when dispatching parallel agents that each need independent Codex review gates.
Submit work to Codex for review and iterate until unconditional approval. Use after completing implementation tasks, before merging, or when a development process requires Codex sign-off.
Structured design document review via LLM gateway — submit plans, specs, or designs for peer review from Codex, Gemini, Grok, or Mistral, iterate on feedback, track review rounds. Use before implementing complex features. Mistral Vibe defaults to `--agent auto-approve`.
Choose the right LLM and model for each task based on proven patterns. Use when deciding whether to delegate to Claude, Codex, Gemini, Grok, or Mistral, or when selecting model variants. Mistral Vibe selects model via `VIBE_ACTIVE_MODEL` env var (no `--model` flag).
Run a task through multiple LLMs (Claude, Codex, Gemini, Grok, Mistral) independently and require agreement before proceeding. Use for high-stakes generation, conflict resolution, or final quality gates requiring unanimous approval.
Admin access level
Server config contains admin-level keywords
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
"Without consultation, plans are frustrated, but with many counselors they succeed." — Proverbs 15:22 (LSB)
A Model Context Protocol (MCP) gateway for running Claude Code, Codex, Gemini, Grok, and Mistral (Vibe) CLIs from one MCP endpoint, with durable async jobs, session continuity, cache-aware prompting, observability, and personal-appliance setup tooling.
Why developers try it: one local MCP endpoint for cross-LLM validation, multi-agent coding workflows, and repeatable assistant-led setup across five provider CLIs.
Current signals: CI and security workflows pass on main, OpenSSF Scorecard is published, OpenSSF Best Practices is passing, releases use Sigstore signing, and the package is MIT licensed.
npm install -g llm-cli-gateway
Or use directly with npx from an MCP client:
{
"mcpServers": {
"llm-gateway": {
"command": "npx",
"args": ["-y", "llm-cli-gateway"]
}
}
}
llm-cli-gateway is a single-user MCP gateway for cross-LLM validation and multi-agent coding workflows. It is more than a thin CLI wrapper:
promptParts, including explicit Claude cache_control when opted in.doctor --json, setup UI artifacts, provider setup snippets, Docker fallback, and checked release bundles.workspace; every HTTP/tunnel request must use a registered alias, session workspace, or [workspaces].default before provider execution. Local unrestricted filesystem access is the stdio transport.The repo ships agent-ready workflow skills under .agents/skills for async orchestration, session continuity, multi-LLM review, implement-review-fix loops, and secure approval-gated dispatch. Machine-readable DAG-TOML plans live under docs/plans and setup/install-plan.dag.toml for workflows that need deterministic sequencing and verification gates.
The next documentation focus is provider-specific skill and DAG-TOML pairs for each outbound CLI: Claude, Codex, Gemini, Grok, and Mistral Vibe. The implementation plan is tracked in docs/plans/provider-workflow-assets.dag.toml, with each provider asset expected to cover install/login checks, session behavior, approval modes, cache/telemetry surfaces, failure modes, and a smoke-test gate.
npx claudepluginhub verivus-oss/llm-cli-gateway --plugin llm-gatewayFlagship+ skill pack for OpenRouter - 30 skills for multi-model routing, fallbacks, and LLM gateway mastery
Intelligent delegation framework for routing tasks to external LLM services while retaining strategic oversight
Run any model with an Anthropic- or OpenAI-compatible API (e.g. DeepSeek, GLM, Kimi, Qwen, MiniMax) — even your Codex subscription — as real Claude Code workflows, agent-team teammates, or one-shot subagents, driven exactly like native ones. Your main session's own auth is untouched (OAuth subscription or API key, either works); API-key providers bill the provider key via apiKeyHelper, while a Codex subscription bills through a local OAuth daemon — each worker receives its credential on demand, never through its env or argv. Requires the `cc-fleet` binary on PATH, installed separately.
When calling LLM APIs from Python code. When connecting to llamafile or local LLM servers. When switching between OpenAI/Anthropic/local providers. When implementing retry/fallback logic for LLM calls. When code imports litellm or uses completion() patterns.
Fuse the Claude Code model with OpenAI Codex and agy: query all three in parallel, then Claude judges, synthesizes, and acts.
Multi-LLM integration for second opinions and task delegation