Drive a single AI-generated video project end-to-end: creative brief, model selection, character sheets, script (script-writer agent), storyboard, generation pipelines (text-to-image, image-to-video, voice, lip-sync), GPU-aware concatenation/normalisation/aspect conversion, ComfyUI import-export, budget estimation, and final export. State-aware lifecycle plugin designed to operate inside a per-project workspace. Ships with fal.ai, Replicate, and MiniMax MCP servers plus fal-js and WaveSpeed Python SDK runners, all wired to a persistent ~/.config/ai-video-producer/.env.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Propose an ffmpeg/edit plan to concatenate clips/edited/ into a rough cut in output/drafts/.
Add a recurring character or subject with appearance and voice notes. Saves to characters/.
Produce a script draft from the brief. Saves to scripts/drafts/.
Finalise the locked draft — encode to spec, embed subtitles, write to output/final/.
Scaffold the AI video production workspace (folder skeleton + project CLAUDE.md) into the current directory.
Use this agent to upscale and/or convert aspect ratio for video clips — e.g. taking a 720p 16:9 generation and producing a 1080p 9:16 vertical for Reels/Shorts/TikTok, or upscaling SD-era footage for inclusion alongside HD generations. Picks between resize-only, letterbox, blur-fill, smart-crop, and AI upscaling depending on the source-vs-target relationship and the user's quality bar. GPU-aware: uses NVENC/AMF/QSV when available, falls back to CPU. For AI upscaling defers to the existing `upscale-and-interpolate` skill or external tools (Topaz, Real-ESRGAN) and does not reinvent them. <example> user: "Convert all my 16:9 clips to 9:16 for Shorts, upscale to 1080×1920" assistant: "Launching the aspect-converter agent." </example> <example> user: "This 720p clip needs to sit in a 4K timeline — upscale it" assistant: "I'll use the aspect-converter agent." </example>
Use this agent when the user has produced video elements in parts (clips, segments, scenes, intro/outro, B-roll inserts) and is ready to join and render them into a single deliverable. Detects the host's GPU (NVIDIA / AMD / Intel / CPU-only) and picks the right ffmpeg encoder (NVENC / AMF / QSV / libx264 / libx265). Handles concat-demuxer for same-codec same-resolution sources and re-encode-and-concat for mixed sources. Respects the workspace's `clips/selected/` ordering and writes to `output/`. <example> user: "I'm done picking takes — concat them and render the rough cut" assistant: "Launching the concatenator agent to detect GPU and assemble." </example> <example> user: "Stitch the intro, the three scene clips, and the outro into one MP4" assistant: "I'll use the concatenator agent — it'll pick the right encoder for your GPU." </example>
Use this agent to normalise a set of video clips so they can be safely concatenated or compared — equalises loudness (EBU R128 / -16 LUFS by default), aligns sample rate / channel layout, conforms framerate and pixel format, and optionally applies basic colour-level normalisation (full↔limited range, BT.601↔BT.709). Run before `concatenator` when sources are mixed (different generation models, screen recordings, ElevenLabs voice tracks, etc.). Writes normalised copies to `clips/normalised/` — never overwrites originals. <example> user: "These clips were generated with different models and the audio levels are all over the place — fix them" assistant: "Launching the normalizer agent." </example>
Use this agent to turn an approved pipeline SPEC.md (produced by pipeline-scaffolder) into an executable pipeline — runnable scripts/configs that `/run-pipeline` can invoke. Generates per-stage runner files, parameter templates, an entry-point script, and a pipeline README. Wires up calls to fal/replicate (or other providers) using the project's MCP servers and respects the workspace's path conventions. Use after the user approves a SPEC.md. <example> user: "Build out the pipeline I just scaffolded — talking-head v1" assistant: "Launching the pipeline-builder agent to generate the runners from SPEC.md." </example>
Use this agent to set up a private GitHub repository where the user versions reusable AI video pipelines across projects. Creates the repo via `gh`, scaffolds an opinionated structure (one folder per pipeline, shared model/cost reference, CHANGELOG, README), seeds it from any pipelines already present in the current AI-Video-Producer workspace, and wires the workspace to consume pipelines from there. Use the first time the user wants pipelines to live outside a single video project, or when migrating an existing pipeline collection into version control. <example> user: "Set me up a private repo for my pipelines" assistant: "I'll launch the pipeline-repo-setup agent to create and seed it." </example>
Estimate API costs for a planned pipeline or for an entire script before any generation runs. Reads pipeline SPEC.md(s), the storyboard, and `brief/tools-and-models.md`, then projects cost per shot and total — with a low / typical / high range that accounts for take iteration. Writes a budget report to `budgets/estimate-<YYYY-MM-DD>.md`. Use before `/run-pipeline` on any non-trivial project.
Export an AI-Video-Producer pipeline to ComfyUI workflow JSON — produce a workflow the user can load in ComfyUI to reproduce (or approximate) a hosted-API pipeline locally. Best-effort: maps fal/replicate model calls to equivalent ComfyUI checkpoints/LoRAs/samplers and flags stages that have no native ComfyUI equivalent. Use when the user wants to take a pipeline that's been working via APIs and bring it onto local hardware (cost reduction, offline work, fine-grained control).
Import a ComfyUI workflow JSON (the API-format export from ComfyUI's "Save (API Format)" option, or the UI-format graph JSON) and turn it into an AI-Video-Producer pipeline SPEC plus stage runners. Maps Comfy nodes to pipeline stages, surfaces the model checkpoints and LoRAs needed, and flags nodes that have no clean equivalent in a hosted-API pipeline. Use when the user has a ComfyUI workflow that works locally and wants to either run it inside a project workspace or document it as a versioned pipeline.
Research what fal.ai and Replicate (and optionally other providers — Runway, Kling, Pika, Hedra, ElevenLabs, OpenAI) currently offer for a given workload — text-to-image, text-to-video, image-to-video, lip-sync, voice, upscaling, interpolation — and recommend the best-fit model for the project's brief. Compares quality reputation, price, max duration/resolution, aspect ratio support, and known failure modes. Updates `brief/tools-and-models.md` with the recommendation and rationale on user approval.
Intelligent, preference-driven model recommendation across fal.ai, Replicate, WaveSpeedAI, and MiniMax (Hailuo). Asks the user a short set of preference questions (workload, priority — quality vs speed vs cost, max budget per output, resolution/aspect/duration, NSFW tolerance, must-have features like lip-sync or audio), queries each available provider, and returns a ranked shortlist (3–5 options) with approximate per-output costs, quality/speed notes, and a recommendation. Differs from `model-researcher`: live-API backed, cost-first, and conversational about preferences.
External network access
Connects to servers outside your machine
Requires secrets
Needs API keys or credentials to function
Uses power tools
Uses Bash, Write, or Edit tools
A Claude Code plugin that drives the end-to-end lifecycle of a single AI-generated video project: creative brief → model selection → character sheets → script → storyboard → generation pipelines → clip assembly → post-production → final export.
This is a state-aware lifecycle plugin — it operates inside a per-project workspace it scaffolds itself. Distinct from utility-belt plugins that fire stateless one-off operations on media files.
/install-video-workspace — scaffold the per-project folder structure + project CLAUDE.md into the current directory./setup-credentials — one-time setup for fal.ai, Replicate, WaveSpeed, and MiniMax API keys, plus runner toolchain bootstrap (npm + pip). Writes ~/.config/ai-video-producer/.env and wires it into your shell profile so the bundled MCP servers always have keys./onboard — capture the creative brief and tools-and-models selection./define-character — add a recurring character/subject sheet./draft-script — produce a script draft from the brief./storyboard — break the locked script into a numbered shot list with per-shot briefs./run-pipeline <name> <shot#> — execute a predefined pipeline for a shot./log-take <path> — record a generated artefact's prompt, model, parameters, and cost./promote-take <path> — copy a raw take into clips/selected/ with rationale./assemble — propose an ffmpeg edit plan to concatenate clips into a rough cut./export-final — encode the locked draft to spec, embed subtitles, write to output/final/.Spawned via the Task tool when a job warrants a dedicated subagent.
script-writer — drafts and revises scripts from the brief; iterates with the user; promotes a chosen draft to scripts/final/script.md.pipeline-scaffolder — designs a new pipeline as a structured SPEC.md (planning only; no code).pipeline-builder — converts an approved SPEC into runnable per-stage runners.pipeline-repo-setup — provisions a private GitHub repo for versioning reusable pipelines across projects.concatenator — joins clip elements into a single render. GPU-aware (NVENC / AMF / VAAPI / QSV / CPU). Picks fast-path (concat-demuxer) vs safe-path (re-encode).normalizer — equalises loudness (EBU R128, two-pass), conforms framerate / pixel format / colour tag, writes to clips/normalised/.aspect-converter — aspect ratio conversion (16:9 ↔ 9:16, blur-fill / letterbox / smart-crop) and resolution upscaling. GPU-aware. Defers to AI-upscale skills when ≥2× uplift is needed.Pipeline recipes:
text-to-image-to-video — Flux/SDXL/Imagen → Kling/Runway/Hailuo.voice-to-lip-sync — voice generation → lip-sync.text-to-video-direct — single-step text-to-video.upscale-and-interpolate — resolution + frame-rate uplift.video-workspace-conventions — reference for the workspace layout.Production utilities:
script-to-tts — reformat a finished script for a specific TTS model (ElevenLabs, OpenAI, Google, Azure, Hume, Chatterbox), with model-appropriate SSML / pacing / chunking.model-researcher — research current fal/Replicate (and direct-provider) offerings for a workload; recommend a best-fit model with rationale; update brief/tools-and-models.md.budget-estimator — project API cost (low / typical / high) for a pipeline or whole script before any generation runs.comfyui-import — turn a ComfyUI workflow JSON into a pipeline SPEC + stage runners (with custom-node + checkpoint manifest).comfyui-export — turn a pipeline SPEC into a ComfyUI API-format workflow JSON for local execution; flag stages that have no clean local equivalent.The plugin ships with three MCP servers preconfigured (see .mcp.json):
https://mcp.fal.ai/mcp (HTTP transport, requires FAL_KEY). Catalogue browsing only; actual generation goes through runners/fal_run.mjs.npx -y replicate-mcp (requires REPLICATE_API_TOKEN). Used for both browsing and execution.uvx minimax-mcp (requires MINIMAX_API_KEY + MINIMAX_API_HOST). Provides generate_video (Hailuo), text_to_image, text_to_audio, voice_clone, voice_design, music_generation.Credentials live in ~/.config/ai-video-producer/.env. The /setup-credentials command creates that file and adds a loader line to your shell profile so every shell that launches Claude Code has the values in its environment. See .env.example for the full template.
For execution-heavy providers, the plugin ships SDK wrappers under runners/ instead of relying on MCP catalogue servers:
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub danielrosehill/claude-code-plugins --plugin ai-video-producerClaude Code plugin for image editing, batch operations, format conversion, and filesystem organisation of image libraries — bucket by resolution, aspect ratio, orientation, format, EXIF capture time, camera, and dedupe duplicates / near-duplicates.
First-pass data analysis toolkit: correlations, PII flagging, anomalies, hypothesis tests, data dictionaries, and trend analysis on a dataset in a folder.
Manage a Proxmox VE host via SSH and the Proxmox API — guided first-run onboarding, VM/CT lifecycle, storage and ZFS inspection, log review, and update workflows. Per-host config is stored outside the plugin so the same install works across multiple Proxmox environments.
Claude Code plugin: ideation and planning workflow — capture, evaluate, rank, simulate, and plan ideas, with ideation/single-idea-eval/multi-idea-ranking/feature-ideas/simulation/idea-capture variants.
Personal budgeting workflow primitives — transaction logging, categorization, budget creation, spending analysis, goal tracking, and forecasting — plus a provisioning skill for a fresh budgeting workspace.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Memory compression system for Claude Code - persist context across sessions