By gitroomhq
Generate AI-powered UGC videos: photorealistic portraits, talking-head clips with lip-sync, product showcase videos, and TikTok/Hormozi-style captions, then post directly to social platforms via a single API key.
Generate a magazine-style character sheet from a portrait. Provide EITHER portrait_url (must be R2-hosted) OR portrait_image_base64 (PNG/JPEG, ≤10 MB; the API will upload it to R2 first). Optional ≤10-word description for name/age/vibe hints.
Bring your own audio: lip-sync a face (an R2-hosted image / character sheet, OR an existing clip) to a provided audio track. No text-to-speech or voice cloning — the character speaks your uploaded recording. Output is a 9:16 talking-head video.
Publish a generated agent-media video to the user's connected TikTok, Instagram, or X. Connect channels (OAuth) and post or schedule via the REST API. Use after producing a video with make_ugc_video / make_simple_selfie.
Playbook for orchestrating an end-to-end UGC video on the agent-media vNext runtime. Read this before deciding whether to call the one-shot make_ugc_video skill or to chain the four primitives (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) manually.
Up-to-30s vertical talking-head video: the actor speaks full-frame while a user-supplied b-roll video is overlaid on the lower half (face stays clear up top). REVIEW STYLE — put a line containing only `---` in the script to split an INTRO (actor speaks to camera, no b-roll yet — e.g. "Today we are going to review the game of X vs Y") from the MOVES narration after it; the b-roll appears when the moves begin and loop-fills through to the end while the actor narrates, and the moves take continues seamlessly from the intro's last frame and the same voice (guaranteed continuity, not a jump-cut). Without `---` it is a plain talking head (<=15s single take; 16-30s two takes) with the b-roll placed via broll_start_time. Provide actor_image_url (any https image) + broll_video_url (any https video — both re-hosted to R2 automatically) and EITHER script (Seedance voice) OR audio_url (your own audio, single clip <=15s). Optional: subtitles; broll_width_rate (0.1-1.0, e.g. 0.8 = b-roll 80% width centered with black margins; omit for full width); broll_start_time (override when the b-roll appears); broll_fade_out (dissolve the b-roll at its end).
Requires secrets
Needs API keys or credentials to function
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Agents: read this whole page. It is everything you need to create UGC videos with agent-media — no other docs required.
agent-media turns a short description (or a photo) + a script into a finished, captioned, lip-synced vertical UGC video. Works in Claude Code, Cursor, or any MCP / HTTP agent. One Bearer token authenticates everything.
npx skills add gitroomhq/agent-media — installs all of agent-media's skills into your agent (Claude Code, Cursor, etc.)./plugin marketplace add gitroomhq/agent-media then /plugin install agent-media@agent-media.npx -y -p @agentmedia/mcp-server@latest agent-media-mcp with env AGENT_MEDIA_API_KEY=ma_.... All skills self-describe via tools/list.Get a Bearer token: npm i -g agent-media-cli && agent-media login (stores it at ~/.agent-media/credentials.json), or grab the ma_* token from the dashboard. Every call uses Authorization: Bearer ma_.... You need credits on the account (buy at agent-media.ai).
make_ugc_video runs the whole pipeline — portrait → character sheet → lip-synced talking head → captions — in a single request.
curl -X POST https://api.agent-media.ai/v1/skills/make_ugc_video/run \
-H "Authorization: Bearer ma_..." -H "Content-Type: application/json" \
-d '{ "description": "a friendly 28-year-old woman, soft daylight",
"script": "Okay, this changed my whole morning routine — you have to try it.",
"duration": 10, "subtitles": true }'
# -> 202 { "skill_run_id": "..." } then poll:
curl https://api.agent-media.ai/v1/skills/runs/<skill_run_id> -H "Authorization: Bearer ma_..."
# when status == "succeeded", final_output.video_url is your MP4.
In Claude/Cursor you just say it in words: "Make a 10s UGC video of a friendly woman saying '…' with TikTok captions." — the agent picks the skill.
POST https://api.agent-media.ai/v1/skills/<slug>/run (Bearer auth, JSON body) → 202 with a run_id (or skill_run_id for make_ugc_video).GET /v1/skills/runs/<skill_run_id>; single primitive → GET /v1/primitives/runs/<run_id>. Output is final_output.video_url / artifacts[].url.GET https://api.agent-media.ai/v1/public/skills or MCP tools/list. Trust that over any hand-written list.Social media automation CLI for scheduling posts, managing integrations, uploading media, and tracking analytics across 28+ platforms including X, LinkedIn, Reddit, YouTube, TikTok, Instagram, and more
npx claudepluginhub gitroomhq/agent-media --plugin agent-mediaCreate HeyGen avatar videos, personalized video messages, and translated / dubbed videos. Build a persistent digital identity from a photo, generate presenter-led videos with your digital twin, and localize existing videos into 175+ languages with voice cloning and lip-sync.
AI video generation — describe what you want, Pexo picks the best model across 10+ engines (Seedance, Kling, Veo, Sora) and returns a finished, multi-shot video with music, subtitles, and transitions. Includes the Pexo agent plus image, audio, director, and model-prompting skills.
Generate professional Kling AI video prompts. Supports VIDEO 3.0, VIDEO 3.0 Omni, Avatar 2.0 talking avatars, Multi-Shot Storyboards, Element Reference, Voice Control, Motion Control.
Full video production pipeline for Remotion — gives Claude eyes (video analysis), voice (TTS/voiceover), ears (music/SFX), stock footage, AI image/video generation, TikTok captions, 3D content, and more. By Dojo Coding Labs.
Pika Creative Suite — give any AI agent a face, a voice, and a full creative studio. 9 curated /pika:* skills backed by 58 atomic media tools at mcp.pika.me.
Video generation at scale. Generate videos, images, and audio with Runway's API — batch ad campaigns, product videos, multishot stories, and creative iteration. Supports seedance2, gen4.5, veo3, Nano, Banana Pro, and more.