From ai-brain-starter
Generates, edits, and composes images using Google's Gemini 3 Pro Image model. Supports text-to-image, multi-image composition, aspect ratio control, and search-grounded generation for data visuals.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-brain-starter:nano-bananaThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Image generation skill powered by Google's Gemini 3 Pro Image model. Enables text-to-image generation, image editing, multi-image composition, and real-time data visualization.
Image generation skill powered by Google's Gemini 3 Pro Image model. Enables text-to-image generation, image editing, multi-image composition, and real-time data visualization.
GEMINI_API_KEY environment variable setgoogle-genai, PillowInstall dependencies:
pip install -r requirements.txt
python scripts/generate_image.py "A cat wearing a wizard hat" cat.png
python scripts/edit_image.py photo.png "Add a sunset to the background" edited.png
python scripts/compose_images.py "Create a group photo in an office" team.png person1.png person2.png
| Script | Purpose |
|---|---|
generate_image.py | Text-to-image generation |
edit_image.py | Edit/modify existing images |
compose_images.py | Combine up to 14 reference images |
chat_image.py | Interactive multi-turn refinement |
search_grounded_image.py | Generate images with real-time search data |
1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
1K (1024px), 2K, 4K
python scripts/generate_image.py "Futuristic motorcycle on Mars" mars.png --aspect 16:9 --size 4K
python scripts/generate_image.py "Clean black-and-white logo with text 'Daily Grind', sans-serif font, coffee bean icon, minimalist style" logo.png --aspect 1:1
python scripts/generate_image.py "Studio-lit product photo on polished concrete, 3-point softbox, 45-degree angle, professional e-commerce style" product.png --aspect 4:3 --size 4K
python scripts/generate_image.py "A photorealistic close-up portrait, shot on 85mm lens, golden hour lighting, shallow depth of field, cinematic" portrait.png --size 4K
python scripts/generate_image.py "A kawaii red panda sticker, bold outlines, cel-shading, white background, cute expression" sticker.png
Use the chat script for back-and-forth refinement:
python scripts/chat_image.py
Then interact:
> Create a logo for 'Acme Corp'
[Image generated]
> Make the text bolder and add a blue gradient
[Refined image]
> save acme_logo.png
Generate infographics with current data:
python scripts/search_grounded_image.py "Visualize today's weather in Tokyo as an infographic" tokyo_weather.png --aspect 9:16
Use cases:
Combine reference images:
python scripts/compose_images.py "Create a product comparison shot with these items side by side, professional lighting" comparison.png item1.png item2.png item3.png --aspect 16:9
Use cases:
For integration in larger scripts:
import os
from google import genai
from google.genai import types
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
response = client.models.generate_content(
model="gemini-3-pro-image-preview",
contents=["A serene mountain landscape at dawn"],
config=types.GenerateContentConfig(
response_modalities=['TEXT', 'IMAGE'],
image_config=types.ImageConfig(
aspect_ratio="16:9",
image_size="2K"
)
)
)
for part in response.parts:
if part.inline_data:
image = part.as_image()
image.save("landscape.png")
from PIL import Image
from google import genai
from google.genai import types
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
img = Image.open("input.png")
response = client.models.generate_content(
model="gemini-3-pro-image-preview",
contents=["Add dramatic clouds to the sky", img],
config=types.GenerateContentConfig(
response_modalities=['TEXT', 'IMAGE']
)
)
Photorealistic: Include camera settings, lighting, lens details
"Shot on 85mm lens, golden hour lighting, shallow depth of field"
Logos: Specify style, colors, typography
"Clean minimalist logo, sans-serif font, monochrome, vector style"
Product shots: Describe studio setup
"Studio-lit, 3-point softbox, polished surface, 45-degree angle"
Stylized art: Name the style explicitly
"Anime style, cel-shading, bold outlines, vibrant colors"
npx claudepluginhub mycelium-hq/ai-brain-starter --plugin ai-brain-starterGenerates images from text, edits existing images, applies style transfers, composes from multiple references, and supports multi-turn refinement using Google's Gemini API via Python scripts. For logos, stickers, mockups.
Generates and edits images using Google's Gemini Nano Banana Pro model (gemini-3-pro-image-preview). Requires GEMINI_API_KEY. Supports prompt-based image creation and editing user-provided images.
Generates AI images from text prompts, edits images, and composes from multiple references using Gemini models. Supports t2i, i2i, product mockups, and stickers.