Generate, edit, and compose images using Google's Gemini 3 Pro Image model (Nano Banana Pro). Use this skill when the user asks to create images, generate visuals, edit photos, compose multiple images, create logos, thumbnails, infographics, product shots, or any image generation task. Supports text-to-image, image editing, multi-image composition (up to 14 images), iterative refinement, aspect ratio control, and Google Search-grounded image generation for real-time data visualization.
This skill inherits all available tools. When active, it can use any tool Claude has access to.
requirements.txtscripts/chat_image.pyscripts/compose_images.pyscripts/edit_image.pyscripts/generate_image.pyscripts/search_grounded_image.pyImage generation skill powered by Google's Gemini 3 Pro Image model. Enables text-to-image generation, image editing, multi-image composition, and real-time data visualization.
GEMINI_API_KEY environment variable setgoogle-genai, PillowInstall dependencies:
pip install -r requirements.txt
python scripts/generate_image.py "A cat wearing a wizard hat" cat.png
python scripts/edit_image.py photo.png "Add a sunset to the background" edited.png
python scripts/compose_images.py "Create a group photo in an office" team.png person1.png person2.png
| Script | Purpose |
|---|---|
generate_image.py | Text-to-image generation |
edit_image.py | Edit/modify existing images |
compose_images.py | Combine up to 14 reference images |
chat_image.py | Interactive multi-turn refinement |
search_grounded_image.py | Generate images with real-time search data |
1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
1K (1024px), 2K, 4K
python scripts/generate_image.py "Futuristic motorcycle on Mars" mars.png --aspect 16:9 --size 4K
python scripts/generate_image.py "Clean black-and-white logo with text 'Daily Grind', sans-serif font, coffee bean icon, minimalist style" logo.png --aspect 1:1
python scripts/generate_image.py "Studio-lit product photo on polished concrete, 3-point softbox, 45-degree angle, professional e-commerce style" product.png --aspect 4:3 --size 4K
python scripts/generate_image.py "A photorealistic close-up portrait, shot on 85mm lens, golden hour lighting, shallow depth of field, cinematic" portrait.png --size 4K
python scripts/generate_image.py "A kawaii red panda sticker, bold outlines, cel-shading, white background, cute expression" sticker.png
Use the chat script for back-and-forth refinement:
python scripts/chat_image.py
Then interact:
> Create a logo for 'Acme Corp'
[Image generated]
> Make the text bolder and add a blue gradient
[Refined image]
> save acme_logo.png
Generate infographics with current data:
python scripts/search_grounded_image.py "Visualize today's weather in Tokyo as an infographic" tokyo_weather.png --aspect 9:16
Use cases:
Combine reference images:
python scripts/compose_images.py "Create a product comparison shot with these items side by side, professional lighting" comparison.png item1.png item2.png item3.png --aspect 16:9
Use cases:
For integration in larger scripts:
import os
from google import genai
from google.genai import types
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
response = client.models.generate_content(
model="gemini-3-pro-image-preview",
contents=["A serene mountain landscape at dawn"],
config=types.GenerateContentConfig(
response_modalities=['TEXT', 'IMAGE'],
image_config=types.ImageConfig(
aspect_ratio="16:9",
image_size="2K"
)
)
)
for part in response.parts:
if part.inline_data:
image = part.as_image()
image.save("landscape.png")
from PIL import Image
from google import genai
from google.genai import types
client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
img = Image.open("input.png")
response = client.models.generate_content(
model="gemini-3-pro-image-preview",
contents=["Add dramatic clouds to the sky", img],
config=types.GenerateContentConfig(
response_modalities=['TEXT', 'IMAGE']
)
)
Photorealistic: Include camera settings, lighting, lens details
"Shot on 85mm lens, golden hour lighting, shallow depth of field"
Logos: Specify style, colors, typography
"Clean minimalist logo, sans-serif font, monochrome, vector style"
Product shots: Describe studio setup
"Studio-lit, 3-point softbox, polished surface, 45-degree angle"
Stylized art: Name the style explicitly
"Anime style, cel-shading, bold outlines, vibrant colors"