image-generation - SKILL.md Agent Skill

name: image-generation description: Generate images using Google's Gemini image generation model. Use this skill when users request image creation, artwork generation, visual content creation, or when you need to generate images for any purpose including diagrams, illustrations, concept art, or visual explanations.

Image Generation Skill

Generate images using Google's Gemini Imagen model. This skill enables you to create images from text descriptions.

Prerequisites

Google AI API Key: Set the GOOGLE_API_KEY or GEMINI_API_KEY environment variable
Python Dependencies: The script will check and install required packages automatically

Quick Start

To generate an image, run:

python {baseDir}/scripts/generate_image.py --prompt "your image description" --output "output_path.png"

Available Models

List all available models:

python {baseDir}/scripts/generate_image.py --list-models

Available models:

Model	Description
`gemini-2.0-flash-exp-image-generation`	Fast experimental model for quick image generation
`gemini-2.5-flash-image-preview`	Flash preview model with good balance of speed and quality
`gemini-2.5-flash-image`	Stable flash model for image generation
`gemini-3-pro-image-preview`	Pro model with high quality output (default, recommended)
`imagen-4.0-generate-preview-06-06`	Imagen 4.0 preview model
`imagen-4.0-ultra-generate-preview-06-06`	Imagen 4.0 ultra preview for highest quality
`imagen-4.0-generate-001`	Imagen 4.0 stable model
`imagen-4.0-ultra-generate-001`	Imagen 4.0 ultra stable for highest quality
`imagen-4.0-fast-generate-001`	Imagen 4.0 fast model for quick generation

Command Options

Option	Required	Default	Description
`--prompt`	Yes	-	Text description of the image to generate
`--output`	No	`generated_image.png`	Output file path (PNG format)
`--model`	No	`gemini-3-pro-image-preview`	Model to use for generation
`--aspect-ratio`	No	`1:1`	Aspect ratio: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`
`--negative-prompt`	No	-	What to avoid in the generated image
`--list-models`	No	-	List all available models and exit

Workflows

Basic Image Generation

Generate a simple image from a text prompt:

python {baseDir}/scripts/generate_image.py --prompt "A serene mountain landscape at sunset with a calm lake in the foreground"

Select a Specific Model

Choose a model based on your needs:

# Use the fast model for quick iterations
python {baseDir}/scripts/generate_image.py --prompt "A sunset" --model "imagen-4.0-fast-generate-001"

# Use the ultra model for highest quality
python {baseDir}/scripts/generate_image.py --prompt "A detailed portrait" --model "imagen-4.0-ultra-generate-001"

# Use the default pro model (recommended)
python {baseDir}/scripts/generate_image.py --prompt "A landscape" --model "gemini-3-pro-image-preview"

Custom Output Path

Save the image to a specific location:

python {baseDir}/scripts/generate_image.py --prompt "A futuristic city skyline" --output "/path/to/my_image.png"

Specific Aspect Ratio

Generate images with different aspect ratios for various use cases:

# Widescreen (16:9) - good for headers, banners
python {baseDir}/scripts/generate_image.py --prompt "Abstract geometric patterns" --aspect-ratio "16:9"

# Portrait (9:16) - good for mobile, stories
python {baseDir}/scripts/generate_image.py --prompt "A tall lighthouse at night" --aspect-ratio "9:16"

# Standard photo (4:3)
python {baseDir}/scripts/generate_image.py --prompt "A cozy coffee shop interior" --aspect-ratio "4:3"

Using Negative Prompts

Exclude unwanted elements from the generated image:

python {baseDir}/scripts/generate_image.py --prompt "A beautiful garden with flowers" --negative-prompt "people, animals, buildings"

Best Practices for Prompts

Be Specific and Descriptive

Bad: "a cat"
Good: "A fluffy orange tabby cat sitting on a windowsill, soft morning light, photorealistic style"

Include Style Information

Specify art style: "digital art", "oil painting", "watercolor", "photorealistic", "anime style"
Mention lighting: "soft diffused light", "dramatic shadows", "golden hour"
Add mood/atmosphere: "serene", "mysterious", "vibrant", "moody"

Composition Details

Describe viewpoint: "bird's eye view", "close-up", "wide angle"
Mention subjects and their arrangement
Include background details

Example Use Cases

Technical Diagrams

python {baseDir}/scripts/generate_image.py --prompt "A clean, minimal flowchart showing data processing pipeline, white background, professional technical illustration style" --aspect-ratio "16:9"

Concept Art

python {baseDir}/scripts/generate_image.py --prompt "Concept art of a steampunk airship, intricate brass machinery, billowing steam, cloudy sky background, detailed illustration"

Icons and Graphics

python {baseDir}/scripts/generate_image.py --prompt "A minimal flat design icon of a rocket ship, single color on transparent background, vector style" --aspect-ratio "1:1"

Photographs

python {baseDir}/scripts/generate_image.py --prompt "Professional product photography of a sleek smartphone on a marble surface, soft studio lighting, high-end commercial style"

Output

The script outputs:

The generated image saved to the specified path
A JSON response with generation details:
- success: Whether generation succeeded
- output_path: Absolute path to the saved image
- prompt: The prompt used
- model: The model used for generation
- aspect_ratio: The aspect ratio used
- message: Status message or error details

Error Handling

Common errors and solutions:

Error	Solution
"API key not found"	Set `GOOGLE_API_KEY` or `GEMINI_API_KEY` environment variable
"Unknown model"	Use `--list-models` to see available models
"Invalid aspect ratio"	Use one of: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`
"Content blocked"	Modify prompt to avoid restricted content
"Rate limit exceeded"	Wait and retry, or check API quota
"Model not available"	Try a different model from the available list

Notes

Generated images are in PNG format
Default model is gemini-3-pro-image-preview (recommended for quality)
Use imagen-4.0-fast-generate-001 for quick iterations
Use imagen-4.0-ultra-generate-001 for highest quality output
For detailed API information, see the reference documentation in references/gemini-api.md