gemini-image-generator

name: gemini-image-generator description: Use when generating professional posed product images for e-commerce using Gemini AI with optimized prompts persona: "|\n name: "Ansel Adams"\n title: "Master of Visual Composition"\n
\ expertise: ["lighting mastery", "golden hour timing", "composition mathematics"
, "technical precision"]\n philosophy: "You don't take a photograph, you make
\ it."\n credentials:\n - "Developed the Zone System for precise exposure
\ control"\n - "Yosemite National Park's most celebrated photographer"\n
\ - "National Medal of Arts recipient from President Jimmy Carter"\n
\ - "Co-founded Group f/64 defining purist photography"\n principles:\n
\ - "Lighting is everything - approach light as a sculptor approaches clay"
\n - "Composition is mathematics - apply the rule of thirds and golden ratio"
\n - "Timing is non-negotiable - wait for the perfect golden hour"\n
\ - "Technical precision first - master ISO, aperture, shutter speed trios"\n
\ - "Every image tells a story - don't just document, narrate visually"\n
\ - "Simplicity creates impact - remove distracting elements ruthlessly"\n
\ - "Technical limitations define creativity - work within constraints to
\ innovate"\n" domain: content

Gemini Image Generator Skill

Overview

Generate professional posed product images using Gemini AI with optimized prompts for e-commerce and content creation.

When to Use

When you need to create product images with model poses for e-commerce
When preparing TikTok/social media content featuring products
When you have product images and need them "modeled" without hiring photographers
When batch-generating multiple product variations

When NOT to Use

When you need photorealistic human faces (AI limitations)
When legal compliance requires real photography
When the product requires accurate color representation (AI may shift colors)
When generating trademarked brand items

Quick Reference

# Generate prompt
python prompt_optimizer.py --category fashion --product "white dress"

# Full workflow
python workflow_runner.py --pose pose.jpg --product dress.jpg --category fashion

Common Mistakes

Using low-resolution input images (results will be blurry)
Not specifying lighting setup in prompts (inconsistent results)
Skipping the ChatGPT video step (misses viral content optimization)
Ignoring TikTok AI disclosure requirements

Workflow

Input: Pose Model + Product Image
    ↓
Gemini Link (c7150b8213a4) → AI Image Generation
    ↓
Output: Multiple Posed Product Images
    ↓
ChatGPT → Video Scene Generation
    ↓
TikTok Content (compliant)

Quick Start

Get started with gemini-image-generator in three steps.

Install dependencies: pip install -r requirements.txt
Configure settings in config.yaml
Run: python main.py --mode gemini-image-generator

Verify setup:

python main.py --check-config
python main.py --run

Getting Started

Install dependencies: pip install -r requirements.txt
Configure settings in config.yaml
Run: python main.py --mode gemini-image-generator

First Run

# Verify setup
python main.py --check-config
# Execute
python main.py --run

1. Prepare Images

Pose Model: Reference pose image (person/model)
Product: Product image to integrate
Format: JPG/PNG, any size (Rasio 9:16 recommended)

2. Generate Instruction

Use the prompt optimizer:

python prompt_optimizer.py --category fashion --style minimal --product "white dress"

3. Run in Gemini

Open: https://gemini.google.com/share/c7150b8213a4
Upload pose model image
Upload product image
Paste generated instruction
Generate images

4. Download Results

Gemini generates 4-8 variations
Download all to output folder
Next: Send to ChatGPT for video scenes

Usage

Configure commerce, gemini, generating, generator, image settings before first use
Review output quality and adjust parameters
Monitor performance metrics during execution
Document custom configurations for team reference
Schedule regular runs for consistent results

Command Line

# Generate instruction only
python prompt_optimizer.py --category fashion --product "summer dress"

# Full workflow (with browser automation)
python workflow_runner.py --pose pose.jpg --product dress.jpg --category fashion

Python API

from prompt_optimizer import generate_instruction

instruction = generate_instruction(
    category="fashion",
    product_name="summer dress",
    style="minimal",
    lighting="soft"
)
print(instruction)

Output Format

REFINER PRODUKSI Template:

REFINER PRODUKSI: [Environment] | Rasio: 9:16 | Lighting: [Setup] | Intruksi tambahan: [Detail]

Configuration

Edit config.yaml:

gemini:
  shared_link: "https://gemini.google.com/share/c7150b8213a4"
  output_ratio: "9:16"
  
categories:
  fashion:
    environments: ["clean white studio", "minimal gray", "premium lifestyle"]
    lighting: ["soft diffused", "studio lighting", "natural daylight"]
    
defaults:
  ratio: "9:16"
  style: "premium"
  lighting: "soft diffused"

Integration

Configure commerce, gemini, generating, generator, image settings before first use
Review output quality and adjust parameters
Monitor performance metrics during execution
Document custom configurations for team reference
Schedule regular runs for consistent results

Next Step: ChatGPT Video Generation

Send Gemini output to ChatGPT with:

Generate video scene descriptions from these images. 
Style: TikTok viral, 15-30 seconds, hook in first 3 seconds.

TikTok Compliance

Disclosure: "AI-generated content"
Label videos appropriately
Follow community guidelines

Files

SKILL.md - This documentation
prompt_optimizer.py - Generate optimal prompts
browser_helper.py - Browser automation
workflow_runner.py - Full pipeline
config.yaml - Configuration
templates/ - Category templates

License

MIT - Free for personal and commercial use.

Common Rationalizations

Rationalization	Reality
"I'll do this later"	Explain why this excuse is wrong for this skill
"This is simple, skip steps"	Even simple tasks benefit from process

Red Flags

Generated images have visible artifacts or distorted text
Agent does not verify images are appropriate for the intended context
Watch for shortcuts and skipped steps

Verification

After completing this skill, confirm:

Generated images are free of visible artifacts and distortions
Images are appropriate for the intended use context
All required outputs generated
Success criteria met