gemini-image-generator

star 3

Use when generating professional posed product images for e-commerce using Gemini AI with optimized prompts

oyi77 By oyi77 schedule Updated 6/8/2026

name: gemini-image-generator description: Use when generating professional posed product images for e-commerce using Gemini AI with optimized prompts persona: "|\n name: "Ansel Adams"\n title: "Master of Visual Composition"\n
\ expertise: ["lighting mastery", "golden hour timing", "composition mathematics"
, "technical precision"]\n philosophy: "You don't take a photograph, you make
\ it."\n credentials:\n - "Developed the Zone System for precise exposure
\ control"\n - "Yosemite National Park's most celebrated photographer"\n
\ - "National Medal of Arts recipient from President Jimmy Carter"\n
\ - "Co-founded Group f/64 defining purist photography"\n principles:\n
\ - "Lighting is everything - approach light as a sculptor approaches clay"
\n - "Composition is mathematics - apply the rule of thirds and golden ratio"
\n - "Timing is non-negotiable - wait for the perfect golden hour"\n
\ - "Technical precision first - master ISO, aperture, shutter speed trios"\n
\ - "Every image tells a story - don't just document, narrate visually"\n
\ - "Simplicity creates impact - remove distracting elements ruthlessly"\n
\ - "Technical limitations define creativity - work within constraints to
\ innovate"\n" domain: content

Gemini Image Generator Skill

Overview

Generate professional posed product images using Gemini AI with optimized prompts for e-commerce and content creation.

When to Use

  • When you need to create product images with model poses for e-commerce
  • When preparing TikTok/social media content featuring products
  • When you have product images and need them "modeled" without hiring photographers
  • When batch-generating multiple product variations

When NOT to Use

  • When you need photorealistic human faces (AI limitations)
  • When legal compliance requires real photography
  • When the product requires accurate color representation (AI may shift colors)
  • When generating trademarked brand items

Quick Reference

# Generate prompt
python prompt_optimizer.py --category fashion --product "white dress"

# Full workflow
python workflow_runner.py --pose pose.jpg --product dress.jpg --category fashion

Common Mistakes

  • Using low-resolution input images (results will be blurry)
  • Not specifying lighting setup in prompts (inconsistent results)
  • Skipping the ChatGPT video step (misses viral content optimization)
  • Ignoring TikTok AI disclosure requirements

Workflow

Input: Pose Model + Product Image
    ↓
Gemini Link (c7150b8213a4) → AI Image Generation
    ↓
Output: Multiple Posed Product Images
    ↓
ChatGPT → Video Scene Generation
    ↓
TikTok Content (compliant)

Quick Start

Get started with gemini-image-generator in three steps.

  1. Install dependencies: pip install -r requirements.txt
  2. Configure settings in config.yaml
  3. Run: python main.py --mode gemini-image-generator

Verify setup:

python main.py --check-config
python main.py --run

Getting Started

  1. Install dependencies: pip install -r requirements.txt
  2. Configure settings in config.yaml
  3. Run: python main.py --mode gemini-image-generator

First Run

# Verify setup
python main.py --check-config
# Execute
python main.py --run

1. Prepare Images

  • Pose Model: Reference pose image (person/model)
  • Product: Product image to integrate
  • Format: JPG/PNG, any size (Rasio 9:16 recommended)

2. Generate Instruction

Use the prompt optimizer:

python prompt_optimizer.py --category fashion --style minimal --product "white dress"

3. Run in Gemini

  1. Open: https://gemini.google.com/share/c7150b8213a4
  2. Upload pose model image
  3. Upload product image
  4. Paste generated instruction
  5. Generate images

4. Download Results

  • Gemini generates 4-8 variations
  • Download all to output folder
  • Next: Send to ChatGPT for video scenes

Usage

  • Configure commerce, gemini, generating, generator, image settings before first use
  • Review output quality and adjust parameters
  • Monitor performance metrics during execution
  • Document custom configurations for team reference
  • Schedule regular runs for consistent results

Command Line

# Generate instruction only
python prompt_optimizer.py --category fashion --product "summer dress"

# Full workflow (with browser automation)
python workflow_runner.py --pose pose.jpg --product dress.jpg --category fashion

Python API

from prompt_optimizer import generate_instruction

instruction = generate_instruction(
    category="fashion",
    product_name="summer dress",
    style="minimal",
    lighting="soft"
)
print(instruction)

Categories

Available templates:

  • fashion - Clothing, accessories
  • electronics - Gadgets, devices
  • food - Culinary, beverages
  • beauty - Cosmetics, skincare
  • home - Furniture, decor

Output Format

REFINER PRODUKSI Template:

REFINER PRODUKSI: [Environment] | Rasio: 9:16 | Lighting: [Setup] | Intruksi tambahan: [Detail]

Configuration

Edit config.yaml:

gemini:
  shared_link: "https://gemini.google.com/share/c7150b8213a4"
  output_ratio: "9:16"
  
categories:
  fashion:
    environments: ["clean white studio", "minimal gray", "premium lifestyle"]
    lighting: ["soft diffused", "studio lighting", "natural daylight"]
    
defaults:
  ratio: "9:16"
  style: "premium"
  lighting: "soft diffused"

Integration

  • Configure commerce, gemini, generating, generator, image settings before first use
  • Review output quality and adjust parameters
  • Monitor performance metrics during execution
  • Document custom configurations for team reference
  • Schedule regular runs for consistent results

Next Step: ChatGPT Video Generation

Send Gemini output to ChatGPT with:

Generate video scene descriptions from these images. 
Style: TikTok viral, 15-30 seconds, hook in first 3 seconds.

TikTok Compliance

  • Disclosure: "AI-generated content"
  • Label videos appropriately
  • Follow community guidelines

Files

  • SKILL.md - This documentation
  • prompt_optimizer.py - Generate optimal prompts
  • browser_helper.py - Browser automation
  • workflow_runner.py - Full pipeline
  • config.yaml - Configuration
  • templates/ - Category templates

License

MIT - Free for personal and commercial use.

Common Rationalizations

Rationalization Reality
"I'll do this later" Explain why this excuse is wrong for this skill
"This is simple, skip steps" Even simple tasks benefit from process

Red Flags

  • Generated images have visible artifacts or distorted text
  • Agent does not verify images are appropriate for the intended context
  • Watch for shortcuts and skipped steps

Verification

After completing this skill, confirm:

  • Generated images are free of visible artifacts and distortions
  • Images are appropriate for the intended use context
  • All required outputs generated
  • Success criteria met
Install via CLI
npx skills add https://github.com/oyi77/1ai-skills --skill gemini-image-generator
Repository Details
star Stars 3
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator