media-content-image-prompting - SKILL.md Agent Skill

name: media-content/image-prompting description: > Craft optimized image generation prompts for photorealistic, editorial, cinematic, and commercial photography. Use when generating images, photos, portraits, product shots, mood boards, or any visual asset. Supports physical format specifications (banners, cans, posters, apparel) and context-aware camera/lens selection. Works with nano-banana-pro (Gemini 3 Pro) for execution.

Image Prompting

Core Workflow

Analyze intent — what, who, where, why, mood, format
Select prompt formula — photorealistic, character-focused, cinematic, enhanced lighting, etc.
Choose equipment — camera + lens matched to scenario (read references/visual-assets.md for full tables)
Layer detail progressively — subject → environment → lighting → technical
Apply physical format if user specifies dimensions (read references/dimensions-formats.md)
Execute via nano-banana-pro skill
Offer 2-3 alternative directions with different angles/styles

Prompt Structure

Order matters — models weight elements at the beginning more heavily.

[Subject + Action], [Environment/Setting], [Medium (camera/lens/film stock)],
[Depth of Field], [Composition], [Lighting/Time of Day], [Color/Mood], [Textures]

Element Priority

Priority	Element	Example
1	Subject + Action	`satisfied Brazilian woman sitting, exhaling smoke toward camera`
2	Environment	`cozy favela bedroom, house plants`
3	Medium	`editorial photography, Leica M10-R, 50mm f/2`
4	Depth of Field	`shallow depth of field`
5	Composition	`asymmetrical composition, leading lines`
6	Lighting	`cinematic lighting, golden hour, dramatic shadows`
7	Color/Mood	`muted 1990s color grading, warm tones`
8	Textures	`smoke blending with light rays, weathered skin`

Strategy: Place whatever needs emphasis FIRST.

Prompt Length

Length	Effect	Best For
Short (10-30 words)	Max creative liberty, varied results	Exploration
Medium (30-80 words)	Sweet spot — precision + AI creativity	Production
Long (80-150 words)	Maximum control, risk of late-token drop	Complex scenes
Too Long (150+)	Model ignores later elements	Avoid

Use strong keywords, not full sentences. Separate with commas. One subject per prompt.

Power words: luminous > bright, jubilant > happy, enormous > big, ethereal > dreamy

Prompt Formulas

Photorealistic (General)

[Shot Type], [Photo Style], [Subject + Action], [Environment],
[Camera/Lens/Film Stock], [Depth of Field], [Color Scheme],
[Textures], [Composition], [Lighting]

Character-Focused

[Shot Type], [Subject + Action + Environment], [Expression],
[Wardrobe], [Body Position], [Unique Features], [Props],
[Camera/Film Stock]

Cinematic Still

Cinematic still, [Subject + Action], [Environment],
[Camera/Film Stock], [Cinematic Lighting], [Details/Modifiers]

Enhanced Lighting

[Lighting Conditions], [Subject + Action], [Environment],
[Camera/Film Stock], [Composition], [Details]

Enhanced Motion

[Motion Type], [Subject + Action], [Environment],
[Camera/Film Stock], [Composition], [Lighting], [Details]

Super Control (Maximum Detail)

[Shot Type], [Photo Type], [Subject + Action], [Subject Position],
[Wardrobe], [Props], [Subject Focus], [Environment],
[Environment Focus], [Environment Details], [Colors & Tone],
[Lighting], [Time of Day], [Textures], [Camera/Technical]

Progressive Detail Layering

Start simple, add incrementally. Each layer gives more control:

Subject: satisfied Brazilian woman sitting, looking at camera
+ Framing: center of image, medium shot, editorial
+ Focus: sharp focus, mid-ground, slightly blurred background
+ Wardrobe: wearing muted blue oversized t-shirt, headband, red lipstick
+ Environment: cozy favela bedroom, house plants, rundown and liberated
+ Lighting: low light, light from window, dramatic shadows, early morning
+ Aesthetic: smoke blending with light rays, muted 1990s color grading
+ Technical: Bergger Pancro 400 film

Context-Aware Equipment

Match camera/lens to scenario:

Scenario	Camera	Lens	Why
Portraits	Leica M10-R	50mm f/2 ASPH	Sharp, dreamy rendering
Wildlife	Sony A9 II	200mm telephoto	Fast AF, compressed BG
Street	Fujifilm X-T4	35mm	Candid, organic feel
Landscape	Hasselblad H6D	Wide angle	Ultra-resolution
Action/Extreme	GoPro HERO	Ultra-wide	Immersive POV
Vintage/Art	Polaroid SX-70	Fixed	Nostalgic, soft-toned
Fine Art	Fujifilm GFX 100	110mm f/2	Medium format detail
Cinematic	ARRI Alexa Mini	70mm	Film-standard color

For full camera, lens, film stock, lighting, and composition tables: read references/visual-assets.md.

Film Stock Quick Reference

Aesthetic	Film Stock
Natural/Balanced	Fujifilm Provia
Warm Portraits	Kodak Portra
High Contrast B&W	Kodak Tri-X 400
Cinematic Night	Cinestill 800T
Vivid Landscape	Fuji Velvia 50
Vintage	AgfaPhoto Vista
Surreal	LomoChrome Purple
Urban Gritty	Lomography LomoChrome Metropolis

Physical Dimensions

Gemini 3 Pro understands real-world format specs. State dimensions early:

Design a product label for a 2.75 inch aluminum energy drink can...
Create a horizontal vinyl banner, 6 feet wide by 2 feet tall...
Poster at 24x36 inches for a music festival...

Include material (vinyl, aluminum, canvas) and viewing distance if relevant.

For full format table and examples: read references/dimensions-formats.md.

Digital Asset Dimensions (Web / Blog / Social)

Always match dimensions to the target surface. Mismatched aspect ratios (e.g. square images for blog posts) waste generation cycles and require manual re-generation.

Target	Dimensions	Aspect Ratio	When to use
Blog hero / OG image	1200x630	1.91:1	Blog cover images, OpenGraph, Twitter Cards
Blog inline graphic	1200x675	16:9	In-article images between sections
Instagram / Threads	1080x1080	1:1	Social feed posts, carousels
Instagram Story / TikTok	1080x1920	9:16	Vertical video, stories
X/Twitter image	1200x675	16:9	Tweet images
Instagram portrait	1080x1350	4:5	Max vertical feed real estate

Include pixel dimensions in every prompt. Do not default to square unless the target is explicitly a social feed post.

Photography Elements That Create Realism

Simply adding a camera and lens transforms AI output into photorealistic results:

❌ African wild dogs, wilderness, hunt → looks artificial
✅ African wild dogs, wilderness, intense hunt, Sony A9 II, 85mm f/1.8 GM lens → photorealistic

Execution

Generate images via nano-banana-pro:

uv run {baseDir}/scripts/generate_image.py --prompt "your prompt" --filename "output.png" --resolution 1K

Resolutions: 1K (default), 2K, 4K
Timestamps in filenames: yyyy-mm-dd-hh-mm-ss-name.png
Multi-image composition: add multiple -i flags
Do not read the image back; report the saved path only

Image-to-Video Seeding

Generated images can seed video generation for maximum first-frame control:

# 1. Generate still image
uv run .../nano-banana-pro/scripts/generate_image.py --prompt "..." --filename "frame.png" --resolution 2K
# 2. Animate with Veo 3.1
uv run .../veo3-gen/scripts/generate_video.py --prompt "motion description" --filename "out.mp4" -i frame.png
# 3. Or animate with Sora 2
uv run .../sora2-gen/scripts/generate_video.py --prompt "motion description" --filename "out.mp4" -i frame.png

See media-content/video-prompting/SKILL.md § Image-to-Video Workflow for full details.

Key Rules

Specificity over generality — always specify camera, lighting, palette
One subject per prompt — multiple subjects confuse the model
Place emphasis first — whatever matters most goes at the beginning
Iterate, don't rewrite — change one element at a time
Always offer alternatives — provide 2-3 directions per request