gflow-cli - SKILL.md Agent Skill

name: gflow-cli version: "1.2" skillopt_epoch: 0 description: Use when the user wants to drive Google Flow (Veo image-to-video, Veo text-to-video, Imagen / Nano Banana image generation) from the terminal or a script — including text-to-video, image-to-video, image-to-image, batch video pipelines, or burning Flow Ultra/Pro credits programmatically. The CLI is `gflow` (or `flow`); install with `uv tool install gflow-cli` or run ad-hoc with `uvx --from gflow-cli gflow ...`. Bypasses the web UI entirely after a one-time browser sign-in. optimization_notes: | Known weak spots for the SkillOpt training loop (targets for epoch 1+): - Wrong subcommand: agents emit 'gflow video generate' / 'gflow video create' instead of 'gflow video t2v' or 'gflow video i2v' - Wrong output flag: '--output PATH' instead of '-o PATH' - Prerequisite gap: Playwright Chromium install step skipped on fresh machines - Profile parallelism anti-pattern: two generations launched on the same profile in parallel (crashes Chromium) - reCAPTCHA direction inverted: agents suggest GFLOW_CLI_HEADLESS=true to fix detection; correct fix is =false - UUID reuse: agents call 'gflow image upload' again for an already-uploaded UUID instead of passing it directly to --ref - Model alias confusion: '--model imagen' / '--model quality' instead of '--model image4' / '--model nano2' - Auth recovery: agents suggest 'gflow auth refresh' / 'gflow auth renew' which do not exist; correct command is 'gflow auth login'

gflow-cli skill

gflow-cli is an unofficial Python CLI that drives Google Flow — Veo (T2V/I2V) and Imagen / Nano Banana — from the terminal, bypassing the web UI. Source: https://github.com/ffroliva/gflow-cli. Canonical command reference: docs/USAGE.md.

When to invoke this skill

The user wants to:

Generate one or many Veo videos from text prompts (T2V) or from initial frame + motion prompt (I2V)
Generate one or many Imagen / Nano Banana images from text (T2I) or from prompt + reference images (I2I)
Build a batch pipeline for video generations
Create a reusable, project-scoped Flow Character (a named subject with reference images, optional voice + personality) for consistent subjects across generations (gflow character)
Compose ordered clips into a scene and optionally render a credit-free server-side extended video (gflow scene)
Stitch a multi-clip story where each clip is seeded by the previous clip's last frame (gflow video chain)
Use their Google AI Ultra or Pro Flow credits via script instead of clicking through the UI
Automate Flow inside a content pipeline, AI video production stack, or research project

Do NOT use this skill when:

The user wants production-grade reliability with SLAs — recommend the official Gen AI SDK instead.
The user asks about audio, music, or anything outside Flow's video/image surface — wrong tool.

Prerequisites

Before any gflow-cli invocation, verify:

Python 3.11+ is available (python --version).
uv is installed (uv --version). If not, install: curl -LsSf https://astral.sh/uv/install.sh | sh (or Windows equivalent from https://docs.astral.sh/uv/).
gflow-cli is installed OR available via uvx:
- Quick: uvx --from gflow-cli gflow --help (no install)
- Persistent: uv tool install gflow-cli && gflow --help
Playwright Chromium has been downloaded once: uvx --from gflow-cli playwright install chromium (~150 MB).
A signed-in profile exists: gflow auth status should show Profile 'default' is configured. If not, run gflow auth login and walk the user through the one-time browser sign-in.
The user has Flow access — Google AI Ultra or Pro subscription with Flow rolled out. If gflow image upload returns 403, this is the cause.

Core commands

# Auth (one-time)
gflow auth login                                          # opens Chromium, user signs in
gflow auth status                                         # confirms session
gflow auth                                                # bare: list profiles or trigger first login
gflow auth logout                                         # delete a saved session

# Image generation (Imagen / Nano Banana)
gflow image upload <path>                                 # → asset UUID + dimensions
gflow image t2i "<prompt>" [--model {nano2|nano-pro|image4}] \
                            [--aspect {9:16|16:9|1:1|4:3|3:4}] \
                            [-n 1..4] [--seed N] [--out DIR]
gflow image i2i "<prompt>" --ref PATH_OR_UUID [--ref ...] [...same as t2i]

# Video generation (Veo 3.1)
gflow video t2v "<prompt>" [--out-dir DIR] [--aspect ...] [--seed N]
gflow video i2v --initial-frame <image> "<prompt>" [--out-dir DIR] [...same as t2v]
gflow video batch <manifest.tsv> [--out-dir DIR]
gflow video chain <manifest.jsonl> [--out-dir DIR] [--dry-run] \
                  [--max-links N] [--resume-from N]   # last-frame I2V chaining; veo models only

# Characters (reusable, project-scoped subjects)
gflow character create --project <id> --name "<name>" --face-prompt "<prompt>" \
                       [--body-prompt "<prompt>"] [--voice <id>] [--personality "<text>"] \
                       [--model {nano2|nanopro}]
gflow character list --project <id>
gflow character show <character-id> --project <id>
gflow character rm --project <id> (--id <character-id> | --name "<name>") [--yes]   # delete (FREE)
gflow character voices                                    # list the Gemini voice catalog

# Scenes (Add Clip / compose ordered clips)
gflow scene create --project <id> <clip-id> [<clip-id> ...] \
                   [-o extended.mp4]                       # --output = credit-free server-side concat
gflow scene show <scene-id> --project <id>

Every subcommand accepts --profile <name> (per-subcommand, not global) to drive multiple Google accounts side-by-side.

Recipes

Single image (most common)

gflow image t2i "a hot air balloon over Tokyo at sunrise" --aspect 16:9

Image fan-out (4 variants in parallel)

gflow image t2i "variations of a minimalist fox logo" -n 4 --aspect 1:1 --out ./logos/

Image-to-image with a local reference

gflow image i2i "make it cinematic, golden hour" --ref hero.png

Image-to-image with an already-uploaded asset UUID (no re-upload)

UUID=$(gflow image upload hero.png | awk '/Asset UUID:/ {print $3}')
gflow image i2i "stylize this asset" --ref "$UUID"

Single clip from initial frame

gflow video i2v --initial-frame ./input.png "Slow cinematic push-in, soft golden light at sunset" --out-dir outputs

Batch from a directory of inputs (bash)

mkdir -p out
for img in ./inputs/*.png; do
  name=$(basename "$img" .png)
  gflow video i2v --initial-frame "$img" "Cinematic push-in" --out-dir out
done

Batch via a TSV manifest

# manifest.tsv columns: initial_frame \t prompt \t end_frame? \t aspect? \t output_path?
# Empty initial_frame = T2V; lines starting with `# ` are comments.
gflow video batch ./manifest.tsv --out-dir ./out/

Create a reusable Character for consistent subjects

# A Character is a named, project-scoped subject reused across generations.
gflow character create --project "$PROJECT_ID" --name "Joaquim" \
  --face-prompt "weathered fisherman, grey beard, kind eyes" \
  --body-prompt "tall, broad-shouldered, wearing a navy wool sweater" \
  --voice <voice-id> --model nano2
gflow character voices            # discover valid --voice ids first
gflow character list --project "$PROJECT_ID"

See docs/CHARACTER.md for the full domain model, wire protocol, and the crash-recoverable persist-before-spend saga.

Compose clips into an extended video (credit-free)

# Concatenate ordered clips server-side via runVideoFxConcatenation — no local ffmpeg, no credits.
gflow scene create --project "$PROJECT_ID" "$CLIP_A" "$CLIP_B" -o extended.mp4

Chain clips by last frame (story stitching)

# manifest.jsonl: one JSON object per line. Link 0 = t2v; later links = i2v seeded by the
# previous clip's last frame. One credit per link. veo models only.
gflow video chain ./story.jsonl --out-dir ./out/ --dry-run   # preview the plan first
gflow video chain ./story.jsonl --out-dir ./out/             # then run for real

Use as a Python library

import asyncio
from pathlib import Path
from gflow_cli.api.client import FlowApiClient
from gflow_cli.paths import profile_dir

async def make_clip(image: Path, prompt: str, out: Path) -> None:
    async with FlowApiClient(profile_dir=profile_dir("default")) as client:
        project = await client.create_project(title="gflow-cli demo")
        asset = await client.upload_image(image, project.project_id)
        op = await client.generate_video(
            project_id=project.project_id,
            prompt=prompt,
            start_asset=asset,
            aspect="9:16",
        )
        # Poll op.workflow_id with client.poll_video_status(...) and
        # client.download_video(...) when status reaches succeeded.

asyncio.run(make_clip(Path("in.png"), "Push-in", Path("out.mp4")))

Common errors and fixes

Error	Cause	Fix
`No session for profile 'default'`	First run, no auth	`gflow auth login`
`403 Forbidden` from upload / generate	Account doesn't have Flow access	Verify in labs.google/fx/tools/flow
reCAPTCHA refuses to mint a token (headless detected)	Google bot-detection	Set `GFLOW_CLI_HEADLESS=false` and re-run; the visible window passes detection
`Playwright Executable doesn't exist`	Chromium not downloaded	`uvx --from gflow-cli playwright install chromium`
Generations all fail with the same UUID	Stale Flow session	`gflow auth login` again to refresh cookies
Quota exceeded	Burned through monthly credits	Wait for reset, or upgrade subscription

Important constraints

Costs real money / credits. Each gflow video t2v|i2v and gflow image t2i|i2i call burns credits from the user's Google AI Ultra/Pro subscription. Confirm before running batches.
Not for production-grade SLAs. gflow-cli reverse-engineers a private Google API. It can break without notice. For production, use the official Gen AI SDK.
Don't share auth profiles. The Playwright profile dir lives at the per-OS user-data location (Windows: %LOCALAPPDATA%\gflow-cli\profile_*; macOS: ~/Library/Application Support/gflow-cli/profile_*; Linux: ~/.local/share/gflow-cli/profile_*) and contains Google session cookies — treat as secrets.
Same profile can't run in parallel. Chromium refuses two persistent contexts on the same profile dir; use different --profile names for parallel work.
Respect Google's Generative AI Prohibited Use Policy. Don't generate content that would get the user's Google account banned.

Known agent failure modes

Documented errors agents commonly make — negative examples for the SkillOpt training loop:

Mistake	Correct behaviour
`gflow video generate` or `gflow video create`	`gflow video t2v` (text→video) or `gflow video i2v` (image→video)
`--output PATH` on any video/image command	`-o PATH` (short flag) or `--out DIR` (image output dir)
`gflow auth` bare or `gflow login` to sign in	`gflow auth login` — bare `gflow auth` only lists profiles
`gflow auth refresh` / `gflow auth renew` (don't exist)	`gflow auth login` to refresh a stale or expired session
`playwright install` or `playwright install --all`	`uvx --from gflow-cli playwright install chromium` (Chromium only, ~150 MB)
Running two generations on the same `--profile` in parallel	Use different `--profile` names — Chromium refuses two persistent contexts on the same dir
`GFLOW_CLI_HEADLESS=true` to fix reCAPTCHA failures	`GFLOW_CLI_HEADLESS=false` — headless mode causes bot-detection, not prevents it
Calling `gflow image upload` again for an already-uploaded UUID	Pass the UUID directly to `--ref UUID` — no re-upload needed
`--model imagen` / `--model quality` / `--model high`	`--model image4` (Imagen 3.5), `--model nano-pro` (Gem Pix 2), `--model nano2` (Narwhal)
Python: `client = FlowApiClient(...)` then method calls	Must use `async with FlowApiClient(...) as client:` — it's an async context manager
Python: `from gflow_cli import FlowApiClient`	`from gflow_cli.api.client import FlowApiClient`
Piping `gflow video t2v` output into a shell loop for batch	Use `gflow video batch manifest.tsv` — the CLI has a native batch runner

Disclaimer

gflow-cli is not affiliated with Google. Reverse-engineered, unofficial; may break when Google changes Flow's private API. Read the DISCLAIMER before deploying in any sensitive setting.