name: antigravity-gemini-image description: Generate or edit images using the Antigravity-hosted Gemini image model via the local gateway. Use when the user asks to create an image, generate an avatar, or edit/transform an existing image with text instructions. Supports text-to-image and image-to-image editing.
Antigravity Gemini Image
Overview
Use the Gemini REST endpoint (via the local gateway base URL) to create or edit images. This skill wraps the correct request format for the gemini-3.1-flash-image model, which does not work reliably with /v1/responses + image_generation.
Quick start
Text → Image
mkdir -p ~/.openclaw/workspace/tmp
/usr/bin/python3 {baseDir}/scripts/generate_gemini_image.py \
--prompt "A cute yellow chick mascot, vector style" \
--out ~/.openclaw/workspace/tmp/chick.jpg
Image → Image (edit)
mkdir -p ~/.openclaw/workspace/tmp
/usr/bin/python3 {baseDir}/scripts/generate_gemini_image.py \
--prompt "Add a tiny red scarf, keep the style" \
--image /path/to/input.jpg \
--out ~/.openclaw/workspace/tmp/chick_edit.jpg
Optional (explicit .env path):
/usr/bin/python3 {baseDir}/scripts/generate_gemini_image.py \
--env-file ~/.openclaw/workspace/.env \
--prompt "A cute yellow chick mascot, vector style" \
--out ~/.openclaw/workspace/tmp/chick.jpg
macOS Keychain (optional): service openclaw-local-160, accounts base-url and api-key.
Workflow (always follow)
- Confirm the prompt (and optional input image) with the user.
- Run the script in
scripts/generate_gemini_image.py. - Return the generated image file to the user.
Parameters
--prompt(required): text instruction.--image(optional): input image for edits.--out(required): output path (jpg/png).--env-file(optional): load a specific.envfile.--base-url(optional): defaults toLOCAL_160_BASE_URL.--api-key(optional): defaults toLOCAL_160_API_KEY.--model(optional): defaults togemini-3.1-flash-image.--temperature(optional): default 0.3.--size(optional): default 4K (set viaGEMINI_IMAGE_SIZE).--ratio(optional): default 16:9 (set viaGEMINI_IMAGE_RATIO).--timeout(optional): HTTP 超时(秒),默认 1200(也可用GEMINI_IMAGE_TIMEOUT或LOCAL_160_TIMEOUT)。
Notes
- The gateway endpoint is:
POST {BASE_URL}/v1beta/models/gemini-3.1-flash-image:generateContent - Auth header uses
x-goog-api-key. - Image generation can be slow; 10–20 minutes is possible. Keep timeouts high (default 1200s or override via
--timeout/ env). - Output image is returned as base64 under:
candidates[0].content.parts[*].inlineData.data. - 4K 输出必须写在
generationConfig.imageConfig(不要直接写image_size):{"imageConfig": {"imageSize": "4K", "aspectRatio": "16:9"}}
References
- See
references/API.mdfor request/response shape.