name: gpt-image-2-api description: Generate and edit images with gpt-image-2 through a third-party OpenAI-compatible API using .env-configured OPENAI_API_KEY and OPENAI_BASE_URL values. Use when the user asks to create images, edit reference images, create visual assets, illustrations, or image variations with a configurable API endpoint. license: MIT compatibility: Designed for Claude Code; requires Node.js and access to an OpenAI-compatible image API. metadata: author: BenedictKing version: "1.3.0" user-invocable: "true" allowed-tools: Bash Read Write
gpt-image-2-api
Purpose
Generate and edit images with gpt-image-2 through a third-party OpenAI-compatible API using .env-configured OPENAI_API_KEY and OPENAI_BASE_URL values. Use when the user asks to create images, edit reference images, create visual assets, illustrations, or image variations with a configurable API endpoint.
Environment Variables & API Key
Two ways to configure API settings (priority: environment variable > .env):
- Environment variables:
OPENAI_API_KEY,OPENAI_BASE_URL, and optionalOPENAI_IMAGE_*overrides .envfile: Place in.env, can copy from.env.example
You can manage the .env file manually or through skill-master env set.
Configuration
OPENAI_API_KEY: API key for the third-party endpointOPENAI_BASE_URL: OpenAI-compatible base URL, for examplehttp://127.0.0.1:3688/v1OPENAI_IMAGE_MODEL: image model name, defaults togpt-image-2OPENAI_IMAGE_PROTOCOL:openai_imagesoropenai_chat, defaults toopenai_imagesOPENAI_IMAGE_SIZE: image size, defaults to1024x1024OPENAI_IMAGE_QUALITY: optional image qualityOPENAI_IMAGE_N: number of images (1-10), defaults to1OPENAI_IMAGE_OUTPUT_FORMAT: optional output format (png,jpeg,webp)OPENAI_IMAGE_FORMAT: legacy alias forOPENAI_IMAGE_OUTPUT_FORMATOPENAI_IMAGE_RESPONSE_FORMAT: optional response format (url,b64_json)OPENAI_IMAGE_BACKGROUND: optional background (transparent,opaque,auto)OPENAI_IMAGE_MODERATION: optional moderation level (low,auto)OPENAI_IMAGE_OUTPUT_COMPRESSION: optional compression level0–100forjpeg/webpOPENAI_IMAGE_PARTIAL_IMAGES: optional streaming partial image count0–3OPENAI_IMAGE_INPUT_FIDELITY: optional edit fidelity (high,low)OPENAI_IMAGE_STYLE: optional generation style fordall-e-3(vivid,natural)OPENAI_IMAGE_STREAM: optionalopenai_imagesSSE toggle (true,false)OPENAI_IMAGE_USER: optional end-user identifierOPENAI_IMAGE_EXTRA_JSON: optional JSON object merged into the request body
Workflow
- Use
scripts/gpt-image-2-api.mjsfor text-to-image and image-edit requests. - If required
OPENAI_*settings are not available from the current environment or this skill's.env, stop and tell the user which variables need to be configured. openai_imagesgeneration uses/images/generationswith JSON;openai_imagesediting uses/images/editswithmultipart/form-data.- Use
OPENAI_IMAGE_PROTOCOL=openai_chatonly when the endpoint lacks/images/*support; it does not support--mask,--background,--output-format, or streaming image events. - Add one or more
--image <path>arguments to edit/reference an input image. - Use
--mask <path>or--mask autoonly withopenai_imagesedits. - Save generated files under the user requested output directory, or the current working directory when unspecified.
- Report saved file paths and any API errors exactly.
Available Scripts
scripts/gpt-image-2-api.mjs— Sends a text prompt, optionally with input images, to the configured OpenAI-compatible endpoint and saves returned images.
Generate Image
node scripts/gpt-image-2-api.mjs --prompt "a concise image prompt" --output ./generated-images --size 1024x1024 --n 1 --output-format png
Use --base-url, --model, --protocol, --size, --quality, --n, --output-format, --background, --moderation, --response-format, and --extra-json only when the user or endpoint requires overrides.
Edit Image
node scripts/gpt-image-2-api.mjs --prompt "make the dog wear a red scarf" --image ./input.png --output ./edited-images
Multiple input images are supported by repeating --image. --input is accepted as an alias for --image.
Mask example for openai_images:
node scripts/gpt-image-2-api.mjs --protocol openai_images --prompt "replace the background" --image ./input.png --mask ./mask.png --output ./edited-images
Auto-mask example for openai_images:
node scripts/gpt-image-2-api.mjs --protocol openai_images --prompt "remove the background" --image ./input.png --mask auto --background transparent --output-format png --output ./edited-images
Fallback example when the endpoint only supports chat completions:
OPENAI_IMAGE_PROTOCOL=openai_chat node scripts/gpt-image-2-api.mjs --prompt "a concise image prompt" --output ./generated-images
Parameters
| Parameter | Required | Type | Default | Description |
|---|---|---|---|---|
prompt |
yes | string | — | Text description for generation or edit |
model |
no | string | gpt-image-2 |
Image model name |
n |
no | integer | 1 |
Number of images, range 1–10 |
size |
no | string | 1024x1024 |
Output size passed through to the upstream endpoint |
quality |
no | string | endpoint default | low, medium, high, auto |
output-format |
no | string | endpoint default | png, jpeg, webp |
response-format |
no | string | endpoint default | url, b64_json; mainly for DALL·E-compatible responses |
background |
no | string | endpoint default | transparent, opaque, auto |
moderation |
no | string | endpoint default | low, auto |
output-compression |
no | integer | endpoint default | Compression 0–100 for jpeg/webp |
partial-images |
no | integer | endpoint default | Streaming partial image count 0–3 |
input-fidelity |
no | string | endpoint default | Edit fidelity: high, low |
style |
no | string | endpoint default | Generation style for dall-e-3: vivid, natural |
user |
no | string | — | End-user identifier |
format |
no | string | — | Legacy alias for output-format |
Size Behavior
The script does not hard-validate size; values are passed through so OpenAI-compatible upstream endpoints can decide what they support.
For the Yunwu-compatible docs currently checked in this project:
/images/generationsdocumentsauto,1024x1024,1536x1024,1024x1536,2048x2048,2048x1152,3840x2160, and2160x3840/images/editscurrently documents the classic GPT image set:auto,1024x1024,1536x1024,1024x1536dall-e-2documents256x256,512x512,1024x1024dall-e-3documents1024x1024,1792x1024,1024x1792- Chat-compatible fallbacks remain endpoint-specific
Because provider behavior may differ by endpoint and documentation may lag behind the implementation, this skill forwards size unchanged and lets the upstream API accept or reject it.
Size Constraints
- Max side length ≤ 3840px
- Both sides must be multiples of 16px
- Aspect ratio ≤ 3:1 (long/short)
- Total pixels: min 655360, max 8294400