name: illustrious-prompting
disable-model-invocation: true
description:
Prompting rules for the OnomaAI Illustrious-XL SDXL anime / illustration model. Use when the user asks to generate
/ draw / render an anime / illustration image on an Illustrious-XL workflow (v0.1, v1.0, or a near-vanilla finetune).
Illustrious takes plain Danbooru tags + natural language - no score_N quality tags and no @-prefixed artist tags.
Do not use for photorealism.
Illustrious-XL Prompting
Illustrious-XL is OnomaAI's SDXL-based anime /
illustration model, trained on the Danbooru2023 dataset. Write the prompt and negative you pass to the
generate_image tool per the rules below to drive Illustrious-XL well on the first call. The rules here are specific to
Illustrious-XL v0.1 / v1.0 and near-vanilla finetunes.
Illustrious supports Danbooru-style tags, natural-language captions, and mixes of the two. Per the v1.0 model card, it "combines advanced natural language processing with concise Danbooru tag-based prompts" - all three styles work.
Safe default when unsure: lead the prompt with masterpiece, best quality,, then describe the subject in roughly
8-15 lowercase Danbooru tags (or 2+ sentences of prose) covering subject, outfit, scene, lighting, and exactly one
framing tag; always pass the recommended negative from the negative recipe. Illustrious has
no style baked in, so terse prompts come out bland - spend the detail. Everything below is how to do better than this
default when the request calls for it.
The generate_image call
| Arg | For Illustrious-XL |
|---|---|
prompt |
Required. Lead with quality tags; then subject; then descriptive tags. No score_N, no @artist. |
negative |
Always pass one. Start from the recommended negative below. |
workflow |
Use the Illustrious workflow when configured; omit only if it is defaultWorkflow. |
width / height |
Native 1536x1536. Safe defaults 1024x1024 or 832x1216 portrait. Range 512x512-1536x1536; 1248x1824 is fine. |
steps |
20-28. Leave the workflow default unless quality is lacking. |
cfg |
5-7.5. Push to 7 only for tag-heavy prompts; lower for natural-language ones. |
seed |
Omit for a fresh random image; pass a prior seed to reproduce or vary. |
Do not pass inputImage unless the workflow is an Illustrious img2img variant.
Positive prompt recipe
- Lead with quality tags:
masterpiece, best quality,(optional:highres, absurdres,for higher fidelity bias). - Then the subject as tags, natural language, or a mix. Lowercase, comma-separated.
- Spaces over underscores in general tags (
brown hair, notbrown_hair). Multi-word proper nouns from Danbooru may keep their underscores (hatsune_miku) and both forms usually work - prefer spaces when unsure. - Escape literal parentheses with backslash:
arlecchino \(genshin impact\). - Name then describe characters. With multiple characters, describe each one's hair / eyes / outfit or the model conflates them.
- Be specific. Illustrious has no default style baked in - short or vague prompts give bland output. Name the scene, lighting, palette, framing.
- Pick one framing tag, not three.
cowboy shot, close-up, full bodyconflict; the model card warns against overusing them.
Tag order
[quality / meta tags] [1girl / 1boy / 1other] [character] [series] [general tags: pose, outfit, scene, lighting, palette]
Order matters between sections; within a section tags are free-order. Tag dropout during training means you do not need every relevant tag.
Artist styles
Illustrious accepts Danbooru artist tags directly (e.g. wlop, as109) - no @ or artist: prefix; write the bare
tag. Artist effect varies by how well the artist is represented in Danbooru2023.
Negative prompt recipe
Start from the model card's recommendation and add situational terms:
worst quality, comic, multiple views, bad quality, low quality, lowres, displeasing, very displeasing,
bad anatomy, bad hands, scan artifacts, monochrome, greyscale, signature, twitter username,
jpeg artifacts, 2koma, 4koma, guro, extra digits, fewer digits
- Add
nsfw, explicitfor SFW work (Illustrious has nosafepositive-prompt convention; control with the negative). - Add the specific thing you don't want (
text, watermark, multiple views, deformed). - Drop
monochrome, greyscaleif you actually want monochrome.
Tag reference
| Group | Values |
|---|---|
| Quality | masterpiece, best quality, good quality, normal quality, low quality, worst quality |
| Meta | highres, absurdres, official art, scan, jpeg artifacts (usually negative) |
| Framing | portrait, upper body, cowboy shot, full body, close-up - pick ONE |
| Lighting | rim lighting, backlighting, cinematic lighting, soft lighting, dramatic lighting |
| Palette / mood | muted colors, vibrant, low contrast, high contrast, black theme, pastel colors |
Do not use score_9, score_8, score_7_up, source_anime - those score tags have no documented effect on stock
Illustrious; use masterpiece, best quality instead. Do not use an @ prefix on artist tags - write the bare
Danbooru tag.
Generation settings
| Setting | Value |
|---|---|
| Resolution | 512x512 to 1536x1536. Defaults: 1024x1024 square, 832x1216 portrait, 1216x832 landscape. |
| Steps | 20-28. |
| CFG | 5-7.5. Tag-heavy prompt -> push toward 7; natural-language prompt -> stay near 5. |
| Sampler | Euler a is the model card's recommendation. DPM++ 2M Karras is a common workflow swap for detail. |
Sampler/scheduler live in the workflow file, not the tool args - only mention a sampler when asking the user to retune the workflow.
Prompt enhancement (enhance)
The generate_image tool has an opt-in enhance option that routes your prompt through a separate model before the
render to rewrite it into Illustrious's native protocol - the quality-tag-led Danbooru tag list (or tags-plus-prose)
this guide describes.
- It is opt-in. Pass
enhance: trueon the call (or a workflow/config default turns it on). When off, yourpromptis sent as-is. - Use it when the incoming prompt is thin (a few words) and you want a one-shot upgrade to a full tagged prompt.
When you have already built a careful tagged prompt, leave
enhanceoff - it only adds latency and risks drifting from your intent. - Scene continuity. Pass the
contextarg alongsideenhanceto hand the enhancer background to honour (character facts, ongoing scene, wardrobe) without depicting it literally. It is ignored whenenhanceis off. - Negative. The enhancer builds on whatever baseline
negativeyou pass and returns a refined one - it should keep the recommended cleanup terms (see the negative recipe).
If you are enhancing a prompt rather than rendering
You may be reading this not to call generate_image yourself, but as guidance handed to the prompt-enhancement step:
you were given a rough positive prompt, a baseline negative, and the target workflow's protocol, and you must return a
single JSON object {"prompt", "negative"} and nothing else. In that role:
- Act only on the prompt-writing rules above - lead with
masterpiece, best quality,, follow the positive recipe and tag order, and build the negative from the negative recipe. The generation-settings and tool-arg tables do not apply - you do not pick a workflow or call any tool. - Translate and enrich, do not reinvent. Keep the incoming subject and intent; lead with the quality tags and turn
loose phrasing into lowercase Danbooru tags (or a tags-plus-prose mix). Escape literal parentheses (
\(\)) and use the bare artist tag, no@. - Always add detail - even to an already-tagged prompt. Illustrious has no baked-in style, so push toward roughly
8-15 substantive items: infer subject count (
1girl/1boy/1other,solo), then layer in appearance, exactly one framing tag, scene, lighting, and palette that the prompt leaves unstated, staying faithful to the stated subject. - Mine any background context you were given (a scene/continuity note, recent conversation) for that extra detail and fold it in as depictable tags. Treat it as source material to pick from, not a checklist to dump and not a subject that overrides the explicit prompt; ignore chatter with no visual bearing.
- Build on the baseline negative (start from the recommended negative), and add
nsfw, explicitfor SFW work since Illustrious controls safety through the negative.
Worked examples
These illustrate the shape of a finished prompt - adapt them to the actual request, do not reuse them verbatim. Pass
the prompt (and negative) value as a single string.
Tag-style:
prompt: masterpiece, best quality, highres, 1girl, solo, long silver hair, green eyes, witch hat, deep blue robe,
standing, holding spellbook, glowing magic, sunlit library, dust motes, window backlight, soft lighting,
looking at viewer, detailed background
negative: worst quality, low quality, lowres, bad anatomy, bad hands, jpeg artifacts, signature, watermark,
extra digits, multiple views, nsfw
Natural-language:
prompt: masterpiece, best quality. A young anime witch with short silver hair and green eyes stands in a sunlit
library, wearing a wide-brimmed black hat and a deep blue robe. She holds an open spellbook that glows
faintly, and dust motes drift in the light from a tall window behind her. Soft warm palette, cinematic
lighting.
negative: worst quality, low quality, lowres, bad anatomy, bad hands, blurry, jpeg artifacts, watermark, nsfw
The wrapped lines above are a single comma/space-joined string each - pass them as one line in the tool args.
Anti-patterns
- Photorealism prompts. Illustrious is anime/illustration; redirect rather than fight it. For realism, suggest a different checkpoint.
- Three-word prompts.
a cat girlunderuses the model - Illustrious has no default style, so you get bland output. Add quality tags + 6-12 descriptors covering pose, outfit, scene, lighting. - Score tags.
score_9, score_8_up, source_animedo nothing reliable on stock Illustrious; usemasterpiece, best qualityinstead. @-prefixed artists. Write the bare Danbooru tagwlop, not@wlop.- Multiple framing tags.
cowboy shot, close-up, full bodyconfuses composition - pick ONE per the model card. - Dropping the negative. Always pass one; it is the primary cleanup lever for hands, artifacts, and unwanted styles.
- Pushing CFG past 8. Burns colors and crisps edges into noise. Stay 5-7.5.
- Square-bracket weighting. ComfyUI weights with parentheses -
(tag:1.3)to strengthen,(tag:0.8)to weaken. A1111-style square brackets do not de-emphasize here:[tag]parses as([tag]:1)(literal brackets at weight 1). Use(tag:0.8)to weaken instead.