higgsfield-models

star 115

Use when the user asks which model to use, wants to compare models, or needs guidance on selecting between Kling, Sora 2, Wan, Seedance, Veo 3, Minimax Hailuo, Soul, Nano Banana, or other Higgsfield engines.

dsm5e By dsm5e schedule Updated 5/7/2026

name: higgsfield-models description: > Use when the user asks which model to use, wants to compare models, or needs guidance on selecting between Kling, Sora 2, Wan, Seedance, Veo 3, Minimax Hailuo, Soul, Nano Banana, or other Higgsfield engines. user-invocable: true metadata: references: - MODELS-DEEP-REFERENCE.md tags: [higgsfield, models, Kling, Sora, Wan, Seedance, Veo, Soul, NanoBanana] version: 3.0.1 updated: 2026-05-04 parent: higgsfield


Higgsfield Model Selection Guide

Choosing the right model is the single biggest factor in output quality after the prompt. This file handles most selection questions. For deep per-model documentation (prompting specifics, parameters, edge cases, API details) → read MODELS-DEEP-REFERENCE.md.


Quick Decision Flowchart

Fast lookup — for detailed comparisons see the full tables below.

Need Recommended Model Tier
Top-tier cinematic video + audio Kling 3.0 Premium
Epic scale / spectacle Sora 2 Premium
Nature / landscapes + ref images Veo 3.1 Premium
Artistic / stylized video Wan 2.6 Mid
Fast video iteration Seedance 2.0 Pro Mid
VFX / fluid motion Minimax Hailuo 2.3 Mid
Budget-friendly video Kling 2.5 Turbo / Higgsfield DoP Lite Free–Low
Fashion / aesthetic images Soul 2.0 Free
Photorealistic sharp images Nano Banana Pro Low
AI actor generation Soul Cast Low
Native 4K images Kling Image 3.0 Mid
Photo style transformation Photodump (29 presets) Low

Pricing tiers: Free (Soul 2.0, DoP Lite) · Low (0.1–2 credits) · Mid (2–10 credits) · Premium (10+ credits). See the Credit Cost Reference below for exact per-model costs.


Video Models — Comparison

Model Realism Character Motion Style Duration Audio Best for
Kling 3.0 ★★★★★ ★★★★★ ★★★★★ ★★★★☆ 3–15s Cinematic, long, audio, multi-shot
Kling 3.0 Omni ★★★★★ ★★★★★ ★★★★★ ★★★★☆ 3–15s Video clone, storyboard control
Kling 3.0 Omni Edit ★★★★★ ★★★★★ ★★★★☆ 3–10s Edit footage at 3.0 quality
Kling O1 Video (legacy) ★★★★★ ★★★★★ ★★★★☆ ★★★☆☆ 5–10s Multi-ref (7), start/end frame
Kling O1 Video Edit (legacy) ★★★★☆ ★★★★★ ★★★★★ 3–10s Relight, restyle, swap, remove
Kling 3.0 Motion Control ★★★★★ ★★★★☆ ★★★★★ ★★★☆☆ 3–30s Optional Motion transfer from reference video
Kling 2.6 (legacy) ★★★★★ ★★★★★ ★★★★☆ ★★★☆☆ 5–10s Character drama, realism (no audio)
Kling 2.5 Turbo ★★★★☆ ★★★★☆ ★★★★☆ ★★★☆☆ 5–10s Fast Kling iteration
Sora 2 ★★★★☆ ★★★☆☆ ★★★★★ ★★★★☆ Epic scale, physics, action
Wan 2.7 ★★★★★ ★★★★☆ ★★★★★ ★★★★★ 2–15s 60fps, T2V/I2V/R2V/edit, first+last frame
Wan 2.6 ★★★★☆ ★★★☆☆ ★★★★☆ ★★★★★ 5–15s Artistic, stylized, improved physics
Wan 2.5 ★★★★☆ ★★★☆☆ ★★★★☆ ★★★★★ 5–10s Native audio, artistic, fantasy
Seedance 2.0 ★★★★★ ★★★★★ ★★★★★ ★★★★☆ 10s 12-asset multimodal, complex motion
Seedance 1.5 Pro ★★★★☆ ★★★★☆ ★★★★☆ ★★★★☆ 10s Best lip-sync, multilingual audio
Seedance Pro ★★★☆☆ ★★★☆☆ ★★★☆☆ ★★★☆☆ 10s Fast iteration, no audio needed
Veo 3.1 ★★★★★ ★★★★☆ ★★★★☆ ★★★★☆ 4–8s Ref images, first/last frame, 4K
Veo 3.1 Lite ★★★★☆ ★★★★☆ ★★★★☆ ★★★★☆ 4–8s Budget 3.1 quality, 1080p, I2V, volume
Veo 3 ★★★★☆ ★★★☆☆ ★★★★☆ ★★★☆☆ 4–8s Nature, environment, stable model
Grok Imagine Video ★★★★☆ ★★★☆☆ ★★★★☆ ★★★★☆ 1–15s Video editing, animate images
Minimax Hailuo 2.3 ★★★★★ ★★★★☆ ★★★★★ ★★★★☆ 6–10s VFX, fluid motion, anime, physics
Minimax Hailuo 02 ★★★★☆ ★★★☆☆ ★★★★★ ★★★☆☆ 6–10s Dance, sports, fluid motion
Higgsfield DoP (Lite/Standard/Turbo) ★★★☆☆ ★★★☆☆ ★★★★☆ ★★★☆☆ 3–5s I2V specialist, 50+ presets, optical physics

Decision Flowchart

Is this image or video?
├── IMAGE
│   ├── Person / portrait? → Soul 2.0
│   ├── Cinematic keyframe for I2V pipeline? → Soul Cinema Preview
│   ├── Native 4K / image series / storyboarding? → Kling Image 3.0
│   ├── Maximum sharpness / 4K? → Nano Banana Pro
│   ├── Fast pro-quality / text rendering? → Nano Banana 2
│   ├── Reference consistency or dense text? → Seedream 4.5
│   ├── Complex layout / multi-panel? → Seedream 5.0 Lite
│   ├── Text/logo in image? → GPT Image 1.5
│   └── Edit an existing image? → Flux Kontext
│
└── VIDEO
    ├── EDIT existing footage?
    │   ├── Relight, restyle, swap, remove → Kling O1 Video Edit
    │   └── Higher quality 3.0 edit → Kling 3.0 Omni Edit
    │
    ├── Is a human character the focus?
    │   ├── Need audio, long clip (15s), multi-shot → Kling 3.0
    │   ├── Need to clone from reference video → Kling 3.0 Omni
    │   ├── Best lip-sync + multilingual → Seedance 1.5 Pro
    │   ├── No audio needed, great character → Kling 2.6
    │   └── Fast iteration → Kling 2.5 Turbo
    │
    ├── Need motion transfer from reference video?
    │   └── → Kling 3.0 Motion Control
    │
    ├── Animate a still image with cinematic camera?
    │   └── → Higgsfield DoP (Lite/Standard/Turbo)
    │
    ├── Is the environment/phenomenon the hero?
    │   ├── Nature, documentary, stable → Veo 3
    │   ├── Need ref image consistency → Veo 3.1
    │   ├── Budget Veo 3.1 quality / volume → Veo 3.1 Lite
    │   ├── 60fps, first+last frame, ref images → Wan 2.7
    │   └── Artistic, painterly, fantasy → Wan 2.5/2.6
    │
    ├── Is it action/spectacle?
    │   ├── Epic scale, crowds, physics → Sora 2
    │   ├── VFX, anime, fluid motion → Minimax Hailuo 2.3
    │   └── Dance, sports, budget motion → Minimax Hailuo 02
    │
    ├── Need maximum reference control?
    │   ├── Up to 12 assets (images+video+audio) → Seedance 2.0
    │   ├── Up to 7 image refs → Kling O1 Video
    │   └── Up to 3 asset refs → Veo 3.1
    │
    └── Speed/cost priority?
        ├── Fastest Kling → Kling 2.5 Turbo
        ├── Fastest Seedance → Seedance Pro
        └── Fastest Veo → Veo 3.1 Fast or Veo 3 Fast

Image Models — Quick Selection

Need Model Credits
Fashion / cultural portrait Soul 2.0 Free
Cinematic keyframe for I2V Soul Cinema Preview Low
Cheapest generation Z-Image 0.15
Low-cost portrait Higgsfield Soul 0.5
Low-cost 2K square Kling O1 0.5
Native 4K / image series Kling Image 3.0
4K + advanced editing Kling Image 3.0 Omni
Fast versatile 2K Seedream 5.0 Lite 1
4K versatile Seedream 4.5 1
Sketch-to-image (Draw) Nano Banana 1
Artistic / stylized Wan 2.2 1
Blend multiple references Multi Reference 1.5
Fast pro-quality + text rendering Nano Banana 2 1.5
Complex prompts / text in image GPT Image 1.5 2
Max fidelity / Thinking mode / 14 refs Nano Banana Pro 2
Image editing / inpainting Flux Kontext varies
Photo style transformation (29 cartoon/illustration presets) Photodump Low

Full image model specs + UI controls → ../../image-models.md Full Photodump preset library (29 named styles) → ../../photodump-presets.md


Budget Tiers

Image models — by credit cost:

  • Free / near-free: Soul 2.0 (5K gens) · Z-Image (0.15) · Face Swap (2 free)
  • Budget (0.5–1): Higgsfield Soul · Kling O1 · Seedream family · Nano Banana · Wan 2.2 · Reve
  • Mid (1.5–2): Nano Banana 2 · Multi Reference · FLUX.2 Pro · Flux Kontext Max · GPT Image · NB Pro · Character Swap
  • Premium (5–6): FLUX.2 Flex · FLUX.2 Max

General pricing tiers (video + image, approximate):

  • Free: Soul 2.0 · DoP Lite (limited)
  • Low: 0.1–2 credits per generation
  • Mid: 2–10 credits per generation
  • Premium: 10+ credits per generation

For exact per-model video costs see the Credit Cost Reference in ../../model-guide.md.


Unique Feature Matrix

Feature Available on
Native audio (dialogue, SFX, ambient) Kling 3.0/Omni · Seedance 1.5 Pro/2.0 · Veo 3/3.1 · Grok Video
Soul ID character slot Soul 2.0 · GPT Image · Higgsfield Soul
@ Elements syntax Seedream 4.5/5.0 Lite · Nano Banana Pro · Cinema Studio
Draw (sketch-to-image) Nano Banana · Nano Banana Pro
Video editing (relight/restyle/swap) Kling O1 Video Edit · Kling 3.0 Omni Edit · Grok Video
Multi-image reference blend Multi Reference · Nano Banana Pro (14 refs) · Kling O1 Video (7 refs)
Start/end frame control Kling O1 Video · Veo 3.1 · Wan 2.7
Video extension (up to 148s) Veo 3.1
Performance cloning from video Kling 3.0 Omni
Up to 30s camera/motion transfer Kling 3.0 Motion Control
Soul Cast AI actors Cinema Studio 2.5
Soul Cast AI actors (General 2K / Character 4K / Location 4K) Cinema Studio 3.0 (Business/Team)
Built-in color grading Cinema Studio 2.5 (full grading suite) · Cinema Studio 3.5 (Color Palette axis in Style Settings — 8 named palettes)
Native dual-channel stereo audio Cinema Studio 3.0 (Business/Team) · Kling 3.0/Omni · Seedance 2.0/1.5 Pro · Veo 3/3.1 · Wan 2.5/2.7
Soul HEX color matching Soul 2.0 · Soul Cinema Preview · Cinema Studio 2.5
Native 4K image series Kling Image 3.0
Style presets + Color Transfer Soul 2.0
Google Search grounding Nano Banana Pro
Negative prompts supported Veo 3/3.1 only
Smart auto-camera planning Cinema Studio 3.0 (Business/Team)

Key Model Notes

Kling 3.0 vs 2.6: 3.0 is the current top model — longer clips (15s vs 10s), native audio, multi-shot AI direction, physics engine, 4K HDR, stylized output engine. 2.6 is now legacy — use 3.0 for all new work unless cost is the primary constraint.

Kling V3 vs O3: Use V3 for prompt-driven cinematic work (text-to-video, image-to-video). Use O3 when you have reference media (video or image+audio) to anchor character identity — O3's reference-based consistency is its defining advantage.

Kling 3.0 Motion Control: Upload a 3–30s reference clip to transfer full-body motion, hand gestures, facial expressions. Image Orientation for camera/talking head; Video Orientation for complex motions (dancing, action, full-body movement).

Seedance 2.0: Rule of 12 (up to 12 assets per generation). Real person face uploads blocked — use synthetic character references. Best practices for Seedance 2.0 prompting are integrated into the sub-skills (see higgsfield-prompt, higgsfield-camera, higgsfield-motion).

Veo 3.1 vs 3.1 Lite vs 3: 3.1 adds reference images (up to 3), first/last frame, video extension, 4K. 3.1 Lite is budget-priced 3.1 quality at 1080p — supports T2V and I2V, costs less than half of 3.1 Fast. 3 is stable and proven. Use 3.1 for subject consistency, 3.1 Lite for volume, 3 for pure environment/nature.

Wan 2.7: Major upgrade — native 60fps (vs 24fps in 2.6), up to 15s duration, first+last frame anchoring, up to 5 reference images, 4-model suite (T2V/I2V/R2V/video edit), Flow-Matching architecture. 40% better physics consistency over 2.6.

Wan 2.5: First Wan version with native audio — joint text/audio/video generation. Supports audio-driven video (upload audio to drive visuals). 1080p, 5–10s.

Minimax Hailuo 2.3 vs 02: 2.3 is a major upgrade — improved physics, anime/illustration styles, facial micro-expressions, better prompt adherence. Fast variant now at 1080p (02 Fast was 512p). 02 remains available for budget motion work.

Grok Imagine: Aurora architecture (autoregressive, not diffusion) — excels at text/logo rendering and multi-image compositing. Image editing supports multi-turn iterative chains.

For deep documentation on any specific model → read MODELS-DEEP-REFERENCE.md


Cinema Studio 3.0 (Business/Team Plan)

Cinema Studio 3.0 is a separate generation engine available on Business and Team plans. Version toggle in the upper-right corner of the Cinema Studio UI switches between 2.5 and 3.0.

Feature Cinema Studio 2.5 Cinema Studio 3.0 (Business/Team) Cinema Studio 3.5
Video Resolution Up to 1080p Up to 720p (may increase) 480p / 720p / 1080p (three-tier)
Image Resolution Up to 4K Up to 4K (Character/Location) · Up to 2K (General) 1.5K / 2K (Soul Cinema, default image model) · 1K / 2K / 4K (Cinematic Cameras image model)
Max Duration 12s 15s 15s
Aspect Ratios 6 options 7 options (+ 21:9 ultrawide) Video: 7 options (Auto, 16:9, 9:16, 4:3, 3:4, 1:1, 21:9) · Image: 8 options (1:1, 3:4, 2:3, 9:16, 3:2, 4:3, 16:9, 21:9)
Audio On/Off On/Off (native dual-channel stereo) On/Off (generated alongside video)
Shot Control Manual multi-shot Smart (auto) + Custom multi-shot Video: 3-pill main UI (Genre / Style / Camera) · Image: Cinematic models picker (Soul Cinema default + Cinematic Characters / Locations / Cameras) — see higgsfield-cinema
Generation Cost Varies 48 credits Varies — see Higgsfield plan documentation

For full Cinema Studio 3.0 documentation → see higgsfield-cinema

Cinema Studio 3.5 sits alongside 2.5 and 3.0 in the model selector — all three coexist on the platform, version is user-selected, and there is no auto-routing between them. 3.5 reframes the surface: the main UI collapses creative control into three pills (Genre / Style / Camera), each defaulting to Auto with manual override available. Optical physics is restored via a four-axis Camera Settings panel (Camera Body / Lens / Focal Length / Aperture, with 75mm added as a new focal length vs 2.5's 8/14/35/50mm set — vocabulary differs from 2.5; do not mix). The Style Settings panel exposes three preset axes (Color Palette / Lighting / Camera Moveset Style) plus a free-form Manual Style mode for natural-language style direction. An AI director toggle is visible in the bottom toolbar; function not yet documented. 3.5 supports both video and image generation; the image-mode picker exposes four Cinematic models (Soul Cinema default, plus Cinematic Characters, Cinematic Locations, and Cinematic Cameras with 2.5 vocabulary) — see higgsfield-cinema for the image-mode surface.

For full Cinema Studio 3.5 documentation → see higgsfield-cinema


Related skills

  • higgsfield-prompt — MCSLA formula, prompt structure
  • higgsfield-cinema — Cinema Studio model selection
  • higgsfield-assist — Credit optimization and plan selection
  • higgsfield-audio — Audio-capable model details
  • templates/ — Annotated templates with per-genre model recommendations
Install via CLI
npx skills add https://github.com/dsm5e/aso-tracker --skill higgsfield-models
Repository Details
star Stars 115
call_split Forks 14
navigation Branch main
article Path SKILL.md
More from Creator