name: content-engine-cinema category: broomva version: "1.0.0" kind: python description: "Cinematic Generation Layer — the taste and technique library for premium AI content. Codifies start-frame doctrine, camera style vocabulary (Anderson/Fincher/Nolan/Villeneuve/Kubrick), node pipeline architecture, scene generation formulas, and 25+ design styles with exact prompts. Tier-1 generation via Higgsfield CLI (30+ models — Soul V2, Nano Banana 2, Veo 3.1, Kling 3.0, Seedance 2.0, Flux 2, Marketing Studio). Triggers on: 'generate content', 'cinematic', 'create scene', 'start frame', 'style library', 'generate campaign', 'AI filmmaking', 'higgsfield generate', 'soul cinema', 'marketing studio'."
Cinematic Generation Layer
The taste and technique library that turns AI generation tools into a coherent filmmaking system. This skill codifies the principles, workflows, and prompt engineering patterns that separate generic AI output from intentional, cinematic content.
Core Doctrine: Start-Frame First
The single most important principle in AI video generation: video quality is only as good as your initial image. This comes from ohneis652's production experience and is validated across every major video model (Kling, Wan, Sora, Veo, Seedance).
The workflow is never "generate video from text." It is always:
Concept → Start Frame (image) → Motion (video) → Post-Production
A mediocre prompt with a perfect start frame produces better output than a perfect prompt with no image guidance. The start frame locks composition, lighting, color palette, character appearance, and camera position. The video model's job is reduced to adding motion to an already-excellent frame.
See references/cinematic-prompting.md for the full start-frame doctrine.
Camera Style Vocabulary
Every director has a visual signature. Encoding that signature into prompts produces results that feel intentional rather than random. The vocabulary below maps director aesthetics to prompt-level decisions.
| Director | Signature Look | Camera | Lighting | Color | Prompt Keywords |
|---|---|---|---|---|---|
| Wes Anderson | Symmetrical compositions, pastel palettes, flat staging | Static, centered, planimetric | Soft, even, warm tungsten | Pastel yellows, pinks, teals, cream | symmetrical composition, centered framing, pastel color palette, whimsical, flat staging, warm tungsten lighting |
| David Fincher | Dark precision, clinical framing, desaturated | Slow dolly, locked tripod, overhead | Low-key, practical sources, sodium vapor | Desaturated, green-amber, cold | desaturated color grade, dark shadows, clinical precision, cold blue-green tones, low-key lighting, overhead shot |
| Christopher Nolan | IMAX scale, practical effects, blue-orange grade | Sweeping crane, handheld verité, IMAX wide | Natural + practical, golden hour | Blue-orange complementary, deep blacks | IMAX wide angle, sweeping crane shot, blue-orange color grade, epic scale, natural lighting, deep blacks |
| Denis Villeneuve | Vast negative space, geometric architecture, silence | Slow push-in, wide establishing, drone | Diffused overcast, hazy atmosphere | Muted earth tones, amber, grey | vast negative space, geometric architecture, diffused atmospheric haze, muted earth tones, slow push-in, contemplative |
| Stanley Kubrick | One-point perspective, symmetry, uncanny | Steadicam tracking, one-point corridors | Harsh top-light, candlelight | High contrast, selective saturation | one-point perspective, symmetrical corridor, steadicam tracking shot, harsh overhead lighting, unsettling, high contrast |
| Wong Kar-wai | Motion blur, neon reflections, step-printed | Handheld, canted angles, slow-motion | Neon, fluorescent, saturated practicals | Deep reds, greens, blues, warm skin | neon reflections, motion blur, step-printed slow motion, saturated colors, handheld camera, intimate framing |
| Ridley Scott | Smoke, volumetric light, tactile textures | Slow dolly, low angle, aerial | Volumetric beams, industrial haze | Cool steel, warm amber accents | volumetric light beams, atmospheric smoke, industrial textures, slow dolly, low angle, cinematic anamorphic |
| Terrence Malick | Magic hour, natural world, ethereal | Steadicam floating, wide nature | Golden hour exclusively, backlit | Warm golden, green, earth | magic hour golden light, backlit silhouette, steadicam floating through nature, ethereal, natural world, wide angle |
Using the Vocabulary
Combine director keywords with subject and scene description. The director style replaces vague aesthetic instructions:
# Weak prompt
"A person walking down a hallway, cinematic"
# Strong prompt (Kubrick vocabulary)
"A person walking down a long symmetrical corridor, one-point perspective,
steadicam tracking shot following from behind, harsh overhead fluorescent lighting,
high contrast, institutional green walls, unsettling atmosphere, 35mm film grain"
Node Pipeline Architecture
The pipeline is a chain of specialized tools, each handling one aspect of the generation process. This mirrors how a professional post-production pipeline works: each node does one thing well.
[SD/Flux Base] → [LoRA Style Lock] → [Multi-Angle/Pose] → [Motion/Video] → [Upscale] → [Color Grade]
↓ ↓ ↓ ↓ ↓ ↓
Raw image Style-locked Character turnaround Animated 4K/8K res Final look
from prompt to brand/era or scene angles sequence output film-ready
Each node is a discrete design decision. Removing or reordering nodes changes the output character. See references/node-pipelines.md for the full ComfyUI implementation.
Key nodes:
- Base generation (SD 3.5 / Flux / Nano Banana) -- raw creative output from prompt
- LoRA style lock -- constrains output to a specific era, brand, or visual language
- Multi-angle expansion (Nano Banana multi-view) -- generates turnarounds and alternate poses
- Motion synthesis (Wan / Kling / Seedance) -- adds video motion from the locked frame
- Upscaling (Topaz Gigapixel / Real-ESRGAN) -- resolution enhancement without artifact introduction
- Color grading (ffmpeg LUT / Lightroom) -- final color science and look
Scene Generation Formula
The viznfr method for photorealistic scene generation follows a strict four-step formula that ensures character consistency across scenes. See references/realistic-scenes.md for the full workflow.
Persona Definition → Character Sheet → Scene Description → Face Swap → Generate
- Persona: demographic, personality, wardrobe style, environment
- Character sheet: multi-angle reference with consistent features (generated via LoRA or multi-view)
- Scene description: environment, action, lighting, camera angle, mood
- Face swap: locks facial identity from character sheet onto scene output
- Generate: final render with all constraints applied
Tool Priority Matrix
Three tiers based on quality, speed, and cost. Always prefer the highest available tier.
| Priority | Image Gen | Video Gen | Upscaling | Color Grade |
|---|---|---|---|---|
| Tier 1: Higgsfield CLI (default) | higgsfield-generate --model soul_v2 / nano_banana_2 / flux_2 |
higgsfield-generate --model veo_3 / kling_3 / seedance_2 |
Topaz Gigapixel CLI | Lightroom (preset batch) |
| Tier 1 (alternates) | Nano Banana 2 (@google/genai) |
Veo 3.1 (@google/genai) |
— | — |
| Tier 2: Browser-automated | Midjourney (via browser) | Kling 3 Pro (via fal.ai) | Topaz web UI | DaVinci Resolve |
| Tier 3: Local pipeline | ComfyUI + SD/Flux | ComfyUI + Wan/AnimateDiff | Real-ESRGAN CLI | ffmpeg + LUT files |
Why Higgsfield CLI is the default Tier 1: a single CLI auths once and exposes 30+ models (Soul V2, Nano Banana 2, Veo 3.1, Kling 3.0, Seedance 2.0, Flux 2, GPT Image 2, Marketing Studio). One credit pool, one auth flow, scriptable from Claude Code. Higgsfield's own guidance: "If you are using Claude Code or Codex, it's better to use the CLI" — confirmed by the bundled higgsfield-generate, higgsfield-product-photoshoot, higgsfield-soul-id skill wrappers.
Generation Modes
Mode 1: Higgsfield-first (default — recommended for agent workflows):
- One CLI, 30+ models. Auth once via
higgsfield auth login. - Image generation:
- Cinematic editorial / lifestyle:
higgsfield-generate --model soul_v2 --prompt "..." - Character consistency / multi-angle:
higgsfield-generate --model nano_banana_2 --prompt "..." - General photoreal:
higgsfield-generate --model flux_2 --prompt "..." - Product photoshoot: invoke
higgsfield-product-photoshootskill (mode-specific brand-quality enhancement)
- Cinematic editorial / lifestyle:
- Video generation (start-frame first):
- Premium:
higgsfield-generate --model veo_3 --image <upload_id> --prompt "..." - Cinematic motion:
higgsfield-generate --model kling_3 --image <upload_id> - Multi-shot storytelling:
higgsfield-generate --model seedance_2 --image <upload_id>
- Premium:
- Soul Character training for face/identity locking across a series: invoke
higgsfield-soul-idskill. - Marketing Studio for branded ads with avatar + product:
higgsfield-generate --model marketing_studio_video. - Best for: 90% of cinematic generation work. Skip the other modes unless you specifically need a model Higgsfield doesn't expose.
Mode 2: Direct provider APIs (fallback when Higgsfield is missing a model):
@google/genaiSDK for Nano Banana, Veo 3.1, Imagen@fal-ai/clientfor Kling, Sora, alternative Flux variants- Use when: a specific provider exposes a model Higgsfield hasn't bundled, or when you want provider-direct billing
Mode 3: Higgsfield MCP (for Claude Desktop / web Claude):
- Add
https://mcp.higgsfield.aias a custom connector in Claude settings → Connectors - Same 30+ models, same auth, GUI-driven
- Use when: not running Claude Code or Codex; the CLI path is preferred for those
Mode 4: Browser-automated (rare, for tools without API/MCP):
- Use
/agent-browserto drive Midjourney, Weavy, Artlist, or other web-only tools - Best for: music selection, premium one-offs in tools without programmatic access
- Note: Higgsfield Studio is now CLI-accessible; do not use the browser path for Higgsfield models anymore
Mode 5: Local pipeline (maximum control):
- ComfyUI with custom node graphs for full pipeline control
- LoRA training and application for brand-locked output
- Best for: brand content where you need bit-exact reproducibility, R&D on custom styles
Cinema → Higgsfield workflow integration
The compounded workflow when content-engine-cinema dispatches a scene:
Concept (cinema scene brief)
↓
Camera-style vocabulary picked (Fincher / Nolan / Villeneuve / etc.)
↓
Style fragment + scene description merged into final prompt
↓
START FRAME: invoke higgsfield-generate --model soul_v2 (or nano_banana_2)
→ upload_id captured from output
↓
[Optional: invoke higgsfield-soul-id if face/identity needs to be locked]
↓
MOTION: invoke higgsfield-generate --model veo_3 / kling_3 --image <upload_id>
→ wait for job, capture video_url
↓
POST-PRODUCE: Topaz Gigapixel for upscale, Lightroom or ffmpeg LUT for grade
↓
Output to /content-engine/output/{campaign-slug}/raw/
↓
manifest.json updated with: prompt, model, upload_id, job_id, timestamp, identity_refs
Every node above is a single CLI invocation; no manual browser steps.
Design Style Library
25+ codified design styles with exact prompt fragments. Each style is a reusable aesthetic recipe. See references/style-library.md for the complete library with example prompts and use cases.
Selected styles:
| Style | Visual Signature | Best For |
|---|---|---|
| Flat Illustration | Bold shapes, minimal shadows, geometric | Explainer content, social cards |
| Neo 3D | Glossy materials, soft shadows, floating objects | Product showcases, app marketing |
| Brutalism | Raw concrete, exposed structure, monospace | Developer tools, infrastructure |
| Y2K | Chrome, gradients, bubble letters, translucent | Gen-Z targeting, nostalgia |
| Romantasy | Soft light, floral, painterly, warm | Lifestyle, wellness, editorial |
| Cybercore | Neon grids, holographic, dark backgrounds | Tech products, AI content |
| Vintage 80s | VHS grain, neon, synthwave, chrome text | Retro campaigns, music |
| Bauhaus | Primary colors, geometric, grid, sans-serif | Design tools, education |
Motion and Animation
See references/motion-animation.md for the full motion technique guide.
Key capabilities:
- Kling motion transfer: use a reference video as a motion driver -- zero rigging, the model extracts motion and applies it to your character
- Wan image-to-video: animate a start frame with natural motion, best for subtle camera moves and environmental animation
- Seedance 2.0: multi-shot storytelling with character consistency across cuts
- Camera control: encode specific camera movements (dolly, pan, crane) into generation prompts
Post-Production
See references/upscaling-grading.md for the upscaling and color grading pipeline.
Output organization:
assets/
raw/ # Direct model output (never delete)
upscaled/ # After Topaz or Real-ESRGAN
graded/ # After LUT or Lightroom
final/ # Delivery-ready files
Reference Files
- references/cinematic-prompting.md -- start-frame doctrine, camera vocabulary, prompt engineering
- references/realistic-scenes.md -- photorealistic scene generation formula, character consistency
- references/node-pipelines.md -- ComfyUI node architecture, pipeline design
- references/motion-animation.md -- Kling motion transfer, Wan animation, Seedance multi-shot
- references/style-library.md -- 25+ design styles with prompt patterns
- references/upscaling-grading.md -- Topaz, Real-ESRGAN, ffmpeg LUT grading, output organization