name: videoclaw-workflow
description: >
Use this skill whenever the user wants to "make a drama with videoclaw",
"produce a short video drama", "用 videoclaw 做短剧", "build a TikTok
drama", "run the videoclaw pipeline", or invokes any claw drama …
command. Always-active entrypoint into the videoclaw drama lifecycle —
covers drama new / drama import (setup), plan / script,
design-characters / design-scenes / design-cover /
assign-voices / refresh-urls, preview-prompts, run, audit /
audit-regen, export. Loads related skills
(/videoclaw-drama-setup, /videoclaw-models,
/videoclaw-checkpoint, /videoclaw-troubleshoot) when sub-phases
apply.
metadata:
author: VideoClaw Contributors
license: Modified-MIT
version: 0.1.4
requires:
bins:
- claw
install: "uvx --from https://github.com/AIGC-Hackers/videoclaw-cli/releases/download/v0.1.4/videoclaw-0.1.4-py3-none-any.whl videoclaw setup"
VideoClaw Drama Production Workflow
STOP — Do NOT generate prompts or videos yet. If no series exists, start with
/videoclaw-drama-setupto create one viaclaw drama neworclaw drama import. Runclaw --json drama listto check first. Skipping setup leads to orphaned assets, broken checkpoint references, and silent re-billing.
This skill is the always-active entrypoint for producing a TikTok-format Western live-action AI drama with videoclaw — input a script, output a 50–90s episode through the setup → plan → design → preview → generate → audit → export lifecycle.
Requires: videoclaw ≥ 0.1.0 with Evolink key configured for LLM and default
gpt-image-2image assets. Runclaw --json doctorfirst; if it returns exit 3, load/videoclaw-troubleshootto fix the auth path.
Phase 0 — Understand
Before scaffolding anything, get the user's intent in writing. Ask these and wait for answers — do not assume:
- Synopsis or finished script? — concept-driven (LLM authors) vs imported finalized script (locked, decomposition only).
- Language? —
zh(default fordrama new) oren(default fordrama import). - Episodes × duration? — default 5 episodes × 70s. TikTok cap is 90s; below 50s is too short for narrative arc.
- Genre / style? —
drama/cinematicare the defaults; other genres (thriller,romance,comedy) acceptable. - Video model preference? — default
seedance-2.0. If the user has cost / region constraints, load/videoclaw-models. - Image asset provider? — default Evolink
gpt-image-2(1K,medium). BytePlusseedream-5.0-liteis an optional explicit fallback, not the default. - Any constraints on faces? — Seedance Privacy Information filter rejects realistic women's faces; turnarounds must be stylized.
Once answered, persist intent in a working note (e.g. project memo) before scaffolding. Do not skip. Wrong assumptions here cost hours.
Phase 1 — Setup (load /videoclaw-drama-setup for details)
Pick one of three entry modes:
| User has | Command | Mode |
|---|---|---|
| A synopsis only | claw drama new "<synopsis>" --title "<t>" --lang zh --episodes 5 |
LLM authors script (creative) |
| A finalized .pdf / .docx / .txt script | claw drama import script.pdf --title "<t>" --lang en |
Locked, decomposition only |
| Existing series, want to re-plan | claw drama script <series_id> --episode N |
Re-author specific episode |
After setup, capture the series_id from the JSON envelope — every
later command takes it as an argument.
Phase 2 — Plan
claw drama plan <series_id> # episodes outline + scene list (LLM)
claw drama script <series_id> --episode 1 # full scene-by-scene script for ep 1
Outputs land under {VIDEOCLAW_PROJECTS_DIR}/dramas/<series_id>/. Open
the generated script and verify scene blocks have location /
time_of_day / characters_present / emotion / scene_group
populated.
Phase 3 — Design assets
Order matters — characters before scenes (scenes reference characters):
claw drama design-characters <series_id> # Universal Reference turnaround sheets
claw drama design-scenes <series_id> # location reference images
claw drama design-cover <series_id> --episode 1 # TikTok thumbnail
claw drama assign-voices <series_id> # only for non-native-audio video models
For the default seedance-2.0 model, skip assign-voices unless the
user explicitly opts into external TTS. Seedance 2.0 co-generates
dialogue, subtitles, SFX, and ambient audio inside each clip; adding
TTS, BGM, or subtitle overlays later degrades short-drama quality.
Default image asset generation uses Evolink gpt-image-2 at
resolution=1K and quality=medium for character turnaround sheets,
scene/location references, props, and cover frames. For direct image
checks, use:
claw image "location reference, cinematic motel exterior" \
--provider evolink --model gpt-image-2 --size 16:9 \
--resolution 1K --quality medium
Only use BytePlus as an explicit fallback when requested or when Evolink image access is unavailable:
claw image "character turnaround sheet" \
--provider byteplus --model seedream-5.0-lite --size 3:4
If reference image URLs become stale (Seedance refuses base64; uses HTTPS only), refresh:
claw drama refresh-urls <series_id>
Load /videoclaw-models for the HTTPS-only rule and the
stylized faces Privacy Information rule before designing
characters.
Phase 4 — Pre-flight
claw drama preview-prompts <series_id> --episode 1
Reads the script + assets and prints the enhanced Seedance 2.0 prompts
for every shot — review before spending API credits. If a prompt
looks wrong, return to Phase 2 (drama script) or Phase 3
(design-characters) before running. Never skip preview on a fresh
series.
Phase 5 — Generate
Always test with the first 3 shots before the full run:
# Test run
claw drama run <series_id> --episode 1 --max-shots 3
# Full episode after the test passes
claw drama run <series_id> --episode 1
Useful flags:
--max-shots N— limit to first N (test budget control)--shot-breakpoint— pause after each shot for manual review--dry-run— wire-only validation, no model calls--start N/--end M— generate episode range
Each stage of run writes a checkpoint snapshot. If run fails
mid-way, don't start over — load /videoclaw-checkpoint and
resume from the last good checkpoint.
Phase 6 — Audit
claw drama audit <series_id> --episode 1 # Vision QA via Claude
claw drama audit-regen <series_id> --episode 1 # audit → regen failing shots → re-audit loop
Audit checks character consistency, dialogue alignment, scene
continuity, and prompt-vs-output match. Failed shots are listed in
the checkpoint; audit-regen loops automatically up to
VIDEOCLAW_MAX_RETRIES (default 3).
For single-shot fixes (audit failed on shot 7 of 12):
claw drama regen-shot <series_id> --episode 1 --shot 7
claw drama edit-shot <series_id> --episode 1 --shot 7 # opens prompt in $EDITOR
Phase 7 — Export
claw drama export <series_id> --episode 1
Writes deliverables under {VIDEOCLAW_DELIVERABLES_DIR}/<drama-name>/:
final mp4, scene-by-scene review directory (semantic filenames, no
UUIDs), audit report, character sheet. Ready to publish to TikTok.
For multi-episode series:
claw drama series-view <series_id> # rebuild series-level review (idempotent)
Quick decision matrix
| Symptom | Action | Skill |
|---|---|---|
claw doctor returns 3 |
API key missing / expired | /videoclaw-troubleshoot |
| Need to pick a non-default video model | Load model selection | /videoclaw-models |
| Generate failed mid-episode | Resume from checkpoint | /videoclaw-checkpoint |
| Audit flagged shots 3, 7, 9 | drama audit-regen (auto) or regen-shot (manual) |
this skill, Phase 6 |
| Reference image URLs expired | claw drama refresh-urls |
this skill, Phase 3 |
| Privacy filter rejecting faces | Switch turnaround to stylized illustration | /videoclaw-models |
Universal rules (the videoclaw constitution)
- Zero hardcoded drama data. Drama-specific info flows through CLI
flags / config / assets — never edit
src/videoclaw/**to embed series-specific values. - Semantic filenames in review directories — no UUID / hash leak
in
docs/deliverables/<drama>/review/. - Subtitle is rendered by Seedance inside the video, not by FFmpeg external overlay. Don't fight this.
- Seedance 2.0 native audio is authoritative. Do not add downstream TTS, BGM, or subtitle overlay nodes for default Seedance drama runs.
- Reference images are HTTPS URLs only (Seedance proxy rejects base64 data URIs).
- Faces in turnaround sheets are stylized / illustrated, not realistic — the Privacy Information filter rejects realistic women.
- TikTok format is locked at 9:16 / 720p / 50-90s / Seedance 2.0 (4-15s per clip).
Reference
Long-form internals (DAG, checkpoint layout, cost accounting):
references/pipeline-internals.md.