loom

name: loom type: interface description: Generic DAG task-execution library. Use when a skill needs to drive a DAG of tool / agent / human-gate tasks with mandatory output schemas, static validation, built-in Jinja prompt rendering, and crash-resumable workdir state. Do NOT invoke directly from user prompts — skills import loom and call loom.init / loom.extend / loom.resume.

A passive, single-process DAG task-execution library. Skills declare a static plan; loom validates it, lowers it to plan.yaml, schedules tasks, executes internal tool tasks inline, renders prompts for external agent and human tasks, validates outputs against per-task JSON Schema, and persists every state transition atomically.

Loom never imports skill code, never auto-triggers anything on completion, and never owns the workdir root path.

Workdir layout

Loom owns everything inside the workdir, but the workdir root path is the caller's choice. Skills decide where to put their workdirs and what to name them; loom only decides the layout underneath.

<workdir>/                       ← caller chooses this path
├── plan.yaml                    ← loom: lowered DAG with statuses
├── global/                      ← loom: cross-task shared bag
└── tasks/                       ← loom: per-task state
    └── <NN-task_id>/            ← loom: NN = 1-based plan order
        ├── prompt.md            ← rendered for agent/human tasks
        ├── output.yaml          ← task output (schema-validated)
        └── stderr.log           ← tool subprocess stderr

Two-digit prefix NN comes from _numbered_name(index, task_id) in loom.engine.store. Index is the task's 1-based position in the plan's declaration order, NOT the topological order.

A skill that wraps loom (e.g. dojo, curator) typically defines its own WORKDIR_ROOT constant and constructs the workdir from a name plus optional timestamp:

WORKDIR_ROOT = Path("/tmp/dojo")
wd = WORKDIR_ROOT / name        # → /tmp/dojo/<name>/
runtime = loom.init(workdir=wd, plan=...)

Loom does NOT provide a default workdir, an environment variable, or a CLI flag for choosing the root. That is each wrapping skill's responsibility.

Intermediate files a tool or agent task produces (temp JSON, vars files, generated artifacts) belong inside that task's <workdir>/tasks/<NN-id>/ sub-directory — see the task.workdir template variable. Avoid scattering them across /tmp/; co-locating with the task makes failures inspectable.

Task primitive

A task has fields (id, kind, output_schema, depends_on_all, depends_on_any, when, latch, kind-specific fields):

id — unique within the plan; used for refs and paths.
kind — tool, agent, or human.
output_schema — path to YAML JSON Schema. Required for tool/agent; optional for human (defaults to {type: object}).
depends_on_all — list of upstream ids. Optional.
depends_on_any — list of upstream ids. Optional.
when — predicate string. Optional; defaults to true.
latch — loop block. Optional; when set the task is a loop latch (see Loops).
cmd — argv for tool.
template — Jinja template path for agent and human.

Kinds

Kind	Body
`tool`	Engine runs `cmd` as a subprocess inside `next()`. Stdout →
	`output.yaml`, stderr → `stderr.log`. Output validated against
	schema.
`agent`	Engine renders the Jinja template to `prompt.md` and yields the
	task. Caller dispatches LLM, writes `output.yaml`, then calls
	`runtime.complete(id)`. Output validated at completion.
`human`	Same as `agent` but typically gates or approval flows.

Semantics

Statuses: pending → ready → running → done | failed | skipped. Every transition is an atomic plan.yaml write.

next() resolves a task once every id in depends_on_all and depends_on_any is in a terminal status (done, failed, or skipped). The semantics are logical:

done ≡ True
skipped ≡ False
depends_on_all ≡ AND
depends_on_any ≡ OR

Resolution applies these checks in order:

Cascade-skip. Mark skipped and write skip-reason.log if either:
- any dep in depends_on_all has status skipped (False makes the AND False);
- every dep in a non-empty depends_on_any has status skipped (all Falses make the OR False).
Predicate. Evaluate when:. If it returns false, mark skipped and write skip-reason.log.
Otherwise mark ready and dispatch.

Failure is exceptional, not logical. A single failed task halts the entire run: the next call to runtime.next() raises RunAborted carrying the failed task ids. In-flight tasks finish naturally (their outputs are persisted) but no new tasks are dispatched. Orchestrators surface the abort to the user.

Body failures transition the task to failed:

tool subprocess exited non-zero.
agent output.yaml failed schema validation.
Jinja render error (ready → failed).

Empty dependency lists

A task may omit both depends_on_all and depends_on_any to be a root task. When either field is supplied, it must be non-empty — empty lists are rejected by the factory functions in loom.plan with a clear error.

Public API

import loom
from loom import tool, agent, human, make_plan, latch

plan = make_plan(
    tool("fetch", cmd=[...], output_schema="/abs/fetch.yaml"),
    agent(
        "classify",
        template="/abs/classify.j2",
        output_schema="/abs/classify.yaml",
        depends_on_all=["fetch"],
    ),
    agent(
        "extract-paper",
        template="/abs/extract.j2",
        output_schema="/abs/extract.yaml",
        depends_on_all=["classify"],
        when="${task:classify:quintet.form == 'paper'}",
    ),
)

runtime = loom.init(workdir="/abs/run", plan=plan)
loom.extend(runtime, more_tasks)
runtime = loom.resume("/abs/run")

while True:
    action = runtime.next()
    if action is None:
        break
    for task in action.tasks:
        # render prompt is already on disk; dispatch + write output.yaml
        ...
    runtime.commit_running([t["id"] for t in action.tasks])
    for task in action.tasks:
        runtime.complete(task["id"])

Worked example

A plan with two source-shape branches and a fan-in.

plan = make_plan(
    tool("fetch", cmd=["curl", URL], output_schema=fetch_schema),
    agent(
        "classify",
        template=classify_j2,
        output_schema=quintet_schema,
        depends_on_all=["fetch"],
    ),
    # Paper-only branch
    agent(
        "extract-paper",
        template=paper_j2,
        output_schema=extract_schema,
        depends_on_all=["classify"],
        when="${task:classify:quintet.form == 'paper'}",
    ),
    # Video-only branch
    agent(
        "extract-video",
        template=video_j2,
        output_schema=extract_schema,
        depends_on_all=["classify"],
        when="${task:classify:quintet.form == 'video'}",
    ),
    # Fan-in across the optional branches. Use depends_on_any
    # because exactly one of the two extracts will run; the
    # other is skipped. AND would propagate False; OR is True
    # as long as at least one branch produced output.
    tool(
        "aggregate",
        cmd=["python", AGG_SCRIPT, "--workdir", "${workdir}"],
        output_schema=agg_schema,
        depends_on_any=["extract-paper", "extract-video"],
    ),
)

Trace for a paper source:

fetch → done.
classify → done with {quintet: {form: 'paper'}}.
extract-paper: when: true → done.
extract-video: when: false → skipped.
aggregate: depends_on_any=[done, skipped] → OR is True → done.

Trace if extract-paper fails:

The next call to runtime.next() raises RunAborted carrying failed_task_ids=['extract-paper']. The orchestrator surfaces this as a run-level error; aggregate is never scheduled.

Loops

A latch block turns a task into a loop latch. The back-edge is latch -> header; the loop body is the natural loop of that back-edge — derived from the graph, never hand-declared. Build the block with the latch() helper and pass it as latch=:

from loom import tool, agent, make_plan, latch

plan = make_plan(
    tool('fetch', cmd=[...], output_schema=fetch_schema),
    agent('fix', template=fix_j2, output_schema=fix_schema,
          depends_on_all=['fetch']),
    # review loops back to fix until approved or fuel runs out:
    agent('review', template=review_j2, output_schema=review_schema,
          depends_on_all=['fix'],
          latch=latch('fix', fuel=5,
                      while_="${task:review:verdict != 'approved'}")),
    tool('publish', cmd=[...], output_schema=pub_schema,
         depends_on_all=['review']),
)

Shapes

Self-loop — header equals the task's own id. Body is just that task. Use for retry / refine: latch=latch('refine', fuel=3).
Natural loop — header is an upstream task (e.g. fix). Body is every node on the paths header .. latch (here {fix, review}).

Exit controls (`fuel` / `while`)

fuel and while are alternative exit controls; declare at least one (either alone, or both). The latch continues iff (fuel absent or fuel-1 > 0) and (while absent or while is true) — it stops as soon as either fires. fuel is a positive integer countdown, decremented each round and persisted on the latch.

A bound is not a real termination guarantee — fuel: 1_000_000_000 "terminates" but never ends in practice. Keep fuel sane, prefer a while convergence test, and rely on an operational wall-clock / cost budget in the orchestrator for true runaway protection.

Per-iteration outputs

Body tasks write each round under tasks/<NN-id>/iter-NN/output.yaml (non-loop tasks stay flat). References resolve as:

${task:id} / ${task:id:path} — the latest completed round.
${task:id@k} — round k (absolute index).
${task:id@prev} — the round before the latest completed (use in a while to detect convergence, e.g. while_="${task:id:val} != ${task:id@prev:val}").

A consumer downstream of the loop fires once, after the final round, and reads the last round's output.

Constraints

Loops must be reducible and form a hammock (single entry through the header, single exit through the latch). loom.init / loom.extend reject:

a loop with no exit control (NoExitConditionError);
a header that does not dominate the latch, or two latches sharing a header (IrreducibleLoopError);
an edge crossing the region boundary other than into the header or out of the latch (LoopEscapeError);
overlapping or nested regions — not yet supported; bodies must be pairwise disjoint (LoopNestingError).

loom visualise shows a latch as ↻ loop → <header> · fuel N · while ….

Reference grammar

Placeholder	Resolves to
`${workdir}`	absolute workdir path
`${task_workdir}`	absolute path to current task dir
`${task:<id>}`	upstream task's full output
`${task:<id>:<jmespath>}`	JMESPath query result
`${task:<id>@<k>}`	loop body: round `k`'s output
`${task:<id>@prev}`	loop body: round before the latest
`${task_path:<id>}`	absolute path to upstream output
`${global}` / `${global:<rel>}`	absolute path to `<workdir>/global`

Inside a when: predicate the syntax is identical; the engine desugars ${task:<id>:<expr>} to JMESPath task."<id>".<expr> and evaluates against a virtual document of all task outputs.

$${...} produces a literal ${...} (escape).

Workdir layout

<workdir>/
├── plan.yaml             # engine-owned DAG + statuses
├── tasks/<NN-id>/        # per-task scratch
│   ├── output.yaml       # the only file engine reads from a task
│   ├── prompt.md         # rendered prompt (agent/human only)
│   ├── stderr.log        # tool subprocess stderr
│   ├── render-error.log  # jinja error, if any
│   ├── schema-error.log  # output_schema mismatch, if any
│   └── skip-reason.log   # cascade-skip or when:-false reason
└── global/               # cross-task shared state, skill-owned

Output schemas

Every tool and agent MUST declare output_schema pointing to a YAML JSON Schema file. Loom loads, meta-validates, and caches schemas at init/extend. At runtime, complete() validates output.yaml against the schema; mismatch → failed and OutputSchemaError.

Output writer CLI

Lets agent tasks write output.yaml through schema-validated shell calls instead of raw fs_write:

loom output init <workdir> --task <id>
loom output add  <workdir> --task <id> --set path=value [--set ...]

init resolves the task's schema from plan.yaml, seeds top-level array / object containers, and writes tasks/<NN-id>/output.yaml.
add applies dotted path=value assignments (numeric segments are array indices), coerces values per the schema, validates the full file against the schema, and writes back atomically.

The wrapper at scripts/loom.sh runs python -m loom under uv:

$loom_sh output init "$WORKDIR" --task extract-keywords
$loom_sh output add  "$WORKDIR" --task extract-keywords \
    --set keywords.0.name='RAG' \
    --set keywords.0.definition='retrieval augmentation pattern'

Plan extension

loom.extend(runtime, more_tasks) re-runs static validation against the merged plan, then appends. New tasks may reference existing ids in depends_on_* / ${task:...} / when:.

Pattern for branched plans: a tool task generates a plan dict; the orchestrator reads its output and feeds it to loom.extend:

runtime.complete("build-stage2-plan")
plan_dict = runtime.task_output("build-stage2-plan")
loom.extend(runtime, loom.LoomPlan.from_dict(plan_dict))

Static validation (init / extend)

Run before any disk write:

DAG integrity — cycles, missing deps, duplicate ids.
Kind-field consistency — tool has cmd; agent has template; etc.
Mandatory output_schema on every tool and agent.
Schema files exist, parse as YAML, are valid JSON Schema.
Every ${task:<id>:...} reference targets an existing task.
JMESPath dot-paths and array indices resolve against the referenced task's output_schema.
Comparator literals are type-compatible with declared field types.

Any failure raises a LoomPlanError subclass; no disk state is created.

Errors

Class	Raised by
`LoomPlanError`	base for plan-time validation failures
`DAGError`	cycle / duplicate id / missing dep
`SchemaError`	schema file missing or invalid JSON Schema
`ReferenceError`	bad `${task:id:...}` reference
`TypeMismatchError`	comparator literal vs. declared field type
`NoExitConditionError`	loop latch with neither `fuel` nor `while`
`IrreducibleLoopError`	header not dominating latch / shared header
`LoopEscapeError`	edge crosses a loop region boundary illegally
`LoopNestingError`	loop regions overlap without proper nesting
`WorkdirExistsError`	`loom.init` on workdir with `plan.yaml`
`WorkdirNotEmptyError`	`loom.init` on dirty workdir
`RunFailed`	tool subprocess exited non-zero
`OutputSchemaError`	output.yaml fails schema validation
`RenderFailed`	Jinja render error

Template inheritance

For Jinja {% extends %} / {% include %}, set template_search_paths on the task to include any directories the included paths resolve against. Loom's FileSystemLoader uses [template_path.parent, *template_search_paths].

Legacy `depends_on` (deprecated)

depends_on= is silently migrated to depends_on_all= on construction and YAML load. Factories emit FutureWarning. Mixing depends_on= with depends_on_all= raises immediately. Task.depends_on remains populated as the order-preserving union of the two canonical lists for callers that just want "every upstream id".

Implementation

loom/__init__.py — public surface
loom/_lifecycle.py — init, extend, resume
loom/plan.py — tool, agent, human, make_plan factories
loom/engine/models.py — Task, LoomPlan, ActionSpec
loom/engine/store.py — plan.yaml and output.yaml IO
loom/engine/algorithm.py — ready-set, predicates, transitions
loom/engine/runner.py — LoomRuntime execution methods
loom/engine/resolve.py — placeholder substitution
loom/render/jinja.py — Jinja rendering with default context bags
loom/validate/dag.py — DAG + kind-field checks
loom/validate/schemas.py — SchemaCache
loom/validate/references.py — reference + JMESPath + type checks
loom/validate/loops.py — loop admission (reducibility + hammock + nesting)
loom/engine/loops.py — dominators, natural-loop / region computation
loom/builders.py — output init / output add write path
loom/__main__.py — argparse CLI
scripts/loom.sh — uv wrapper for the CLI
loom/errors.py — exception hierarchy

Tests at scripts/loom/tests/ — 346 cases.

loom

Loom

Workdir layout

Task primitive

Kinds

Semantics

Empty dependency lists

Public API

Worked example

Loops

Shapes

Exit controls (fuel / while)

Per-iteration outputs

Constraints

Reference grammar

Workdir layout

Output schemas

Output writer CLI

Plan extension

Static validation (init / extend)

Errors

Template inheritance

Legacy depends_on (deprecated)

Implementation

Exit controls (`fuel` / `while`)

Legacy `depends_on` (deprecated)