hpc-wrap-entry-point

name: hpc-wrap-entry-point description: "Onboard a repo for hpc-agent submission, autonomously. Given a partial `InterviewSpec` (goal + task_generator are required from the caller; everything else is detected from the repo), the skill: (a) detects an existing entry point or scaffolds one via `build-template --shape {script,notebook}` for greenfield repos, (b) prefers `@register_run` direct decoration on the user's existing function and falls back to materializing a wrapper at `.hpc/wrappers/<run_name>.py` only when direct decoration is structurally blocked (non-Python entry point, `@hydra.main` signature rewrite, vendor code), (c) detects frozen YAML configs by convention, (d) walks the data-axis decision tree, (e) invokes the `interview` primitive to persist `tasks.py` + `interview.json`. No `[Y/n]` prompts — every choice point has a deterministic resolution. Human-driven callers (the `/wrap-entry-point-hpc` slash) gather intent from the user first and pass a fully-resolved spec; the skill records what it was given." allowed-tools: Bash Read Edit Write Glob execution: inline category: agent-autonomous

Agent-facing composition over the interview primitive. Autonomous mode fills a partial InterviewSpec from repo inspection; the slash consumer (/wrap-entry-point-hpc) passes a fully-resolved spec and the skill just records.

The skill persists, in either pathway:

A tasks.py (from the supplied task_generator) whose kwargs include <stem>_sha for every frozen YAML, so cmd_sha distinguishes exp_42.yaml from exp_43.yaml and catches in-place edits.
An interview.json recording the entry-point shape (register_run pointer for 3a; shell_command block with the wrapper for 3b).
Only in the fallback path: a @register_run wrapper at <experiment>/.hpc/wrappers/<run_name>.py whose body subprocess.check_calls the user's entry point with kwargs substituted. Downstream primitives (classify-axis, validate-executor-signatures) introspect the wrapper's typed signature; the underlying entry point stays untouched.

Execution style

Batch independent tool calls into one assistant message. "Parallel" means multiple Bash / Read / Grep / Glob tool-call blocks in a single message — the harness runs them concurrently. NOT shell-level concurrency inside one Bash call (cmd1 & cmd2 & wait, parallel, xargs -P) — that trips the permission classifier as a compound command.
Chain sequential hpc-agent calls with && in one Bash block when the next call does NOT branch on prior structured output (e.g. hpc-agent install-commands && hpc-agent load-context --experiment-dir .). Do NOT chain past a call whose envelope the next call's args depend on — read the envelope first, then issue the dependent call as its own block. (The && block on the dispatched hpc-worker subagent does NOT apply here.)
Be terse. Lead with the action or result; skip filler ("Let me…", "I'll go ahead and…") and trailing restatements of what tool output already shows.
Return via the emit-skill-return file primitive — never via chat. The Skill tool result is no longer the return mechanism; the parent (hpc-submit, hpc-campaign, …) reads your return envelope from <experiment_dir>/.hpc/_returns/hpc-wrap-entry-point.json. The final step of this skill (Step 8 below) writes that envelope and invokes hpc-agent emit-skill-return as the LAST tool call — no closing chat message of any kind. A non-tool-call closing message fires the harness's end-of-turn signal, the parent never resumes, and the user has to type "keep going". The schema for the envelope lives at hpc_agent/schemas/skill_returns/hpc-wrap-entry-point.json and is enforced by the emit verb.

Inputs

Caller-supplied (the skill refuses with spec_invalid if these are absent):

Field	Why the caller has to supply it
`goal`	One-line free-text intent — the skill cannot invent it.
`task_generator`	The shape of the scale-up axis (`items_x_seeds`, `cartesian_product`, `enumerated`, `numeric_linspace`/`logspace`) plus its params. Cannot be inferred from the repo.

Caller may pre-resolve to skip detection:

Field	Skill's default if absent
`entry_point.kind` (`register_run` \| `shell_command`)	Detected by Step 1 + decided by Step 2
`entry_point.path` / `run_name`	Highest-likelihood candidate from Step 1's probes
`shape` (`script` \| `notebook`, greenfield only)	`script` (the dominant case at scale-up time)
`argv` template + `signature` (wrapper path only)	Derived from the entry point's CLI surface
`frozen_configs`	All `configs/.yaml` / `configs/.yml` / `conf/*.yaml` detected by Step 4
`data_axis_hint`	Walked from the decision tree (Step 6); ambiguous → omitted (the framework re-asks at submit). Valid only on `entry_point.kind: shell_command` — omit it on `register_run` (#260)

When to run

The user's repo has any non-notebook entry point — main.py, train.py, run_experiment.py, python -m pkg.cli, ./simulator, etc. — and no @register_run decoration anywhere.
The repo is greenfield — no entry point yet — and the caller wants a seed scaffold.
A fresh /submit-hpc escalated with mature_repo_needs_interview.

Steps

0. Detect or scaffold an entry point (greenfield branch)

Run the entry-point detection verb once — it collapses the entry-point probes (conventional main.py/train.py/run.py/experiment.py at the root and under src/, package __main__.py python -m targets, [project.scripts] console scripts, run.sh/launch.sh/binary entry points, and @register_run decoration on disk) into one deterministic scan:

hpc-agent detect-entry-point --experiment-dir <experiment_dir>

Branch on data.kind:

detected — at least one entry-point candidate exists (data.candidates is non-empty) or a @register_run is already on disk (data.decoration_found is non-empty). Skip to Step 1, reusing this same data block (no need to re-scan).

greenfield — no candidate of any kind and no decoration. This is a greenfield repo. Use the caller-supplied shape (or default to script) and scaffold:

hpc-agent build-template --repo-dir . --shape script    # or --shape notebook

The primitive injects the chosen seed file (train.py at repo root or notebooks/experiment.ipynb) alongside the framework-owned .hpc/ assets. Then re-run detect-entry-point --experiment-dir <experiment_dir> against the freshly scaffolded file and proceed through Step 1 onwards.

1. Detect the entry point

Use the data block from the Step 0 detect-entry-point call (re-run it if you scaffolded a greenfield seed in Step 0):

hpc-agent detect-entry-point --experiment-dir <experiment_dir>

data.candidates lists every entry-point candidate in by-likelihood probe order (used only for stable diagnostic output — not as a tie-break, since ties refuse). Each candidate carries its classified argv_kind — the CLI surface read off the file's imports + decorators: argparse (argparse.ArgumentParser / import argparse), click (@click.command / @click.group), typer (@app.command), fire (fire.Fire(...)), hydra (@hydra.main), or __main__ for a bare if __name__ == "__main__": block or a package __main__.py (invoked as python3 -m <pkg>). A [project.scripts] console script is console_script (its path is the registered command name); a run.sh/launch.sh/binary is shell.

Autonomous resolution:

If the caller supplied entry_point.path, use it (overrides detection).
Else if exactly one candidate matched (len(data.candidates) == 1), use it.
Else (multiple entry points, no caller pick) return spec_invalid with error_code: ambiguous_entry_point listing the candidates. The skill does not silently pick across main.py / train.py / run.py when more than one exists — the wrong choice is non-recoverable without the user noticing.

Record the picked entry point — the path (or -m invocation), and which CLI library it uses (the candidate's argv_kind).

2. Decide the pathway: direct decoration (default) vs. wrapper materialization (fallback)

The default pathway is @register_run direct decoration on the user's existing function. Fall through to wrapper materialization only when direct decoration is structurally blocked.

Deterministic decision table:

Condition	Pathway
Python-importable function the caller can edit — even one that currently parses `sys.argv` via argparse	Step 3a (direct decoration)
Non-Python entry point (shell script, compiled binary)	Step 3b (wrapper fallback)
`@hydra.main` on the entry point (rewrites the signature; `@register_run` cannot see through it)	Step 3b (wrapper fallback)
`@click.command` / `@app.command` that consumes the function (typer/click decorator forms that swap the callable)	Step 3b (wrapper fallback)
Caller's spec sets `entry_point.kind = "shell_command"` explicitly	Step 3b (wrapper fallback)

@click.command and Typer commands are auto-detected by reading the decorator stack. When the decorator stack is [@register_run, @click.command]-compatible (click leaves the underlying callable intact for some shapes), 3a is still safe; only the consuming forms force 3b.

3a. `@register_run` direct decoration (the default path)

Apply the two-line edit autonomously. The decorator goes on the function the framework should treat as the entry point — not on the if __name__ == "__main__": block, but on the function it ultimately calls.

Common shape: argparse-driven main() reading sys.argv. Factor out an inner function and decorate it; keep the argparse block so the CLI still works:

# train.py
from hpc_agent import register_run

@register_run
def run(config: str, seed: int) -> None:
    # ... the body that used to live below argparse.parse_args() ...
    ...

if __name__ == "__main__":
    import argparse
    ap = argparse.ArgumentParser()
    ap.add_argument("--config", required=True)
    ap.add_argument("--seed", type=int, required=True)
    args = ap.parse_args()
    run(config=args.config, seed=args.seed)

Shape: function already takes kwargs. Just add the import and the decorator — no refactor.

Apply the edit via the Edit tool (autonomous; no confirmation). Record the picked run_name (the function name) and proceed to Step 4. Step 7's interview carries entry_point.kind = "register_run".

3b. Wrapper materialization (fallback path)

The rescue boat. The existing wrapper machinery (entry_point.kind = "shell_command") materializes a thin @register_run wrapper at .hpc/wrappers/<run_name>.py whose body subprocess.check_calls the entry-point argv with kwargs substituted in.

3b.i: Derive the argv template + signature autonomously. From the detected entry point's CLI surface:

argv template — the shell command with {placeholder} for each kwarg. First element by entry-point shape:
- File-based Python script → ["python3", "train.py", ...]
- Package module → ["python3", "-m", "mypkg.cli", ...]
- Installed console script → ["mytool", ...]
- Shell script / binary → ["./run.sh", ...] or ["./simulator", ...]
signature — {placeholder: type} derived from argparse type=int, click IntType, typer annotations, etc. Default to str when the source declares no type.

When the caller supplied argv / signature in the spec, use those instead of deriving (this is how the slash command passes through user-confirmed overrides).

The wrapper's typed signature is what downstream validate-executor-signatures checks; the underlying entry point stays opaque to the framework. The submit workflow's Step 0b picks up _materialized.entry_point (with kind == "shell_command") and uses executor_cmd as the job's EXECUTOR.

4. Detect frozen YAML configs

Scan for YAMLs by convention:

ls configs/*.yaml configs/*.yml conf/*.yaml 2>/dev/null

Every match is added to frozen_configs. The convention is one YAML = one frozen experiment — the framework hashes each file's bytes and threads <stem>_sha into every task's kwargs so cmd_sha covers the YAML's content. Two submits of the same YAML dedup; an in-place edit makes cmd_sha differ.

When the caller supplied frozen_configs in the spec, use those instead — slash callers can drop a config the user said is not the experiment's identity.

Constraint: frozen_configs requires a task_generator (caller-supplied; Step 5). If the experimenter wants a hand-written tasks.py, they have to include the shas themselves; frozen_configs is rejected at intake otherwise.

5. `task_generator` is caller-supplied (no autonomous derivation)

The entry point handles one task. The task_generator enumerates the N tasks to fan out. Common shapes:

Shape	When to use	Params
`items_x_seeds`	One frozen config × N seeds	`items=[{config: "exp_42.yaml"}], seeds=[0..99]`
`cartesian_product`	Cross a few axes	`axes={seed: [0..9], shard: [0..3]}`
`enumerated`	Hand-supplied list of N task dicts	`items=[{...}, {...}, ...]`
`numeric_linspace` / `numeric_logspace`	Sweep one numeric hyperparameter	`param="lr", low, high, n`

The skill does not invent a task_generator — refuse with spec_invalid if absent. (The slash command elicits this from the user; MARs supplies it explicitly.)

5b. Cover non-axis required params (fixed_params)

The entry point's signature may require params the task_generator does NOT vary — e.g. monte_carlo_pi(seed, samples) where only seed is swept. If nothing supplies samples, the executor crashes on every task (#195).

Partition the signature params:

Axis params — names the task_generator produces. Handled per-task; leave them out of fixed_params.
Covered-by-default params — params with a default in the entry point's CLI surface (argparse default=, Python default). Safe to omit; you MAY pin one for reproducibility.
Uncovered required params — required (no default) AND not an axis. MUST be resolved or every task fails. For each, set a constant in entry_point.fixed_params:
- Use the entry point's argparse/CLI default if pinning one.
- Else the caller-supplied value (the slash's uncovered_param dialog).
- Never invent silently — no default + no caller value = ambiguity to surface.

fixed_params is baked into every resolve(i) dict; a swept axis of the same name wins. Requires task_generator. Submit-time validate-executor-signatures refuses uncovered_required_param — covering it here keeps that gate green.

6. Pre-declare the DataAxis hint (autonomous tree walk)

In the direct-decoration path (3a), classify-axis can introspect the decorated function directly later — pre-declaring is optional, since the framework will infer the axis from the function body at submit time. The skill can still pre-fill the hint to short-circuit one round-trip.

In the wrapper path (3b), the wrapper body is subprocess.check_call, so classify-axis cannot introspect it later. The hint here is load-bearing.

Apply the same decision tree as hpc-classify-axis Step 4 (autonomous; no confirmation):

Does each row's result depend on rows computed before it? No → independent (DOALL).
Is the carried state a fixed-size summary combinable in any order? Yes → associative (pick sum / moments).
Is the dependence a bounded look-back (e.g. trailing N rows)? Yes → bounded_halo with halo.expr over parameter names.
Otherwise / ambiguous → sequential (fail-safe default; serial is slow, not wrong).

When the tree resolves to ambiguous, omit data_axis_hint from the spec — classify-axis will surface the boundary at submit time and the caller can resolve it then. (Sequential as a default is correct for hpc-classify-axis recording; here we leave the field absent so the framework still has a chance to interview.)

data_axis_hint is valid only on entry_point.kind: shell_command (#260). When the entry_point is register_run, omit it unconditionally — the schema (interview.input.json) only accepts the field on the shell_command shape (a register_run carries its classification through the decorated function's @register_run arguments / type hints), so emitting it on a register_run spec fails schema validation and costs an avoidable validate-fail / retry round-trip.

7. Build the spec and invoke the `interview` primitive

Assemble the InterviewSpec JSON. The entry_point block differs by pathway:

Direct-decoration path (3a):

{
  "goal": "<caller-supplied>",
  "task_count": <N from the resolved task_generator>,
  "produced_by": {"kind": "human", "operator": "<git user.name>"},
  "task_generator": { "kind": "...", "params": { ... } },
  "entry_point": {
    "kind": "register_run",
    "run_name": "<the function name decorated in Step 3a>",
    "fixed_params": { "<uncovered required param>": <value from Step 5b>, ... }
  }
}

(fixed_params omitted when every signature param is an axis or has a default — Step 5b.)

Wrapper path (3b):

{
  "goal": "<caller-supplied>",
  "task_count": <N from the resolved task_generator>,
  "produced_by": {"kind": "human", "operator": "<git user.name>"},
  "task_generator": { "kind": "...", "params": { ... } },
  "entry_point": {
    "kind": "shell_command",
    "run_name": "<chosen, valid Python identifier — e.g. 'forecast' or 'train'>",
    "argv": [ ... from Step 3b.i ... ],
    "signature": { ... from Step 3b.i ... },
    "frozen_configs": [ ... from Step 4 ... ],
    "fixed_params": { ... uncovered required params from Step 5b, else omit ... },
    "data_axis_hint": { ... from Step 6 if resolved, else omit — shell_command ONLY; never emit on a register_run entry_point (#260) ... }
  }
}

Write to /tmp/interview_spec.json and invoke:

hpc-agent interview --spec /tmp/interview_spec.json --campaign-dir .

On ok=True: the envelope reports the materialized artifacts (tasks.py, interview.json, plus .hpc/wrappers/<run_name>.py only on the wrapper path), total_tasks, and cmd_sha. On error_code=spec_invalid: surface the message to the caller — most often a typo (argv placeholder not in signature) or a missing frozen config. The skill does not loop on its own; the caller (slash or MARs) decides whether to re-supply.

8. Emit the return envelope (final tool call)

The parent skill reads the return envelope from <experiment_dir>/.hpc/_returns/hpc-wrap-entry-point.json. Stage it, then emit:

Use the Write tool to write the envelope to <experiment_dir>/.hpc/_returns/hpc-wrap-entry-point.staged.json. Required fields on the Success branch: ok: true, skill: "hpc-wrap-entry-point", entry_point_kind ("register_run" or "shell_command"), run_name, tasks_py_path, interview_json_path, total_tasks (from interview's envelope), cmd_sha (from interview's envelope). Optional: wrapper_path (set on the 3b wrapper path; null/omit on the 3a direct-decoration path), files_edited (the list of source files Step 3a edited; empty [] on the 3b path). On a fatal error, write the standard ErrorEnvelope shape.
Invoke as your FINAL tool call:
```
hpc-agent emit-skill-return --skill hpc-wrap-entry-point --experiment-dir <experiment_dir>
```
The verb validates against hpc_agent/schemas/skill_returns/hpc-wrap-entry-point.json and atomically renames .staged.json → .json. Then hand control back to the parent without ending your turn — emit no summary or closing message. The parent's next action is hpc-agent fetch-skill-return --skill hpc-wrap-entry-point.

The submit workflow's Step 0b picks up _materialized.entry_point and threads executor_cmd into the submit-flow spec (wrapper path) or runs its normal @register_run discovery (direct-decoration path) — no further setup needed.

Notes

Two on-ramps, one contract. Greenfield repos scaffold an entry point via build-template --shape {script,notebook} (Step 0); mature repos onboard the existing one (Steps 1+). Both paths end in the same place: a @register_run-decorated function on disk plus a materialized tasks.py + interview.json. The canonical description of the contract is docs/internals/experiment-contract.md.
Direct decoration is the default; the wrapper is a rescue boat. A two-line code edit beats a subprocess shim whenever it's possible. The wrapper is for non-Python entry points, decorator conflicts, and read-only vendor code.
Idempotent. Re-running with the same intent overwrites interview.json (and, on the wrapper path, the wrapper file) byte-equivalently (modulo _materialized.at). Editing the underlying entry point's flags requires re-running this skill.
Signature drift safety (wrapper path). The wrapper's typed signature is what validate-executor-signatures checks at submit time. If the entry point's actual flags drift from the declared signature, the canary catches the argparse / CLI error (one task, not a hundred).
The wrapper IS the contract (wrapper path). The framework reads the wrapper's signature, not the entry point's. Keep the wrapper in sync.
One frozen experiment per YAML. Each configs/exp_NN.yaml is its own experiment with its own cmd_sha. To run a different frozen experiment, re-run this skill against the new YAML.
Ambiguity escalates, never auto-resolves silently. Multiple Python entry points without a caller pick → ambiguous_entry_point. The skill refuses to guess across main.py / train.py / run.py when more than one matched.