harness-evals-export - SKILL.md Agent Skill

name: harness-evals-export description: Run AutoForge harness datasets and export OpenAI-friendly eval bundles from datasets or completed harness runs.

Use this skill when the user wants benchmarking, dataset runs, or eval handoff artifacts.

AutoForge supports:

Preferred commands:

Run a dataset:
- autoforgeai harness run <dataset.jsonl>
Prewarm referenced images:
- autoforgeai harness prewarm <dataset.jsonl>
Export a dataset or run to an eval bundle:
- autoforgeai harness openai-export <dataset-or-run-path>

Expected exported artifacts:

When a harness run has already completed, expect AutoForge to emit:

Always surface: