name: "Research Cycle" description: "Run end-to-end research workflows across hypothesis framing, experiment design/execution, decision synthesis, and diagnostics. Use when a task involves research planning, running/evaluating experiments, comparing alternatives, or debugging unclear results. Set mode=hypothesis|experiment|decision|diagnostics to produce stage-specific artifacts with consistent contracts and explicit assumptions, evidence, and next actions."
Codex GRD Skill: Research Cycle
Only run this loop when missing information would materially change:
- recommendation quality,
- artifact shape or path, or
- execution safety.
Protocol:
- Ask at most 1 high-leverage question per response.
- Prefer direct questions; include 2-4 options only when they reduce ambiguity or user effort.
- When options are provided, include an explicit open-ended path: "If none fit, describe your own direction."
- Recap "Captured so far" only after multi-turn clarification or when alignment appears uncertain.
- Stop questioning immediately once next actions are clear.
- If safe to proceed, continue with explicit assumptions instead of asking extra questions.
Do not force users into provided options; options are scaffolding, not constraints.
- Provide exact file paths, commands, and expected outputs.
- Use numbered steps and execute smallest-valid slice first.
- State assumptions and unknowns explicitly; do not silently guess.
- Define done criteria and verification commands before execution.
- If blocked, report the blocker and the next minimal unblocked action.
If it sounds like corporate PM theater, delete it.
- If user explicitly asks for a saved deliverable: write or update artifact files.
- Otherwise: provide a proposed diff outline (files plus key edits) and verification steps.
- Assumptions (bullet list; call out unknowns)
- Plan (numbered; smallest-first)
- Proposed changes / artifacts
- If user did NOT ask to write files: provide a proposed diff outline plus filenames
- If user DID ask to write files: write or update artifact files named in
- Verification steps (how to check it worked)
- Risks and failure modes (brief; include data leakage and confounds when relevant)
If the profile adds extra numbered items, keep their order after item 5.
If the skill defines additional required sections (for example, evidence taxonomy or artifact tables), include them after the last numbered item in this profile.
For Markdown output containing math, use $...$ inline and $$...$$ for display math.
Keep language execution-centric: concrete file paths, exact commands, and explicit done criteria.
6) Execution record
- exact files/paths touched
- command-level verification performed (or planned)
- rollback note for the smallest reversible unit
If the skill defines additional required sections, include them after item 6.
Risk tiers:
- LOW: summarize, plan, draft text, propose diffs, read-only inspection.
- MED: modify code or configs, run tests or training scripts, change evaluation protocol.
- HIGH: delete or overwrite data, touch secrets or credentials, publish externally, deploy, spend money or credits.
Execution confirmation rule:
- Ask for explicit approval only when executing MED/HIGH actions; planning and proposal alone do not require an execution pause.
Contract:
- List Proposed Actions (files, commands, external calls).
- Label each action LOW, MED, or HIGH plus rollback plan.
- Require explicit user approval before executing MED/HIGH actions. Execution emphasis:
- Before running mutating commands, restate the immediate goal and the minimal rollback path.
- Pre-mutation checklist:
- what will change
- where it will change
- how success will be verified
- how to revert minimally