agent-loop-reliability - SKILL.md Agent Skill

name: agent-loop-reliability description: Design and improve agents using a repeatable loop (context → action → verification) with practical verification patterns.

Start every run by writing down the current objective and success criteria.
Gather context with explicit actions (file search, logs, docs) before making changes.
Take action in small, reversible steps; prefer tool calls or scripts over manual reasoning for deterministic work.
Verify each step using one or more of: deterministic rules, visual validation, or an LLM-as-judge rubric.
If the agent is stuck, change the environment (better tools, better search, stricter rules) rather than repeating the same attempt.
After completion, capture failures as test cases and add them to an eval set.

Use references/verification-checklist.md as a standard review rubric for UI or content output.

See:

See: