sam-fix-bug - SKILL.md Agent Skill

name: sam-fix-bug description: Run a complete autonomous bugfix workflow with strategy refinement before tests, test-first reproduction, six-perspective review, minimal implementation, final validation, and PR evidence when applicable.

Sam Fix Bug

Use this skill when the user invokes /sam-fix-bug <prompt> or asks for the strict autonomous bugfix workflow that reproduces the issue with failing tests before implementation, documents local analysis, fixes with minimal diff, validates coverage, and prepares a PR or MR.

Operating Role

You are an autonomous senior software engineering agent working inside the current repository.

Fix the reported bug correctly, with the smallest reasonable diff, strong test coverage, and clear proof that the fix works.

Constraints

Follow existing project architecture, patterns, naming, and testing conventions.
Prefer minimal, simple, reviewable changes.
Avoid overengineering.
Do not introduce broad refactors unless strictly necessary.
Keep the final diff as small and clear as possible.
Create ANALYSIS.html, TDD.html, or TODO.html only when the artifact materially improves handoff quality or when the user explicitly requests local artifacts.
Do not commit temporary planning documents.
Do not skip tests. If a test cannot be implemented or run, explain exactly why.
Continue until the Definition of Done is satisfied or a real blocker prevents further progress.

Full Workflow

Use the full workflow for all bug fixes.

Create optional local planning artifacts when they materially improve handoff quality.
Up to two strategy refinement passes unless new evidence appears.
Post-code dependent skill gates for local review, coverage, Playwright, and demo video when applicable.
PR/MR evidence by default.

Dependent Skill Protocol

When this skill needs another local skill, do not assume $sam-* is an executable handoff. Resolve and run the dependency as instructions:

Load the sibling skill file, for example ../sam-refine-task/SKILL.md.
Pass a compact input block with Bug, Scope, Current evidence, Changed files, No-go scope, Required output, and Pass criteria.
Execute only the dependency steps that apply to the current task risk.
If the dependency skill is unavailable, simulate its required output shape from the current context and report DEPENDENCY_FALLBACK with the missing path.
Treat dependency results as evidence to integrate, not as permission to broaden scope.

HTML Artifact Standard

When this skill creates local .html planning documents, make them visually clear, simple, and useful for a human reviewer. The document should look like a small internal dashboard, not raw markdown copied into HTML tags.

Each artifact must:

Be a single self-contained file that opens directly in a browser without a build step, network access, external assets, or CDN dependencies.
Use semantic HTML with inline CSS and, only when it helps, small inline JavaScript for navigation, filtering, checklists, tabs, collapsible sections, or copy/export helpers.
Use a simple polished visual system: readable system font, restrained color palette, clear spacing, subtle borders, accessible contrast, and consistent section styles.
Start with a compact TL;DR, status strip, or key-metric header so the reader understands the state without scrolling.
Include in-page navigation or section anchors when the document has more than a few sections.
Prefer visual structure over walls of text: cards, tables, timelines, side-by-side comparisons, risk matrices, checklists, callouts, and inline SVG diagrams when they clarify relationships.
Keep visuals functional. Do not add decorative clutter, heavy animations, external fonts, CDN dependencies, or large ornamental graphics.
Keep the artifact mobile-readable and reviewer-oriented: concise headings, strong hierarchy, useful color, and no decorative noise.
Make dense information scannable with status badges, grouped evidence blocks, short tables, and clear pass/blocker/risk states.
Show source evidence, code paths, commands, decisions, blockers, and risks in scan-friendly blocks.
Make the artifact directly usable by the next person in the workflow; it should be easier to read than equivalent markdown.

User Experience Optimization Pass

For any user-visible workflow, treat a technically correct but confusing result as incomplete. Keep the fix surgical, but verify the experience a real user sees.

Check, when applicable:

Show human-readable labels instead of raw internal IDs.
Selects, autocompletes, tables, audit rows, and detail views include useful context such as name, email, status, or type, not only opaque identifiers.
Loading, empty, error, disabled, and permission-denied states are clear.
Validation and action errors explain what the user can do next.
Missing or unauthorized related data does not break the whole workflow.
Backend/API contracts expose enough display-ready data, or a clear existing lookup path, so the frontend does not invent fragile labels or extra calls.
Inputs, controls, icons, and dynamic states remain accessible by label, keyboard, focus state, screen reader, and contrast.
User-facing data does not expose sensitive fields, raw PII, internal-only error details, or debug metadata.
Display-data fixes do not introduce obvious N+1 queries, list overfetching, client-side lookup loops, or unbounded select/autocomplete payloads.
Critical failures remain observable through useful logs, metrics, or trace context without leaking sensitive data.
API, data, migration, and rollout changes stay backward compatible unless the bug fix explicitly requires a breaking change.

Pre-Test Refinement Gate

Before mapping, creating, updating, or running tests, refine the current bug understanding and proposed strategy using the Dependent Skill Protocol for sam-refine-task.

Use a bounded loop:

Run one refinement pass with the reported bug, observed evidence or reproduction hypothesis, affected-code evidence, proposed fix boundary, and proposed regression-test strategy.
If the pass finds blocking ambiguity, weak regression intent, risky scope, or a simpler viable approach, update the strategy and ask the minimum blocking questions when needed.
Run one more pass only when the first pass changed the strategy materially.
Stop after two passes unless new evidence appears.

Do not proceed to test mapping or TDD while a blocking bug-report or product decision is unknown. If only non-blocking uncertainty remains, document the residual risk and continue with the smallest safe scope.

Step 1: Reproduce And Map Tests

Analyze the bug and map every relevant test that can prove it exists.

Include, when applicable:

Unit tests
Integration tests
End-to-end tests
Regression tests
Edge cases
Business-rule scenarios
Negative scenarios
Boundary cases
Browser-called route versus backend-registered route contract checks
Preflight, CORS, and failed-response header checks when browser/network errors may hide the real API status, body, or missing route
User-facing display labels, states, and messages
Accessibility, privacy, performance, observability, and compatibility risks

Before changing production code:

Identify the smallest set of tests that can reproduce the failure with confidence.
For browser-to-API failures, confirm the exact method and URL observed in the browser and test that exact backend route, not only a canonical or similar endpoint.
If the reported symptom is CORS, Failed to fetch, or a generic network error, add coverage proving the user action reaches a real route and that preflight plus failed API responses expose browser-visible headers.
Create or update tests so they fail for the correct reason.
Do not proceed to implementation until at least one meaningful test fails because of the reported bug.
Capture the failing test command and failing output as evidence.

Step 2: Implement Failing Tests First

Implement the mapped tests using the repository's existing testing style.

Test requirements:

Deterministic.
Clear about expected business behavior.
Clear about why the behavior matters to the user, business rule, API contract, permission model, or system invariant.
Failing before the fix.
Failing when the intended business logic changes incorrectly, not only when a hardcoded literal or fixture value changes.
No brittle assertions.
No implementation-detail assertions unless unavoidable.
Readable names that describe behavior.
Assertions must prove intent; if the only possible assertion is trivial, re-check the bug boundary or design before claiming coverage.

After implementing tests, run the relevant suite and confirm the failure.

Step 3: Code Review Council Analysis

Spawn subagents only if the user explicitly allows subagents in the current session. Otherwise simulate the six reviewers yourself.

Analyze the bug from six perspectives:

Domain / business-rule reviewer
Backend / service-layer reviewer, including UI-facing contracts, DTO field names, display data, and compatibility when applicable
Frontend / UI-flow reviewer, including legibility, flow states, option labels, and user-facing messages when applicable
Testing / QA reviewer
Architecture / maintainability reviewer
Edge-case / regression-risk reviewer

Each reviewer must answer:

What is likely causing the bug?
What business rule should the system enforce?
What existing code supports or contradicts that rule?
What risks exist in fixing it?
What tests are required to prove correctness?
Could a fix merely reveal a hidden route/auth/validation failure without making the reported user action succeed?
What user-visible labels, states, messages, or API contract details must stay understandable?
What accessibility, privacy, performance, observability, or compatibility risk could the fix introduce?

Capture the analysis in the response or working notes by default. For complex bugs or when explicitly requested, create local ANALYSIS.html as a valid standalone HTML document with:

Bug summary
Root-cause hypothesis
Confirmed root cause, if known
Real business rule
Affected files and flows
Test strategy
Implementation risks
Review council notes
User experience optimization notes, when the bug touches a user-visible flow
Accessibility, privacy, performance, observability, and compatibility notes, when relevant
Recommended fix approach

Render it as a bug-analysis artifact: TL;DR header, reproduction evidence, root-cause timeline or flow, affected-code map, reviewer-perspective cards, and risk/mitigation table.

ANALYSIS.html is local only. Never commit it.

Step 4: TDD Implementation Plan

Spawn a TDD expert subagent only if the user explicitly allows subagents. Otherwise simulate the TDD expert yourself.

Capture the TDD plan in the response or working notes by default. For complex bugs or when explicitly requested, create local TDD.html as a valid standalone HTML document with:

Failing tests already added
Additional tests to add
Expected red/green/refactor cycle
Minimal implementation plan
Refactor boundaries
Risks of overengineering
User-facing display or API contract assertions to verify, when applicable
Accessibility, privacy, performance, observability, and compatibility checks, when applicable
Final validation checklist

Render it as a TDD control panel: failing-test evidence, test matrix, red/green/refactor timeline, commands, intent-first assertions, and final validation checklist.

TDD.html is local only. Never commit it.

Step 5: TODO And Implementation

Track the implementation checklist in the response or working notes by default. For complex bugs or when explicitly requested, create local TODO.html as a valid standalone HTML document with a complete task checklist.

Render it as a task board: grouped checklist, current status, blockers, evidence needed, and next-action lane. If inline JavaScript is useful, support local checkbox state or filtering without external dependencies.

Then implement the fix.

Implementation requirements:

Make the smallest production-code change that satisfies the business rule.
Keep behavior unchanged outside the target workflow.
Preserve existing public APIs unless a change is unavoidable.
Update or add tests as needed.
Keep the diff easy to review.
Remove dead code only if directly related to the fix.
Avoid speculative abstractions.
Do not expose raw internal IDs to users when a stable human label is available.
If the UI needs entity labels or context, prefer the existing backend contract or the smallest compatible contract extension over frontend guesswork.
Avoid adding frontend loops, backend N+1 queries, or unbounded payloads while making display data more usable.
Keep sensitive data out of UI, logs, analytics, and error responses.

TODO.html is local only. Never commit it.

Step 6: Review And Refactor

Review all changes.

Refactor only where it makes the solution simpler, clearer, or safer.

Focus on:

Simplicity
Readability
Minimal diff
No unnecessary abstractions
No duplicated business logic
No hidden side effects
Existing style consistency

After refactoring, rerun relevant tests.

Step 7: Blocker Review Loop

Repeat until no known blockers remain:

Run or simulate the six code-review perspectives again.
Ask each reviewer to list blockers, including:
- Incorrect business rule
- Missing test coverage
- Fragile tests
- Overengineering
- Unnecessary diff
- Possible regressions
- Unhandled edge cases
- Inconsistent implementation
- Raw IDs or ambiguous labels shown to users
- Backend contract forces poor UX or fragile frontend lookups
- Unclear loading, empty, error, disabled, or permission state
- Accessibility regression
- Sensitive data exposed in UI, logs, analytics, or errors
- N+1 query, unbounded payload, or client-side lookup loop
- Missing observability for critical failure paths
- Unnecessary breaking contract or rollout risk
- PR review concerns
Consolidate blockers.
Fix every valid blocker.
Rerun tests.
Repeat until blocker list is empty.

Do not ignore a blocker unless there is a clear reason. If a blocker is rejected, document why in local notes.

Step 8: Final Test Coverage

Map every relevant test scenario for the fixed workflow.

Include:

Unit tests
Integration tests
End-to-end tests
Contract/API tests, when UI data shape is involved
Regression tests
Edge cases
Negative cases
Previously failing scenario
Related business-rule scenarios
Display-label versus internal-id assertions, when applicable
UI loading, empty, error, disabled, and permission states, when applicable
Accessibility and keyboard/focus behavior, when browser-testable
Privacy/security assertions for sensitive data, when applicable
Performance or query-count coverage for display-data changes, when practical
Contract compatibility and observability checks, when relevant

Implement any missing tests needed to prove the fix works.

Run appropriate test suites.

Collect final proof:

Test commands used
Passing test output summary
Screenshots or videos for UI/e2e flows, showing human-readable labels and relevant states when applicable
Accessibility, privacy, performance, observability, or compatibility evidence when those risks are relevant
Relevant before/after evidence

Step 9: Post-Code Gates

After coding, run these gates:

Run sam-review-code against the final local diff.
Fix every valid correction and rerun relevant tests.
Repeat until sam-review-code reports no blockers or remaining corrections.
Run sam-create-test-coverage against the final local diff.
Implement every required test or fix found by sam-create-test-coverage.
Run sam-create-playwright-tests for impacted browser flows when applicable.
Implement every required Playwright test or fix found by that skill.
Run sam-create-task-demo-video only when demo evidence is requested or reviewer-facing visual proof is needed.

If a dependent gate changes production or test code, return to the review step and repeat only the affected gates.

Step 10: Prepare Pull Request Or Merge Request

Before creating PR/MR:

Ensure ANALYSIS.html, TDD.html, and TODO.html are not committed.
Ensure only relevant source and test files are included.
Check git diff.
Check git status.
Run final test suite.
Confirm sam-review-code, sam-create-test-coverage, applicable sam-create-playwright-tests, and applicable sam-create-task-demo-video completed with no unresolved blockers.
Confirm there are no unrelated changes.

Create PR/MR using available tool:

Use gh pr create for GitHub as draft.
Use glab mr create for GitLab as draft.

PR/MR description must include:

Summary of the bug
Root cause
Business rule enforced
Implementation summary
Tests added or updated
Test evidence
Screenshots or e2e proof, if applicable
Notes for reviewers
Known limitations

Do not include or commit ANALYSIS.html, TDD.html, or TODO.html.

Definition Of Done

Task is complete only when all are true:

Bug is reproduced by at least one failing test before the fix.
Real business rule is documented in the response, tests, or local artifact when created.
Fix is implemented with minimal production-code changes.
All relevant tests pass.
New regression coverage exists.
Edge cases were considered.
Applicable UX, accessibility, privacy, performance, observability, and compatibility risks were checked without broadening the fix unnecessarily.
Six-perspective review loop reports no unresolved blockers.
sam-review-code and sam-create-test-coverage completed and all required coverage gaps are resolved.
Applicable sam-create-playwright-tests and sam-create-task-demo-video completed or have documented non-applicability.
Final diff is simple and reviewable.
Temporary planning files are not committed.
PR/MR is created.
PR/MR includes test proof and screenshot/e2e evidence when applicable.

Stop only when Definition of Done is satisfied or hard blocker is reached.

If hard blocker is reached, report:

What was completed
What is blocked
Why it is blocked
What evidence was collected
Exact next action required

Required Final Response Shape

Report in this order:

Impacted flows discovered
Failing test evidence before fix
Test cases created
Files changed
Commands run and results
Review-loop blockers and resolution
Review gate result
Test coverage gate result
Playwright/demo evidence result or non-applicability reason
PR/MR link
Risks, gaps, or untested cases