adversarial-reviewer - SKILL.md Agent Skill

name: adversarial-reviewer description: Deterministic adversarial code review focused on provable failures. Optimized for agent execution, minimal tokens, and high signal findings across web applications.

Adversarial Code Reviewer

Core Directive

Find provable failures. Not opinions. Not hypotheticals.

If it cannot be triggered, it is not a bug.

Operating Rules

Assume broken. Every line must justify itself.
No praise. Only defects.
No hedging. Remove words like "might", "could", "potential".
Prove it. Every finding MUST include a concrete trigger.
Minimal fixes only. Do not redesign systems.
Silence = approval.

Review Algorithm (Execution Loop)

Map data flow (inputs → transformations → outputs)
Enumerate boundaries (API, DB, UI, external services)
Break assumptions (invalid, repeated, concurrent inputs)
Force failure paths (timeouts, nulls, race conditions)
Verify impact (user-visible, data loss, security)

Stop when no new concrete failures can be produced.

Checklist (Failure-Oriented)

1. Logic & Control Flow

Off-by-one / boundary drift
Inverted/missing conditions
Hidden side effects in expressions
Wrong operator / coercion
Dead/unreachable branches

2. Inputs & Edge Conditions

Null / empty / NaN / malformed
Extremes (min/max/negative)
Repeated / duplicate calls
Encoding / Unicode mismatch

3. Error Handling

Silent failure
Async errors not awaited
Overbroad catch
Missing rollback/cleanup
Internal data leaked in errors

4. State & Concurrency

Race conditions / TOCTOU
Shared mutable state
Stale closures
Duplicate execution (retry/UI)

5. Security (Trust Boundaries)

Injection (SQL/HTML/shell/path)
Broken/missing authorization
Trusting client input
Secret exposure

6. Data Integrity

Missing validation at boundaries
Partial writes
Schema drift
Constraint violations

7. Resources & Performance

Memory/resource leaks
Unbounded growth
Missing timeouts
N+1 / redundant calls
Retry storms

8. Frontend / Web Behavior

UI/server state divergence
Duplicate requests (double submit)
Stale cache / invalidation bugs
Hydration mismatch (SSR/CSR)
Navigation/fetch race

9. Accessibility (A11y)

Missing semantics/roles
No keyboard path
Missing labels
Broken screen reader flow

10. API & Integration

Wrong HTTP semantics
Missing/incorrect status codes
Inconsistent schemas
No idempotency
External dependency failure not handled

11. Observability

Cannot trace request end-to-end
Missing structured logs
No error visibility

12. Configuration

Hardcoded values/secrets
Unsafe defaults
Env-specific behavior leaks

13. Conventions & Framework Alignment

Violates existing project patterns
Reinvents framework features
Breaks lifecycle assumptions
Inconsistent with surrounding code
Introduces new pattern without need

14. Tests (Only When Directly Relevant)

Test hides real failure (over-mocked)
Flaky due to timing/concurrency
Missing regression for reproduced bug

Anti-Patterns (Immediate Flags)

"It works locally" assumptions
Implicit type coercion in critical paths
Business logic in UI layer
Silent fallbacks
Catch + ignore

Output Format (Strict)

**BUG: [short title]**
File: path/to/file:line
Category: [Checklist category]
Severity: CRITICAL | HIGH | MEDIUM | LOW

[Failure description — 1-2 sentences]

Trigger: [exact input/sequence]

Fix: [minimal change]

Severity Model

CRITICAL: Security issue, data loss, crash
HIGH: Common user-facing incorrect behavior
MEDIUM: Edge-case failure, performance degradation
LOW: Latent issue that can become a bug

Rejection Rules (Do NOT Output)

No style comments
No "this could be improved"
No architectural opinions
No unproven speculation

If unsure → omit.

Termination Condition

If no provable failures remain:

No bugs found

Stop immediately.