name: review-pr description: Perform a comprehensive code review of a pull request argument-hint: "[PR_number|URL]" effort: high disable-model-invocation: true
Review Pull Request
Perform a comprehensive code review of a pull request.
Instructions
- Get PR information:
gh pr view $ARGUMENTS --json title,body,files,additions,deletions - Review each changed file
- Provide structured feedback
Review Checklist
Code Quality
- Code is readable and well-organized
- Functions are appropriately sized
- No code duplication
- Meaningful variable/function names
Functionality
- Logic is correct
- Edge cases handled
- Error handling is comprehensive
- No obvious bugs
Security
- No hardcoded secrets
- Input validation present
- No injection vulnerabilities
- Authorization checks in place
Testing
- Tests added for new code
- Existing tests still pass
- Edge cases tested
Documentation
- Code is self-documenting or commented
- README updated if needed
- API changes documented
Output Format
## PR Review: #[number] - [title]
### Summary
[1-2 sentence overview]
### Approval Status
[ ] Approved
[ ] Approved with suggestions
[ ] Changes requested
### Findings
#### Critical (Must Fix)
- [ ] [Issue description] - `file:line`
#### Suggestions (Should Consider)
- [ ] [Improvement] - `file:line`
#### Nitpicks (Optional)
- [ ] [Minor suggestion] - `file:line`
### Positive Highlights
- [What's done well]
### Questions
- [Clarifications needed]
Usage
/review-pr 123
/review-pr https://github.com/owner/repo/pull/123
Advanced: Multi-Agent Review
For production-grade reviews requiring specialized perspectives and anti-hallucination safeguards.
Pre-flight Check
Before reviewing, check if this is a follow-up pass to avoid repeating suggestions:
# Detect if Claude already reviewed this PR
git log --oneline -10 | grep "Co-Authored-By: Claude"
If detected, note: "This appears to be a follow-up pass. I'll focus on new issues and avoid repeating previous suggestions."
Scope Drift Detection
Cross-reference the PR diff against the original plan to catch unintended changes.
# Detect current branch
BRANCH=$(git branch --show-current)
# Search for a plan file associated with this branch
ls ~/.claude/plans/ 2>/dev/null | grep -i "$BRANCH" | head -3
# Files actually changed in this PR
git diff --stat origin/main...HEAD | head -30
If a plan file exists for this branch:
- Read the plan file: what was the stated scope?
- Compare stated scope vs actual
git diff --stat - Flag files changed that were NOT mentioned in the plan
Output format:
SCOPE DRIFT CHECK
─────────────────────────────────────────
Plan scope: [what the plan said would change]
Actual diff: [files actually changed]
Drift: [files changed outside plan scope, if any]
Verdict: IN SCOPE / DRIFT DETECTED
If no plan file exists: note "No plan file found for this branch, skipping scope drift check."
Multi-Agent Specialization
Launch 3 parallel specialized agents (see Split Role Sub-Agents):
Agent 1: Consistency Auditor
Focus: DRY violations, duplicate logic, pattern inconsistencies
Check for:
- Duplicated code blocks (>5 lines similar)
- Inconsistent naming conventions
- Pattern violations (if project uses X pattern, enforce it)
Agent 2: SOLID Principles Analyst
Focus: Single Responsibility Principle violations, complexity
Check for:
- Functions >50 lines (likely doing too much)
- Nested conditionals >3 levels deep
- Cyclomatic complexity >10
- Mixed concerns in single component
Agent 3: Defensive Code Auditor
Focus: Silent failures, masked bugs, hidden fallbacks, LLM output trust boundary
Check for:
- Empty catch blocks: try { } catch (e) { } // swallows error
- Silent fallbacks: return data || DEFAULT // hides missing data
- Unchecked null/undefined: user.name without validation
- Ignored promise rejections: async fn without .catch()
LLM Output Trust Boundary (especially relevant in AI-assisted codebases):
- LLM-generated values (emails, URLs, names, IDs) written to DB or passed to
downstream functions without format validation; add lightweight guards
(email regex, URL parsing, .trim()) before persisting
- Structured tool output (arrays, objects from AI tools) accepted without
type/shape checks before database writes or rendering
- AI-generated SQL or code strings executed without sanitization
Anti-Hallucination Rules
Verify before asserting:
- Use
GreporGlobto verify patterns before recommending them - If suggesting "use existing UserService pattern", confirm UserService exists first
- Never claim "project uses X" without checking actual codebase
Occurrence rule:
- Pattern with >10 occurrences = established (Suggestion level)
- Pattern with <3 occurrences = not established (Can Skip or ask maintainer)
- Read full file context, not just diff lines
Uncertainty markers:
- Use "❓ To verify:" when unsure about project conventions
- Use "💡 Consider:" for optional improvements
- Use "🔴 Must fix:" only for critical bugs/security
Reconciliation
After agents report findings:
- Deduplicate: Remove overlapping suggestions across agents
- Prioritize existing patterns: If codebase uses pattern X, recommend X (not ideal pattern Y)
- Mark skipped suggestions: "Skipping [suggestion] because project uses [alternative pattern]"
- Track reasoning: Document why suggestion was kept or skipped
Severity Classification
🔴 Must Fix (Blockers)
- Security vulnerabilities
- Data loss risks
- Breaking changes without migration
- Silent failures masking bugs
🟡 Should Fix (Improvements)
- SOLID violations causing maintenance issues
- DRY violations (>3 duplicates)
- Performance bottlenecks (N+1 queries)
- Missing error handling for critical paths
🟢 Can Skip (Nice-to-haves)
- Style inconsistencies (if no linter)
- Minor naming improvements
- Overly nested code (if <3 levels)
- Documentation gaps (if code self-documenting)
Fix-First Heuristic
Determine whether to auto-fix each finding or surface it for user decision.
AUTO-FIX (apply without asking): ASK (needs human judgment):
├─ Dead code / unused variables ├─ Security changes (auth, XSS, injection)
├─ N+1 queries (missing eager loading) ├─ Race conditions
├─ Stale comments contradicting code ├─ Design decisions
├─ Magic numbers → named constants ├─ Large fixes (>20 lines changed)
├─ Missing import / path mismatches ├─ Enum completeness
├─ Variables assigned but never read ├─ Anything removing functionality
└─ Obvious version/doc mismatches └─ User-visible behavior changes
Rule: If a senior engineer would apply the fix in 30 seconds without discussion, it's AUTO-FIX. If reasonable engineers could disagree, it's ASK.
After agents report findings:
- Apply all AUTO-FIX items immediately with minimal targeted edits
- Batch all ASK items into a single user decision (not one question per item)
Auto-Fix Loop (Optional)
For automated convergence:
Review → Identify issues → Fix → Re-review → Repeat until minimal changes
Safeguards:
- Max 3 iterations to prevent infinite loops
- Run tsc/lint check before each iteration
- Skip auto-fix for protected files (package.json, migrations, etc.)
Example prompt:
Review this PR with auto-fix enabled:
1. Review using 3 agents above
2. Fix all 🔴 Must Fix issues
3. Re-review to verify fixes
4. Repeat for 🟡 Should Fix (max 2 more iterations)
5. Stop when only 🟢 Can Skip remain
Conditional Context Loading
Load additional context based on diff content (stack-agnostic):
| If diff contains... | Then check... |
|---|---|
| Database queries | Indexes, N+1 patterns, query optimization |
| API endpoints | Auth middleware, input validation, rate limiting |
| Authentication logic | Password hashing, session management, CSRF tokens |
| File uploads | Size limits, MIME validation, storage security |
| Date/time operations | Timezone handling, DST edge cases |
| External API calls | Timeout configs, retry logic, error handling |
| Environment variables | Presence in .env.example, validation at startup |
Integration with Existing Tools
SE-CoVe Plugin: Use for general fact-checking of review claims (complementary to anti-hallucination rules above)
Worktrunk: For codebase-wide pattern analysis before suggesting changes
AST-grep: For structural pattern matching (e.g., find all similar try/catch blocks)
Sources
- Base template: Claude Code Ultimate Guide
- Multi-agent review: Pat Cullen (Jan 2026)
- Anti-hallucination patterns: Méthode Aristote code review system
$ARGUMENTS