academic-rebuttal - SKILL.md Agent Skill

name: academic-rebuttal description: "Write conference paper rebuttals and author responses that effectively counter incorrect or unreasonable reviewer comments. Targets systems venues (OSDI, NSDI, SIGCOMM, MOBICOM, SOSP, ASPLOS, EuroSys, USENIX Security, CCS, PLDI) on HotCRP and ML venues (NeurIPS, ICML, ICLR, AAAI) on OpenReview. Core focus: identify and resolve reviewer false impressions with evidence-backed, firm-but-professional corrections. Covers rebuttal triage, false impression taxonomy (8 types with resolution playbooks), venue-specific constraints, response structure patterns, and ready-to-use phrase templates. Use when asked to write, draft, review, or improve a conference paper rebuttal, author response, or response to reviewer comments."

Conference Paper Rebuttal Writing

Core Purpose

Write rebuttals that reject wrong reviewer comments with evidence while maintaining professional tone. The primary function is correcting false impressions that would otherwise sink a paper at the AC/PC discussion stage.

Guiding principle: rebuttals win when they remove decision-critical false impressions quickly, politely, and with concrete evidence. Everything else — style, verbosity, rhetoric — is secondary.

Rebuttal Workflow

Stage 1: Triage Reviews (first 3 hours)

Extract every distinct concern from all reviews into a flat list.
Classify each concern:

Label	Meaning	Rebuttal Priority
FI (False Impression)	Reviewer is factually wrong or misunderstood the paper	Highest — correct with evidence
VW (Valid Weakness)	Reviewer identified a real limitation	High — acknowledge, bound impact, commit revision
DQ (Direct Question)	Reviewer asks a specific question	Medium — answer directly
MC (Minor Clarification)	Wording, typo, or presentation concern	Low — one-liner if budget allows

Rank by decision impact: what would flip the AC's assessment from reject to accept?
Check venue constraints before drafting. Load references/venue_policies.md for limits, format, and what is allowed/forbidden.

Stage 2: Draft Response

Select response structure based on venue (see Response Patterns).
For each FI concern, apply the matching False Impression Playbook.
For each VW concern, acknowledge → bound impact → commit concrete revision.
For DQ, answer directly with evidence pointer.
Cross-check: no contradictions across per-reviewer responses.

Stage 3: Compress and Verify

Cut to 80% of limit, then selectively expand highest-impact items.
Front-load: the first 25% must be self-sufficient for a scanning AC.
Policy compliance check:
- Within word/character limit
- No forbidden links/files/attachments
- Anonymity preserved (double-blind venues)
- No new experiments if venue disallows
Coauthor review pass before submission.

Self-Awareness Check

Before classifying a concern as FI (False Impression), verify:

Did you re-read the relevant paper section from the reviewer's perspective?
Could a reasonable reader reach the reviewer's interpretation?
Is the concern actually a VW (Valid Weakness) that you are reluctant to admit?

If in doubt, treat it as VW. Wrongly dismissing a valid concern destroys credibility with the AC.

False Impression Taxonomy

Eight types of false impressions with resolution strategies. Each includes: what happened, why, and how to correct it.

FI-1: Scope Misread

Reviewer believes the paper claims broader scope than it supports. Typical trigger: overgeneralized abstract/introduction.

Resolution:

Narrow claim explicitly in first sentence.
State what IS and IS NOT claimed.
Anchor to exact experiment/theorem boundary.

"Thank you — this is a helpful point. Our claim is limited to [specific regime], not [broader regime]. We will revise Abstract/Intro wording. Evidence for the supported scope is in [Table/Section]."

FI-2: Method Mechanics Misunderstanding

Reviewer believes the method uses/assumes X, but it does not. Typical trigger: notation density, unclear pseudocode.

Resolution:

Start with direct yes/no.
Give method in 3–4 steps.
Explain where confusion arose, commit to textual fix.

"Short answer: No, our method does not use X at test time. Pipeline: (1)… (2)… (3)… (4)… We agree this was insufficiently explicit and will add this summary to Sec. [N]."

FI-3: Baseline Fairness Dispute

Reviewer claims comparison is unfair or baseline is weakly tuned. Typical trigger: missing hyperparameter/tuning details.

Resolution:

State tuning protocol and compute parity.
Report numbers under controlled conditions.
Add/offer missing baseline if already run; if not, explain why and bound likely effect.

"We used identical tuning budget across methods: [details]. For baseline B, we followed [reference/config]. Results: [numbers]. We will add the tuning table."

FI-4: Novelty Conflation

Reviewer conflates contribution with prior work ("already known"). Typical trigger: under-specified related work deltas.

Resolution:

Acknowledge overlap plainly.
Enumerate exact differentiators (D1, D2, D3).
Tie each to concrete section/result.

"We agree our work builds on [prior]. The novelty is: D1: [diff] (Sec X), D2: [diff] (Table Y), D3: [new finding] (Experiment Z). We will revise Related Work."

FI-5: Statistical Evidence Doubt

Reviewer questions significance or reports "could be noise." Typical trigger: point estimates without variance/testing.

Resolution:

Provide seed count, variance, statistical test.
Distinguish significance from practical effect size.

"We report mean±std over [n] seeds with [test]. Improvement: [x]% with [CI/p]. We will add effect-size framing to results discussion."

FI-6: Scalability/Practicality Assumption

Reviewer assumes method is impractical at realistic scale. Typical trigger: no complexity/runtime/memory table.

Resolution:

Provide asymptotic + measured runtime/memory.
Compare against baseline under matched hardware.
Clarify deployment envelope.

"Complexity: O(…). Measured at scale N: [values]. Compared to baseline: [ratio]. Deployment target: [range]. We will add this to Sec [N]."

FI-7: Wrong Evaluation Criteria

Reviewer judges paper by criteria for a different subfield. Typical trigger: cross-disciplinary PC assignment.

Resolution:

Respectfully restate the paper's evaluation criteria and venue norms.
Show how the evaluation matches the stated contributions.
Point to venue-accepted precedent if possible.

"We respectfully note that [venue] systems papers typically evaluate [metrics], as exemplified by [accepted paper]. Our evaluation follows this convention (Sec X). The [different metric] R2 suggests is standard in [other field] but not typically expected here."

FI-8: Scoped-Out Experiment Request

Reviewer demands an experiment the paper intentionally excluded. Typical trigger: different assumptions about evaluation scope.

Resolution:

Acknowledge the experiment would be interesting.
Explain why it was scoped out (compute, data access, orthogonal concern).
Provide proxy analysis if available.
If venue allows and feasible, offer to add it.

"This is a valuable suggestion. We scoped [experiment] out because [reason]. As a proxy, [analysis/ablation] in Sec X suggests [bounded effect]. We will [add in revision / clarify scope statement]."

Universal 5-Step Correction Sequence

Use for any false impression when in doubt about structure:

Acknowledge — "This is an important point."
Direct correction — Yes/no + corrected factual statement.
Evidence — Table, section, figure, or result pointer.
Responsibility — "We see how the current wording caused confusion."
Concrete revision — Exactly what text/section will change.

This sequence defuses tone risk while maximizing decision usefulness for the AC.

Response Patterns

Pattern A: Per-Review (NeurIPS/ICML)

System enforces per-review fields. Inside each:

One-line appreciation
Direct answer to primary concern
Evidence bullets
Short "we will clarify X in revision" line

Pattern B: Single Global (AAAI, tight cap)

One response covering all reviews:

2–3 line global summary
Top 2 concerns subsection
Additional clarifications as one-liners
Close with revision commitments

Pattern C: Grouped by Concern (systems venues, reviewer overlap)

Group by concern, tag reviewers per item:

C1: Novelty clarification (R1, R3)
C2: Evaluation fairness (R2, R3)
C3: Runtime concern (R1)

Saves budget, prevents contradictions across reviewers.

Pattern D: Front-Loaded / Prefix-Optimal

Put highest-impact corrections first. First 25% is self-sufficient. Use when AC expected to scan quickly or at strict limits.

For full templates with examples and budgeting strategy by cap size, load references/rebuttal_templates.md.

Phrase Templates

Polite Factual Correction

"Thank you for raising this. There appears to be a misunderstanding: we do not [incorrect interpretation]. We instead [correct statement], as shown in [Section/Table]. We will revise [location] to prevent this confusion."

Already-in-Paper but Missed

"The result is in [Sec X, lines Y–Z], but we agree it is easy to miss. We will move this to [more prominent location] and restate it clearly."

Providing Requested Experiment (Allowed)

"We ran the requested comparison: [setting]. Result: [number]. Consistent with our claim [claim]. We will include this in the revision."

Requested Experiment Not Feasible

"This is a valuable suggestion. [Venue rule or constraint] prevents adding experiment X now. To address the concern, we provide [proxy analysis] and clarify scope accordingly."

Novelty Dispute

"We agree our work is related to [prior]. Key differences: (1) …, (2) …, (3) … . We will revise Related Work to make these distinctions explicit."

Strong but Professional Disagreement

"We respectfully disagree with the premise that [premise]. Under [assumption], evidence in [table/theorem] indicates [result]. We will clarify the assumption boundary."

Acknowledging Valid Weakness

"This limitation is real. Our method does not address [limitation]. We will state this explicitly and narrow the claim to [supported scope]."

Closing

"We appreciate the reviewers' input and believe these clarifications address the key concerns. We will incorporate the specified revisions in the camera-ready manuscript."

What Authors Can Do

Correct factual errors and misunderstandings with evidence.
Clarify ambiguous definitions, assumptions, and scope.
Point to exact paper locations (section/table/line) and restate.
Provide additional experimental results when venue permits.
Show revised text snippets for camera-ready.
Acknowledge valid limitations and bound their impact.
Consolidate repeated concerns across reviewers.
Prioritize by decision impact, not emotional salience.

What Authors Must Not Do

Exceed strict word/character limits.
Include forbidden links/files when disallowed.
Upload revised paper when venue forbids it during feedback.
Reveal identity in double-blind venues.
Use hostile, sarcastic, or dismissive tone.
Claim reviewer is wrong without concrete evidence.
Introduce major new contributions not in the submission.
Promise changes that fundamentally alter the paper's scope.
Ignore negative reviews or cherry-pick which concerns to address.

References

references/venue_policies.md: Per-venue rebuttal constraints (limits, platform, format, what is allowed/forbidden) for 14 venues across OpenReview and HotCRP
references/platform_mechanics.md: OpenReview vs HotCRP platform mechanics (configuration, formatting, visibility models, author actions)
references/rebuttal_templates.md: 4 full response templates, budgeting strategy by cap size, end-to-end drafting workflow, and common failure modes

Load these references as needed when working on specific aspects of rebuttal writing.