name: ajps-data-analysis description: Use when executing and reporting the analysis for an American Journal of Political Science (AJPS) manuscript. AJPS will have a third-party verifier re-run your exact code against the numerical results in the main text before publication, so analyze reproducibly from the first line. Guides analysis and reporting norms; it does not fabricate results.
Data Analysis (ajps-data-analysis)
AJPS reviewers are methodologically sophisticated, and after acceptance an independent third-party
verifier re-runs your code against the numbers in the main text before the article is published (see
ajps-replication-and-verification). Analyze as if both facts are true — because they are. This skill
covers execution and reporting; design choices live in ajps-research-design.
When to trigger
- Running main and supporting analyses; building the results section
- A reviewer asked for robustness, heterogeneity, or alternative specifications
- Reconciling preregistered vs. exploratory analyses
- Making the analysis reproducible so the verifier's re-run will match
Analysis norms AJPS expects
- Report uncertainty and magnitude. Confidence/credible intervals and substantive effect sizes, not just significance stars — say what the estimate means.
- Robustness that probes, not decorates. Show specifications that could break the result (alternative measures, samples, estimators, fixed effects) and say what you learn.
- Heterogeneity with discipline. Pre-specify subgroups where possible; adjust for multiple comparisons; do not mine for a significant interaction and theorize it post hoc.
- Right inference. Cluster at the assignment/sampling level; randomization inference for experiments; wild-cluster bootstrap when clusters are few.
- Preregistration discipline. Separate registered from exploratory analyses; justify any deviation from the plan.
- Measurement. Validate constructs; report reliability; show results are not an artifact of a single coding/scaling choice.
Computational / text-as-data specifics
- Document model/version, hyperparameters, seeds, and validation against human-labeled samples.
- For topic models/embeddings/LLM pipelines: report stability and a validation step; do not treat raw outputs as ground truth.
Reproducibility while you work (so the verifier's re-run matches)
- One master script regenerates every table and figure from the (raw or constructed) data, in order, setting the working directory once.
- Set and report seeds for every stochastic step (bootstrap, randomization inference, simulation, jittered plots) — the verifier needs identical draws.
- Record exact software versions (e.g., "R 4.3.2", "Stata/MP 18.0") and pin packages
(
renv.lock/requirements.txt/ loggedssc/netinstalls). - Keep the manuscript's table/figure numbers matched to script outputs — the verifier checks the numerical results in the main text line by line.
Analysis-decision checklist a quantitative AJPS referee runs
| Question the referee asks | Pass condition | Fix if it fails |
|---|---|---|
| Does the estimator recover the stated estimand? | Estimand named; estimator matches | Name the target quantity before the table |
| Is inference at the right level? | Clustered at assignment/sampling level | Re-cluster; wild-cluster bootstrap if few clusters |
| Will the verifier's re-run match the printed numbers? | Master script regenerates every exhibit | Script everything; set seeds; pin versions |
Worked micro-example (illustrative numbers)
A survey experiment tests whether a co-partisan endorsement raises policy support. The pre-analysis plan names the estimand (ITT on a 0-100 scale), the primary contrast, and one moderator (political knowledge). Result: +7.4 points (95% CI 3.1-11.7), randomization-inference p = 0.004 (illustrative). A knowledge interaction that was not pre-specified as confirmatory goes to an exploratory subsection, flagged, with a multiple-comparison note. Every number is emitted by one seeded master script, so the AJPS verifier's re-run reproduces the main-text figures exactly.
Referee-pushback patterns and the venue-specific fix
- "Identification leans on selection-on-observables." -> Report an unobserved-confounder sensitivity bound (how strong a confounder must be to overturn the estimate); soften causal language if fragile.
- "This interaction looks mined." -> Show it was pre-registered, or relabel it exploratory and adjust for multiple comparisons; never HARK it into a hypothesis.
- "I cannot reproduce Table 2 from your code." -> Fatal at AJPS verification; rebuild the master script so every number regenerates with fixed seeds and pinned versions.
Calibration anchor: AJPS's independent verifier re-runs deposited code against the main-text numbers before publication, so "it works on my machine" is not enough — confirm the live verification wording against the journal's current guidelines.
Execution bridge (StatsPAI / Stata MCP)
Run the battery, don't just enumerate it. Full map:
execution-with-mcp. AJPS prizes credible identification across American / comparative / IR subfields; DiD/IV/RDD for observational claims, randomization inference for experiments.
- Many outcomes / specifications:
romano_wolf(step-down FWER) orbenjamini_hochberg— report the adjusted threshold. - OVB sensitivity:
oster_delta/sensemakr. - Inference:
wild_cluster_bootstrap(few clusters),twoway_cluster/conley; multilevel data → cluster at the right level. - Re-fit off one handle:
audit_result(result_id)lists the missing checks and the exactsuggest_functionfor each. - Exhibits:
etable/did_summary_to_latexfrom the handle — no retyped numbers.
Keep the decisive checks in the body and the exhaustive battery in the supplement. See the executed chain in the JF execution walkthrough.
Anti-patterns
- Stars-only tables with no effect sizes or intervals
- "Robustness" that only reruns near-identical specs to manufacture stability
- p-hacking / fishing for a significant interaction; HARKing exploratory results into hypotheses
- Clustering at the wrong level or ignoring few-cluster problems
- Hand-edited numbers in the manuscript that the deposited code cannot regenerate (verification fails)
Output format
【Main estimate】magnitude + interval + substantive meaning
【Identification check】(per research-design) result
【Robustness】specs that could break it -> what held
【Heterogeneity】pre-specified? MHT-adjusted?
【Registered vs exploratory】clearly separated?
【Reproducible】master script + seeds + pinned versions, numbers match? [Y/N]
【Next】ajps-tables-figures
Supplementary resources
../../resources/external_tools.md— estimation, inference, and text-as-data packages../../resources/official-source-map.md— third-party verification of numerical results