sitrep-validation

star 1

Update and validate against the INRB-UMIE Ebola DRC 2026 reference dataset. Use when: updating the INRB-UMIE submodule before rendering; the cross-source validation section is missing or stale; git submodule errors; understanding which INRB-UMIE files feed the report; interpreting discrepancies between automated extraction and the INRB-UMIE manually-coded data.

bquilty25 By bquilty25 schedule Updated 5/30/2026

name: sitrep-validation description: "Update and validate against the INRB-UMIE Ebola DRC 2026 reference dataset. Use when: updating the INRB-UMIE submodule before rendering; the cross-source validation section is missing or stale; git submodule errors; understanding which INRB-UMIE files feed the report; interpreting discrepancies between automated extraction and the INRB-UMIE manually-coded data."

INRB-UMIE Validation Data

The Ebola_DRC_2026 git submodule (kraemer-lab/Ebola_DRC_2026) provides a manually-coded reference dataset that the report compares against automated extraction outputs in the "Cross-source validation" section.

When to Use

  • Before rendering — to pull the latest INRB-UMIE data so the validation section reflects current numbers
  • Diagnosing why the cross-source validation plots are stale or empty
  • Understanding a discrepancy flagged in the rendered report

Update Command

cd /Users/billyquilty/Documents/Work/bvd_sitrep_extractor
git submodule update --remote Ebola_DRC_2026

This pulls the latest commit from https://github.com/kraemer-lab/Ebola_DRC_2026 into the pinned Ebola_DRC_2026/ directory.

After updating, confirm new data arrived:

git submodule status Ebola_DRC_2026
# Shows: <new_commit_hash> Ebola_DRC_2026 (build-YYYY-MM-DD-...)
git -C Ebola_DRC_2026 log --oneline -3

Files Used by the Report

All under Ebola_DRC_2026/build/long/:

File Used for
insp_sitrep__cumulative_confirmed_cases.csv Confirmed case counts by zone
insp_sitrep__cumulative_suspected_cases.csv Suspected case counts by zone
insp_sitrep__cumulative_suspected_deaths.csv Suspected deaths by zone
insp_sitrep__cumulative_confirmed_deaths.csv Confirmed deaths by zone
insp_sitrep__hospitalised.csv Hospitalised patients
insp_sitrep__in_bed_previous_day.csv Response indicators
insp_sitrep__new_hosp_admissions.csv New admissions
insp_sitrep__new_hosp_detainees.csv New detainees
insp_sitrep__new_hosp_other.csv Other new admissions
insp_sitrep__cumulative_contacts_traced.csv Contact tracing totals

Zone name differences are harmonised in the report via KRAEMER_NAME_MAP:

  • "Mongbalu""Mongbwalu"
  • "Nyakunde""Nyankunde"

Interpreting the Cross-Source Comparison

The validation section overlays this extraction's values against INRB-UMIE values on the same chart. Expected behaviour:

Observation Meaning
Lines overlap closely Extraction and INRB-UMIE agree — good
INRB-UMIE lags by 1–2 sitreps INRB-UMIE dataset not yet updated; re-run submodule update later
Systematic offset for one zone Possible zone name mismatch or aggregation difference; check KRAEMER_NAME_MAP
Extraction higher than INRB-UMIE Extraction may be capturing aggregate rows that INRB-UMIE excludes; check is_aggregate column in master CSV
Large isolated spike Likely a table parsing error in one sitrep; inspect data/raw/<name>/raw_extraction.json for the outlier date

Committing the Submodule Pin

After updating, the new commit hash should be committed with the rest of the pipeline output:

git add Ebola_DRC_2026
git commit -m "Update INRB-UMIE submodule to YYYY-MM-DD build"
# This can be combined with the report commit:
# git commit -m "Add SitRep NNN, update INRB-UMIE submodule, rebuild report"

Error Table

Error Cause Fix
fatal: no submodule mapping found in .gitmodules Submodule not initialised Run git submodule init && git submodule update
fatal: repository 'https://github.com/kraemer-lab/Ebola_DRC_2026' not found Network or GitHub auth issue Check internet; re-authenticate with gh auth login if using HTTPS
Error in read_csv(...) : cannot open the connection CSV file missing from submodule Submodule may be partially initialised; run git submodule update --init --recursive
Cross-source section renders blank INRB-UMIE files exist but all rows filtered out Check that zone values in INRB-UMIE data match ZONE_LEVELS defined in sitrep_report.qmd
Submodule is already up to date INRB-UMIE has not pushed a new build Check https://github.com/kraemer-lab/Ebola_DRC_2026/commits for the latest build date
Install via CLI
npx skills add https://github.com/bquilty25/bvd_sitrep_extractor --skill sitrep-validation
Repository Details
star Stars 1
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator