name: ameriflux-surgo-grid-extract description: Extract dominant-component soil profile variables for EcoSIM grid or template inputs from gSSURGO, with FAO HWSD v2.0 fallback when gSSURGO data are unavailable or incomplete. Use when deriving soil depth, bulk density, field capacity, wilting point, hydraulic conductivity, texture, rock fraction, pH, CEC, or soil organic carbon.
AmeriFlux gSSURGO/FAO Grid Extractor
Use When
- You need soil profile values for an AmeriFlux or EcoSIM site from a local gSSURGO geodatabase.
- You need FAO HWSD v2.0 fallback values when gSSURGO is outside coverage, unavailable, or has missing layer values.
- You need EcoSIM template variables such as
CDPTH,BKDSI,FC,WP,SCNV,SCNH,CSAND,CSILT,ROCK,PH,CEC, orCORGC. - You need depth-weighted interpolation from source horizons or HWSD2 layers to EcoSIM soil layers.
Constraints
- NEVER use it extract climate data.
Workflow
- Locate
gSSURGO_CONUS.gdbunderdata/first, then elsewhere in the repo if needed. - Identify the map unit (
MUKEY) by spatial query onMUPOLYGON. - Select the dominant component by
comppct_r. - Retrieve horizons and rock fragments from
chorizonandchfrags. - Convert units and interpolate vertically to the target template layers.
- If gSSURGO fails, generate the profile from FAO HWSD2 under
data/FAO_HWSD2. - If gSSURGO succeeds but some layer values are invalid/fill values, replace only those missing values from FAO HWSD2.
- Write the derived soil profile JSON under
result/<SITE_ID>/.
Overview
This document summarizes which variables in the provided template can be
extracted from the CONUS gSSURGO database (directory gSSURGO_CONUS.gdb) and
from FAO HWSD v2.0, along with methods for extraction and vertical
interpolation.
Extractable Variables from gSSURGO
The following template variables can be derived from standard gSSURGO tables:
Template Variable gSSURGO Source Notes
CDPTH chorizon.hzdepb_r / 100 Depth to bottom of horizon (m)
BKDSI chorizon.dbovendry_r Bulk density (g/cm3)
FC (wthirdbar_r / 100) * Field capacity dbthirdbar_r (volumetric)
WP (wfifteenbar_r / 100) * Wilting point dbfifteenbar_r
SCNV, SCNH ksat_r * 3.6 Convert um/s to mm/h
CSAND sandtotal_r * 10 kg/Mg
CSILT silttotal_r * 10 kg/Mg
ROCK sum(chfrags.fragvol_r) / 100 Volume fraction
PH ph1to1h2o_r Soil pH
CEC cec7_r Cation exchange capacity
CORGC om_r * 0.58 * 10 Soil organic carbon
Fallback Variables from FAO HWSD v2.0
The fallback reads HWSD2_RASTER/HWSD2.bil to identify HWSD2_SMU_ID, then
uses the HWSD2_LAYERS attribute table from HWSD2.mdb or an exported
HWSD2_LAYERS.csv/SQLite table. Use the dominant soil sequence (SEQUENCE=1)
unless a different sequence is explicitly requested.
Template Variable FAO HWSD2 Source Notes
CDPTH template CDPTH Target EcoSIM depths remain template-driven
BKDSI BULK, fallback REF_BULK g/cm3 equals Mg m-3
FC, WP SAND/SILT/CLAY + AWC Texture-class defaults, adjusted by plausible AWC when available
SCNV, SCNH SAND/SILT/CLAY Texture-class Ksat estimate
CSAND SAND * 10 kg/Mg
CSILT SILT * 10 kg/Mg
ROCK COARSE / 100 Volume fraction
PH PH_WATER Soil pH
CEC CEC_SOIL cmolc/kg
CORGC ORG_CARBON g/kg equals kg C/Mg soil
FAO HWSD2 negative miscellaneous/no-data codes (-1 through -9) must be treated
as invalid. The fallback should be recorded in the output JSON under
sources_by_variable and fallbacks.
Non-Extractable Variables
The following variables are not directly available from gSSURGO and require external datasets or modeling assumptions:
- Climate variables (e.g., ATCAG)
- Topography (e.g., ASPX, slope)
- Hydrologic boundary conditions
- Nutrient pools (NH4, NO3, PO4)
- Organic nitrogen/phosphorus pools
- Exchange coefficients (GKC*)
- Water table dynamics
Vertical Interpolation
- Use overlap-weighted averaging for most variables.
- For soil organic matter (
CORGC), apply logarithmic interpolation. - Optionally extend deepest horizon or FAO layer downward with
--extend-last.
Script
A Python script (extract_gssurgo_profile.py) automates spatial lookup, horizon
extraction, variable conversion, vertical interpolation, and FAO HWSD2 fallback.
Example Usage
python .agents/skills/ameriflux-surgo-grid-extract/extract_gssurgo_profile.py \
--gdb /path/to/gSSURGO_CONUS.gdb \
--lon -121.85 \
--lat 39.0 \
--template template.nc \
--out result/<SITE_ID>/profile_<SITE_ID>.json \
--extend-last
The FAO fallback is enabled by default and looks under data/FAO_HWSD2. Use
--fao-hwsd2-dir /path/to/FAO_HWSD2 to override the location, or
--no-fao-fallback for a strict gSSURGO-only run.
Notes
- Bulk density and water retention conversions assume standard pedotransfer interpretations.
- Ensure gSSURGO geodatabase includes required tables:
MUPOLYGON,component,chorizon,chfrags. - FAO HWSD2 provides seven layers (
D1-D7, 0-200 cm). When--extend-lastis used, the deepest FAO layer is extended downward to fill deeper EcoSIM template layers; this should be treated as a gap-fill assumption, not direct observation. - Reading
HWSD2.mdbrequires GDAL/pyogrio with MDB support. If that driver is unavailable, export the Access table asHWSD2_LAYERS.csvin the sameFAO_HWSD2directory.