Geos Core Data Product¶
The Geos CDP defines the geographic boundaries used by all other CDPs. It provides region definitions, county boundaries, and spatial reference data.
Regions¶
Replica organizes data into hierarchical geographic regions:
| Type | Examples | Purpose |
|---|---|---|
| Nationwide | usa |
Full US coverage |
| Megaregion | california_nevada, mid_atlantic, great_lakes |
Regional builds for Places pipeline |
| Test region | mini_kc, mini_nor_cal |
Fast iteration and CI testing |
Key Tables¶
model-159019.geos.region¶
The primary region boundary table. Each row is a region with:
- id — unique region identifier (format: replica:region:{name}:{hash}:region)
- geom — boundary geometry (POLYGON/MULTIPOLYGON)
Used extensively for spatial filtering across all CDPs:
SELECT data.*
FROM `model-159019.{dataset}.{table}` AS data
JOIN `model-159019.geos.region` AS region
ON region.id = '{region_id}'
AND ST_COVERS(region.geom, data.geom)
Publishing¶
Published to GCS: gs://core_data_products/geos/{version}/{region}.yaml
Built and published using arti commands. All code must be merged to master and tests passed before publishing.