Skip to content

Geos Core Data Product

The Geos CDP defines the geographic boundaries used by all other CDPs. It provides region definitions, county boundaries, and spatial reference data.

Regions

Replica organizes data into hierarchical geographic regions:

Type Examples Purpose
Nationwide usa Full US coverage
Megaregion california_nevada, mid_atlantic, great_lakes Regional builds for Places pipeline
Test region mini_kc, mini_nor_cal Fast iteration and CI testing

Key Tables

model-159019.geos.region

The primary region boundary table. Each row is a region with: - id — unique region identifier (format: replica:region:{name}:{hash}:region) - geom — boundary geometry (POLYGON/MULTIPOLYGON)

Used extensively for spatial filtering across all CDPs:

SELECT data.*
FROM `model-159019.{dataset}.{table}` AS data
JOIN `model-159019.geos.region` AS region
  ON region.id = '{region_id}'
  AND ST_COVERS(region.geom, data.geom)

Publishing

Published to GCS: gs://core_data_products/geos/{version}/{region}.yaml

Built and published using arti commands. All code must be merged to master and tests passed before publishing.