core_epa__assn_eia_epacamd
Return to SearchAssociation table providing connections between EPA units and EIA plants, boilers, and generators.
- Most-recent data:
2024
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EPA -- Mix of multiple EPA sources
- Primary key:
This table has no primary key.
Additional Details
This crosswalk table comes from the PUDL fork of the EPA camd-eia-crosswalk Github repo: https://github.com/catalyst-cooperative/camd-eia-crosswalk-latest.
The camd-eia-crosswalk README and our Data Source documentation page on ../data_sources/epacems depict the complicated relationship between EIA and EPA data, specifically the nature of EPA vs. EIA "units" and the level of granularity that one can connect the two sources.
The original EPA crosswalk runs on 2018 EIA data. We adapted the crosswalk code to run on each new year of EIA data, capturing changes in plant information over time.
Our version of the crosswalk clarifies some of the column names and removes unmatched rows. The pudl.dagster.assets.core.glue.core_epa__assn_eia_epacamd function doc strings explain what changes are made from the EPA's version.
Columns
Four-digit year in which the data was reported.
The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.
Emissions (smokestack) unit monitored by EPA CEMS.
Generator ID used by the EPA.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!