out_eia__yearly_plant_parts
Annual time series of all EIA plant parts, for use in matching to FERC 1.
- Most-recent data:
2026
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
record_id_eia
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.
Contains information from multiple raw inputs.
Additional Details
Practically speaking, a plant is a collection of generator(s). There are many attributes of generators (i.e. prime mover, primary fuel source, technology type). We can use these generator attributes to group generator records into larger aggregate records which we call "plant parts". A plant part is a record which corresponds to a particular collection of generators that all share an identical attribute and utility owner. E.g. all of the generators with unit_id=2, or all of the generators with coal as their primary fuel source.
The EIA data about power plants (from EIA 923 and 860) is reported in tables with records that correspond to mostly generators and plants. Other datasets (cough cough FERC1) are less well organized and include plants, generators and other plant parts all in the same table without any clear labels. This plant part table is an attempt to create records corresponding to many different plant parts in order to connect specific slices of EIA plants to other datasets.
Because generators are often owned by multiple utilities, another dimension of this plant part table involves generating two records for each owner: one for the portion of the plant part they own and one for the plant part as a whole. The portion records are labeled in the ownership_record_type column as owned and the total records are labeled as total.
This table includes A LOT of duplicative information about EIA plants. It is primarily meant for use as an input into the record linkage between FERC1 plants and EIA.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.
Columns
Identifier for EIA plant parts analysis records.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
The part of the plant a record corresponds to.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Code for the type of prime mover (e.g. CT, CG)
The code representing the most predominant type of energy that fuels the generator.
High level description of the technology used by the generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code.
The EIA Utility Identification number.
Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.
Plant part of the associated true granularity record.
EIA record ID of the associated true granularity record.
ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. The ID is manually assigned and has not been updated since 2020, but only affects a couple hundred records total across all years.
Total end of year installed (nameplate) capacity for a plant part, in megawatts.
Fraction of potential generation that was actually reported for a plant part. Energy generated over time period / nameplate capacity * time period (hours/years/etc.).
Total installed (nameplate) capacity, in megawatts.
Year the plant's oldest still operational unit was built.
Proportion of generator ownership attributable to this utility.
Average fuel cost per MMBTU of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Simplified fuel type code used in PUDL
Date of the scheduled or effected retirement of the generator.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Year a generator went into service.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
The operating status of the asset using PUDL categories.
Whether each generator record is for one owner or represents a total of all ownerships.
Whether a plant part record has a duplicate record with different ownership status.
Planned effective date of the scheduled retirement of the generator.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.
Contains EIA plant ID, plant part, ownership, and EIA utility id
Number of distinct generator IDs that participated in the aggregation for a plant part list record.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Four-digit year in which the data was reported.
PUDL plant ID and report year of the record.
out_eia__yearly_assn_plant_parts_plant_gen
Annual Association table providing connections between plant parts (identified as record_id_eia) and 'plant_gen' records (identified as record_id_eia_plant_gen) from the out_eia__yearly_plant_parts table.
- Most-recent data:
2026
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
record_id_eia, record_id_eia_plant_gen
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Contains information from multiple raw inputs.
Additional Details
The purpose of this table is to help determine what generator records are associated with every plant part record. The plant part records have a one to many relationship to the associated 'plant_gen' records. The 'plant_gen' records are also included in the plant part records, so for these records there is a one to one relationship and the record_id_eia and the record_id_eia_plant_gen are the same.
All of the columns in this table that have a suffix of '_plant_gen' are attributes of the record_id_eia_plant_gen record.
Columns
Identifier for EIA plant parts analysis records.
Identifier for EIA plant parts analysis records which is at the plant_part level of plant_gen - meaning each record pertains to one generator.
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
The EIA Utility Identification number.
Whether each generator record is for one owner or represents a total of all ownerships.
Generator ID of the record_id_eia_plant_gen record. This is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Code representing the most predominant type of energy that fuels the record_id_eia_plant_gen's generator.
Code for the type of prime mover (e.g. CT, CG) associated with the record_id_eia_plant_gen.
Dynamically assigned PUDL unit id of the record_id_eia_plant_gen. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
High level description of the technology used by the record_id_eia_plant_gen's generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code. This name is associated with the record_id_eia_plant_gen record.
ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. This ID is associated with the record_id_eia_plant_gen record. It depends on ferc1_generator_agg_id, which has not been updated since 2020, but only affects a couple hundred records total across all years.
The year an associated plant_gen's generator went into service.
The operating status of the asset using PUDL categories of the record_id_eia_plant_gen record .
The number of generators associated with each record_id_eia.
out_pudl__yearly_assn_eia_ferc1_plant_parts
Annual Association table providing connections between power plant data reported in FERC Form 1 and any available EIA data related to the plant parts covered by each FERC Form 1 record.
- Most-recent data:
2025
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
record_id_ferc1
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table answers the question "What EIA data reported about plants or generators should be associated with a given plant record found in the FERC Form 1?"
Each record in this table corresponds to a single FERC Form 1 record reported in one of several tables describing power plants (large steam, hydro, small, etc.). These FERC records can correspond to an entire plant, individual generators within a plant, all generators in a plant with the same prime mover type, or just the respondent's ownership share of any of those categories (or other categories). Furthermore, the same utility may report the same plant in different ways in different years.
The EIA data associated with each FERC plant record comes from our Plant Parts EIA table, out_eia__yearly_plant_parts. The EIA data in each record represents an aggregation of several slices of an EIA plant, across both physical characteristics and utility ownership.
This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Identifier for EIA plant parts analysis records.
Indicates the source and validation of the match between EIA and FERC. Match types include matches was generated from the model, verified by the training data, overridden by the training data, etc.
Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.
The part of the plant a record corresponds to.
Four-digit year in which the data was reported.
Date reported.
Whether each generator record is for one owner or represents a total of all ownerships.
Plant name.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Code for the type of prime mover (e.g. CT, CG)
The code representing the most predominant type of energy that fuels the generator.
High level description of the technology used by the generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code.
Year a generator went into service.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.
Plant part of the associated true granularity record.
EIA record ID of the associated true granularity record.
Number of distinct generator IDs that participated in the aggregation for a plant part list record.
Proportion of generator ownership attributable to this utility.
Whether a plant part record has a duplicate record with different ownership status.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
The operating status of the asset using PUDL categories.
A manually assigned PUDL plant ID. May not be constant over time.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Average fuel cost per MMBTU of heat content in nominal USD.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total installed (nameplate) capacity, in megawatts.
Fraction of potential generation that was actually reported for a plant part. Energy generated over time period / nameplate capacity * time period (hours/years/etc.).
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Simplified fuel type code used in PUDL
Year the plant's most recently built unit was installed.
Contains EIA plant ID, plant part, ownership, and EIA utility id
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part. Energy generated over time period / nameplate capacity * time period (hours/years/etc.).
Total installed (nameplate) capacity, in megawatts.
Annual capital addition into capex_total.
Year-to-date capital addition into capex_total.
Annual capital addition into capex_total per kw.
Annual capital addition into capex_total per MW.
Year-to-date capital addition into capex_total per MW.
Annual capital addition into capex_total per MWh.
Year-to-date capital addition into capex_total per MWh.
Cost of plant: equipment (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Total cost of plant (USD) without retirements.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Plant capacity in MW when not limited by condenser water.
Allowances.
Maintenance of boiler (or reactor) plant.
Cost of coolants and water (nuclear plants only)
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: fuel (USD).
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Miscellaneous steam (or nuclear) expenses.
Maintenance of miscellaneous steam (or nuclear) plant.
Investments in non-fuel production expenses per Mwh.
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Production expenses: maintenance of electric plant (USD).
Total operating expenses.
Production expenses: rents (USD).
Steam expenses.
Steam from other sources.
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Steam transferred (Credit).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net plant capability in megawatts.
Hours the plant was connected to load while generating in the report year.
Type of plant.
Plant capacity in MW when limited by condenser water.
Average fuel cost per MMBTU of heat content in nominal USD.
Type of fuel.
FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.
Production expenses: Maintenance (USD).
Total production expenses, excluding fuel (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: roads, railroads, and bridges (USD).
Net plant capability under the least favorable operating conditions, in megawatts.
Net plant capability under the most favorable operating conditions, in megawatts.
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: hydraulic expenses (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Production expenses: water for power (USD).
The FERC license ID of a project.
Cost of plant: accessory electric equipment (USD).
Cost of plant: miscellaneous power plant equipment (USD).
Cost of plant: water wheels, turbines, and generators (USD).
Energy used for pumping, in megawatt-hours.
Net output for load (net generation - energy used for pumping) in megawatt-hours.
Total production expenses before pumping (USD).
Production expenses: pumped storage (USD).
Production expenses: We are here to PUMP YOU UP! (USD).
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Simplified fuel type code used in PUDL
Fuel content per unit of electricity generated. Calculated from FERC reported fuel consumption and net generation.