out_eia__yearly_generators

package: pudl

Annual time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.

Most-recent data:

2026

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • Due to coverage problems in other tables, some generators have heat rate estimates but not fuel cost estimates.

  • Not all columns are originally reported in or calculable from the input tables. Expect nulls.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

This table includes all yearly attributes for all generators reported to EIA-860 and EIA-923.

To provide a complete picture of generator data, this table compiles data from many different EIA tables, including directly reported, imputed and calculated columns. We suggest using this table if you want to explore yearly attributes about generators and would rather use a more complete and denormalized table. If you are more interested in the originally reported values, we recommend searching for core_eia with the column you are most interested in.

The calculations of capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh are based on the allocation of net generation reported on the basis of plant, prime mover and energy source to individual generators - which comes from this table: out_eia923__yearly_generation_fuel_by_generator.

Heat rates by generator-yearly are estimated by using allocated estimates for per-generator net generation and fuel consumption as well as the core_eia923__monthly_boiler_fuel table, which reports fuel consumed by boiler. Heat rates are necessary to estimate the amount of fuel consumed by a generation unit, and thus the fuel cost per MWh generated.

Plant specific fuel prices are taken from the core_eia923__fuel_receipts_costs table, which only has ~70% coverage, leading to some generators with heat rate estimates still lacking fuel cost estimates.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

technology_description

High level description of the technology used by the generator to produce electricity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

capacity_factor

Fraction of potential generation that was actually reported for a plant part. Energy generated over time period / nameplate capacity * time period (hours/years/etc.).

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

fuel_cost_per_mmbtu

Average fuel cost per MMBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

associated_combined_heat_power

Indicates whether the generator is associated with a combined heat and power system

bga_source

The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).

bypass_heat_recovery

Can this generator operate while bypassing the heat recovery steam generator?

carbon_capture

Indicates whether the generator uses carbon capture technology.

city

Name of the city.

can_cofire_fuels

Whether the generator can co-fire fuels.

county

County name.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

deliver_power_transgrid

Indicate whether the generator can deliver power to the transmission grid.

distributed_generation

Whether the generator is considered distributed generation

duct_burners

Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas

energy_source_1_transport_1

Primary mode of transport for energy source 1.

energy_source_1_transport_2

Secondary mode of transport for energy source 1.

energy_source_1_transport_3

Tertiary mode of transport for energy source 1.

energy_source_2_transport_1

Primary mode of transport for energy source 2.

energy_source_2_transport_2

Secondary mode of transport for energy source 2.

energy_source_2_transport_3

Tertiary mode of transport for energy source 2.

energy_source_code_2

The code representing the second most predominant type of energy that fuels the generator

energy_source_code_3

The code representing the third most predominant type of energy that fuels the generator

energy_source_code_4

The code representing the fourth most predominant type of energy that fuels the generator

energy_source_code_5

The code representing the fifth most predominant type of energy that fuels the generator

energy_source_code_6

The code representing the sixth most predominant type of energy that fuels the generator

energy_storage_capacity_mwh

Energy storage capacity in MWh (e.g. for batteries).

ferc_qualifying_facility

Indicates whether or not a generator is a qualifying FERC cogeneration facility.

fluidized_bed_tech

Indicates whether the generator uses fluidized bed technology

fuel_type_count

A count of how many different simple energy sources there are associated with a generator.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

minimum_load_mw

The minimum load at which the generator can operate at continuosuly.

can_burn_multiple_fuels

Whether the generator can burn multiple fuels.

nameplate_power_factor

The nameplate power factor of the generator.

net_capacity_mwdc

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

can_switch_when_operating

Indicates whether a fuel switching generator can switch fuels while operating.

operational_status_code

The operating status of the asset.

original_planned_generator_operating_date

The date the generator was originally scheduled to be operational

other_combustion_tech

Indicates whether the generator uses other combustion technologies

other_modifications_date

Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.

other_planned_modifications

Indicates whether there are there other modifications planned for the generator.

owned_by_non_utility

Whether any part of generator is owned by a nonutilty

ownership_code

Identifies the ownership for each generator.

planned_derate_date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_energy_source_code_1

New energy source code for the planned repowered generator.

planned_modifications

Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.

planned_net_summer_capacity_derate_mw

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_summer_capacity_uprate_mw

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_net_winter_capacity_derate_mw

Decrease in winter capacity expected to be realized from the derate modification to the equipment.

planned_net_winter_capacity_uprate_mw

Increase in winter capacity expected to be realized from the uprate modification to the equipment.

planned_new_capacity_mw

The expected new namplate capacity for the generator.

planned_new_prime_mover_code

New prime mover for the planned repowered generator.

planned_repower_date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

planned_uprate_date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

previously_canceled

Indicates whether the generator was previously reported as indefinitely postponed or canceled

pulverized_coal_tech

Indicates whether the generator uses pulverized coal technology

reactive_power_output_mvar

Reactive Power Output (MVAr)

rto_iso_lmp_node_id

The designation used to identify the price node in RTO/ISO Locational Marginal Price reports

rto_iso_location_wholesale_reporting_id

The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report

solid_fuel_gasification

Indicates whether the generator is part of a solid fuel gasification system

startup_source_code_1

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_2

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_3

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_4

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

state

Two letter US state abbreviation.

stoker_tech

Indicates whether the generator uses stoker technology

street_address

Physical street address.

subcritical_tech

Indicates whether the generator uses subcritical technology

summer_capacity_estimate

Whether the summer capacity value was an estimate

summer_capacity_mw

The net summer capacity.

summer_estimated_capability_mw

EIA estimated summer capacity (in MWh).

supercritical_tech

Indicates whether the generator uses supercritical technology

can_switch_oil_gas

Whether the generator can switch between oil and natural gas.

synchronized_transmission_grid

Indicates whether standby generators (SB status) can be synchronized to the grid.

time_cold_shutdown_full_load_code

The minimum amount of time required to bring the unit to full load from shutdown.

timezone

IANA timezone name

topping_bottoming_code

If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle

turbines_inverters_hydrokinetics

Number of wind turbines, or hydrokinetic buoys.

turbines_num

Number of wind turbines, or hydrokinetic buoys.

ultrasupercritical_tech

Indicates whether the generator uses ultra-supercritical technology

uprate_derate_completed_date

The date when the uprate or derate was completed.

uprate_derate_during_year

Was an uprate or derate completed on this generator during the reporting year?

winter_capacity_estimate

Whether the winter capacity value was an estimate

winter_capacity_mw

The net winter capacity.

winter_estimated_capability_mw

EIA estimated winter capacity (in MWh).

zip_code

Five digit US Zip Code.

out_eia__yearly_generators_by_ownership

package: pudl

Annual time series of all EIA generators with ownership integrated.

Most-recent data:

2026

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

This table has no primary key. This table would have a primary key of the following columns, except there are ~200 records without utility IDs: ['plant_id_eia', 'generator_id', 'report_date', 'utility_id_eia', 'ownership_record_type'].

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

Generators that report to EIA can have multiple utility owners. EIA reports the ownership fraction for each of the owners of generators (see core_eia860__scd_ownership). This table includes two records for each generator's owner: one record represents a total ownership stake and the other record represents a proportional ownership stake (the ownership_record_type column labeled as "total" and "owned" respectively). All of the values reported for these generator-owner records are scaled to the ownership stake (represented by fraction_owned).

This table was created to represent every possible version of ownership slices of the EIA generators. It is primarily used as an input to out_eia__yearly_plant_parts.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

technology_description

High level description of the technology used by the generator to produce electricity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

capacity_factor

Fraction of potential generation that was actually reported for a plant part. Energy generated over time period / nameplate capacity * time period (hours/years/etc.).

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

fuel_cost_per_mmbtu

Average fuel cost per MMBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

ferc_acct_name

Name of FERC account, derived from technology description and prime mover code.

generator_operating_year

Year a generator went into service.

operational_status_pudl

The operating status of the asset using PUDL categories.

capacity_eoy_mw

Total end of year installed (nameplate) capacity for a plant part, in megawatts.

fraction_owned

Proportion of generator ownership attributable to this utility.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

out_eia923__yearly_generation_fuel_by_generator

package: pudl

Annual time series of of estimated net generation and fuel consumption by generator.

Most-recent data:

2024

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__yearly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

out_eia923__yearly_generation_fuel_by_generator_energy_source

package: pudl

Annual time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.

Most-recent data:

2024

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.

The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

out_eia923__yearly_generation_fuel_by_generator_energy_source_owner

package: pudl

Annual time series of of estimated net generation and fuel consumption for each generator, associated with each combination of generator, energy source, prime mover, and owner.

Most-recent data:

2024

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code, utility_id_eia, ownership_record_type

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has scaled the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel by ownership from out_eia860__yearly_ownership. Note that the utility_id_eia in this table refers to the OWNER of the generator, not the operator.

This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

utility_id_eia

The EIA Utility Identification number.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

fraction_owned

Proportion of generator ownership attributable to this utility.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.