out_eia923__yearly_generation

package: pudl

Annual time series of net electricity generated by power plant generators.

Most-recent data:

2024

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Not all IDs are present.

  • Table has known low coverage - either geographic or temporal or otherwise.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

Additional Details

This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.

Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_generation_fuel_by_generator

package: pudl

Annual time series of of estimated net generation and fuel consumption by generator.

Most-recent data:

2024

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__yearly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

out_eia923__yearly_generation_fuel_combined

package: pudl

Annual time series of generation and fuel consumption of all generation units.

Most-recent data:

2024

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

Additional Details

Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.

This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.

The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_generation_fuel_by_generator_energy_source

package: pudl

Annual time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.

Most-recent data:

2024

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.

The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

out_eia923__yearly_generation_fuel_by_generator_energy_source_owner

package: pudl

Annual time series of of estimated net generation and fuel consumption for each generator, associated with each combination of generator, energy source, prime mover, and owner.

Most-recent data:

2024

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code, utility_id_eia, ownership_record_type

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details, and see Entity Resolution Methodology for a fuller conceptual overview.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has scaled the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel by ownership from out_eia860__yearly_ownership. Note that the utility_id_eia in this table refers to the OWNER of the generator, not the operator.

This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. See /methodology/entity_resolution for a conceptual overview of this process.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

utility_id_eia

The EIA Utility Identification number.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

fraction_owned

Proportion of generator ownership attributable to this utility.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.