_core_phmsagas__yearly_distribution_by_material

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by material for each gas distribution operator.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 1 - General)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state and material. However, there are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

material

The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

mains_miles

The miles of mains distribution pipeline.

services

Number of services in system at end of year.

_core_phmsagas__yearly_distribution_filings

package: pudl

Annual time series of filings (aka submissions) from gas distribution system operators.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report

Primary key:

report_id, report_date, operator_id_phmsa

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

Additional Details

This table contains information about the filer and filing type. This includes information about who filed but also whether this was an original filing or a correction.

Columns
report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

report_date

Date reported.

filing_date

Date on which the filing was submitted.

initial_filing_date

Initial date when filing was originally submitted.

filing_correction_date

Date when a correction filing was submitted.

report_filing_type

Type of report submitted, either Initial or Supplemental.

data_date

When the data source was last updated.

form_revision_id

PHMSA form revision identifier.

preparer_name

Name of representative who filed report.

preparer_title

Title of representative who filed report.

preparer_phone

Phone number of representative who filed report.

preparer_fax

Fax number of representative who filed report.

preparer_email

Email address of representative who filed report.

_core_phmsagas__yearly_distribution_leaks

package: pudl

Annual time series of total and hazardous leaks eliminated or repaired during the report year.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part C)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, leak_severity and leak_source. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

leak_severity

Whether or not the leak described in this record are all leaks or hazardous leaks.

leak_source

The cause of the leaks.

mains

The number of mains distribution pipeline.

services

Number of services in system at end of year.

_core_phmsagas__yearly_distribution_misc

package: pudl

Annual time series of miscellaneous distribution information.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B & C)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, and operating_state. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_date

Date reported.

report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

all_known_leaks_scheduled_for_repair

The number of known system leaks at the end of the report year scheduled for repair.

all_known_leaks_scheduled_for_repair_main

The number of known leaks on main at the end of the report year scheduled for repair.

hazardous_leaks_mechanical_joint_failure

The total number of hazardous leaks caused by a mechanical joint failure.

federal_land_leaks_repaired_or_scheduled

Total number of leaks repaired, eliminated, or scheduled for repair on federal land during the reporting year.

average_service_length_feet

The average system service length in feet.

services_efv_in_system

Estimated number of services with Excess Flow Valve in the system at end of reported year related to natural gas distribution.

services_efv_installed

Total number of services with Excess Flow Valve installed during reported year related to natural gas distribution.

services_shutoff_valve_in_system

Estimated number of services with manual service line shut-off valves installed in the system at end of report year related to natural gas distribution.

services_shutoff_valve_installed

Total number of manual service line shut-off valves installed during reported year related to natural gas distribution.

unaccounted_for_gas_fraction

Unaccounted for gas as a fraction of total consumption for the 12 months ending June 30 of the reporting year. Calculated as follows: Take the sum of: (purchased gas + produced gas) minus (customer use + company use + appropriate adjustments). Then divide by the sum of (customer use + company use + appropriate adjustments). Prior to 2017, this field was calculated with a different deonominator (purchased gas + produced gas). The time period between 2010-2017 having this different calculation method ensured that there was no records that had a negative fraction. For all the other reporting years there are known and expected negative values in this column.

excavation_tickets

Number of Excavation Tickets received by the operator during the year, (i.e., receipt of information by the operator from the notification center).

core_phmsagas__yearly_distribution_operators

package: pudl

Annual time series of distribution operator information.

Most-recent data:

2024

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part A)

Primary key:

report_id, report_date, operator_id_phmsa

Additional Details

This table contains operator-level information including office and headquarter location.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

operator_name_phmsa

PHMSA operator name.

office_street_address

Street address of an operator's office.

office_city

City where an operator's office is located.

office_county

County where an operator's office is located.

office_zip

Zipcode where an operator's office is located.

office_state

State where an operator's office is located.

headquarters_street_address

Street address for an operator's headquarters.

headquarters_city

City where an operator's headquarters are located.

headquarters_county

County where an operator's headquarters are located.

headquarters_state

State where an operator's headquarters are located.

headquarters_zip

Zipcode where an operator's headquarters are located.

additional_information

Any additional information which will assist in clarifying or classifying the reported data.

_core_phmsagas__yearly_distribution_by_install_decade

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by install decade.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 4)

Primary key:

report_id, report_date, operator_id_phmsa, operating_state, install_decade

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

Additional Details

The records with an install decade of total_decade are a total - beware of aggregating these values.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

install_decade

The decade the distribution pipeline was installed.

mains_miles

The miles of mains distribution pipeline.

services

Number of services in system at end of year.

_core_phmsagas__yearly_distribution_by_material_and_size

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by material and size of pipe.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 3)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, main_size and material. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • The size ranges in main_size have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less)

  • The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_date

Date reported.

report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

main_size

Size range of mains. The size ranges have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less).

material

The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

mains_miles

The miles of mains distribution pipeline.

services

Number of services in system at end of year.

main_other_material_detail

A free-form text field containing notes about the other material type. This column should only contain values in it for rows with other as the material type listed.

_core_phmsagas__yearly_distribution_excavation_damages

package: pudl

Annual time series of excavation damages from various sources.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part D - Excavation Damage)

Primary key:

report_id, damage_type, damage_sub_type

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

damage_type

A high level category of excavation damage causes.

damage_sub_type

A sub-category of damage_type of excavation damage causes.

damages

Number of instances of excavation damage.