_core_phmsagas__yearly_distribution_leaks

package: pudl

Annual time series of total and hazardous leaks eliminated or repaired during the report year.

Most-recent data:

2024

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part C)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, leak_severity and leak_source. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

leak_severity

Whether or not the leak described in this record are all leaks or hazardous leaks.

leak_source

The cause of the leaks.

mains

The number of mains distribution pipeline.

services

Number of services in system at end of year.