Synthetic data generation for machine learning model training for energy theft scenarios using cosimulation
- National Rural Electric Cooperative Association Arlington Virginia USA
- Pacific Northwest National Laboratory Richland Washington USA
Abstract Technical and non‐technical losses in distribution circuits result in significant economic costs to power utilities. One type of non‐technical loss is energy theft by various means including illegal tapping of feeders, bypassing the meter, and billing fraud. These losses are usually hard to detect, and can remain undetected for long periods of time. Machine learning models have been proven effective in detecting these conditions, but rely on the availability of large, good‐quality training data sets. The problem is exacerbated by the imbalanced nature of data related to these conditions—energy theft, though costly, is very rare. The available data sets generally have very few samples of theft with most of the data pertaining to normal operation. Such data sets are generally not suitable to train machine learning models. In this paper, an overview of energy theft detection techniques, the challenges with their data needs, and the limitations of current techniques to bridge such data limitations is presented. A co‐simulation framework is proposed to generate reliable training data for machine learning algorithms for theft detection. An example scenario is presented and a machine learning model is built to detect certain kinds of energy theft.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- Grant/Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1889757
- Report Number(s):
- PNNL-SA--168747
- Journal Information:
- IET Generation, Transmission, & Distribution, Journal Name: IET Generation, Transmission, & Distribution Journal Issue: 5 Vol. 17; ISSN 1751-8687
- Publisher:
- Institution of Engineering and Technology (IET)Copyright Statement
- Country of Publication:
- United Kingdom
- Language:
- English
Similar Records
Applying Machine‐Learning Methods to Laser Acceleration of Protons: Lessons Learned From Synthetic Data
Theft of service poses major problems