Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Resampling and data augmentation for short-term PV output prediction based on an imbalanced sky images dataset using convolutional neural networks

Journal Article · · Solar Energy
Integrating photovoltaics (PV) into electricity grids is challenged by potentially large fluctuations in power generation. In recent years, sky image-based PV output prediction using convolutional neural networks (CNNs) has emerged as a promising approach to forecasting fluctuations. A key challenge is imbalanced sky image datasets: because of the geography of solar PV system installations, sky image datasets are often rich in sunny condition data but deficient in cloudy condition data. This imbalance contrasts with the fact that model errors are dominated by cloudy condition performance. In this study, we attempt to remedy this by exploring the enrichment and augmentation of an imbalanced sky images dataset for two PV output prediction tasks: nowcasting (predicting concurrent PV output) and forecasting (predicting 15-minute-ahead future PV output). We empirically examine the efficacy of using different resampling and data augmentation approaches to create a rebalanced dataset for model development. A three-stage greedy search is used to determine the optimal resampling approach, data augmentation techniques and over-sampling rate. The results show that for the nowcast problem, resampling and data augmentation can effectively enhance the model performance, reducing overall root mean squared error (RMSE) by an average of 4%, or a 15 std. (standard deviation) of improvement compared to the variability of the baseline model. In contrast, the treatment RMSE for the forecast problem nearly always overlaps the baseline performance at the ± 2 std. level. The optimal resampling approach expands on the original dataset by over-sampling the minority cloudy data, with the best results from large over-sampling rate (e.g., 4 ~ 6 times over-sampling of cloudy images).
Research Organization:
National Renewable Energy Laboratory (NREL), Golden, CO (United States)
Sponsoring Organization:
USDOE National Renewable Energy Laboratory (NREL), Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC36-08GO28308
OSTI ID:
1805186
Report Number(s):
NREL/JA--5D00-80184; MainId:42387; UUID:2c9c3c5c-19b9-4736-8c87-457bb9442e2a; MainAdminID:25654
Journal Information:
Solar Energy, Journal Name: Solar Energy Vol. 224; ISSN 0038-092X
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (27)

Empirical Risk Minimization book January 2017
GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification journal December 2018
A systematic study of the class imbalance problem in convolutional neural networks journal October 2018
Real-time prediction intervals for intra-hour DNI forecasts journal November 2015
Hybrid intra-hour DNI forecasts with sky image processing enhanced by stochastic learning journal December 2013
Short-term reforecasting of power output from a 48 MWe solar PV plant journal February 2015
Deep photovoltaic nowcasting journal December 2018
Short-term solar power forecast with deep learning: Exploring optimal input and output configuration journal August 2019
Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks journal November 2019
Solar PV output prediction from video streams using convolutional neural networks journal January 2018
A guideline to solar forecasting research practice: Reproducible, operational, probabilistic or physically-based, ensemble, and skill (ROPES) journal March 2019
PV power output prediction from sky images using convolutional neural network: The comparison of sky-condition-specific sub-models and an end-to-end model journal July 2020
Solar Irradiance Capturing in Cloudy Sky Days–A Convolutional Neural Network Based Image Regression Approach journal January 2020
Image Style Transfer Using Convolutional Neural Networks conference June 2016
Learning Deep Representation for Imbalanced Classification conference June 2016
Image-to-Image Translation with Conditional Adversarial Networks conference July 2017
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs conference June 2018
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks conference October 2017
Plankton classification on imbalanced large scale database via convolutional neural networks with transfer learning conference September 2016
Training cost-sensitive Deep Belief Networks on imbalance data problems conference July 2016
Training deep neural networks on imbalanced data sets conference July 2016
Domain randomization for transferring deep neural networks from simulation to the real world conference September 2017
Predicting Hospital Readmission via Cost-Sensitive Deep Learning journal November 2018
Imbalanced Deep Learning by Minority Class Incremental Rectification journal June 2019
Resampling strategies for regression journal August 2014
Survey on deep learning with class imbalance journal March 2019
A survey on Image Data Augmentation for Deep Learning journal July 2019

Similar Records

Benchmarking of Solar Irradiance Nowcast Performance Derived from All-Sky Imagers
Journal Article · Sat Sep 03 00:00:00 EDT 2022 · Renewable Energy · OSTI ID:1890144

Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited
Journal Article · Thu Jan 31 19:00:00 EST 2019 · PLoS Computational Biology (Online) · OSTI ID:1495153

Spatio-Temporal Denoising Graph Autoencoders with Data Augmentation for Missing Photovoltaic Data Imputation
Conference · Fri Jun 23 00:00:00 EDT 2023 · OSTI ID:1959292