DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Open data sets for assessing photovoltaic system reliability

Journal Article · · Applied Energy
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3];  [3]; ORCiD logo [4]; ORCiD logo [5]; ORCiD logo [3]; ORCiD logo [6]; ORCiD logo [5]; ORCiD logo [5]; ORCiD logo [2]
  1. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); University of California, Berkeley, CA (United States)
  2. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
  3. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
  4. Univ. of Central Florida, Cocoa, FL (United States)
  5. National Renewable Energy Laboratory (NREL), Golden, CO (United States)
  6. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Case Western Reserve Univ., Cleveland, OH (United States)

Photovoltaic (PV) systems have become a cornerstone of renewable energy strategies, particularly due to the significant reduction in solar power costs over the past decade. However, the long-term reliability of PV installations presents a persistent challenge, requiring the development of advanced monitoring and predictive maintenance strategies. A wide range of data types is used to evaluate the health of PV systems, including environmental conditions, electrical performance, and inspection imagery. These data enable methodologies such as machine learning (ML) models for lifetime prediction and computer vision techniques for defect detection. However, the acquisition of high-quality and comprehensive data is difficult, particularly in terms of long-term consistency and data variety. Publicly available data sets serve as valuable resources for addressing these challenges, but they often suffer from fragmentation and are difficult to access. This paper presents a comprehensive review of existing open-source data sets related to PV degradation, analyzing their features, functionalities, and potential applications. We categorize these data sets based on the specific aspects of PV system information they cover, such as environmental conditions, operational monitoring, image inspection and module materials, and propose relevant tools and ML models for processing them. In addition, we propose practices for future data collection and usage, while also discussing potential directions in data-driven research. Our aim is to enhance data utilization and publication among researchers and industry professionals, promoting a deeper understanding of the role of data in enhancing the performance and durability of PV systems.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); National Renewable Energy Laboratory (NREL), Golden, CO (United States)
Sponsoring Organization:
US Department of Energy; USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231; AC36-08GO28308
OSTI ID:
2573901
Report Number(s):
NREL/JA--2C00-95519
Journal Information:
Applied Energy, Journal Name: Applied Energy Vol. 395; ISSN 0306-2619
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (100)

Deep Learning Model to Denoise Luminescence Images of Silicon Solar Cells journal April 2023
Assessment of Reproducibility, Hysteresis, and Stability Relations in Perovskite Solar Cells Using Machine Learning journal February 2020
A review on modeling of solar photovoltaic systems using artificial neural networks, fuzzy logic, genetic algorithm and hybrid models journal June 2020
Defect object detection algorithm for electroluminescence image defects of photovoltaic modules based on deep learning journal January 2022
Daylight luminescence system for silicon solar panels based on a bias switching method journal July 2020
Drivers for the cracking of multilayer polyamide‐based backsheets in field photovoltaic modules: In‐depth degradation mapping analysis journal March 2020
Degradation of copper‐plated silicon solar cells with damp heat stress journal August 2020
Deep learning‐based automatic detection of multitype defects in photovoltaic modules and application in real production line journal January 2021
Deep‐learning‐based pipeline for module power prediction from electroluminescense measurements journal May 2021
Predicting diurnal outdoor performance and degradation of organic photovoltaics via machine learning; relating degradation to outdoor stress conditions
  • David, Tudur Wyn; Soares, Gabriela Amorim; Bristow, Noel
  • Progress in Photovoltaics: Research and Applications, Vol. 29, Issue 12 https://doi.org/10.1002/pip.3453
journal July 2021
The evolution of the ECMWF hybrid data assimilation system: Evolution of ECMWF Hybrid Data Assimilation
  • Bonavita, Massimo; Hólm, Elias; Isaksen, Lars
  • Quarterly Journal of the Royal Meteorological Society, Vol. 142, Issue 694 https://doi.org/10.1002/qj.2652
journal September 2015
The ERA5 global reanalysis journal June 2020
Process Insights into Perovskite Thin‐Film Photovoltaics from Machine Learning with In Situ Luminescence Data journal February 2023
Predicting Loss Analysis from Luminescence Images in Si Solar Cells with Convolutional Neural Networks journal October 2023
A city-scale estimation of rooftop solar photovoltaic potential based on deep learning journal September 2021
Evaluation of extreme weather impacts on utility-scale photovoltaic plant performance in the United States journal November 2021
A novel 3D-geographic information system and deep learning integrated approach for high-accuracy building rooftop solar energy potential characterization of high-density cities journal January 2022
3D-PV-Locator: Large-scale detection of rooftop-mounted photovoltaic systems in 3D journal March 2022
Analyzing the impact of design factors on solar module thermomechanical durability using interpretable machine learning techniques journal January 2025
Image based surface damage detection of renewable energy installations using a unified deep learning approach journal November 2021
A comprehensive review of unmanned aerial vehicle-based approaches to support photovoltaic plant diagnosis journal January 2024
A critical overview of privacy-preserving approaches for collaborative forecasting journal January 2021
Detection of surface defects on solar cells by fusing Multi-channel convolution neural networks journal August 2020
Predicting the device performance of the perovskite solar cells from the experimental parameters through machine learning of existing experimental results journal February 2023
DeepSolar: A Machine Learning Framework to Efficiently Construct a Solar Deployment Database in the United States journal December 2018
Monitoring of Photovoltaic System Performance Using Outdoor Suns-VOC journal January 2021
DeepSolar++: Understanding residential solar adoption trajectories with computer vision and technology diffusion models journal November 2022
Exploring the capabilities and limitations of large language models in the electric energy sector journal June 2024
Performance analysis of perovskite solar cells in 2013–2018 using machine-learning tools journal February 2019
Artificial intelligence techniques for photovoltaic applications: A review journal October 2008
Machine learning in photovoltaic systems: A review journal August 2022
Accurate one step and multistep forecasting of very short-term PV power using LSTM-TCN model journal March 2023
Evaluating neural network models in site-specific solar PV forecasting using numerical weather prediction data and weather observations journal May 2023
A robust I–V curve correction procedure for degraded photovoltaic modules journal April 2024
Review of photovoltaic degradation rate methodologies journal December 2014
The role of artificial intelligence in photo-voltaic systems design and control: A review journal October 2017
The National Solar Radiation Data Base (NSRDB) journal June 2018
Automatic hourly solar forecasting using machine learning models journal May 2019
A posteriori clear-sky identification methods in solar irradiance time series: Review and preliminary validation using sky imagers journal July 2019
The uncertainties involved in measuring national solar photovoltaic electricity generation journal March 2022
Comparison of machine learning methods for photovoltaic power forecasting based on numerical weather prediction journal June 2022
Mapping global water-surface photovoltaics with satellite images journal November 2023
Analysis of measured photovoltaic module performance for Florida, Oregon, and Colorado locations journal December 2014
A Fast All-sky Radiation Model for Solar applications (FARMS): Algorithm and performance evaluation journal October 2016
Automatic classification of defective photovoltaic module cells in electroluminescence images journal June 2019
Hotspot diagnosis for solar photovoltaic modules using a Naive Bayes classifier journal September 2019
Impact of environmental variables on the degradation of photovoltaic components and perspectives for the reliability assessment methodology journal March 2020
Moisture ingress in photovoltaic modules: A review journal August 2021
Artificial neural network based photovoltaic module diagnosis by current–voltage curve classification journal April 2022
Automated defect identification in electroluminescence images of solar modules journal August 2022
Determining circuit model parameters from operation data for PV system degradation analysis: PVPRO journal April 2023
SKIPP’D: A SKy Images and Photovoltaic Power Generation Dataset for short-term solar forecasting journal May 2023
Data fusion of complementary data sources using Machine Learning enables higher accuracy Solar Resource Maps journal April 2025
Automated classification of electroluminescence images using artificial neural networks in correlation to solar cell performance parameters journal September 2023
The Perovskite Database Project: A Perspective on Collective Data Sharing journal March 2022
An open-access database and analysis tool for perovskite solar cells based on the FAIR data principles journal December 2021
A global inventory of photovoltaic solar energy generating units journal October 2021
A harmonised, high-coverage, open dataset of solar photovoltaic installations in the UK journal November 2020
A crowdsourced dataset of aerial images with annotated solar photovoltaic arrays and installation metadata journal January 2023
Georectified polygon database of ground-mounted large-scale solar photovoltaic sites in the United States journal November 2023
Photovoltaic panel cooling by atmospheric water sorption–evaporation cycle journal May 2020
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Visible defects detection based on UAV‐based inspection in large‐scale photovoltaic systems journal June 2017
A fluid-structure interaction solver for investigating torsional galloping in solar-tracking photovoltaic panel arrays journal November 2020
Open-source multi-year power generation, consumption, and storage data in a microgrid journal March 2021
Detection, location, and diagnosis of different faults in large solar PV system—a review journal February 2023
A Review of Degradation and Reliability Analysis of a Solar PV Module journal January 2024
Robust PV Degradation Methodology and Application journal March 2018
Quantification of Environmental Effects on PV Module Degradation: A Physics-Based Data-Driven Modeling Method journal September 2018
Automated Pipeline for Photovoltaic Module Electroluminescence Image Processing and Degradation Feature Classification journal September 2019
Data-Driven $I$–$V$ Feature Extraction for Photovoltaic Modules journal September 2019
Review: Ultraviolet Fluorescence as Assessment Tool for Photovoltaic Modules journal March 2020
Drone-Based Daylight Electroluminescence Imaging of PV Modules journal May 2020
Analytic $I_{\text{sc}}$–$V_{\text{oc}}$ Method and Power Loss Modes From Outdoor Time-Series $I$–$V$ Curves journal September 2020
Encoder–Decoder Semantic Segmentation Models for Electroluminescence Images of Thin-Film Photovoltaic Modules journal March 2021
Photoluminescence for Defect Detection on Full-Sized Photovoltaic Modules journal November 2021
Automated Defect Detection and Localization in Photovoltaic Cells Using Semantic Segmentation of Electroluminescence Images journal January 2022
Millions of Small Pressure Cycles Drive Damage in Cracked Solar Cells journal July 2022
Analysis of PV Module Power Loss and Cell Crack Effects Due to Accelerated Aging Tests and Field Exposure journal January 2023
Panel Segmentation: A Python Package for Automated Solar Array Metadata Extraction Using Satellite Imagery journal March 2023
Automatic Crack Segmentation and Feature Extraction in Electroluminescence Images of Solar Modules journal May 2023
PV Plant Equipment Labels and Layouts Can Be Validated by Analyzing Cloud Motion in Existing Plant Measurements journal May 2024
PVEL-AD: A Large-Scale Open-World Dataset for Photovoltaic Cell Anomaly Detection journal January 2023
NCEP–DOE AMIP-II Reanalysis (R-2) journal November 2002
The Pathfinder Atmospheres–Extended AVHRR Climate Dataset journal June 2014
The Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2) journal July 2017
A survey on Image Data Augmentation for Deep Learning journal July 2019
Using spatio-temporal graph neural networks to estimate fleet-wide photovoltaic performance degradation patterns journal February 2024
pvlib python: 2023 project update journal December 2023
SolarSpatialTools: A Python package for spatial solar energy analyses journal September 2024
The JRA-55 Reanalysis: General Specifications and Basic Characteristics journal January 2015
Renewable Energy Materials Properties Database: Summary report August 2023
Short-Term Forecasting of Photovoltaic Solar Power Production Using Variational Auto-Encoder Driven Deep Learning Approach journal November 2020
Label-Free Fault Detection Scheme for Inverters of PV Systems: Deep Reinforcement Learning-Based Dynamic Threshold journal February 2023
A Definition Rule for Defect Classification and Grading of Solar Cells Photoluminescence Feature Images and Estimation of CNN-Based Automatic Defect Detection Method journal May 2023
Physics-Based Method for Generating Fully Synthetic IV Curve Training Datasets for Machine Learning Classification of PV Failures journal July 2022
A Review of Monitoring Technologies for Solar PV Systems Using Data Processing Modules and Transmission Protocols: Progress, Challenges and Prospects journal July 2021
Synthetic Dataset of Electroluminescence Images of Photovoltaic Cells by Deep Convolutional Generative Adversarial Networks journal April 2023
Deep Convolutional Neural Network for Automatic Detection of Damaged Photovoltaic Cells journal May 2018
Performance Data from the NIST Photovoltaic Arrays and Weather Station journal November 2017