Utilization of Synthetic Near-Infrared Spectra via Generative Adversarial Network to Improve Wood Stiffness Prediction
- USDA Forest Service, Madison, WI (United States); University of Wisconsin, Madison, WI (United States); OSTI
- University of Georgia, Athens, GA (United States)
- Oregon State University, Corvallis, OR (United States)
- USDA Forest Service, Madison, WI (United States)
- University of Wisconsin, Madison, WI (United States)
Near-infrared (NIR) spectroscopy is widely used as a nondestructive evaluation (NDE) tool for predicting wood properties. When deploying NIR models, one faces challenges in ensuring representative training data, which large datasets can mitigate but often at a significant cost. Machine learning and deep learning NIR models are at an even greater disadvantage because they typically require higher sample sizes for training. In this study, NIR spectra were collected to predict the modulus of elasticity (MOE) of southern pine lumber (training set = 573 samples, testing set = 145 samples). To account for the limited size of the training data, this study employed a generative adversarial network (GAN) to generate synthetic NIR spectra. The training dataset was fed into a GAN to generate 313, 573, and 1000 synthetic spectra. The original and enhanced datasets were used to train artificial neural networks (ANNs), convolutional neural networks (CNNs), and light gradient boosting machines (LGBMs) for MOE prediction. Overall, results showed that data augmentation using GAN improved the coefficient of determination (R2) by up to 7.02% and reduced the error of predictions by up to 4.29%. ANNs and CNNs benefited more from synthetic spectra than LGBMs, which only yielded slight improvement. All models showed optimal performance when 313 synthetic spectra were added to the original training data; further additions did not improve model performance because the quality of the datapoints generated by GAN beyond a certain threshold is poor, and one of the main reasons for this can be the size of the initial training data fed into the GAN. LGBMs showed superior performances than ANNs and CNNs on both the original and enhanced training datasets, which highlights the significance of selecting an appropriate machine learning or deep learning model for NIR spectral-data analysis. The results highlighted the positive impact of GAN on the predictive performance of models utilizing NIR spectroscopy as an NDE technique and monitoring tool for wood mechanical-property evaluation. Further studies should investigate the impact of the initial size of training data, the optimal number of generated synthetic spectra, and machine learning or deep learning models that could benefit more from data augmentation using GANs.
- Research Organization:
- University of Georgia, Athens, GA (United States)
- Sponsoring Organization:
- USDA; USDOE Office of Energy Efficiency and Renewable Energy (EERE)
- Grant/Contract Number:
- EE0008911
- OSTI ID:
- 2472267
- Journal Information:
- Sensors, Journal Name: Sensors Journal Issue: 6 Vol. 24; ISSN 1424-8220
- Publisher:
- MDPI AGCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Machine Learning-based Prediction of Departure from Nucleate Boiling Power for the PSBT Benchmark
Deep learning with mixup augmentation for improved pore detection during additive manufacturing
Conference
·
Sun Jun 12 00:00:00 EDT 2022
·
OSTI ID:1856777
Deep learning with mixup augmentation for improved pore detection during additive manufacturing
Technical Report
·
Tue Jun 11 00:00:00 EDT 2024
·
OSTI ID:2377316