skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Accounting for Training Data Error in Machine Learning Applied to Earth Observations

Journal Article · · Remote Sensing
DOI:https://doi.org/10.3390/rs12061034· OSTI ID:1608215
ORCiD logo [1]; ORCiD logo [2];  [3];  [3];  [4];  [5];  [6];  [7];  [8]; ORCiD logo [9]; ORCiD logo [10]; ORCiD logo [11]; ORCiD logo [4];  [12];  [4];  [4];  [12];  [4];  [13]; ORCiD logo [4]
  1. Clark Univ., Worcester, MA (United States); Univ. of Massachusetts, Boston (United States)
  2. Radiant Earth Foundation, San Francisco, CA (United States)
  3. Univ. of California, Santa Barbara, CA (United States)
  4. Clark Univ., Worcester, MA (United States)
  5. Azavea, Inc., Philadelphia, PA (United States)
  6. Boston Univ., MA (United States)
  7. Univ. of Michigan, Ann Arbor, MI (United States)
  8. Univ. of Twente, Enschede (The Netherlands)
  9. International Inst. for Applied Systems Analysis (IIASA), Laxenburg (Austria)
  10. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  11. Miami Univ., Oxford, OH (United States)
  12. City Univ. of New York (CUNY), NY (United States). Advanced Science Research Center; Hunter College, New York, NY (United States)
  13. Development Seed,Washington, DC (United States)

Remote sensing, or Earth Observation (EO), is increasingly used to understand Earth system dynamics and create continuous and categorical maps of biophysical properties and land cover, especially based on recent advances in machine learning (ML). ML models typically require large, spatially explicit training datasets to make accurate predictions. Training data (TD) are typically generated by digitizing polygons on high spatial-resolution imagery, by collecting in situ data, or by using pre-existing datasets. TD are often assumed to accurately represent the truth, but in practice almost always have error, stemming from (1) sample design, and (2) sample collection errors. The latter is particularly relevant for image-interpreted TD, an increasingly commonly used method due to its practicality and the increasing training sample size requirements of modern ML algorithms. TD errors can cause substantial errors in the maps created using ML algorithms, which may impact map use and interpretation. Despite these potential errors and their real-world consequences for map-based decisions, TD error is often not accounted for or reported in EO research. Here we review the current practices for collecting and handling TD. We identify the sources of TD error, and illustrate their impacts using several case studies representing different EO applications (infrastructure mapping, global surface flux estimates, and agricultural monitoring), and provide guidelines for minimizing and accounting for TD errors. To harmonize terminology, we distinguish TD from three other classes of data that should be used to create and assess ML models: training reference data, used to assess the quality of TD during data generation; validation data, used to iteratively improve models; and map reference data, used only for final accuracy assessment. We focus primarily on TD, but our advice is generally applicable to all four classes, and we ground our review in established best practices for map accuracy assessment literature. EO researchers should start by determining the tolerable levels of map error and appropriate error metrics. Next, TD error should be minimized during sample design by choosing a representative spatio-temporal collection strategy, by using spatially and temporally relevant imagery and ancillary data sources during TD creation, and by selecting a set of legend definitions supported by the data. Furthermore, TD error can be minimized during the collection of individual samples by using consensus-based collection strategies, by directly comparing interpreted training observations against expert-generated training reference data to derive TD error metrics, and by providing image interpreters with thorough application-specific training. We strongly advise that TD error is incorporated in model outputs, either directly in bias and variance estimates or, at a minimum, by documenting the sources and implications of error. TD should be fully documented and made available via an open TD repository, allowing others to replicate and assess its use. To guide researchers in this process, we propose three tiers of TD error accounting standards. Finally, we advise researchers to clearly communicate the magnitude and impacts of TD error on map outputs, with specific consideration given to the likely map audience.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; Omidyar Network’s Property Rights Initiative, now PlaceFund; National Aeronautics and Space Administration (NASA); National Science Foundation (NSF); National Institute of Standards and Technology (NIST); New York State Department of Environmental Conservation
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1608215
Journal Information:
Remote Sensing, Vol. 12, Issue 6; Conference: Quantifying Error in Training Data for Mapping and Monitoring the Earth System, Worcester, MA (United States), 8-9 Jan 2019; ISSN 2072-4292
Publisher:
MDPICopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 25 works
Citation information provided by
Web of Science

References (192)

Influence of carbon mapping and land change modelling on the prediction of carbon emissions from deforestation journal June 2012
Orthorectification of VHR optical satellite data exploiting the geometric accuracy of TerraSAR-X data journal January 2011
Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification journal October 2004
Global land cover classifications at 8 km spatial resolution: The use of training data derived from Landsat imagery in decision tree classifiers journal January 1998
The impact of imperfect ground reference data on the accuracy of land cover change estimation journal June 2009
Mapping smallholder and large-scale cropland dynamics with a flexible classification system and pixel-based composites in an emerging frontier of Mozambique journal March 2020
Citizens as sensors: the world of volunteered geography journal November 2007
Object based image analysis for remote sensing journal January 2010
A Pixel-Based Landsat Compositing Algorithm for Large Area Land Cover Mapping journal October 2013
Exploring the Use of Google Earth Imagery and Object-Based Methods in Land Use/Cover Mapping journal November 2013
Artificial neural network response to mixed pixels in coarse-resolution satellite data journal December 1996
On the nature of models in remote sensing journal October 1986
Mapping land-cover modifications over large areas: A comparison of machine learning algorithms journal May 2008
Deep learning in remote sensing applications: A meta-analysis and review journal June 2019
Status of land cover classification accuracy assessment journal April 2002
Water, Energy, and Carbon with Artificial Neural Networks (WECANN): a statistically based estimate of global surface turbulent fluxes and gross primary productivity using solar-induced fluorescence journal January 2017
Satellite-based assessment of yield variation and its determinants in smallholder African systems journal February 2017
Land-Use Mapping in a Mixed Urban-Agricultural Arid Landscape Using Object-Based Image Analysis: A Case Study from Maricopa, Arizona journal June 2014
Strategies for Incorporating High-Resolution Google Earth Databases to Guide and Validate Classifications: Understanding Deforestation in Borneo journal June 2011
A survey of image classification methods and techniques for improving classification performance journal March 2007
Exploring diversity in ensemble classification: Applications in large area land cover mapping journal July 2017
Recommendations for using the relative operating characteristic (ROC) journal January 2014
New refinements and validation of the MODIS Land-Surface Temperature/Emissivity products journal January 2008
A Novel Context-Sensitive Semisupervised SVM Classifier Robust to Mislabeled Training Samples journal July 2009
High-Resolution Global Maps of 21st-Century Forest Cover Change journal November 2013
Mapping global cropland and field size journal January 2015
Automatic identification of building types based on topographic databases – a comparison of different data sources journal January 2015
Land cover mapping of wetland areas in an agricultural landscape using SAR and Landsat imagery journal May 2009
Harshness in image classification accuracy assessment journal May 2008
Large-scale land cover mapping with the integration of multi-source information based on the Dempster–Shafer theory journal January 2012
Classification in the Presence of Label Noise: A Survey journal May 2014
First operational BRDF, albedo nadir reflectance products from MODIS journal November 2002
Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points journal June 2012
The Sensitivity of Mapping Methods to Reference Data Quality: Training Supervised Image Classifications with Imperfect Reference Data journal November 2016
A neural network approach using multi-scale textural metrics from very high-resolution panchromatic imagery for urban land-use classification journal June 2009
Assessing geometric accuracy of the orthorectification process from GeoEye-1 and WorldView-2 panchromatic images journal April 2013
Effect of errors in ground truth on classification accuracy journal August 2009
Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery journal December 2017
Spatiotemporal characteristics, patterns, and causes of land-use changes in China since the late 1980s journal January 2014
Good practices for estimating area and assessing accuracy of land change journal May 2014
A review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches journal July 2018
Remote sensing of impervious surfaces in the urban areas: Requirements, methods, and trends journal February 2012
A scalable satellite-based crop yield mapper journal July 2015
Conflation of expert and crowd reference data to validate global binary thematic maps journal February 2019
Towards a set of agrosystem-specific cropland mapping methods to address the global cropland diversity journal June 2016
Making better use of accuracy data in land change studies: Estimating accuracy and area and quantifying uncertainty using stratified estimation journal February 2013
Utilizing Temporally Invariant Calibration Sites to Classify Multiple Dates and Types of Satellite Imagery journal February 2011
Object-based Vegetation Mapping in the Kissimmee River Watershed Using HyMap Data and Machine Learning Techniques journal January 2013
Sub-pixel land-cover mapping with improved fraction images upon multiple-point simulation journal June 2013
On the use of dimensioned measures of error to evaluate the performance of spatial interpolators journal January 2006
Using active learning to adapt remote sensing image classifiers journal September 2011
Carbon consequences of land cover change and expansion of urban lands: A case study in the Seattle metropolitan region journal October 2011
Incorporating Land Use Mapping and Participation in Jordan: An Approach to Sustainable Management of Two Mountainous Areas journal February 2008
Comparison of methods for land-use classification incorporating remote sensing and GIS inputs journal April 2011
A fuzzy classification of sub-urban land cover from remotely sensed imagery journal January 1998
The modifiable areal unit problem and implications for landscape ecology journal June 1996
Criteria to Confirm Models that Simulate Deforestation and Carbon Disturbance journal September 2018
Comment on “Tropical forests are a net carbon source based on aboveground measurements of gain and loss” journal January 2019
Sampling designs for accuracy assessment of land cover journal September 2009
A platform for crowdsourcing the creation of representative, accurate landcover maps journal June 2016
A framework for selecting appropriate remotely sensed data dimensions for environmental monitoring and management journal November 1998
Capturing rapid land surface dynamics with Collection V006 MODIS BRDF/NBAR/Albedo (MCD43) products journal March 2018
A review of assessing the accuracy of classifications of remotely sensed data journal July 1991
Spatio-temporal dynamics and evolution of land use change and landscape pattern in response to rapid urbanization journal September 2009
Challenges in using land use and land cover data for global change studies: LAND USE AND LAND COVER DATA FOR GLOBAL CHANGE STUDIES journal January 2011
Accuracy of forest inventory mapping: Some implications for boreal forest management journal November 2007
Development of a global land cover characteristics database and IGBP DISCover from 1 km AVHRR data journal January 2000
Land Surface Temperature Retrieval Methods From Landsat-8 Thermal Infrared Sensor Data journal October 2014
An Enhanced TIMESAT Algorithm for Estimating Vegetation Phenology Metrics From MODIS Data journal June 2011
A global map of rainfed cropland areas (GMRCA) at the end of last millennium using remote sensing journal April 2009
Land cover change assessment using decision trees, support vector machines and maximum likelihood classification algorithms journal February 2010
Smallholder maize area and yield mapping at national scales with Google Earth Engine journal July 2019
Cartography: uncertainty, interventions, and dynamic display journal June 2006
Assessing the accuracy of land cover change with imperfect ground reference data journal October 2010
A Coefficient of Agreement for Nominal Scales journal April 1960
A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform journal September 2017
Modeling Percent Tree Canopy Cover: A Pilot Study journal July 2012
MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets journal January 2010
Edge effects enhance carbon uptake and its vulnerability to climate change in temperate broadleaf forests journal December 2016
Global characterization and monitoring of forest cover using Landsat data: opportunities and challenges journal August 2012
Soil Moisture Remote Sensing: State-of-the-Science journal January 2017
A generalized cross‐tabulation matrix to compare soft‐classified maps at multiple resolutions journal January 2006
A review of methods for the assessment of prediction errors in conservation presence/absence models journal March 1997
From Guesstimates to GPStimates: Land Area Measurement and Implications for Agricultural Analysis journal May 2015
Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance journal January 2005
Characterizing the Spatial and Temporal Availability of Very High Resolution Satellite Imagery in Google Earth and Microsoft Bing Maps as a Source of Reference Data journal October 2018
Combining satellite imagery and machine learning to predict poverty journal August 2016
Range of Categorical Associations for Comparison of Maps with Mixed Pixels journal August 2009
Crowdsourcing In-Situ Data on Land Cover and Land Use Using Gamification and Mobile Technology journal November 2016
Quality control and assessment of interpreter consistency of annual land cover reference data in an operational national monitoring program journal March 2020
Hybrid object-based approach for land use/land cover mapping using high spatial resolution imagery journal June 2011
Recent Advances in the Measurement Error Literature journal October 2016
A scalable approach to mapping annual land cover at 250 m using MODIS time series data: A case study in the Dry Chaco ecoregion of South America journal November 2010
Global land cover mapping at 30m resolution: A POK-based operational approach journal May 2015
Detecting trends in forest disturbance and recovery using yearly Landsat time series: 1. LandTrendr — Temporal segmentation algorithms journal December 2010
Long-term land cover dynamics by multi-temporal classification across the Landsat-5 record journal January 2013
An ontology of slums for image-based classification journal March 2012
Exploring issues of training data imbalance and mislabelling on random forest performance for large area land cover classification using the ensemble margin journal July 2015
Comparison of Office and Field Techniques for Validating Landscape Change Classification in Pacific Northwest National Parks journal December 2018
Towards a Reproducible LULC Hierarchical Class Legend for Use in the Southwest of Pará State, Brazil: A Comparison with Remote Sensing Data-Driven Hierarchies journal May 2018
Component intensities to relate difference by category with difference overall journal May 2019
An Evaluation of Bagging, Boosting, and Random Forests for Land-Cover Classification in Cape Cod, Massachusetts, USA journal September 2012
Improved land cover mapping using high resolution multiangle 8-band WorldView-2 satellite remote sensing data journal January 2013
On the Use of Unmanned Aerial Systems for Environmental Monitoring journal April 2018
Evaluating effectiveness of down-sampling for stratified designs and unbalanced prevalence in Random Forest models of tree species distributions in Nevada journal May 2012
While Boolean sets non-gently rip: A theoretical framework on fuzzy sets for mapping landscape patterns journal March 2010
Supervised methods of image segmentation accuracy assessment in land cover mapping journal February 2018
Extended triple collocation: Estimating errors and correlation coefficients with respect to an unknown target: EXTENDED TRIPLE COLLOCATION journal September 2014
An assessment of the effectiveness of a random forest classifier for land-cover classification journal January 2012
Mapping vegetation in a heterogeneous mountain rangeland using landsat data: an alternative method to define and classify land-cover units journal July 2004
Implementation of machine-learning classification in remote sensing: an applied review journal January 2018
Multiscale analysis and validation of the MODIS LAI productI. Uncertainty assessment journal December 2002
Sub-pixel confusion–uncertainty matrix for assessing soft classifications journal March 2008
The factor of scale in remote sensing journal April 1987
Ground reference data error and the mis-estimation of the area of land cover change as a function of its abundance journal August 2013
Radiative forcing and temperature response to changes in urban albedos and associated CO 2 offsets journal January 2010
An assessment of support vector machines for land cover classification journal January 2002
Land cover mapping using time series HJ-1/CCD data journal May 2014
A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources Scientists journal November 2018
ImageNet Large Scale Visual Recognition Challenge journal April 2015
Components of information for multiple resolution comparison between maps that share a real variable journal October 2007
Statistics notes: Measurement error journal June 1996
Land change for all municipalities in Latin America and the Caribbean assessed from 250-m MODIS imagery (2001–2010) journal November 2012
Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS): Assessing the accuracy of distribution models journal September 2006
Tree Species Abundance Predictions in a Tropical Agricultural Landscape with a Supervised Classification Model and Imbalanced Data journal February 2016
Nominal 30-m Cropland Extent Map of Continental Africa by Integrating Pixel-Based and Object-Based Algorithms Using Sentinel-2 and Landsat-8 Data on Google Earth Engine journal October 2017
Land cover mapping of the tropical savanna region in Brazil journal June 2009
A generalized computer vision approach to mapping crop fields in heterogeneous agricultural landscapes journal June 2016
A global dataset of crowdsourced land cover and land use reference data journal June 2017
Importance of Matrix Construction for Multiple-Resolution Categorical Map Comparison journal July 2008
Accounting for urban biogenic fluxes in regional carbon budgets journal August 2017
Estimating the global distribution of field size using crowdsourcing journal November 2018
The total operating characteristic to measure diagnostic ability for multiple thresholds journal November 2013
Optimizing Remotely Sensed Solutions for Monitoring, Modeling, and Managing Coastal Environments journal August 2000
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification journal July 2019
What is Land Cover? journal April 2005
Uncovering Ecological Patterns with Convolutional Neural Networks journal August 2019
High-resolution mapping of global surface water and its long-term changes journal December 2016
Learning Multiscale Deep Features for High-Resolution Satellite Image Scene Classification journal January 2018
Mapping and interpreting historical land cover/land use changes in a Natura 2000 site using earth observational data: The case of Nestos delta, Greece journal January 2011
Mapping Urban Land Use by Using Landsat Images and Open Social Data journal February 2016
Ambiguities inherent in sums-of-squares-based error statistics journal January 2009
Fully fuzzy supervised classification of land cover from remotely sensed imagery with an artificial neural network journal December 1997
Global land change from 1982 to 2016 journal August 2018
Key issues in rigorous accuracy assessment of land cover products journal September 2019
Learning Aerial Image Segmentation From Online Maps journal November 2017
Geo-Wiki: An online platform for improving global land cover journal May 2012
Global forecasts of urban expansion to 2030 and direct impacts on biodiversity and carbon pools journal September 2012
Lessons learned implementing an operational continuous United States national land change monitoring capability: The Land Change Monitoring, Assessment, and Projection (LCMAP) approach journal March 2020
Validating MODIS Terrestrial Ecology Products journal October 1999
Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data journal December 2012
Explaining the unsuitability of the kappa coefficient in the assessment and comparison of the accuracy of thematic maps obtained by image classification journal March 2020
Variability of operator performance in remote-sensing image interpretation: the importance of human and external factors journal January 2014
Visualizing Uncertain Information journal June 1992
Design and Interpretation of Intensity Analysis Illustrated by Land Change in Central Kalimantan, Indonesia journal July 2013
Assessing the global warming potential of human settlement expansion in a mesic temperate landscape from 2005 to 2050 journal March 2016
Land Use and Land Cover Mapping in the Brazilian Amazon Using Polarimetric Airborne P-Band SAR Data journal October 2008
Using volunteered geographic information (VGI) in design-based statistical inference for area estimation and accuracy assessment of land cover journal June 2018
The effects of imperfect reference data on remote sensing-assisted estimators of land cover class proportions journal August 2018
Review article Synergy in remote sensing-what's in a pixel? journal January 1998
Uncertainty analysis for image interpretations of urban slums journal November 2016
A large-area, spatially continuous assessment of land cover map error and its impact on downstream analyses journal October 2017
Collect Earth: Land Use and Land Cover Assessment through Augmented Visual Interpretation journal September 2016
The dimensions of global urban expansion: Estimates and projections for all countries, 2000–2050 journal February 2011
The spatial and temporal domains of modern ecology journal April 2018
Hierarchical mapping of annual global land cover 2001 to present: The MODIS Collection 6 Land Cover product journal March 2019
Intensity analysis to unify measurements of size and stationarity of land changes by interval, category, and transition journal May 2012
A critical synthesis of remotely sensed optical image change detection techniques journal April 2015
Death to Kappa: birth of quantity disagreement and allocation disagreement for accuracy assessment journal August 2011
Evaluation of the VIIRS BRDF, Albedo and NBAR products suite and an assessment of continuity with the long term MODIS record journal November 2017
Land use and land cover change in Greater Dhaka, Bangladesh: Using remote sensing to promote sustainable urbanization journal July 2009
Evaluation of land surface phenology from VIIRS data using time series of PhenoCam imagery journal June 2018
Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources journal December 2017
Uncertainty in ecosystem mapping by remote sensing journal January 2013
Integrating OpenStreetMap crowdsourced data and Landsat time-series imagery for rapid land use/land cover (LULC) mapping: Case study of the Laguna de Bay area of the Philippines journal February 2016
Global vegetation phenology from Moderate Resolution Imaging Spectroradiometer (MODIS): Evaluation of global patterns and comparison with in situ measurements: GLOBAL PHENOLOGY FROM MODIS journal December 2006
How good is good enough? Data requirements for reliable crop yield simulations and yield-gap analysis journal June 2015
Identifying Mislabeled Training Data journal July 1999
Smallholder crop area mapped with wall-to-wall WorldView sub-meter panchromatic image texture: A test case for Tigray, Ethiopia journal June 2018
Fuzzy set theory and thematic maps: accuracy assessment and area estimation journal March 2000
Random Forests journal January 2001
Machine learning in geosciences and remote sensing journal January 2016
Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information journal February 2014
Comparing the Quality of Crowdsourced Data Contributed by Expert and Non-Experts journal July 2013
Sources of error in accuracy assessment of thematic land-cover maps in the Brazilian Amazon journal March 2004
Assessing different remote sensing techniques to detect land use/cover changes in the eastern Mediterranean journal February 2009
A global reference database of crowdsourced cropland data collected using the Geo-Wiki platform dataset January 2017
Implementation of machine-learning classification in remote sensing: an applied review dataset January 2018
Edge effects enhance carbon uptake and its vulnerability to climate change in temperate broadleaf forests dataset January 2016
Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding conference July 2019
Estimating the Global Distribution of Field Size using Crowdsourcing dataset January 2018
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification dataset January 2018

Cited By (1)

Using satellite imagery to understand and promote sustainable development preprint January 2020