DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A globally sampled high-resolution hand-labeled validation dataset for evaluating surface water extent maps

Journal Article · · Earth System Science Data (Online)
 [1];  [2];  [3];  [3];  [3];  [3];  [3];  [4]
  1. Pacific Northwest National Laboratory (PNNL), Richland, WA (United States); Univ. of Arizona, Tucson, AZ (United States)
  2. NASA Goddard Space Flight Center (GSFC), Greenbelt, MD (United States)
  3. Univ. of Arizona, Tucson, AZ (United States)
  4. Pacific Northwest National Laboratory (PNNL), Richland, WA (United States); Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)

Effective monitoring of global water resources is increasingly critical due to climate change and population growth. Advancements in remote sensing technology, specifically in spatial, spectral, and temporal resolutions, are revolutionizing water resource monitoring, leading to more frequent and high-quality surface water extent maps using various techniques such as traditional image processing and machine learning algorithms. However, satellite imagery datasets contain trade-offs that result in inconsistencies in performance, such as disparities in measurement principles between optical (e.g., Sentinel-2) and radar (e.g., Sentinel-1) sensors and differences in spatial and spectral resolutions among optical sensors. Therefore, developing accurate and robust surface water mapping solutions requires independent validations from multiple datasets to identify potential biases within the imagery and algorithms. However, high-quality validation datasets are expensive to build, and few contain information on water resources. For this purpose, we introduce a globally sampled, high-spatial-resolution dataset labeled using 3 m PlanetScope imagery. Our surface water extent dataset comprises 100 images, each with a size of 1024×1024 pixels, which were sampled using a stratified random sampling strategy covering all 14 biomes. We highlighted urban and rural regions, lakes, and rivers, including braided rivers and coastal regions. We evaluated two surface water extent mapping methods using our dataset – Dynamic World, based on Sentinel-2, and the NASA IMPACT model, based on Sentinel-1. Dynamic World achieved a mean intersection over union (IoU) of 72.16 % and F1 score of 79.70 %, while the NASA IMPACT model had a mean IoU of 57.61 % and F1 score of 65.79 %. Performance varied substantially across biomes, highlighting the importance of evaluating models on diverse landscapes to assess their generalizability and robustness. Our dataset can be used to analyze satellite products and methods, providing insights into their advantages and drawbacks. Our dataset offers a unique tool for analyzing satellite products, aiding the development of more accurate and robust surface water monitoring solutions. The dataset can be accessed via https://doi.org/10.25739/03nt-4f29.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE; National Aeronautics and Space Administration (NASA)
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
2467419
Report Number(s):
PNNL-SA--204138
Journal Information:
Earth System Science Data (Online), Journal Name: Earth System Science Data (Online) Journal Issue: 9 Vol. 16; ISSN 1866-3516
Publisher:
Copernicus PublicationsCopyright Statement
Country of Publication:
United States
Language:
English

References (24)

Surface water detection and delineation using remote sensing images: a review of methods and algorithms journal July 2020
Overly optimistic prediction results on imbalanced data: a case study of flaws and benefits when applying over-sampling journal January 2021
Data and its (dis)contents: A survey of dataset development and use in machine learning research journal November 2021
On the merging of optical and SAR satellite imagery for surface water mapping applications journal June 2018
PlanetScope contributions compared to Sentinel-2, and Landsat-8 for LULC mapping journal August 2022
Virtual constellations for global terrestrial monitoring journal December 2015
Water observations from space: Mapping surface water from 25 years of Landsat imagery across Australia journal March 2016
Towards a global seasonal and permanent reference water product from Sentinel-1/2 data for improved flood mapping journal September 2022
Semantic segmentation of water bodies in very high-resolution satellite and aerial images journal March 2023
High-resolution mapping of global surface water and its long-term changes journal December 2016
Increasing drought under global warming in observations and models journal August 2012
Satellite imaging reveals increased proportion of population exposed to floods journal August 2021
Dynamic World, Near real-time global 10 m land use land cover mapping journal June 2022
Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels conference June 2016
Sen1Floods11: a georeferenced dataset to train and test deep learning flood algorithms for Sentinel-1 conference June 2020
Inferring the past: a combined CNN–LSTM deep learning framework to fuse satellites for historical inundation mapping conference June 2023
Improving flood mapping in arid areas using SENTINEL-1 time series data conference July 2017
Bigearthnet: A Large-Scale Benchmark Archive for Remote Sensing Image Understanding conference July 2019
Surface Water Mapping by Deep Learning journal November 2017
Terrestrial Ecoregions of the World: A New Map of Life on Earth journal January 2001
An improved elevation dataset for climate and ice-sheet modelling: validation with satellite imagery journal January 1997
Performance Evaluation of Sentinel-2 and Landsat 8 OLI Data for Land Cover/Use Classification Using a Comparison between Machine Learning Algorithms journal April 2021
Satellite Detection of Surface Water Extent: A Review of Methodology journal April 2022
Flood Detection in time Series of Optical and sar Images journal August 2020