skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Hybrid Classification Scheme for Mining Multisource Geospatial Data

Abstract

Supervised learning methods such as Maximum Likelihood (ML) are often used in land cover (thematic) classification of remote sensing imagery. ML classifier relies exclusively on spectral characteristics of thematic classes whose statistical distributions are often overlapping. The spectral response distributions of thematic classes are dependent on many factors including elevation, soil types, and atmospheric conditions present at the time of data acquisition. A second problem with statistical classifiers is the requirement of large number of accurate training samples, which are often costly and time consuming to acquire over large geographic regions. With the increasing availability of geospatial databases, it is possible to exploit the knowledge derived from these ancillary datasets to improve classification accuracies even when the class distributions are highly overlapping. Likewise newer semi-supervised techniques can be adopted to improve the parameter estimates of statistical model by utilizing a large number of easily available unlabeled training samples. Unfortunately there is no convenient multivariate statistical model that can be employed for mulitsource geospatial databases. In this paper we present a hybrid semi-supervised learning algorithm that effectively exploits freely available unlabeled training samples from multispectral remote sensing images and also incorporates ancillary geospatial databases. We have conducted several experiments on realmore » datasets, and our new hybrid approach shows over 15% improvement in classification accuracy over conventional classification schemes.« less

Authors:
 [1];  [1]
  1. ORNL
Publication Date:
Research Org.:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Sponsoring Org.:
Work for Others (WFO)
OSTI Identifier:
978780
DOE Contract Number:  
DE-AC05-00OR22725
Resource Type:
Conference
Resource Relation:
Conference: ICDM International Workshop on Spatial and Spatio-temporal Data Mining (SSTDM), Omaha, NE, USA, 20071028, 20071028
Country of Publication:
United States
Language:
English
Subject:
54 ENVIRONMENTAL SCIENCES; ACCURACY; ALGORITHMS; AVAILABILITY; CLASSIFICATION; DATA ACQUISITION; LEARNING; MINING; REMOTE SENSING; SOILS; SPECTRAL RESPONSE; STATISTICAL MODELS; TRAINING; MLC; EM; Semisupervised Learning

Citation Formats

Vatsavai, Raju, and Bhaduri, Budhendra L. A Hybrid Classification Scheme for Mining Multisource Geospatial Data. United States: N. p., 2007. Web.
Vatsavai, Raju, & Bhaduri, Budhendra L. A Hybrid Classification Scheme for Mining Multisource Geospatial Data. United States.
Vatsavai, Raju, and Bhaduri, Budhendra L. Mon . "A Hybrid Classification Scheme for Mining Multisource Geospatial Data". United States. doi:.
@article{osti_978780,
title = {A Hybrid Classification Scheme for Mining Multisource Geospatial Data},
author = {Vatsavai, Raju and Bhaduri, Budhendra L},
abstractNote = {Supervised learning methods such as Maximum Likelihood (ML) are often used in land cover (thematic) classification of remote sensing imagery. ML classifier relies exclusively on spectral characteristics of thematic classes whose statistical distributions are often overlapping. The spectral response distributions of thematic classes are dependent on many factors including elevation, soil types, and atmospheric conditions present at the time of data acquisition. A second problem with statistical classifiers is the requirement of large number of accurate training samples, which are often costly and time consuming to acquire over large geographic regions. With the increasing availability of geospatial databases, it is possible to exploit the knowledge derived from these ancillary datasets to improve classification accuracies even when the class distributions are highly overlapping. Likewise newer semi-supervised techniques can be adopted to improve the parameter estimates of statistical model by utilizing a large number of easily available unlabeled training samples. Unfortunately there is no convenient multivariate statistical model that can be employed for mulitsource geospatial databases. In this paper we present a hybrid semi-supervised learning algorithm that effectively exploits freely available unlabeled training samples from multispectral remote sensing images and also incorporates ancillary geospatial databases. We have conducted several experiments on real datasets, and our new hybrid approach shows over 15% improvement in classification accuracy over conventional classification schemes.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {Mon Jan 01 00:00:00 EST 2007},
month = {Mon Jan 01 00:00:00 EST 2007}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: