skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Adaptive Framework for Classification and Novel Class Detection over Evolving Data Streams with Limited Labeled Data.

Program Document ·
OSTI ID:1427231
 [1];  [1];  [1];  [2]
  1. Univ. of Texas, Dallas, TX (United States)
  2. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Most approaches to classifying evolving data streams either divide the stream of data into fixed-size chunks or use gradual forgetting to address the problems of infinite length and concept drift. Finding the fixed size of the chunks or choosing a forgetting rate without prior knowledge about time-scale of change is not a trivial task. As a result, these approaches suffer from a trade-off between performance and sensitivity. To address this problem, we present a framework which uses change detection techniques on the classifier performance to determine chunk boundaries dynamically. Though this framework exhibits good performance, it is heavily dependent on the availability of true labels of data instances. However, labeled data instances are scarce in realistic settings and not readily available. Therefore, we present a second framework which is unsupervised in nature, and exploits change detection on classifier confidence values to determine chunk boundaries dynamically. In this way, it avoids the use of labeled data while still addressing the problems of infinite length and concept drift. Moreover, both of our proposed frameworks address the concept evolution problem by detecting outliers having similar values for the attributes. We provide theoretical proof that our change detection method works better than other state-of-the-art approaches in this particular scenario. Results from experiments on various benchmark and synthetic data sets also show the efficiency of our proposed frameworks.

Research Organization:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
AC04-94AL85000
OSTI ID:
1427231
Report Number(s):
SAND-2015-7513J; 603465
Country of Publication:
United States
Language:
English

Similar Records

Transfer Learning for Event Detection From PMU Measurements With Scarce Labels
Journal Article · Fri Jan 01 00:00:00 EST 2021 · IEEE Access · OSTI ID:1427231

Anomaly detection enhanced classification in computer intrusion detection
Conference · Tue Jan 01 00:00:00 EST 2002 · OSTI ID:1427231

Semantic role labeling for protein transport predicates
Journal Article · Wed Jun 11 00:00:00 EDT 2008 · BMC Bioinformatics · OSTI ID:1427231

Related Subjects