Efficient generalized boundary detection
Abstract
Fast, efficient, and robust compression-based methods for detecting boundaries in arbitrary datasets, including sequences (1D datasets), are desired. The methods, each employing three simple algorithms, approximate the information distance between two adjacent sliding windows within a dataset. One of the algorithms calculates an initial ordered list of subsequences; while a second algorithm updates the ordered list of subsequences by dropping a first entry and appending a last entry rather than calculating completely new ordered lists with each iteration. Large values in the distance metric are indicative of boundary locations. A smoothed z-score or a wavelet-based algorithm may then be used to locate peaks in the distance metric, thereby identifying boundary locations. An adaptive version of the method employs a collection of window sizes and corresponding weighting functions, making it more amenable to real datasets with unknown, complex, and changing structures.
- Inventors:
- Issue Date:
- Research Org.:
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Org.:
- USDOE National Nuclear Security Administration (NNSA)
- OSTI Identifier:
- 1987073
- Patent Number(s):
- 11563446
- Application Number:
- 17/219,217
- Assignee:
- National Technology & Engineering Solutions of Sandia, LLC (Albuquerque, NM)
- DOE Contract Number:
- NA0003525
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 03/31/2021
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Ting, Christina, Field, Jr., Richard V., Quach, Tu-Thach, and Bauer, Travis L. Efficient generalized boundary detection. United States: N. p., 2023.
Web.
Ting, Christina, Field, Jr., Richard V., Quach, Tu-Thach, & Bauer, Travis L. Efficient generalized boundary detection. United States.
Ting, Christina, Field, Jr., Richard V., Quach, Tu-Thach, and Bauer, Travis L. Tue .
"Efficient generalized boundary detection". United States. https://www.osti.gov/servlets/purl/1987073.
@article{osti_1987073,
title = {Efficient generalized boundary detection},
author = {Ting, Christina and Field, Jr., Richard V. and Quach, Tu-Thach and Bauer, Travis L.},
abstractNote = {Fast, efficient, and robust compression-based methods for detecting boundaries in arbitrary datasets, including sequences (1D datasets), are desired. The methods, each employing three simple algorithms, approximate the information distance between two adjacent sliding windows within a dataset. One of the algorithms calculates an initial ordered list of subsequences; while a second algorithm updates the ordered list of subsequences by dropping a first entry and appending a last entry rather than calculating completely new ordered lists with each iteration. Large values in the distance metric are indicative of boundary locations. A smoothed z-score or a wavelet-based algorithm may then be used to locate peaks in the distance metric, thereby identifying boundary locations. An adaptive version of the method employs a collection of window sizes and corresponding weighting functions, making it more amenable to real datasets with unknown, complex, and changing structures.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2023},
month = {1}
}
Works referenced in this record:
A survey of methods for time series change point detection
journal, September 2016
- Aminikhanghahi, Samaneh; Cook, Diane J.
- Knowledge and Information Systems, Vol. 51, Issue 2
Data-Driven Semi-Global Alignment Technique for Masquerade Detection in Stand-Alone and Cloud Computing Systems
patent-application, January 2017
- Kholidy, Hesham Abdelazim Ismail Mohamed; Azab, Abdulrahman; Baiardi, Fabrizio
- US Patent Application 15/213111; 20170019419
A fast compression-based similarity measure with applications to content-based image retrieval
journal, February 2012
- Cerra, Daniele; Datcu, Mihai
- Journal of Visual Communication and Image Representation, Vol. 23, Issue 2
The Similarity Metric
journal, December 2004
- Li, M.; Chen, X.; Li, X.
- IEEE Transactions on Information Theory, Vol. 50, Issue 12
An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance
conference, August 2017
- Raff, Edward; Nicholas, Charles
- Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
A universal algorithm for sequential data compression
journal, May 1977
- Ziv, J.; Lempel, A.
- IEEE Transactions on Information Theory, Vol. 23, Issue 3
Data-driven semi-global alignment technique for masquerade detection in stand-alone and cloud computing systems
patent, January 2019
- Kholidy, Hesham AbdElazim Ismail Mohamed; Azab, Abdulrahman; Baiardi, Fabrizio
- US Patent Document 10,193,904
Efficient Generalized Boundary Detection Using a Sliding Information Distance
journal, January 2020
- Field, Richard; Quach, Tu-Thach; Ting, Christina
- IEEE Transactions on Signal Processing, Vol. 68
Support Vector Machines Processing System
patent-application, March 2005
- Milenova, Boriana L.; Yarmus, Joseph S.; Campos, Marcos M.
- US Patent Application 10/927111; 20050049990
Dictionary based color image retrieval
journal, October 2008
- Macedonas, A.; Besiris, D.; Economou, G.
- Journal of Visual Communication and Image Representation, Vol. 19, Issue 7
Support Vector Machines Processing System
patent-application, May 2009
- Milenova, Boriana L.; Yarmus, Joseph S.; Campos, Marcos M.
- US Patent Application 12/356063; 20090132447
Support vector machine using iterative chunking and violators
patent, May 2011
- Milenova, Boriana; Yarmus, Joseph; Campos, Marcos M.
- US Patent Document 7,937,351
Support vector machines processing system
patent, July 2014
- Milenova, Boriana; Yarmus, Joseph; Campos, Marcos M.
- US Patent Document 8,781,978