Source identification by non-negative matrix factorization combined with semi-supervised clustering
Abstract
Machine-learning methods and apparatus are provided to solve blind source separation problems with an unknown number of sources and having a signal propagation model with features such as wave-like propagation, medium-dependent velocity, attenuation, diffusion, and/or advection, between sources and sensors. In exemplary embodiments, multiple trials of non-negative matrix factorization are performed for a fixed number of sources, with selection criteria applied to determine successful trials. A semi-supervised clustering procedure is applied to trial results, and the clustering results are evaluated for robustness using measures for reconstruction quality and cluster separation. The number of sources is determined by comparing these measures for different trial numbers of sources. Source locations and parameters of the signal propagation model can also be determined. Disclosed methods are applicable to a wide range of spatial problems including chemical dispersal, pressure transients, and electromagnetic signals, and also to non-spatial problems such as cancer mutation.
- Inventors:
- Issue Date:
- Research Org.:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 2222289
- Patent Number(s):
- 11748657
- Application Number:
- 17/020,504
- Assignee:
- Triad National Security, LLC (Los Alamos, NM)
- Patent Classifications (CPCs):
-
G - PHYSICS G06 - COMPUTING G06F - ELECTRIC DIGITAL DATA PROCESSING
G - PHYSICS G06 - COMPUTING G06N - COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- DOE Contract Number:
- AC52-06NA25396
- Resource Type:
- Patent
- Resource Relation:
- Patent File Date: 09/14/2020
- Country of Publication:
- United States
- Language:
- English
Citation Formats
Alexandrov, Boian S., Alexandrov, Ludmil B., Iliev, Filip L., Stanev, Valentin G., and Vesselinov, Velimir V. Source identification by non-negative matrix factorization combined with semi-supervised clustering. United States: N. p., 2023.
Web.
Alexandrov, Boian S., Alexandrov, Ludmil B., Iliev, Filip L., Stanev, Valentin G., & Vesselinov, Velimir V. Source identification by non-negative matrix factorization combined with semi-supervised clustering. United States.
Alexandrov, Boian S., Alexandrov, Ludmil B., Iliev, Filip L., Stanev, Valentin G., and Vesselinov, Velimir V. Tue .
"Source identification by non-negative matrix factorization combined with semi-supervised clustering". United States. https://www.osti.gov/servlets/purl/2222289.
@article{osti_2222289,
title = {Source identification by non-negative matrix factorization combined with semi-supervised clustering},
author = {Alexandrov, Boian S. and Alexandrov, Ludmil B. and Iliev, Filip L. and Stanev, Valentin G. and Vesselinov, Velimir V.},
abstractNote = {Machine-learning methods and apparatus are provided to solve blind source separation problems with an unknown number of sources and having a signal propagation model with features such as wave-like propagation, medium-dependent velocity, attenuation, diffusion, and/or advection, between sources and sensors. In exemplary embodiments, multiple trials of non-negative matrix factorization are performed for a fixed number of sources, with selection criteria applied to determine successful trials. A semi-supervised clustering procedure is applied to trial results, and the clustering results are evaluated for robustness using measures for reconstruction quality and cluster separation. The number of sources is determined by comparing these measures for different trial numbers of sources. Source locations and parameters of the signal propagation model can also be determined. Disclosed methods are applicable to a wide range of spatial problems including chemical dispersal, pressure transients, and electromagnetic signals, and also to non-spatial problems such as cancer mutation.},
doi = {},
journal = {},
number = ,
volume = ,
place = {United States},
year = {2023},
month = {9}
}
Works referenced in this record:
A blind source separation technique using second-order statistics
journal, January 1997
- Belouchrani, A.; Abed-Meraim, K.; Cardoso, J. -F.
- IEEE Transactions on Signal Processing, Vol. 45, Issue 2
A Machine Learning Approach for Locating Acoustic Emission
journal, October 2010
- Ince, Nf; Kao, Chu-Shu; Kaveh, M.
- EURASIP Journal on Advances in Signal Processing, Vol. 2010, Issue 1
Clock-like mutational processes in human somatic cells
journal, November 2015
- Alexandrov, Ludmil B.; Jones, Philip H.; Wedge, David C.
- Nature Genetics, Vol. 47, Issue 12
Mutational signatures associated with tobacco smoking in human cancer
journal, November 2016
- Alexandrov, L. B.; Ju, Y. S.; Haase, K.
- Science, Vol. 354, Issue 6312
Learning the parts of objects by non-negative matrix factorization
journal, October 1999
- Lee, Daniel D.; Seung, H. Sebastian
- Nature, Vol. 401, Issue 6755
Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture
journal, July 1991
- Jutten, Christian; Herault, Jeanny
- Signal Processing, Vol. 24, Issue 1
Method and System for Fast Tensor-Vector Multiplication
patent-application, June 2014
- Dourbal, Pavel
- US Patent Application 13/726367; 20140181171
Reference-free time-based localization for an asynchronous target
journal, January 2012
- Wang, Yiyin; Leus, Geert
- EURASIP Journal on Advances in Signal Processing, Vol. 2012, Issue 1
Algorithm AS 136: A K-Means Clustering Algorithm
journal, January 1979
- Hartigan, J. A.; Wong, M. A.
- Applied Statistics, Vol. 28, Issue 1
ToQ.jl: A high-level programming language for D-Wave machines based on Julia
conference, September 2016
- O'Malley, Daniel; Vesselinov, Velimir V.
- 2016 IEEE High Performance Extreme Computing Conference (HPEC)
Integrating Volterra Series Model And Deep Neural Networks To Equalize Nonlinear Power amplifiers
patent-application, September 2022
- Li, Xiaohua; Thompson, Robert
- US Patent Application 17/234102; 11451419 B2
Novel Multi-layer Non-negative Tensor Factorization with Sparsity Constraints
book, January 2007
- Cichocki, Andrzej; Zdunek, Rafal; Choi, Seungjin
- Adaptive and Natural Computing Algorithms
An Approach to Bayesian Sensitivity Analysis
journal, November 1996
- Weiss, Robert
- Journal of the Royal Statistical Society: Series B (Methodological), Vol. 58, Issue 4
Fully Scalable Methods for Distributed Tensor Factorization
journal, January 2017
- Shin, Kijung; Sael, Lee; Kang, U.
- IEEE Transactions on Knowledge and Data Engineering, Vol. 29, Issue 1
Shifted factor analysis?Part II: Algorithms
journal, January 2003
- Hong, Sungjin; Harshman, Richard A.
- Journal of Chemometrics, Vol. 17, Issue 7
Automatic Analysis of Composite Physical Signals Using Non-Negative Factorization and Information Criterion
journal, March 2012
- Watanabe, Kenji; Hidaka, Akinori; Otsu, Nobuyuki
- PLoS ONE, Vol. 7, Issue 3
Shifted factor analysis?Part I: Models and properties
journal, January 2003
- Harshman, Richard A.; Hong, Sungjin; Lundy, Margaret E.
- Journal of Chemometrics, Vol. 17, Issue 7
Statistical source identification of metals in groundwater exposed to industrial contamination
journal, May 2007
- Tariq, Saadia R.; Shah, Munir H.; Shaheen, N.
- Environmental Monitoring and Assessment, Vol. 138, Issue 1-3
Multivariate statistical and GIS-based approach to identify heavy metal sources in soils
journal, October 2001
- Facchinelli, A.; Sacchi, E.; Mallen, L.
- Environmental Pollution, Vol. 114, Issue 3
On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming
journal, April 2005
- Wächter, Andreas; Biegler, Lorenz T.
- Mathematical Programming, Vol. 106, Issue 1
Understanding the origins of human cancer
journal, December 2015
- Alexandrov, L. B.
- Science, Vol. 350, Issue 6265
Artificial intelligence for management and control of pollution minimization and mitigation processes
journal, March 2003
- Chan, Christine W.; Huang, Guo H.
- Engineering Applications of Artificial Intelligence, Vol. 16, Issue 2
Bayesian Non-negative Matrix Factorization
book, January 2009
- Schmidt, Mikkel N.; Winther, Ole; Hansen, Lars Kai
- Independent Component Analysis and Signal Separation
Machine Learning Approach for Contamination Source Identification in Water Distribution Systems
conference, July 2012
- Rasekh, Amin; Brumbelow, Kelly
- World Environmental And Water Resources Congress 2012
Silhouettes: A graphical aid to the interpretation and validation of cluster analysis
journal, November 1987
- Rousseeuw, Peter J.
- Journal of Computational and Applied Mathematics, Vol. 20
Blind source separation for groundwater pressure analysis based on nonnegative matrix factorization
journal, September 2014
- Alexandrov, Boian S.; Vesselinov, Velimir V.
- Water Resources Research, Vol. 50, Issue 9
Method of driving a display using non-negative matrix factorization to determine a pair of matrices for representing features of pixel data in an image data matrix and determining weights of said features such that a product of the matrices approximates the image data matrix
patent, May 2011
- Smith, Euan; Routley, Paul; Foden, Clare
- US Patent Document 7,953,682
Probabilistic sensitivity analysis of complex models: a Bayesian approach
journal, August 2004
- Oakley, Jeremy E.; O'Hagan, Anthony
- Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 66, Issue 3
Characterization of guanidiniocarbonyl pyrroles in water by pH-dependent UV Raman spectroscopy and component analysis
journal, January 2008
- Srivastava, S. K.; Niebling, S.; Küstner, B.
- Physical Chemistry Chemical Physics, Vol. 10, Issue 45
Dynamical Model of Drug Accumulation in Bacteria: Sensitivity Analysis and Experimentally Testable Predictions
journal, November 2016
- Vesselinova, Neda; Alexandrov, Boian S.; Wall, Michael E.
- PLOS ONE, Vol. 11, Issue 11
Systems and methods for analysis and design of radiating and scattering objects
patent, June 2012
- Shaeffer, John Fredrick
- US Patent Document 8,209,138
Using Bayesian statistics in the estimation of heat source in radiation
journal, January 2005
- Wang, Jingbo; Zabaras, Nicholas
- International Journal of Heat and Mass Transfer, Vol. 48, Issue 1
A mutational signature in gastric cancer suggests therapeutic strategies
journal, October 2015
- Alexandrov, Ludmil B.; Nik-Zainal, Serena; Siu, Hoi Cheong
- Nature Communications, Vol. 6, Issue 1
Shifted Non-Negative Matrix Factorization
conference, August 2007
- Morup, Morten; Madsen, Kristoffer H.; Hansen, Lars K.
- 2007 IEEE Workshop on Machine Learning for Signal Processing
Blind Source Separation
book, January 1999
- Zarzoso, V.; Nandi, A. K.
- Blind Estimation Using Higher-Order Statistics
Mutational signatures: the patterns of somatic mutations hidden in cancer genomes
journal, February 2014
- Alexandrov, Ludmil B.; Stratton, Michael R.
- Current Opinion in Genetics & Development, Vol. 24
Three-dimensional numerical inversion of pneumatic cross-hole tests in unsaturated fractured tuff: 1. Methodology and borehole effects
journal, December 2001
- Vesselinov, Velimir V.; Neuman, Shlomo P.; Illman, Walter A.
- Water Resources Research, Vol. 37, Issue 12
F lexi F a CT: Scalable Flexible Factorization of Coupled Tensors on Hadoop
conference, April 2014
- Beutel, Alex; Talukdar, Partha Pratim; Kumar, Abhimanu
- Proceedings of the 2014 SIAM International Conference on Data Mining
Three-dimensional numerical inversion of pneumatic cross-hole tests in unsaturated fractured tuff: 2. Equivalent parameters, high-resolution stochastic imaging and scale effects
journal, December 2001
- Vesselinov, Velimir V.; Neuman, Shlomo P.; Illman, Walter A.
- Water Resources Research, Vol. 37, Issue 12
Robust Multilinear Tensor Rank Estimation Using Higher Order Singular Value Decomposition and Information Criteria
journal, March 2017
- Yokota, Tatsuya; Lee, Namgil; Cichocki, Andrzej
- IEEE Transactions on Signal Processing, Vol. 65, Issue 5
Applicability of statistical learning algorithms in groundwater quality modeling: GROUNDWATER MODELING BY LEARNING MACHINES
journal, May 2005
- Khalil, Abedalrazq; Almasri, Mohammad N.; McKee, Mac
- Water Resources Research, Vol. 41, Issue 5
A new structure identification scheme for ANFIS and its application for the simulation of virtual air pollution monitoring stations in urban areas
journal, May 2015
- Taheri Shahraiyni, Hamid; Sodoudi, Sahar; Kerschbaumer, Andreas
- Engineering Applications of Artificial Intelligence, Vol. 41
Broadly targeted CD8+ T cell responses restricted by major histocompatibility complex E
journal, January 2016
- Hansen, S. G.; Wu, H. L.; Burwitz, B. J.
- Science, Vol. 351, Issue 6274
Second-order optimization based adaptive PARAFAC decomposition of three-way tensors
journal, April 2017
- Nguyen, Viet-Dung; Abed-Meraim, Karim; Linh-Trung, Nguyen
- Digital Signal Processing, Vol. 63
Assessment of surface water quality using multivariate statistical techniques: A case study of the Fuji river basin, Japan
journal, April 2007
- Shrestha, S.; Kazama, F.
- Environmental Modelling & Software, Vol. 22, Issue 4
Identification of Pumping Influences in Long-Term Water Level Fluctuations
journal, April 2011
- Harp, Dylan R.; Vesselinov, Velimir V.
- Ground Water, Vol. 49, Issue 3
Document clustering based on non-negative matrix factorization
conference, January 2003
- Xu, Wei; Liu, Xin; Gong, Yihong
- Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03
Deciphering Signatures of Mutational Processes Operative in Human Cancer
journal, January 2013
- Alexandrov, Ludmil B.; Nik-Zainal, Serena; Wedge, David C.
- Cell Reports, Vol. 3, Issue 1
Automatic Relevance Determination in Nonnegative Matrix Factorization with the /spl beta/-Divergence
journal, July 2013
- Tan, V. Y. F.; Fevotte, C.
- IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, Issue 7
Simultaneous parameter estimation and contaminant source characterization for coupled groundwater flow and contaminant transport modelling
journal, July 1992
- Wagner, Brian J.
- Journal of Hydrology, Vol. 135, Issue 1-4
The case of arsenic contamination in the Sardinian Geopark, Italy, analyzed using symbolic machine learning: ALGORITHM QUASI-OPTIMAL LEARNING AND ARSENIC CONTAMINATION
journal, September 2013
- Manca, Germana; Cervone, Guido
- Environmetrics
Combining multiple clusterings by soft correspondence
patent, June 2012
- Long, Bo; Zhang, Zhongfei Mark
- US Patent Document 8,195,734
Rapid identification of structural phases in combinatorial thin-film libraries using x-ray diffraction and non-negative matrix factorization
journal, October 2009
- Long, C. J.; Bunker, D.; Li, X.
- Review of Scientific Instruments, Vol. 80, Issue 10
Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values
journal, June 1994
- Paatero, Pentti; Tapper, Unto
- Environmetrics, Vol. 5, Issue 2
Algorithm quasi‐optimal (AQ) learning
journal, March 2010
- Cervone, Guido; Franzese, Pasquale; Keesee, Allen P. K.
- Wiley Interdisciplinary Reviews: Computational Statistics, Vol. 2, Issue 2
Denoising acoustic signals using constrained non-negative matrix factorization
patent, September 2011
- Wilson, Kevin William; Divakaran, Ajay; Ramakrishnan, Bhiksha
- US Patent Document 8,015,003
Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-Way Data Analysis and Blind Source Separation
book, September 2009
- Cichocki, Andrzej; Zdunek, Rafal; Phan, Anh Huy