A dictionary learning algorithm for compression and reconstruction of streaming data in preset order

Archibald, Richard; Tran, Hoang

doi:10.3934/dcdss.2021102

Title: A dictionary learning algorithm for compression and reconstruction of streaming data in preset order

Abstract

There has been an emerging interest in developing and applying dictionary learning (DL) to process massive datasets in the last decade. Many of these efforts, however, focus on employing DL to compress and extract a set of important features from data, while considering restoring the original data from this set a secondary goal. On the other hand, although several methods are able to process streaming data by updating the dictionary incrementally as new snapshots pass by, most of those algorithms are designed for the setting where the snapshots are randomly drawn from a probability distribution. In this paper, we present a new DL approach to compress and denoise massive dataset in real time, in which the data are streamed through in a preset order (instances are videos and temporal experimental data), so at any time, we can only observe a biased sample set of the whole data. Here, our approach incrementally builds up the dictionary in a relatively simple manner: if the new snapshot is adequately explained by the current dictionary, we perform a sparse coding to find its sparse representation; otherwise, we add the new snapshot to the dictionary, with a Gram-Schmidt process to maintain the orthogonality. To compressmore »« less

Authors:

^[1];

^[1]

Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Publication Date:: Fri Apr 01 00:00:00 EDT 2022

Research Org.:: Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

Sponsoring Org.:: USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR). Scientific Discovery through Advanced Computing (SciDAC); USDOE Laboratory Directed Research and Development (LDRD) Program

OSTI Identifier:: 1883981

Grant/Contract Number:: AC05-00OR22725; AC02-05CH11231

Resource Type:: Accepted Manuscript

Journal Name:: Discrete and Continuous Dynamical Systems - Series S

Additional Journal Information:: Journal Volume: 15; Journal Issue: 4; Journal ID: ISSN 1937-1632

Publisher:: American Institute of Mathematical Sciences (AIMS)

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; dictionary learning; matrix factorization; online algorithm

Citation Formats


                    Archibald, Richard, and Tran, Hoang. A dictionary learning algorithm for compression and reconstruction of streaming data in preset order.  United States: N. p., 2022. 
Web.  doi:10.3934/dcdss.2021102.

Copy to clipboard


                    Archibald, Richard, & Tran, Hoang. A dictionary learning algorithm for compression and reconstruction of streaming data in preset order.  United States.  https://doi.org/10.3934/dcdss.2021102

Copy to clipboard


                    Archibald, Richard, and Tran, Hoang. Fri .  
"A dictionary learning algorithm for compression and reconstruction of streaming data in preset order".  United States.  https://doi.org/10.3934/dcdss.2021102.  https://www.osti.gov/servlets/purl/1883981.

Copy to clipboard


                    
@article{osti_1883981,

  title        = {A dictionary learning algorithm for compression and reconstruction of streaming data in preset order},

  author       = {Archibald, Richard and Tran, Hoang},

  abstractNote = {There has been an emerging interest in developing and applying dictionary learning (DL) to process massive datasets in the last decade. Many of these efforts, however, focus on employing DL to compress and extract a set of important features from data, while considering restoring the original data from this set a secondary goal. On the other hand, although several methods are able to process streaming data by updating the dictionary incrementally as new snapshots pass by, most of those algorithms are designed for the setting where the snapshots are randomly drawn from a probability distribution. In this paper, we present a new DL approach to compress and denoise massive dataset in real time, in which the data are streamed through in a preset order (instances are videos and temporal experimental data), so at any time, we can only observe a biased sample set of the whole data. Here, our approach incrementally builds up the dictionary in a relatively simple manner: if the new snapshot is adequately explained by the current dictionary, we perform a sparse coding to find its sparse representation; otherwise, we add the new snapshot to the dictionary, with a Gram-Schmidt process to maintain the orthogonality. To compress and denoise noisy datasets, we apply the denoising to the snapshot directly before sparse coding, which deviates from traditional dictionary learning approach that achieves denoising via sparse coding. Compared to full-batch matrix decomposition methods, where the whole data is kept in memory, and other mini-batch approaches, where unbiased sampling is often assumed, our approach has minimal requirement in data sampling and storage: i) each snapshot is only seen once then discarded, and ii) the snapshots are drawn in a preset order, so can be highly biased. Through experiments on climate simulations and scanning transmission electron microscopy (STEM) data, we demonstrate that the proposed approach performs competitively to those methods in data reconstruction and denoising.},

  doi          = {10.3934/dcdss.2021102},

  journal      = {Discrete and Continuous Dynamical Systems - Series S},

  number       = 4,

  volume       = 15,

  place        = {United States},

  year         = {Fri Apr 01 00:00:00 EDT 2022},

  month        = {Fri Apr 01 00:00:00 EDT 2022}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.3934/dcdss.2021102

Other availability

Search WorldCat to find libraries that may hold this journal

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Sparse coding with an overcomplete basis set: A strategy employed by V1?
journal, December 1997

Olshausen, Bruno A.; Field, David J.
Vision Research, Vol. 37, Issue 23
DOI: 10.1016/S0042-6989(97)00169-7

Fast Low-Rank Shared Dictionary Learning for Image Classification
journal, November 2017

Vu, Tiep Huu; Monga, Vishal
IEEE Transactions on Image Processing, Vol. 26, Issue 11
DOI: 10.1109/TIP.2017.2729885

Online group-structured dictionary learning
conference, June 2011

Szabo, Zoltan; Poczos, Barnabas; Lorincz, Andras
CVPR 2011
DOI: 10.1109/CVPR.2011.5995712

Identifying Novel Polar Distortion Modes in Engineered Magnetic Oxide Superlattices
journal, July 2017

Ghosh, Saurabh; Choquette, Amber; May, Steve
Microscopy and Microanalysis, Vol. 23, Issue S1
DOI: 10.1017/S1431927617008613

An initial-value problem for testing numerical models of the global shallow-water equations
journal, January 2004

Galewsky, Joseph; Scott, Richard K.; Polvani, Lorenzo M.
Tellus A: Dynamic Meteorology and Oceanography, Vol. 56, Issue 5
DOI: 10.3402/tellusa.v56i5.14436

Dictionary Learning
journal, March 2011

Tosic, Ivana; Frossard, Pascal
IEEE Signal Processing Magazine, Vol. 28, Issue 2
DOI: 10.1109/MSP.2010.939537

Online Robust Dictionary Learning
conference, June 2013

Lu, Cewu; Shi, Jiaping; Jia, Jiaya
2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
DOI: 10.1109/CVPR.2013.60

A fast patch-dictionary method for whole image recovery
journal, May 2016

Xu, Yangyang; Yin, Wotao
Inverse Problems and Imaging, Vol. 10, Issue 2
DOI: 10.3934/ipi.2016012

Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries
journal, January 2006

Elad, Michael; Aharon, Michal
IEEE Transactions on Image Processing, Vol. 15, Issue 12
DOI: 10.1109/TIP.2006.881969

Online dictionary learning from big data using accelerated stochastic approximation algorithms
conference, May 2014

Slavakis, Konstantinos; Giannakis, Georgios B.
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: 10.1109/ICASSP.2014.6853549

Rudin-Osher-Fatemi Total Variation Denoising using Split Bregman
journal, January 2012

Getreuer, Pascal
Image Processing On Line, Vol. 2
DOI: 10.5201/ipol.2012.g-tvd

First- and Second-Order Methods for Online Convolutional Dictionary Learning
journal, January 2018

Liu, Jialin; Garcia-Cardona, Cristina; Wohlberg, Brendt
SIAM Journal on Imaging Sciences, Vol. 11, Issue 2
DOI: 10.1137/17M1145689

A Discontinuous Galerkin Transport Scheme on the Cubed Sphere
journal, April 2005

Nair, Ramachandran D.; Thomas, Stephen J.; Loft, Richard D.
Monthly Weather Review, Vol. 133, Issue 4
DOI: 10.1175/MWR2890.1

$rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation
journal, November 2006

Aharon, M.; Elad, M.; Bruckstein, A.
IEEE Transactions on Signal Processing, Vol. 54, Issue 11
DOI: 10.1109/TSP.2006.881199

Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary
journal, January 2008

Aharon, Michal; Elad, Michael
SIAM Journal on Imaging Sciences, Vol. 1, Issue 3
DOI: 10.1137/07070156X

Dictionaries for Sparse Representation Modeling
journal, June 2010

Rubinstein, Ron; Bruckstein, Alfred M.; Elad, Michael
Proceedings of the IEEE, Vol. 98, Issue 6
DOI: 10.1109/JPROC.2010.2040551

Incremental Learning for Robust Visual Tracking
journal, August 2007

Ross, David A.; Lim, Jongwoo; Lin, Ruei-Sung
International Journal of Computer Vision, Vol. 77, Issue 1-3
DOI: 10.1007/s11263-007-0075-7

Task-Driven Dictionary Learning
journal, April 2012

Mairal, J.; Bach, F.; Ponce, J.
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 34, Issue 4, p. 791-804
DOI: 10.1109/TPAMI.2011.156

An Algorithm for Total Variation Minimization and Applications
journal, January 2004

Nikolova, Mila
Journal of Mathematical Imaging and Vision, Vol. 20, Issue 1/2
DOI: 10.1023/B:JMIV.0000011321.19549.88

Online dictionary learning for sparse coding
conference, January 2009

Mairal, Julien; Bach, Francis; Ponce, Jean
Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09
DOI: 10.1145/1553374.1553463

Online convolutional dictionary learning for multimodal imaging
conference, September 2017

Degraux, Kevin; Kamilov, Ulugbek S.; Boufounos, Petros T.
2017 IEEE International Conference on Image Processing (ICIP)
DOI: 10.1109/ICIP.2017.8296555

Similar Records in DOE PAGES and OSTI.GOV collections:

Path-Based Dictionary Augmentation: A Framework for Improving $$k$$ -Sparse Image Processing

Journal Article Emerson, Tegan H. ; Olson, Colin C. ; Doster, Timothy - IEEE Transactions on Image Processing

In this study, we have previously shown that augmenting orthogonal matching pursuit (OMP) with an additional step in the identification stage of each pursuit iteration yields improvedmore »« less
Cited by 1
https://doi.org/10.1109/TIP.2019.2927331

Full Text Available
Convolutional Dictionary Regularizers for Tomographic Inversion

Conference Venkatakrishnan, Singanallur ; Wohlberg, Brendt

There has been a growing interest in the use of data-driven regularizers to solve inverse problems associated with computational imaging systems. The convolutional sparse representation model has recently gained attention, driven by the development of fast algorithms for solving the dictionary learning and sparse coding problems for sufficiently large images and data sets. Nevertheless, this model has seen very limited application to tomographic reconstruction problems. In this paper, we present a model-based tomographic reconstruction algorithm using a learnt convolutional dictionary as a regularizer. The key contribution is the use of a data-dependent weighting scheme for the l 1 regularization tomore »« less
https://doi.org/10.1109/ICASSP.2019.8682637

Full Text Available
WE-G-18A-04: 3D Dictionary Learning Based Statistical Iterative Reconstruction for Low-Dose Cone Beam CT Imaging

Journal Article Bai, T ; UT Southwestern Medical Center, Dallas, TX ; Yan, H ; ... - Medical Physics

Purpose: To develop a 3D dictionary learning based statistical reconstruction algorithm on graphic processing units (GPU), to improve the quality of low-dose cone beam CT (CBCT) imaging with high efficiency. Methods: A 3D dictionary containing 256 small volumes (atoms) of 3x3x3 voxels was trained from a high quality volume image. During reconstruction, we utilized a Cholesky decomposition based orthogonal matching pursuit algorithm to find a sparse representation on this dictionary basis of each patch in the reconstructed image, in order to regularize the image quality. To accelerate the time-consuming sparse coding in the 3D case, we implemented our algorithm inmore »« less
https://doi.org/10.1118/1.4889515
SU-E-I-41: Dictionary Learning Based Quantitative Reconstruction for Low-Dose Dual-Energy CT (DECT)

Journal Article Xu, Q ; Department of Radiation Oncology, Stanford University, Stanford, CA 94305 ; Xing, L ; ... - Medical Physics

Purpose: DECT collects two sets of projection data under higher and lower energies. With appropriates composition methods on linear attenuation coefficients, quantitative information about the object, such as density, can be obtained. In reality, one of the important problems in DECT is the radiation dose due to doubled scans. This work is aimed at establishing a dictionary learning based reconstruction framework for DECT for improved image quality while reducing the imaging dose. Methods: In our method, two dictionaries were learned respectively from the high-energy and lowenergy image datasets of similar objects under normal dose in advance. The linear attenuation coefficientmore »« less
https://doi.org/10.1118/1.4924038
MO-G-17A-05: PET Image Deblurring Using Adaptive Dictionary Learning

Journal Article Valiollahzadeh, S ; Clark, J ; Mawlawi, O - Medical Physics

Purpose: The aim of this work is to deblur PET images while suppressing Poisson noise effects using adaptive dictionary learning (DL) techniques. Methods: The model that relates a blurred and noisy PET image to the desired image is described as a linear transform y=Hm+n where m is the desired image, H is a blur kernel, n is Poisson noise and y is the blurred image. The approach we follow to recover m involves the sparse representation of y over a learned dictionary, since the image has lots of repeated patterns, edges, textures and smooth regions. The recovery is based onmore »« less
https://doi.org/10.1118/1.4889207

Similar Records

Title: A dictionary learning algorithm for compression and reconstruction of streaming data in preset order

Abstract

Citation Formats

Sparse coding with an overcomplete basis set: A strategy employed by V1? journal, December 1997

Fast Low-Rank Shared Dictionary Learning for Image Classification journal, November 2017

Online group-structured dictionary learning conference, June 2011

Identifying Novel Polar Distortion Modes in Engineered Magnetic Oxide Superlattices journal, July 2017

An initial-value problem for testing numerical models of the global shallow-water equations journal, January 2004

Dictionary Learning journal, March 2011

Online Robust Dictionary Learning conference, June 2013

A fast patch-dictionary method for whole image recovery journal, May 2016

Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries journal, January 2006

Online dictionary learning from big data using accelerated stochastic approximation algorithms conference, May 2014

Rudin-Osher-Fatemi Total Variation Denoising using Split Bregman journal, January 2012

First- and Second-Order Methods for Online Convolutional Dictionary Learning journal, January 2018

A Discontinuous Galerkin Transport Scheme on the Cubed Sphere journal, April 2005

$rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation journal, November 2006

Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary journal, January 2008

Dictionaries for Sparse Representation Modeling journal, June 2010

Incremental Learning for Robust Visual Tracking journal, August 2007

Task-Driven Dictionary Learning journal, April 2012

An Algorithm for Total Variation Minimization and Applications journal, January 2004

Online dictionary learning for sparse coding conference, January 2009

Online convolutional dictionary learning for multimodal imaging conference, September 2017

Sparse coding with an overcomplete basis set: A strategy employed by V1?
journal, December 1997

Fast Low-Rank Shared Dictionary Learning for Image Classification
journal, November 2017

Online group-structured dictionary learning
conference, June 2011

Identifying Novel Polar Distortion Modes in Engineered Magnetic Oxide Superlattices
journal, July 2017

An initial-value problem for testing numerical models of the global shallow-water equations
journal, January 2004

Dictionary Learning
journal, March 2011

Online Robust Dictionary Learning
conference, June 2013

A fast patch-dictionary method for whole image recovery
journal, May 2016

Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries
journal, January 2006

Online dictionary learning from big data using accelerated stochastic approximation algorithms
conference, May 2014

Rudin-Osher-Fatemi Total Variation Denoising using Split Bregman
journal, January 2012

First- and Second-Order Methods for Online Convolutional Dictionary Learning
journal, January 2018

A Discontinuous Galerkin Transport Scheme on the Cubed Sphere
journal, April 2005

$rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation
journal, November 2006

Sparse and Redundant Modeling of Image Content Using an Image-Signature-Dictionary
journal, January 2008

Dictionaries for Sparse Representation Modeling
journal, June 2010

Incremental Learning for Robust Visual Tracking
journal, August 2007

Task-Driven Dictionary Learning
journal, April 2012

An Algorithm for Total Variation Minimization and Applications
journal, January 2004

Online dictionary learning for sparse coding
conference, January 2009

Online convolutional dictionary learning for multimodal imaging
conference, September 2017