Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

NgramPPM: Compression Analytics without Compression

Technical Report ·
DOI:https://doi.org/10.2172/1822127· OSTI ID:1822127
 [1]
  1. Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

Arithmetic Coding (AC) using Prediction by Partial Matching (PPM) is a compression algorithm that can be used as a machine learning algorithm. This paper describes a new algorithm, NGram PPM. NGram PPM has all the predictive power of AC/PPM, but at a fraction of the computational cost. Unlike compression-based analytics, it is also amenable to a vector space interpretation, which creates the ability for integration with other traditional machine learning algorithms. AC/PPM is reviewed, including its application to machine learning. Then NGram PPM is described and test results are presented, comparing them to AC/PPM.

Research Organization:
Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
NA0003525
OSTI ID:
1822127
Report Number(s):
SAND2021-11185; 700061
Country of Publication:
United States
Language:
English

Similar Records

Identifying and Explaining Anomalous Activity in Surveillance Video with Compression Algorithms
Technical Report · Fri Sep 01 00:00:00 EDT 2023 · OSTI ID:2430331

Prediction and compression of lattice QCD data using machine learning algorithms on quantum annealer
Journal Article · Mon May 16 00:00:00 EDT 2022 · PoS - Proceedings of Science · OSTI ID:1887146

Cognition at the Point of Sensing
Technical Report · Tue Sep 29 00:00:00 EDT 2020 · OSTI ID:1668459

Related Subjects