Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Faster classification using compression analytics.

Conference ·

Abstract not provided.

Research Organization:
Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
NA0003525
OSTI ID:
1895259
Report Number(s):
SAND2021-13312C; 701255
Resource Relation:
Conference: Proposed for presentation at the International Conference on Data Mining 2021 in ,
Country of Publication:
United States
Language:
English

References (30)

Lempel-Ziv Jaccard Distance, an effective alternative to ssdeep and sdhash March 2018
Compression of individual sequences via variable-rate coding September 1978
A universal algorithm for sequential data compression May 1977
Authorship attribution of source code by using back propagation neural network based on particle swarm optimization November 2017
Finding the Jaccard Median January 2010
The minisum location problem for the Jaccard metric June 1981
The Similarity Metric December 2004
Clustering by Compression April 2005
A survey of modern authorship attribution methods December 2008
Compression-based Image Registration July 2006
Analyzing worms and network traffic using compression March 2007
Compression Analytics for Classification and Anomaly Detection Within Network Communication May 2019
A Survey on Using Kolmogorov Complexity in Cybersecurity December 2019
Mobile malware visual analytics and similarities of Attack Toolkits (Malware gene analysis) May 2013
Nearest neighbor pattern classification January 1967
Tracking concept drift in malware families October 2012
A note on the triangle inequality for the Jaccard distance April 2019
Streaming Malware Classification in the Presence of Concept Drift and Class Imbalance December 2013
Machine learning in computer forensics (and the lessons learned from machine learning in computer security) October 2011
Mining e-mail content for author identification forensics December 2001
File Fragment Classification-The Case for Specialized Approaches May 2009
On normalized compression distance and large malware December 2015
Sparse Coding for N-Gram Feature Extraction and Training for File Fragment Classification October 2018
Dictionary based color image retrieval October 2008
Automated Classification and Analysis of Internet Malware January 2007
Drebin: Effective and Explainable Detection of Android Malware in Your Pocket January 2014
A fast compression-based similarity measure with applications to content-based image retrieval February 2012
Generalized Boundary Detection Using Compression-based Analytics May 2019
Code Authorship Attribution February 2019
An Alternative to NCD for Large Sequences, Lempel-Ziv Jaccard Distance August 2017