skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Training a Quantum Annealing Based Restricted Boltzmann Machine on Cybersecurity Data

Journal Article · · IEEE Transactions on Emerging Topics in Computational Intelligence

A restricted Boltzmann machine (RBM) is a generative model that could be used in effectively balancing a cybersecurity dataset because the synthetic data a RBM generates follows the probability distribution of the training data. RBM training can be performed using contrastive divergence (CD) and quantum annealing (QA). QA-based RBM training is fundamentally different from CD and requires samples from a quantum computer. We present a real-world application that uses a quantum computer. Specifically, we train a RBM using QA for cybersecurity applications. The D-Wave 2000Q has been used to implement QA. RBMs are trained on the ISCX data, which is a benchmark dataset for cybersecurity. For comparison, RBMs are also trained using CD. CD is a commonly used method for RBM training. Our analysis of the ISCX data shows that the dataset is imbalanced. We present two different schemes to balance the training dataset before feeding it to a classifier. The first scheme is based on the undersampling of benign instances. The imbalanced training dataset is divided into five sub-datasets that are trained separately. A majority voting is then performed to get the result. Our results show the majority vote increases the classification accuracy up from 90.24% to 95.68%, in the case of CD. For the case of QA, the classification accuracy increases from 74.14% to 80.04%. In the second scheme, a RBM is used to generate synthetic data to balance the training dataset. We show that both QA and CD-trained RBM can be used to generate useful synthetic data. Balanced training data is used to evaluate several classifiers. Among the classifiers investigated, K-Nearest Neighbor (KNN) and Neural Network (NN) perform better than other classifiers. They both show an accuracy of 93%. Our results show a proof-of-concept that a QA-based RBM can be trained on a 64-bit binary dataset. The illustrative example suggests the possibility to migrate many practical classification problems to QA-based techniques. Further, we show that synthetic data generated from a RBM can be used to balance the original dataset.

Research Organization:
Purdue Univ., West Lafayette, IN (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
SC0019215; AC05-00OR22725
OSTI ID:
1871582
Alternate ID(s):
OSTI ID: 1870258
Journal Information:
IEEE Transactions on Emerging Topics in Computational Intelligence, Vol. 6, Issue 3; ISSN 2471-285X
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (33)

Determination of the Lowest-Energy States for the Model Distribution of Trained Restricted Boltzmann Machines Using a 1000 Qubit D-Wave 2X Quantum Computer journal July 2017
Optimizing adiabatic quantum program compilation using a graph-theoretic framework journal April 2018
Intrusion Detection Using Random Forests Classifier with SMOTE and Feature Reduction conference November 2013
Research on Intrusion Detection Method Based on Improved Smote and XGBoost conference January 2018
A novel region adaptive SMOTE algorithm for intrusion detection on imbalanced problem conference December 2017
AESMOTE: Adversarial Reinforcement Learning With SMOTE for Anomaly Detection journal April 2021
Toward developing a systematic approach to generate benchmark datasets for intrusion detection journal May 2012
SMOTE Implementation on Phishing Data to Enhance Cybersecurity conference May 2018
Adiabatic Quantum Computation is Equivalent to Standard Quantum Computation journal January 2007
Comparison of Use of a 2000 Qubit D-Wave Quantum Annealer and MCMC for Sampling, Image Reconstruction, and Classification journal February 2021
Quantum annealing in the transverse Ising model journal November 1998
Quantum annealing: A new method for minimizing multidimensional functions journal March 1994
Support vector machines on the D-Wave quantum annealer journal March 2020
Quantum annealing for combinatorial clustering journal January 2018
Quantum Annealing for Prime Factorization journal December 2018
Electronic Structure Calculations and the Ising Hamiltonian journal October 2017
Intrusion detection using deep belief networks
  • Alom, Md. Zahangir; Bontupalli, VenkataRamesh; Taha, Tarek M.
  • NAECON 2015 - IEEE National Aerospace and Electronics Conference, 2015 National Aerospace and Electronics Conference (NAECON) https://doi.org/10.1109/NAECON.2015.7443094
conference June 2015
SMOTE: Synthetic Minority Over-sampling Technique journal January 2002
An evaluation of the performance of Restricted Boltzmann Machines as a model for anomaly network intrusion detection journal October 2018
Training Products of Experts by Minimizing Contrastive Divergence journal August 2002
A Hybrid Malicious Code Detection Method based on Deep Learning journal May 2015
SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary journal April 2018
Simple Proof of Equivalence between Adiabatic Quantum Computation and the Circuit Model journal August 2007
Toward an Online Anomaly Intrusion Detection System Based on Deep Learning conference December 2016
Network anomaly detection with the restricted Boltzmann machine journal December 2013
Toward sampling from undirected probabilistic graphical models using a D-Wave quantum annealer journal September 2020
Estimation of effective temperatures in quantum annealers for sampling applications: A case study with possible applications in deep learning journal August 2016
Data mining: practical machine learning tools and techniques with Java implementations journal March 2002
Comparison of D-Wave Quantum Annealing and Classical Simulated Annealing for Local Minima Determination journal August 2020
Training a Quantum Annealing Based Restricted Boltzmann Machine on Cybersecurity Data journal January 2021
A hybrid quantum enabled RBM advantage: convolutional autoencoders for quantum image compression and generative learning conference May 2020
Measuring the Impact of Accurate Feature Selection on the Performance of RBM in Comparison to State of the Art Machine Learning Algorithms journal July 2020
Some Remarks on Weakly Prime and Weakly Semiprime Submodules journal January 2012

Similar Records

Training Restricted Boltzmann Machines With a D-Wave Quantum Annealer
Journal Article · Tue Jun 29 00:00:00 EDT 2021 · Frontiers in Physics · OSTI ID:1871582

Restricted Boltzmann Machines for galaxy morphology classification with a quantum annealer
Journal Article · Thu Nov 14 00:00:00 EST 2019 · TBD · OSTI ID:1871582

Structure–activity relationship-based chemical classification of highly imbalanced Tox21 datasets
Journal Article · Tue Oct 27 00:00:00 EDT 2020 · Journal of Cheminformatics · OSTI ID:1871582

Related Subjects