Detecting anomalous packets in network transfers: investigations using PCA, autoencoder and isolation forest in TCP
Journal Article
·
· Machine Learning
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); University of Southern California
- Univ. of North Carolina, Chapel Hill, NC (United States)
- Univ. of Southern California, Los Angeles, CA (United States). Information Sciences Institute
Large-scale scientific workflows rely heavily on high-performance file transfers. These transfers require strict quality parameters such as guaranteed bandwidth, no packet loss or data duplication. To have successful file transfers, methods such as predetermined thresholds and statistical analysis need to be done to determine abnormal patterns. Network administrators routinely monitor and analyze network data for diagnosing and alleviating these, making decisions based on their experience. However, as networks grow and become complex, monitoring large data files and quickly processing them, makes it improbable to identify errors and rectify these. Abnormal file transfers have been classified by simply setting alert thresholds, via tools such as PerfSonar and TCP statistics (Tstat). This paper investigates the feasibility of unsupervised feature extraction methods for identifying network anomaly patterns with three unsupervised classification methods—principal component analysis, autoencoder and isolation forest. Here, we collect file transfer statistics from two experiment sets—synthetic iPerf generated traffic and 1000 Genome workflow runs, with synthetically introduced anomalies. Our results show that while PCA and a simple autoencoder finds it difficult to detect clusters, the tree-variant isolation forest is able to identify anomalous packets by breaking down TCP traces into tree classes early.
- Research Organization:
- Univ. of Southern California, Los Angeles, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- SC0012636
- OSTI ID:
- 1787027
- Journal Information:
- Machine Learning, Journal Name: Machine Learning Journal Issue: 5 Vol. 109; ISSN 0885-6125
- Publisher:
- Springer NatureCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Detecting Outliers in Network Transfers with Feature Extraction
An investigation of packet reordering in TCP traces (extended abstract)
Experiences with TCP/IP over an ATM OC12 WAN
Conference
·
Sun Jul 01 00:00:00 EDT 2018
·
OSTI ID:1468101
An investigation of packet reordering in TCP traces (extended abstract)
Conference
·
Wed Dec 31 23:00:00 EST 2003
·
OSTI ID:977651
Experiences with TCP/IP over an ATM OC12 WAN
Technical Report
·
Wed Dec 22 23:00:00 EST 1999
·
OSTI ID:764365