DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Exploratory analysis and performance prediction of big data transfer in High-performance Networks

Journal Article · · Engineering Applications of Artificial Intelligence
 [1]; ORCiD logo [2];  [2]; ORCiD logo [3];  [4]
  1. Harrisburg Univ., Harrisburg, PA (United States)
  2. New Jersey Institute of Technology, Newark, NJ (United States)
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
  4. Argonne National Lab. (ANL), Argonne, IL (United States)

Big data transfer in large-scale scientific and business applications is increasingly carried out over connections with guaranteed bandwidth provisioned in High-performance Networks (HPNs) via advance bandwidth reservation. Provisioning agents need to carefully schedule data transfer requests, compute network paths, and allocate appropriate bandwidths. Such reserved bandwidths, if not fully utilized, could be simply wasted due to the exclusive access during the approved time window, and cause extra overhead and complexity for resource management. This calls for accurate performance prediction to reserve bandwidths that match actual needs and avoid over-provisioning. We employ machine learning algorithms to predict big data transfer performance based on extensive performance measurements collected in the past several years from data transfer tests using different protocols and toolkits between various end sites on several real-life physical or emulated testbeds. We first analyze the performance patterns in response to a comprehensive list of parameters in end-host systems, network connections, and data transfer applications, which motivate the use of machine learning and also help us identify the effects of latent factors. We then propose threshold- and clustering-based methods to eliminate negative effects of latent factors in data preprocessing and build a robust performance predictor based on customized domain-oriented loss functions. The performance of the proposed methods is verified by extensive experiments using SVR and RFR as well as theoretical analysis of the general performance bound.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1820703
Journal Information:
Engineering Applications of Artificial Intelligence, Vol. 102; ISSN 0952-1976
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (21)

On the predictability of large transfer TCP throughput journal October 2007
Control Plane for Advance Bandwidth Scheduling in Ultra High-Speed Networks conference April 2006
Comparative analysis of big data transfer protocols in an international high-speed network conference December 2015
Profiling Optimization for Big Data Transfer over Dedicated Channels conference August 2016
A Machine Learning Approach to TCP Throughput Prediction journal August 2010
B4: experience with a globally-deployed software defined wan journal August 2013
On Performance Prediction of Big Data Transfer in High-performance Networks conference June 2020
Profiling transport performance for big data transfer over dedicated channels conference February 2015
UDT: UDP-based data transfer for high-speed wide area networks journal May 2007
Advising Big Data Transfer Over Dedicated Connections Based on Profiling Optimization journal December 2019
Modeling TCP Reno performance: a simple model and its empirical validation journal April 2000
End system optimizations for high-speed TCP journal April 2001
Explaining Wide Area Data Transfer Performance
  • Liu, Zhengchun; Balaprakash, Prasanna; Kettimuthu, Rajkumar
  • HPDC '17: The 26th International Symposium on High-Performance Parallel and Distributed Computing, Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing https://doi.org/10.1145/3078597.3078605
conference June 2017
Bandwidth Reservation Strategies for Scheduling Maximization in Dedicated Networks journal June 2018
On Source-Based Route Computation for Quickest Paths under Dynamic Bandwidth Constraints journal June 2003
Complexity Analysis and Algorithm Design for Advance Bandwidth Scheduling in Dedicated Networks journal February 2013
Time Series Analysis for Efficient Sample Transfers conference January 2019
Simple sample bound for feedforward sigmoid networks with bounded weights journal November 1999
Intra and Interdomain Circuit Provisioning Using the OSCARS Reservation System conference October 2006
Improving network performance on multicore systems: Impact of core affinities on high throughput flows journal March 2016
TCP Throughput Profiles Using Measurements over Dedicated Connections
  • Rao, Nageswara S. V.; Liu, Qiang; Sen, Satyabrata
  • Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '17 https://doi.org/10.1145/3078597.3078615
conference January 2017