Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Learning concave-convex profiles of data transport over dedicated connections

Conference ·
Dedicated data transport infrastructures are increasingly being deployed to support distributed big-data and high-performance computing scenarios. These infrastructures employ data transfer nodes that use sophisticated software stacks to support network transport among sites, which often house distributed file and storage systems. Throughput measurements collected over such infrastructures for a range of round trip times (RTTs) reflect the underlying complex end-to-end connections, and have revealed dichotomous throughput profiles as functions of RTT. In particular, concave regions of throughput profiles at lower RTTs indicate near-optimal performance, and convex regions at higher RTTs indicate bottlenecks due to factors such as buffer or credit limits. We present a machine learning method that explicitly infers these concave and convex regions and transitions between them using sigmoid functions. We also provide distribution-free confidence estimates for the generalization error of these concave-convex profile estimates. Throughput profiles for data transfers over 10 Gbps connections with 0–366 ms RTT provide important performance insights, including the near optimality of transfers performed with the XDD tool between XFS filesystems, and the performance limits of wide-area Lustre extensions using LNet routers. A direct application of generic machine learning packages does not adequately highlight these critical performance regions or provide as precise confidence estimates.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21); USDOE National Nuclear Security Administration (NNSA)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1513419
Country of Publication:
United States
Language:
English

References (12)

On Analytics of File Transfer Rates over Dedicated Wide-Area Connections conference October 2017
Overlay Networks of In Situ Instruments for Probabilistic Guarantees on Message Delays in Wide-Area Networks journal January 2004
UDT: UDP-based data transfer for high-speed wide area networks journal May 2007
Experiments and Analyses of Data Transfers over Wide-Area Dedicated Connections conference July 2017
Simple sample bound for feedforward sigmoid networks with bounded weights journal November 1999
Cross-geography scientific data transferring trends and behavior
  • Liu, Zhengchun; Kettimuthu, Rajkumar; Foster, Ian
  • Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '18 https://doi.org/10.1145/3208040.3208053
conference January 2018
A technique for moving large data sets over high-performance long distance networks conference May 2011
Experimental Analysis of File Transfer Rates over Wide-Area Dedicated Connections
  • Rao, Nageswara S. V.; Liu, Qiang; Sen, Satyabrata
  • 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) https://doi.org/10.1109/HPCC-SmartCity-DSS.2016.0038
conference December 2016
TCP Throughput Profiles Using Measurements over Dedicated Connections
  • Rao, Nageswara S. V.; Liu, Qiang; Sen, Satyabrata
  • Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing - HPDC '17 https://doi.org/10.1145/3078597.3078615
conference January 2017
Software as a service for data scientists journal February 2012
Wide-area lustre file system using LNet routers conference April 2018
Nonparametric estimation and classification using radial basis function nets and empirical risk minimization journal March 1996

Similar Records

Learning Concave-Convex Profiles of Data Transport Over Dedicated Connections
Conference · Mon Dec 31 23:00:00 EST 2018 · OSTI ID:1574712

Measurements and analytics of wide-area file transfers over dedicated connections
Conference · Mon Dec 31 23:00:00 EST 2018 · OSTI ID:1509527

On Data Transfers Over Wide-Area Dedicated Connections
Book · Sun Dec 31 23:00:00 EST 2017 · OSTI ID:1399399

Related Subjects