skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Time-Series Forecast Modeling on High-Bandwidth Network Measurements

Journal Article · · Journal of Grid Computing
 [1];  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

With the increasing number of geographically distributed scientific collaborations and the growing sizes of scientific data, it has become challenging for users to achieve the best possible network performance on a shared network. In this paper, we have developed a model to forecast expected bandwidth utilization on high-bandwidth wide area networks. The forecast model can improve the efficiency of the resource utilization and scheduling of data movements on high-bandwidth networks to accommodate ever increasing data volume for large-scale scientific data applications. A univariate time-series forecast model is developed with the Seasonal decomposition of Time series by Loess (STL) and the AutoRegressive Integrated Moving Average (ARIMA) on Simple Network Management Protocol (SNMP) path utilization measurement data. Compared with the traditional approach such as Box-Jenkins methodology to train the ARIMA model, our forecast model reduces computation time up to 92.6 %. It also shows resilience against abrupt network usage changes. Finally, our forecast model conducts the large number of multi-step forecast, and the forecast errors are within the mean absolute deviation (MAD) of the monitored measurements.

Research Organization:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1377437
Journal Information:
Journal of Grid Computing, Vol. 14, Issue 3; ISSN 1570-7873
Publisher:
Springer
Country of Publication:
United States
Language:
English

References (28)

Network traffic characteristics of data centers in the wild conference January 2010
On a measure of lack of fit in time series models journal August 1978
On the predictability of large transfer TCP throughput
  • He, Qi; Dovrolis, Constantine; Ammar, Mostafa
  • Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '05 https://doi.org/10.1145/1080091.1080110
conference January 2005
The road to SDN: an intellectual history of programmable networks journal April 2014
Testing the null hypothesis of stationarity against the alternative of a unit root journal October 1992
Wide area traffic: the failure of Poisson modeling journal June 1995
A new look at the statistical model identification journal December 1974
Automatically inferring patterns of resource consumption in network traffic
  • Estan, Cristian; Savage, Stefan; Varghese, George
  • Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '03 https://doi.org/10.1145/863955.863972
conference January 2003
On the self-similar nature of Ethernet traffic (extended version) journal January 1994
Network bandwidth utilization forecast model on high bandwidth networks conference February 2015
Parallelization of an evolving Artificial Neural Networks system to Forecast Time Series using OPENMP and MPI conference May 2012
Empirical Evaluation of Techniques for Measuring Available Bandwidth conference May 2007
ARCH-Based Traffic Forecasting and Dynamic Bandwidth Provisioning for Periodically Measured Nonstationary Traffic journal June 2007
A Machine Learning Approach to TCP Throughput Prediction journal August 2010
The quest for bandwidth estimation techniques for large-scale distributed systems journal January 2010
A Flexible Reservation Algorithm for Advance Network Provisioning
  • Balman, Mehmet; Chaniotakisy, Evangelos; Shoshani, Arie
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.4
conference November 2010
Unified architecture for network measurement: The case of available bandwidth journal September 2012
Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting journal September 1988
A Data Throughput Prediction and Optimization Service for Widely Distributed Many-Task Computing journal June 2011
The Influence Curve and its Role in Robust Estimation journal June 1974
Self-similarity in World Wide Web traffic: evidence and possible causes journal January 1997
A predictability analysis of network traffic journal July 2002
Evaluation and characterization of available bandwidth probing techniques journal August 2003
Long-Term Forecasting of Internet Backbone Traffic journal September 2005
A state space framework for automatic forecasting using exponential smoothing methods journal July 2002
Multi-scale Internet traffic forecasting using neural networks and time series methods journal December 2010
Comparing Predictive Accuracy journal July 1995
Data cleaning for dynamic modeling and control conference August 1999