skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Time-Series Forecast Modeling on High-Bandwidth Network Measurements

Abstract

With the increasing number of geographically distributed scientific collaborations and the growing sizes of scientific data, it has become challenging for users to achieve the best possible network performance on a shared network. In this paper, we have developed a model to forecast expected bandwidth utilization on high-bandwidth wide area networks. The forecast model can improve the efficiency of the resource utilization and scheduling of data movements on high-bandwidth networks to accommodate ever increasing data volume for large-scale scientific data applications. A univariate time-series forecast model is developed with the Seasonal decomposition of Time series by Loess (STL) and the AutoRegressive Integrated Moving Average (ARIMA) on Simple Network Management Protocol (SNMP) path utilization measurement data. Compared with the traditional approach such as Box-Jenkins methodology to train the ARIMA model, our forecast model reduces computation time up to 92.6 %. It also shows resilience against abrupt network usage changes. Finally, our forecast model conducts the large number of multi-step forecast, and the forecast errors are within the mean absolute deviation (MAD) of the monitored measurements.

Authors:
 [1];  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Publication Date:
Research Org.:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
OSTI Identifier:
1377437
DOE Contract Number:  
AC02-05CH11231
Resource Type:
Journal Article
Journal Name:
Journal of Grid Computing
Additional Journal Information:
Journal Volume: 14; Journal Issue: 3; Journal ID: ISSN 1570-7873
Publisher:
Springer
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; data modeling; time series; prediction model; network measurements; network traffic

Citation Formats

Yoo, Wucherl, and Sim, Alex. Time-Series Forecast Modeling on High-Bandwidth Network Measurements. United States: N. p., 2016. Web. doi:10.1007/s10723-016-9368-9.
Yoo, Wucherl, & Sim, Alex. Time-Series Forecast Modeling on High-Bandwidth Network Measurements. United States. https://doi.org/10.1007/s10723-016-9368-9
Yoo, Wucherl, and Sim, Alex. 2016. "Time-Series Forecast Modeling on High-Bandwidth Network Measurements". United States. https://doi.org/10.1007/s10723-016-9368-9. https://www.osti.gov/servlets/purl/1377437.
@article{osti_1377437,
title = {Time-Series Forecast Modeling on High-Bandwidth Network Measurements},
author = {Yoo, Wucherl and Sim, Alex},
abstractNote = {With the increasing number of geographically distributed scientific collaborations and the growing sizes of scientific data, it has become challenging for users to achieve the best possible network performance on a shared network. In this paper, we have developed a model to forecast expected bandwidth utilization on high-bandwidth wide area networks. The forecast model can improve the efficiency of the resource utilization and scheduling of data movements on high-bandwidth networks to accommodate ever increasing data volume for large-scale scientific data applications. A univariate time-series forecast model is developed with the Seasonal decomposition of Time series by Loess (STL) and the AutoRegressive Integrated Moving Average (ARIMA) on Simple Network Management Protocol (SNMP) path utilization measurement data. Compared with the traditional approach such as Box-Jenkins methodology to train the ARIMA model, our forecast model reduces computation time up to 92.6 %. It also shows resilience against abrupt network usage changes. Finally, our forecast model conducts the large number of multi-step forecast, and the forecast errors are within the mean absolute deviation (MAD) of the monitored measurements.},
doi = {10.1007/s10723-016-9368-9},
url = {https://www.osti.gov/biblio/1377437}, journal = {Journal of Grid Computing},
issn = {1570-7873},
number = 3,
volume = 14,
place = {United States},
year = {Fri Jun 24 00:00:00 EDT 2016},
month = {Fri Jun 24 00:00:00 EDT 2016}
}

Works referenced in this record:

Network traffic characteristics of data centers in the wild
conference, January 2010


On a measure of lack of fit in time series models
journal, August 1978


On the predictability of large transfer TCP throughput
conference, January 2005

  • He, Qi; Dovrolis, Constantine; Ammar, Mostafa
  • Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '05
  • https://doi.org/10.1145/1080091.1080110

The road to SDN: an intellectual history of programmable networks
journal, April 2014


Testing the null hypothesis of stationarity against the alternative of a unit root
journal, October 1992


Wide area traffic: the failure of Poisson modeling
journal, June 1995


A new look at the statistical model identification
journal, December 1974


Automatically inferring patterns of resource consumption in network traffic
conference, January 2003

  • Estan, Cristian; Savage, Stefan; Varghese, George
  • Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '03
  • https://doi.org/10.1145/863955.863972

On the self-similar nature of Ethernet traffic (extended version)
journal, January 1994


Network bandwidth utilization forecast model on high bandwidth networks
conference, February 2015


Parallelization of an evolving Artificial Neural Networks system to Forecast Time Series using OPENMP and MPI
conference, May 2012


Empirical Evaluation of Techniques for Measuring Available Bandwidth
conference, May 2007


ARCH-Based Traffic Forecasting and Dynamic Bandwidth Provisioning for Periodically Measured Nonstationary Traffic
journal, June 2007


A Machine Learning Approach to TCP Throughput Prediction
journal, August 2010


The quest for bandwidth estimation techniques for large-scale distributed systems
journal, January 2010


A Flexible Reservation Algorithm for Advance Network Provisioning
conference, November 2010

  • Balman, Mehmet; Chaniotakisy, Evangelos; Shoshani, Arie
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
  • https://doi.org/10.1109/SC.2010.4

Unified architecture for network measurement: The case of available bandwidth
journal, September 2012


Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting
journal, September 1988


A Data Throughput Prediction and Optimization Service for Widely Distributed Many-Task Computing
journal, June 2011


The Influence Curve and its Role in Robust Estimation
journal, June 1974


Self-similarity in World Wide Web traffic: evidence and possible causes
journal, January 1997


A predictability analysis of network traffic
journal, July 2002


Evaluation and characterization of available bandwidth probing techniques
journal, August 2003


Long-Term Forecasting of Internet Backbone Traffic
journal, September 2005


A state space framework for automatic forecasting using exponential smoothing methods
journal, July 2002


Comparing Predictive Accuracy
journal, July 1995


Data cleaning for dynamic modeling and control
conference, August 1999