skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A Kriging based spatiotemporal approach for traffic volume data imputation

Journal Article · · PLoS ONE
ORCiD logo [1];  [2];  [3];  [1];  [4];  [5];  [5];
  1. National United Engineering Laboratory of Integrated and Intelligent Transportation, School of Transportation and Logistics, Southwest Jiaotong University, Hi-Tech Industrial Development Zone, Chengdu, Sichuan, China
  2. Model Risk Management, Bank of America, Charlotte, NC United States)
  3. Univ. of Tennessee, Knoxville, TN (United States). Dept. of Civil & Environmental Engineering
  4. School of Architecture and Design, Southwest Jiaotong University, Hi-Tech Industrial Development Zone, Chengdu, Sichuan, China
  5. Oak Ridge National Laboratory, Knoxville, TN (United States) Center for Transportation Analysis

Along with the rapid development of Intelligent Transportation Systems, traffic data collection technologies have progressed fast. The emergence of innovative data collection technologies such as remote traffic microwave sensor, Bluetooth sensor, GPS-based floating car method, and automated license plate recognition, has significantly increased the variety and volume of traffic data. Despite the development of these technologies, the missing data issue is still a problem that poses great challenge for data based applications such as traffic forecasting, real-time incident detection, dynamic route guidance, and massive evacuation optimization. A thorough literature review suggests most current imputation models either focus on the temporal nature of the traffic data and fail to consider the spatial information of neighboring locations or assume the data follow a certain distribution. These two issues reduce the imputation accuracy and limit the use of the corresponding imputation methods respectively. As a result, this paper presents a Kriging based data imputation approach that is able to fully utilize the spatiotemporal correlation in the traffic data and that does not assume the data follow any distribution. A set of scenarios with different missing rates are used to evaluate the performance of the proposed method. The performance of the proposed method was compared with that of two other widely used methods, historical average and K-nearest neighborhood. Comparison results indicate that the proposed method has the highest imputation accuracy and is more flexible compared to other methods.

Research Organization:
Oak Ridge National Lab (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1627861
Journal Information:
PLoS ONE, Vol. 13, Issue 4; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (27)

Large-Scale Transportation Network Congestion Evolution Prediction Using Deep Learning Theory journal March 2015
Predicting Short-Term Subway Ridership and Prioritizing Its Influential Factors Using Gradient Boosting Decision Trees journal October 2016
Prioritizing Influential Factors for Freeway Incident Clearance Time Prediction Using the Gradient Boosting Decision Trees Method journal September 2017
Defining and Measuring Traffic Data Quality: White Paper on Recommended Approaches journal January 2004
Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results journal November 2003
New Algorithms for Filtering and Imputation of Real-Time and Archived Dual-Loop Detector Data in I-4 Data Warehouse
  • Al-Deek, Haitham M.; Venkata, Chilakamarri; Chandra, Srinivasa Ravi
  • Transportation Research Record: Journal of the Transportation Research Board, Vol. 1867, Issue 1 https://doi.org/10.3141/1867-14
journal January 2004
Markov Chain Monte Carlo Multiple Imputation Using Bayesian Networks for Incomplete Intelligent Transportation Systems Data journal January 2005
Missing traffic data: comparison of imputation methods journal February 2014
Matching Patterns for Updating Missing Values of Traffic Counts journal April 2006
Exploring Imputation Techniques for Missing Data in Transportation Management Systems
  • Smith, Brian L.; Scherer, William T.; Conklin, James H.
  • Transportation Research Record: Journal of the Transportation Research Board, Vol. 1836, Issue 1 https://doi.org/10.3141/1836-17
journal January 2003
Genetically Designed Models for Accurate Imputation of Missing Traffic Counts
  • Zhong, Ming; Sharma, Satish; Lingras, Pawan
  • Transportation Research Record: Journal of the Transportation Research Board, Vol. 1879, Issue 1 https://doi.org/10.3141/1879-09
journal January 2004
A BPCA based missing value imputing method for traffic flow volume data conference June 2008
Imputation of Missing Traffic Data during Holiday Periods journal October 2008
Urban Freeway Traffic Flow Prediction: Application of Seasonal Autoregressive Integrated Moving Average and Exponential Smoothing Models
  • Williams, Billy M.; Durvasula, Priya K.; Brown, Donald E.
  • Transportation Research Record: Journal of the Transportation Research Board, Vol. 1644, Issue 1 https://doi.org/10.3141/1644-14
journal January 1998
Travel-Time Prediction With Support Vector Regression journal December 2004
Short-term traffic flow prediction models-a comparison of neural network and nonparametric regression approaches conference January 1994
A Hidden Markov Model for short term prediction of traffic conditions on freeways journal June 2014
Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm journal June 2012
Spatial Copula Model for Imputing Traffic Flow Data from Remote Microwave Sensors journal September 2017
Space–time modeling of traffic flow journal March 2005
Short-Term Freeway Speed Profiling Based on Longitudinal Spatiotemporal Dynamics
  • Yang, Jianjiang; Han, Lee D.; Freeze, Phillip B.
  • Transportation Research Record: Journal of the Transportation Research Board, Vol. 2467, Issue 1 https://doi.org/10.3141/2467-07
journal January 2014
A multivariate state space approach for urban traffic flow modeling and prediction journal April 2003
The origins of kriging journal April 1990
Incorporating volatility updating into the historical simulation method for value-at-risk journal September 1998
Missing value estimation methods for DNA microarrays journal June 2001
Statewide Rural-Urban Bus Travel Demand and Network Evaluation: An Application in Tennessee journal September 2012
Use characteristics and demographics of rural transit riders: a case study in Tennessee journal December 2016

Cited By (2)

Traffic data reconstruction based on compressive sensing with neighbor regularization journal February 2020
Traffic Estimation for Large Urban Road Network with High Missing Data Ratio journal June 2019