DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Improving the accuracy of freight mode choice models: A case study using the 2017 CFS PUF data set and ensemble learning techniques

Journal Article · · Expert Systems with Applications

Here, the US Census Bureau has collected two rounds of experimental data from the Commodity Flow Survey, providing shipment-level characteristics of nationwide commodity movements, published in 2012 (i.e., Public Use Microdata) and in 2017 (i.e., Public Use File). With this information, data-driven methods have become increasingly valuable for understanding detailed patterns in freight logistics. In this study, we used the 2017 Commodity Flow Survey Public Use File data set to explore building a high-performance freight mode choice model, considering three main improvements: (1) constructing local models for each separate commodity/industry category; (2) extracting useful geographical features, particularly the derived distance of each freight mode between origin/destination zones; and (3) applying additional ensemble learning methods such as stacking or voting to combine results from local and unified models for improved performance. The proposed method achieved over 92% accuracy without incorporating external information, an over 19% increase compared to directly fitting Random Forests models over 10,000 samples. Furthermore, SHAP (Shapely Additive Explanations) values were computed to explain the outputs and major patterns obtained from the proposed model. The model framework could enhance the performance and interpretability of existing freight mode choice models.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
2251623
Alternate ID(s):
OSTI ID: 2369705
Journal Information:
Expert Systems with Applications, Journal Name: Expert Systems with Applications Journal Issue: 1 Vol. 240; ISSN 0957-4174
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (27)

A Stated preference freight mode choice model journal April 2003
Analysis of Travel Mode Choice in Seoul Using an Interpretable Machine Learning Approach journal March 2021
A joint framework for modeling freight mode and destination choice: Application to the US commodity flow survey data journal February 2021
A Freight Mode Choice Analysis Using a Binary Logit Model and GIS: The Case of Cereal Grains Transportation in the United States journal January 2012
Modeling freight mode choice using machine learning classifiers: a comparative study using Commodity Flow Survey (CFS) data journal May 2021
The transport geography of electric and autonomous vehicles in road freight networks journal October 2019
A behavioral analysis of freight mode choice decisions journal December 2011
Using big data to enhance the bosch production line performance: A Kaggle challenge conference December 2016
XGBoost: A Scalable Tree Boosting System conference January 2016
Freight Mode Choice: A Regret Minimization and Utility Maximization Based Hybrid Model journal June 2018
A stacking model using URL and HTML features for phishing webpage detection journal May 2019
Predicting Travel Mode of Individuals by Machine Learning journal January 2015
Modeling Freight Mode Choice in Greece journal January 2012
A comparative study of machine learning classifiers for modeling travel mode choice journal July 2017
An Analysis of Interstate Freight Mode Choice between Truck and Rail: A Case Study of Maryland, United States journal November 2013
Kaggle forecasting competitions: An overlooked learning opportunity journal April 2021
Prediction and behavioral analysis of travel mode choice: A comparison of machine learning and logit models journal July 2020
Analysis of human-factor-caused freight train accidents in the United States journal December 2019
A weighted logit freight mode-choice model journal November 2009
How shift scheduling practices contribute to fatigue amongst freight rail operating employees: Findings from Canadian accident investigations journal May 2019
An interpretable machine learning framework to understand bikeshare demand before and during the COVID-19 pandemic in New York City journal April 2023
Predicting antigenic variants of H1N1 influenza virus based on epidemics and pandemics using a stacking model journal December 2018
Joint model of freight mode choice and shipment size: A copula-based random regret minimization framework journal May 2019
Food supply chains during the COVID‐19 pandemic journal May 2020
A multimodal location and routing model for hazardous materials transportation journal August 2012
Predicting the travel mode choice with interpretable machine learning techniques: A comparative study journal October 2022
Applying a random forest method approach to model travel mode choice behavior journal January 2019