A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework

Zhu, Feng; Aziz, H. M.  Abdul; Qian, Xinwu; Ukkusuri, Satish V.

doi:10.1016/j.trc.2014.12.009

Title: A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework

Abstract

Our study develops a novel reinforcement learning algorithm for the challenging coordinated signal control problem. Traffic signals are modeled as intelligent agents interacting with the stochastic traffic environment. The model is built on the framework of coordinated reinforcement learning. The Junction Tree Algorithm (JTA) based reinforcement learning is proposed to obtain an exact inference of the best joint actions for all the coordinated intersections. Moreover, the algorithm is implemented and tested with a network containing 18 signalized intersections in VISSIM. Finally, our results show that the JTA based algorithm outperforms independent learning (Q-learning), real-time adaptive learning, and fixed timing plans in terms of average delay, number of stops, and vehicular emissions at the network level.

Authors:

Zhu, Feng ^[1]; Aziz, H. M. Abdul ^[1]; Qian, Xinwu ^[1]; Ukkusuri, Satish V. ^[1]

Purdue Univ., West Lafayette, IN (United States)

Publication Date:: Sat Jan 31 00:00:00 EST 2015

Research Org.:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Org.:: USDOE, National Science Foundation (NSF)

OSTI Identifier:: 1265896

Grant/Contract Number:: AC05-00OR22725; 1004528; 104IPY04

Resource Type:: Accepted Manuscript

Journal Name:: Transportation Research Part C: Emerging Technologies

Additional Journal Information:: Journal Volume: 58; Journal Issue: PC; Journal ID: ISSN 0968-090X

Publisher:: Elsevier

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; Traffic Control; Learning Algorithm; Multi-agent control; Connected Vehicle

Citation Formats


                    Zhu, Feng, Aziz, H. M.  Abdul, Qian, Xinwu, and Ukkusuri, Satish V. A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework.  United States: N. p., 2015. 
Web.  doi:10.1016/j.trc.2014.12.009.

Copy to clipboard


                    Zhu, Feng, Aziz, H. M.  Abdul, Qian, Xinwu, & Ukkusuri, Satish V. A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework.  United States.  https://doi.org/10.1016/j.trc.2014.12.009

Copy to clipboard


                    Zhu, Feng, Aziz, H. M.  Abdul, Qian, Xinwu, and Ukkusuri, Satish V. Sat .  
"A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework".  United States.  https://doi.org/10.1016/j.trc.2014.12.009.  https://www.osti.gov/servlets/purl/1265896.

Copy to clipboard


                    
@article{osti_1265896,

  title        = {A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework},

  author       = {Zhu, Feng and Aziz, H. M.  Abdul and Qian, Xinwu and Ukkusuri, Satish V.},

  abstractNote = {Our study develops a novel reinforcement learning algorithm for the challenging coordinated signal control problem. Traffic signals are modeled as intelligent agents interacting with the stochastic traffic environment. The model is built on the framework of coordinated reinforcement learning. The Junction Tree Algorithm (JTA) based reinforcement learning is proposed to obtain an exact inference of the best joint actions for all the coordinated intersections. Moreover, the algorithm is implemented and tested with a network containing 18 signalized intersections in VISSIM. Finally, our results show that the JTA based algorithm outperforms independent learning (Q-learning), real-time adaptive learning, and fixed timing plans in terms of average delay, number of stops, and vehicular emissions at the network level.},

  doi          = {10.1016/j.trc.2014.12.009},

  journal      = {Transportation Research Part C: Emerging Technologies},

  number       = PC,

  volume       = 58,

  place        = {United States},

  year         = {Sat Jan 31 00:00:00 EST 2015},

  month        = {Sat Jan 31 00:00:00 EST 2015}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Accepted Manuscript (DOE)

Publisher's Version of Record

https://doi.org/10.1016/j.trc.2014.12.009

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 41 works

Citation information provided by
Web of Science

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Reinforcement Learning for True Adaptive Traffic Signal Control
journal, May 2003

Abdulhai, Baher; Pringle, Rob; Karakoulas, Grigoris J.
Journal of Transportation Engineering, Vol. 129, Issue 3
DOI: 10.1061/(ASCE)0733-947X(2003)129:3(278)

Store-and-forward based methods for the signal control problem in large-scale congested urban road networks
journal, April 2009

Aboudolas, K.; Papageorgiou, M.; Kosmatopoulos, E.
Transportation Research Part C: Emerging Technologies, Vol. 17, Issue 2
DOI: 10.1016/j.trc.2008.10.002

Unified Framework for Dynamic Traffic Assignment and Signal Control with Cell Transmission Model
journal, January 2012

Aziz, H. M. Abdul; Ukkusuri, Satish V.
Transportation Research Record: Journal of the Transportation Research Board, Vol. 2311, Issue 1
DOI: 10.3141/2311-07

A Distributed Approach for Coordination of Traffic Signal Agents
journal, January 2005

Bazzan, Ana L. C.
Autonomous Agents and Multi-Agent Systems, Vol. 10, Issue 1
DOI: 10.1007/s10458-004-6975-9

Learning in groups of traffic signals
journal, June 2010

Bazzan, Ana L. C.; de Oliveira, Denise; da Silva, Bruno C.
Engineering Applications of Artificial Intelligence, Vol. 23, Issue 4
DOI: 10.1016/j.engappai.2009.11.009

System Optimal Signal Optimization Formulation
journal, January 2006

Beard, Christopher; Ziliaskopoulos, Athanasios
Transportation Research Record: Journal of the Transportation Research Board, Vol. 1978
DOI: 10.3141/1978-15

The real-time urban traffic control system CRONOS: Algorithm and experiments
journal, February 2006

Boillot, Florence; Midenet, Sophie; Pierrelée, Jean-Claude
Transportation Research Part C: Emerging Technologies, Vol. 14, Issue 1
DOI: 10.1016/j.trc.2006.05.001

A multivariable regulator approach to traffic-responsive network-wide signal control
journal, February 2002

Diakaki, Christina; Papageorgiou, Markos; Aboudolas, Kostas
Control Engineering Practice, Vol. 10, Issue 2
DOI: 10.1016/S0967-0661(01)00121-6

Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC)
journal, April 2010

El-Tantawy, Samah; Abdulhai, Baher
Transportation Letters, Vol. 2, Issue 2
DOI: 10.3328/TL.2010.02.02.89-110

A multiagent system for optimizing urban traffic
conference, January 2003

France, J.; Ghorbani, A. A.
2003 IEEE/WIC International Conference on Intelligent Agent Technology, IEEE/WIC International Conference on Intelligent Agent Technology, 2003. IAT 2003.
DOI: 10.1109/IAT.2003.1241110

Model predictive control for optimal coordination of ramp metering and variable speed limits
journal, June 2005

Hegyi, Andreas; De Schutter, Bart; Hellendoorn, Hans
Transportation Research Part C: Emerging Technologies, Vol. 13, Issue 3
DOI: 10.1016/j.trc.2004.08.001

Traffic Signal Optimization with Greedy Randomized Tabu Search Algorithm
journal, August 2012

Hu, Ta-Yin; Chen, Li-Wen
Journal of Transportation Engineering, Vol. 138, Issue 8
DOI: 10.1061/(ASCE)TE.1943-5436.0000404

Graphical Models
journal, February 2004

Jordan, Michael I.
Statistical Science, Vol. 19, Issue 1
DOI: 10.1214/088342304000000026

Evaluating the impacts of urban corridor traffic signal optimization on vehicle emissions and fuel consumption
journal, March 2012

Kwak, Jaeyoung; Park, Byungkyu; Lee, Jaesup
Transportation Planning and Technology, Vol. 35, Issue 2
DOI: 10.1080/03081060.2011.651877

An Enhanced 0–1 Mixed-Integer LP Formulation for Traffic Signal Control
journal, December 2004

Lin, W. -H.; Wang, C.
IEEE Transactions on Intelligent Transportation Systems, Vol. 5, Issue 4
DOI: 10.1109/TITS.2004.838217

A Cell-Based Traffic Control Formulation: Strategies and Benefits of Dynamic Timing Plans
journal, May 2001

Lo, Hong K.
Transportation Science, Vol. 35, Issue 2
DOI: 10.1287/trsc.35.2.148.10136

Utopia
book, January 1990

Mauro, V.; Di Taranto, C.
Control, Computers, Communications in Transportation
DOI: 10.1016/B978-0-08-037025-5.50042-6

Traffic signal control using reinforcement learning and the max-plus algorithm as a coordinating strategy
conference, September 2012

Medina, Juan C.; Benekohal, Rahim F.
2012 15th International IEEE Conference on Intelligent Transportation Systems - (ITSC 2012)
DOI: 10.1109/ITSC.2012.6338911

Genetic reinforcement learning for cooperative traffic signal control
conference, January 1994

Mikami, S.; Kakazu, Y.
Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence
DOI: 10.1109/ICEC.1994.350012

A real-time traffic signal control system: architecture, algorithms, and analysis
journal, December 2001

Mirchandani, Pitu; Head, Larry
Transportation Research Part C: Emerging Technologies, Vol. 9, Issue 6
DOI: 10.1016/S0968-090X(00)00047-4

Traffic Lights Control with Adaptive Group Formation Based on Swarm Intelligence
book, January 2006

de Oliveira, Denise; Bazzan, Ana L. C.
Ant Colony Optimization and Swarm Intelligence
DOI: 10.1007/11839088_61

Using cooperative mediation to coordinate traffic lights: a case study
conference, January 2005

de Oliveira, Denise; Bazzan, Ana L. C.; Lesser, Victor
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems - AAMAS '05
DOI: 10.1145/1082473.1082544

A Mathematical Logic Approach for the Transformation of the Linear Conditional Piecewise Functions of Dispersion-and-Store and Cell Transmission Traffic Flow Models into Linear Mixed-Integer Form
journal, February 2009

Pavlis, Yannis; Recker, Will
Transportation Science, Vol. 43, Issue 1
DOI: 10.1287/trsc.1080.0254

Tree consistency and bounds on the performance of the max-product algorithm and its generalizations
journal, April 2004

Wainwright, Martin; Jaakkola, Tommi; Willsky, Alan
Statistics and Computing, Vol. 14, Issue 2
DOI: 10.1023/B:STCO.0000021412.33763.d5

Simulation and optimization of traffic in a city
conference, January 2004

Wiering, M.; Vreeken, J.; van Veenen, J.
IEEE Intelligent Vehicles Symposium, 2004
DOI: 10.1109/IVS.2004.1336426

A Novel Signal-Scheduling Algorithm With Quality-of-Service Provisioning for an Isolated Intersection
journal, September 2008

Wunderlich, R.; Elhanany, I.
IEEE Transactions on Intelligent Transportation Systems, Vol. 9, Issue 3
DOI: 10.1109/TITS.2008.928266

Works referencing / citing this record:

A Simultaneous Solution for Reserve Capacity Maximization and Delay Minimization Problems in Signalized Road Networks
journal, May 2019

Baskan, Ozgur; Ceylan, Huseyin; Ozan, Cenk
Journal of Advanced Transportation, Vol. 2019
DOI: 10.1155/2019/6203137

Similar Records in DOE PAGES and OSTI.GOV collections:

Integration of Decentralized Graph-Based Multi-Agent Reinforcement Learning with Digital Twin for Traffic Signal Optimization

Journal Article Kumarasamy, Vijayalakshmi K. ; Saroj, Abhilasha Jairam ; Liang, Yu ; ... - Symmetry

Machine learning (ML) methods, particularly Reinforcement Learning (RL), have gained widespread attention for optimizing traffic signal control in intelligent transportation systems. However, existing ML approaches often exhibit limitations in scalability and adaptability, particularly within large traffic networks. This paper introduces an innovative solution by integrating decentralized graph-based multi-agent reinforcement learning (DGMARL) with a Digital Twin to enhance traffic signal optimization, targeting the reduction of traffic congestion and network-wide fuel consumption associated with vehicle stops and stop delays. In this approach, DGMARL agents are employed to learn traffic state patterns and make informed decisions regarding traffic signal control. The integration withmore »« less
https://doi.org/10.3390/sym16040448
Traffic Signal Optimization by Integrating Reinforcement Learning and Digital Twins

Conference Kumarasamy, Vijayalakshmi K. ; Saroj, Abhilasha ; Liang, Yu ; ...

Machine learning (ML) methods, especially reinforcement learning (RL), have been widely considered for traffic signal optimization in intelligent transportation systems. Most of these ML methods are centralized, lacking in scalability and adaptability in large traffic networks. Further, it is challenging to train such ML models due to the lack of training platforms and/or the cost of deploying and training in a real traffic networks. This paper presents an approach for the integration of decentralized graph-based multi-agent reinforcement learning (DGMARL) with a Digital Twin (DT) to optimize traffic signals for the reduction of traffic congestion and network-wide fuel consumption related tomore »« less
https://doi.org/10.1109/SWC57546.2023.10448974

Full Text Available
Investigating the Impact of Connected Vehicle Market Share on the Performance of Reinforcement-Learning Based Traffic Signal Control

Technical Report Aziz, H M Abdul ; Wang, Hong ; Young, Stanley ; ...

We aim to understand and explore the performance of reinforcement learning based signal control algorithms in a mixed environment with less than 100% market share of connected and automated vehicles (CAVs). Within a simulation environment, we have considered partial connectivity—less than 100% market share of CAVs—in the network and investigated the impact on the performance of the signal control algorithm. Two test networks including a four-intersection arterial in Lankershim Boulevard, CA, and a portion of downtown Springfield, IL, with 20 intersections. The first network is calibrated in the micro-simulator PTV Vissim with the US DOT provided NGSIM datasets. The resultsmore »« less
https://doi.org/10.2172/1566974

Full Text Available
Network-Wide Traffic Signal Control Using Bilinear System Modeling and Adaptive Optimization

Journal Article Wang, Hong ; Zhu, Meixin ; Hong, Wanshi ; ... - IEEE Transactions on Intelligent Transportation Systems

This study proposes a new multi-input multi-output optimal bilinear signal control method in which a bilinear dynamic model approximation is used to capture the nonlinear dynamics of the urban traffic networks. With signal green time splits as the control input and traffic delay changes as the output for each intersections in the network, a bilinear system model was developed, which, on the basis of linear system modeling, takes interactions among traffic delays and signal timing splits into consideration. Based on the bilinear system modeling framework, we conducted two steps in each time interval to derive traffic control strategies: (1) wemore »« less
https://doi.org/10.1109/tits.2022.3215537

Full Text Available
Minimizing Energy Consumption from Connected Signalized Intersections by Reinforcement Learning

Conference Bin Al Islam, S. M. A. ; Aziz, H. M. Abdul ; Wang, Hong ; ...

Explicit energy minimization objectives are often discouraged in signal optimization algorithms due to its negative impact on mobility performance. One potential direction to solve this problem is to provide a balanced objective function to achieve desired mobility with minimized energy consumption. This research developed a reinforcement learning (RL) based control with reward functions considering energy and mobility in a joint manner-a penalty function is introduced for number of stops. Further, we proposed a clustering-based technique to make the state-space finite which is critical for a tractable implementation of the RL algorithm. We implemented the algorithm in a calibrated NG-SIM networkmore »« less
https://doi.org/10.1109/ITSC.2018.8569891

Similar Records

Title: A junction-tree based learning algorithm to optimize network wide traffic control: A coordinated multi-agent framework

Abstract

Citation Formats

Reinforcement Learning for True Adaptive Traffic Signal Control journal, May 2003

Store-and-forward based methods for the signal control problem in large-scale congested urban road networks journal, April 2009

Unified Framework for Dynamic Traffic Assignment and Signal Control with Cell Transmission Model journal, January 2012

A Distributed Approach for Coordination of Traffic Signal Agents journal, January 2005

Learning in groups of traffic signals journal, June 2010

System Optimal Signal Optimization Formulation journal, January 2006

The real-time urban traffic control system CRONOS: Algorithm and experiments journal, February 2006

A multivariable regulator approach to traffic-responsive network-wide signal control journal, February 2002

Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC) journal, April 2010

A multiagent system for optimizing urban traffic conference, January 2003

Model predictive control for optimal coordination of ramp metering and variable speed limits journal, June 2005

Traffic Signal Optimization with Greedy Randomized Tabu Search Algorithm journal, August 2012

Graphical Models journal, February 2004

Evaluating the impacts of urban corridor traffic signal optimization on vehicle emissions and fuel consumption journal, March 2012

An Enhanced 0–1 Mixed-Integer LP Formulation for Traffic Signal Control journal, December 2004

A Cell-Based Traffic Control Formulation: Strategies and Benefits of Dynamic Timing Plans journal, May 2001

Utopia book, January 1990

Traffic signal control using reinforcement learning and the max-plus algorithm as a coordinating strategy conference, September 2012

Genetic reinforcement learning for cooperative traffic signal control conference, January 1994

A real-time traffic signal control system: architecture, algorithms, and analysis journal, December 2001

Traffic Lights Control with Adaptive Group Formation Based on Swarm Intelligence book, January 2006

Using cooperative mediation to coordinate traffic lights: a case study conference, January 2005

A Mathematical Logic Approach for the Transformation of the Linear Conditional Piecewise Functions of Dispersion-and-Store and Cell Transmission Traffic Flow Models into Linear Mixed-Integer Form journal, February 2009

Tree consistency and bounds on the performance of the max-product algorithm and its generalizations journal, April 2004

Simulation and optimization of traffic in a city conference, January 2004

A Novel Signal-Scheduling Algorithm With Quality-of-Service Provisioning for an Isolated Intersection journal, September 2008

A Simultaneous Solution for Reserve Capacity Maximization and Delay Minimization Problems in Signalized Road Networks journal, May 2019

Reinforcement Learning for True Adaptive Traffic Signal Control
journal, May 2003

Store-and-forward based methods for the signal control problem in large-scale congested urban road networks
journal, April 2009

Unified Framework for Dynamic Traffic Assignment and Signal Control with Cell Transmission Model
journal, January 2012

A Distributed Approach for Coordination of Traffic Signal Agents
journal, January 2005

Learning in groups of traffic signals
journal, June 2010

System Optimal Signal Optimization Formulation
journal, January 2006

The real-time urban traffic control system CRONOS: Algorithm and experiments
journal, February 2006

A multivariable regulator approach to traffic-responsive network-wide signal control
journal, February 2002

Towards multi-agent reinforcement learning for integrated network of optimal traffic controllers (MARLIN-OTC)
journal, April 2010

A multiagent system for optimizing urban traffic
conference, January 2003

Model predictive control for optimal coordination of ramp metering and variable speed limits
journal, June 2005

Traffic Signal Optimization with Greedy Randomized Tabu Search Algorithm
journal, August 2012

Graphical Models
journal, February 2004

Evaluating the impacts of urban corridor traffic signal optimization on vehicle emissions and fuel consumption
journal, March 2012

An Enhanced 0–1 Mixed-Integer LP Formulation for Traffic Signal Control
journal, December 2004

A Cell-Based Traffic Control Formulation: Strategies and Benefits of Dynamic Timing Plans
journal, May 2001

Utopia
book, January 1990

Traffic signal control using reinforcement learning and the max-plus algorithm as a coordinating strategy
conference, September 2012

Genetic reinforcement learning for cooperative traffic signal control
conference, January 1994

A real-time traffic signal control system: architecture, algorithms, and analysis
journal, December 2001

Traffic Lights Control with Adaptive Group Formation Based on Swarm Intelligence
book, January 2006

Using cooperative mediation to coordinate traffic lights: a case study
conference, January 2005

A Mathematical Logic Approach for the Transformation of the Linear Conditional Piecewise Functions of Dispersion-and-Store and Cell Transmission Traffic Flow Models into Linear Mixed-Integer Form
journal, February 2009

Tree consistency and bounds on the performance of the max-product algorithm and its generalizations
journal, April 2004

Simulation and optimization of traffic in a city
conference, January 2004

A Novel Signal-Scheduling Algorithm With Quality-of-Service Provisioning for an Isolated Intersection
journal, September 2008

A Simultaneous Solution for Reserve Capacity Maximization and Delay Minimization Problems in Signalized Road Networks
journal, May 2019