Reinforcement learning based schemes to manage client activities in large distributed control systems

Gao, Y.; Chen, J.; Robertazzi, T.; Brown, K. A.

doi:10.1103/PhysRevAccelBeams.22.014601

Title: Reinforcement learning based schemes to manage client activities in large distributed control systems

Journal Article · Wed Jan 02 00:00:00 EST 2019 · Physical Review Accelerators and Beams

DOI:https://doi.org/10.1103/PhysRevAccelBeams.22.014601· OSTI ID:1489305

Gao, Y.; Chen, J.; Robertazzi, T.; Brown, K. A.

Large distributed control systems typically can be modeled by a hierarchical structure with two physical layers: console level computers (CLCs) layer and front end computers (FECs) layer. The control system of the Relativistic Heavy Ion Collider (RHIC) at Brookhaven National Laboratory (BNL) consists of more than 500 FECs, each acting as a server providing services to a large number of clients. Hence the interactions between the server and its clients become crucial to the overall system performance. There are different scenarios of the interactions. For instance, there are cases where the server has a limited processing ability and is queried by a large number of clients. Such cases can put a bottleneck in the system, as heavy traffic can slow down or even crash a system, making it momentarily unresponsive. Also, there are cases where the server has adequate ability to process all the traffic from its clients. We pursue different goals in those cases. For the first case, we would like to manage clients’ activities so that their requests are processed by the server as much as possible and the server remains operational. For the second case, we would like to explore an operation point at which the server’s resources get utilized efficiently. Moreover, we consider a real-world time constraint to the above case. The time constraint states that clients expect the responses from the server within a time window. In this work, we analyze those cases from a game theory perspective. We model the underlying interactions as a repeated game between clients, which is carried out in discrete time slots. For clients’ activity management, we apply a reinforcement learning procedure as a baseline to regulate clients’ behaviors. Then we propose a memory scheme to improve its performance. Next, depending on different scenarios, we design corresponding reward functions to stimulate clients in a proper way so that they can learn to optimize different goals. Through extensive simulations, we show that first, the memory structure improves the learning ability of the baseline procedure significantly. Second, by applying appropriate reward functions, clients’ activities can be effectively managed to achieve different optimization goals.

View Journal Article

Cite

Export

Save

Research Organization:: Brookhaven National Lab. (BNL), Upton, NY (United States); Stony Brook Univ., NY (United States)

Sponsoring Organization:: USDOE; National Science Foundation (NSF)

Grant/Contract Number:: SC0012704; 1553385

OSTI ID:: 1489305

Alternate ID(s):: OSTI ID: 1491685

Report Number(s):: BNL-210905-2019-JAAM; PRABCJ; 014601

Journal Information:: Physical Review Accelerators and Beams, Journal Name: Physical Review Accelerators and Beams Vol. 22 Journal Issue: 1; ISSN 2469-9888

Publisher:: American Physical SocietyCopyright Statement

Country of Publication:: United States

Language:: English

Citation Metrics:

Cited by: 3 works

Citation information provided by
Web of Science

References (6)

Multiagent learning using a variable learning rate Bowling, Michael; Veloso, Manuela Artificial Intelligence, Vol. 136, Issue 2 https://doi.org/10.1016/S0004-3702(02)00121-2	journal	April 2002
Learning, hypothesis testing, and Nash equilibrium Foster, Dean P.; Young, H. Peyton Games and Economic Behavior, Vol. 45, Issue 1 https://doi.org/10.1016/S0899-8256(03)00025-3	journal	October 2003
Conditional Universal Consistency Fudenberg, Drew; Levine, David K. Games and Economic Behavior, Vol. 29, Issue 1-2 https://doi.org/10.1006/game.1998.0705	journal	October 1999
A Simple Adaptive Procedure Leading to Correlated Equilibrium Hart, Sergiu; Mas-Colell, Andreu Econometrica, Vol. 68, Issue 5 https://doi.org/10.1111/1468-0262.00153	journal	September 2000
Calibrated Learning and Correlated Equilibrium Foster, Dean P.; Vohra, Rakesh V. Games and Economic Behavior, Vol. 21, Issue 1-2 https://doi.org/10.1006/game.1997.0595	journal	October 1997
RHIC control system Barton, D. S.; Binello, S.; Buxton, W. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 499, Issue 2-3 https://doi.org/10.1016/S0168-9002(02)01943-5	journal	March 2003

Figures / Tables (19)

Similar Records

SCALING AN URBAN EMERGENCY EVACUATION FRAMEWORK: CHALLENGES AND PRACTICES

Conference · Wed Jan 01 00:00:00 EST 2014 · OSTI ID:1489305

Karthik, Rajasekar; Lu, Wei

NSLS-II HIGH LEVEL APPLICATION INFRASTRUCTURE AND CLIENT API DESIGN

Conference · Mon Mar 28 00:00:00 EDT 2011 · OSTI ID:1489305

Shen, G; Yang, ; L, ; +2 more

Austin Sustainable and Holistic Integration of Energy Storage and Solar PV [Austin SHINES]. Final Report, Version 2

Technical Report · Fri Jun 19 00:00:00 EDT 2020 · OSTI ID:1489305

Popp, Anna; Hughes, Jennifer

Related Subjects

97 MATHEMATICS AND COMPUTING
collective behavior in networks
computational complexity
evolving networks
network formation & growth
coherent structures
collective dynamics
high dimensional systems
nonlinear time-delay systems

Title: Reinforcement learning based schemes to manage client activities in large distributed control systems

Citation Formats

References (6)

Figures / Tables (19)

Similar Records

Related Subjects