Reinforcement learning based schemes to manage client activities in large distributed control systems

Gao, Y.; Chen, J.; Robertazzi, T.; Brown, K. A.

doi:10.1103/PhysRevAccelBeams.22.014601

Title: Reinforcement learning based schemes to manage client activities in large distributed control systems

Abstract

Large distributed control systems typically can be modeled by a hierarchical structure with two physical layers: console level computers (CLCs) layer and front end computers (FECs) layer. The control system of the Relativistic Heavy Ion Collider (RHIC) at Brookhaven National Laboratory (BNL) consists of more than 500 FECs, each acting as a server providing services to a large number of clients. Hence the interactions between the server and its clients become crucial to the overall system performance. There are different scenarios of the interactions. For instance, there are cases where the server has a limited processing ability and is queried by a large number of clients. Such cases can put a bottleneck in the system, as heavy traffic can slow down or even crash a system, making it momentarily unresponsive. Also, there are cases where the server has adequate ability to process all the traffic from its clients. We pursue different goals in those cases. For the first case, we would like to manage clients’ activities so that their requests are processed by the server as much as possible and the server remains operational. For the second case, we would like to explore an operation point at which the server’smore »« less

Authors:: Gao, Y.; Chen, J.; Robertazzi, T.; Brown, K. A.

Publication Date:: Wed Jan 02 00:00:00 EST 2019

Research Org.:: Brookhaven National Lab. (BNL), Upton, NY (United States); Stony Brook Univ., NY (United States)

Sponsoring Org.:: USDOE; National Science Foundation (NSF)

OSTI Identifier:: 1489305

Alternate Identifier(s):: OSTI ID: 1491685

Report Number(s):: BNL-210905-2019-JAAM
Journal ID: ISSN 2469-9888; PRABCJ; 014601

Grant/Contract Number:: SC0012704; 1553385

Resource Type:: Published Article

Journal Name:: Physical Review Accelerators and Beams

Additional Journal Information:: Journal Name: Physical Review Accelerators and Beams Journal Volume: 22 Journal Issue: 1; Journal ID: ISSN 2469-9888

Publisher:: American Physical Society

Country of Publication:: United States

Language:: English

Subject:: 97 MATHEMATICS AND COMPUTING; collective behavior in networks; computational complexity; evolving networks; network formation & growth; coherent structures; collective dynamics; high dimensional systems; nonlinear time-delay systems

Citation Formats


                    Gao, Y., Chen, J., Robertazzi, T., and Brown, K. A. Reinforcement learning based schemes to manage client activities in large distributed control systems.  United States: N. p., 2019. 
Web.  doi:10.1103/PhysRevAccelBeams.22.014601.

Copy to clipboard


                    Gao, Y., Chen, J., Robertazzi, T., & Brown, K. A. Reinforcement learning based schemes to manage client activities in large distributed control systems.  United States.  https://doi.org/10.1103/PhysRevAccelBeams.22.014601

Copy to clipboard


                    Gao, Y., Chen, J., Robertazzi, T., and Brown, K. A. Wed .  
"Reinforcement learning based schemes to manage client activities in large distributed control systems".  United States.  https://doi.org/10.1103/PhysRevAccelBeams.22.014601.

Copy to clipboard


                    
@article{osti_1489305,

  title        = {Reinforcement learning based schemes to manage client activities in large distributed control systems},

  author       = {Gao, Y. and Chen, J. and Robertazzi, T. and Brown, K. A.},

  abstractNote = {Large distributed control systems typically can be modeled by a hierarchical structure with two physical layers: console level computers (CLCs) layer and front end computers (FECs) layer. The control system of the Relativistic Heavy Ion Collider (RHIC) at Brookhaven National Laboratory (BNL) consists of more than 500 FECs, each acting as a server providing services to a large number of clients. Hence the interactions between the server and its clients become crucial to the overall system performance. There are different scenarios of the interactions. For instance, there are cases where the server has a limited processing ability and is queried by a large number of clients. Such cases can put a bottleneck in the system, as heavy traffic can slow down or even crash a system, making it momentarily unresponsive. Also, there are cases where the server has adequate ability to process all the traffic from its clients. We pursue different goals in those cases. For the first case, we would like to manage clients’ activities so that their requests are processed by the server as much as possible and the server remains operational. For the second case, we would like to explore an operation point at which the server’s resources get utilized efficiently. Moreover, we consider a real-world time constraint to the above case. The time constraint states that clients expect the responses from the server within a time window. In this work, we analyze those cases from a game theory perspective. We model the underlying interactions as a repeated game between clients, which is carried out in discrete time slots. For clients’ activity management, we apply a reinforcement learning procedure as a baseline to regulate clients’ behaviors. Then we propose a memory scheme to improve its performance. Next, depending on different scenarios, we design corresponding reward functions to stimulate clients in a proper way so that they can learn to optimize different goals. Through extensive simulations, we show that first, the memory structure improves the learning ability of the baseline procedure significantly. Second, by applying appropriate reward functions, clients’ activities can be effectively managed to achieve different optimization goals.},

  doi          = {10.1103/PhysRevAccelBeams.22.014601},

  journal      = {Physical Review Accelerators and Beams},

  number       = 1,

  volume       = 22,

  place        = {United States},

  year         = {Wed Jan 02 00:00:00 EST 2019},

  month        = {Wed Jan 02 00:00:00 EST 2019}

}

Copy to clipboard

Journal Article:

Free Publicly Available Full Text

Publisher's Version of Record
https://doi.org/10.1103/PhysRevAccelBeams.22.014601

Other availability

Search WorldCat to find libraries that may hold this journal

Citation Metrics:

Cited by: 3 works

Citation information provided by
Web of Science

Figures / Tables:

FIG. 1: RHIC system hardware architecture.

All figures and tables (19 total)

Save / Share:

Export Metadata

Save to My Library

Works referenced in this record:

Multiagent learning using a variable learning rate
journal, April 2002

Bowling, Michael; Veloso, Manuela
Artificial Intelligence, Vol. 136, Issue 2
DOI: 10.1016/S0004-3702(02)00121-2

Learning, hypothesis testing, and Nash equilibrium
journal, October 2003

Foster, Dean P.; Young, H. Peyton
Games and Economic Behavior, Vol. 45, Issue 1
DOI: 10.1016/S0899-8256(03)00025-3

Conditional Universal Consistency
journal, October 1999

Fudenberg, Drew; Levine, David K.
Games and Economic Behavior, Vol. 29, Issue 1-2
DOI: 10.1006/game.1998.0705

A Simple Adaptive Procedure Leading to Correlated Equilibrium
journal, September 2000

Hart, Sergiu; Mas-Colell, Andreu
Econometrica, Vol. 68, Issue 5
DOI: 10.1111/1468-0262.00153

Calibrated Learning and Correlated Equilibrium
journal, October 1997

Foster, Dean P.; Vohra, Rakesh V.
Games and Economic Behavior, Vol. 21, Issue 1-2
DOI: 10.1006/game.1997.0595

RHIC control system
journal, March 2003

Barton, D. S.; Binello, S.; Buxton, W.
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 499, Issue 2-3
DOI: 10.1016/S0168-9002(02)01943-5

Figures / Tables found in this record:

Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.

Similar Records in DOE PAGES and OSTI.GOV collections:

SCALING AN URBAN EMERGENCY EVACUATION FRAMEWORK: CHALLENGES AND PRACTICES

Conference Karthik, Rajasekar ; Lu, Wei

Critical infrastructure disruption, caused by severe weather events, natural disasters, terrorist attacks, etc., has significant impacts on urban transportation systems. We built a computational framework to simulate urban transportation systems under critical infrastructure disruption in order to aid real-time emergency evacuation. This framework will use large scale datasets to provide a scalable tool for emergency planning and management. Our framework, World-Wide Emergency Evacuation (WWEE), integrates population distribution and urban infrastructure networks to model travel demand in emergency situations at global level. Also, a computational model of agent-based traffic simulation is used to provide an optimal evacuation plan for traffic operationmore »« less
NSLS-II HIGH LEVEL APPLICATION INFRASTRUCTURE AND CLIENT API DESIGN

Conference Shen, G ; Yang, ; L, ; ...

The beam commissioning software framework of NSLS-II project adopts a client/server based architecture to replace the more traditional monolithic high level application approach. It is an open structure platform, and we try to provide a narrow API set for client application. With this narrow API, existing applications developed in different language under different architecture could be ported to our platform with small modification. This paper describes system infrastructure design, client API and system integration, and latest progress. As a new 3rd generation synchrotron light source with ultra low emittance, there are new requirements and challenges to control and manipulate themore »« less
Full Text Available
Austin Sustainable and Holistic Integration of Energy Storage and Solar PV [Austin SHINES]. Final Report, Version 2

Technical Report Popp, Anna ; Hughes, Jennifer

The Austin SHINES project and solution is a software management platform, for an electric grid with a high penetration of dispersed photovoltaic (PV) solar generation sites, which maintains the traditional power quality and reliability associated with grid service. This project developed and deployed the platform as a Distributed Energy Resource Management System (DERMS), engaging multiple advanced controls, to evaluate operation and optimization of a fleet of diverse DER assets, installed at several locations among Austin Energy’s customers and distribution system. The project also produced a methodology to create a replicable DERMS template, adaptable to other regions and market structures. Last, Austin SHINES aimed to demonstrate the solution’s methodology would enable the DER grid ecosystem to serve load at a technical cost (System Levelized Cost of Electricity, or System LCOE) of less than the U.S. Department of Energy SHINES program metric ofmore »« less
https://doi.org/10.2172/1661670

Full Text Available
Justifying the need for forensically ready protocols: A case study of identifying malicious web servers using client honeypots

Conference Seifert, Christian ; Endicott-Popovsky, Barbara E ; Frincke, Deborah A ; ...

Abstract: Client honeypot technology can find malicious web servers that attack web browsers and push malware, so called drive-by-downloads, to the client machine. Merely recording the network traffic is insufficient to perform an efficient forensic analysis of the attack. Custom tools need to be developed to access and examine the embedded data of the network protocols. Once the information is extracted from the network data, it cannot be used to perform a behavioral analysis on the attack, therefore limiting the ability to answer what exactly happened on the attacked system. Implementation of a record/ replay mechanism is proposed that allowsmore »« less
Distributed analysis with CRAB: The client-server architecture evolution and commissioning

Conference Codispoti, G ; /INFN, Bologna /Bologna U. ; Cinquilli, M ; ... - PoS ACAT08:029,2008

CRAB (CMS Remote Analysis Builder) is the tool used by CMS to enable running physics analysis in a transparent manner over data distributed across many sites. It abstracts out the interaction with the underlying batch farms, grid infrastructure and CMS workload management tools, such that it is easily usable by non-experts. CRAB can be used as a direct interface to the computing system or can delegate the user task to a server. Major efforts have been dedicated to the client-server system development, allowing the user to deal only with a simple and intuitive interface and to delegate all the workmore »« less
Full Text Available

Similar Records

Title: Reinforcement learning based schemes to manage client activities in large distributed control systems

Abstract

Citation Formats

Figures / Tables:

Multiagent learning using a variable learning rate journal, April 2002

Learning, hypothesis testing, and Nash equilibrium journal, October 2003

Conditional Universal Consistency journal, October 1999

A Simple Adaptive Procedure Leading to Correlated Equilibrium journal, September 2000

Calibrated Learning and Correlated Equilibrium journal, October 1997

RHIC control system journal, March 2003

Multiagent learning using a variable learning rate
journal, April 2002

Learning, hypothesis testing, and Nash equilibrium
journal, October 2003

Conditional Universal Consistency
journal, October 1999

A Simple Adaptive Procedure Leading to Correlated Equilibrium
journal, September 2000

Calibrated Learning and Correlated Equilibrium
journal, October 1997

RHIC control system
journal, March 2003