Multirobot Collaborative Pursuit Target Robot by Improved MADDPG
- School of Mechanical and Electronic Engineering, Wuhan University of Technology, Wuhan 430070, China
- Intelligent Transport Systems Research Center, Wuhan University of Technology, Wuhan 430063, China
Policy formulation is one of the main problems in multirobot systems, especially in multirobot pursuit-evasion scenarios, where both sparse rewards and random environment changes bring great difficulties to find better strategy. Existing multirobot decision-making methods mostly use environmental rewards to promote robots to complete the target task that cannot achieve good results. This paper proposes a multirobot pursuit method based on improved multiagent deep deterministic policy gradient (MADDPG), which solves the problem of sparse rewards in multirobot pursuit-evasion scenarios by combining the intrinsic reward and the external environment. The state similarity module based on the threshold constraint is as a part of the intrinsic reward signal output by the intrinsic curiosity module, which is used to balance overexploration and insufficient exploration, so that the agent can use the intrinsic reward more effectively to learn better strategies. The simulation experiment results show that the proposed method can improve the reward value of robots and the success rate of the pursuit task significantly. The intuitive change is obviously reflected in the real-time distance between the pursuer and the escapee, the pursuer using the improved algorithm for training can get closer to the escapee more quickly, and the average following distance also decreases.
- Sponsoring Organization:
- USDOE Office of Electricity (OE), Advanced Grid Research & Development. Power Systems Engineering Research
- OSTI ID:
- 1846468
- Journal Information:
- Computational Intelligence and Neuroscience, Journal Name: Computational Intelligence and Neuroscience Vol. 2022; ISSN 1687-5265
- Publisher:
- Hindawi Publishing CorporationCopyright Statement
- Country of Publication:
- Country unknown/Code not available
- Language:
- English
Similar Records
Multirobot Lunar Excavation and ISRU Using Artificial-Neural-Tissue Controllers
Multirobot systems