Minimization of energy costs for UAV management in a conflict task

This article considers the method of developing an evader control strategy in the non-linear differential pursuit-evasion game problem. It is assumed that the pursuer resorts to the most probable control strategy in order to capture the evader and that at each moment the evader knows its own and the enemy’s physical capabilities. This assumption allows to bring the game problem down to the problem of a unilateral evader control, with the condition of reaching a saddle point not obligatory to be fulfilled. The control is realised in the form of synthesis and additionally ensures that the requirements for bringing the evader to a specified area with terminal optimization of certain state variables are satisfiedt. The solution of this problem will significantly reduce the energy losses for controlling an unmanned vehicle, the possible effect is to save 15-20 % of fuel with a probability of 0.98, to solve the problem of chasing the enemy.

Download Full-text

Two-Stage Pursuit Strategy for Incomplete-Information Impulsive Space Pursuit-Evasion Mission Using Reinforcement Learning

Aerospace ◽

10.3390/aerospace8100299 ◽

2021 ◽

Vol 8 (10) ◽

pp. 299

Author(s):

Bin Yang ◽

Pengxuan Liu ◽

Jinglang Feng ◽

Shuang Li

Keyword(s):

Reinforcement Learning ◽

Incomplete Information ◽

Trajectory Optimization ◽

Game Problem ◽

Gradient Algorithm ◽

Two Stage ◽

Pursuit Evasion ◽

Evasion Game ◽

Close Distance ◽

Pursuit Strategy

This paper presents a novel and robust two-stage pursuit strategy for the incomplete-information impulsive space pursuit-evasion missions considering the J2 perturbation. The strategy firstly models the impulsive pursuit-evasion game problem into a far-distance rendezvous stage and a close-distance game stage according to the perception range of the evader. For the far-distance rendezvous stage, it is transformed into a rendezvous trajectory optimization problem and a new objective function is proposed to obtain the pursuit trajectory with the optimal terminal pursuit capability. For the close-distance game stage, a closed-loop pursuit approach is proposed using one of the reinforcement learning algorithms, i.e., the deep deterministic policy gradient algorithm, to solve and update the pursuit trajectory for the incomplete-information impulsive pursuit-evasion missions. The feasibility of this novel strategy and its robustness to different initial states of the pursuer and evader and to the evasion strategies are demonstrated for the sun-synchronous orbit pursuit-evasion game scenarios. The results of the Monte Carlo tests show that the successful pursuit ratio of the proposed method is over 91% for all the given scenarios.

Download Full-text

Qualitative criterion for interception in a pursuit/evasion game

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2009.0552 ◽

2009 ◽

Vol 466 (2117) ◽

pp. 1365-1371 ◽

Cited By ~ 1

Author(s):

J. A. Morgan

Keyword(s):

Nash Equilibrium ◽

Initial Position ◽

Initial Time ◽

Sufficient Condition ◽

Pursuit Evasion ◽

Evasion Game ◽

Differential Pursuit ◽

The Future ◽

Qualitative Criterion ◽

Future Cone

A qualitative account is given of a differential pursuit/evasion game. A criterion for the existence of an intercept solution is obtained using future cones that contain all attainable trajectories of target or interceptor originating from an initial position. A sufficient and necessary conditon that an opportunity to intercept always exists is that, after some initial time, the future cone of the target be contained within the future cone of the interceptor. The sufficient condition may be regarded as a kind of Nash equilibrium.

Download Full-text

A differential pursuit/evasion game of capture between an omnidirectional agent and a differential drive robot, and their winning roles

International Journal of Control ◽

10.1080/00207179.2016.1151078 ◽

2016 ◽

Vol 89 (11) ◽

pp. 2169-2184 ◽

Cited By ~ 5

Author(s):

Ubaldo Ruiz ◽

Rafael Murrieta-Cid

Keyword(s):

Differential Drive ◽

Pursuit Evasion ◽

Evasion Game ◽

Differential Pursuit ◽

Differential Drive Robot

Download Full-text

On the value of information in a differential pursuit-evasion game

2015 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2015.7139862 ◽

2015 ◽

Cited By ~ 4

Author(s):

Israel Becerra ◽

Vladimir Macias ◽

Rafael Murrieta-Cid

Keyword(s):

Value Of Information ◽

Pursuit Evasion ◽

Evasion Game ◽

Differential Pursuit

Download Full-text

An Improved Approach towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

Entropy ◽

10.3390/e23111433 ◽

2021 ◽

Vol 23 (11) ◽

pp. 1433

Author(s):

Kaifang Wan ◽

Dingwei Wu ◽

Yiwei Zhai ◽

Bo Li ◽

Xiaoguang Gao ◽

...

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Superior Performance ◽

State Variables ◽

Multi Agent Systems ◽

Adversarial Learning ◽

Pursuit Evasion ◽

Evasion Game ◽

Multi Agent ◽

Adversarial Attack

A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit–evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training.

Download Full-text

Modeling and Solving of the Missile Pursuit-Evasion Game Problem

10.23919/ccc52363.2021.9549376 ◽

2021 ◽

Author(s):

Dingding Qi ◽

Longyue Li ◽

Hailong Xu ◽

Ye Tian ◽

Huizhen Zhao

Keyword(s):

Game Problem ◽

Pursuit Evasion ◽

Evasion Game

Download Full-text

A direct method of solution of linear differential pursuit-evasion games

Mathematical Notes ◽

10.1007/bf01157467 ◽

1983 ◽

Vol 33 (6) ◽

pp. 455-458 ◽

Cited By ~ 1

Author(s):

M. S. Nikol'skii

Keyword(s):

Direct Method ◽

Pursuit Evasion ◽

Differential Pursuit ◽

Method Of Solution ◽

Linear Differential ◽

A Direct Method

Download Full-text

Construction of optimal position strategies in a differential pursuit-evasion game with one pursuer and two evaders

Journal of Applied Mathematics and Mechanics ◽

10.1016/s0021-8928(97)00050-6 ◽

1997 ◽

Vol 61 (3) ◽

pp. 391-399 ◽

Cited By ~ 4

Author(s):

K.A. Zemskov ◽

A.G. Pashkow

Keyword(s):

Optimal Position ◽

Pursuit Evasion ◽

Evasion Game ◽

Differential Pursuit

Download Full-text

Pursuer’s Control Strategy for Orbital Pursuit-Evasion-Defense Game with Continuous Low Thrust Propulsion

Applied Sciences ◽

10.3390/app9153190 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3190

Author(s):

Junfeng Zhou ◽

Lin Zhao ◽

Jianhua Cheng ◽

Shuo Wang ◽

Yipeng Wang

Keyword(s):

Control Strategy ◽

Comprehensive Evaluation ◽

Fuzzy Comprehensive Evaluation ◽

Multiple Shooting ◽

State Variables ◽

Low Thrust ◽

Optimal Control Strategy ◽

Pursuit Evasion ◽

Multi Objective Genetic Algorithm ◽

Multiple Shooting Method

This paper studies the orbital pursuit-evasion-defense problem with the continuous low thrust propulsion. A control strategy for the pursuer is proposed based on the fuzzy comprehensive evaluation and the differential game. First, the system is described by the Lawden’s equations, and simplified by introducing the relative state variables and the zero effort miss (ZEM) variables. Then, the objective function of the pursuer is designed based on the fuzzy comprehensive evaluation, and the analytical necessary conditions for the optimal control strategy are presented. Finally, a hybrid method combining the multi-objective genetic algorithm and the multiple shooting method is proposed to obtain the solution of the orbital pursuit-evasion-defense problem. The simulation results show that the proposed control strategy can handle the orbital pursuit-evasion-defense problem effectively.

Download Full-text

Stochastic control in a determinate differential pursuit-evasion game

Automation and Remote Control ◽

10.1134/s0005117911020093 ◽

2011 ◽

Vol 72 (2) ◽

pp. 305-322 ◽

Cited By ~ 5

Author(s):

N. N. Krasovskii ◽

A. N. Kotel’nikova

Keyword(s):

Stochastic Control ◽

Pursuit Evasion ◽

Evasion Game ◽

Differential Pursuit

Download Full-text