scholarly journals Study on Reinforcement Learning-Based Missile Guidance Law

2020 ◽  
Vol 10 (18) ◽  
pp. 6567
Author(s):  
Daseon Hong ◽  
Minjeong Kim ◽  
Sungsu Park

Reinforcement learning is generating considerable interest in terms of building guidance law and solving optimization problems that were previously difficult to solve. Since reinforcement learning-based guidance laws often show better robustness than a previously optimized algorithm, several studies have been carried out on the subject. This paper presents a new approach to training missile guidance law by reinforcement learning and introducing some notable characteristics. The novel missile guidance law shows better robustness to the controller-model compared to the proportional navigation guidance. The neural network in this paper has identical inputs with proportional navigation guidance, which makes the comparison fair, distinguishing it from other research. The proposed guidance law will be compared to the proportional navigation guidance, which is widely known as quasi-optimal of missile guidance law. Our work aims to find effective missile training methods through reinforcement learning, and how better the new method is. Additionally, with the derived policy, we contemplated which is better, and in which circumstances it is better. A novel methodology for the training will be proposed first, and the performance comparison results will be continued therefrom.

2019 ◽  
Vol 123 (1262) ◽  
pp. 464-483
Author(s):  
X.L. Ai ◽  
L.L. Wang ◽  
Y.C. Shen

ABSTRACTThis study focuses on the co-operative salvo attack problem of multiple missiles against a stationary target under jointly connected switching topologies subject to time-varying communication delays. By carefully exploring certain features of the typical pure proportional navigation guidance law, a two-stage distributed guidance scheme is proposed without any information on time-to-go in this study to realise the simultaneous attack of multiple missiles. In the first guidance stage, a co-operative guidance law is proposed using local neighbouring communications only to achieve consensus on range-to-go and heading error to provide favourable initial conditions for the latter phase, in which switching topologies and time-varying communication delays are taken into account when obtaining sufficient conditions of consensus in terms of linear matrix inequalities. Then, missiles disconnect from each other and are guided individually by the typical pure proportional navigation guidance law with the same navigation gain to realise salvo attack in the second guidance phase. Finally, numerical simulations are carried out to clearly validate the theoretical results.


Author(s):  
Sheng Sun ◽  
Di Zhou ◽  
Jingyang Zhou ◽  
Kok Lay Teo

The true proportional navigation guidance law, the augmented proportional navigation guidance law, or the adaptive sliding-mode guidance law, is designed based on the planar target-to-missile relative motion dynamics. By a proper construction of a nonlinear Lyapunov function for the line-of-sight angular rates in the three-dimensional guidance dynamics, it is shown that the three guidance laws mentioned above are able to ensure the asymptotic convergence of the angular rates as they are directly applied to the three-dimensional guidance environment. Furthermore, considering the missile autopilot dynamics as a first-order lag, we design three-dimensional nonlinear guidance laws by using the backstepping technique for three cases: (1) the target does not maneuver; (2) the information of target acceleration can be acquired; and (3) the target acceleration is not available but its bound is known a priori. In the first step of the backstepping design of the control law, there is no need to cancel the nonlinear coupling terms in the three-dimensional guidance dynamics in such way that the final expressions of the proposed guidance laws are significantly simplified. Thus, the proposed nonlinear Lyapunov function for the line-of-sight angular rates is a generalized function for designing three-dimensional guidance laws. Simulation results of a missile interception mission show that the proposed guidance laws are highly effective.


Author(s):  
P Gurfil

This paper derives a new non-linear guidance law aimed at interception of highly manoeuvring targets. The guidance law is developed based on the theory of control Lyapunov functions (CLFs), a methodology for universal stabilization of non-linear systems which is also inverse optimal with respect to some performance measure. The three-dimensional guidance dynamics are formulated in a fixed-line-of-sight coordinate system, yielding matching between the target and missile accelerations. Closed-form expressions for the CLF guidance commands are given. Simulation shows that the new guidance scheme significantly outperforms augmented proportional navigation in short-range engagements.


2013 ◽  
Vol 35 (5) ◽  
pp. 703-710 ◽  
Author(s):  
Seyyed Sajjad Moosapour ◽  
Ghasem Alizadeh ◽  
Sohrab Khanmohammadi ◽  
Hamzeh Moosapour

Sign in / Sign up

Export Citation Format

Share Document