scholarly journals Time-varying Proportional Navigation Guidance using Deep Reinforcement Learning

Author(s):  
Hyeok-Joo Chae ◽  
Daniel Lee ◽  
Su-Jeong Park ◽  
Han-Lim Choi ◽  
Han-Sol Park ◽  
...  
2019 ◽  
Vol 123 (1262) ◽  
pp. 464-483
Author(s):  
X.L. Ai ◽  
L.L. Wang ◽  
Y.C. Shen

ABSTRACTThis study focuses on the co-operative salvo attack problem of multiple missiles against a stationary target under jointly connected switching topologies subject to time-varying communication delays. By carefully exploring certain features of the typical pure proportional navigation guidance law, a two-stage distributed guidance scheme is proposed without any information on time-to-go in this study to realise the simultaneous attack of multiple missiles. In the first guidance stage, a co-operative guidance law is proposed using local neighbouring communications only to achieve consensus on range-to-go and heading error to provide favourable initial conditions for the latter phase, in which switching topologies and time-varying communication delays are taken into account when obtaining sufficient conditions of consensus in terms of linear matrix inequalities. Then, missiles disconnect from each other and are guided individually by the typical pure proportional navigation guidance law with the same navigation gain to realise salvo attack in the second guidance phase. Finally, numerical simulations are carried out to clearly validate the theoretical results.


2018 ◽  
Vol 2018 ◽  
pp. 1-9
Author(s):  
Kaiwei Chen ◽  
Qunli Xia ◽  
Xiao Du ◽  
Yuemin Yao

This study analyzes the effects of disturbance rejection and radome error on the stability of guidance systems. First, disturbance rejection rate transfer function (DRRTF) models are established. Second, the Lyapunov stability of a proportional navigation guidance system is proposed. The passivity theorems are introduced for the analysis of the Lyapunov stability of the nonlinear time-varying system. Finally, the influence of the different DRRTF models on the stability of the guidance system is analyzed by mathematical simulations. The simulation results indicate that the stable boundary of the guidance system varies and its tolerance to disturbance differs significantly for various disturbance torque types and radome slope.


Author(s):  
Runle Du ◽  
Xinguang Zou ◽  
Di Zhou ◽  
Jiaqi Liu

This paper addresses a pursuer tracking problem where the pursuer's acceleration is given by a proportional navigation (PN) guidance law with a time-varying navigation ratio which varies with the relative range between the pursuer and its target. Based on a motion model that exactly describes the relative motion and the PN guidance law, a novel filter for tracking such a pursuer is designed using interactive multiple model (IMM) algorithm and unscented Kalman filtering (UKF) technique. This filter is able to accurately estimate the relative range, relative velocity, and the acceleration of pursuer even if the pursuer adopts a PN guidance law with time-varying navigation ratio. The proposed tracking method is evaluated in extensive Monte Carlo simulations. It is shown that accurate estimation results have been obtained, and the model probabilities in the IMM UKF filter are consistent with real situations.


2020 ◽  
Vol 10 (18) ◽  
pp. 6567
Author(s):  
Daseon Hong ◽  
Minjeong Kim ◽  
Sungsu Park

Reinforcement learning is generating considerable interest in terms of building guidance law and solving optimization problems that were previously difficult to solve. Since reinforcement learning-based guidance laws often show better robustness than a previously optimized algorithm, several studies have been carried out on the subject. This paper presents a new approach to training missile guidance law by reinforcement learning and introducing some notable characteristics. The novel missile guidance law shows better robustness to the controller-model compared to the proportional navigation guidance. The neural network in this paper has identical inputs with proportional navigation guidance, which makes the comparison fair, distinguishing it from other research. The proposed guidance law will be compared to the proportional navigation guidance, which is widely known as quasi-optimal of missile guidance law. Our work aims to find effective missile training methods through reinforcement learning, and how better the new method is. Additionally, with the derived policy, we contemplated which is better, and in which circumstances it is better. A novel methodology for the training will be proposed first, and the performance comparison results will be continued therefrom.


2019 ◽  
Vol 2019 ◽  
pp. 1-14
Author(s):  
Sijiang Chang ◽  
Shengfu Chen

In a bid to take advantage of natural characteristics of the proportional navigation guidance (PNG) in practical engineering, the PNG-based impact time control guidance (ITCG) continues to be a popular alternative for achieving the desired impact time of a missile. For most such ITCG, the performance is dependent on the accuracy of the time-to-go estimation. Along the lines of the development of PNG-based ITCG in earlier studies, a nonsingular ITCG is proposed on the basis of nonlinear formulations. It is demonstrated that, by theoretical analysis and numerical simulation, this proposed ITCG is shown to be advantageous in certain circumstances. By deriving a novel additional acceleration command, the proposed law is of lower dependence on time-to-go estimate and is capable of eliminating some singularities, leading to wider adjustable range of the desired impact time and better adaptability to more conditions. This research is expected to be supplementary to those presented in the current research literature.


Sign in / Sign up

Export Citation Format

Share Document