scholarly journals Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Author(s):  
Baturay Saglam ◽  
Enes Duran ◽  
Dogan C. Cicek ◽  
Furkan B. Mutlu ◽  
Suleyman S. Kozat
Author(s):  
Beijia Wang ◽  
Hongliang Wang ◽  
Lei Wu ◽  
Liuliu Cai ◽  
Dawei Pi ◽  
...  

Vehicle mass estimation is the key technology to improve vehicle stability. However, the existing mass estimation accuracy is easily affected by the change of road gradient, and there are few studies on the mass estimation method of the light truck. Aiming at this problem, this paper uses sensors to measure road gradient and rear suspension deformation and proposes a sensor-based vehicle mass estimation algorithm. First, factors that affect the mass estimation are analyzed, road gradient error correction method and mass estimation error correction method are established. Besides, the suspension deformation is decoupled from the road gradient. Second, the mass estimation algorithm model was established in Matlab/Simulink platform and compared with the mass estimation iterative algorithm. Finally, the road test was carried out under various conditions, the results show that the proposed mass estimation algorithm is robust, and the accuracy of the mass estimation will not be affected by the sudden change of road gradient.


Quantum ◽  
2019 ◽  
Vol 3 ◽  
pp. 183 ◽  
Author(s):  
Philip Andreasson ◽  
Joel Johansson ◽  
Simon Liljestrand ◽  
Mats Granath

We implement a quantum error correction algorithm for bit-flip errors on the topological toric code using deep reinforcement learning. An action-value Q-function encodes the discounted value of moving a defect to a neighboring site on the square grid (the action) depending on the full set of defects on the torus (the syndrome or state). The Q-function is represented by a deep convolutional neural network. Using the translational invariance on the torus allows for viewing each defect from a central perspective which significantly simplifies the state space representation independently of the number of defect pairs. The training is done using experience replay, where data from the algorithm being played out is stored and used for mini-batch upgrade of the Q-network. We find performance which is close to, and for small error rates asymptotically equivalent to, that achieved by the Minimum Weight Perfect Matching algorithm for code distances up to d=7. Our results show that it is possible for a self-trained agent without supervision or support algorithms to find a decoding scheme that performs on par with hand-made algorithms, opening up for future machine engineered decoders for more general error models and error correcting codes.


2020 ◽  
Vol 384 (17) ◽  
pp. 126353 ◽  
Author(s):  
Laia Domingo Colomer ◽  
Michalis Skotiniotis ◽  
Ramon Muñoz-Tapia

Energy ◽  
2022 ◽  
Vol 239 ◽  
pp. 122128
Author(s):  
Rui Yang ◽  
Hui Liu ◽  
Nikolaos Nikitas ◽  
Zhu Duan ◽  
Yanfei Li ◽  
...  

2014 ◽  
Vol 8 (5) ◽  
pp. 277-282 ◽  
Author(s):  
Øystein Marøy ◽  
Lars Lydersen ◽  
Magne Gudmundsen ◽  
Johannes Skaar

2011 ◽  
Vol E94-B (3) ◽  
pp. 649-657 ◽  
Author(s):  
Shigeaki TAGASHIRA ◽  
Yuhei KANEKIYO ◽  
Yutaka ARAKAWA ◽  
Teruaki KITASUKA ◽  
Akira FUKUDA

2010 ◽  
Vol 27 (2) ◽  
pp. 201-234 ◽  
Author(s):  
Myung Hwan Seo

Asymptotic theory for the estimation of nonlinear vector error correction models that exhibit regime-specific short-run dynamics is developed. In particular, regimes are determined by the error correction term, and the transition between regimes is allowed to be discontinuous, as in, e.g., threshold cointegration. Several nonregular problems are resolved. First of all, consistency—square rootnconsistency for the cointegrating vectorβ—is established for the least squares estimation of this general class of models. Second, the convergence rates are obtained for the least squares of threshold cointegration, which aren3/2andnforβandγ, respectively, whereγdenotes the threshold parameter. This fast rate forβin itself is of practical relevance because, unlike in smooth transition models, the estimation error inβdoes not affect the estimation of short-run parameters. We also derive asymptotic distributions for the smoothed least squares estimation of threshold cointegration.


Sign in / Sign up

Export Citation Format

Share Document