Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

Vehicle mass estimation is the key technology to improve vehicle stability. However, the existing mass estimation accuracy is easily affected by the change of road gradient, and there are few studies on the mass estimation method of the light truck. Aiming at this problem, this paper uses sensors to measure road gradient and rear suspension deformation and proposes a sensor-based vehicle mass estimation algorithm. First, factors that affect the mass estimation are analyzed, road gradient error correction method and mass estimation error correction method are established. Besides, the suspension deformation is decoupled from the road gradient. Second, the mass estimation algorithm model was established in Matlab/Simulink platform and compared with the mass estimation iterative algorithm. Finally, the road test was carried out under various conditions, the results show that the proposed mass estimation algorithm is robust, and the accuracy of the mass estimation will not be affected by the sudden change of road gradient.

Download Full-text

Quantum error correction for the toric code using deep reinforcement learning

Quantum ◽

10.22331/q-2019-09-02-183 ◽

2019 ◽

Vol 3 ◽

pp. 183 ◽

Cited By ~ 17

Author(s):

Philip Andreasson ◽

Joel Johansson ◽

Simon Liljestrand ◽

Mats Granath

Keyword(s):

Reinforcement Learning ◽

Error Correction ◽

Minimum Weight ◽

Error Correcting Codes ◽

Error Rates ◽

Quantum Error Correction ◽

Translational Invariance ◽

Quantum Error ◽

Q Function ◽

Toric Code

We implement a quantum error correction algorithm for bit-flip errors on the topological toric code using deep reinforcement learning. An action-value Q-function encodes the discounted value of moving a defect to a neighboring site on the square grid (the action) depending on the full set of defects on the torus (the syndrome or state). The Q-function is represented by a deep convolutional neural network. Using the translational invariance on the torus allows for viewing each defect from a central perspective which significantly simplifies the state space representation independently of the number of defect pairs. The training is done using experience replay, where data from the algorithm being played out is stored and used for mini-batch upgrade of the Q-network. We find performance which is close to, and for small error rates asymptotically equivalent to, that achieved by the Minimum Weight Perfect Matching algorithm for code distances up to d=7. Our results show that it is possible for a self-trained agent without supervision or support algorithms to find a decoding scheme that performs on par with hand-made algorithms, opening up for future machine engineered decoders for more general error models and error correcting codes.

Download Full-text

Reinforcement learning for optimal error correction of toric codes

Physics Letters A ◽

10.1016/j.physleta.2020.126353 ◽

2020 ◽

Vol 384 (17) ◽

pp. 126353 ◽

Cited By ~ 2

Author(s):

Laia Domingo Colomer ◽

Michalis Skotiniotis ◽

Ramon Muñoz-Tapia

Keyword(s):

Reinforcement Learning ◽

Error Correction ◽

Optimal Error ◽

Toric Codes

Download Full-text

Short-term wind speed forecasting using deep reinforcement learning with improved multiple error correction approach

Energy ◽

10.1016/j.energy.2021.122128 ◽

2022 ◽

Vol 239 ◽

pp. 122128

Author(s):

Rui Yang ◽

Hui Liu ◽

Nikolaos Nikitas ◽

Zhu Duan ◽

Yanfei Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Wind Speed ◽

Error Correction ◽

Short Term ◽

Wind Speed Forecasting ◽

Multiple Error

Download Full-text

Error estimation, error correction and verification in quantum key distribution

IET Information Security ◽

10.1049/iet-ifs.2012.0333 ◽

2014 ◽

Vol 8 (5) ◽

pp. 277-282 ◽

Cited By ~ 2

Author(s):

Øystein Marøy ◽

Lars Lydersen ◽

Magne Gudmundsen ◽

Johannes Skaar

Keyword(s):

Error Correction ◽

Error Estimation ◽

Quantum Key Distribution ◽

Estimation Error ◽

Key Distribution

Download Full-text

Reinforcement Learning Algorithm for Partially Observable Environment Considering State Estimation Error Prediction

10.1007/978-981-16-5207-3_52 ◽

2021 ◽

pp. 615-628

Author(s):

Yuuya Watabe ◽

Takeshi Shibuya

Keyword(s):

Reinforcement Learning ◽

State Estimation ◽

Learning Algorithm ◽

Estimation Error ◽

Error Prediction ◽

Partially Observable ◽

Reinforcement Learning Algorithm

Download Full-text

Collaborative Filtering for Position Estimation Error Correction in WLAN Positioning Systems

IEICE Transactions on Communications ◽

10.1587/transcom.e94.b.649 ◽

2011 ◽

Vol E94-B (3) ◽

pp. 649-657 ◽

Cited By ~ 6

Author(s):

Shigeaki TAGASHIRA ◽

Yuhei KANEKIYO ◽

Yutaka ARAKAWA ◽

Teruaki KITASUKA ◽

Akira FUKUDA

Keyword(s):

Error Correction ◽

Collaborative Filtering ◽

Estimation Error ◽

Position Estimation ◽

Positioning Systems

Download Full-text

ESTIMATION OF NONLINEAR ERROR CORRECTION MODELS

Econometric Theory ◽

10.1017/s026646661000023x ◽

2010 ◽

Vol 27 (2) ◽

pp. 201-234 ◽

Cited By ~ 22

Author(s):

Myung Hwan Seo

Keyword(s):

Least Squares ◽

Error Correction ◽

Convergence Rates ◽

Estimation Error ◽

Least Squares Estimation ◽

Asymptotic Distributions ◽

Smooth Transition ◽

Threshold Cointegration ◽

Error Correction Models ◽

Short Run

Asymptotic theory for the estimation of nonlinear vector error correction models that exhibit regime-specific short-run dynamics is developed. In particular, regimes are determined by the error correction term, and the transition between regimes is allowed to be discontinuous, as in, e.g., threshold cointegration. Several nonregular problems are resolved. First of all, consistency—square rootnconsistency for the cointegrating vectorβ—is established for the least squares estimation of this general class of models. Second, the convergence rates are obtained for the least squares of threshold cointegration, which aren3/2andnforβandγ, respectively, whereγdenotes the threshold parameter. This fast rate forβin itself is of practical relevance because, unlike in smooth transition models, the estimation error inβdoes not affect the estimation of short-run parameters. We also derive asymptotic distributions for the smoothed least squares estimation of threshold cointegration.

Download Full-text

Object Shape Error Correction using Deep Reinforcement Learning for Multi-Station Assembly Systems

10.1109/indin45523.2021.9557359 ◽

2021 ◽

Author(s):

Sumit Sinha ◽

Pasquale Franciosa ◽

Dariusz Ceglarek

Keyword(s):

Reinforcement Learning ◽

Error Correction ◽

Assembly Systems ◽

Shape Error ◽

Object Shape

Download Full-text