Learning control of finite Markov chains with an explicit trade-off between estimation and control

IEEE Transactions on Systems Man and Cybernetics ◽

10.1109/21.21595 ◽

1988 ◽

Vol 18 (5) ◽

pp. 677-684 ◽

Author(s):

M. Sato ◽

K. Abe ◽

H. Takeda

Keyword(s):

Markov Chains ◽

Learning Control ◽

Trade Off ◽

Finite Markov Chains ◽

Estimation And Control ◽

Download Full-text

Observer and control design in partially observable finite Markov chains

10.1016/j.automatica.2019.108587 ◽

2019 ◽

Vol 110 ◽

pp. 108587 ◽

Author(s):

Julio B. Clempner ◽

Alexander S. Poznyak

Keyword(s):

Markov Chains ◽

Control Design ◽

Finite Markov Chains ◽

Partially Observable ◽

Download Full-text

Data-Driven Simulation Model for Quality-Induced Rework Cost Estimation and Control Using Absorbing Markov Chains

Journal of Construction Engineering and Management ◽

10.1061/(asce)co.1943-7862.0001534 ◽

2018 ◽

Vol 144 (8) ◽

pp. 04018078 ◽

Author(s):

Wenying Ji ◽

Simaan M. AbouRizk

Keyword(s):

Markov Chains ◽

Simulation Model ◽

Cost Estimation ◽

Data Driven ◽

Absorbing Markov Chains ◽

Estimation And Control ◽

Download Full-text

Self-Learning Control of Finite Markov Chains

10.1201/9781482273274 ◽

2018 ◽

Author(s):

A.S. Poznyak ◽

Kaddour Najim ◽

E. Gomez-Ramirez

Keyword(s):

Markov Chains ◽

Learning Control ◽

Finite Markov Chains ◽

Download Full-text

Learning control of finite Markov chains with periodically varying transition probabilities

10.1109/cdc.1990.203852 ◽

1990 ◽

Author(s):

M. Sato ◽

H. Takeda

Keyword(s):

Markov Chains ◽

Transition Probabilities ◽

Learning Control ◽

Finite Markov Chains

Download Full-text

Estimation and control in Markov chains

Advances in Applied Probability ◽

10.2307/1426206 ◽

1974 ◽

Vol 6 (1) ◽

pp. 40-60 ◽

Author(s):

P. Mandl

Keyword(s):

Markov Chains ◽

Asymptotic Properties ◽

Control Policy ◽

Contrast Method ◽

The Central Limit Theorem ◽

Large Numbers ◽

Estimation And Control ◽

And Control ◽

Stationary Control ◽

Optimal Stationary Control

We consider a finite controlled Markov chain, the description of which depends on an unknown parameter a, and investigate the following control policy. To each a an optimal stationary control is associated. a is estimated recurrently from the trajectory by the minimum contrast method, and the optimal stationary control corresponding to the estimate is used. We present asymptotic properties of the estimate and of the criterion function. They follow from the law of large numbers and from the central limit theorem for controlled Markov chains derived with the aid of martingales.

Download Full-text

Jointly optimal quantization, estimation, and control of hidden markov chains

42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475) ◽

10.1109/cdc.2003.1272714 ◽

2004 ◽

Author(s):

J.S. Baras ◽

Xiaobo Tan ◽

Wei Xi

Keyword(s):

Markov Chains ◽

Hidden Markov ◽

Hidden Markov Chains ◽

Optimal Quantization ◽

Estimation And Control ◽

Download Full-text

Jointly Optimal Quantization, Estimation, and Control of Hidden Markov Chains

10.21236/ada439765 ◽

2003 ◽

Author(s):

John S. Baras ◽

Xiaobo Tan ◽

Wei Xi

Keyword(s):

Markov Chains ◽

Hidden Markov ◽

Hidden Markov Chains ◽

Optimal Quantization ◽

Estimation And Control ◽

Download Full-text

Learning control of finite Markov chains with unknown transition probabilities

IEEE Transactions on Automatic Control ◽

10.1109/tac.1982.1102893 ◽

1982 ◽

Vol 27 (2) ◽

pp. 502-505 ◽

Author(s):

M. Sato ◽

K. Abe ◽

H. Takeda

Keyword(s):

Markov Chains ◽

Transition Probabilities ◽

Learning Control ◽

Finite Markov Chains

Download Full-text

Estimation and Control of Markov Chains with Cyclically Changing Unknown Probabilities

Transactions of the Society of Instrument and Control Engineers ◽

10.9746/sicetr1965.25.860 ◽

1989 ◽

Vol 25 (8) ◽

pp. 860-866

Author(s):

Mitsuo SATO ◽

Hiroshi TAKEDA ◽

Tomomi IWASAKI

Keyword(s):

Markov Chains ◽

Estimation And Control ◽

Download Full-text

Self-learning control of finite markov chains, by A. S. Poznyak, K. Najim and E. Gomez-Ramirez, Marcel Dekker, Inc., New York, 2000, 298pp, ISBN: 0-8247-9429-X

International Journal of Adaptive Control and Signal Processing ◽

10.1002/acs.782 ◽

2003 ◽

Vol 17 (10) ◽

pp. 801-803

Author(s):

Daniel W. Repperger

Keyword(s):

New York ◽

Markov Chains ◽

Learning Control ◽

Finite Markov Chains ◽

Download Full-text