Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition

1991 ◽  
Vol 28 (4) ◽  
pp. 779-790 ◽  
Author(s):  
Rajeev Agrawal

We consider the adaptive control of Markov chains under the weak accessibility condition with a view to minimizing the learning loss. A certainty equivalence control with a forcing scheme is constructed. We use a stationary randomized control scheme for forcing and compute a maximum likelihood estimate of the unknown parameter from the resulting observations. We obtain an exponential upper bound on the rate of decay of the probability of error. This allows us to choose the rate of forcing appropriately, whereby we achieve a o(f(n) log n) learning loss for any function as .

1991 ◽  
Vol 28 (04) ◽  
pp. 779-790 ◽  
Author(s):  
Rajeev Agrawal

We consider the adaptive control of Markov chains under the weak accessibility condition with a view to minimizing the learning loss. A certainty equivalence control with a forcing scheme is constructed. We use a stationary randomized control scheme for forcing and compute a maximum likelihood estimate of the unknown parameter from the resulting observations. We obtain an exponential upper bound on the rate of decay of the probability of error. This allows us to choose the rate of forcing appropriately, whereby we achieve a o(f(n) log n) learning loss for any function as .


Author(s):  
Vinodhini M.

The objective of this paper is to develop a Direct Model Reference Adaptive Control (DMRAC) algorithm for a MIMO process by extending the MIT rule adopted for a SISO system. The controller thus developed is implemented on Laboratory interacting coupled tank process through simulation. This can be regarded as the relevant process control in petrol and chemical industries. These industries involve controlling the liquid level and the flow rate in the presence of nonlinearity and disturbance which justifies the use of adaptive techniques such as DMRAC control scheme. For this purpose, mathematical models are obtained for each of the input-output combinations using white box approach and the respective controllers are developed. A detailed analysis on the performance of the chosen process with these controllers is carried out. Simulation studies reveal the effectiveness of proposed controller for multivariable process that exhibits nonlinear behaviour.


Sign in / Sign up

Export Citation Format

Share Document