scholarly journals Constrained stochastic optimal control with learned importance sampling: A path integral approach

2021 ◽  
pp. 027836492110478
Author(s):  
Jan Carius ◽  
René Ranftl ◽  
Farbod Farshidian ◽  
Marco Hutter

Modern robotic systems are expected to operate robustly in partially unknown environments. This article proposes an algorithm capable of controlling a wide range of high-dimensional robotic systems in such challenging scenarios. Our method is based on the path integral formulation of stochastic optimal control, which we extend with constraint-handling capabilities. Under our control law, the optimal input is inferred from a set of stochastic rollouts of the system dynamics. These rollouts are simulated by a physics engine, placing minimal restrictions on the types of systems and environments that can be modeled. Although sampling-based algorithms are typically not suitable for online control, we demonstrate in this work how importance sampling and constraints can be used to effectively curb the sampling complexity and enable real-time control applications. Furthermore, the path integral framework provides a natural way of incorporating existing control architectures as ancillary controllers for shaping the sampling distribution. Our results reveal that even in cases where the ancillary controller would fail, our stochastic control algorithm provides an additional safety and robustness layer. Moreover, in the absence of an existing ancillary controller, our method can be used to train a parametrized importance sampling policy using data from the stochastic rollouts. The algorithm may thereby bootstrap itself by learning an importance sampling policy offline and then refining it to unseen environments during online control. We validate our results on three robotic systems, including hardware experiments on a quadrupedal robot.

1997 ◽  
Vol 11 (04) ◽  
pp. 129-138 ◽  
Author(s):  
V. Sa-Yakanit ◽  
V. D. Lakhno ◽  
Klaus Haß

The generalized path integral approach is applied to calculate the ground state energy and the effective mass of an electron-plasmon interacting system for a wide range of densities. It is shown that in the self-consistent approximation an abrupt transition between the weak coupling and the strong coupling region of interaction exists. The transition occurs at low electron densities according to a value of 418 for rs, when Wigner crystallization is possible. For densities of real metals, the electron bandwidth is calculated and a comparison with experimental results is given.


1995 ◽  
Vol 389 ◽  
Author(s):  
K. C. Saraswat ◽  
Y. Chen ◽  
L. Degertekin ◽  
B. T. Khuri-Yakub

ABSTRACTA highly flexible Rapid Thermal Multiprocessing (RTM) reactor is described. This flexibility is the result of several new innovations: a lamp system, an acoustic thermometer and a real-time control system. The new lamp has been optimally designed through the use of a “virtual reactor” methodology to obtain the best possible wafer temperature uniformity. It consists of multiple concentric rings composed of light bulbs with horizontal filaments. Each ring is independently and dynamically controlled providing better control over the spatial and temporal optical flux profile resulting in excellent temperature uniformity over a wide range of process conditions. An acoustic thermometer non-invasively allows complete wafer temperature tomography under all process conditions - a critically important measurement never obtained before. For real-time equipment and process control a model based multivariable control system has been developed. Extensive integration of computers and related technology for specification, communication, execution, monitoring, control, and diagnosis demonstrates the programmability of the RTM.


Sign in / Sign up

Export Citation Format

Share Document