scholarly journals A proof of the Caffarelli contraction theorem via entropic regularization

Author(s):  
Max Fathi ◽  
Nathael Gozlan ◽  
Maxime Prod’homme
2020 ◽  
Vol 39 (5) ◽  
pp. 7831-7841
Author(s):  
Nabanita Konwar

The aim of this paper is to define the notion of intuitionistic fuzzy b metric space (in short, IFbMS) along with some useful results. We establish some important Lemmas in order to study the Cauchy sequence in IFbMS. To further develop the work, we establish some fixed point theorems and study the existence of unique fixed point of some self mappings in IFbMS. We also develop the concept of Ćirić quasi-Contraction theorem in IFbMS. Examples are provided to validate the non-triviality of the results.


Author(s):  
Vipin Kumar ◽  
Muslim Malik

Abstract In this work, we investigate the controllability results of a fractional integro-differential equation with non-instantaneous impulses on time scales. Banach contraction theorem and the non-linear functional analysis have been used to establish these results. In support, a numerical example with simulation for different time scales is given to validate the obtained analytical outcomes.


Entropy ◽  
2019 ◽  
Vol 21 (7) ◽  
pp. 674
Author(s):  
Boris Belousov ◽  
Jan Peters

An optimal feedback controller for a given Markov decision process (MDP) can in principle be synthesized by value or policy iteration. However, if the system dynamics and the reward function are unknown, a learning agent must discover an optimal controller via direct interaction with the environment. Such interactive data gathering commonly leads to divergence towards dangerous or uninformative regions of the state space unless additional regularization measures are taken. Prior works proposed bounding the information loss measured by the Kullback–Leibler (KL) divergence at every policy improvement step to eliminate instability in the learning dynamics. In this paper, we consider a broader family of f-divergences, and more concretely α -divergences, which inherit the beneficial property of providing the policy improvement step in closed form at the same time yielding a corresponding dual objective for policy evaluation. Such entropic proximal policy optimization view gives a unified perspective on compatible actor-critic architectures. In particular, common least-squares value function estimation coupled with advantage-weighted maximum likelihood policy improvement is shown to correspond to the Pearson χ 2 -divergence penalty. Other actor-critic pairs arise for various choices of the penalty-generating function f. On a concrete instantiation of our framework with the α -divergence, we carry out asymptotic analysis of the solutions for different values of α and demonstrate the effects of the divergence function choice on common standard reinforcement learning problems.


1981 ◽  
Vol 23 (4) ◽  
pp. 1632-1638 ◽  
Author(s):  
Frank E. Harris ◽  
Bogumił Jeziorski ◽  
Hendrik J. Monkhorst

1995 ◽  
Vol 125 (5) ◽  
pp. 1085-1104 ◽  
Author(s):  
Volker Metz

Transition probabilities are calculated which make the construction of diffusions on finitely ramified fractals straightforward. In contrast to former approaches using Brouwer's Fixed Point Theorem, we consider an approximation procedure based on the iteration of a nonlinear mapL. Physically, this is done by ‘coarse-graining-renormalisation of finite electric resistor networks’. Mathematically, it is a convergence problem for quotients of Dirichlet forms on finite graphs. These graphs approximate finitely ramified fractals. The basic tool is a contraction theorem for the renormalisation mapLwhich allows the use of known results about nested fractals for non-nested (p.c.f. self-similar) ones. In general, the above contraction is not strict because several linear independent fixed points occur.


2020 ◽  
Vol 26 ◽  
pp. 103
Author(s):  
Simone Di Marino ◽  
Lénaïc Chizat

In this paper, we characterize a degenerate PDE as the gradient flow in the space of nonnegative measures endowed with an optimal transport-growth metric. The PDE of concern, of Hele-Shaw type, was introduced by Perthame et. al. as a mechanical model for tumor growth and the metric was introduced recently in several articles as the analogue of the Wasserstein metric for nonnegative measures. We show existence of solutions using minimizing movements and show uniqueness of solutions on convex domains by proving the Evolutional Variational Inequality. Our analysis does not require any regularity assumption on the initial condition. We also derive a numerical scheme based on the discretization of the gradient flow and the idea of entropic regularization. We assess the convergence of the scheme on explicit solutions. In doing this analysis, we prove several new properties of the optimal transport-growth metric, which generally have a known counterpart for the Wasserstein metric.


2013 ◽  
Vol 6 (1) ◽  
pp. 215-233
Author(s):  
T. J. Sullivan ◽  
◽  
M. Koslowski ◽  
F. Theil ◽  
Michael Ortiz ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document