reward criterion
Recently Published Documents


TOTAL DOCUMENTS

67
(FIVE YEARS 8)

H-INDEX

10
(FIVE YEARS 0)

Author(s):  
Rolando Cavazos-Cadena ◽  
Mario Cantú-Sifuentes ◽  
Imelda Cerda-Delgado

Kybernetika ◽  
2021 ◽  
pp. 474-492
Author(s):  
Rolando Cavazos-Cadena ◽  
Luis Rodríguez-Gutiérrez ◽  
Dulce María Sánchez-Guillermo

2021 ◽  
Vol 11 (3) ◽  
pp. 1098
Author(s):  
Norbert Kozłowski ◽  
Olgierd Unold

Initially, Anticipatory Classifier Systems (ACS) were designed to address both single and multistep decision problems. In the latter case, the objective was to maximize the total discounted rewards, usually based on Q-learning algorithms. Studies on other Learning Classifier Systems (LCS) revealed many real-world sequential decision problems where the preferred objective is the maximization of the average of successive rewards. This paper proposes a relevant modification toward the learning component, allowing us to address such problems. The modified system is called AACS2 (Averaged ACS2) and is tested on three multistep benchmark problems.


2020 ◽  
Vol 22 (02) ◽  
pp. 2040002
Author(s):  
Reinoud Joosten ◽  
Llea Samuel

Games with endogenous transition probabilities and endogenous stage payoffs (or ETP–ESP games for short) are stochastic games in which both the transition probabilities and the payoffs at any stage are continuous functions of the relative frequencies of all past action combinations chosen. We present methods to compute large sets of jointly-convergent pure-strategy rewards in two-player ETP–ESP games with communicating states under the limiting average reward criterion. Such sets are useful in determining feasible rewards in a game, and instrumental in obtaining the set of (Nash) equilibrium rewards.


2020 ◽  
Vol 47 (2) ◽  
pp. 225-253
Author(s):  
B. A. Escobedo-Trujillo ◽  
O. Hernández-Lerma ◽  
F. A. Alaffita-Hernández

2017 ◽  
Vol 62 (11) ◽  
pp. 6032-6038 ◽  
Author(s):  
Xiaofeng Jiang ◽  
Xiaodong Wang ◽  
Hongsheng Xi ◽  
Falin Liu

Sign in / Sign up

Export Citation Format

Share Document