Creating Affective Autonomous Characters Using Planning in Partially Observable Stochastic Domains

AbstractRecent advances in genomic selection (GS) have demonstrated the importance of not only the accuracy of genomic prediction but also the intelligence of selection strategies. The look ahead selection algorithm, for example, has been found to significantly outperform the widely used truncation selection approach in terms of genetic gain, thanks to its strategy of selecting breeding parents that may not necessarily be elite themselves but have the best chance of producing elite progeny in the future. This paper presents the look ahead trace back algorithm as a new variant of the look ahead approach, which introduces several improvements to further accelerate genetic gain especially under imperfect genomic prediction. Perhaps an even more significant contribution of this paper is the design of opaque simulators for evaluating the performance of GS algorithms. These simulators are partially observable, explicitly capture both additive and non-additive genetic effects, and simulate uncertain recombination events more realistically. In contrast, most existing GS simulation settings are transparent, either explicitly or implicitly allowing the GS algorithm to exploit certain critical information that may not be possible in actual breeding programs. Comprehensive computational experiments were carried out using a maize data set to compare a variety of GS algorithms under four simulators with different levels of opacity. These results reveal how differently a same GS algorithm would interact with different simulators, suggesting the need for continued research in the design of more realistic simulators. As long as GS algorithms continue to be trained in silico rather than in planta, the best way to avoid disappointing discrepancy between their simulated and actual performances may be to make the simulator as akin to the complex and opaque nature as possible.

Download Full-text

Benefits of combining dimensional attention and working memory for partially observable reinforcement learning problems

Proceedings of the 2021 ACM Southeast Conference ◽

10.1145/3409334.3452072 ◽

2021 ◽

Author(s):

Ngozi Omatu ◽

Joshua L. Phillips

Keyword(s):

Working Memory ◽

Reinforcement Learning ◽

Learning Problems ◽

Partially Observable

Download Full-text

Secure Control in Partially Observable Environments to Satisfy LTL Specifications

IEEE Transactions on Automatic Control ◽

10.1109/tac.2020.3039484 ◽

2020 ◽

pp. 1-1

Author(s):

Bhaskar Ramasubramanian ◽

Luyao Niu ◽

Andrew Clark ◽

Linda Bushnell ◽

Radha Poovendran

Keyword(s):

Secure Control ◽

Partially Observable

Download Full-text

Optimal adaptive inspection and maintenance for redundant systems

Proceedings of the Institution of Mechanical Engineers Part O Journal of Risk and Reliability ◽

10.1177/1748006x211020151 ◽

2021 ◽

pp. 1748006X2110201

Author(s):

Chaochao Lin ◽

Matteo Pozzi

Keyword(s):

Engineering Systems ◽

Discounted Cost ◽

Markov Decision ◽

Inspection And Maintenance ◽

And Performance ◽

Partially Observable ◽

Series Systems ◽

Selection Of ◽

Redundant Systems

Optimal exploration of engineering systems can be guided by the principle of Value of Information (VoI), which accounts for the topological important of components, their reliability and the management costs. For series systems, in most cases higher inspection priority should be given to unreliable components. For redundant systems such as parallel systems, analysis of one-shot decision problems shows that higher inspection priority should be given to more reliable components. This paper investigates the optimal exploration of redundant systems in long-term decision making with sequential inspection and repairing. When the expected, cumulated, discounted cost is considered, it may become more efficient to give higher inspection priority to less reliable components, in order to preserve system redundancy. To investigate this problem, we develop a Partially Observable Markov Decision Process (POMDP) framework for sequential inspection and maintenance of redundant systems, where the VoI analysis is embedded in the optimal selection of exploratory actions. We investigate the use of alternative approximate POMDP solvers for parallel and more general systems, compare their computation complexities and performance, and show how the inspection priorities depend on the economic discount factor, the degradation rate, the inspection precision, and the repair cost.

Download Full-text

Suboptimal Control of a Class of Stochastic System with Random, Partially Observable Parameters

10.23919/acc.1983.4788298 ◽

1983 ◽

Cited By ~ 1

Author(s):

M.H. Lee ◽

W.J. Kolodziej ◽

R.R. Mohler

Keyword(s):

Stochastic System ◽

Suboptimal Control ◽

Partially Observable

Download Full-text