Solving Partially Observable Reinforcement Learning Problems with Recurrent Neural Networks

Recurrent neural networks (RNNs) have been widely used to deal with sequence learning problems. The input-dependent transition function, which folds new observations into hidden states to sequentially construct fixed-length representations of arbitrary-length sequences, plays a critical role in RNNs. Based on single space composition, transition functions in existing RNNs often have difficulty in capturing complicated long-range dependencies. In this paper, we introduce a new Multi-zone Unit (MZU) for RNNs. The key idea is to design a transition function that is capable of modeling multiple space composition. The MZU consists of three components: zone generation, zone composition, and zone aggregation. Experimental results on multiple datasets of the character-level language modeling task and the aspect-based sentiment analysis task demonstrate the superiority of the MZU.

Download Full-text

Integrating recurrent neural networks and reinforcement learning for dynamic service composition

Future Generation Computer Systems ◽

10.1016/j.future.2020.02.030 ◽

2020 ◽

Vol 107 ◽

pp. 551-563 ◽

Cited By ~ 1

Author(s):

Hongbing Wang ◽

Jiajie Li ◽

Qi Yu ◽

Tianjing Hong ◽

Jia Yan ◽

...

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Recurrent Neural Networks ◽

Service Composition ◽

Dynamic Service Composition

Download Full-text

ANALYTICAL REVIEW OF MULTI-AGENT REINFORCEMENT LEARNING PROBLEMS

Vestnik komp iuternykh i informatsionnykh tekhnologii ◽

10.14489/vkit.2020.06.pp.048-056 ◽

2020 ◽

pp. 48-56

Author(s):

Yu. V. Dubenko

Keyword(s):

Reinforcement Learning ◽

Intelligent Agents ◽

Russian Language ◽

Learning Problems ◽

Multi Agent Systems ◽

Hierarchical Reinforcement Learning ◽

Collective Interaction ◽

Analytical Review ◽

Multi Agent ◽

Partially Observable

This paper is devoted to the problem of collective artificial intelligence in solving problems by intelligent agents in external environments. The environments may be: fully or partially observable, deterministic or stochastic, static or dynamic, discrete or continuous. The paper identifies problems of collective interaction of intelligent agents when they solve a class of tasks, which need to coordinate actions of agent group, e. g. task of exploring the territory of a complex infrastructure facility. It is revealed that the problem of reinforcement training in multi-agent systems is poorly presented in the press, especially in Russian-language publications. The article analyzes reinforcement learning, describes hierarchical reinforcement learning, presents basic methods to implement reinforcement learning. The concept of macro-action by agents integrated in groups is introduced. The main problems of intelligent agents collective interaction for problem solving (i. e. calculation of individual rewards for each agent; agent coordination issues; application of macro actions by agents integrated into groups; exchange of experience generated by various agents as part of solving a collective problem) are identified. The model of multi-agent reinforcement learning is described in details. The article describes problems of this approach building on existing solutions. Basic problems of multi-agent reinforcement learning are formulated in conclusion.

Download Full-text

A networked smart home system based on recurrent neural networks and reinforcement learning

Systems Science & Control Engineering ◽

10.1080/21642583.2021.2001769 ◽

2021 ◽

Vol 9 (1) ◽

pp. 775-783

Author(s):

Zhongwang Li ◽

Bin Deng

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Recurrent Neural Networks ◽

Smart Home

Download Full-text

Reinforcement learning under incomplete perception using stochastic gradient ascent and recurrent neural networks

IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028) ◽

10.1109/icsmc.1999.815598 ◽

2003 ◽

Cited By ~ 1

Author(s):

A. Onat ◽

N. Kosino ◽

M. Kuramitu ◽

H. Kita

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Recurrent Neural Networks ◽

Stochastic Gradient ◽

Gradient Ascent ◽

Stochastic Gradient Ascent

Download Full-text

A Monte Carlo EM Approach for Partially Observable Diffusion Processes: Theory and Applications to Neural Networks

Neural Computation ◽

10.1162/08997660260028593 ◽

2002 ◽

Vol 14 (7) ◽

pp. 1507-1544 ◽

Cited By ~ 14

Author(s):

Javier R. Movellan ◽

Paul Mineiro ◽

R. J. Williams

Keyword(s):

Neural Networks ◽

Monte Carlo ◽

Speech Recognition ◽

Recurrent Neural Networks ◽

Diffusion Processes ◽

Visual Speech ◽

Stochastic Version ◽

Inner Products ◽

Partially Observable ◽

Monte Carlo Em

We present a Monte Carlo approach for training partially observable diffusion processes. We apply the approach to diffusion networks, a stochastic version of continuous recurrent neural networks. The approach is aimed at learning probability distributions of continuous paths, not just expected values. Interestingly, the relevant activation statistics used by the learning rule presented here are inner products in the Hilbert space of square integrable functions. These inner products can be computed using Hebbian operations and do not require backpropagation of error signals. Moreover, standard kernel methods could potentially be applied to compute such inner products. We propose that the main reason that recurrent neural networks have not worked well in engineering applications (e.g., speech recognition) is that they implicitly rely on a very simplistic likelihood model. The diffusion network approach proposed here is much richer and may open new avenues for applications of recurrent neural networks. We present some analysis and simulations to support this view. Very encouraging results were obtained on a visual speech recognition task in which neural networks outperformed hidden Markov models.

Download Full-text

Fuzzy inference-based reinforcement learning of dynamic recurrent neural networks

10.1109/sice.1997.624934 ◽

2002 ◽

Author(s):

Hyo-Byung Jun ◽

Dong-Wook Lee ◽

Dae-Joon Kim ◽

Kwee-Bo Sim

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Recurrent Neural Networks ◽

Fuzzy Inference

Download Full-text

Local stability conditions for discrete-time cascade locally recurrent neural networks

International Journal of Applied Mathematics and Computer Science ◽

10.2478/v10006-010-0002-x ◽

2010 ◽

Vol 20 (1) ◽

pp. 23-34 ◽

Cited By ~ 4

Author(s):

Krzysztof Patan

Keyword(s):

Neural Networks ◽

Discrete Time ◽

Recurrent Neural Networks ◽

Local Stability ◽

Learning Problems ◽

Stability Conditions ◽

Neuron Models ◽

Optimization Task ◽

Dynamic Type ◽

Locally Recurrent

Local stability conditions for discrete-time cascade locally recurrent neural networksThe paper deals with a specific kind of discrete-time recurrent neural network designed with dynamic neuron models. Dynamics are reproduced within each single neuron, hence the network considered is a locally recurrent globally feedforward. A crucial problem with neural networks of the dynamic type is stability as well as stabilization in learning problems. The paper formulates local stability conditions for the analysed class of neural networks using Lyapunov's first method. Moreover, a stabilization problem is defined and solved as a constrained optimization task. In order to tackle this problem, a gradient projection method is adopted. The efficiency and usefulness of the proposed approach are justified by using a number of experiments.

Download Full-text