continuous state space Latest Research Papers

AbstractThis work analyses the performance of Reinforcement Learning (RL) versus Swarm Intelligence (SI) for coordinating multiple unmanned High Altitude Platform Stations (HAPS) for communications area coverage. It builds upon previous work which looked at various elements of both algorithms. The main aim of this paper is to address the continuous state-space challenge within this work by using partitioning to manage the high dimensionality problem. This enabled comparing the performance of the classical cases of both RL and SI establishing a baseline for future comparisons of improved versions. From previous work, SI was observed to perform better across various key performance indicators. However, after tuning parameters and empirically choosing suitable partitioning ratio for the RL state space, it was observed that the SI algorithm still maintained superior coordination capability by achieving higher mean overall user coverage (about 20% better than the RL algorithm), in addition to faster convergence rates. Though the RL technique showed better average peak user coverage, the unpredictable coverage dip was a key weakness, making SI a more suitable algorithm within the context of this work.

Download Full-text

Exact decoding of the sequentially Markov coalescent

10.1101/2020.09.21.307355 ◽

2020 ◽

Author(s):

Caleb Ki ◽

Jonathan Terhorst

Keyword(s):

Latent Variable ◽

Evolutionary Biology ◽

Markov Models ◽

Parameter Tuning ◽

Population History ◽

Natural Setting ◽

Exact Methods ◽

Continuous State Space ◽

Continuous State ◽

User Intervention

In statistical genetics, the sequentially Markov coalescent (SMC) is an important framework for approximating the distribution of genetic variation data under complex evolutionary models. Methods based on SMC are widely used in genetics and evolutionary biology, with significant applications to genotype phasing and imputation, recombination rate estimation, and inferring population history. SMC allows for likelihood-based inference using hidden Markov models (HMMs), where the latent variable represents a genealogy. Because genealogies are continuous, while HMMs are discrete, SMC requires discretizing the space of trees in a way that is complicated and can lead to bias. In this work, we propose a method that circumvents this requirement, enabling SMC-based inference to be performed in the natural setting of a continuous state space. We derive fast, exact methods for frequentist and Bayesian inference using SMC. Compared to existing methods, ours requires minimal user intervention or parameter tuning, no numerical optimization or E-M, and is faster and more accurate.

Download Full-text

Deep Q-Network with Predictive State Models in Partially Observable Domains

Mathematical Problems in Engineering ◽

10.1155/2020/1596385 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9

Author(s):

Danning Yu ◽

Kun Ni ◽

Yunlong Liu

Keyword(s):

Recurrent Network ◽

Fixed Number ◽

The State ◽

Great Success ◽

Robotic Control ◽

Continuous State Space ◽

Continuous State ◽

Observable Actions ◽

State Models ◽

Partially Observable

While deep reinforcement learning (DRL) has achieved great success in some large domains, most of the related algorithms assume that the state of the underlying system is fully observable. However, many real-world problems are actually partially observable. For systems with continuous observation, most of the related algorithms, e.g., the deep Q-network (DQN) and deep recurrent Q-network (DRQN), use history observations to represent states; however, they often make computation-expensive and ignore the information of actions. Predictive state representations (PSRs) can offer a powerful framework for modelling partially observable dynamical systems with discrete or continuous state space, which represents the latent state using completely observable actions and observations. In this paper, we present a PSR model-based DQN approach which combines the strengths of the PSR model and DQN planning. We use a recurrent network to establish the recurrent PSR model, which can fully learn dynamics of the partially continuous observable environment. Then, the model is used for the state representation and update of DQN, which makes DQN no longer rely on a fixed number of history observations or recurrent neural network (RNN) to represent states in the case of partially observable environments. The strong performance of the proposed approach is demonstrated on a set of robotic control tasks from OpenAI Gym by comparing with the technique with the memory-based DRQN and the state-of-the-art recurrent predictive state policy (RPSP) networks. Source code is available at https://github.com/RPSR-DQN/paper-code.git.

Download Full-text

Identifiability in Continuous State-Space Models

Parameter Redundancy and Identifiability ◽

10.1201/9781315120003-7 ◽

2020 ◽

pp. 155-168

Author(s):

Diana J. Cole

Keyword(s):

State Space ◽

State Space Models ◽

Continuous State Space ◽

Continuous State

Download Full-text

A Hidden Markov Model to Address Measurement Errors in Ordinal Response Scale and Non-Decreasing Process

Mathematics ◽

10.3390/math8040622 ◽

2020 ◽

Vol 8 (4) ◽

pp. 622 ◽

Cited By ~ 1

Author(s):

Lizbeth Naranjo ◽

Luz Judith R. Esparza ◽

Carlos J. Pérez

Keyword(s):

Markov Model ◽

Hidden Markov Model ◽

Latent Variables ◽

Measurement Errors ◽

Hidden Markov ◽

Response Scale ◽

Ordinal Response ◽

Continuous State Space ◽

Continuous State ◽

Categorical Response

A Bayesian approach was developed, tested, and applied to model ordinal response data in monotone non-decreasing processes with measurement errors. An inhomogeneous hidden Markov model with continuous state-space was considered to incorporate measurement errors in the categorical response at the same time that the non-decreasing patterns were kept. The computational difficulties were avoided by including latent variables that allowed implementing an efficient Markov chain Monte Carlo method. A simulation-based analysis was carried out to validate the approach, whereas the proposed approach was applied to analyze aortic aneurysm progression data.

Download Full-text

Analysis of Reward Functions in Deep Reinforcement Learning for Continuous State Space Control

Journal of KIISE ◽

10.5626/jok.2020.47.1.78 ◽

2020 ◽

Vol 47 (1) ◽

pp. 78-87

Author(s):

MinKu Kang ◽

Kee-Eung Kim

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Continuous State Space ◽

Continuous State ◽

Reward Functions

Download Full-text

Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning

Proceedings of the 12th International Conference on Agents and Artificial Intelligence ◽

10.5220/0008971502260233 ◽

2020 ◽

Author(s):

Gurpreet Singh ◽

Daniel Lofaro ◽

Donald Sofge

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Action Space ◽

Pursuit Evasion ◽

Continuous State Space ◽

Continuous State ◽

Robotic Swarm

Download Full-text

continuous state space
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Stability analysis of 2‐D discrete and continuous state‐space systems

Maximum likelihood estimation and the local asymptotic mixed normality in a second order branching process with continuous state space

Hyperspace Neighbor Penetration Approach to Dynamic Programming for Model-Based Reinforcement Learning Problems with Slowly Changing Variables in a Continuous State Space

Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

Exact decoding of the sequentially Markov coalescent

Deep Q-Network with Predictive State Models in Partially Observable Domains

Identifiability in Continuous State-Space Models

A Hidden Markov Model to Address Measurement Errors in Ordinal Response Scale and Non-Decreasing Process

Analysis of Reward Functions in Deep Reinforcement Learning for Continuous State Space Control

Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning

Export Citation Format

continuous state spaceRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Stability analysis of 2‐D discrete and continuous state‐space systems

Maximum likelihood estimation and the local asymptotic mixed normality in a second order branching process with continuous state space

Hyperspace Neighbor Penetration Approach to Dynamic Programming for Model-Based Reinforcement Learning Problems with Slowly Changing Variables in a Continuous State Space

Reinforcement learning versus swarm intelligence for autonomous multi-HAPS coordination

Exact decoding of the sequentially Markov coalescent

Deep Q-Network with Predictive State Models in Partially Observable Domains

Identifiability in Continuous State-Space Models

A Hidden Markov Model to Address Measurement Errors in Ordinal Response Scale and Non-Decreasing Process

Analysis of Reward Functions in Deep Reinforcement Learning for Continuous State Space Control

Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning

continuous state space
Recently Published Documents