predictive state representations Latest Research Papers

Predictive state representations (PSRs) are models of controlled non-Markov observation sequences which exhibit the same generative process governing POMDP observations without relying on an underlying latent state. In that respect, a PSR is indistinguishable from the corresponding POMDP. However, PSRs notoriously ignore the notion of rewards, which undermines the general utility of PSR models for control, planning, or reinforcement learning. Therefore, we describe a sufficient and necessary accuracy condition which determines whether a PSR is able to accurately model POMDP rewards, we show that rewards can be approximated even when the accuracy condition is not satisfied, and we find that a non-trivial number of POMDPs taken from a well-known third-party repository do not satisfy the accuracy condition. We propose reward-predictive state representations (R-PSRs), a generalization of PSRs which accurately models both observations and rewards, and develop value iteration for R-PSRs. We show that there is a mismatch between optimal POMDP policies and the optimal PSR policies derived from approximate rewards. On the other hand, optimal R-PSR policies perfectly match optimal POMDP policies, reconfirming R-PSRs as accurate state-less generative models of observations and rewards.

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

Frontiers in Neurorobotics ◽

10.3389/fnbot.2020.578675 ◽

2020 ◽

Vol 14 ◽

Author(s):

Jeffery Dick ◽

Pawel Ladosz ◽

Eseoghene Ben-Iwhiwhu ◽

Hideyasu Shimadzu ◽

Peter Kinnell ◽

...

Keyword(s):

Predictive Models ◽

Probability Distributions ◽

Statistical Tests ◽

Decision Processes ◽

The Novel ◽

Predictive State Representations ◽

Markov Decision ◽

Novel Method ◽

Two Phases ◽

Partially Observable

The ability of an agent to detect changes in an environment is key to successful adaptation. This ability involves at least two phases: learning a model of an environment, and detecting that a change is likely to have occurred when this model is no longer accurate. This task is particularly challenging in partially observable environments, such as those modeled with partially observable Markov decision processes (POMDPs). Some predictive learners are able to infer the state from observations and thus perform better with partial observability. Predictive state representations (PSRs) and neural networks are two such tools that can be trained to predict the probabilities of future observations. However, most such existing methods focus primarily on static problems in which only one environment is learned. In this paper, we propose an algorithm that uses statistical tests to estimate the probability of different predictive models to fit the current environment. We exploit the underlying probability distributions of predictive models to provide a fast and explainable method to assess and justify the model's beliefs about the current environment. Crucially, by doing so, the method can label incoming data as fitting different models, and thus can continuously train separate models in different environments. This new method is shown to prevent catastrophic forgetting when new environments, or tasks, are encountered. The method can also be of use when AI-informed decisions require justifications because its beliefs are based on statistical evidence from observations. We empirically demonstrate the benefit of the novel method with simulations in a set of POMDP environments.

Basis selection in spectral learning of predictive state representations

Neurocomputing ◽

10.1016/j.neucom.2018.04.079 ◽

2018 ◽

Vol 310 ◽

pp. 183-189 ◽

Cited By ~ 1

Author(s):

Chunqing Huang ◽

Yisheng An ◽

Sun Zhou ◽

Zhezheng Hong ◽

Yunlong Liu

Keyword(s):

Spectral Learning ◽

Predictive State Representations

Group sparse optimization for learning predictive state representations

Information Sciences ◽

10.1016/j.ins.2017.05.023 ◽

2017 ◽

Vol 412-413 ◽

pp. 1-13 ◽

Cited By ~ 1

Author(s):

Yifeng Zeng ◽

Biyang Ma ◽

Bilian Chen ◽

Jing Tang ◽

Mengda He

Keyword(s):

Sparse Optimization ◽

Predictive State Representations

Novel Approach for the Recognition and Prediction of Multi-Function Radar Behaviours Based on Predictive State Representations

Sensors ◽

10.3390/s17030632 ◽

2017 ◽

Vol 17 (3) ◽

pp. 632 ◽

Cited By ~ 2

Author(s):

Jian Ou ◽

Yongguang Chen ◽

Feng Zhao ◽

Jin Liu ◽

Shunping Xiao

Keyword(s):

Novel Approach ◽

Predictive State Representations

Method for operating mode identification of multi‐function radars based on predictive state representations

IET Radar Sonar & Navigation ◽

10.1049/iet-rsn.2016.0182 ◽

2017 ◽

Vol 11 (3) ◽

pp. 426-433 ◽

Cited By ~ 2

Author(s):

Jian Ou ◽

Yongguang Chen ◽

Feng Zhao ◽

Jin Liu ◽

Shunping Xiao

Keyword(s):

Operating Mode ◽

Mode Identification ◽

Predictive State Representations

Learning of scanning strategies for electronic support using predictive state representations

2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP) ◽

10.1109/mlsp.2015.7324365 ◽

2015 ◽

Author(s):

Hadrien Claude ◽

Cyrille Enderli ◽

Jean-Francois Grandin ◽

Olivier Pietquin

Keyword(s):

Predictive State Representations ◽

Scanning Strategies ◽

Electronic Support

Learning Predictive State Representations for planning

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2015.7353855 ◽

2015 ◽

Author(s):

Johannes A. Stork ◽

Carl Henrik Ek ◽

Danica Kragic

Keyword(s):

Predictive State Representations

Closing the learning-planning loop with predictive state representations

The International Journal of Robotics Research ◽

10.1177/0278364911404092 ◽

2011 ◽

Vol 30 (7) ◽

pp. 954-966 ◽

Cited By ~ 40

Author(s):

Byron Boots ◽

Sajid M Siddiqi ◽

Geoffrey J Gordon

Keyword(s):

Predictive State Representations

Closing the Learning-Planning Loop with Predictive State Representations

Robotics ◽

10.7551/mitpress/9123.003.0040 ◽

2011 ◽

Keyword(s):

Predictive State Representations

predictive state representations
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Reconciling Rewards with Predictive State Representations

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

Basis selection in spectral learning of predictive state representations

Group sparse optimization for learning predictive state representations

Novel Approach for the Recognition and Prediction of Multi-Function Radar Behaviours Based on Predictive State Representations

Method for operating mode identification of multi‐function radars based on predictive state representations

Learning of scanning strategies for electronic support using predictive state representations

Learning Predictive State Representations for planning

Closing the learning-planning loop with predictive state representations

Closing the Learning-Planning Loop with Predictive State Representations

Export Citation Format

predictive state representationsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Reconciling Rewards with Predictive State Representations

Detecting Changes and Avoiding Catastrophic Forgetting in Dynamic Partially Observable Environments

Basis selection in spectral learning of predictive state representations

Group sparse optimization for learning predictive state representations

Novel Approach for the Recognition and Prediction of Multi-Function Radar Behaviours Based on Predictive State Representations

Method for operating mode identification of multi‐function radars based on predictive state representations

Learning of scanning strategies for electronic support using predictive state representations

Learning Predictive State Representations for planning

Closing the learning-planning loop with predictive state representations

Closing the Learning-Planning Loop with Predictive State Representations

predictive state representations
Recently Published Documents