Mining sequences with exceptional transition behaviour of varying order using quality measures based on information-theoretic scoring functions

AbstractDiscrete Markov chains are frequently used to analyse transition behaviour in sequential data. Here, the transition probabilities can be estimated using varying order Markov chains, where order k specifies the length of the sequence history that is used to model these probabilities. Generally, such a model is fitted to the entire dataset, but in practice it is likely that some heterogeneity in the data exists and that some sequences would be better modelled with alternative parameter values, or with a Markov chain of a different order. We use the framework of Exceptional Model Mining (EMM) to discover these exceptionally behaving sequences. In particular, we propose an EMM model class that allows for discovering subgroups with transition behaviour of varying order. To that end, we propose three new quality measures based on information-theoretic scoring functions. Our findings from controlled experiments show that all three quality measures find exceptional transition behaviour of varying order and are reasonably sensitive. The quality measure based on Akaike’s Information Criterion is most robust for the number of observations. We furthermore add to existing work by seeking for subgroups of sequences, as opposite to subgroups of transitions. Since we use sequence-level descriptive attributes, we form subgroups of entire sequences, which is practically relevant in situations where you want to identify the originators of exceptional sequences, such as patients. We show this relevance by analysing sequences of blood glucose values of adult persons with diabetes type 2. In the experiments, we find subgroups of patients based on age and glycated haemoglobin (HbA1c), a measure known to correlate with average blood glucose values. Clinicians and domain experts confirmed the transition behaviour as estimated by the fitted Markov chain models.

Download Full-text

A Bayesian model for binary Markov chains

International Journal of Mathematics and Mathematical Sciences ◽

10.1155/s0161171204202319 ◽

2004 ◽

Vol 2004 (8) ◽

pp. 421-429 ◽

Cited By ~ 2

Author(s):

Souad Assoudou ◽

Belkheir Essebbar

Keyword(s):

Monte Carlo ◽

Markov Chain ◽

Markov Chains ◽

Bayesian Estimation ◽

Bayesian Model ◽

Transition Probabilities ◽

Simulated Data ◽

Bayesian Estimator ◽

Jeffreys Prior ◽

Data Set

This note is concerned with Bayesian estimation of the transition probabilities of a binary Markov chain observed from heterogeneous individuals. The model is founded on the Jeffreys' prior which allows for transition probabilities to be correlated. The Bayesian estimator is approximated by means of Monte Carlo Markov chain (MCMC) techniques. The performance of the Bayesian estimates is illustrated by analyzing a small simulated data set.

Download Full-text

Infinitely divisible random transition probabilities with application to dependent Markov chains

The Journal of the Australian Mathematical Society Series B Applied Mathematics ◽

10.1017/s0334270000004215 ◽

1984 ◽

Vol 25 (4) ◽

pp. 463-472

Author(s):

Peter L. Chesson

Keyword(s):

Markov Chain ◽

Markov Chains ◽

White Noise ◽

Transition Probabilities ◽

Transition Probability ◽

Transition Rates ◽

Infinitely Divisible ◽

Random Transition ◽

Transition Probability Matrices ◽

Probability Matrices

AbstractRandom transition probability matrices with stationary independent factors define “white noise” environment processes for Markov chains. Two examples are considered in detail. Such environment processes can be used to construct several Markov chains which are dependent, have the same transition probabilities and are jointly a Markov chain. Transition rates for such processes are evaluated. These results have application to the study of animal movements.

Download Full-text

Perturbation theory and finite Markov chains

Journal of Applied Probability ◽

10.2307/3212261 ◽

1968 ◽

Vol 5 (2) ◽

pp. 401-413 ◽

Cited By ~ 211

Author(s):

Paul J. Schweitzer

Keyword(s):

Perturbation Theory ◽

Markov Chain ◽

Markov Chains ◽

Stationary Distribution ◽

Fundamental Matrix ◽

Transition Probabilities ◽

Semi Group ◽

Partial Derivatives ◽

Finite Markov Chains ◽

Derivatives Of

A perturbation formalism is presented which shows how the stationary distribution and fundamental matrix of a Markov chain containing a single irreducible set of states change as the transition probabilities vary. Expressions are given for the partial derivatives of the stationary distribution and fundamental matrix with respect to the transition probabilities. Semi-group properties of the generators of transformations from one Markov chain to another are investigated. It is shown that a perturbation formalism exists in the multiple subchain case if and only if the change in the transition probabilities does not alter the number of, or intermix the various subchains. The formalism is presented when this condition is satisfied.

Download Full-text

Single-shelf library-type Markov chains with infinitely many books

Journal of Applied Probability ◽

10.1017/s0021900200104978 ◽

1977 ◽

Vol 14 (02) ◽

pp. 298-308 ◽

Cited By ~ 2

Author(s):

Peter R. Nelson

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Transition Probabilities ◽

Natural Order ◽

Order Book ◽

Exit Boundary ◽

One Step ◽

Step Transition ◽

The Right ◽

Positive Recurrent

In a single-shelf library having infinitely many books B 1 , B 2 , …, the probability of selecting each book is assumed known. Books are removed one at a time and replaced in position k prior to the next removal. Books are moved either to the right or the left as is necessary to vacate position k. Those arrangements of books where after some finite position all the books are in natural order (book i occupies position i) are considered as states in an infinite Markov chain. When k > 1, we show that the chain can never be positive recurrent. When k = 1, we find the limits of ratios of one-step transition probabilities; and when k = 1 and the chain is transient, we find the Martin exit boundary.

Download Full-text

Hitting Time and Inverse Problems for Markov Chains

Journal of Applied Probability ◽

10.1017/s0021900200004617 ◽

2008 ◽

Vol 45 (03) ◽

pp. 640-649

Author(s):

Victor de la Peña ◽

Henryk Gzyl ◽

Patrick McDonald

Keyword(s):

Markov Chain ◽

Inverse Problems ◽

Markov Chains ◽

Transition Probabilities ◽

Hitting Time ◽

First Hitting Time ◽

Joint Distributions ◽

Finite Set

Let W n be a simple Markov chain on the integers. Suppose that X n is a simple Markov chain on the integers whose transition probabilities coincide with those of W n off a finite set. We prove that there is an M > 0 such that the Markov chain W n and the joint distributions of the first hitting time and first hitting place of X n started at the origin for the sets {-M, M} and {-(M + 1), (M + 1)} algorithmically determine the transition probabilities of X n .

Download Full-text

The robustness of positive recurrence and recurrence of Markov chains under perturbations of the transition probabilities

Journal of Applied Probability ◽

10.1017/s0021900200048701 ◽

1975 ◽

Vol 12 (04) ◽

pp. 744-752 ◽

Cited By ~ 3

Author(s):

Richard L. Tweedie

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Finite Number ◽

Transition Probabilities ◽

General State ◽

Transition Matrices ◽

Positive Recurrence ◽

Markov Chain Models ◽

Countable Space ◽

General State Space

In many Markov chain models, the immediate characteristic of importance is the positive recurrence of the chain. In this note we investigate whether positivity, and also recurrence, are robust properties of Markov chains when the transition laws are perturbed. The chains we consider are on a fairly general state space : when specialised to a countable space, our results are essentially that, if the transition matrices of two irreducible chains coincide on all but a finite number of columns, then positivity of one implies positivity of both; whilst if they coincide on all but a finite number of rows and columns, recurrence of one implies recurrence of both. Examples are given to show that these results (and their general analogues) cannot in general be strengthened.

Download Full-text

Vulnerability of networks of interacting Markov chains

Philosophical Transactions of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rsta.2010.0036 ◽

2010 ◽

Vol 368 (1918) ◽

pp. 2205-2219 ◽

Cited By ~ 4

Author(s):

L. Kocarev ◽

N. Zlatanov ◽

D. Trajanov

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Transition Probabilities ◽

Power Grid ◽

Current Status ◽

Topological Structures ◽

Influence Model ◽

The Status ◽

The Eu

The concept of vulnerability is introduced for a model of random, dynamical interactions on networks. In this model, known as the influence model, the nodes are arranged in an arbitrary network, while the evolution of the status at a node is according to an internal Markov chain, but with transition probabilities that depend not only on the current status of that node but also on the statuses of the neighbouring nodes. Vulnerability is treated analytically and numerically for several networks with different topological structures, as well as for two real networks—the network of infrastructures and the EU power grid—identifying the most vulnerable nodes of these networks.

Download Full-text

A MARKOV CHAIN CHOICE PROBLEM

Probability in the Engineering and Informational Sciences ◽

10.1017/s0269964812000290 ◽

2012 ◽

Vol 27 (1) ◽

pp. 53-55

Author(s):

Sheldon M. Ross

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Transition Probabilities ◽

Choice Problem ◽

Prior Probabilities ◽

Myopic Policy ◽

Initial States ◽

State 1

Consider two independent Markov chains having states 0, 1, and identical transition probabilities. At each stage one of the chains is observed, and a reward equal to the observed state is earned. Assuming prior probabilities on the initial states of the chains it is shown that the myopic policy that always chooses to observe the chain most likely to be in state 1 stochastically maximizes the sequence of rewards earned in each period.

Download Full-text

On the Embedding Problem for Discrete-Time Markov Chains

Journal of Applied Probability ◽

10.1017/s002190020001370x ◽

2013 ◽

Vol 50 (04) ◽

pp. 918-930 ◽

Cited By ~ 4

Author(s):

Marie-Anne Guerry

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Discrete Time ◽

Transition Matrix ◽

Transition Probabilities ◽

Sufficient Conditions ◽

Embedding Problem ◽

Time Intervals ◽

Square Roots ◽

Probability Matrices

When a discrete-time homogenous Markov chain is observed at time intervals that correspond to its time unit, then the transition probabilities of the chain can be estimated using known maximum likelihood estimators. In this paper we consider a situation when a Markov chain is observed on time intervals with length equal to twice the time unit of the Markov chain. The issue then arises of characterizing probability matrices whose square root(s) are also probability matrices. This characterization is referred to in the literature as the embedding problem for discrete time Markov chains. The probability matrix which has probability root(s) is called embeddable. In this paper for two-state Markov chains, necessary and sufficient conditions for embeddability are formulated and the probability square roots of the transition matrix are presented in analytic form. In finding conditions for the existence of probability square roots for (k x k) transition matrices, properties of row-normalized matrices are examined. Besides the existence of probability square roots, the uniqueness of these solutions is discussed: In the case of nonuniqueness, a procedure is introduced to identify a transition matrix that takes into account the specificity of the concrete context. In the case of nonexistence of a probability root, the concept of an approximate probability root is introduced as a solution of an optimization problem related to approximate nonnegative matrix factorization.

Download Full-text

A Note on Approximating Mean Occupation Times of Continuous-Time Markov Chains

Probability in the Engineering and Informational Sciences ◽

10.1017/s0269964800000796 ◽

1988 ◽

Vol 2 (2) ◽

pp. 267-268

Author(s):

Sheldon M. Ross

Keyword(s):

Markov Chain ◽

Markov Chains ◽

Continuous Time ◽

Transition Probabilities ◽

Random Variables ◽

Continuous Time Markov Chain ◽

Occupation Times ◽

Continuous Time Markov Chains ◽

Exponential Random Variables

In [1] an approach to approximate the transition probabilities and mean occupation times of a continuous-time Markov chain is presented. For the chain under consideration, let Pij(t) and Tij(t) denote respectively the probability that it is in state j at time t, and the total time spent in j by time t, in both cases conditional on the chain starting in state i. Also, let Y1,…, Yn be independent exponential random variables each with rate λ = n/t, which are also independent of the Markov chain.

Download Full-text