Short-range template switching in great ape genomes explored using pair hidden Markov models

Many complex genomic rearrangements arise through template switch errors, which occur in DNA replication when there is a transient polymerase switch to an alternate template nearby in three-dimensional space. While typically investigated at kilobase-to-megabase scales, the genomic and evolutionary consequences of this mutational process are not well characterised at smaller scales, where they are often interpreted as clusters of independent substitutions, insertions and deletions. Here we present an improved statistical approach using pair hidden Markov models, and use it to detect and describe short-range template switches underlying clusters of mutations in the multi-way alignment of hominid genomes. Using robust statistics derived from evolutionary genomic simulations, we show that template switch events have been widespread in the evolution of the great apes’ genomes and provide a parsimonious explanation for the presence of many complex mutation clusters in their phylogenetic context. Larger-scale mechanisms of genome rearrangement are typically associated with structural features around breakpoints, and accordingly we show that atypical patterns of secondary structure formation and DNA bending are present at the initial template switch loci. Our methods improve on previous non-probabilistic approaches for computational detection of template switch mutations, allowing the statistical significance of events to be assessed. By specifying realistic evolutionary parameters based on the genomes and taxa involved, our methods can be readily adapted to other intra- or inter-species comparisons.

Download Full-text

QUESTION CLASSIFICATION USING PROFILE HIDDEN MARKOV MODELS

International Journal of Artificial Intelligence Tools ◽

10.1142/s0218213010000066 ◽

2010 ◽

Vol 19 (01) ◽

pp. 121-131 ◽

Cited By ~ 2

Author(s):

YAN PAN ◽

YONG TANG ◽

YE-MIN LUO ◽

LU-XIAN LIN ◽

GUI-BIN WU

Keyword(s):

Hidden Markov Models ◽

Question Answering ◽

Markov Models ◽

Hidden Markov ◽

Critical Role ◽

Structural Features ◽

Profile Hidden Markov Models ◽

Question Classification ◽

Question Answering Systems ◽

Selective Substitution

Recently, Question Answering has been a hot topic in the research of information retrieval. Question Classification plays a critical role in most Question Answering systems. In this paper, a new approach to classifying questions using Profile Hidden Markov Models (PHMMs) is proposed. The generalization strategies to extract the pattern instances of questions by selective substitution are discussed. Then the classification method with pattern instances' structural features is investigated. Experimental results show that the PHMM based question classifier can reach the accuracy of 92.2% and significantly outperforms most of the state-of-the-art systems.

Download Full-text

Statistical Significance of Probabilistic Sequence Alignment and Related Local Hidden Markov Models

Journal of Computational Biology ◽

10.1089/10665270152530845 ◽

2001 ◽

Vol 8 (3) ◽

pp. 249-282 ◽

Cited By ~ 35

Author(s):

Yi-Kuo Yu ◽

Terence Hwa

Keyword(s):

Hidden Markov Models ◽

Sequence Alignment ◽

Markov Models ◽

Hidden Markov ◽

Statistical Significance

Download Full-text

Faculty Opinions recommendation of Statistical significance of probabilistic sequence alignment and related local hidden Markov models.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.1003058.29555 ◽

2001 ◽

Author(s):

Stephen Altschul

Keyword(s):

Hidden Markov Models ◽

Sequence Alignment ◽

Markov Models ◽

Hidden Markov ◽

Statistical Significance

Download Full-text

Classification of EEG Single Trial Microstates Using Local Global Graphs and Discrete Hidden Markov Models

International Journal of Neural Systems ◽

10.1142/s0129065716500362 ◽

2016 ◽

Vol 26 (06) ◽

pp. 1650036 ◽

Cited By ~ 8

Author(s):

Kostas Michalopoulos ◽

Michalis Zervakis ◽

Marie-Pierre Deiber ◽

Nikolaos Bourbakis

Keyword(s):

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov ◽

Event Related Potentials ◽

Structural Features ◽

Distance Measures ◽

Syntactic Analysis ◽

Input Stimulus ◽

Eeg Topography ◽

Related Potentials

We present a novel synergistic methodology for the spatio-temporal analysis of single Electroencephalogram (EEG) trials. This new methodology is based on the novel synergy of Local Global Graph (LG graph) to characterize define the structural features of the EEG topography as a global descriptor for robust comparison of dominant topographies (microstates) and Hidden Markov Models (HMM) to model the topographic sequence in a unique way. In particular, the LG graph descriptor defines similarity and distance measures that can be successfully used for the difficult comparison of the extracted LG graphs in the presence of noise. In addition, hidden states represent periods of stationary distribution of topographies that constitute the equivalent of the microstates in the model. The transitions between the different microstates and the formed syntactic patterns can reveal differences in the processing of the input stimulus between different pathologies. We train the HMM model to learn the transitions between the different microstates and express the syntactic patterns that appear in the single trials in a compact and efficient way. We applied this methodology in single trials consisting of normal subjects and patients with Progressive Mild Cognitive Impairment (PMCI) to discriminate these two groups. The classification results show that this approach is capable to efficiently discriminate between control and Progressive MCI single trials. Results indicate that HMMs provide physiologically meaningful results that can be used in the syntactic analysis of Event Related Potentials.

Download Full-text

Estimating Personality Impression from Speech Record Using Hidden Markov Models

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.135.1517 ◽

2015 ◽

Vol 135 (12) ◽

pp. 1517-1523 ◽

Cited By ~ 1

Author(s):

Yicheng Jin ◽

Takuto Sakuma ◽

Shohei Kato ◽

Tsutomu Kunitachi

Keyword(s):

Hidden Markov Models ◽

Markov Models ◽

Hidden Markov

Download Full-text

Hidden Markov Processes

10.23943/princeton/9780691133157.001.0001 ◽

2014 ◽

Cited By ~ 2

Author(s):

M. Vidyasagar

Keyword(s):

Hidden Markov Models ◽

Markov Processes ◽

Viterbi Algorithm ◽

Markov Models ◽

Hidden Markov ◽

Local Alignment ◽

Biological Applications ◽

Standard Material ◽

Hidden Markov Processes ◽

Genomics And Proteomics

This book explores important aspects of Markov and hidden Markov processes and the applications of these ideas to various problems in computational biology. It starts from first principles, so that no previous knowledge of probability is necessary. However, the work is rigorous and mathematical, making it useful to engineers and mathematicians, even those not interested in biological applications. A range of exercises is provided, including drills to familiarize the reader with concepts and more advanced problems that require deep thinking about the theory. Biological applications are taken from post-genomic biology, especially genomics and proteomics. The topics examined include standard material such as the Perron–Frobenius theorem, transient and recurrent states, hitting probabilities and hitting times, maximum likelihood estimation, the Viterbi algorithm, and the Baum–Welch algorithm. The book contains discussions of extremely useful topics not usually seen at the basic level, such as ergodicity of Markov processes, Markov Chain Monte Carlo (MCMC), information theory, and large deviation theory for both i.i.d and Markov processes. It also presents state-of-the-art realization theory for hidden Markov models. Among biological applications, it offers an in-depth look at the BLAST (Basic Local Alignment Search Technique) algorithm, including a comprehensive explanation of the underlying theory. Other applications such as profile hidden Markov models are also explored.

Download Full-text