Persistent Homology Analysis of RNA

AbstractTopological data analysis has been recently used to extract meaningful information frombiomolecules. Here we introduce the application of persistent homology, a topological data analysis tool, for computing persistent features (loops) of the RNA folding space. The scaffold of the RNA folding space is a complex graph from which the global features are extracted by completing the graph to a simplicial complex via the notion of clique and Vietoris-Rips complexes. The resulting simplicial complexes are characterised in terms of topological invariants, such as the number of holes in any dimension, i.e. Betti numbers. Our approach discovers persistent structural features, which are the set of smallest components to which the RNA folding space can be reduced. Thanks to this discovery, which in terms of data mining can be considered as a space dimension reduction, it is possible to extract a new insight that is crucial for understanding the mechanism of the RNA folding towards the optimal secondary structure. This structure is composed by the components discovered during the reduction step of the RNA folding space and is characterized by minimum free energy.

Download Full-text

Using Topological Data Analysis (TDA) and Persistent Homology to Analyze the Stock Markets in Singapore and Taiwan

Frontiers in Physics ◽

10.3389/fphy.2021.572216 ◽

2021 ◽

Vol 9 ◽

Author(s):

Peter Tsung-Wen Yen ◽

Siew Ann Cheong

Keyword(s):

Data Analysis ◽

Stock Markets ◽

Persistent Homology ◽

Betti Numbers ◽

Stock Index ◽

Topological Data Analysis ◽

Series Data ◽

Topological Features ◽

Market Crashes ◽

Topological Data

In recent years, persistent homology (PH) and topological data analysis (TDA) have gained increasing attention in the fields of shape recognition, image analysis, data analysis, machine learning, computer vision, computational biology, brain functional networks, financial networks, haze detection, etc. In this article, we will focus on stock markets and demonstrate how TDA can be useful in this regard. We first explain signatures that can be detected using TDA, for three toy models of topological changes. We then showed how to go beyond network concepts like nodes (0-simplex) and links (1-simplex), and the standard minimal spanning tree or planar maximally filtered graph picture of the cross correlations in stock markets, to work with faces (2-simplex) or any k-dim simplex in TDA. By scanning through a full range of correlation thresholds in a procedure called filtration, we were able to examine robust topological features (i.e. less susceptible to random noise) in higher dimensions. To demonstrate the advantages of TDA, we collected time-series data from the Straits Times Index and Taiwan Capitalization Weighted Stock Index (TAIEX), and then computed barcodes, persistence diagrams, persistent entropy, the bottleneck distance, Betti numbers, and Euler characteristic. We found that during the periods of market crashes, the homology groups become less persistent as we vary the characteristic correlation. For both markets, we found consistent signatures associated with market crashes in the Betti numbers, Euler characteristics, and persistent entropy, in agreement with our theoretical expectations.

Download Full-text

Persistent Homology: Application To Monitoring Hydraulic Fracturing

Russian Digital Libraries Journal ◽

10.26907/1562-5419-2020-23-6-1192-1212 ◽

2020 ◽

Vol 23 (6) ◽

pp. 1192-1212

Author(s):

Kirill Yurevich Erofeev ◽

Mansur Tagirovich Ziiatdinov ◽

Evgenii Vladimirovich Mokshin

Keyword(s):

Data Analysis ◽

Hydraulic Fracturing ◽

Topological Structure ◽

Prior Information ◽

Persistent Homology ◽

Topological Data Analysis ◽

Analysis Tool ◽

Data Analysis Tool ◽

Natural Way ◽

Topological Data

Persistent homology is a topological data analysis tool which is reflecting changes in topological structure of data along its scale. Application of persistent homology to monitoring hydraulic fracturing which is allowing researchers to consider prior information in a natural way is given in the article

Download Full-text

Classification of apatite structures via topological data analysis: a framework for a ‘Materials Barcode’ representation of structure maps

Scientific Reports ◽

10.1038/s41598-021-90070-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Scott Broderick ◽

Ruhil Dongol ◽

Tianmu Zhang ◽

Krishna Rajan

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Crystal Chemistry ◽

Persistent Homology ◽

Hierarchical Classification ◽

Topological Data Analysis ◽

Learning Tool ◽

Coordination Polyhedra ◽

Machine Learning Tool ◽

Topological Data

AbstractThis paper introduces the use of topological data analysis (TDA) as an unsupervised machine learning tool to uncover classification criteria in complex inorganic crystal chemistries. Using the apatite chemistry as a template, we track through the use of persistent homology the topological connectivity of input crystal chemistry descriptors on defining similarity between different stoichiometries of apatites. It is shown that TDA automatically identifies a hierarchical classification scheme within apatites based on the commonality of the number of discrete coordination polyhedra that constitute the structural building units common among the compounds. This information is presented in the form of a visualization scheme of a barcode of homology classifications, where the persistence of similarity between compounds is tracked. Unlike traditional perspectives of structure maps, this new “Materials Barcode” schema serves as an automated exploratory machine learning tool that can uncover structural associations from crystal chemistry databases, as well as to achieve a more nuanced insight into what defines similarity among homologous compounds.

Download Full-text

Topological data analysis for true step detection in periodic piecewise constant signals

Proceedings of The Royal Society A Mathematical Physical and Engineering Sciences ◽

10.1098/rspa.2018.0027 ◽

2018 ◽

Vol 474 (2218) ◽

pp. 20180027 ◽

Cited By ~ 4

Author(s):

Firas A. Khasawneh ◽

Elizabeth Munch

Keyword(s):

Data Analysis ◽

Persistent Homology ◽

Topological Data Analysis ◽

Future Research ◽

Accurate Identification ◽

Piecewise Constant ◽

Higher Dimensional ◽

Powerful Approach ◽

Hadamard Transforms ◽

Topological Data

This paper introduces a simple yet powerful approach based on topological data analysis for detecting true steps in a periodic, piecewise constant (PWC) signal. The signal is a two-state square wave with randomly varying in-between-pulse spacing, subject to spurious steps at the rising or falling edges which we call digital ringing. We use persistent homology to derive mathematical guarantees for the resulting change detection which enables accurate identification and counting of the true pulses. The approach is tested using both synthetic and experimental data obtained using an engine lathe instrumented with a laser tachometer. The described algorithm enables accurate and automatic calculations of the spindle speed without any choice of parameters. The results are compared with the frequency and sequency methods of the Fourier and Walsh–Hadamard transforms, respectively. Both our approach and the Fourier analysis yield comparable results for pulses with regular spacing and digital ringing while the latter causes large errors using the Walsh–Hadamard method. Further, the described approach significantly outperforms the frequency/sequency analyses when the spacing between the peaks is varied. We discuss generalizing the approach to higher dimensional PWC signals, although using this extension remains an interesting question for future research.

Download Full-text

On Topological Data Analysis for SHM: An Introduction to Persistent Homology

10.1007/978-3-030-76004-5_20 ◽

2021 ◽

pp. 169-184

Author(s):

T. Gowdridge ◽

N. Dervilis ◽

K. Worden

Keyword(s):

Data Analysis ◽

Persistent Homology ◽

Topological Data Analysis ◽

Topological Data

Download Full-text

Topological Data Analysis as a Morphometric Method: Using Persistent Homology to Demarcate a Leaf Morphospace

Frontiers in Plant Science ◽

10.3389/fpls.2018.00553 ◽

2018 ◽

Vol 9 ◽

Cited By ~ 18

Author(s):

Mao Li ◽

Hong An ◽

Ruthie Angelovici ◽

Clement Bagaza ◽

Albert Batushansky ◽

...

Keyword(s):

Data Analysis ◽

Persistent Homology ◽

Topological Data Analysis ◽

Morphometric Method ◽

Topological Data

Download Full-text

Topological data analysis and diagnostics of compressible magnetohydrodynamic turbulence

Journal of Plasma Physics ◽

10.1017/s0022377818000752 ◽

2018 ◽

Vol 84 (4) ◽

Cited By ~ 2

Author(s):

I. Makarenko ◽

P. Bushby ◽

A. Fletcher ◽

R. Henderson ◽

N. Makarenko ◽

...

Keyword(s):

Data Analysis ◽

Random Fields ◽

Large Scale ◽

Betti Numbers ◽

Mean Field ◽

Topological Data Analysis ◽

Topological Measures ◽

Magnetohydrodynamic Simulations ◽

Random Flows ◽

Topological Data

The predictions of mean-field electrodynamics can now be probed using direct numerical simulations of random flows and magnetic fields. When modelling astrophysical magnetohydrodynamics, it is important to verify that such simulations are in agreement with observations. One of the main challenges in this area is to identify robust quantitative measures to compare structures found in simulations with those inferred from astrophysical observations. A similar challenge is to compare quantitatively results from different simulations. Topological data analysis offers a range of techniques, including the Betti numbers and persistence diagrams, that can be used to facilitate such a comparison. After describing these tools, we first apply them to synthetic random fields and demonstrate that, when the data are standardized in a straightforward manner, some topological measures are insensitive to either large-scale trends or the resolution of the data. Focusing upon one particular astrophysical example, we apply topological data analysis to H iobservations of the turbulent interstellar medium (ISM) in the Milky Way and to recent magnetohydrodynamic simulations of the random, strongly compressible ISM. We stress that these topological techniques are generic and could be applied to any complex, multi-dimensional random field.

Download Full-text

Persistent homology as a new method of the assessment of heart rate variability

PLoS ONE ◽

10.1371/journal.pone.0253851 ◽

2021 ◽

Vol 16 (7) ◽

pp. e0253851

Author(s):

Grzegorz Graff ◽

Beata Graff ◽

Paweł Pilarczyk ◽

Grzegorz Jabłoński ◽

Dariusz Gąsecki ◽

...

Keyword(s):

Heart Rate ◽

Heart Rate Variability ◽

Data Analysis ◽

Persistent Homology ◽

Assessment Tools ◽

Topological Data Analysis ◽

Topological Descriptors ◽

Time Interval ◽

Analysis Tool ◽

Frequency Domain Methods

Heart rate variability (hrv) is a physiological phenomenon of the variation in the length of the time interval between consecutive heartbeats. In many cases it could be an indicator of the development of pathological states. The classical approach to the analysis of hrv includes time domain methods and frequency domain methods. However, attempts are still being made to define new and more effective hrv assessment tools. Persistent homology is a novel data analysis tool developed in the recent decades that is rooted at algebraic topology. The Topological Data Analysis (TDA) approach focuses on examining the shape of the data in terms of connectedness and holes, and has recently proved to be very effective in various fields of research. In this paper we propose the use of persistent homology to the hrv analysis. We recall selected topological descriptors used in the literature and we introduce some new topological descriptors that reflect the specificity of hrv, and we discuss their relation to the standard hrv measures. In particular, we show that this novel approach provides a collection of indices that might be at least as useful as the classical parameters in differentiating between series of beat-to-beat intervals (RR-intervals) in healthy subjects and patients suffering from a stroke episode.

Download Full-text

Flow Estimation Only from Image Data, based on Persistent Homology

10.21203/rs.3.rs-330050/v1 ◽

2021 ◽

Author(s):

Anna Suzuki ◽

Miyuki Miyazawa ◽

James Minto ◽

Takeshi Tsuji ◽

Ippei Obayashi ◽

...

Keyword(s):

Data Analysis ◽

Persistent Homology ◽

Image Data ◽

Fracture Network ◽

Topological Data Analysis ◽

Geometric Information ◽

Flow Estimation ◽

Flow Phenomena ◽

Network Patterns ◽

Topological Data

Abstract Topological data analysis is an emerging concept of data analysis for characterizing shapes. A state-of-the-art tool in topological data analysis is persistent homology, which is expected to summarize quantified topological and geometric features. Although persistent homology is useful for revealing the topological and geometric information, it is difficult to interpret the parameters of persistent homology themselves and difficult to directly relate the parameters to physical properties. In this study, we focus on connectivity and apertures of flow channels detected from persistent homology analysis. We propose a method to estimate permeability in fracture networks from parameters of persistent homology. Synthetic 3D fracture network patterns and their direct flow simulations are used for the validation. The results suggest that the persistent homology can estimate fluid flow in fracture network based on the image data. This method can easily derive the flow phenomena based on the information of the structure.

Download Full-text

Persistent homology approach distinguishes potential pattern between “Early” and “Not Early” hepatic decompensation groups using MRI modalities

Current Directions in Biomedical Engineering ◽

10.1515/cdbme-2021-2124 ◽

2021 ◽

Vol 7 (2) ◽

pp. 488-491

Author(s):

Yashbir Singh ◽

William Jons ◽

Gian Marco Conte ◽

Jaidip Jagtap ◽

Kuan Zhang ◽

...

Keyword(s):

Data Analysis ◽

Liver Failure ◽

Algebraic Topology ◽

Persistent Homology ◽

Topological Data Analysis ◽

Topological Representation ◽

Hepatic Decompensation ◽

Topological Data

Abstract Primary sclerosis cholangitis (PSC) predisposes individuals to liver failure, but it is challenging for radiologists examining radiologic images to predict which patients with PSC will ultimately develop liver failure. Motivated by algebraic topology, a topological data analysis - inspired framework was adopted in the study of the imaging pattern between the “Early Decompensation” and “Not Early” groups. The results demonstrate that the proposed methodology discriminates “Early Decompensation” and “Not Early” groups. Our study is the first attempt to provide a topological representation-based method into early hepatic decompensation and not early groups.

Download Full-text