Corrigendum to: Fast Estimation of Recombination Rates Using Topological Data Analysis

AbstractAccurate estimation of recombination rates is critical for studying the origins and maintenance of genetic diversity. Because the inference of recombination rates under a full evolutionary model is computationally expensive, an alternative approach using topological data analysis (TDA) has been proposed. Previous TDA methods used information contained solely in the first Betti number (β1)of the cloud of genomes, which relates to the number of loops that can be detected within a genealogy. While these methods are considerably less computationally intensive than current biological model-based methods, these explorations have proven difficult to connect to the theory of the underlying biological process of recombination, and consequently have unpredictable behavior under different perturbations of the data. We introduce a new topological feature with a natural connection to coalescent models, which we call ψ. We show that ψ and β1 are differentially affected by changes to the structure of the data and use them in conjunction to provide a robust, efficient, and accurate estimator of recombination rates, TREE. Compared to previous TDA methods, TREE more closely approximates of the results of commonly used model-based methods. These characteristics make TREE well suited as a first-pass estimator of recombination rate heterogeneity or hotspots throughout the genome. In addition, we present novel arguments relating β1 to population genetic models; our work justifies the use of topological statistics as summaries of distributions of genome sequences and describes a new, unintuitive relationship between topological summaries of distance and the footprint of recombination on genome sequences.

Download Full-text

Topological Data Analysis beyond Genomics

Topological Data Analysis for Genomics and Evolution ◽

10.1017/9781316671665.011 ◽

2019 ◽

pp. 423-442

Keyword(s):

Data Analysis ◽

Topological Data Analysis ◽

Topological Data

Download Full-text

Voronoi Graph Traversal in High Dimensions with Applications to Topological Data Analysis and Piecewise Linear Interpolation

Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining ◽

10.1145/3394486.3403266 ◽

2020 ◽

Author(s):

Vladislav Polianskii ◽

Florian T. Pokorny

Keyword(s):

Data Analysis ◽

Piecewise Linear ◽

Linear Interpolation ◽

Topological Data Analysis ◽

High Dimensions ◽

Graph Traversal ◽

Piecewise Linear Interpolation ◽

Voronoi Graph ◽

Topological Data

Download Full-text

Topological Data Analysis For Evaluating PDE-based Denoising Models

Journal of Physics Conference Series ◽

10.1088/1742-6596/1897/1/012006 ◽

2021 ◽

Vol 1897 (1) ◽

pp. 012006

Author(s):

Ahmed K. Al-Jaberi ◽

Ehsan M. Hameed

Keyword(s):

Data Analysis ◽

Topological Data Analysis ◽

Topological Data

Download Full-text

Topological Data Analysis for Classification of DeepSat-4 Dataset

2020 10th International Symposium onTelecommunications (IST) ◽

10.1109/ist50524.2020.9345829 ◽

2020 ◽

Author(s):

Mehdi Hosseini Moghadam ◽

Mir Mohsen Pedram

Keyword(s):

Data Analysis ◽

Topological Data Analysis ◽

Topological Data

Download Full-text

Scalable topological data analysis for life science applications

Proceedings of the 18th ACM International Conference on Computing Frontiers ◽

10.1145/3457388.3459983 ◽

2021 ◽

Author(s):

Ananth Kalyanaraman

Keyword(s):

Data Analysis ◽

Life Science ◽

Topological Data Analysis ◽

Topological Data

Download Full-text

Topological Data Analysis Approaches to Uncovering the Timing of Ring Structure Onset in Filamentous Networks

Bulletin of Mathematical Biology ◽

10.1007/s11538-020-00847-3 ◽

2021 ◽

Vol 83 (3) ◽

Author(s):

Maria-Veronica Ciocanel ◽

Riley Juenemann ◽

Adriana T. Dawes ◽

Scott A. McKinley

Keyword(s):

Time Series ◽

Data Analysis ◽

Actin Filaments ◽

Time Series Data ◽

Polymer Networks ◽

Topological Data Analysis ◽

Series Data ◽

Topological Features ◽

Topological Data

AbstractIn developmental biology as well as in other biological systems, emerging structure and organization can be captured using time-series data of protein locations. In analyzing this time-dependent data, it is a common challenge not only to determine whether topological features emerge, but also to identify the timing of their formation. For instance, in most cells, actin filaments interact with myosin motor proteins and organize into polymer networks and higher-order structures. Ring channels are examples of such structures that maintain constant diameters over time and play key roles in processes such as cell division, development, and wound healing. Given the limitations in studying interactions of actin with myosin in vivo, we generate time-series data of protein polymer interactions in cells using complex agent-based models. Since the data has a filamentous structure, we propose sampling along the actin filaments and analyzing the topological structure of the resulting point cloud at each time. Building on existing tools from persistent homology, we develop a topological data analysis (TDA) method that assesses effective ring generation in this dynamic data. This method connects topological features through time in a path that corresponds to emergence of organization in the data. In this work, we also propose methods for assessing whether the topological features of interest are significant and thus whether they contribute to the formation of an emerging hole (ring channel) in the simulated protein interactions. In particular, we use the MEDYAN simulation platform to show that this technique can distinguish between the actin cytoskeleton organization resulting from distinct motor protein binding parameters.

Download Full-text

Classification of apatite structures via topological data analysis: a framework for a ‘Materials Barcode’ representation of structure maps

Scientific Reports ◽

10.1038/s41598-021-90070-4 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Scott Broderick ◽

Ruhil Dongol ◽

Tianmu Zhang ◽

Krishna Rajan

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Crystal Chemistry ◽

Persistent Homology ◽

Hierarchical Classification ◽

Topological Data Analysis ◽

Learning Tool ◽

Coordination Polyhedra ◽

Machine Learning Tool ◽

Topological Data

AbstractThis paper introduces the use of topological data analysis (TDA) as an unsupervised machine learning tool to uncover classification criteria in complex inorganic crystal chemistries. Using the apatite chemistry as a template, we track through the use of persistent homology the topological connectivity of input crystal chemistry descriptors on defining similarity between different stoichiometries of apatites. It is shown that TDA automatically identifies a hierarchical classification scheme within apatites based on the commonality of the number of discrete coordination polyhedra that constitute the structural building units common among the compounds. This information is presented in the form of a visualization scheme of a barcode of homology classifications, where the persistence of similarity between compounds is tracked. Unlike traditional perspectives of structure maps, this new “Materials Barcode” schema serves as an automated exploratory machine learning tool that can uncover structural associations from crystal chemistry databases, as well as to achieve a more nuanced insight into what defines similarity among homologous compounds.

Download Full-text

The Shape of Things: Topological Data Analysis

CHANCE ◽

10.1080/09332480.2021.1915036 ◽

2021 ◽

Vol 34 (2) ◽

pp. 59-64

Author(s):

Nicole Lazar ◽

Hyunnam Ryu

Keyword(s):

Data Analysis ◽

Topological Data Analysis ◽

Topological Data

Download Full-text