computational genomics
Recently Published Documents


TOTAL DOCUMENTS

109
(FIVE YEARS 27)

H-INDEX

17
(FIVE YEARS 3)

Mathematics ◽  
2021 ◽  
Vol 9 (24) ◽  
pp. 3239
Author(s):  
Ginés Almagro-Hernández ◽  
Juana-María Vivo ◽  
Manuel Franco ◽  
Jesualdo Tomás Fernández-Breis

Computational genomics aim at supporting the discovery of how the functionality of the genome of the organism under study is affected both by its own sequence and structure, and by the network of interaction between this genome and different biological or physical factors. In this work, we focus on the analysis of ChIP-seq data, for which many methods have been proposed in the recent years. However, to the best of our knowledge, those methods lack an appropriate mathematical formalism. We have developed a method based on multivariate models for the analysis of the set of peaks obtained from a ChIP-seq experiment. This method can be used to characterize an individual experiment and to compare different experiments regardless of where and when they were conducted. The method is based on a multivariate hypergeometric distribution, which fits the complexity of the biological data and is better suited to deal with the uncertainty generated in this type of experiments than the dichotomous models used by the state of the art methods. We have validated this method with Arabidopsis thaliana datasets obtained from the Remap2020 database, obtaining results in accordance with the original study of these samples. Our work shows a novel way for analyzing ChIP-seq data.


Life ◽  
2021 ◽  
Vol 11 (11) ◽  
pp. 1211
Author(s):  
Yuriy L. Orlov ◽  
Anastasia A. Anashkina

This Special Issue, “Life: Computational Genomics”, presents research articles on systems biology applications, computational genomics, and bioinformatics methods in life sciences [...]


2021 ◽  
Author(s):  
Jacob Schreiber ◽  
Surag Nair ◽  
Akshay Balsubramani ◽  
Anshul Kundaje

In-silico saturation mutagenesis (ISM) is a popular approach in computational genomics for calculating feature attributions on biological sequences that proceeds by systematically perturbing each position in a sequence and recording the difference in model output. However, this method can be slow because systematically perturbing each position requires performing a number of forward passes proportional to the length of the sequence being examined. In this work, we propose a modification of ISM that leverages the principles of compressed sensing to require only a constant number of forward passes, regardless of sequence length, when applied to models that contain operations with a limited receptive field, such as convolutions. Our method, named Yuzu, can reduce the time that ISM spends in convolution operations by several orders of magnitude and, consequently, Yuzu can speed up ISM on several commonly used architectures in genomics by over an order of magnitude. Notably, we found that Yuzu provides speedups that increase with the complexity of the convolution operation and the length of the sequence being analyzed, suggesting that Yuzu provides large benefits in realistic settings. We have made this tool available at https://github.com/kundajelab/yuzu.


Author(s):  
Ariya Shajii ◽  
Ibrahim Numanagić ◽  
Alexander T. Leighton ◽  
Haley Greenyer ◽  
Saman Amarasinghe ◽  
...  

Author(s):  
Naman Mangukia ◽  
Priyashi Rao ◽  
Kamlesh Patel ◽  
Himanshu Pandya ◽  
Rakesh M. Rawal

2021 ◽  
Vol 18 (4) ◽  
pp. 1227-1229
Author(s):  
Dong-Qing Wei ◽  
Aman Chandra Kaushik ◽  
Gurudeeban Selvaraj ◽  
Yi Pan

2021 ◽  
Vol 16 (1) ◽  
Author(s):  
Sven Schrinner ◽  
Manish Goel ◽  
Michael Wulfert ◽  
Philipp Spohr ◽  
Korbinian Schneeberger ◽  
...  

AbstractGenome assembly is one of the most important problems in computational genomics. Here, we suggest addressing an issue that arises in homology-based scaffolding, that is, when linking and ordering contigs to obtain larger pseudo-chromosomes by means of a second incomplete assembly of a related species. The idea is to use alignments of binned regions in one contig to find the most homologous contig in the other assembly. We show that ordering the contigs of the other assembly can be expressed by a new string problem, the longest run subsequence problem (LRS). We show that LRS is NP-hard and present reduction rules and two algorithmic approaches that, together, are able to solve large instances of LRS to provable optimality. All data used in the experiments as well as our source code are freely available. We demonstrate its usefulness within an existing larger scaffolding approach by solving realistic instances resulting from partial Arabidopsis thaliana assemblies in short computation time.


Sign in / Sign up

Export Citation Format

Share Document