TUMOR HAPLOTYPE ASSEMBLY ALGORITHMS FOR CANCER GENOMICS

AbstractBackgroundHaplotype information is essential for many genetic and genomic analyses, including genotype-phenotype associations in human, animals and plants. Haplotype assembly is a method for reconstructing haplotypes from DNA sequencing reads. By the advent of new sequencing technologies, new algorithms are needed to ensure long and accurate haplotypes. While a few linked-read haplotype assembly algorithms are available for diploid genomes, there are no algorithms yet for polyploids.ResultsThe first haplotyping algorithm designed for 10X linked reads generated from a polyploid genome is presented, built on a typical short-read haplotyping method, SDhaP. Using the input aligned reads and called variants, the haplotype-relevant information is extracted. Next, reads with the same barcodes are combined to produce molecule-specific fragments. Then, these fragments are clustered into strongly connected components which are then used as input of a haplotype assembly core in order to estimate accurate and long haplotypes.ConclusionsHap10 is a novel algorithm for haplotype assembly of polyploid genomes using linked reads. The performance of the algorithms is evaluated in a number of simulation scenarios and its applicability is demonstrated on a real dataset of sweet potato.

Download Full-text

Personalized Medicine and Cancer Genomics

PsycEXTRA Dataset ◽

10.1037/e542752012-001 ◽

2009 ◽

Keyword(s):

Personalized Medicine ◽

Cancer Genomics

Download Full-text

A Study of DNA Fragment Assembly Algorithms

Journal of Applied Physics and Engineering ◽

10.26524/jap2 ◽

2016 ◽

Vol 1 (1) ◽

pp. 10-16

Author(s):

Satyanarayana Reddy Beeram ◽

Edara Srinivasa Reddy

Keyword(s):

Fragment Assembly ◽

Dna Fragment ◽

Assembly Algorithms ◽

Dna Fragment Assembly

Download Full-text

Faculty Opinions recommendation of Optimal algorithms for haplotype assembly from whole-genome sequence data.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.13339986.14707085 ◽

2011 ◽

Author(s):

Alejandro Schaffer

Keyword(s):

Genome Sequence ◽

Sequence Data ◽

Whole Genome Sequence ◽

Whole Genome ◽

Optimal Algorithms ◽

Genome Sequence Data ◽

Haplotype Assembly

Download Full-text

Faculty Opinions recommendation of The first five years of single-cell cancer genomics and beyond.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.725824177.793535257 ◽

2017 ◽

Author(s):

Wafik El-Deiry

Keyword(s):

Single Cell ◽

Cancer Genomics ◽

Cell Cancer

Download Full-text

Cancer Genomics & Proteomics

10.21873/cgp ◽

2018 ◽

Cited By ~ 1

Keyword(s):

Cancer Genomics

Download Full-text

Bayesian graphical models for modern biological applications

Statistical Methods & Applications ◽

10.1007/s10260-021-00572-8 ◽

2021 ◽

Author(s):

Yang Ni ◽

Veerabhadran Baladandayuthapani ◽

Marina Vannucci ◽

Francesco C. Stingo

Keyword(s):

Graphical Models ◽

Cancer Genomics ◽

Graph Structure ◽

Biological Processes ◽

Limited Sample ◽

Large Networks ◽

Bayesian Approaches ◽

Complex Sampling ◽

Complex Dependence ◽

Bayesian Graphical Models

AbstractGraphical models are powerful tools that are regularly used to investigate complex dependence structures in high-throughput biomedical datasets. They allow for holistic, systems-level view of the various biological processes, for intuitive and rigorous understanding and interpretations. In the context of large networks, Bayesian approaches are particularly suitable because it encourages sparsity of the graphs, incorporate prior information, and most importantly account for uncertainty in the graph structure. These features are particularly important in applications with limited sample size, including genomics and imaging studies. In this paper, we review several recently developed techniques for the analysis of large networks under non-standard settings, including but not limited to, multiple graphs for data observed from multiple related subgroups, graphical regression approaches used for the analysis of networks that change with covariates, and other complex sampling and structural settings. We also illustrate the practical utility of some of these methods using examples in cancer genomics and neuroimaging.

Download Full-text

Ranking cancer drivers via betweenness-based outlier detection and random walks

BMC Bioinformatics ◽

10.1186/s12859-021-03989-w ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Cesim Erten ◽

Aissa Houdjedj ◽

Hilal Kazan

Keyword(s):

Cancer Genomics ◽

Interaction Network ◽

Molecular Data ◽

Alternative Methods ◽

Patient Specific ◽

Cancer Genes ◽

Driver Genes ◽

Cancer Driver ◽

Protein Protein Interaction ◽

Genomic Studies

Abstract Background Recent cancer genomic studies have generated detailed molecular data on a large number of cancer patients. A key remaining problem in cancer genomics is the identification of driver genes. Results We propose BetweenNet, a computational approach that integrates genomic data with a protein-protein interaction network to identify cancer driver genes. BetweenNet utilizes a measure based on betweenness centrality on patient specific networks to identify the so-called outlier genes that correspond to dysregulated genes for each patient. Setting up the relationship between the mutated genes and the outliers through a bipartite graph, it employs a random-walk process on the graph, which provides the final prioritization of the mutated genes. We compare BetweenNet against state-of-the art cancer gene prioritization methods on lung, breast, and pan-cancer datasets. Conclusions Our evaluations show that BetweenNet is better at recovering known cancer genes based on multiple reference databases. Additionally, we show that the GO terms and the reference pathways enriched in BetweenNet ranked genes and those that are enriched in known cancer genes overlap significantly when compared to the overlaps achieved by the rankings of the alternative methods.

Download Full-text

Advancing global equity in cancer genomics – challenges and opportunities in Sub-Saharan Africa

Current Opinion in Genetics & Development ◽

10.1016/j.gde.2020.11.006 ◽

2021 ◽

Vol 66 ◽

pp. 20-24

Author(s):

Nchangwi S Munung ◽

Melvin A Ambele ◽

Pontsho Moela

Keyword(s):

Cancer Genomics ◽

Sub Saharan Africa ◽

Challenges And Opportunities ◽

Sub Saharan

Download Full-text

The Role of Network Science in Glioblastoma

Cancers ◽

10.3390/cancers13051045 ◽

2021 ◽

Vol 13 (5) ◽

pp. 1045

Author(s):

Marta B. Lopes ◽

Eduarda P. Martins ◽

Susana Vinga ◽

Bruno M. Costa

Keyword(s):

Personalized Medicine ◽

Drug Development ◽

Information Flow ◽

Clinical Studies ◽

Network Science ◽

Cancer Genomics ◽

High Dimensional ◽

Network Discovery ◽

Software Implementations

Network science has long been recognized as a well-established discipline across many biological domains. In the particular case of cancer genomics, network discovery is challenged by the multitude of available high-dimensional heterogeneous views of data. Glioblastoma (GBM) is an example of such a complex and heterogeneous disease that can be tackled by network science. Identifying the architecture of molecular GBM networks is essential to understanding the information flow and better informing drug development and pre-clinical studies. Here, we review network-based strategies that have been used in the study of GBM, along with the available software implementations for reproducibility and further testing on newly coming datasets. Promising results have been obtained from both bulk and single-cell GBM data, placing network discovery at the forefront of developing a molecularly-informed-based personalized medicine.

Download Full-text