omics data integration
Recently Published Documents


TOTAL DOCUMENTS

97
(FIVE YEARS 57)

H-INDEX

12
(FIVE YEARS 6)

2022 ◽  
Vol 4 (1) ◽  
Author(s):  
Maria Tsagiopoulou ◽  
Nikolaos Pechlivanis ◽  
Maria Christina Maniou ◽  
Fotis Psomopoulos

ABSTRACT The integration of multi-omics data can greatly facilitate the advancement of research in Life Sciences by highlighting new interactions. However, there is currently no widespread procedure for meaningful multi-omics data integration. Here, we present a robust framework, called InterTADs, for integrating multi-omics data derived from the same sample, and considering the chromatin configuration of the genome, i.e. the topologically associating domains (TADs). Following the integration process, statistical analysis highlights the differences between the groups of interest (normal versus cancer cells) relating to (i) independent and (ii) integrated events through TADs. Finally, enrichment analysis using KEGG database, Gene Ontology and transcription factor binding sites and visualization approaches are available. We applied InterTADs to multi-omics datasets from 135 patients with chronic lymphocytic leukemia (CLL) and found that the integration through TADs resulted in a dramatic reduction of heterogeneity compared to individual events. Significant differences for individual events and on TADs level were identified between patients differing in the somatic hypermutation status of the clonotypic immunoglobulin genes, the core biological stratifier in CLL, attesting to the biomedical relevance of InterTADs. In conclusion, our approach suggests a new perspective towards analyzing multi-omics data, by offering reasonable execution time, biological benchmarking and potentially contributing to pattern discovery through TADs.


2021 ◽  
Vol 8 (1) ◽  
Author(s):  
Alyssa Imbert ◽  
Magali Rompais ◽  
Mohammed Selloum ◽  
Florence Castelli ◽  
Emmanuelle Mouton-Barbosa ◽  
...  

AbstractGenes are pleiotropic and getting a better knowledge of their function requires a comprehensive characterization of their mutants. Here, we generated multi-level data combining phenomic, proteomic and metabolomic acquisitions from plasma and liver tissues of two C57BL/6 N mouse models lacking the Lat (linker for activation of T cells) and the Mx2 (MX dynamin-like GTPase 2) genes, respectively. Our dataset consists of 9 assays (1 preclinical, 2 proteomics and 6 metabolomics) generated with a fully non-targeted and standardized approach. The data and processing code are publicly available in the ProMetIS R package to ensure accessibility, interoperability, and reusability. The dataset thus provides unique molecular information about the physiological role of the Lat and Mx2 genes. Furthermore, the protocols described herein can be easily extended to a larger number of individuals and tissues. Finally, this resource will be of great interest to develop new bioinformatic and biostatistic methods for multi-omics data integration.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Sarmistha Das ◽  
Indranil Mukhopadhyay

AbstractMulti-omics data integration is widely used to understand the genetic architecture of disease. In multi-omics association analysis, data collected on multiple omics for the same set of individuals are immensely important for biomarker identification. But when the sample size of such data is limited, the presence of partially missing individual-level observations poses a major challenge in data integration. More often, genotype data are available for all individuals under study but gene expression and/or methylation information are missing for different subsets of those individuals. Here, we develop a statistical model TiMEG, for the identification of disease-associated biomarkers in a case–control paradigm by integrating the above-mentioned data types, especially, in presence of missing omics data. Based on a likelihood approach, TiMEG exploits the inter-relationship among multiple omics data to capture weaker signals, that remain unidentified in single-omic analysis or common imputation-based methods. Its application on a real tuberous sclerosis dataset identified functionally relevant genes in the disease pathway.


Author(s):  
Antoine Bodein ◽  
Marie-Pier Scott-Boyer ◽  
Olivier Perin ◽  
Kim-Anh Lê Cao ◽  
Arnaud Droit

Abstract Motivation Multi-omics data integration enables the global analysis of biological systems and discovery of new biological insights. Multi-omics experimental designs have been further extended with a longitudinal dimension to study dynamic relationships between molecules. However, methods that integrate longitudinal multi-omics data are still in their infancy. Results We introduce the R package timeOmics, a generic analytical framework for the integration of longitudinal multi-omics data. The framework includes pre-processing, modeling and clustering to identify molecular features strongly associated with time. We illustrate this framework in a case study to detect seasonal patterns of mRNA, metabolites, gut taxa and clinical variables in patients with diabetes mellitus from the integrative Human Microbiome Project. Availabilityand implementation timeOmics is available on Bioconductor and github.com/abodein/timeOmics. Supplementary information Supplementary data are available at Bioinformatics online.


2021 ◽  
Author(s):  
Matti Hoch ◽  
Suchi Smita Gupta ◽  
Konstantin Cesnulevicius ◽  
David Lescheid ◽  
Myron Schultz ◽  
...  

Disease maps have emerged as computational knowledge bases for exploring and modeling disease-specific molecular processes. By capturing molecular interactions, disease-associated processes, and phenotypes in standardized representations, disease maps provide a platform for applying bioinformatics and systems biology approaches. Applications range from simple map exploration to algorithm-driven target discovery and network perturbation. The web-based MINERVA environment for disease maps provides a platform to develop tools not only for mapping experimental data but also to identify, analyze and simulate disease-specific regulatory networks. We have developed a MINERVA plugin suite based on network topology and enrichment analyses that facilitate multi-omics data integration and enable in silico perturbation experiments on disease maps. We demonstrate workflows by analyzing two RNA-seq datasets on the Atlas of Inflammation Resolution (AIR). Our approach improves usability and increases the functionality of disease maps by providing easy access to available data and integration of self-generated data. It supports efficient and intuitive analysis of omics data, with a focus on disease maps.


2021 ◽  
Author(s):  
Pia Rautenstrauch ◽  
Anna Hendrika Cornelia Vlot ◽  
Sepideh Saran ◽  
Uwe Ohler

2021 ◽  
Vol 17 (8) ◽  
pp. e1009224
Author(s):  
Ran Duan ◽  
Lin Gao ◽  
Yong Gao ◽  
Yuxuan Hu ◽  
Han Xu ◽  
...  

Computational integrative analysis has become a significant approach in the data-driven exploration of biological problems. Many integration methods for cancer subtyping have been proposed, but evaluating these methods has become a complicated problem due to the lack of gold standards. Moreover, questions of practical importance remain to be addressed regarding the impact of selecting appropriate data types and combinations on the performance of integrative studies. Here, we constructed three classes of benchmarking datasets of nine cancers in TCGA by considering all the eleven combinations of four multi-omics data types. Using these datasets, we conducted a comprehensive evaluation of ten representative integration methods for cancer subtyping in terms of accuracy measured by combining both clustering accuracy and clinical significance, robustness, and computational efficiency. We subsequently investigated the influence of different omics data on cancer subtyping and the effectiveness of their combinations. Refuting the widely held intuition that incorporating more types of omics data always produces better results, our analyses showed that there are situations where integrating more omics data negatively impacts the performance of integration methods. Our analyses also suggested several effective combinations for most cancers under our studies, which may be of particular interest to researchers in omics data analysis.


Sign in / Sign up

Export Citation Format

Share Document