Advancing Biopharmaceutical Process Development by System-Level Data Analysis and Integration of Omics Data

Author(s):  
Jochen Schaub ◽  
Christoph Clemens ◽  
Hitto Kaufmann ◽  
Torsten W. Schulz
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Benjamin Ulfenborg

Abstract Background Studies on multiple modalities of omics data such as transcriptomics, genomics and proteomics are growing in popularity, since they allow us to investigate complex mechanisms across molecular layers. It is widely recognized that integrative omics analysis holds the promise to unlock novel and actionable biological insights into health and disease. Integration of multi-omics data remains challenging, however, and requires combination of several software tools and extensive technical expertise to account for the properties of heterogeneous data. Results This paper presents the miodin R package, which provides a streamlined workflow-based syntax for multi-omics data analysis. The package allows users to perform analysis of omics data either across experiments on the same samples (vertical integration), or across studies on the same variables (horizontal integration). Workflows have been designed to promote transparent data analysis and reduce the technical expertise required to perform low-level data import and processing. Conclusions The miodin package is implemented in R and is freely available for use and extension under the GPL-3 license. Package source, reference documentation and user manual are available at https://gitlab.com/algoromics/miodin.


2016 ◽  
Vol 15s3 ◽  
pp. CIN.S39458 ◽  
Author(s):  
Wenying Yan ◽  
Wenjin Xue ◽  
Jiajia Chen ◽  
Guang Hu

Due to its extraordinary heterogeneity and complexity, cancer is often proposed as a model case of a systems biology disease or network disease. There is a critical need of effective biomarkers for cancer diagnosis and/or outcome prediction from system level analyses. Methods based on integrating omics data into networks have the potential to revolutionize the identification of cancer biomarkers. Deciphering the biological networks underlying cancer is undoubtedly important for understanding the molecular mechanisms of the disease and identifying effective biomarkers. In this review, the networks constructed for cancer biomarker discovery based on different omics level data are described and illustrated from recent advances in the field.


2020 ◽  
Author(s):  
Christoph Ogris ◽  
Yue Hu ◽  
Janine Arloth ◽  
Nikola S. Müller

AbstractConstantly decreasing costs of high-throughput profiling on many molecular levels generate vast amounts of so-called multi-omics data. Studying one biomedical question on two or more omic levels provides deeper insights into underlying molecular processes or disease pathophysiology. For the majority of multi-omics data projects, the data analysis is performed level-wise, followed by a combined interpretation of results. Few exceptions exist, for example the pairwise integration for quantitative trait analysis. However, the full potential of integrated data analysis is not leveraged yet, presumably due to the complexity of the data and the lacking toolsets. Here we propose a versatile approach, to perform a multi-level integrated analysis: The Knowledge guIded Multi-Omics Network inference approach, KiMONo. KiMONo performs network inference using statistical modeling on top of a powerful knowledge-guided strategy exploiting prior information from biological sources. Within the resulting network, nodes represent features of all input types and edges refer to associations between them, e.g. underlying a disease. Our method infers the network by combining sparse grouped-LASSO regression with a genomic position-confined Biogrid protein-protein interaction prior. In a comprehensive evaluation, we demonstrate that our method is robust to noise and still performs on low-sample size data. Applied to the five-level data set of the publicly available Pan-cancer collection, KiMONO integrated mutation, epigenetics, transcriptomics, proteomics and clinical information, detecting cancer specific omic features. Moreover, we analysed a four-level data set from a major depressive disorder cohort, including genetic, epigenetic, transcriptional and clinical data. Here we demonstrated KiMONo’s analytical power to identify expression quantitative trait methylation sites and loci and show it’s advantage to state-of-the-art methods. Our results show the general applicability to the full spectrum multi-omics data and demonstrating that KiMONo is a powerful approach towards leveraging the full potential of data sets. The method is freely available as an R package (https://github.com/cellmapslab/kimono).


2018 ◽  
Author(s):  
Benjamin Ulfenborg

AbstractBackgroundStudies on multiple modalities of omics data such as transcriptomics, genomics and proteomics are growing in popularity, since they allow us to investigate complex mechanisms across molecular layers. It is widely recognized that integrative omics analysis holds the promise to unlock novel and actionable biological insights to health and disease. Integration of multi-omics data remains challenging, however, and requires combination of several software tools and extensive technical expertise to account for the properties of heterogeneous data.ResultsThis paper presents the miodin R package, which provides a streamlined workflow-based syntax for multi-omics data analysis. The package allows users to perform analysis and integration of omics data either across experiments on the same samples, or across studies on the same variables. Workflows have been designed to promote transparent data analysis and reduce the technical expertise required to perform low-level data import and processing.ConclusionsThe miodin package is implemented in R and is freely available for use and extension under the GPL-3 license. Package source, reference documentation and user manual are available at https://gitlab.com/algoromics/miodin.


Author(s):  
Hirotaka Iijima ◽  
Gabrielle Gilmer ◽  
Kai Wang ◽  
Sruthi Sivakumar ◽  
Christopher Evans ◽  
...  

Abstract Increased mechanistic insight into the pathogenesis of knee osteoarthritis (KOA) is needed to develop efficacious disease-modifying treatments. Though age-related pathogenic mechanisms are most relevant to the majority of clinically-presenting KOA, the bulk of our mechanistic understanding of KOA has been derived using surgically induced post-traumatic OA (PTOA) models. Here, we took an integrated approach of meta-analysis and multi-omics data analysis to elucidate pathogenic mechanisms of age-related KOA in mice. Protein-level data were integrated with transcriptomic profiling to reveal inflammation, autophagy, and cellular senescence as primary hallmarks of age-related KOA. Importantly, the molecular profiles of cartilage aging were unique from those observed following PTOA, with less than 3% overlap between the two models. At the nexus of the three aging hallmarks, Advanced Glycation End-Product (AGE)/Receptor for AGE emerged as the most statistically robust pathway associated with age-related KOA. This pathway was further supported by analysis of mass spectrometry data. Notably, the change in AGE-RAGE signaling over time was exclusively observed in male mice, suggesting sexual dimorphism in the pathogenesis of age-induced KOA in murine models. Collectively, these findings implicate dysregulation of AGE-RAGE signaling as a sex-dependent driver of age-related KOA.


Author(s):  
Eun-Young Mun ◽  
Anne E. Ray

Integrative data analysis (IDA) is a promising new approach in psychological research and has been well received in the field of alcohol research. This chapter provides a larger unifying research synthesis framework for IDA. Major advantages of IDA of individual participant-level data include better and more flexible ways to examine subgroups, model complex relationships, deal with methodological and clinical heterogeneity, and examine infrequently occurring behaviors. However, between-study heterogeneity in measures, designs, and samples and systematic study-level missing data are significant barriers to IDA and, more broadly, to large-scale research synthesis. Based on the authors’ experience working on the Project INTEGRATE data set, which combined individual participant-level data from 24 independent college brief alcohol intervention studies, it is also recognized that IDA investigations require a wide range of expertise and considerable resources and that some minimum standards for reporting IDA studies may be needed to improve transparency and quality of evidence.


2021 ◽  
Vol 20 (1) ◽  
Author(s):  
Jingru Zhou ◽  
Yingping Zhuang ◽  
Jianye Xia

Abstract Background Genome-scale metabolic model (GSMM) is a powerful tool for the study of cellular metabolic characteristics. With the development of multi-omics measurement techniques in recent years, new methods that integrating multi-omics data into the GSMM show promising effects on the predicted results. It does not only improve the accuracy of phenotype prediction but also enhances the reliability of the model for simulating complex biochemical phenomena, which can promote theoretical breakthroughs for specific gene target identification or better understanding the cell metabolism on the system level. Results Based on the basic GSMM model iHL1210 of Aspergillus niger, we integrated large-scale enzyme kinetics and proteomics data to establish a GSMM based on enzyme constraints, termed a GEM with Enzymatic Constraints using Kinetic and Omics data (GECKO). The results show that enzyme constraints effectively improve the model’s phenotype prediction ability, and extended the model’s potential to guide target gene identification through predicting metabolic phenotype changes of A. niger by simulating gene knockout. In addition, enzyme constraints significantly reduced the solution space of the model, i.e., flux variability over 40.10% metabolic reactions were significantly reduced. The new model showed also versatility in other aspects, like estimating large-scale $$k_{{cat}}$$ k cat values, predicting the differential expression of enzymes under different growth conditions. Conclusions This study shows that incorporating enzymes’ abundance information into GSMM is very effective for improving model performance with A. niger. Enzyme-constrained model can be used as a powerful tool for predicting the metabolic phenotype of A. niger by incorporating proteome data. In the foreseeable future, with the fast development of measurement techniques, and more precise and rich proteomics quantitative data being obtained for A. niger, the enzyme-constrained GSMM model will show greater application space on the system level.


2021 ◽  
Vol 49 ◽  
pp. 107739
Author(s):  
Parminder S. Reel ◽  
Smarti Reel ◽  
Ewan Pearson ◽  
Emanuele Trucco ◽  
Emily Jefferson

2018 ◽  
Vol 19 (S14) ◽  
Author(s):  
Diogo Manuel Carvalho Leite ◽  
Xavier Brochet ◽  
Grégory Resch ◽  
Yok-Ai Que ◽  
Aitana Neves ◽  
...  

Rhizosphere ◽  
2017 ◽  
Vol 3 ◽  
pp. 222-229 ◽  
Author(s):  
Richard Allen White ◽  
Mark I. Borkum ◽  
Albert Rivas-Ubach ◽  
Aivett Bilbao ◽  
Jason P. Wendler ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document