scholarly journals Orymold: ontology based gene expression data integration and analysis tool applied to rice.

2009 ◽  
Vol 10 (1) ◽  
pp. 158 ◽  
Author(s):  
Jaume Mercade ◽  
Antonio Espinosa ◽  
Jose Enrique Adsuara ◽  
Rosa Adrados ◽  
Jordi Segura ◽  
...  
2007 ◽  
Vol 46 (05) ◽  
pp. 608-613 ◽  
Author(s):  
M. Rosolowski ◽  
H. Berger ◽  
C. Schwaenen ◽  
S. Wessendorf ◽  
M. Loeffler ◽  
...  

Summary Objectives: Array-comparative genomic hybridization (aCGH) is a high-throughput method to detect and map copy number aberrations in the genome. Multi-step analysis of high-dimensional data requires an integrated suite of bioinformatic tools. In this paperwe detail an analysis pipeline for array CGH data. Methods: We developed an analysis tool for array CGH data which supports single and multi-chip analyses as well as combined analyses with paired mRNA gene expression data. The functions supporting relevant steps of analysis were implemented using the open source software R and combined as package aCGHPipeline. Analysis methods were illustrated using 189 CGH arrays of aggressive B-cell lymphomas. Results: The package covers data input, quality control, normalization, segmentation and classification. For multi-chip analysis aCGHPipeline offers an algorithm for automatic delineation of recurrent regions. This task was performed manuallyup to now. The package also supports combined analysis with mRNA gene expression data. Outputs consist of HTML documents to facilitate communication with clinical partners. Conclusions: The R package aCGHPipeline supports basic tasks of single and multi-chip analysis of array CGH data.


2012 ◽  
Vol 28 (17) ◽  
pp. 2283-2284
Author(s):  
Peter M. Krempl ◽  
Juergen Mairhofer ◽  
Gerald Striedner ◽  
Gerhard G. Thallinger

2016 ◽  
Author(s):  
Alina Frolova ◽  
Vladyslav Bondarenko ◽  
Maria Obolenska

AbstractBackgroundAccording to major public repositories statistics an overwhelming majority of the existing and newly uploaded data originates from microarray experiments. Unfortunately, the potential of this data to bring new insights is limited by the effects of individual study-specific biases due to small number of biological samples. Increasing sample size by direct microarray data integration increases the statistical power to obtain a more precise estimate of gene expression in a population of individuals resulting in lower false discovery rates. However, despite numerous recommendations for gene expression data integration, there is a lack of a systematic comparison of different processing approaches aimed to asses microarray platforms diversity and ambiguous probesets to genes correspondence, leading to low number of studies applying integration.ResultsHere, we investigated five different approaches of the microarrays data processing in comparison with RNA-seq data on breast cancer samples. We aimed to evaluate different probesets annotations as well as different procedures of choosing between probesets mapped to the same gene. We show that pipelines rankings are mostly preserved across Affymetrix and Illumina platforms. BrainArray approach based on updated annotation and redesigned probesets definition and choosing probeset with the maximum average signal across the samples have best correlation with RNA-seq, while averaging probesets signals as well as scoring the quality of probes sequences mapping to the transcripts of the targeted gene have worse correlation. Finally, randomly selecting probeset among probesets mapped to the same gene significantly decreases the correlation with RNA-seq.ConclusionWe show that methods, which rely on actual probesets signal intensities, are advantageous to methods considering biological characteristics of the probes sequences only and that cross-platform integration of datasets improves correlation with the RNA-seq data. We consider the results obtained in this paper contributive to the integrative analysis as a worthwhile alternative to the classical meta-analysis of the multiple gene expression datasets.


2012 ◽  
Vol 14 (4) ◽  
pp. 469-490 ◽  
Author(s):  
C. Lazar ◽  
S. Meganck ◽  
J. Taminau ◽  
D. Steenhoff ◽  
A. Coletta ◽  
...  

2021 ◽  
Author(s):  
Yang Xu ◽  
Edmon Begoli ◽  
Rachel Patton McCord

The booming single-cell technologies bring a surge of high dimensional data that come from different sources and represent cellular systems from different views. With advances in single-cell technologies, integrating single-cell data across modalities arises as a new computational challenge and gains more and more attention within the community. Here, we present a novel adversarial approach, sciCAN, to integrate single-cell chromatin accessibility and gene expression data in an unsupervised manner. We benchmarked sciCAN with 3 state-of-the-art (SOTA) methods in 5 scATAC-seq/scRNA-seq datasets, and we demonstrated that our method dealt with data integration with better balance of mutual transferring between modalities than the other 3 SOTA methods. We further applied sciCAN to 10X Multiome data and confirmed the integrated representation preserves information of the hematopoietic hierarchy. Finally, we investigated CRSIPR-perturbed single-cell K562 ATAC-seq and RNA-seq data to identify cells with related responses to different perturbations in these different modalities.


Sign in / Sign up

Export Citation Format

Share Document