scholarly journals Breast cancer PAM50 subtypes: Correlation between RNA-Seq and multiplexed gene expression platforms

2017 ◽  
Vol 28 ◽  
pp. v56-v57
Author(s):  
A.C. Picornell ◽  
I. Echavarria Diaz-Guardamino ◽  
E.L. Alvarez Castillo ◽  
S. Lopez-Tarruella Cobo ◽  
Y. Jerez ◽  
...  
2012 ◽  
Vol 30 (30_suppl) ◽  
pp. 56-56
Author(s):  
Byung-In Lee ◽  
Kahuku Oades ◽  
Lien Vo ◽  
Jerry Lee ◽  
Mark Landers ◽  
...  

56 Background: Gene expression profiling has been shown to be effective in analyzing postoperative tumor samples in various cancers. However, in analyzing small specimens such as core biopsies, the limited amount of available material makes multi-gene analyses difficult or impossible. Microarray-based analyses also provide limited dynamic range. We describe the development of targeted RNA-sequencing methodology which combines the power of a universal RNA amplification with NGS for an ultra-deep expression analysis of multiple target genes, enabling <100 ng of sample input for multi-gene analysis in a single tube format. Methods: The gene expression patterns of triple-negative breast cancer FFPE samples were analyzed using a 96-gene breast cancer biomarker panel across three different platforms: Affymetrix Human Gene ST 1.0 microarrays, a pre-developed OncoScore qRT-PCR panel, and targeted RNA-seq. For targeted RNA-seq analysis, the 96-gene panel was amplified using a universal, single-tube “XP-PCR” amplification strategy followed by sequence analysis using the Ion-Torrent Personal Genome Machine. Results: Targeted RNA-seq provided the most sensitivity in terms of detection rates with <100 ng FFPE RNA input and provides unlimited dynamic range with increased sequencing depth. Expression ratio compression issues typically associated with a high number of pre-amplification cycles in standard multiplex-primed methods were not observed here. Low expressing genes, undetectable by qRT-PCR analysis from 1,000 ng input FFPE RNA, were detected and eligible for expression analysis with a significant number of sequencing reads. Alternative transcription/splicing analysis is also possible from sequence analysis of the target transcripts using targeted RNA-seq. Conclusions: By combining universally primed pre-amplification and NGS in multi-gene expression analysis, targeted RNA-seq provides the most sensitive gene expression analysis methodology.


2020 ◽  
Author(s):  
Qiang Song ◽  
Man Huang ◽  
Guicheng Wu ◽  
Lu Dou ◽  
Wenjin Zhang ◽  
...  

Abstract Background Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) is the most sensitive technique for evaluating gene expression levels. Choosing appropriate reference genes (RGs) is critical for normalizing and evaluating changes in the expression of target genes. However, uniform and reliable RGs for breast cancer research have not been identified, limiting the value of target gene expression studies. Here, we provide a novel approach for mining RGs by using the RNA-seq dataset to identify reliable and accurate RGs that can be applied to different types of breast cancer tissues and cell lines. Methods First, we compiled the transcriptome profiling data from the TCGA database involving 1217 samples to identify novel RGs and then ten genes (SF1, TARDBP, THRAP3, QRICH1, TRA2B, SRSF3, YY1, DNAJC8, RNF10, and RHOA) with relatively stable expression levels were chosen as novel candidate RGs. Additionally, six conventional RGs (ACTB, TUBA1A, RPL13A, B2M, GAPDH, and GUSB) were also selected. To determine and validate the optimal RGs we performed qRT-PCR experiments on 87 samples from 5 types of surgically excised breast tumor specimens including HR+HER2-, HR+HER2+, HR-HER2-, HR-HER2+, breast cancer after neoadjuvant chemotherapy (NAC) and their matched para-carcinoma tissues, furthermore, we also included a benign breast tumor sample. Six biological replicates were included for each tissue. Moreover, we assessed 7 breast cancer cell lines (MCF-10A, MCF-7, T-47D, MDA-MB-231, MDA-MB-468, as well as MDA-MB-231 with either CNR2 knockdown or overexpression; 3 biological replicates for each line). Five statistical algorithms (geNorm, NormFinder, ΔCt method, BestKeeper, and ComprFinder) were used to assess the stability of expression of each RG across all breast cancer tissues and cell lines. Results Our results show that RG combinations SF1+TRA2B+THRAP3 and THRAP3+RHOA+QRICH1 showed stable expression in breast cancer tissues and cell lines, respectively, and that these two combinations displayed good interchangeability. Therefore, we propose that the above two combinations are optimal triplet RGs for breast cancer research. Conclusions In summary, we identified novel and reliable RG combinations for breast cancer research based on a public RNA-seq dataset which lays a solid foundation for accurate normalization of qRT-PCR results across different breast cancer tissues and cells.


2020 ◽  
Author(s):  
Xiaomei Li ◽  
Lin Liu ◽  
Greg Goodall ◽  
Andreas Schreiber ◽  
Taosheng Xu ◽  
...  

AbstractBreast cancer prognosis is challenging due to the heterogeneity of the disease. Various computational methods using bulk RNA-seq data have been proposed for breast cancer prognosis. However, these methods suffer from limited performances or ambiguous biological relevance, as a result of the neglect of intra-tumor heterogeneity. Recently, single cell RNA-sequencing (scRNA-seq) has emerged for studying tumor heterogeneity at cellular levels. In this paper, we propose a novel method, scPrognosis, to improve breast cancer prognosis with scRNA-seq data. scPrognosis uses the scRNA-seq data of the biological process Epithelial-to-Mesenchymal Transition (EMT). It firstly infers the EMT pseudotime and a dynamic gene co-expression network, then uses an integrative model to select genes important in EMT based on their expression variation and differentiation in different stages of EMT, and their roles in the dynamic gene co-expression network. To validate and apply the selected signatures to breast cancer prognosis, we use them as the features to build a prediction model with bulk RNA-seq data. The experimental results show that scPrognosis outperforms other benchmark breast cancer prognosis methods that use bulk RNA-seq data. Moreover, the dynamic changes in the expression of the selected signature genes in EMT may provide clues to the link between EMT and clinical outcomes of breast cancer. scPrognosis will also be useful when applied to scRNA-seq datasets of different biological processes other than EMT.Author summaryVarious computational methods have been developed for breast cancer prognosis. However, those methods mainly use the gene expression data generated by the bulk RNA sequencing techniques, which average the expression level of a gene across different cell types. As breast cancer is a heterogenous disease, the bulk gene expression may not be the ideal resource for cancer prognosis. In this study, we propose a novel method to improve breast cancer prognosis using scRNA-seq data. The proposed method has been applied to the EMT scRNA-seq dataset for identifying breast cancer signatures for prognosis. In comparison with existing bulk expression data based methods in breast cancer prognosis, our method shows a better performance. Our single-cell-based signatures provide clues to the relation between EMT and clinical outcomes of breast cancer. In addition, the proposed method can also be useful when applied to scRNA-seq datasets of different biological processes other than EMT.


2016 ◽  
Author(s):  
Alina Frolova ◽  
Vladyslav Bondarenko ◽  
Maria Obolenska

AbstractBackgroundAccording to major public repositories statistics an overwhelming majority of the existing and newly uploaded data originates from microarray experiments. Unfortunately, the potential of this data to bring new insights is limited by the effects of individual study-specific biases due to small number of biological samples. Increasing sample size by direct microarray data integration increases the statistical power to obtain a more precise estimate of gene expression in a population of individuals resulting in lower false discovery rates. However, despite numerous recommendations for gene expression data integration, there is a lack of a systematic comparison of different processing approaches aimed to asses microarray platforms diversity and ambiguous probesets to genes correspondence, leading to low number of studies applying integration.ResultsHere, we investigated five different approaches of the microarrays data processing in comparison with RNA-seq data on breast cancer samples. We aimed to evaluate different probesets annotations as well as different procedures of choosing between probesets mapped to the same gene. We show that pipelines rankings are mostly preserved across Affymetrix and Illumina platforms. BrainArray approach based on updated annotation and redesigned probesets definition and choosing probeset with the maximum average signal across the samples have best correlation with RNA-seq, while averaging probesets signals as well as scoring the quality of probes sequences mapping to the transcripts of the targeted gene have worse correlation. Finally, randomly selecting probeset among probesets mapped to the same gene significantly decreases the correlation with RNA-seq.ConclusionWe show that methods, which rely on actual probesets signal intensities, are advantageous to methods considering biological characteristics of the probes sequences only and that cross-platform integration of datasets improves correlation with the RNA-seq data. We consider the results obtained in this paper contributive to the integrative analysis as a worthwhile alternative to the classical meta-analysis of the multiple gene expression datasets.


Author(s):  
Isaac Raplee ◽  
Alexei Evsikov ◽  
Caralina Marín de Evsikova

The rapid expansion of transcriptomics from increased affordability of next-generation sequencing (NGS) technologies generates rocketing amounts of gene expression data across biology and medicine, and notably in cancer research. Concomitantly, many bioinformatics tools were developed to streamline gene expression analysis and quantification. We tested the concordance of NGS RNA sequencing (RNA-seq) analysis outcomes between the two predominant programs for reads alignment, HISAT2 and STAR, and the two most popular programs for quantifying gene expression in NGS experiments, edgeR and DESeq2, using RNA-seq data from a series of breast cancer progression specimens, which include histologically confirmed normal, early neoplasia, ductal carcinoma in situ and infiltrating ductal carcinoma samples microdissected from formalin fixed, paraffin embedded (FFPE) breast tissue blocks. We identified significant differences in aligners&rsquo; performance: HISAT2 was prone to misalign reads to retrogene genomic loci, STAR generated more precise alignments, especially for early neoplasia samples. edgeR and DESeq2 produced similar lists of differentially expressed genes in stage comparisons, with edgeR producing more conservative, though shorter, lists of genes. Albeit, Gene Ontology (GO) enrichment analysis revealed no skewness in significant GO categories identified among differentially expressed genes by edgeR vs DESeq2. As transcriptome analysis of archived FFPE samples becomes a vanguard of precision medicine, identification and fine-tuning of bioinformatics tools becomes critical for clinical research. Our results indicate that STAR and edgeR are well-suited tools for differential gene expression analysis from FFPE samples.


2019 ◽  
Author(s):  
Shikha Roy ◽  
Rakesh Kumar ◽  
Vaibhav Mittal ◽  
Dinesh Gupta

AbstractEarly detection of breast cancer and its correct stage determination are important for prognosis and rendering appropriate personalized clinical treatment to breast cancer patients. However, despite considerable efforts and progress, there is a need to identify the specific genomic factors responsible for, or accompanying Invasive Ductal Carcinoma (IDC) progression stages, which can aid the determination of the correct cancer stages. We have developed two-class machine-learning classification models to differentiate the early and late stages of invasive ductal carcinoma. The prediction models are trained with RNA-seq gene expression profiles representing different IDC stages of 610 patients, obtained from The Cancer Genome Atlas (TCGA). Different supervised learning algorithms were trained and evaluated with an enriched model learning, facilitated by different feature selection methods. We also developed a machine-learning classifier trained on the same datasets with training sets reduced data corresponding to IDC driver genes. Based on these two classifiers, we have developed a web-server Duct-BRCA-CSP to predict early stage from late stages of IDC based on input RNA-seq gene expression profiles. The analysis conducted by us also enables deeper insights into the stage-dependent molecular events accompanying breast ductal carcinoma progression. The server is publicly available at http://bioinfo.icgeb.res.in/duct-BRCA-CSP.


Author(s):  
Abah Moses Owoicho ◽  
Joseph Luper Tsenum ◽  
Deborah Oganya Ogenyi ◽  
Ogu Stephen ◽  
Ujah Moses Okwori

The study seeks to compare the transcriptomic response of pancreatic and breast cancer cells to Anarcadic Acid and Olaparib via the preparation of Pancreatic Cancer Cell Culture which involves the seeding of PANC-1 cells in 6-well plates (5× 105 cells per well). 24hours later, cells will be untreated or treated by 5mM anacardic acid, 2mM olaparib or a combination of anacardic acid (5mM) and olaparib (2mM) for 48hours; after which Pancreatic Cancer Cell’s mRNA Library will be Prepared and Sequenced using the Illumina TruSeq™ RNA Sample Prep Kit v2. Samples will be sequenced on the Illumina HiSeq 2500, 2× 100bp paired-end reads, to a minimum depth of 30 million reads per sample. Thereafter, the Computational Analyses of Pancreatic Cancer RNA-seq Data will be done by obtaining a total of 240 million obtained reads of high quality clean tags which will then be mapped and annotated via human reference genome using Bioconductor package biomaRt (http://www.bioconductor.org) (Durinck et al 2009). Mapped reads with mapping quality 10 or more will be defined as uniquely mapped reads and used in the downstream analyses. Biological networks and pathways related to anachardic acid, olaparib and the combination will be analyzed with Ingenuity Pathway Analysis (IPA) software (Qiagen, CA, USA). The lists of all genes identified in gene expression analysis will be uploaded into the IPA software. For the analysis of networks and pathways, the cutoff values will be set at P≤ 1× 10−5 and FC≥ |2| respectively.Validation of RNA-seq Results by qRT-PCR via the expression of mRNA which will be determined in all 4 samples using Power SYBR® Green RNA-to-CT™ 1-Step Kit (Life Technologies, CA, USA). The Western blotting for the selected proteins will be performed, as described by Yue (Yue et al 2015). Thereafter, the Breast Cancer Cell Culture will be prepared and treated. Breast Cancer Cell’s mRNA RNA-seq will be prepared. The Truseq Stranded mRNA kit (Illumina) will be used to prepare mRNA libraries from 1 µg total RNA. Libraries will be confirmed on the Agilent 2100 Bioanalyzer and quantitated using the Illumina Library Quantification Kit, ABI Prism qPCR Mix from Kapa Biosystems and the ABI7900HT real-time PCR instrument. The differential Gene Expression will be analysed RNA-seq reads will be assembled according to the hg19.gtf annotation file (downloaded from ENSEMBL) (Flicek et al 2014) using Cufflinks (version 2.2.1) (Trapnell et al 2012). For each comparison, both cufflinks assemblies shall be merged, and the resulting merged gtf file serves as the transcript input for differential gene expression analysis in Gene Ontology and KEGG pathways. For three of the comparisons, a p-value cutoff ≤0.05 shall be used to determine differential expression. In-silico pathway and network analysis of differentially expressed genes shall be performed in MetaCore version 6.27 (GeneGO, Thomson Reuters, New York, N.Y.) (Bolser et al 2012). The results obtained will be statistically analysed. The results of RT-PCR shall be normalized to expression of GAPDH using the formula 2∆ CT. One-way ANOVA shall be used for comparing treatment with the combination of anacardic acid and olaparib to the untreated control. A P value less than 0.05 will be considered statistically significant.


Sign in / Sign up

Export Citation Format

Share Document