A Novel, Improved, Application for the Normalization of RNA-seq Based Expression Data in Complex Polyploids.

Mapping Intimacies ◽

10.1101/2021.07.27.453947 ◽

2021 ◽

Author(s):

Dyfed Lloyd Evans

Keyword(s):

Transcript Level ◽

Cancer Tissue ◽

Expression Data ◽

Gene Length ◽

Rna Seq ◽

Qrt Pcr ◽

Transcript Length ◽

Gc Bias ◽

Microarray Analyses

Much of the work on the normalization of RNA-seq data has been performed on human, notably cancer tissue. Little work has been done in plants, particularly polyploids and those species with incomplete or no genomes. We present a novel implementation of GeTMM (Gene Length Corrected TMM) that accounts for GC bias and works at the transcript level. The algorithm also employs transcript length as a factor, allowing for incomplete transcripts and alternate transcripts. This significantly improves overall normalization. The GCGeTMM methodology also allows for simultaneous determination of differentially expressed transcripts (and by extension genes) and stably expressed genes to act as references for qRT-PCR and microarray analyses.

Download Full-text

PlanExp: intuitive integration of complex RNA-seq datasets with planarian omics resources

Bioinformatics ◽

10.1093/bioinformatics/btz802 ◽

2019 ◽

Cited By ~ 3

Author(s):

S Castillo-Lara ◽

E Pascual-Carreras ◽

J F Abril

Keyword(s):

Single Cell ◽

Web Application ◽

Transcript Level ◽

Supplementary Information ◽

Expression Data ◽

Rna Seq ◽

Making Sense ◽

Sequencing Technologies ◽

Planarian Regeneration ◽

Current Sequence

Abstract Motivation There is an increasing amount of transcriptomic and genomic data available for planarians with the advent of both traditional and single-cell RNA sequencing technologies. Therefore, exploring, visualizing and making sense of all these data in order to understand planarian regeneration and development can be challenging. Results In this work, we present PlanExp, a web-application to explore and visualize gene expression data from different RNA-seq experiments (both traditional and single-cell RNA-seq) for the planaria Schmidtea mediterranea. PlanExp provides tools for creating different interactive plots, such as heatmaps, scatterplots, etc. and links them with the current sequence annotations both at the genome and the transcript level thanks to its integration with the PlanNET web application. PlanExp also provides a full gene/protein network editor, a prediction of genetic interactions from single-cell RNA-seq data, and a network expression mapper that will help researchers to close the gap between systems biology and planarian regeneration. Availability and implementation PlanExp is freely available at https://compgen.bio.ub.edu/PlanNET/planexp. The source code is available at https://compgen.bio.ub.edu/PlanNET/downloads. Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Statistical calibration of qRT-PCR, microarray and RNA-Seq gene expression data with measurement error models

The Annals of Applied Statistics ◽

10.1214/14-aoas721 ◽

2014 ◽

Vol 8 (2) ◽

pp. 1022-1044 ◽

Cited By ~ 2

Author(s):

Zhaonan Sun ◽

Thomas Kuczek ◽

Yu Zhu

Keyword(s):

Gene Expression ◽

Measurement Error ◽

Gene Expression Data ◽

Expression Data ◽

Rna Seq ◽

Error Models ◽

Qrt Pcr ◽

Measurement Error Models ◽

Pcr Microarray ◽

Statistical Calibration

Download Full-text

Faculty of 1000 evaluation for Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences.

F1000 - Post-publication peer review of the biomedical literature ◽

10.3410/f.726079641.793513319 ◽

2016 ◽

Author(s):

Wolfgang Huber

Keyword(s):

Transcript Level ◽

Rna Seq ◽

Gene Level

Download Full-text

Comprehensive Analysis of Transcriptomics and Metabolomics between the Heads and Tails of Angelica Sinensis: Genes Related to Phenylpropanoid Biosynthesis Pathway

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207323999201103221952 ◽

2020 ◽

Vol 23 ◽

Author(s):

Jie Yang ◽

Chi Zhang ◽

Wei-Hong Li ◽

Tian-Er Zhang ◽

Guang-Zhong Fan ◽

...

Keyword(s):

Ferulic Acid ◽

Metabolic Regulation ◽

Comprehensive Analysis ◽

Angelica Sinensis ◽

Rna Seq ◽

Targeted Metabolomics ◽

Kegg Pathways ◽

Qrt Pcr ◽

Phenylpropanoid Biosynthesis ◽

Combined Strategy

Background:: In Traditional Chinese Medicine (TCM), the heads and tails of Angelica sinensis (Oliv.) Diels (AS) is used in treating different diseases due to their different pharmaceutical efficacies. The underline mechanisms, however, have not been fully explored. Objective:: Novel mechanisms responsible for the discrepant activities between AS heads and tails were explored by a combined strategy of transcriptomes and metabolomics. Method:: Six pairs of the heads and tails of AS roots were collected in Min County, China. Total RNA and metabolites, which were used for RNA-seq and untargeted metabolomics analysis, were respectively isolated from each AS sample (0.1 g) by Trizol and methanol reagent. Subsequently, differentially expressed genes (DEGs) and discrepant pharmaceutical metabolites were identified for comparing AS heads and tails. Key DEGs and metabolites were quantified by qRT-PCR and targeted metabolomics experiment. Results:: Comprehensive analysis of transcriptomes and metabolomics results suggested that five KEGG pathways with significant differences included 57 DEGs. Especially, fourteen DEGs and six key metabolites were relation to the metabolic regulation of Phenylpropanoid biosynthesis (PB) pathway. Results of qRT-PCR and targeted metabolomics indicated that higher levels of expression of crucial genes in PB pathway, such as PAL, CAD, COMT and peroxidase in the tail of AS were positively correlated with levels of ferulic acid-related metabolites. The average content of ferulic acid in tails (569.58162.39 nmol/g) was higher than those in the heads (168.73  67.30 nmol/g) (P˂0.01); Caffeic acid in tails (3.82  0.88 nmol/g) vs heads (1.37  0.41 nmol/g) (P˂0.01), and Cinnamic acid in tails (0.24  0.09 nmol/g) vs heads (0.14  0.02 nmol/g) (P˂0.05). Conclusion:: Our work demonstrated that overexpressed genes and accumulated metabolites derived from PB pathway might be responsible for the discrepant pharmaceutical efficacies between AS heads and tails.

Download Full-text

Quantitative Estimation of Oxidative Stress in Cancer Tissue Cells Through Gene Expression Data Analyses

Frontiers in Genetics ◽

10.3389/fgene.2020.00494 ◽

2020 ◽

Vol 11 ◽

Author(s):

Liyang Liu ◽

Haining Cui ◽

Ying Xu

Keyword(s):

Oxidative Stress ◽

Gene Expression ◽

Gene Expression Data ◽

Quantitative Estimation ◽

Cancer Tissue ◽

Expression Data ◽

Data Analyses ◽

Tissue Cells

Download Full-text

nanotatoR: a tool for enhanced annotation of genomic structural variants

BMC Genomics ◽

10.1186/s12864-020-07182-w ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Surajit Bhattacharya ◽

Hayk Barseghyan ◽

Emmanuèle C. Délot ◽

Eric Vilain

Keyword(s):

De Novo ◽

Genome Mapping ◽

Gene List ◽

Sufficient Information ◽

Rna Seq ◽

Structural Variants ◽

De Novo Genome Assembly ◽

Pathogenic Variants ◽

Increased Sensitivity

Abstract Background Whole genome sequencing is effective at identification of small variants, but because it is based on short reads, assessment of structural variants (SVs) is limited. The advent of Optical Genome Mapping (OGM), which utilizes long fluorescently labeled DNA molecules for de novo genome assembly and SV calling, has allowed for increased sensitivity and specificity in SV detection. However, compared to small variant annotation tools, OGM-based SV annotation software has seen little development, and currently available SV annotation tools do not provide sufficient information for determination of variant pathogenicity. Results We developed an R-based package, nanotatoR, which provides comprehensive annotation as a tool for SV classification. nanotatoR uses both external (DGV; DECIPHER; Bionano Genomics BNDB) and internal (user-defined) databases to estimate SV frequency. Human genome reference GRCh37/38-based BED files are used to annotate SVs with overlapping, upstream, and downstream genes. Overlap percentages and distances for nearest genes are calculated and can be used for filtration. A primary gene list is extracted from public databases based on the patient’s phenotype and used to filter genes overlapping SVs, providing the analyst with an easy way to prioritize variants. If available, expression of overlapping or nearby genes of interest is extracted (e.g. from an RNA-Seq dataset, allowing the user to assess the effects of SVs on the transcriptome). Most quality-control filtration parameters are customizable by the user. The output is given in an Excel file format, subdivided into multiple sheets based on SV type and inheritance pattern (INDELs, inversions, translocations, de novo, etc.). nanotatoR passed all quality and run time criteria of Bioconductor, where it was accepted in the April 2019 release. We evaluated nanotatoR’s annotation capabilities using publicly available reference datasets: the singleton sample NA12878, mapped with two types of enzyme labeling, and the NA24143 trio. nanotatoR was also able to accurately filter the known pathogenic variants in a cohort of patients with Duchenne Muscular Dystrophy for which we had previously demonstrated the diagnostic ability of OGM. Conclusions The extensive annotation enables users to rapidly identify potential pathogenic SVs, a critical step toward use of OGM in the clinical setting.

Download Full-text

Pan-cancer analysis of transcripts encoding novel open-reading frames (nORFs) and their potential biological functions

npj Genomic Medicine ◽

10.1038/s41525-020-00167-4 ◽

2021 ◽

Vol 6 (1) ◽

Cited By ~ 2

Author(s):

Chaitanya Erady ◽

Adam Boxall ◽

Shraddha Puntambekar ◽

N. Suhas Jagannathan ◽

Ruchi Chauhan ◽

...

Keyword(s):

Transcript Level ◽

Open Reading Frames ◽

The Cancer Genome Atlas ◽

Small Subset ◽

Cancer Tissue ◽

Post Translational Modifications ◽

Cancer Genome Atlas ◽

Systematic Identification ◽

Reading Frames

AbstractUncharacterized and unannotated open-reading frames, which we refer to as novel open reading frames (nORFs), may sometimes encode peptides that remain unexplored for novel therapeutic opportunities. To our knowledge, no systematic identification and characterization of transcripts encoding nORFs or their translation products in cancer, or in any other physiological process has been performed. We use our curated nORFs database (nORFs.org), together with RNA-Seq data from The Cancer Genome Atlas (TCGA) and Genotype-Expression (GTEx) consortiums, to identify transcripts containing nORFs that are expressed frequently in cancer or matched normal tissue across 22 cancer types. We show nORFs are subject to extensive dysregulation at the transcript level in cancer tissue and that a small subset of nORFs are associated with overall patient survival, suggesting that nORFs may have prognostic value. We also show that nORF products can form protein-like structures with post-translational modifications. Finally, we perform in silico screening for inhibitors against nORF-encoded proteins that are disrupted in stomach and esophageal cancer, showing that they can potentially be targeted by inhibitors. We hope this work will guide and motivate future studies that perform in-depth characterization of nORF functions in cancer and other diseases.

Download Full-text

Transcriptome Analysis of Responses to Dengue Virus 2 Infection in Aedes albopictus (Skuse) C6/36 Cells

Viruses ◽

10.3390/v13020343 ◽

2021 ◽

Vol 13 (2) ◽

pp. 343

Author(s):

Manjin Li ◽

Dan Xing ◽

Duo Su ◽

Di Wang ◽

Heting Gao ◽

...

Keyword(s):

Dengue Virus ◽

Aedes Albopictus ◽

Bioinformatics Analysis ◽

Interaction Mechanism ◽

Transcriptional Analysis ◽

Functional Verification ◽

Mosquito Vector ◽

Rna Seq ◽

Sequencing Data ◽

Qrt Pcr

Dengue virus (DENV), a member of the Flavivirus genus of the Flaviviridae family, can cause dengue fever (DF) and more serious diseases and thus imposes a heavy burden worldwide. As the main vector of DENV, mosquitoes are a serious hazard. After infection, they induce a complex host–pathogen interaction mechanism. Our goal is to further study the interaction mechanism of viruses in homologous, sensitive, and repeatable C6/36 cell vectors. Transcriptome sequencing (RNA-Seq) technology was applied to the host transcript profiles of C6/36 cells infected with DENV2. Then, bioinformatics analysis was used to identify significant differentially expressed genes and the associated biological processes. Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) was performed to verify the sequencing data. A total of 1239 DEGs were found by transcriptional analysis of Aedes albopictus C6/36 cells that were infected and uninfected with dengue virus, among which 1133 were upregulated and 106 were downregulated. Further bioinformatics analysis showed that the upregulated DEGs were significantly enriched in signaling pathways such as the MAPK, Hippo, FoxO, Wnt, mTOR, and Notch; metabolic pathways and cellular physiological processes such as autophagy, endocytosis, and apoptosis. Downregulated DEGs were mainly enriched in DNA replication, pyrimidine metabolism, and repair pathways, including BER, NER, and MMR. The qRT-PCR results showed that the concordance between the RNA-Seq and RT-qPCR data was very high (92.3%). The results of this study provide more information about DENV2 infection of C6/36 cells at the transcriptome level, laying a foundation for further research on mosquito vector–virus interactions. These data provide candidate antiviral genes that can be used for further functional verification in the future.

Download Full-text

Meta-analysis of RNA-seq expression data across species, tissues and studies

Genome Biology ◽

10.1186/s13059-015-0853-4 ◽

2015 ◽

Vol 16 (1) ◽

Cited By ~ 64

Author(s):

Peter H. Sudmant ◽

Maria S. Alexis ◽

Christopher B. Burge

Keyword(s):

Meta Analysis ◽

Expression Data ◽

Rna Seq

Download Full-text

Time Dependent Gene Expression Changes in the Liver of Mice Treated with Benzene

Biomarker Insights ◽

10.4137/bmi.s590 ◽

2008 ◽

Vol 3 ◽

pp. BMI.S590 ◽

Cited By ~ 3

Author(s):

Han-Jin Park ◽

Jung Hwa Oh ◽

Seokjoo Yoon ◽

S.V.S. Rana

Keyword(s):

Gene Expression ◽

Expression Profiles ◽

Gene Expression Profiles ◽

General Purpose ◽

Time Dependent ◽

Expression Data ◽

Benzene Exposure ◽

Microarray Analyses ◽

First Time ◽

Affymetrix Gene Chip

Benzene is used as a general purpose solvent. Benzene metabolism starts from phenol and ends with p-benzoquinone and o-benzoquinone. Liver injury inducted by benzene still remains a toxicologic problem. Tumor related genes and immune responsive genes have been studied in patients suffering from benzene exposure. However, gene expression profiles and pathways related to its hepatotoxicity are not known. This study reports the results obtained in the liver of BALB/C mice (SLC, Inc., Japan) administered 0.05 ml/100 g body weight of 2% benzene for six days. Serum, ALT, AST and ALP were determined using automated analyzer (Fuji., Japan). Histopathological observations were made to support gene expression data. c-DNA microarray analyses were performed using Affymetrix Gene-chip system. After six days of benzene exposure, twenty five genes were down regulated whereas nineteen genes were up-regulated. These gene expression changes were found to be related to pathways of biotransformation, detoxification, apoptosis, oxidative stress and cell cycle. It has been shown for the first time that genes corresponding to circadian rhythms are affected by benzene. Results suggest that gene expression profile might serve as potential biomarkers of hepatotoxicity during benzene exposure.

Download Full-text