Oxford Nanopore MinION Direct RNA-Seq for Systems Biology

Long-read direct RNA sequencing developed by Oxford Nanopore Technologies (ONT) is quickly gaining popularity for transcriptome studies, while fast turnaround time and low cost make it an attractive instrument for clinical applications. There is a growing interest to utilize transcriptome data to unravel activated biological processes responsible for disease progression and response to therapies. This trend is of particular interest for precision medicine which aims at single-patient analysis. Here we evaluated whether gene abundances measured by MinION direct RNA sequencing are suited to produce robust estimates of pathway activation for single sample scoring methods. We performed multiple RNA-seq analyses for a single sample that originated from the HepG2 cell line, namely five ONT replicates, and three replicates using Illumina NovaSeq. Two pathway scoring methods were employed—ssGSEA and singscore. We estimated the ONT performance in terms of detected protein-coding genes and average pairwise correlation between pathway activation scores using an exhaustive computational scheme for all combinations of replicates. In brief, we found that at least two ONT replicates are required to obtain reproducible pathway scores for both algorithms. We hope that our findings may be of interest to researchers planning their ONT direct RNA-seq experiments.

Download Full-text

The assembled and annotated genome of the pigeon louse Columbicola columbae, a model ectoparasite

G3 Genes|Genome|Genetics ◽

10.1093/g3journal/jkab009 ◽

2021 ◽

Vol 11 (2) ◽

Author(s):

James G Baldwin-Brown ◽

Scott M Villa ◽

Anna I Vickrey ◽

Kevin P Johnson ◽

Sarah E Bush ◽

...

Keyword(s):

Low Cost ◽

Single Copy ◽

Odorant Receptors ◽

Rna Seq ◽

Pediculus Humanus ◽

Final Assembly ◽

Oxford Nanopore ◽

Genes Encoding ◽

Host Parasite ◽

G Protein Coupled

Abstract The pigeon louse Columbicola columbae is a longstanding and important model for studies of ectoparasitism and host-parasite coevolution. However, a deeper understanding of its evolution and capacity for rapid adaptation is limited by a lack of genomic resources. Here, we present a high-quality draft assembly of the C. columbae genome, produced using a combination of Oxford Nanopore, Illumina, and Hi-C technologies. The final assembly is 208 Mb in length, with 12 chromosome-size scaffolds representing 98.1% of the assembly. For gene model prediction, we used a novel clustering method (wavy_choose) for Oxford Nanopore RNA-seq reads to feed into the MAKER annotation pipeline. High recovery of conserved single-copy orthologs (BUSCOs) suggests that our assembly and annotation are both highly complete and highly accurate. Consistent with the results of the only other assembled louse genome, Pediculus humanus, we find that C. columbae has a relatively low density of repetitive elements, the majority of which are DNA transposons. Also similar to P. humanus, we find a reduced number of genes encoding opsins, G protein-coupled receptors, odorant receptors, insulin signaling pathway components, and detoxification proteins in the C. columbae genome, relative to other insects. We propose that such losses might characterize the genomes of obligate, permanent ectoparasites with predictable habitats, limited foraging complexity, and simple dietary regimes. The sequencing and analysis for this genome were relatively low cost, and took advantage of a new clustering technique for Oxford Nanopore RNAseq reads that will be useful to future genome projects.

Download Full-text

Emerging Roles of Estrogen-Regulated Enhancer and Long Non-Coding RNAs

International Journal of Molecular Sciences ◽

10.3390/ijms21103711 ◽

2020 ◽

Vol 21 (10) ◽

pp. 3711

Author(s):

Melina J. Sedano ◽

Alana L. Harrison ◽

Mina Zilaie ◽

Chandrima Das ◽

Ramesh Choudhari ◽

...

Keyword(s):

Rna Sequencing ◽

Expression Patterns ◽

Biological Significance ◽

Rna Seq ◽

Biological Functions ◽

Protein Coding ◽

Rna Molecules ◽

Non Coding Rna ◽

Genome Wide ◽

Non Coding Rnas

Genome-wide RNA sequencing has shown that only a small fraction of the human genome is transcribed into protein-coding mRNAs. While once thought to be “junk” DNA, recent findings indicate that the rest of the genome encodes many types of non-coding RNA molecules with a myriad of functions still being determined. Among the non-coding RNAs, long non-coding RNAs (lncRNA) and enhancer RNAs (eRNA) are found to be most copious. While their exact biological functions and mechanisms of action are currently unknown, technologies such as next-generation RNA sequencing (RNA-seq) and global nuclear run-on sequencing (GRO-seq) have begun deciphering their expression patterns and biological significance. In addition to their identification, it has been shown that the expression of long non-coding RNAs and enhancer RNAs can vary due to spatial, temporal, developmental, or hormonal variations. In this review, we explore newly reported information on estrogen-regulated eRNAs and lncRNAs and their associated biological functions to help outline their markedly prominent roles in estrogen-dependent signaling.

Download Full-text

Comparative analysis of RNA enrichment methods for preparation of Cryptococcus neoformans RNA sequencing libraries

10.1101/2021.03.01.433483 ◽

2021 ◽

Author(s):

Calla L. Telzrow ◽

Paul J. Zwack ◽

Shannon Esher Righi ◽

Fred S. Dietrich ◽

Cliburn Chan ◽

...

Keyword(s):

Cryptococcus Neoformans ◽

Rna Sequencing ◽

Rnase H ◽

Rrna Genes ◽

Rna Seq ◽

Protein Coding ◽

Expression Levels ◽

Non Coding Rna ◽

Long Non Coding Rna ◽

Enrichment Methods

ABSTRACTRibosomal RNA (rRNA) is the major RNA constituent of cells, therefore most RNA sequencing (RNA-Seq) experiments involve removal of rRNA. This process, called RNA enrichment, is done primarily to reduce cost: without rRNA removal, deeper sequencing would need to be performed to balance the sequencing reads wasted on rRNA. The ideal RNA enrichment method would remove all rRNA without affecting other RNA in the sample. We have tested the performance of three RNA enrichment methods on RNA isolated from Cryptococcus neoformans, a fungal pathogen of humans. We show that the RNase H depletion method unambiguously outperforms the commonly used Poly(A) isolation method: the RNase H method more efficiently depletes rRNA while more accurately recapitulating the expression levels of other RNA observed in an unenriched “gold standard”. The RNase H depletion method is also superior to the Ribo-Zero depletion method as measured by rRNA depletion efficiency and recapitulation of protein-coding gene expression levels, while the Ribo-Zero depletion method performs moderately better in preserving non-coding RNA (ncRNA). Finally, we have leveraged this dataset to identify novel long non-coding RNA (lncRNA) genes and to accurately map the C. neoformans mitochondrial rRNA genes.ARTICLE SUMMARYWe compare the efficacy of three different RNA enrichment methods for RNA-Seq in Cryptococcus neoformans: RNase H depletion, Ribo-Zero depletion, and Poly(A) isolation. We show that the RNase H depletion method, which is evaluated in C. neoformans samples for the first time here, is highly efficient and specific in removing rRNA. Additionally, using data generated through these analyses, we identify novel long non-coding RNA genes in C. neoformans. We conclude that RNase H depletion is an effective and reliable method for preparation of C. neoformans RNA-Seq libraries.

Download Full-text

Direct RNA Nanopore Sequencing of SARS-CoV-2 Extracted from Critical Material from Swabs

Life ◽

10.3390/life12010069 ◽

2022 ◽

Vol 12 (1) ◽

pp. 69

Author(s):

Davide Vacca ◽

Antonino Fiannaca ◽

Fabio Tramuto ◽

Valeria Cancila ◽

Laura La Paglia ◽

...

Keyword(s):

Rna Sequencing ◽

Systematic Errors ◽

N Gene ◽

Nanopore Sequencing ◽

Rna Seq ◽

Bioinformatics Pipeline ◽

Critical Material ◽

Oxford Nanopore ◽

Oropharyngeal Swab ◽

Generation Sequencing

In consideration of the increasing prevalence of COVID-19 cases in several countries and the resulting demand for unbiased sequencing approaches, we performed a direct RNA sequencing (direct RNA seq.) experiment using critical oropharyngeal swab samples collected from Italian patients infected with SARS-CoV-2 from the Palermo region in Sicily. Here, we identified the sequences SARS-CoV-2 directly in RNA extracted from critical samples using the Oxford Nanopore MinION technology without prior cDNA retrotranscription. Using an appropriate bioinformatics pipeline, we could identify mutations in the nucleocapsid (N) gene, which have been reported previously in studies conducted in other countries. In conclusion, to the best of our knowledge, the technique used in this study has not been used for SARS-CoV-2 detection previously owing to the difficulties in the extraction of RNA of sufficient quantity and quality from routine oropharyngeal swabs. Despite these limitations, this approach provides the advantages of true native RNA sequencing and does not include amplification steps that could introduce systematic errors. This study can provide novel information relevant to the current strategies adopted in SARS-CoV-2 next-generation sequencing.

Download Full-text

Comparative analysis of RNA enrichment methods for preparation of Cryptococcus neoformans RNA sequencing libraries

G3 Genes|Genome|Genetics ◽

10.1093/g3journal/jkab301 ◽

2021 ◽

Author(s):

Calla L Telzrow ◽

Paul J Zwack ◽

Shannon Esher Righi ◽

Fred S Dietrich ◽

Cliburn Chan ◽

...

Keyword(s):

Cryptococcus Neoformans ◽

Rna Sequencing ◽

Rnase H ◽

Rrna Genes ◽

Rna Seq ◽

Protein Coding ◽

Mitochondrial Rrna ◽

Non Coding Rna ◽

Rrna Depletion ◽

Enrichment Methods

Abstract RNA sequencing (RNA-Seq) experiments focused on gene expression involve removal of ribosomal RNA (rRNA) because it is the major RNA constituent of cells. This process, called RNA enrichment, is done primarily to reduce cost: without rRNA removal, deeper sequencing must be performed to compensate for the sequencing reads wasted on rRNA. The ideal RNA enrichment method removes all rRNA without affecting other RNA in the sample. We tested the performance of three RNA enrichment methods on RNA isolated from Cryptococcus neoformans, a fungal pathogen of humans. We find that the RNase H depletion method is more efficient in depleting rRNA and more specific in recapitulating non-rRNA levels present in unenriched controls than the commonly-used Poly(A) isolation method. The RNase H depletion method is also more effective than the Ribo-Zero depletion method as measured by rRNA depletion efficiency and recapitulation of protein-coding RNA levels present in unenriched controls, while the Ribo-Zero depletion method more closely recapitulates annotated non-coding RNA (ncRNA) levels. Finally, we leverage these data to accurately map the C. neoformans mitochondrial rRNA genes, and also demonstrate that RNA-Seq data generated with the RNase H and Ribo-Zero depletion methods can be used to explore novel C. neoformans long non-coding RNA genes.

Download Full-text

Nanopore sequencing of RNA and cDNA molecules in Escherichia coli

RNA ◽

10.1261/rna.078937.121 ◽

2021 ◽

pp. rna.078937.121

Author(s):

Felix Grünberger ◽

Sébastien Ferreira-Cerca ◽

Dina Grohmann

Keyword(s):

Escherichia Coli ◽

Rna Sequencing ◽

Single Molecule ◽

High Throughput Sequencing ◽

Model Organism ◽

Cost Effective ◽

Rna Seq ◽

Sequencing Platform ◽

Quantitative Measurements ◽

Oxford Nanopore

High-throughput sequencing dramatically changed our view of transcriptome architectures and allowed for ground-breaking discoveries in RNA biology. Recently, sequencing of full-length transcripts based on the single-molecule sequencing platform from Oxford Nanopore Technologies (ONT) was introduced and is widely employed to sequence eukaryotic and viral RNAs. However, experimental approaches implementing this technique for prokaryotic transcriptomes remain scarce. Here, we present an experimental and bioinformatic workflow for ONT RNA-seq in the bacterial model organism Escherichia coli, which can be applied to any microorganism. Our study highlights critical steps of library preparation and computational analysis and compares the results to gold standards in the field. Furthermore, we comprehensively evaluate the applicability and advantages of different ONT-based RNA sequencing protocols, including direct RNA, direct cDNA, and PCR-cDNA. We find that (PCR)-cDNA-seq offers improved yield and accuracy compared to direct RNA sequencing. Notably, (PCR)-cDNA-seq is suitable for quantitative measurements and can be readily used for simultaneous and accurate detection of transcript 5'and 3' boundaries, analysis of transcriptional units and transcriptional heterogeneity. In summary, based on our comprehensive study, we show that Nanopore RNA-seq to be a ready-to-use tool allowing rapid, cost-effective, and accurate annotation of multiple transcriptomic features. Thereby Nanopore RNA-seq holds the potential to become a valuable alternative method for RNA analysis in prokaryotes.

Download Full-text

Highly parallel direct RNA sequencing on an array of nanopores

10.1101/068809 ◽

2016 ◽

Cited By ~ 26

Author(s):

Daniel R. Garalde ◽

Elizabeth A. Snell ◽

Daniel Jachimowicz ◽

Andrew J. Heron ◽

Mark Bruce ◽

...

Keyword(s):

Rna Sequencing ◽

Reverse Transcription ◽

Biological Information ◽

Rna Seq ◽

Sequencing Technology ◽

Rna Sequences ◽

Oxford Nanopore ◽

The Status ◽

And Function ◽

Oxford Nanopore Technologies

AbstractRibonucleic acid sequencing can allow us to monitor the RNAs present in a sample. This enables us to detect the presence and nucleotide sequence of viruses, or to build a picture of how active transcriptional processes are changing – information that is useful for understanding the status and function of a sample. Oxford Nanopore Technologies’ sequencing technology is capable of electronically analysing a sample’s DNA directly, and in real-time. In this manuscript we demonstrate the ability of an array of nanopores to sequence RNA directly, and we apply it to a range of biological situations. Nanopore technology is the only available sequencing technology that can sequence RNA directly, rather than depending on reverse transcription and PCR. There are several potential advantages of this approach over other RNA-seq strategies, including the absence of amplification and reverse transcription biases, the ability to detect nucleotide analogues and the ability to generate full-length, strand-specific RNA sequences. Direct RNA sequencing is a completely new way of analysing the sequence of RNA samples and it will improve the ease and speed of RNA analysis, while yielding richer biological information.

Download Full-text

Caught between Two Genes: Accounting for Operonic Gene Structure Improves Prokaryotic RNA Sequencing Quantification

mSystems ◽

10.1128/msystems.01256-20 ◽

2021 ◽

Vol 6 (1) ◽

Author(s):

Taylor Reiter

Keyword(s):

Rna Sequencing ◽

Low Cost ◽

Software Tool ◽

Transcriptome Profiling ◽

Tool Development ◽

Rna Seq ◽

Computational Tools ◽

Biological Interpretation ◽

Wet Lab ◽

Polycistronic Transcripts

ABSTRACT RNA sequencing (RNA-seq) has matured into a reliable and low-cost assay for transcriptome profiling and has been deployed across a range of systems. The computational tool space for the analysis of RNA-seq data has kept pace with advances in sequencing. Yet tool development has largely centered around the human transcriptome. While eukaryotic and prokaryotic transcriptomes are similar, key differences in transcribed units limit the transfer of wet-lab and computational tools between the two domains. The article by M. Chung, R. S. Adkins, J. S. A. Mattick, K. R. Bradwell, et al. (mSystems 6:e00917-20, 2021, https://doi.org/10.1128/mSystems.00917-20), demonstrates that integrating prokaryote-specific strategies into existing RNA-seq analyses improves read quantification. Unlike in eukaryotes, polycistronic transcripts derived from operons lead to sequencing reads that span multiple neighboring genes. Chung et al. introduce FADU, a software tool that performs a correction for such reads and thereby improves read quantification and biological interpretation of prokaryotic RNA sequencing.

Download Full-text

Nanopore sequencing of RNA and cDNA molecules expands the transcriptomic toolbox in prokaryotes

10.1101/2021.06.14.448286 ◽

2021 ◽

Author(s):

Felix Gruenberger ◽

Sebastien Ferreira-Cerca ◽

Dina Grohmann

Keyword(s):

Rna Sequencing ◽

Single Molecule ◽

High Throughput Sequencing ◽

Model Organism ◽

Cost Effective ◽

Rna Seq ◽

Sequencing Platform ◽

Transcript Quantification ◽

Oxford Nanopore ◽

Bacterial Model

High-throughput sequencing dramatically changed our view of transcriptome architectures and allowed for ground-breaking discoveries in RNA biology. Recently, sequencing of full-length transcripts based on the single-molecule sequencing platform from Oxford Nanopore Technologies (ONT) was introduced and is widely employed to sequence eukaryotic and viral RNAs. However, experimental approaches implementing this technique for prokaryotic transcriptomes remain scarce. Here, we present an experimental and bioinformatic workflow for ONT RNA-seq in the bacterial model organism Escherichia coli, which can be applied to any microorganism. Our study highlights critical steps of library preparation and computational analysis and compares the results to gold standards in the field. Furthermore, we comprehensively evaluate the applicability and advantages of different ONT-based RNA sequencing protocols, including direct RNA, direct cDNA, and PCR-cDNA. We find that cDNA-seq offers improved yield and accuracy without bias in quantification compared to direct RNA sequencing. Notably, cDNA-seq can be readily used for simultaneous transcript quantification, accurate detection of transcript 5 ′ and 3′ boundaries, analysis of transcriptional units and transcriptional heterogeneity. In summary, we establish Nanopore RNA-seq to be a ready-to-use tool allowing rapid, cost-effective, and accurate annotation of multiple transcriptomic features thereby advancing it to become a standard method for RNA analysis in prokaryotes.

Download Full-text

Telomere-to-telomere genome assembly of asparaginase-producing Trichoderma simmonsii

BMC Genomics ◽

10.1186/s12864-021-08162-4 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Dawoon Chung ◽

Yong Min Kwon ◽

Youngik Yang

Keyword(s):

Rna Sequencing ◽

Draft Genome ◽

Sequencing Analysis ◽

Sequencing Data ◽

Terrestrial Plants ◽

Protein Coding ◽

Oxford Nanopore ◽

Wide Range ◽

The Family ◽

Encoding Genes

Abstract Background Trichoderma is a genus of fungi in the family Hypocreaceae and includes species known to produce enzymes with commercial use. They are largely found in soil and terrestrial plants. Recently, Trichoderma simmonsii isolated from decaying bark and decorticated wood was newly identified in the Harzianum clade of Trichoderma. Due to a wide range of applications in agriculture and other industries, genomes of at least 12 Trichoderma spp. have been studied. Moreover, antifungal and enzymatic activities have been extensively characterized in Trichoderma spp. However, the genomic information and bioactivities of T. simmonsii from a particular marine-derived isolate remain largely unknown. While we screened for asparaginase-producing fungi, we observed that T. simmonsii GH-Sj1 strain isolated from edible kelp produced asparaginase. In this study, we report a draft genome of T. simmonsii GH-Sj1 using Illumina and Oxford Nanopore technologies. Furthermore, to facilitate biotechnological applications of this species, RNA-sequencing was performed to elucidate the transcriptional profile of T. simmonsii GH-Sj1 in response to asparaginase-rich conditions. Results We generated ~ 14 Gb of sequencing data assembled in a ~ 40 Mb genome. The T. simmonsii GH-Sj1 genome consisted of seven telomere-to-telomere scaffolds with no sequencing gaps, where the N50 length was 6.4 Mb. The total number of protein-coding genes was 13,120, constituting ~ 99% of the genome. The genome harbored 176 tRNAs, which encode a full set of 20 amino acids. In addition, it had an rRNA repeat region consisting of seven repeats of the 18S-ITS1–5.8S-ITS2–26S cluster. The T. simmonsii genome also harbored 7 putative asparaginase-encoding genes with potential medical applications. Using RNA-sequencing analysis, we found that 3 genes among the 7 putative genes were significantly upregulated under asparaginase-rich conditions. Conclusions The genome and transcriptome of T. simmonsii GH-Sj1 established in the current work represent valuable resources for future comparative studies on fungal genomes and asparaginase production.

Download Full-text