Yet another de novo genome assembler

AbstractSummaryPossibility to generate large RNA-seq datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the model organisms with finished and annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing and paralogous genes. In this paper we describe a novel transcriptome assembler called rnaSPAdes, which is developed on top of SPAdes genome assembler and explores surprising computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-Seq datasets, and briefly highlight strong and weak points of different assemblers.Availability and implementationrnaSPAdes is implemented in C++ and Python and is freely available at cab.spbu.ru/software/rnaspades/.

Download Full-text

SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale

2016 45th International Conference on Parallel Processing (ICPP) ◽

10.1109/icpp.2016.29 ◽

2016 ◽

Cited By ~ 12

Author(s):

Jintao Meng ◽

Sangmin Seo ◽

Pavan Balaji ◽

Yanjie Wei ◽

Bingqiang Wang ◽

...

Keyword(s):

De Novo ◽

Extreme Scale ◽

Genome Assembler

Download Full-text

Raven: a de novo genome assembler for long reads

10.1101/2020.08.07.242461 ◽

2020 ◽

Cited By ~ 5

Author(s):

Robert Vaser ◽

Mile Šikić

Keyword(s):

Human Genome ◽

Genome Assembly ◽

De Novo ◽

De Novo Genome Assembly ◽

New Methods ◽

Long Reads ◽

Long Read ◽

Comparable Accuracy ◽

Genome Assembler ◽

Genome Dataset

We present new methods for the improvement of long-read de novo genome assembly incorporated into a straightforward tool called Raven (https://github.com/lbcb-sci/raven). Compared with other assemblers, Raven is one of two fastest, it reconstructs the sequenced genome in the least amount of fragments, has better or comparable accuracy, and maintains similar performance for various genomes. Raven takes 500 CPU hours to assemble a 44x human genome dataset in only 259 fragments.

Download Full-text

Spaler: Spark and GraphX based de novo genome assembler

2015 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2015.7363853 ◽

2015 ◽

Cited By ~ 12

Author(s):

Anas Abu-Doleh ◽

Umit V. Catalyurek

Keyword(s):

De Novo ◽

Genome Assembler

Download Full-text

RepAHR: an improved approach for de novo repeat identification by assembly of the high-frequency reads

BMC Bioinformatics ◽

10.1186/s12859-020-03779-w ◽

2020 ◽

Vol 21 (1) ◽

Author(s):

Xingyu Liao ◽

Xin Gao ◽

Xiankai Zhang ◽

Fang-Xiang Wu ◽

Jianxin Wang

Keyword(s):

High Frequency ◽

Structural Variation ◽

De Novo ◽

Repetitive Sequences ◽

Data Sets ◽

Sequence Coverage ◽

Coverage Ratio ◽

Next Generation Sequencing Ngs ◽

Genome Assembler ◽

Generation Sequencing

Abstract Background Repetitive sequences account for a large proportion of eukaryotes genomes. Identification of repetitive sequences plays a significant role in many applications, such as structural variation detection and genome assembly. Many existing de novo repeat identification pipelines or tools make use of assembly of the high-frequency k-mers to obtain repeats. However, a certain degree of sequence coverage is required for assemblers to get the desired assemblies. On the other hand, assemblers cut the reads into shorter k-mers for assembly, which may destroy the structure of the repetitive regions. For the above reasons, it is difficult to obtain complete and accurate repetitive regions in the genome by using existing tools. Results In this study, we present a new method called RepAHR for de novo repeat identification by assembly of the high-frequency reads. Firstly, RepAHR scans next-generation sequencing (NGS) reads to find the high-frequency k-mers. Secondly, RepAHR filters the high-frequency reads from whole NGS reads according to certain rules based on the high-frequency k-mer. Finally, the high-frequency reads are assembled to generate repeats by using SPAdes, which is considered as an outstanding genome assembler with NGS sequences. Conlusions We test RepAHR on five data sets, and the experimental results show that RepAHR outperforms RepARK and REPdenovo for detecting repeats in terms of N50, reference alignment ratio, coverage ratio of reference, mask ratio of Repbase and some other metrics.

Download Full-text

Integrative Meta-Assembly Pipeline (IMAP): Chromosome-level genome assembler combining multiple de novo assemblies

PLoS ONE ◽

10.1371/journal.pone.0221858 ◽

2019 ◽

Vol 14 (8) ◽

pp. e0221858 ◽

Cited By ~ 2

Author(s):

Giltae Song ◽

Jongin Lee ◽

Juyeon Kim ◽

Seokwoo Kang ◽

Hoyong Lee ◽

...

Keyword(s):

De Novo ◽

Assembly Pipeline ◽

Genome Assembler ◽

Chromosome Level

Download Full-text

rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data

GigaScience ◽

10.1093/gigascience/giz100 ◽

2019 ◽

Vol 8 (9) ◽

Cited By ~ 52

Author(s):

Elena Bushmanova ◽

Dmitry Antipov ◽

Alla Lapidus ◽

Andrey D Prjibelski

Keyword(s):

Rna Sequencing ◽

De Novo ◽

Transcriptome Assembly ◽

The Novel ◽

Rna Seq ◽

De Novo Transcriptome ◽

Weak Points ◽

Transcriptome Reconstruction ◽

Evaluation Approaches ◽

Genome Assembler

Abstract Background The possibility of generating large RNA-sequencing datasets has led to development of various reference-based and de novo transcriptome assemblers with their own strengths and limitations. While reference-based tools are widely used in various transcriptomic studies, their application is limited to the organisms with finished and well-annotated genomes. De novo transcriptome reconstruction from short reads remains an open challenging problem, which is complicated by the varying expression levels across different genes, alternative splicing, and paralogous genes. Results Herein we describe the novel transcriptome assembler rnaSPAdes, which has been developed on top of the SPAdes genome assembler and explores computational parallels between assembly of transcriptomes and single-cell genomes. We also present quality assessment reports for rnaSPAdes assemblies, compare it with modern transcriptome assembly tools using several evaluation approaches on various RNA-sequencing datasets, and briefly highlight strong and weak points of different assemblers. Conclusions Based on the performed comparison between different assembly methods, we infer that it is not possible to detect the absolute leader according to all quality metrics and all used datasets. However, rnaSPAdes typically outperforms other assemblers by such important property as the number of assembled genes and isoforms, and at the same time has higher accuracy statistics on average comparing to the closest competitors.

Download Full-text

The Morphology of the Rat Kidney Following Folic Acid Administration

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100067996 ◽

1970 ◽

Vol 28 ◽

pp. 200-201

Author(s):

Aline Byrnes ◽

Elsa E. Ramos ◽

Minoru Suzuki ◽

E.D. Mayfield

Keyword(s):

Folic Acid ◽

De Novo ◽

Specific Activity ◽

Rat Kidney ◽

Present Report ◽

Intracellular Localization ◽

Male Rats ◽

Renal Hypertrophy ◽

Electron Microscopic ◽

Biochemical Alterations

Renal hypertrophy was induced in 100 g male rats by the injection of 250 mg folic acid (FA) dissolved in 0.3 M NaHCO3/kg body weight (i.v.). Preliminary studies of the biochemical alterations in ribonucleic acid (RNA) metabolism of the renal tissue have been reported recently (1). They are: RNA content and concentration, orotic acid-c14 incorporation into RNA and acid soluble nucleotide pool, intracellular localization of the newly synthesized RNA, and the specific activity of enzymes of the de novo pyrimidine biosynthesis pathway. The present report describes the light and electron microscopic observations in these animals. For light microscopy, kidney slices were fixed in formalin, embedded, sectioned, and stained with H & E and PAS.

Download Full-text

Early ossicle growth in the regenerating disc of a brittle star

Proceedings, annual meeting, Electron Microscopy Society of America ◽

10.1017/s0424820100169559 ◽

1994 ◽

Vol 52 ◽

pp. 364-365

Author(s):

M. Shlepr ◽

R. L. Turner

Keyword(s):

Cell Membrane ◽

Light Microscopy ◽

De Novo ◽

Current Model ◽

Syncytium Formation ◽

Brittle Star ◽

Intracellular Location ◽

Limited Volume ◽

Tissue Calcification

Calcification in the echinoderms occurs within a limited-volume cavity enclosed by cytoplasmic extensions of the mineral depositing cells, the sclerocytes. The current model of this process maintains that the sheath formed from these cytoplasmic extensions is syncytial. Prior studies indicate that syncytium formation might be dependent on sclerocyte density and not required for calcification. This model further envisions that ossicles formed de novo nucleate and grow intracellularly until the ossicle effectively outgrows the vacuole. Continued ossicle growth occurs within the sheath but external to the cell membrane. The initial intracellular location has been confirmed only for elements of the echinoid tooth.The regenerating aboral disc integument of ophiophragmus filograneus was used to test the current echinoderm calcification model. This tissue is free of calcite fragments, thus avoiding questions of cellular engulfment, and ossicles are formed de novo. The tissue calcification pattern was followed by light microscopy in both living and fixed preparations.

Download Full-text

Yet another de novo genome assembler

A de novo genome assembler based on MapReduce and bi-directed de Bruijn graph

rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data

SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Extreme Scale

Raven: a de novo genome assembler for long reads

Spaler: Spark and GraphX based de novo genome assembler

RepAHR: an improved approach for de novo repeat identification by assembly of the high-frequency reads

Integrative Meta-Assembly Pipeline (IMAP): Chromosome-level genome assembler combining multiple de novo assemblies

rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data

The Morphology of the Rat Kidney Following Folic Acid Administration

Early ossicle growth in the regenerating disc of a brittle star

Export Citation Format