scholarly journals SNP discovery and functional annotation in the Panax japonicus var. major transcriptome

RSC Advances ◽  
2019 ◽  
Vol 9 (37) ◽  
pp. 21513-21517
Author(s):  
Jian Li ◽  
Ding-Ping Bai ◽  
Xi-Feng Zhang

Due to the lack of a Panax japonicus var. major reference genome, we assembled a reference transcriptome from P. japonicus C. A. Mey transcriptome sequencing data, and 203 283 unigenes were obtained.

2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Nae-Chyun Chen ◽  
Brad Solomon ◽  
Taher Mun ◽  
Sheila Iyer ◽  
Ben Langmead

AbstractMost sequencing data analyses start by aligning sequencing reads to a linear reference genome, but failure to account for genetic variation leads to reference bias and confounding of results downstream. Other approaches replace the linear reference with structures like graphs that can include genetic variation, incurring major computational overhead. We propose the reference flow alignment method that uses multiple population reference genomes to improve alignment accuracy and reduce reference bias. Compared to the graph aligner vg, reference flow achieves a similar level of accuracy and bias avoidance but with 14% of the memory footprint and 5.5 times the speed.


2017 ◽  
Vol 18 (6) ◽  
pp. 1110 ◽  
Author(s):  
Wolfgang Kaisers ◽  
Johannes Ptok ◽  
Holger Schwender ◽  
Heiner Schaal

2018 ◽  
Vol 35 (15) ◽  
pp. 2654-2656 ◽  
Author(s):  
Guoli Ji ◽  
Wenbin Ye ◽  
Yaru Su ◽  
Moliang Chen ◽  
Guangzao Huang ◽  
...  

Abstract Summary Alternative splicing (AS) is a well-established mechanism for increasing transcriptome and proteome diversity, however, detecting AS events and distinguishing among AS types in organisms without available reference genomes remains challenging. We developed a de novo approach called AStrap for AS analysis without using a reference genome. AStrap identifies AS events by extensive pair-wise alignments of transcript sequences and predicts AS types by a machine-learning model integrating more than 500 assembled features. We evaluated AStrap using collected AS events from reference genomes of rice and human as well as single-molecule real-time sequencing data from Amborella trichopoda. Results show that AStrap can identify much more AS events with comparable or higher accuracy than the competing method. AStrap also possesses a unique feature of predicting AS types, which achieves an overall accuracy of ∼0.87 for different species. Extensive evaluation of AStrap using different parameters, sample sizes and machine-learning models on different species also demonstrates the robustness and flexibility of AStrap. AStrap could be a valuable addition to the community for the study of AS in non-model organisms with limited genetic resources. Availability and implementation AStrap is available for download at https://github.com/BMILAB/AStrap. Supplementary information Supplementary data are available at Bioinformatics online.


2015 ◽  
Vol 14 ◽  
pp. CIN.S26470 ◽  
Author(s):  
Richard P. Finney ◽  
Qing-Rong Chen ◽  
Cu V. Nguyen ◽  
Chih Hao Hsu ◽  
Chunhua Yan ◽  
...  

The name Alview is a contraction of the term Alignment Viewer. Alview is a compiled to native architecture software tool for visualizing the alignment of sequencing data. Inputs are files of short-read sequences aligned to a reference genome in the SAM/BAM format and files containing reference genome data. Outputs are visualizations of these aligned short reads. Alview is written in portable C with optional graphical user interface (GUI) code written in C, C++, and Objective-C. The application can run in three different ways: as a web server, as a command line tool, or as a native, GUI program. Alview is compatible with Microsoft Windows, Linux, and Apple OS X. It is available as a web demo at https://cgwb.nci.nih.gov/cgi-bin/alview . The source code and Windows/Mac/Linux executables are available via https://github.com/NCIP/alview .


2021 ◽  
Author(s):  
Myung-Shin Kim ◽  
Taeyoung Lee ◽  
Jeonghun Baek ◽  
Ji Hong Kim ◽  
Changhoon Kim ◽  
...  

AbstractMassive resequencing efforts have been undertaken to catalog allelic variants in major crop species including soybean, but the scope of the information for genetic variation often depends on short sequence reads mapped to the extant reference genome. Additional de novo assembled genome sequences provide a unique opportunity to explore a dispensable genome fraction in the pan-genome of a species. Here, we report the de novo assembly and annotation of Hwangkeum, a popular soybean cultivar in Korea. The assembly was constructed using PromethION nanopore sequencing data and two genetic maps, and was then error-corrected using Illumina short-reads and PacBio SMRT reads. The 933.12 Mb assembly was annotated 79,870 transcripts for 58,550 genes using RNA-Seq data and the public soybean annotation set. Comparison of the Hwangkeum assembly with the Williams 82 soybean reference genome sequence revealed 1.8 million single-nucleotide polymorphisms, 0.5 million indels, and 25 thousand putative structural variants. However, there was no natural megabase-scale chromosomal rearrangement. Incidentally, by adding two novel groups, we found that soybean contains four clearly separated groups of centromeric satellite repeats. Analyses of satellite repeats and gene content suggested that the Hwangkeum assembly is a high-quality assembly. This was further supported by comparison of the marker arrangement of anthocyanin biosynthesis genes and of gene arrangement at the Rsv3 locus. Therefore, the results indicate that the de novo assembly of Hwangkeum is a valuable additional reference genome resource for characterizing traits for the improvement of this important crop species.


2022 ◽  
Vol 9 ◽  
Author(s):  
Han Wang ◽  
Bowen Cui ◽  
Huiying Sun ◽  
Fang Zhang ◽  
Jianan Rao ◽  
...  

GATA2 is a transcription factor that is critical for the generation and survival of hematopoietic stem cells (HSCs). It also plays an important role in the regulation of myeloid differentiation. Accordingly, GATA2 expression is restricted to HSCs and hematopoietic progenitors as well as early erythroid cells and megakaryocytic cells. Here we identified aberrant GATA2 expression in B-cell acute lymphoblastic leukemia (B-ALL) by analyzing transcriptome sequencing data obtained from St. Jude Cloud. Differentially expressed genes upon GATA2 activation showed significantly myeloid-like transcription signature. Further analysis identified several tumor-associated genes as targets of GATA2 activation including BAG3 and EPOR. In addition, the correlation between KMT2A-USP2 fusion and GATA2 activation not only indicates a potential trans-activating mechanism of GATA2 but also suggests that GATA2 is a target of KMT2A-USP2. Furthermore, by integrating whole-genome and transcriptome sequencing data, we showed that GATA2 is also cis activated. A somatic focal deletion located in the GATA2 neighborhood that disrupts the boundaries of topologically associating domains was identified in one B-ALL patient with GATA2 activation. These evidences support the hypothesis that GATA2 could be involved in leukemogenesis of B-ALL and can be transcriptionally activated through multiple mechanisms. The findings of aberrant activation of GATA2 and its molecular function extend our understanding of transcriptional factor dysregulation in B-ALL.


2020 ◽  
Vol 21 (22) ◽  
pp. 8640
Author(s):  
Kijeong Lee ◽  
Mi-Ryung Han ◽  
Ji Woo Yeon ◽  
Byoungjae Kim ◽  
Tae Hoon Kim

Dendritic cells (DCs) play critical roles in atopic diseases, orchestrating both innate and adaptive immune systems. Nevertheless, limited information is available regarding the mechanism through which DCs induce hyperresponsiveness in patients with allergies. This study aims to reveal novel genetic alterations and future therapeutic target molecules in the DCs from patients with allergies using whole transcriptome sequencing. Transcriptome sequencing of human BDCA-3+/CD11c+ DCs sorted from peripheral blood monocytes obtained from six patients with allergies and four healthy controls was conducted. Gene expression profile data were analyzed, and an ingenuity pathway analysis was performed. A total of 1638 differentially expressed genes were identified at p-values < 0.05, with 11 genes showing a log2-fold change ≥1.5. The top gene network was associated with cell death/survival and organismal injury/abnormality. In validation experiments, amphiregulin (AREG) showed consistent results with transcriptome sequencing data, with increased mRNA expression in THP-1-derived DCs after Der p 1 stimulation and higher protein expression in myeloid DCs obtained from patients with allergies. This study suggests an alteration in the expression of DCs in patients with allergies, proposing related altered functions and intracellular mechanisms. Notably, AREG might play a crucial role in DCs by inducing the Th2 immune response.


Sign in / Sign up

Export Citation Format

Share Document