DNA Research
Latest Publications


TOTAL DOCUMENTS

1156
(FIVE YEARS 91)

H-INDEX

94
(FIVE YEARS 8)

Published By Oxford University Press

1756-1663, 1340-2838

DNA Research ◽  
2021 ◽  
Vol 28 (6) ◽  
Author(s):  
Yuya Kiguchi ◽  
Suguru Nishijima ◽  
Naveen Kumar ◽  
Masahira Hattori ◽  
Wataru Suda
Keyword(s):  

DNA Research ◽  
2021 ◽  
Author(s):  
Yuya Kiguchi ◽  
Suguru Nishijima ◽  
Naveen Kumar ◽  
Masahira Hattori ◽  
Wataru Suda

Abstract The human gut bacteriophage community (phageome) plays an important role in the host’s health and disease; however, the entire structure is poorly understood, partly owing to the generation of many incomplete genomes in conventional short-read metagenomics. Here, we show long-read metagenomics of amplified DNA of low-biomass phageomes with multiple displacement amplification (MDA), involving the development of a novel bioinformatics tool, SACRA, that efficiently preprocessed numerous chimeric reads generated through MDA. Using five samples, SACRA markedly reduced the average chimera ratio from 72% to 1.5% in PacBio reads with an average length of 1.8-kb. De novo assembly of chimera-less PacBio long reads reconstructed contigs of ≥ 5-kb with an average proportion of 27%, which was 1% in contigs from MiSeq short reads, thereby dramatically improving contig length and genome completeness. Comparison of PacBio and MiSeq contigs found MiSeq contig fragmentations frequently near local repeats and hypervariable regions in the phage genomes, and those caused by multiple homologous phage genomes coexisting in the community. We also developed a reference-independent method to assess the completeness of the linear phage genomes. Overall, we established a SACRA-coupled long-read metagenomics robust to highly diverse gut phageomes, identifying high-quality circular and linear phage genomes with adequate sequence quantity.


DNA Research ◽  
2021 ◽  
Vol 28 (5) ◽  
Author(s):  
Ding Huang ◽  
Ruhong Ming ◽  
Shiqiang Xu ◽  
Jihua Wang ◽  
Shaochang Yao ◽  
...  

Abstract Gynostemma pentaphyllum (Thunb.) Makino is an economically valuable medicinal plant belonging to the Cucurbitaceae family that produces the bioactive compound gypenoside. Despite several transcriptomes having been generated for G. pentaphyllum, a reference genome is still unavailable, which has limited the understanding of the gypenoside biosynthesis and regulatory mechanism. Here, we report a high-quality G. pentaphyllum genome with a total length of 582 Mb comprising 1,232 contigs and a scaffold N50 of 50.78 Mb. The G. pentaphyllum genome comprised 59.14% repetitive sequences and 25,285 protein-coding genes. Comparative genome analysis revealed that G. pentaphyllum was related to Siraitia grosvenorii, with an estimated divergence time dating to the Paleogene (∼48 million years ago). By combining transcriptome data from seven tissues, we reconstructed the gypenoside biosynthetic pathway and potential regulatory network using tissue-specific gene co-expression network analysis. Four UDP-glucuronosyltransferases (UGTs), belonging to the UGT85 subfamily and forming a gene cluster, were involved in catalyzing glycosylation in leaf-specific gypenoside biosynthesis. Furthermore, candidate biosynthetic genes and transcription factors involved in the gypenoside regulatory network were identified. The genetic information obtained in this study provides insights into gypenoside biosynthesis and lays the foundation for further exploration of the gypenoside regulatory mechanism.


DNA Research ◽  
2021 ◽  
Vol 28 (5) ◽  
Author(s):  
Fengqi Zang ◽  
Yan Ma ◽  
Xiaolong Tu ◽  
Ping Huang ◽  
Qichao Wu ◽  
...  

Abstract Rosa rugosa is an important shrub with economic, ecological, and pharmaceutical value. A high-quality chromosome-scale genome for R. rugosa sequences was assembled using PacBio and Hi-C technologies. The final assembly genome sequences size was about 407.1 Mb, the contig N50 size was 2.85 Mb, and the scaffold N50 size was 56.6 Mb. More than 98% of the assembled genome sequences were anchored to seven pseudochromosomes (402.9 Mb). The genome contained 37,512 protein-coding genes, with 37,016 genes (98.68%) that were functionally annotated, and 206.67 Mb (50.76%) of the assembled sequences are repetitive sequences. Phylogenetic analyses indicated that R. rugosa diverged from Rosa chinensis ∼6.6 million years ago, and no lineage-specific whole-genome duplication event occurred after divergence from R. chinensis. Chromosome synteny analysis demonstrated highly conserved synteny between R. rugosa and R. chinensis, between R. rugosa and Prunus persica as well. Comparative genome and transcriptome analysis revealed genes related to colour, scent, and environment adaptation. The chromosome-level reference genome provides important genomic resources for molecular-assisted breeding and horticultural comparative genomics research.


DNA Research ◽  
2021 ◽  
Vol 28 (5) ◽  
Author(s):  
Chengyi Tang

Abstract Increasing genome data are coming out. Genome size estimation plays an essential role in guiding genome assembly. Several months ago, other researchers were the first to publish a draft genome of the red gromwell (i.e. Lithospermum erythrorhizon). However, we considered that the genome size they estimated and assembled was incorrect. This study meticulously estimated the L. erythrorhizon genome size to should be ∼708.74 Mb and further provided a reliable genome version (size ≈ 693.34 Mb; contigN50 length ≈ 238.08 Kb) to support our objection. Furthermore, according to our genome, we identified a gene family of the alkannin/shikonin O-acyltransferases (i.e. AAT/SAT) that catalysed enantiomer-specific acylations in the alkannin/shikonin biosynthesis (a characteristic metabolic pathway in L. erythrorhizon’s roots) and further explored its evolutionary process. The results indicated that the existing AAT/SAT were not generated from only one round of gene duplication but three rounds; after different rounds of gene duplication, the existing AAT/SAT and their recent ancestors were under positive selection at different amino acid sites. These suggested that a combined power from gene duplication plus positive selection plausibly propelled AAT/SAT’s functional differentiation in evolution.


DNA Research ◽  
2021 ◽  
Author(s):  
Renqiang Yuan ◽  
Jiaman Zhang ◽  
Yujie Wang ◽  
Xingxing Zhu ◽  
Silu Hu ◽  
...  

Abstract Myofibers (primary and secondary myofiber) are the basic structure of muscle and the determinant of muscle mass. To explore the skeletal muscle developmental processes from primary myofibers to secondary myofibers, we conducted an integrative 3 D structure of genome and transcriptomic characterization of longissimus dorsi muscle (LDM) of pig from primary myofiber formation stage [embryonic day 35 (E35)] to secondary myofiber formation stage (E80). In the hierarchical genomic structure, we found that 11.43% of genome switched compartment A/B status, 14.53% of topologically associating domains (TADs) are changed intra-domain interactions (D-scores) and 3,166 genes with differential promoter-enhancer interactions (PEIs) and (or) enhancer activity from E35 to E80. The alterations of genome architecture were found to affect expression of genes that play significant roles in neuromuscular junction (NMJ), embryonic morphogenesis, and skeletal muscle development or metabolism, typically NEFL, MuSK, SLN, Mef2D and GCK. Significantly, Sox6 and MATN2 play important roles in the process of primary to secondary myofibers form and increase the RPS and genes expression in it. In brief, we reveal the genomic reorganization from E35 to E80 and construct genome-wide high-resolution interaction maps that provide a resource for studying long-range control of gene expression from E35 to E80.


DNA Research ◽  
2021 ◽  
Author(s):  
Kenta Shirasawa ◽  
Akihiro Itai ◽  
Sachiko Isobe

Abstract Aim We analyzed the genome sequence of a Japanese pear (Pyrus pyrifolia) to facilitate its genetics and genomics as well as breeding programs, in which a variety ′Nijisseiki′ with superior flesh texture has been used as a parent for most Japanese pear cultivars. Methods and results De novo assembly of long sequence reads covered 136× of the Japanese pear genome and generated 503.9 Mb contigs consisting of 114 sequences with an N50 value of 7.6 Mb. Contigs were assigned to Japanese pear genetic maps to establish 17 chromosome-scale sequences. In total, 44,876 high-confidence protein-encoding genes were predicted, 84.3% of which were supported by predicted genes and transcriptome data from Japanese pear relatives. As expected, evidence of genome-wide duplication was observed, consistent with related species. Conclusion and Perspective This is the first chromosome-scale genome sequence analysis reported for Japanese pear, and this resource will support breeding programs and provide new insights into the physiology and evolutionary history of Japanese pear.


DNA Research ◽  
2021 ◽  
Vol 28 (1) ◽  
Author(s):  
Murukarthick Jayakodi ◽  
Mona Schreiber ◽  
Nils Stein ◽  
Martin Mascher

Abstract Pan-genomic studies aim at representing the entire sequence diversity within a species to provide useful resources for evolutionary studies, functional genomics and breeding of cultivated plants. Cost reductions in high-throughput sequencing and advances in sequence assembly algorithms have made it possible to create multiple reference genomes along with a catalogue of all forms of genetic variations in plant species with large and complex or polyploid genomes. In this review, we summarize the current approaches to building pan-genomes as an in silico representation of plant sequence diversity and outline relevant methods for their effective utilization in linking structural with phenotypic variation. We propose as future research avenues (i) transcriptomic and epigenomic studies across multiple reference genomes and (ii) the development of user-friendly and feature-rich pan-genome browsers.


Sign in / Sign up

Export Citation Format

Share Document