scholarly journals Genome sequence of Hydrangea macrophylla and its application in analysis of the double flower phenotype

DNA Research ◽  
2020 ◽  
Author(s):  
Kenji Nashima ◽  
Kenta Shirasawa ◽  
Andrea Ghelfi ◽  
Hideki Hirakawa ◽  
Sachiko Isobe ◽  
...  

Abstract Owing to its high ornamental value, the double flower phenotype of hydrangea (Hydrangea macrophylla) is one of its most important traits. In this study, genome sequence information was obtained to explore effective DNA markers and the causative genes for double flower production in hydrangea. Single-molecule real-time sequencing data followed by a Hi-C analysis were employed. Two haplotype-phased sequences were obtained from the heterozygous genome of hydrangea. One assembly consisted of 3,779 scaffolds (2.256 Gb in length and N50 of 1.5 Mb), the other also contained 3,779 scaffolds (2.227 Gb in length, and N50 of 1.4 Mb). A total of 36,930 genes were predicted in the sequences, of which 32,205 and 32,222 were found in each haplotype. A pair of 18 pseudomolecules was constructed along with a high-density single-nucleotide polymorphism (SNP) genetic linkage map. Using the genome sequence data, and two F2 populations, the SNPs linked to double flower loci (djo and dsu) were discovered. DNA markers linked to djo and dsu were developed, and these could distinguish the recessive double flower allele for each locus, respectively. The LEAFY gene is a very likely candidate as the causative gene for dsu, since frameshift was specifically observed in the double flower accession with dsu.

2020 ◽  
Author(s):  
K Nashima ◽  
K Shirasawa ◽  
A Ghelfi ◽  
H Hirakawa ◽  
S Isobe ◽  
...  

AbstractOwing to its high ornamental value, the double flower phenotype of hydrangea (Hydrangea macrophylla) is one of its most important traits. In this study, genome sequence information was obtained to explore effective DNA markers and the causative genes for double flower production in hydrangea. Single molecule real-time sequencing data followed by a HiC analysis was employed. The resultant haplotype-phased sequences consisted of 3,779 sequences (2.256 Gb in length and N50 of 1.5 Mb), and 18 pseudomolecules comprising 1.08 Gb scaffold sequences along with a high-density SNP genetic linkage map. Using the genome sequence data obtained from two breeding populations, the SNPs linked to double flower loci (Djo and Dsu), were discovered for each breeding population. DNA markers J01 linked to Djo and S01 linked to Dsu were developed, and these could be used successfully to distinguish the recessive double flower allele for each locus respectively. The LEAFY gene was suggested as the causative gene for Dsu, since frameshift was specifically observed in double flower accession with dsu. The genome information obtained in this study will facilitate a wide range of genomic studies on hydrangea in the future.


2021 ◽  
Author(s):  
Kenta Shirasawa ◽  
Nobuo Kobayashi ◽  
Akira Nakatsuka ◽  
Hideya Ohta ◽  
Sachiko Isobe

To enhance the genomics and genetics of azalea, the whole-genome sequences of two species of Rhododendron were determined and analyzed in this study: Rhododendron ripense, the cytoplasmic donor and ancestral species of large-flowered and evergreen azalea cultivars, respectively; and Rhododendron kiyosumense, a native of Chiba prefecture (Japan) seldomly bred and cultivated. A chromosome-level genome sequence assembly of R. ripense was constructed by single-molecule real-time (SMRT) sequencing and genetic mapping, while the genome sequence of R. kiyosumense was assembled using the single-tube long fragment read (stLFR) sequencing technology. The R. ripense genome assembly contained 319 contigs (506.7 Mb; N50 length: 2.5 Mb) and was assigned to the genetic map to establish 13 pseudomolecule sequences. On the other hand, the genome of R. kiyosumense was assembled into 32,308 contigs (601.9 Mb; N50 length: 245.7 kb). A total of 34,606 genes were predicted in the R. ripense genome, while 35,785 flower and 48,041 leaf transcript isoforms were identified in R. kiyosumense through Iso-Seq analysis. Overall, the genome sequence information generated in this study enhances our understanding of genome evolution in the Ericales and reveals the phylogenetic relationship of closely-related species. This information will also facilitate the development of phenotypically attractive azalea cultivars.


2017 ◽  
Vol 5 (23) ◽  
Author(s):  
Urmi Halder ◽  
Aparna Banerjee ◽  
Vasvi Chaudhry ◽  
Rajeev K. Varshney ◽  
Shrikant Mantri ◽  
...  

ABSTRACT Here, we present the draft genome sequence of Bacillus altitudinis SORB11, which is tolerant to UV radiation. The strain was isolated from the Indian sector of the Southern Ocean at a depth of 3.8 km. The genome sequence information reported here for B. altitudinis SORB11 gives the basis of its UV resistance mechanism and provides data for further comparative studies with other bacteria resistant to UV radiation.


2020 ◽  
Vol 9 (24) ◽  
Author(s):  
Indranil Biswas

ABSTRACT Streptococcus mutans is known to produce various antimicrobial peptides called mutacins. Two clinical isolates, T8 and UA140, are well characterized regarding their mutacin production, but genome sequence information was previously unavailable. Complete genome sequences of these two mutacin-producing strains are reported here.


PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e5895 ◽  
Author(s):  
Thomas Andreas Kohl ◽  
Christian Utpatel ◽  
Viola Schleusener ◽  
Maria Rosaria De Filippo ◽  
Patrick Beckert ◽  
...  

Analyzing whole-genome sequencing data of Mycobacterium tuberculosis complex (MTBC) isolates in a standardized workflow enables both comprehensive antibiotic resistance profiling and outbreak surveillance with highest resolution up to the identification of recent transmission chains. Here, we present MTBseq, a bioinformatics pipeline for next-generation genome sequence data analysis of MTBC isolates. Employing a reference mapping based workflow, MTBseq reports detected variant positions annotated with known association to antibiotic resistance and performs a lineage classification based on phylogenetic single nucleotide polymorphisms (SNPs). When comparing multiple datasets, MTBseq provides a joint list of variants and a FASTA alignment of SNP positions for use in phylogenomic analysis, and identifies groups of related isolates. The pipeline is customizable, expandable and can be used on a desktop computer or laptop without any internet connection, ensuring mobile usage and data security. MTBseq and accompanying documentation is available from https://github.com/ngs-fzb/MTBseq_source.


2016 ◽  
Vol 4 (5) ◽  
Author(s):  
Kai Bernd Stadermann ◽  
Daniela Holtgräwe ◽  
Bernd Weisshaar

A publicly available data set from Pacific Biosciences was used to create an assembly of the chloroplast genome sequence of theArabidopsis thalianagenotype Landsbergerecta. The assembly is solely based on single-molecule, real-time sequencing data and hence provides high resolution of the two inverted repeat regions typically contained in chloroplast genomes.


Sign in / Sign up

Export Citation Format

Share Document