large repeat
Recently Published Documents


TOTAL DOCUMENTS

20
(FIVE YEARS 4)

H-INDEX

8
(FIVE YEARS 1)

2021 ◽  
Vol 12 ◽  
Author(s):  
Min Liao ◽  
Xin-Fen Gao ◽  
Jun-Yi Zhang ◽  
Heng-Ning Deng ◽  
Bo Xu

The taxonomy and evolutionary history of Sophora L., a genus with high economic and medicinal value, remain uncertain due to the absence of genetic resource (especially in China) and low polymorphism of molecular markers. Our aim was to elucidate the molecular evolution and phylogenetic relationships in chloroplast genomes of Sophora species in the early-diverging legume subfamily Papilionoideae (Fabaceae). We reported nine Sophora chloroplast genome from China using Illumina sequencing. We performed a series of analyses with previously published genomes of Sophora species to investigate their genomic characteristics, identified simple sequence repeats, large repeat sequences, tandem repeats, and highly polymorphic loci. The genomes were 152,953–158,087 bp in length, and contained 111–113 unique genes, including 76–78 protein coding, 31 tRNA, and 4 rRNA. The expansion of inverted repeat boundary of Sophora resulted in rps12 entering into the LSC region and loss of trnT-CGU gene in some species. Also, we found an approximately 23 kb inversion between trnC-GCA and trnF-GAA within the genus. In addition, we identified seven highly polymorphic loci (pi (π) > 0.035) suitable for inferring the phylogeny of Sophora species. Among these, three regions also co-occurred with large repeat sequences and support use of repeats as a proxy for the identification of polymorphic loci. Based on whole chloroplast genome and protein-coding sequences data-set, a well-supported phylogenetic tree of Sophora and related taxa showed that this genus is monophyletic, but sect. Disamaea and sect. Sophora, are incongruent with traditional taxonomic classifications based on fruit morphology. Our finding provides significant genetic resources to support further investigation into the phylogenetic relationship and evolution of the genus Sophora.


2021 ◽  
Vol 10 ◽  
Author(s):  
Edith Heard ◽  
Alexander D Johnson ◽  
Jan O Korbel ◽  
Charles Lee ◽  
Michael P Snyder ◽  
...  

While the human genome represents the most accurate vertebrate reference assembly to date, it still contains numerous gaps, including centromeric and other large repeat-containing regions – often termed the “dark side” of the genome – many of which are of fundamental biological importance. Miga et al. present the first gapless assembly of the human X chromosome, with the help of ultra-long-read nanopore reads generated for the haploid complete hydatidiform mole (CHM13) genome. They reconstruct the ~3.1 megabase centromeric satellite DNA array and map DNA methylation patterns across complex tandem repeats and satellite arrays. This Telomere-to-Telomere assembly provides a superior human X chromosome reference enabling future sex-determination and X-linked disease research, and provides a path towards finishing the entire human genome sequence.


2020 ◽  
Vol 9 (42) ◽  
Author(s):  
Mehmet Dayi ◽  
Simo Sun ◽  
Yasunobu Maeda ◽  
Ryusei Tanaka ◽  
Akemi Yoshida ◽  
...  

ABSTRACT Bursaphelenchus xylophilus has been destroying pine forests in East Asia and western Europe. Here, we report its nearly complete genomic sequence containing five ∼12-Mb scaffolds and one ∼15-Mb scaffold representing six chromosomes. Large repeat regions that were previously unidentified are now reasonably integrated, particularly in the ∼15-Mb scaffold.


2020 ◽  
Vol 432 (20) ◽  
pp. 5696-5710 ◽  
Author(s):  
Lucia Motlova ◽  
Nela Klimova ◽  
Radovan Fiser ◽  
Peter Sebo ◽  
Ladislav Bumba
Keyword(s):  
Type I ◽  

IUCrJ ◽  
2018 ◽  
Vol 5 (6) ◽  
pp. 681-698 ◽  
Author(s):  
C. Delacotte ◽  
G. F. S. Whitehead ◽  
M. J. Pitcher ◽  
C. M. Robertson ◽  
P. M. Sharp ◽  
...  

Hexaferrites are an important class of magnetic oxides with applications in data storage and electronics. Their crystal structures are highly modular, consisting of Fe- or Ba-rich close-packed blocks that can be stacked in different sequences to form a multitude of unique structures, producing large anisotropic unit cells with lattice parameters typically >100 Å along the stacking axis. This has limited atomic-resolution structure solutions to relatively simple examples such as Ba2Zn2Fe12O22, whilst longer stacking sequences have been modelled only in terms of block sequences, with no refinement of individual atomic coordinates or occupancies. This paper describes the growth of a series of complex hexaferrite crystals, their atomic-level structure solution by high-resolution synchrotron X-ray diffraction, electron diffraction and imaging methods, and their physical characterization by magnetometry. The structures include a new hexaferrite stacking sequence, with the longest lattice parameter of any hexaferrite with a fully determined structure.


2017 ◽  
Author(s):  
Emily J. Brown ◽  
Alison H. Nguyen ◽  
Doris Bachtrog

Heterochromatin suppresses repetitive DNA, and a loss of heterochromatin has been observed in aged cells of several species, including humans and Drosophila. Males often contain substantially more heterochromatic DNA than females, due to the presence of a large, repeat-rich Y chromosome, and male flies generally have shorter average life spans than females. Here we show that repetitive DNA becomes de-repressed more rapidly in old male flies relative to females, and repeats on the Y chromosome are disproportionally mis-expressed during aging. This is associated with a loss of heterochromatin at repetitive elements during aging in male flies, and a general loss of repressive chromatin in aged males away from pericentromeric regions and the Y. By generating flies with different sex chromosome karyotypes (XXY females; X0 and XYY males), we show that repeat de-repression and average lifespan is directly correlated with the number of Y chromosomes. Thus, sex-specific chromatin differences contribute to sex-specific aging in flies.


2016 ◽  
Author(s):  
Bernardo J. Clavijo ◽  
Luca Venturini ◽  
Christian Schudoma ◽  
Gonzalo Garcia Accinelli ◽  
Gemy Kaithakottil ◽  
...  

Advances in genome sequencing and assembly technologies are generating many high quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimised data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents more than 78% of the genome with a scaffold N50 of 88.8kbp that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNAseq and PacBio full-length cDNAs to identify 104,091 high confidence protein-coding genes and 10,156 non-coding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop. [Supplemental material is available for this article.]


2016 ◽  
Vol 17 (1) ◽  
Author(s):  
Song Gao ◽  
Denis Bertrand ◽  
Burton K. H. Chia ◽  
Niranjan Nagarajan

2015 ◽  
Author(s):  
Song Gao ◽  
Denis Bertrand ◽  
Burton KH Chia ◽  
Niranjan Nagarajan

The assembly of large, repeat-rich eukaryotic genomes continues to represent a significant challenge in genomics. While long-read technologies have made the high-quality assembly of small, microbial genomes increasingly feasible, data generation can be prohibitively expensive for larger genomes. Fundamental advances in assembly algorithms are thus essential to exploit the characteristics of short and long-read sequencing technologies to consistently and reliably provide high-qualities assemblies in a cost-efficient manner. Here we present a scalable, exact algorithm (OPERA-LG) for the scaffold assembly of large, repeat-rich genomes that exhibits almost an order of magnitude improvement over the state-of-the-art programs in both correctness (>5X on average) and contiguity (>10X). This provides a systematic approach for combining data from different sequencing technologies, as well as a rigorous framework for scaffolding of repetitive sequences. OPERA-LG represents the first in a new class of algorithms that can efficiently assemble large genomes while providing formal guarantees about assembly quality, providing an avenue for systematic augmentation and improvement of 1000s of existing draft eukaryotic genome assemblies.


BMC Genomics ◽  
2013 ◽  
Vol 14 (1) ◽  
pp. 448 ◽  
Author(s):  
Daryl T Morishige ◽  
Patricia E Klein ◽  
Josie L Hilley ◽  
Sayed Mohammad Sahraeian ◽  
Arun Sharma ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document