scholarly journals Comparative Genomics Analysis Reveals High Levels of Differential Retrotransposition among Primates from the Hominidae and the Cercopithecidae Families

2019 ◽  
Vol 11 (11) ◽  
pp. 3309-3325 ◽  
Author(s):  
Wanxiangfu Tang ◽  
Ping Liang

Abstract Mobile elements (MEs), making ∼50% of primate genomes, are known to be responsible for generating inter- and intra-species genomic variations and play important roles in genome evolution and gene function. Using a bioinformatics comparative genomics approach, we performed analyses of species-specific MEs (SS-MEs) in eight primate genomes from the families of Hominidae and Cercopithecidae, focusing on retrotransposons. We identified a total of 230,855 SS-MEs, with which we performed normalization based on evolutionary distances, and we also analyzed the most recent SS-MEs in these genomes. Comparative analysis of SS-MEs reveals striking differences in ME transposition among these primate genomes. Interesting highlights of our results include: 1) the baboon genome has the highest number of SS-MEs with a strong bias for SINEs, while the crab-eating macaque genome has a sustained extremely low transposition for all ME classes, suggesting the existence of a genome-wide mechanism suppressing ME transposition; 2) while SS-SINEs represent the dominant class in general, the orangutan genome stands out by having SS-LINEs as the dominant class; 3) the human genome stands out among the eight genomes by having the largest number of recent highly active ME subfamilies, suggesting a greater impact of ME transposition on its recent evolution; and 4) at least 33% of the SS-MEs locate to genic regions, including protein coding regions, presenting significant potentials for impacting gene function. Our study, as the first of its kind, demonstrates that mobile elements evolve quite differently among these primates, suggesting differential ME transposition as an important mechanism in primate evolution.

2019 ◽  
Author(s):  
Wanxiangfu Tang ◽  
Ping Liang

ABSTRACTMobile elements generated via DNA transposition constitute ∼50% of the primate genomes. As a result of past and ongoing activity, DNA transposition is responsible for generating inter- and intra-species genomic variations, and it plays important roles in shaping genome evolution and impacting gene function. While limited analysis of mobile elements has been performed in many primate genomes, a large-scale comparative genomic analysis examining the impact of DNA transposition on primate evolution is still missing.Using a bioinformatics comparative genomics approach, we performed analysis of species-specific mobile elements (SS-MEs) in eight primate genomes, which include human, chimpanzee, gorilla, orangutan, green monkey, crab-eating macaque, rhesus monkey, and baboon. These species have good representations for the top two primate families, Hominidae (great apes) and the Cercopithecidae (old world monkeys), for which draft genome sequences are available.Our analysis identified a total of 230,855 SS-MEs from the eight primate genomes, which collectively contribute to ∼82 Mbp genome sequences, ranging from 14 to 25 Mbp for individual genomes. Several new interesting observations were made based on these SS-MEs. First, the DNA transposition activity level reflected by the numbers of SS-MEs was shown to be drastically different across species with the highest (baboon genome) being more than 30 times higher than the lowest (crab-eating macaque genome). Second, the compositions of SS-MEs, as well as the top active ME subfamilies, also differ significantly across genomes. By the copy numbers of SS-MEs divided into major ME classes, SINE represents the dominant class in all genomes, but more so in the Cercopithecidae genomes than in the Hominidae genomes in general with the orangutan genome being the outliner of this trend by having LINE as the dominant class. While AluY represents the major SINE groups in the Hominidae genomes, AluYRa1 is the dominant SINE in the Cercopithecidae genomes. For LINEs, each Hominidae genome seems to have a unique most active L1 subfamily, but all Cercopithecidae genomes have L1RS2 as the most active LINEs. While genomes with a high number of SS-MEs all have one or more very active ME subfamilies, the crab-eating macaque genome, being the one with an extremely low level of DNA transposition, has no single ME class being very active, suggesting the existence of a genome-wide mechanism suppressing DNA transposition. Third, DNA transposons, despite being considered dead in primate genomes, were in fact shown to have a certain level of activity in all genomes examined with a total of ∼2,400 entries as SS-MEs. Among these SS-MEs, at least 23% locate to genic regions, including exons and regulatory elements, presenting significant potentials for their impact on gene function. Very interestingly, our data demonstrate that, among the eight primates included in this study, the human genome is shown to be the most actively evolving genome via DNA transposition as having the highest most recent activity of many ME subfamilies, notably the AluYa5/Yb8/Yb9, L1HS, and SVA-D subfamilies.Representing the first of its kind, our large-scale comparative genomics study has shown that mobile elements evolved quite differently among different groups and species of primates, indicating that differential DNA transposition has served as an important mechanism in primate evolution.


2022 ◽  
Vol 9 ◽  
Author(s):  
Jonathan Filée ◽  
Marie Merle ◽  
Héloïse Bastide ◽  
Florence Mougel ◽  
Jean-Michel Bérenger ◽  
...  

We provide in this study a very large DNA dataset on Rhodnius species including 36 samples representing 16 valid species of the three Rhodnius groups, pictipes, prolixus and pallescens. Samples were sequenced at low-depth with whole-genome shotgun sequencing (Illumina technology). Using phylogenomics including 15 mitochondrial genes (13.3 kb), partial nuclear rDNA (5.2 kb) and 51 nuclear protein-coding genes (36.3 kb), we resolve sticking points in the Rhodnius phylogeny. At the species level, we confirmed the species-specific status of R. montenegrensis and R. marabaensis and we agree with the synonymy of R. taquarussuensis with R. neglectus. We also invite to revisit the species-specific status of R. milesi that is more likely R. nasutus. We proposed to define a robustus species complex that comprises the four close relative species: R. marabaensis, R. montenegrensis, R. prolixus and R. robustus. As Psammolestes tertius was included in the Rhodnius clade, we strongly recommend reclassifying this species as R. tertius. At the Rhodnius group level, molecular data consistently supports the clustering of the pictipes and pallescens groups, more related to each other than they are to the prolixus group. Moreover, comparing mitochondrial and nuclear tree topologies, our results demonstrated that various introgression events occurred in all the three Rhodnius groups, in laboratory strains but also in wild specimens. We demonstrated that introgressions occurred frequently in the prolixus group, involving the related species of the robustus complex but also the pairwise R. nasutus and R. neglectus. A genome wide analysis highlighted an introgression event in the pictipes group between R. stali and R. brethesi and suggested a complex gene flow between the three species of the pallescens group, R. colombiensis, R. pallescens and R. ecuadoriensis. The molecular data supports also a sylvatic distribution of R. prolixus in Brazil (Pará state) and the monophyly of R. robustus. As we detected extensive introgression events and selective pressure on mitochondrial genes, we strongly recommend performing separate mitochondrial and nuclear phylogenies and to take advantages of mito-nuclear conflicts in order to have a comprehensive evolutionary vision of this genus.


2016 ◽  
Author(s):  
Wanxiangfu Tang ◽  
Seyoung Mun ◽  
Adiya Joshi ◽  
Kyundong Han ◽  
Ping Liang

AbstractMobile elements (MEs) collectively constituted to at least 51% of the human genome. Due to their past incremental accumulation and ongoing DNA transposition of members from certain subfamilies, MEs serve as a significant source for both inter- and intra-species genetic diversity during primate and human evolution. Since MEs can exert direct impact on gene function via a plethora of mechanism, it is believed that the ME-derived genetic diversity has contributed to the phenotypic differences between human and non-human primates, as well as among human populations and individuals. To define the specific contribution of MEs in making Human sapiens as a biologically unique species, we aim to compile a complete list of MEs that are only uniquely present in the human genome, i.e., human-specific MEs (HS-MEs).By making use of the most recent reference genome sequences for human and many other primates and a unbiased more robust and integrative multi-way comparative genomic approach, we identified a total of 15,463 HS-MEs. This list of HS-MEs represents a 120% increase from prior studies with over 8,000 being newly identified as HS-MEs. Collectively, these ~15,000 HS-MEs have contributed to a total of 15 million base pair (Mbp) sequence increase through insertion, generation of target site duplications, and transductions, as well as a 0.5 Mbp sequence loss via insertion- mediated deletions, leading to a net total of 14.5 Mbp genome size increase. Other new observations made with these HS-MEs include: 1) identification of several additional ME subfamilies with significant transposition activities not visible with prior smaller datasets (e.g. L1HS, L1PA2, and HERV-K); 2) A clear similarity of the retrotransposition mechanism among L1, Alus, and SVAs that is distinct from HERVs based on the pre- integration site sequence motifs; 3) Y-chromosome as a strikingly hot target for HS-MEs, particularly for LTRs, which showed an insertion rate 15 times higher than the genome average; 4) among the ME types, SVAs seem to show a very strong bias in inserting into existing SVAs. Among the HS-MEs, more than 8,000 elements were integrated into the vicinity of ~4900 unique genes, in regions including CDS, untranslated exon regions, promoters, and introns of protein coding genes, as well as promoters and exons of non- coding RNAs. In seven cases, MEs participate in protein coding. Furthermore, 1,213 HS-MEs contributed to a total of 3,124 experimentally identified binding sites for 146 of the 161 transcriptional factors in association with 622 genes. All these data suggest that these HS-MEs, despite being very young, already showed sufficient sign for their participation in gene function via regulation of transcription, splicing, and protein coding, with more potential for future participation.In conclusion, our results demonstrate that the amount of MEs uniquely occurred in the human genome is much higher than previously known, and we predict that the same is true regarding their impact on human genome evolution and function. The comprehensive list of HS-MEs provides an important reference resource for studying the impact of DNA transposition in human genome evolution and gene function.


2014 ◽  
Author(s):  
Adrienne Ressayre ◽  
Sylvain Glemin ◽  
Pierre Montalent ◽  
Laurana Serres-Giardi ◽  
Christine Dillmann ◽  
...  

Plant genomes are large, intron-rich and present a wide range of variation in coding region G+C content. Concerning coding regions, a sort of syndrome can be described in plants: the increase in G+C content is associated with both the increase in heterogeneity among genes within a genome and the increase in variation across genes. Taking advantage of the large number of genes composing plant genomes and the wide range of variation in gene intron number, we performed a comprehensive survey of the patterns of variation in G+C content at different scales from the nucleotide level to the genome scale in two species Arabidopsis thaliana and Oryza sativa, comparing the patterns in genes with different intron numbers. In both species, we observed a pervasive effect of gene intron number and location along genes on G+C content, codon and amino acid frequencies suggesting that in both species, introns have a barrier effect structuring G+C content along genes. In external gene regions (located upstream first or downstream last intron), species-specific factors are shaping G+C content while in internal gene regions (surrounded by introns), G+C content is constrained to remain within a range common to both species. In rice, introns appear as a major determinant of gene G+C content while in A. thaliana introns have a weaker but significant effect. The structuring effect of introns in both species is susceptible to explain the G+C content syndrome observed in plants.


Author(s):  
Noé Cochetel ◽  
Andrea Minio ◽  
Mélanie Massonnet ◽  
Amanda M Vondras ◽  
Rosa Figueroa-Balderas ◽  
...  

Abstract Muscadinia rotundifolia, the muscadine grape, has been cultivated for centuries in the southeastern United States. M. rotundifolia is resistant to many of the pathogens that detrimentally affect Vitis vinifera, the grape species commonly used for winemaking. For this reason, M. rotundifolia is a valuable genetic resource for breeding. Single-molecule real-time reads were combined with optical maps to reconstruct the two haplotypes of each of the 20 M. rotundifolia cv. Trayshed chromosomes. The completeness and accuracy of the assembly were confirmed using a high-density linkage map of M. rotundifolia. Protein-coding genes were annotated using an integrated and comprehensive approach. This included using Full-length cDNA sequencing (Iso-Seq) to improve gene structure and hypothetical spliced variant predictions. Our data strongly support that Muscadinia chromosomes 7 and 20 are fused in Vitis and pinpoint the location of the fusion in Cabernet Sauvignon and PN40024 chromosome 7. Disease-related gene numbers in Trayshed and Cabernet Sauvignon were similar, but their clustering locations were different. A dramatic expansion of the Toll/Interleukin-1 Receptor-like Nucleotide-Binding Site Leucine-Rich Repeat (TIR-NBS-LRR) class was detected on Trayshed chromosome 12 at the Resistance to Uncinula necator 1 (RUN1)/ Resistance to Plasmopara viticola 1 (RPV1) locus, which confers strong dominant resistance to powdery and downy mildews. A genome browser for Trayshed, its annotation, and an associated Blast tool are available at .www.grapegenomics.com


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Guifang Lin ◽  
Cheng He ◽  
Jun Zheng ◽  
Dal-Hoe Koo ◽  
Ha Le ◽  
...  

Abstract Background The maize inbred line A188 is an attractive model for elucidation of gene function and improvement due to its high embryogenic capacity and many contrasting traits to the first maize reference genome, B73, and other elite lines. The lack of a genome assembly of A188 limits its use as a model for functional studies. Results Here, we present a chromosome-level genome assembly of A188 using long reads and optical maps. Comparison of A188 with B73 using both whole-genome alignments and read depths from sequencing reads identify approximately 1.1 Gb of syntenic sequences as well as extensive structural variation, including a 1.8-Mb duplication containing the Gametophyte factor1 locus for unilateral cross-incompatibility, and six inversions of 0.7 Mb or greater. Increased copy number of carotenoid cleavage dioxygenase 1 (ccd1) in A188 is associated with elevated expression during seed development. High ccd1 expression in seeds together with low expression of yellow endosperm 1 (y1) reduces carotenoid accumulation, accounting for the white seed phenotype of A188. Furthermore, transcriptome and epigenome analyses reveal enhanced expression of defense pathways and altered DNA methylation patterns of the embryonic callus. Conclusions The A188 genome assembly provides a high-resolution sequence for a complex genome species and a foundational resource for analyses of genome variation and gene function in maize. The genome, in comparison to B73, contains extensive intra-species structural variations and other genetic differences. Expression and network analyses identify discrete profiles for embryonic callus and other tissues.


2021 ◽  
Vol 22 (4) ◽  
pp. 2183
Author(s):  
Nurhani Mat Razali ◽  
Siti Norvahida Hisham ◽  
Ilakiya Sharanee Kumar ◽  
Rohit Nandan Shukla ◽  
Melvin Lee ◽  
...  

Proper management of agricultural disease is important to ensure sustainable food security. Staple food crops like rice, wheat, cereals, and other cash crops hold great export value for countries. Ensuring proper supply is critical; hence any biotic or abiotic factors contributing to the shortfall in yield of these crops should be alleviated. Rhizoctonia solani is a major biotic factor that results in yield losses in many agriculturally important crops. This paper focuses on genome informatics of our Malaysian Draft R. solani AG1-IA, and the comparative genomics (inter- and intra- AG) with four AGs including China AG1-IA (AG1-IA_KB317705.1), AG1-IB, AG3, and AG8. The genomic content of repeat elements, transposable elements (TEs), syntenic genomic blocks, functions of protein-coding genes as well as core orthologous genic information that underlies R. solani’s pathogenicity strategy were investigated. Our analyses show that all studied AGs have low content and varying profiles of TEs. All AGs were dominant for Class I TE, much like other basidiomycete pathogens. All AGs demonstrate dominance in Glycoside Hydrolase protein-coding gene assignments suggesting its importance in infiltration and infection of host. Our profiling also provides a basis for further investigation on lack of correlation observed between number of pathogenicity and enzyme-related genes with host range. Despite being grouped within the same AG with China AG1-IA, our Draft AG1-IA exhibits differences in terms of protein-coding gene proportions and classifications. This implies that strains from similar AG do not necessarily have to retain similar proportions and classification of TE but must have the necessary arsenal to enable successful infiltration and colonization of host. In a larger perspective, all the studied AGs essentially share core genes that are generally involved in adhesion, penetration, and host colonization. However, the different infiltration strategies will depend on the level of host resilience where this is clearly exhibited by the gene sets encoded for the process of infiltration, infection, and protection from host.


2020 ◽  
Vol 21 (24) ◽  
pp. 9746
Author(s):  
Shahina Akter ◽  
Sun-Young Lee ◽  
Muhammad Zubair Siddiqi ◽  
Sri Renukadevi Balusamy ◽  
Md. Ashrafudoulla ◽  
...  

It is essential to develop and discover alternative eco-friendly antibacterial agents due to the emergence of multi-drug-resistant microorganisms. In this study, we isolated and characterized a novel bacterium named Terrabacter humi MAHUQ-38T, utilized for the eco-friendly synthesis of silver nanoparticles (AgNPs) and the synthesized AgNPs were used to control multi-drug-resistant microorganisms. The novel strain was Gram stain positive, strictly aerobic, milky white colored, rod shaped and non-motile. The optimal growth temperature, pH and NaCl concentration were 30 °C, 6.5 and 0%, respectively. Based on 16S rRNA gene sequence, strain MAHUQ-38T belongs to the genus Terrabacter and is most closely related to several Terrabacter type strains (98.2%–98.8%). Terrabacter humi MAHUQ-38T had a genome of 5,156,829 bp long (19 contigs) with 4555 protein-coding genes, 48 tRNA and 5 rRNA genes. The culture supernatant of strain MAHUQ-38T was used for the eco-friendly and facile synthesis of AgNPs. The transmission electron microscopy (TEM) image showed the spherical shape of AgNPs with a size of 6 to 24 nm, and the Fourier transform infrared (FTIR) analysis revealed the functional groups responsible for the synthesis of AgNPs. The synthesized AgNPs exhibited strong anti-bacterial activity against multi-drug-resistant pathogens, Escherichia coli and Pseudomonas aeruginosa. Minimal inhibitory/bactericidal concentrations against E. coli and P. aeruginosa were 6.25/50 and 12.5/50 μg/mL, respectively. The AgNPs altered the cell morphology and damaged the cell membrane of pathogens. This study encourages the use of Terrabacter humi for the ecofriendly synthesis of AgNPs to control multi-drug-resistant microorganisms.


Genes ◽  
2021 ◽  
Vol 12 (5) ◽  
pp. 643
Author(s):  
Thibaud Kuca ◽  
Brandy M. Marron ◽  
Joana G. P. Jacinto ◽  
Julia M. Paris ◽  
Christian Gerspach ◽  
...  

Genodermatosis such as hair disorders mostly follow a monogenic mode of inheritance. Congenital hypotrichosis (HY) belong to this group of disorders and is characterized by abnormally reduced hair since birth. The purpose of this study was to characterize the clinical phenotype of a breed-specific non-syndromic form of HY in Belted Galloway cattle and to identify the causative genetic variant for this recessive disorder. An affected calf born in Switzerland presented with multiple small to large areas of alopecia on the limbs and on the dorsal part of the head, neck, and back. A genome-wide association study using Swiss and US Belted Galloway cattle encompassing 12 cases and 61 controls revealed an association signal on chromosome 29. Homozygosity mapping in a subset of cases refined the HY locus to a 1.5 Mb critical interval and subsequent Sanger sequencing of protein-coding exons of positional candidate genes revealed a stop gain variant in the HEPHL1 gene that encodes a multi-copper ferroxidase protein so-called hephaestin like 1 (c.1684A>T; p.Lys562*). A perfect concordance between the homozygous presence of this most likely pathogenic loss-of-function variant and the HY phenotype was found. Genotyping of more than 700 purebred Swiss and US Belted Galloway cattle showed the global spread of the mutation. This study provides a molecular test that will permit the avoidance of risk matings by systematic genotyping of relevant breeding animals. This rare recessive HEPHL1-related form of hypotrichosis provides a novel large animal model for similar human conditions. The results have been incorporated in the Online Mendelian Inheritance in Animals (OMIA) database (OMIA 002230-9913).


Sign in / Sign up

Export Citation Format

Share Document