de novo gene
Recently Published Documents


TOTAL DOCUMENTS

111
(FIVE YEARS 39)

H-INDEX

24
(FIVE YEARS 6)

2022 ◽  
Author(s):  
Nicholas Delihas

A short non-coding sequence present between the gamma-glutamyltransferase 1 (GGT1) and gamma-glutamyltransferase 5 (GGT5) genes, termed a spacer sequence has been detected in the genomes of Mus musculus, the house mouse and in Philippine tarsier, a primitive ancestral primate. It is highly conserved during primate evolution with certain sequences being totally invariant from mouse to humans. Evidence is presented to show this intergenic sequence serves as a nucleation site for the initiation of diverse genes. We also outline the birth of the human lincRNA gene BCRP3 (BCR activator of RhoGEF and GTPase 3 pseudogene) during primate evolution. The gene developmental process involves sequence initiation, addition of a complex of tandem transposable elements and addition of a segment of another gene. The sequence, initially formed in the Old World Monkeys such as the Rhesus monkey (Macaca mulatta) and the baboon (Papio anubis), develops into different primate genes before evolving into the human BCRP3 gene; it appears to also include trial and error during sequence/gene formation. The protein gene, GGT5 may have also formed by spacer sequence initiation in an ancient ancestor such as zebrafish, but spacer and GGT5 gene sequence drift during evolution produced a divergence that precludes further assessment.


2021 ◽  
Author(s):  
Chris Papadopoulos ◽  
Isabelle Callebaut ◽  
Jean-Christophe Gelly ◽  
Isabelle Hatin ◽  
Olivier Namy ◽  
...  

The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences’ properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic open reading frames (ORFs) of S. cerevisiae with the aim of (1) exploring whether the structural states’ diversity of proteomes is already present in noncoding sequences, and (2) estimating the potential of the noncoding genome to produce novel protein bricks that could either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural state diversity of canonical proteins, with the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by reconstructing the ancestral sequences of 70 yeast de novo genes and characterized the sequence and structural properties of intergenic ORFs with a strong translation signal. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.


Genetics ◽  
2021 ◽  
Author(s):  
Julie M Cridland ◽  
Alex C Majane ◽  
Li Zhao ◽  
David J Begun

Abstract Early work on de novo gene discovery in Drosophila was consistent with the idea that many such genes have male-biased patterns of expression, including a large number expressed in the testis. However, there has been little formal analysis of variation in the abundance and properties of de novo genes expressed in different tissues. Here we investigate the population biology of recently evolved de novo genes expressed in the D. melanogaster accessory gland, a somatic male tissue that plays an important role in male and female fertility and the post mating response of females, using the same collection of inbred lines used previously to identify testis-expressed de novo genes, thus allowing for direct cross tissue comparisons of these genes in two tissues of male reproduction. Using RNA-seq data we identify candidate de novo genes located in annotated intergenic and intronic sequence and determine the properties of these genes including chromosomal location, expression, abundance, and coding capacity. Generally, we find major differences between the tissues in terms of gene abundance and expression, though other properties such as transcript length and chromosomal distribution are more similar. We also explore differences between regulatory mechanisms of de novo genes in the two tissues and how such differences may interact with selection to produce differences in D. melanogaster de novo genes expressed in the two tissues.


Genes ◽  
2021 ◽  
Vol 12 (11) ◽  
pp. 1777
Author(s):  
Xuan Zhuang ◽  
C.-H. Christina Cheng

The de novo birth of functional genes from non-coding DNA as an important contributor to new gene formation is increasingly supported by evidence from diverse eukaryotic lineages. However, many uncertainties remain, including how the incipient de novo genes would continue to evolve and the molecular mechanisms underlying their evolutionary trajectory. Here we address these questions by investigating evolutionary history of the de novo antifreeze glycoprotein (AFGP) gene and gene family in gadid (codfish) lineages. We examined AFGP phenotype on a phylogenetic framework encompassing a broad sampling of gadids from freezing and non-freezing habitats. In three select species representing different AFGP-bearing clades, we analyzed all AFGP gene family members and the broader scale AFGP genomic regions in detail. Codon usage analyses suggest that motif duplication produced the intragenic AFGP tripeptide coding repeats, and rapid sequence divergence post-duplication stabilized the recombination-prone long repetitive coding region. Genomic loci analyses support AFGP originated once from a single ancestral genomic origin, and shed light on how the de novo gene proliferated into a gene family. Results also show the processes of gene duplication and gene loss are distinctive in separate clades, and both genotype and phenotype are commensurate with differential local selective pressures.


2021 ◽  
Author(s):  
Nikolaos Vakirlis ◽  
Kate M. Duggan ◽  
Aoife McLysaght

We now have a growing understanding that functional short proteins can be translated out of small Open Reading Frames (sORF). Such ″microproteins″ can perform crucial biological tasks and can have considerable phenotypic consequences. However, their size makes them less amenable to genomic analysis, and their evolutionary origins and conservation are poorly understood. Given their short length it is plausible that some of these functional microproteins have recently originated entirely de novo from non-coding sequence. Here we test the possibility that de novo gene birth can produce microproteins that are functional ″out-of-the-box″. We reconstructed the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the appearance of each ORF and its transcriptional activation, we were able to show that, indeed, novel small proteins with significant phenotypic effects have emerged de novo throughout animal evolution, including many after the human-chimpanzee split. We show that traditional methods for assessing the coding potential of such sequences often fall short, due to the high variability present in the alignments and the absence of telltale evolutionary signatures that are not yet measurable. Thus we provide evidence that the functional potential intrinsic to sORFs can be rapidly, and frequently realised through de novo gene birth.


Author(s):  
Carlos P Cantalapiedra ◽  
Ana Hernández-Plaza ◽  
Ivica Letunic ◽  
Peer Bork ◽  
Jaime Huerta-Cepas

Abstract Even though automated functional annotation of genes represents a fundamental step in most genomic and metagenomic workflows, it remains challenging at large scales. Here, we describe a major upgrade to eggNOG-mapper, a tool for functional annotation based on precomputed orthology assignments, now optimized for vast (meta)genomic data sets. Improvements in version 2 include a full update of both the genomes and functional databases to those from eggNOG v5, as well as several efficiency enhancements and new features. Most notably, eggNOG-mapper v2 now allows for: (i) de novo gene prediction from raw contigs, (ii) built-in pairwise orthology prediction, (iii) fast protein domain discovery, and (iv) automated GFF decoration. eggNOG-mapper v2 is available as a standalone tool or as an online service at http://eggnog-mapper.embl.de.


2021 ◽  
Author(s):  
Bo Yun Lee ◽  
Jun Kim ◽  
Junho Lee

Genes embed their evolutionary history in the form of various alleles. Presence absence variants (PAVs) are extreme cases of such alleles, where a gene present in one haplotype does not exist in another. Since PAVs may result from either birth or death of a gene, PAV genes and their alternative alleles, if available, can represent a basis for rapid intraspecific gene evolution. Here, we traced a possible evolution of PAV genes in the PD1074 and CB4856 C. elegans strains as well as their alternative alleles found in other 14 wild strains, using long-read sequencing technologies. We updated the CB4856 genome by filling 18 gaps and identified 50 novel genes and 7,460 novel isoforms from both strains. We verified 328 PAV genes, out of which 48 were C. elegans-specific. Among these possible newly-born genes, 13 had alternative alleles in other wild strains and, in particular, alternative alleles of three genes showed signatures active transposons. Alternative alleles of four other genes showed another type of signature reflected in accumulation of small insertions or deletions. Our results exemplify that research on gene evolution using both species-specific PAV genes and their alternative alleles is expected to provide new perspectives for how genes evolve.


Author(s):  
Timothy B Yates ◽  
Kai Feng ◽  
Jin Zhang ◽  
Vasanth Singan ◽  
Sara S Jawdy ◽  
...  

Abstract Orphan genes are characteristic genomic features that have no detectable homology to genes in any other species and represent an important attribute of genome evolution as sources of novel genetic functions. Here, we identified 445 genes specific to Populus trichocarpa. Of these, we performed deeper reconstruction of 13 orphan genes to provide evidence of de novo gene evolution. Populus and its sister genera Salix are particularly well suited for the study of orphan gene evolution because of the Salicoid whole-genome duplication event (WGD) which resulted in highly syntenic sister chromosomal segments across the Salicaceae. We leveraged this genomic feature to reconstruct de novo gene evolution from inter-genera, inter-species, and intra-genomic perspectives by comparing the syntenic regions within the P. trichocarpa reference, then P. deltoides, and finally Salix purpurea. Furthermore, we demonstrated that 86.5% of the putative orphan genes had evidence of transcription. Additionally, we also utilized the Populus genome-wide association mapping panel (GWAS), a collection of 1,084 undomesticated P. trichocarpa genotypes to further determine putative regulatory networks of orphan genes using expression quantitative trait loci (eQTL) mapping. Functional enrichment of these eQTL subnetworks identified common biological themes associated with orphan genes such as response to stress and defense response. We also identify a putative cis-element for a de novo gene and leverage conserved synteny to describe evolution of a putative transcription factor binding site. Overall, 45% of orphan genes were captured in trans-eQTL networks.


2021 ◽  
Author(s):  
Somya Mani ◽  
Tsvi Tlusty

Contrary to long-held views, recent evidence indicates that de novo birth of genes is not only possible but is surprisingly prevalent: a substantial fraction of eukaryotic genomes are composed of orphan genes, which show no homology with any conserved genes. And a remarkably large proportion of orphan genes likely originated denovo from non-genic regions. Here, using a parsimonious mathematical model, we investigate the probability and timescale of de novo gene birth due to spontaneous mutations. We trace how an initially non-genic locus accumulates beneficial mutations to become a gene. We sample across a wide range of biologically feasible distributions of fitness effects (DFE) of mutations, and calculate the conditions conducive to gene birth. We find that in a time frame of millions of years, gene birth is highly likely for a wide range of DFEs. Moreover, when we allow DFEs to fluctuate, which is expected given the long time frame, gene birth in the model becomes practically inevitable. This supports the idea that gene birth is a ubiquitous process, and should occur in a wide variety of organisms. Our results also demonstrate that intergenic regions are not inactive and silent but are more like dynamic storehouses of potential genes.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Helen Lesser ◽  
Jason E. Cohn

AbstractA 14-year-old African American female presented to the emergency department with spontaneous, sudden-onset lip swelling for 1 h. On examination, there was significant water-bag edema of the upper lip extending to the philtrum and premaxilla. Nasopharyngeal laryngoscopy revealed a patent airway without edema. She was initiated on intravenous dexamethasone, famotidine, and diphenhydramine, after which her edema improved but did not resolve. She was subsequently transferred to a local pediatric hospital and upon further testing she was found to have a C1 esterase inhibitor de novo gene mutation. Angioedema causes localized, non-pitting edema of the dermis, subcutaneous and submucosal tissue, and often manifests in the lips, face, mouth, and throat. Signs of laryngeal involvement include change in voice, stridor, dysphagia, and dyspnea. When laryngeal edema is present, it may necessitate definitive airway management and patients should be monitored in the intensive care unit.


Sign in / Sign up

Export Citation Format

Share Document