scholarly journals A screen for gene paralogies delineating evolutionary branching order of early Metazoa

2019 ◽  
Author(s):  
Albert Erives ◽  
Bernd Fritzsch

The evolutionary diversification of animals is one of Earth’s greatest triumphs, yet its origins are still shrouded in mystery. Animals, the monophyletic clade known as Metazoa, evolved wildly divergent multicellular life strategies featuring ciliated sensory epithelia. In many lineages epithelial sensoria became coupled to increasingly complex nervous systems. Currently, different phylogenetic analyses of single-copy genes support mutually-exclusive possibilities that either Porifera or Ctenophora is sister to all other animals. Resolving this dilemma would advance the ecological and evolutionary understanding of the first animals and the evolution of nervous systems. Here we describe a comparative phylogenetic approach based on gene duplications. We computationally identify and analyze gene families with early metazoan duplications using an approach that mitigates apparent gene loss resulting from the miscalling of paralogs. In the transmembrane channel-like (TMC) family of mechano-transducing channels, we find ancient duplications that define separate clades for Eumetazoa (Placozoa + Cnidaria + Bilateria) versus Ctenophora, and one duplication that is shared only by Eumetazoa and Porifera. In the MLX/MLXIP family of bHLH-ZIP regulators of metabolism, we find that all major lineages from Eumetazoa and Porifera (sponges) share a duplication, absent in Ctenophora. These results suggest a new avenue for deducing deep phylogeny by choosing rather than avoiding ancient gene paralogies.

2019 ◽  
Vol 10 (2) ◽  
pp. 811-826 ◽  
Author(s):  
Albert Erives ◽  
Bernd Fritzsch

The evolutionary diversification of animals is one of Earth’s greatest marvels, yet its earliest steps are shrouded in mystery. Animals, the monophyletic clade known as Metazoa, evolved wildly divergent multicellular life strategies featuring ciliated sensory epithelia. In many lineages epithelial sensoria became coupled to increasingly complex nervous systems. Currently, different phylogenetic analyses of single-copy genes support mutually-exclusive possibilities that either Porifera or Ctenophora is sister to all other animals. Resolving this dilemma would advance the ecological and evolutionary understanding of the first animals and the evolution of nervous systems. Here we describe a comparative phylogenetic approach based on gene duplications. We computationally identify and analyze gene families with early metazoan duplications using an approach that mitigates apparent gene loss resulting from the miscalling of paralogs. In the transmembrane channel-like (TMC) family of mechano-transducing channels, we find ancient duplications that define separate clades for Eumetazoa (Placozoa + Cnidaria + Bilateria) vs. Ctenophora, and one duplication that is shared only by Eumetazoa and Porifera. In the Max-like protein X (MLX and MLXIP) family of bHLH-ZIP regulators of metabolism, we find that all major lineages from Eumetazoa and Porifera (sponges) share a duplicated gene pair that is sister to the single-copy gene maintained in Ctenophora. These results suggest a new avenue for deducing deep phylogeny by choosing rather than avoiding ancient gene paralogies.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Mehmet Dayi ◽  
Natsumi Kanzaki ◽  
Simo Sun ◽  
Tatsuya Ide ◽  
Ryusei Tanaka ◽  
...  

AbstractCaenorhabditis auriculariae, which was morphologically described in 1999, was re-isolated from a Platydema mushroom-associated beetle. Based on the re-isolated materials, some morphological characteristics were re-examined and ascribed to the species. In addition, to clarify phylogenetic relationships with other Caenorhabditis species and biological features of the nematode, the whole genome was sequenced and assembled into 109.5 Mb with 16,279 predicted protein-coding genes. Molecular phylogenetic analyses based on ribosomal RNA and 269 single-copy genes revealed the species is closely related to C. sonorae and C. monodelphis placing them at the most basal clade of the genus. C. auriculariae has morphological characteristics clearly differed from those two species and harbours a number of species-specific gene families, indicating its usefulness as a new outgroup species for Caenorhabditis evolutionary studies. A comparison of carbohydrate-active enzyme (CAZy) repertoires in genomes, which we found useful to speculate about the lifestyle of Caenorhabditis nematodes, suggested that C. auriculariae likely has a life-cycle with tight-association with insects.


2018 ◽  
Author(s):  
Meng Wu ◽  
Jamie L. Kostyun ◽  
Leonie C. Moyle

ABSTRACTWithin the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ∼80 species. Here we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long-reads (25X) and Illumina short-reads (148X) achieved an assembly of approximately 1.45 Gb, spanning ∼96% of the estimated genome. 96% of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (∼80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1-2 million years.Computational gene prediction, in conjunction with a merged transcriptome dataset from 11 tissues, identified 34725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group, and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Mingzheng Duan ◽  
Haiying Bao ◽  
Tolgor Bau

AbstractIn this study, we report a de novo assembly of the first high-quality genome for a wild mushroom species Leucocalocybe mongolica (LM). We performed high-throughput transcriptome sequencing to analyze the genetic basis for the life history of LM. Our results show that the genome size of LM is 46.0 Mb, including 26 contigs with a contig N50 size of 3.6 Mb. In total, we predicted 11,599 protein-coding genes, of which 65.7% (7630) could be aligned with high confidence to annotated homologous genes in other species. We performed phylogenetic analyses using genes form 3269 single-copy gene families and showed support for distinguishing LM from the genus Tricholoma (L.) P.Kumm., in which it is sometimes circumscribed. We believe that one reason for limited wild occurrences of LM may be the loss of key metabolic genes, especially carbohydrate-active enzymes (CAZymes), based on comparisons with other closely related species. The results of our transcriptome analyses between vegetative (mycelia) and reproductive (fruiting bodies) organs indicated that changes in gene expression among some key CAZyme genes may help to determine the switch from asexual to sexual reproduction. Taken together, our genomic and transcriptome data for LM comprise a valuable resource for both understanding the evolutionary and life history of this species.


2017 ◽  
Author(s):  
Abigail J. Moore ◽  
Jurriaan M. de Vos ◽  
Lillian P. Hancock ◽  
Eric Goolsby ◽  
Erika J. Edwards

ABSTRACTHybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, single copy loci are relatively uncommon elements of most genomes, and as such may provide a biased evolutionary history. Furthermore, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the “portullugo” (Caryophyllales), a moderately sized lineage of flowering plants (~2200 species) that includes the cacti and harbors many evolutionary transitions to C4 and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C4 and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C4 and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75–218 loci across 74 taxa, with ~50% matrix completeness across datasets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve with strong support the sister lineage of the cacti: Anacampserotaceae + Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees for the resolution of the sister group of the cacti. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.


2008 ◽  
Vol 105 (40) ◽  
pp. 15487-15492 ◽  
Author(s):  
Susanne Dreborg ◽  
Görel Sundström ◽  
Tomas A. Larsson ◽  
Dan Larhammar

The opioid peptides and receptors have prominent roles in pain transmission and reward mechanisms in mammals. The evolution of the opioid receptors has so far been little studied, with only a few reports on species other than tetrapods. We have investigated species representing a broader range of vertebrates and found that the four opioid receptor types (delta, kappa, mu, and NOP) are present in most of the species. The gene relationships were deduced by using both phylogenetic analyses and chromosomal location relative to 20 neighboring gene families in databases of assembled genomes. The combined results show that the vertebrate opioid receptor gene family arose by quadruplication of a large chromosomal block containing at least 14 other gene families. The quadruplication seems to coincide with, and, therefore, probably resulted from, the two proposed genome duplications in early vertebrate evolution. We conclude that the quartet of opioid receptors was already present at the origin of jawed vertebrates ≈450 million years ago. A few additional opioid receptor gene duplications have occurred in bony fishes. Interestingly, the ancestral receptor gene duplications coincide with the origin of the four opioid peptide precursor genes. Thus, the complete vertebrate opioid system was already established in the first jawed vertebrates.


2021 ◽  
Author(s):  
Alberto Cenci ◽  
Mairenys Concepci&oacuten-Hernández ◽  
Geert Angenon ◽  
Mathieu Rouard

GDSL-type esterase/lipase (GELP) enzymes have multiple functions in plants, spanning from developmental processes to the response to biotic and abiotic stresses. Genes encoding GELP belong to a large gene family with several tens to more than hundred members per species in angiosperms. Here, we applied iterative phylogenic analyses to identify 10 main clusters subdivided into 44 expert-curated reference orthogroups (OGs) using three monocot and five dicot genomes. Our results show that some GELP OGs expanded while others were maintained as single copy genes. This semi-automatic approach proves to be effective to characterize large gene families and provides a solid classification framework for the GELP members in angiosperms. The orthogroup-based reference will be useful to perform comparative studies, infer gene functions and better understand the evolutionary history of this gene family.


Genetics ◽  
2020 ◽  
Vol 215 (4) ◽  
pp. 1153-1169 ◽  
Author(s):  
Riddhiman K. Garge ◽  
Jon M. Laurent ◽  
Aashiq H. Kachroo ◽  
Edward M. Marcotte

Many gene families have been expanded by gene duplications along the human lineage, relative to ancestral opisthokonts, but the extent to which the duplicated genes function similarly is understudied. Here, we focused on structural cytoskeletal genes involved in critical cellular processes, including chromosome segregation, macromolecular transport, and cell shape maintenance. To determine functional redundancy and divergence of duplicated human genes, we systematically humanized the yeast actin, myosin, tubulin, and septin genes, testing ∼81% of human cytoskeletal genes across seven gene families for their ability to complement a growth defect induced by inactivation or deletion of the corresponding yeast ortholog. In five of seven families—all but α-tubulin and light myosin, we found at least one human gene capable of complementing loss of the yeast gene. Despite rescuing growth defects, we observed differential abilities of human genes to rescue cell morphology, meiosis, and mating defects. By comparing phenotypes of humanized strains with deletion phenotypes of their interaction partners, we identify instances of human genes in the actin and septin families capable of carrying out essential functions, but failing to fully complement the cytoskeletal roles of their yeast orthologs, thus leading to abnormal cell morphologies. Overall, we show that duplicated human cytoskeletal genes appear to have diverged such that only a few human genes within each family are capable of replacing the essential roles of their yeast orthologs. The resulting yeast strains with humanized cytoskeletal components now provide surrogate platforms to characterize human genes in simplified eukaryotic contexts.


2020 ◽  
Vol 21 (5) ◽  
pp. 1752 ◽  
Author(s):  
Jin Yang ◽  
Jie Zhou ◽  
Hong-Jun Zhou ◽  
Mang-Mang Wang ◽  
Ming-Ming Liu ◽  
...  

Phosphate (Pi) transporters play critical roles in Pi acquisition and homeostasis. However, currently little is known about these genes in oil crops. In this study, we aimed to characterize the five Pi transporter gene families (PHT1-5) in allotetraploid Brassica napus. We identified and characterized 81 putative PHT genes in B. napus (BnaPHTs), including 45 genes in PHT1 family (BnaPHT1s), four BnaPHT2s, 10 BnaPHT3s, 13 BnaPHT4s and nine BnaPHT5s. Phylogenetic analyses showed that the largest PHT1 family could be divided into two groups (Group I and II), while PHT4 may be classified into five, Groups I-V. Gene structure analysis revealed that the exon-intron pattern was conservative within the same family or group. The sequence characteristics of these five families were quite different, which may contribute to their functional divergence. Transcription factor (TF) binding network analyses identified many potential TF binding sites in the promoter regions of candidates, implying their possible regulating patterns. Collinearity analysis demonstrated that most BnaPHTs were derived from an allopolyploidization event (~40.7%) between Brassica rapa and Brassica oleracea ancestors, and small-scale segmental duplication events (~39.5%) in the descendant. RNA-Seq analyses proved that many BnaPHTs were preferentially expressed in leaf and flower tissues. The expression profiles of most colinearity-pairs in B. napus are highly correlated, implying functional redundancy, while a few pairs may have undergone neo-functionalization or sub-functionalization during evolution. The expression levels of many BnaPHTs tend to be up-regulated by different hormones inductions, especially for IAA, ABA and 6-BA treatments. qRT-PCR assay demonstrated that six BnaPHT1s (BnaPHT1.11, BnaPHT1.14, BnaPHT1.20, BnaPHT1.35, BnaPHT1.41, BnaPHT1.44) were significantly up-regulated under low- and/or rich- Pi conditions in B. napus roots. This work analyzes the evolution and expression of the PHT family in Brassica napus, which will help further research on their role in Pi transport.


2021 ◽  
Vol 15 (8) ◽  
pp. e0009665
Author(s):  
Shuai Xu ◽  
Zhenpeng Li ◽  
Yuanming Huang ◽  
Lichao Han ◽  
Yanlin Che ◽  
...  

Nocardia is a complex and diverse genus of aerobic actinomycetes that cause complex clinical presentations, which are difficult to diagnose due to being misunderstood. To date, the genetic diversity, evolution, and taxonomic structure of the genus Nocardia are still unclear. In this study, we investigated the pan-genome of 86 Nocardia type strains to clarify their genetic diversity. Our study revealed an open pan-genome for Nocardia containing 265,836 gene families, with about 99.7% of the pan-genome being variable. Horizontal gene transfer appears to have been an important evolutionary driver of genetic diversity shaping the Nocardia genome and may have caused historical taxonomic confusion from other taxa (primarily Rhodococcus, Skermania, Aldersonia, and Mycobacterium). Based on single-copy gene families, we established a high-accuracy phylogenomic approach for Nocardia using 229 genome sequences. Furthermore, we found 28 potentially new species and reclassified 16 strains. Finally, by comparing the topology between a phylogenomic tree and 384 phylogenetic trees (from 384 single-copy genes from the core genome), we identified a novel locus for inferring the phylogeny of this genus. The dapb1 gene, which encodes dipeptidyl aminopeptidase BI, was far superior to commonly used markers for Nocardia and yielded a topology almost identical to that of genome-based phylogeny. In conclusion, the present study provides insights into the genetic diversity, contributes a robust framework for the taxonomic classification, and elucidates the evolutionary relationships of Nocardia. This framework should facilitate the development of rapid tests for the species identification of highly variable species and has given new insight into the behavior of this genus.


Sign in / Sign up

Export Citation Format

Share Document