scholarly journals Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function

2017 ◽  
Vol 18 (1) ◽  
Author(s):  
P. K. Busk ◽  
B. Pilgaard ◽  
M. J. Lezyk ◽  
A. S. Meyer ◽  
L. Lange
2013 ◽  
Vol 79 (11) ◽  
pp. 3380-3391 ◽  
Author(s):  
Peter Kamp Busk ◽  
Lene Lange

ABSTRACTFunctional prediction of carbohydrate-active enzymes is difficult due to low sequence identity. However, similar enzymes often share a few short motifs, e.g., around the active site, even when the overall sequences are very different. To exploit this notion for functional prediction of carbohydrate-active enzymes, we developed a simple algorithm, peptide pattern recognition (PPR), that can divide proteins into groups of sequences that share a set of short conserved sequences. When this method was used on 118 glycoside hydrolase 5 proteins with 9% average pairwise identity and representing four characterized enzymatic functions, 97% of the proteins were sorted into groups correlating with their enzymatic activity. Furthermore, we analyzed 8,138 glycoside hydrolase 13 proteins including 204 experimentally characterized enzymes with 28 different functions. There was a 91% correlation between group and enzyme activity. These results indicate that the function of carbohydrate-active enzymes can be predicted with high precision by finding short, conserved motifs in their sequences. The glycoside hydrolase 61 family is important for fungal biomass conversion, but only a few proteins of this family have been functionally characterized. Interestingly, PPR divided 743 glycoside hydrolase 61 proteins into 16 subfamilies useful for targeted investigation of the function of these proteins and pinpointed three conserved motifs with putative importance for enzyme activity. Furthermore, the conserved sequences were useful for cloning of new, subfamily-specific glycoside hydrolase 61 proteins from 14 fungi. In conclusion, identification of conserved sequence motifs is a new approach to sequence analysis that can predict carbohydrate-active enzyme functions with high precision.


2021 ◽  
Vol 7 (3) ◽  
pp. 207
Author(s):  
Lene Lange ◽  
Kristian Barrett ◽  
Anne S. Meyer

Fungal genome sequencing data represent an enormous pool of information for enzyme discovery. Here, we report a new approach to identify and quantitatively compare biomass-degrading capacity and diversity of fungal genomes via integrated function-family annotation of carbohydrate-active enzymes (CAZymes) encoded by the genomes. Based on analyses of 1932 fungal genomes the most potent hotspots of fungal biomass processing CAZymes are identified and ranked according to substrate degradation capacity. The analysis is achieved by a new bioinformatics approach, Conserved Unique Peptide Patterns (CUPP), providing for CAZyme-family annotation and robust prediction of molecular function followed by conversion of the CUPP output to lists of integrated “Function;Family” (e.g., EC 3.2.1.4;GH5) enzyme observations. An EC-function found in several protein families counts as different observations. Summing up such observations allows for ranking of all analyzed genome sequenced fungal species according to richness in CAZyme function diversity and degrading capacity. Identifying fungal CAZyme hotspots provides for identification of fungal species richest in cellulolytic, xylanolytic, pectinolytic, and lignin modifying enzymes. The fungal enzyme hotspots are found in fungi having very different lifestyle, ecology, physiology and substrate/host affinity. Surprisingly, most CAZyme hotspots are found in enzymatically understudied and unexploited species. In contrast, the most well-known fungal enzyme producers, from where many industrially exploited enzymes are derived, are ranking unexpectedly low. The results contribute to elucidating the evolution of fungal substrate-digestive CAZyme profiles, ecophysiology, and habitat adaptations, and expand the knowledge base for novel and improved biomass resource utilization.


2018 ◽  
Vol 115 (17) ◽  
pp. 4429-4434 ◽  
Author(s):  
Thies Gehrmann ◽  
Jordi F. Pelkmans ◽  
Robin A. Ohm ◽  
Aurin M. Vos ◽  
Anton S. M. Sonnenberg ◽  
...  

Many fungi are polykaryotic, containing multiple nuclei per cell. In the case of heterokaryons, there are different nuclear types within a single cell. It is unknown what the different nuclear types contribute in terms of mRNA expression levels in fungal heterokaryons. Each cell of the mushroomAgaricus bisporuscontains two to 25 nuclei of two nuclear types originating from two parental strains. Using RNA-sequencing data, we assess the differential mRNA contribution of individual nuclear types and its functional impact. We studied differential expression between genes of the two nuclear types, P1 and P2, throughout mushroom development in various tissue types. P1 and P2 produced specific mRNA profiles that changed through mushroom development. Differential regulation occurred at the gene level, rather than at the locus, chromosomal, or nuclear level. P1 dominated mRNA production throughout development, and P2 showed more differentially up-regulated genes in important functional groups. In the vegetative mycelium, P2 up-regulated almost threefold more metabolism genes and carbohydrate active enzymes (cazymes) than P1, suggesting phenotypic differences in growth. We identified widespread transcriptomic variation between the nuclear types ofA. bisporus. Our method enables studying nucleus-specific expression, which likely influences the phenotype of a fungus in a polykaryotic stage. Our findings have a wider impact to better understand gene regulation in fungi in a heterokaryotic state. This work provides insight into the transcriptomic variation introduced by genomic nuclear separation.


PLoS Genetics ◽  
2014 ◽  
Vol 10 (11) ◽  
pp. e1004773 ◽  
Author(s):  
Magali Boutard ◽  
Tristan Cerisy ◽  
Pierre-Yves Nogue ◽  
Adriana Alberti ◽  
Jean Weissenbach ◽  
...  

Author(s):  
Brandi Cantarel ◽  
Pedro Coutinho ◽  
Bernard Henrissat

Author(s):  
G.S. Dotsenko ◽  
A.S. Dotsenko

Mining protein data is a recent promising area of modern bioinformatics. In this work, we suggested a novel approach for mining protein data – conserved peptides recognition by ensemble of neural networks (CPRENN). This approach was applied for mining lytic polysaccharide monooxygenases (LPMOs) in 19 ascomycete, 18 basidiomycete, and 18 bacterial proteomes. LPMOs are recently discovered enzymes and their mining is of high relevance for biotechnology of lignocellulosic materials. CPRENN was compared with two conventional bioinformatic methods for mining protein data – profile hidden Markov models (HMMs) search (HMMER program) and peptide pattern recognition (PPR program combined with Hotpep application). The maximum number of hypothetical LPMO amino acid sequences was discovered by HMMER. Profile HMMs search proved to be more sensitive method for mining LPMOs than conserved peptides recognition. Totally, CPRENN found 76 %, 67 %, and 65 % of hypothetical ascomycete, basidiomycete, and bacterial LPMOs discovered by HMMER, respectively. For AA9, AA10, and AA11 families which contain the major part of all LPMOs in the carbohydrate-active enzymes database (CAZy), CPRENN and PPR + Hotpep found 69–98 % and 62–95 % of amino acid sequences discovered by HMMER, respectively. In contrast with PPR + Hotpep, CPRENN possessed perfect precision and provided more complete mining of basidiomycete and bacterial LPMOs.


2021 ◽  
Author(s):  
Gabriele Cerutti ◽  
Elena Gugole ◽  
Linda Celeste Montemiglio ◽  
Annick Turbé-Doan ◽  
Dehbia Chena ◽  
...  

Abstract Background: Fungal glucose dehydrogenases (GDHs) are FAD-dependent enzymes belonging to the glucose-methanol-choline oxidoreductase superfamily. These enzymes are classified in the “Auxiliary Activity” family 3 (AA3) of the Carbohydrate-Active enZymes database, and more specifically in subfamily AA3_2, that also includes the closely related flavoenzymes aryl-alcohol oxidase and glucose 1-oxidase. Based on sequence similarity to known fungal GDHs, an AA3_2 enzyme active on glucose was identified in the genome of Pycnoporus cinnabarinus, a model Basidiomycete able to completely degrade lignin.Results: In our work, substrate screening and functional characterization showed an unexpected preferential activity of this enzyme toward oligosaccharides containing a b(1à3) glycosidic bond, with the highest efficiency observed for the disaccharide laminaribiose. Despite its sequence similarity to GDHs, we defined a novel enzymatic activity, namely oligosaccharide dehydrogenase (ODH), for this enzyme. The crystallographic structures of ODH in the sugar-free form and in complex with glucose and laminaribiose unveiled a peculiar saccharide recognition mechanism which is not shared with previously characterized AA3 oxidoreductases and accounts for ODH preferential activity toward oligosaccharides. The sugar molecules in the active site of ODH are mainly stabilized through CH-p interactions with aromatic residues rather than through hydrogen bonds with highly conserved residues, as observed instead for the fungal glucose dehydrogenases and oxidases characterized to date. Finally, three sugar-binding sites were identified on ODH external surface, which were not previously observed and might be of importance in the physiological scenario.Conclusions: Structure-function analysis of ODH is consistent with its role as an auxiliary enzyme in lignocellulose degradation and unveils yet another enzymatic function within the AA3 family of the Carbohydrate-Active enZymes database. Our findings allow deciphering the molecular determinants of substrate binding and provide insight into the physiological role of ODH, opening new perspectives to exploit biodiversity for lignocellulose transformation into fuels and chemicals.


Sign in / Sign up

Export Citation Format

Share Document