Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function

ABSTRACTFunctional prediction of carbohydrate-active enzymes is difficult due to low sequence identity. However, similar enzymes often share a few short motifs, e.g., around the active site, even when the overall sequences are very different. To exploit this notion for functional prediction of carbohydrate-active enzymes, we developed a simple algorithm, peptide pattern recognition (PPR), that can divide proteins into groups of sequences that share a set of short conserved sequences. When this method was used on 118 glycoside hydrolase 5 proteins with 9% average pairwise identity and representing four characterized enzymatic functions, 97% of the proteins were sorted into groups correlating with their enzymatic activity. Furthermore, we analyzed 8,138 glycoside hydrolase 13 proteins including 204 experimentally characterized enzymes with 28 different functions. There was a 91% correlation between group and enzyme activity. These results indicate that the function of carbohydrate-active enzymes can be predicted with high precision by finding short, conserved motifs in their sequences. The glycoside hydrolase 61 family is important for fungal biomass conversion, but only a few proteins of this family have been functionally characterized. Interestingly, PPR divided 743 glycoside hydrolase 61 proteins into 16 subfamilies useful for targeted investigation of the function of these proteins and pinpointed three conserved motifs with putative importance for enzyme activity. Furthermore, the conserved sequences were useful for cloning of new, subfamily-specific glycoside hydrolase 61 proteins from 14 fungi. In conclusion, identification of conserved sequence motifs is a new approach to sequence analysis that can predict carbohydrate-active enzyme functions with high precision.

Download Full-text

New Method for Identifying Fungal Kingdom Enzyme Hotspots from Genome Sequences

Journal of Fungi ◽

10.3390/jof7030207 ◽

2021 ◽

Vol 7 (3) ◽

pp. 207

Author(s):

Lene Lange ◽

Kristian Barrett ◽

Anne S. Meyer

Keyword(s):

Fungal Species ◽

Fungal Genome ◽

Sequencing Data ◽

Carbohydrate Active Enzymes ◽

New Approach ◽

Fungal Enzyme ◽

Biomass Processing ◽

Fungal Genomes ◽

Robust Prediction ◽

Fungal Kingdom

Fungal genome sequencing data represent an enormous pool of information for enzyme discovery. Here, we report a new approach to identify and quantitatively compare biomass-degrading capacity and diversity of fungal genomes via integrated function-family annotation of carbohydrate-active enzymes (CAZymes) encoded by the genomes. Based on analyses of 1932 fungal genomes the most potent hotspots of fungal biomass processing CAZymes are identified and ranked according to substrate degradation capacity. The analysis is achieved by a new bioinformatics approach, Conserved Unique Peptide Patterns (CUPP), providing for CAZyme-family annotation and robust prediction of molecular function followed by conversion of the CUPP output to lists of integrated “Function;Family” (e.g., EC 3.2.1.4;GH5) enzyme observations. An EC-function found in several protein families counts as different observations. Summing up such observations allows for ranking of all analyzed genome sequenced fungal species according to richness in CAZyme function diversity and degrading capacity. Identifying fungal CAZyme hotspots provides for identification of fungal species richest in cellulolytic, xylanolytic, pectinolytic, and lignin modifying enzymes. The fungal enzyme hotspots are found in fungi having very different lifestyle, ecology, physiology and substrate/host affinity. Surprisingly, most CAZyme hotspots are found in enzymatically understudied and unexploited species. In contrast, the most well-known fungal enzyme producers, from where many industrially exploited enzymes are derived, are ranking unexpectedly low. The results contribute to elucidating the evolution of fungal substrate-digestive CAZyme profiles, ecophysiology, and habitat adaptations, and expand the knowledge base for novel and improved biomass resource utilization.

Download Full-text

Bisphenol A alters volatile fatty acids accumulation during sludge anaerobic fermentation by affecting amino acid metabolism, material transport and carbohydrate-active enzymes

Bioresource Technology ◽

10.1016/j.biortech.2020.124588 ◽

2021 ◽

Vol 323 ◽

pp. 124588

Author(s):

Xiupeng Jiang ◽

Yuanyuan Yan ◽

Leiyu Feng ◽

Feng Wang ◽

Yingqing Guo ◽

...

Keyword(s):

Fatty Acids ◽

Amino Acid ◽

Bisphenol A ◽

Amino Acid Metabolism ◽

Volatile Fatty Acids ◽

Acid Metabolism ◽

Anaerobic Fermentation ◽

Material Transport ◽

Carbohydrate Active Enzymes

Download Full-text

68 Comparative transcriptomic analysis of the carbohydrate active enzymes that are expressed to degrade plant cell wall carbohydrates by diverse genera of anaerobic fungi isolated from the rumen.

Journal of Animal Science ◽

10.1093/jas/sky404.868 ◽

2018 ◽

Vol 96 (suppl_3) ◽

pp. 395-396

Author(s):

R Gruninger ◽

T Nguyen ◽

I Reid ◽

J Yanke ◽

P Wang ◽

...

Keyword(s):

Cell Wall ◽

Plant Cell ◽

Plant Cell Wall ◽

Transcriptomic Analysis ◽

Anaerobic Fungi ◽

Carbohydrate Active Enzymes ◽

Cell Wall Carbohydrates ◽

Comparative Transcriptomic Analysis

Download Full-text

Nucleus-specific expression in the multinuclear mushroom-forming fungusAgaricus bisporusreveals different nuclear regulatory programs

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.1721381115 ◽

2018 ◽

Vol 115 (17) ◽

pp. 4429-4434 ◽

Cited By ~ 15

Author(s):

Thies Gehrmann ◽

Jordi F. Pelkmans ◽

Robin A. Ohm ◽

Aurin M. Vos ◽

Anton S. M. Sonnenberg ◽

...

Keyword(s):

Agaricus Bisporus ◽

Sequencing Data ◽

Carbohydrate Active Enzymes ◽

Nuclear Level ◽

Specific Expression ◽

Phenotypic Differences ◽

Gene Level ◽

Nuclear Separation ◽

Insight Into ◽

Specific Mrna

Many fungi are polykaryotic, containing multiple nuclei per cell. In the case of heterokaryons, there are different nuclear types within a single cell. It is unknown what the different nuclear types contribute in terms of mRNA expression levels in fungal heterokaryons. Each cell of the mushroomAgaricus bisporuscontains two to 25 nuclei of two nuclear types originating from two parental strains. Using RNA-sequencing data, we assess the differential mRNA contribution of individual nuclear types and its functional impact. We studied differential expression between genes of the two nuclear types, P1 and P2, throughout mushroom development in various tissue types. P1 and P2 produced specific mRNA profiles that changed through mushroom development. Differential regulation occurred at the gene level, rather than at the locus, chromosomal, or nuclear level. P1 dominated mRNA production throughout development, and P2 showed more differentially up-regulated genes in important functional groups. In the vegetative mycelium, P2 up-regulated almost threefold more metabolism genes and carbohydrate active enzymes (cazymes) than P1, suggesting phenotypic differences in growth. We identified widespread transcriptomic variation between the nuclear types ofA. bisporus. Our method enables studying nucleus-specific expression, which likely influences the phenotype of a fungus in a polykaryotic stage. Our findings have a wider impact to better understand gene regulation in fungi in a heterokaryotic state. This work provides insight into the transcriptomic variation introduced by genomic nuclear separation.

Download Full-text

Functional Diversity of Carbohydrate-Active Enzymes Enabling a Bacterium to Ferment Plant Biomass

PLoS Genetics ◽

10.1371/journal.pgen.1004773 ◽

2014 ◽

Vol 10 (11) ◽

pp. e1004773 ◽

Cited By ~ 19

Author(s):

Magali Boutard ◽

Tristan Cerisy ◽

Pierre-Yves Nogue ◽

Adriana Alberti ◽

Jean Weissenbach ◽

...

Keyword(s):

Functional Diversity ◽

Plant Biomass ◽

Carbohydrate Active Enzymes

Download Full-text

Contribution of Domain Interactions and Calcium Binding to the Stability of Carbohydrate-Active Enzymes

Bioenergy ◽

10.1128/9781555815547.ch9 ◽

2014 ◽

pp. 115-127

Author(s):

Lars G. Ljungdahl ◽

Irina A. Kataeva ◽

Vladimir N. Uversky

Keyword(s):

Calcium Binding ◽

Carbohydrate Active Enzymes ◽

Domain Interactions ◽

The Stability

Download Full-text

Carbohydrate-Active Enzymes Database, Metagenomic Expert Resource

Encyclopedia of Metagenomics ◽

10.1007/978-1-4614-6418-1_25-10 ◽

2012 ◽

pp. 1-7

Author(s):

Brandi Cantarel ◽

Pedro Coutinho ◽

Bernard Henrissat

Keyword(s):

Carbohydrate Active Enzymes

Download Full-text

Conserved Peptides Recognition by Ensemble of Neural Networks for Mining Protein Data – LPMO Case Study

Математическая биология и биоинформатика ◽

10.17537/2020.15.429 ◽

2020 ◽

Vol 15 (2) ◽

pp. 429-440

Author(s):

G.S. Dotsenko ◽

A.S. Dotsenko

Keyword(s):

Neural Networks ◽

Amino Acid ◽

Markov Models ◽

Amino Acid Sequences ◽

Peptide Pattern ◽

Novel Approach ◽

Polysaccharide Monooxygenases ◽

Promising Area ◽

Peptide Pattern Recognition ◽

Bacterial Proteomes

Mining protein data is a recent promising area of modern bioinformatics. In this work, we suggested a novel approach for mining protein data – conserved peptides recognition by ensemble of neural networks (CPRENN). This approach was applied for mining lytic polysaccharide monooxygenases (LPMOs) in 19 ascomycete, 18 basidiomycete, and 18 bacterial proteomes. LPMOs are recently discovered enzymes and their mining is of high relevance for biotechnology of lignocellulosic materials. CPRENN was compared with two conventional bioinformatic methods for mining protein data – profile hidden Markov models (HMMs) search (HMMER program) and peptide pattern recognition (PPR program combined with Hotpep application). The maximum number of hypothetical LPMO amino acid sequences was discovered by HMMER. Profile HMMs search proved to be more sensitive method for mining LPMOs than conserved peptides recognition. Totally, CPRENN found 76 %, 67 %, and 65 % of hypothetical ascomycete, basidiomycete, and bacterial LPMOs discovered by HMMER, respectively. For AA9, AA10, and AA11 families which contain the major part of all LPMOs in the carbohydrate-active enzymes database (CAZy), CPRENN and PPR + Hotpep found 69–98 % and 62–95 % of amino acid sequences discovered by HMMER, respectively. In contrast with PPR + Hotpep, CPRENN possessed perfect precision and provided more complete mining of basidiomycete and bacterial LPMOs.

Download Full-text

Identification of an Oligosaccharide Dehydrogenase from Pycnoporus Cinnabarinus Provides Insights into Fungal Breakdown of Lignocellulose

10.21203/rs.3.rs-237113/v1 ◽

2021 ◽

Author(s):

Gabriele Cerutti ◽

Elena Gugole ◽

Linda Celeste Montemiglio ◽

Annick Turbé-Doan ◽

Dehbia Chena ◽

...

Keyword(s):

Function Analysis ◽

Sequence Similarity ◽

Functional Characterization ◽

Physiological Role ◽

Free Form ◽

Lignocellulose Degradation ◽

Carbohydrate Active Enzymes ◽

Recognition Mechanism ◽

Pycnoporus Cinnabarinus ◽

Enzymatic Function

Abstract Background: Fungal glucose dehydrogenases (GDHs) are FAD-dependent enzymes belonging to the glucose-methanol-choline oxidoreductase superfamily. These enzymes are classified in the “Auxiliary Activity” family 3 (AA3) of the Carbohydrate-Active enZymes database, and more specifically in subfamily AA3_2, that also includes the closely related flavoenzymes aryl-alcohol oxidase and glucose 1-oxidase. Based on sequence similarity to known fungal GDHs, an AA3_2 enzyme active on glucose was identified in the genome of Pycnoporus cinnabarinus, a model Basidiomycete able to completely degrade lignin.Results: In our work, substrate screening and functional characterization showed an unexpected preferential activity of this enzyme toward oligosaccharides containing a b(1à3) glycosidic bond, with the highest efficiency observed for the disaccharide laminaribiose. Despite its sequence similarity to GDHs, we defined a novel enzymatic activity, namely oligosaccharide dehydrogenase (ODH), for this enzyme. The crystallographic structures of ODH in the sugar-free form and in complex with glucose and laminaribiose unveiled a peculiar saccharide recognition mechanism which is not shared with previously characterized AA3 oxidoreductases and accounts for ODH preferential activity toward oligosaccharides. The sugar molecules in the active site of ODH are mainly stabilized through CH-p interactions with aromatic residues rather than through hydrogen bonds with highly conserved residues, as observed instead for the fungal glucose dehydrogenases and oxidases characterized to date. Finally, three sugar-binding sites were identified on ODH external surface, which were not previously observed and might be of importance in the physiological scenario.Conclusions: Structure-function analysis of ODH is consistent with its role as an auxiliary enzyme in lignocellulose degradation and unveils yet another enzymatic function within the AA3 family of the Carbohydrate-Active enZymes database. Our findings allow deciphering the molecular determinants of substrate binding and provide insight into the physiological role of ODH, opening new perspectives to exploit biodiversity for lignocellulose transformation into fuels and chemicals.

Download Full-text