new protein families
Recently Published Documents


TOTAL DOCUMENTS

8
(FIVE YEARS 3)

H-INDEX

5
(FIVE YEARS 2)

PLoS ONE ◽  
2020 ◽  
Vol 15 (4) ◽  
pp. e0231085 ◽  
Author(s):  
Arturo Medrano-Soto ◽  
Faezeh Ghazi ◽  
Kevin J. Hendargo ◽  
Gabriel Moreno-Hagelsieb ◽  
Scott Myers ◽  
...  

2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Yan Wang ◽  
Qiang Shi ◽  
Pengshuo Yang ◽  
Chengxin Zhang ◽  
S. M. Mortuza ◽  
...  

Abstract Introduction The ocean microbiome represents one of the largest microbiomes and produces nearly half of the primary energy on the planet through photosynthesis or chemosynthesis. Using recent advances in marine genomics, we explore new applications of oceanic metagenomes for protein structure and function prediction. Results By processing 1.3 TB of high-quality reads from the Tara Oceans data, we obtain 97 million non-redundant genes. Of the 5721 Pfam families that lack experimental structures, 2801 have at least one member associated with the oceanic metagenomics dataset. We apply C-QUARK, a deep-learning contact-guided ab initio structure prediction pipeline, to model 27 families, where 20 are predicted to have a reliable fold with estimated template modeling score (TM-score) at least 0.5. Detailed analyses reveal that the abundance of microbial genera in the ocean is highly correlated to the frequency of occurrence in the modeled Pfam families, suggesting the significant role of the Tara Oceans genomes in the contact-map prediction and subsequent ab initio folding simulations. Of interesting note, PF15461, which has a majority of members coming from ocean-related bacteria, is identified as an important photosynthetic protein by structure-based function annotations. The pipeline is extended to a set of 417 Pfam families, built on the combination of Tara with other metagenomics datasets, which results in 235 families with an estimated TM-score over 0.5. Conclusions These results demonstrate a new avenue to improve the capacity of protein structure and function modeling through marine metagenomics, especially for difficult proteins with few homologous sequences.


2017 ◽  
Author(s):  
André Veríssimo ◽  
Jean-Etienne Bassard ◽  
Alice Julien-Laferrière ◽  
Marie-France Sagot ◽  
Susana Vinga

AbstractSummaryCurrent workflows for sequence analysis heavily depend on user input and manual curation. New specialized tools and methods are appearing all the time, but the actions required for a full analysis are disconnected and very time-consuming. The software we propose, MassBlast, combines BLAST+ and an automated workflow analysis to filter the results and significantly improve the annotation of multiple sequencing databases for exploring new biosynthetic pathways and new protein families, among other applications. MassBlast is fully configurable and reproducible.Availability and ImplementationThe MassBlast package is written in Ruby. Source code and releases are freely available from Github (https://github.com/averissimo/mass-blast) for all major platforms (Linux, MS Windows and OS X) under the GPLv3 [email protected]


2013 ◽  
Vol 14 (2) ◽  
pp. R16 ◽  
Author(s):  
Mario Looso ◽  
Jens Preussner ◽  
Konstantinos Sousounis ◽  
Marc Bruckskotten ◽  
Christian S Michel ◽  
...  

2002 ◽  
Vol 184 (22) ◽  
pp. 6301-6315 ◽  
Author(s):  
Johannes Gescher ◽  
Annette Zaar ◽  
Magdy Mohamed ◽  
Hermann Schägger ◽  
Georg Fuchs

ABSTRACT A new pathway for aerobic benzoate oxidation has been postulated for Azoarcus evansii and for a Bacillus stearothermophilus-like strain. Benzoate is first transformed into benzoyl coenzyme A (benzoyl-CoA), which subsequently is oxidized to 3-hydroxyadipyl-CoA and then to 3-ketoadipyl-CoA; all intermediates are CoA thioesters. The genes coding for this benzoate-induced pathway were investigated in the β-proteobacterium A. evansii. They were identified on the basis of N-terminal amino acid sequences of purified benzoate metabolic enzymes and of benzoate-induced proteins identified on two-dimensional gels. Fifteen genes probably coding for the benzoate pathway were found to be clustered on the chromosome. These genes code for the following functions: a putative ATP-dependent benzoate transport system, benzoate-CoA ligase, a putative benzoyl-CoA oxygenase, a putative isomerizing enzyme, a putative ring-opening enzyme, enzymes for β-oxidation of CoA-activated intermediates, thioesterase, and lactone hydrolase, as well as completely unknown enzymes belonging to new protein families. An unusual putative regulator protein consists of a regulator protein and a shikimate kinase I-type domain. A deletion mutant with a deletion in one gene (boxA) was unable to grow with benzoate as the sole organic substrate, but it was able to grow with 3-hydroxybenzoate and adipate. The data support the proposed pathway, which postulates operation of a new type of ring-hydroxylating dioxygenase acting on benzoyl-CoA and nonoxygenolytic ring cleavage. A β-oxidation-like metabolism of the ring cleavage product is thought to lead to 3-ketoadipyl-CoA, which finally is cleaved into succinyl-CoA and acetyl-CoA.


Sign in / Sign up

Export Citation Format

Share Document