Predicting HIV-1 Protease Cleavage Sites With Positive-Unlabeled Learning

Understanding the substrate specificity of HIV-1 protease plays an essential role in the prevention of HIV infection. A variety of computational models have thus been developed to predict substrate sites that are cleaved by HIV-1 protease, but most of them normally follow a supervised learning scheme to build classifiers by considering experimentally verified cleavable sites as positive samples and unknown sites as negative samples. However, certain noisy can be contained in the negative set, as false negative samples are possibly existed. Hence, the performance of the classifiers is not as accurate as they could be due to the biased prediction results. In this work, unknown substrate sites are regarded as unlabeled samples instead of negative ones. We propose a novel positive-unlabeled learning algorithm, namely PU-HIV, for an effective prediction of HIV-1 protease cleavage sites. Features used by PU-HIV are encoded from different perspectives of substrate sequences, including amino acid identities, coevolutionary patterns and chemical properties. By adjusting the weights of errors generated by positive and unlabeled samples, a biased support vector machine classifier can be built to complete the prediction task. In comparison with state-of-the-art prediction models, benchmarking experiments using cross-validation and independent tests demonstrated the superior performance of PU-HIV in terms of AUC, PR-AUC, and F-measure. Thus, with PU-HIV, it is possible to identify previously unknown, but physiologically existed substrate sites that are able to be cleaved by HIV-1 protease, thus providing valuable insights into designing novel HIV-1 protease inhibitors for HIV treatment.

Download Full-text

An Ensemble Learning Algorithm for Predicting HIV-1 Protease Cleavage Sites

Intelligent Computing Theories and Application - Lecture Notes in Computer Science ◽

10.1007/978-3-030-84532-2_46 ◽

2021 ◽

pp. 509-521

Author(s):

Zhenfeng Li ◽

Pengwei Hu ◽

Lun Hu

Keyword(s):

Ensemble Learning ◽

Learning Algorithm ◽

Cleavage Sites ◽

Protease Cleavage ◽

Protease Cleavage Sites ◽

Ensemble Learning Algorithm ◽

Hiv 1

Download Full-text

The identification of variable-length coevolutionary patterns for predicting HIV-1 protease cleavage sites

2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc42975.2020.9283082 ◽

2020 ◽

Author(s):

Zhenfeng Li ◽

Lun Hu

Keyword(s):

Variable Length ◽

Cleavage Sites ◽

Protease Cleavage ◽

Protease Cleavage Sites ◽

Hiv 1

Download Full-text

Detection of HIV-1 Protease Cleavage Sites via Hidden Markov Model and Physicochemical Properties of Amino Acids

Nonlinear Systems and Complexity - Numerical Solutions of Realistic Nonlinear Phenomena ◽

10.1007/978-3-030-37141-8_10 ◽

2020 ◽

pp. 171-193

Author(s):

Elif Doğan Dar ◽

Vilda Purutçuoğlu ◽

Eda Purutçuoğlu

Keyword(s):

Amino Acids ◽

Markov Model ◽

Physicochemical Properties ◽

Hidden Markov Model ◽

Hidden Markov ◽

Cleavage Sites ◽

Protease Cleavage ◽

Protease Cleavage Sites ◽

Hiv 1

Download Full-text

Protease Cleavage Sites in HIV-1 gp120 Recognized by Antigen Processing Enzymes Are Conserved and Located at Receptor Binding Sites

Journal of Virology ◽

10.1128/jvi.01765-09 ◽

2009 ◽

Vol 84 (3) ◽

pp. 1513-1526 ◽

Cited By ~ 19

Author(s):

Bin Yu ◽

Dora P. A. J. Fonseca ◽

Sara M. O'Rourke ◽

Phillip W. Berman

Keyword(s):

Immune Response ◽

Receptor Binding ◽

Neutralizing Antibodies ◽

Antigen Processing ◽

Cleavage Sites ◽

Antiviral Immune Response ◽

Processing Enzymes ◽

Protease Cleavage ◽

Protease Cleavage Sites ◽

Hiv 1

ABSTRACT The identification of vaccine immunogens able to elicit broadly neutralizing antibodies (bNAbs) is a major goal in HIV vaccine research. Although it has been possible to produce recombinant envelope glycoproteins able to adsorb bNAbs from HIV-positive sera, immunization with these proteins has failed to elicit antibody responses effective against clinical isolates of HIV-1. Thus, the epitopes recognized by bNAbs are present on recombinant proteins, but they are not immunogenic. These results led us to consider the possibility that changes in the pattern of antigen processing might alter the immune response to the envelope glycoprotein to better elicit protective immunity. In these studies, we have defined protease cleavage sites on HIV gp120 recognized by three major human proteases (cathepsins L, S, and D) important for antigen processing and presentation. Remarkably, six of the eight sites identified in gp120 were highly conserved and clustered in regions of the molecule associated with receptor binding and/or the binding of neutralizing antibodies. These results suggested that HIV may have evolved to take advantage of major histocompatibility complex (MHC) class II antigen processing enzymes in order to evade or direct the antiviral immune response.

Download Full-text