scholarly journals ClassiPhages 2.0: Sequence-based classification of phages using Artificial Neural Networks

2019 ◽  
Author(s):  
Cynthia Maria Chibani ◽  
Florentin Meinecke ◽  
Anton Farr ◽  
Sascha Dietrich ◽  
Heiko Liesegang

AbstractBackground/ MotivationIn the era of affordable next generation sequencing technologies we are facing an exploding amount of new phage genome sequences. This requests high throughput phage classification tools that meet the standards of the International Committee on Taxonomy of Viruses (ICTV). However, an accurate prediction of phage taxonomic classification derived from phage sequences still poses a challenge due to the lack of performant taxonomic markers. Since machine learning methods have proved to be efficient for the classification of biological data we investigated how artificial neural networks perform on the task of phage taxonomy.ResultsIn this work, 5,920 constructed and refined profile Hidden Markov Models (HMMs), derived from 8,721 phage sequences classified into 12 well known phage families, were used to scan phage proteome datasets. The resulting Phage Family-proteome to Phage-derived-HMMs scoring matrix was used to develop and train an Artificial Neural Network (ANN) to find patterns for phage classification into one of the phage families. Results show that using the 100 fold cross-validation test, the proposed method achieved an overall accuracy of 84.18 %. The ANN was tested on a set of unclassified phages and resulted in a taxonomic prediction. The ANN prediction was benchmarked against the prediction resulting of multi-HMM hits, and showed that the ANN performance is dependent on the quality of the input matrix.ConclusionsWe believe that, as long as some phage families on public databases are underrepresented, multi-HMM hits can be used as a classification method to populate those phage families, which in turn will improve the performance and accuracy of the ANN. We believe that the proposed method is an effective and promising method for phage classification. The good performance of the ANN and HMM based predictor indicates the efficiency of the method for phage classification, where we foresee its improvement with an increasing number of sequenced viral genomes.

2021 ◽  
Vol 23 ◽  
pp. 100313
Author(s):  
Nicholas A. Thurn ◽  
Taylor Wood ◽  
Mary R. Williams ◽  
Michael E. Sigman

2017 ◽  
Vol 70 (4) ◽  
pp. 492-498 ◽  
Author(s):  
Leandro S Santos ◽  
Roberta M D Cardozo ◽  
Natália Moreiria Nunes ◽  
Andréia B Inácio ◽  
Ana Clarissa dos S Pires ◽  
...  

2006 ◽  
Vol 41 (3) ◽  
pp. 257-263 ◽  
Author(s):  
Robespierre Santos ◽  
Horst G. Haack ◽  
Des Maddalena ◽  
Ross D. Hansen ◽  
John E. Kellow

2016 ◽  
Vol 19 (1) ◽  
pp. 49-59 ◽  
Author(s):  
Nina Pavlin-Bernardić ◽  
◽  
Silvija Ravić ◽  
Ivan Pavao Matić ◽  
◽  
...  

Artificial neural networks have a wide use in the prediction and classification of different variables, but their application in the area of educational psychology is still relatively rare. The aim of this study was to examine the accuracy of artificial neural networks in predicting students’ general giftedness. The participants were 221 fourth grade students from one Croatian elementary school. The input variables for artificial neural networks were teachers’ and peers’ nominations, school grades, earlier school readiness assessment and parents’ education. The output variable was the result on the Standard Progressive Matrices (Raven, 1994), according to which students were classified as gifted or non-gifted. We tested two artificial neural networks’ algorithms: multilayer perceptron and radial basis function. Within each algorithm, a number of different types of activation functions were tested. 80% of the sample was used for training the network and the remaining 20% to test the network. For a criterion according to which students were classified as gifted if their result on the Standard Progressive Matrices was in the 95th centile or above, the best model was obtained by the hyperbolic tangent multilayer perceptron, which had a high accuracy of 100% of correctly classified non-gifted students and 75% correctly classified gifted students in the test sample. When the criterion was the 90th centile or above, the best model was also obtained by the hyperbolic tangent multilayer perceptron, but the accuracy was lower: 94.7% in the classification of non-gifted students and 66.7% in the classification of gifted students. The study has shown artificial neural networks’ potential in this area, which should be further explored. Keywords: gifted students, identification of gifted students, artificial neural networks


1996 ◽  
Vol 57 (2) ◽  
pp. 79-87 ◽  
Author(s):  
Abdelgadir A. Abuelgasim ◽  
Sucharita Gopal ◽  
James R. Irons ◽  
Alan H. Strahler

Sign in / Sign up

Export Citation Format

Share Document