Classification of spammer and nonspammer content in online social network using genetic algorithm-based feature selection

2020 ◽  
Vol 14 (5) ◽  
pp. 710-736
Author(s):  
Somya Ranjan Sahoo ◽  
B. B. Gupta
Author(s):  
Sourav Das ◽  
Anup Kumar Kolya ◽  
Dipankar Das

Twitter-based research for sentiment analysis is popular for quite some time now. This is used to represent documents in a corpus usually. This increases the time of classification and also increases space complexity. It is hence very natural to say that non-redundant feature reduction of the input space for a classifier will improve the generalization property of a classifier. In this approach, the researchers have tried to do feature selection using Genetic Algorithm (GA) which will reduce the set of features into a smaller subset. The researchers have also tried to put forward an approach using Genetic Algorithm to reduce the modelling complexity and training time of classification algorithm for 10k Twitter data based on GST. They aim to improve the accuracy of the classification that the researchers have obtained in a preface work to this work and achieved an accuracy of 87% through this work. Hence the Genetic Algorithm will do the feature selection to reduce the complexity of the classifier and give us a better accuracy of the classification of the tweet.


Sign in / Sign up

Export Citation Format

Share Document