scholarly journals English Sentiment Classification using Only the Sentiment Lexicons with a JOHNSON Coefficient in a Parallel Network Environment

2018 ◽  
Vol 11 (1) ◽  
pp. 38-65 ◽  
Author(s):  
Vo Ngoc Phu ◽  
Vo Thi Ngoc Tran
Author(s):  
Nguyen Duy Dat ◽  
Vo Ngoc Phu ◽  
Vo Thi Ngoc Tran ◽  
Vo Thi Ngoc Chau ◽  
Tuan A. Nguyen

Sentiment classification is significant in everyday life of everyone, in political activities, activities of commodity production, commercial activities. In this research, we propose a new model for Big Data sentiment classification in the parallel network environment. Our new model uses STING Algorithm (SA) (in the data mining field) for English document-level sentiment classification with Hadoop Map (M)/Reduce (R) based on the 90,000 English sentences of the training data set in a Cloudera parallel network environment — a distributed system. In the world there is not any scientific study which is similar to this survey. Our new model can classify sentiment of millions of English documents with the shortest execution time in the parallel network environment. We test our new model on the 25,000 English documents of the testing data set and achieved on 61.2% accuracy. Our English training data set includes 45,000 positive English sentences and 45,000 negative English sentences.


2017 ◽  
Vol 53 (3) ◽  
pp. 579-636 ◽  
Author(s):  
Vo Ngoc Phu ◽  
Vo Thi Ngoc Chau ◽  
Nguyen Duy Dat ◽  
Vo Thi Ngoc Tran ◽  
Tuan A. Nguyen

2016 ◽  
Vol 46 (3) ◽  
pp. 717-738 ◽  
Author(s):  
Vo Ngoc Phu ◽  
Nguyen Duy Dat ◽  
Vo Thi Ngoc Tran ◽  
Vo Thi Ngoc Chau ◽  
Tuan A. Nguyen

Author(s):  
Qiang Ma ◽  
Wenbo Chen ◽  
Zhihao Shang
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document