scholarly journals A public opinion classification algorithm based on micro-blog text sentiment intensity: Design and implementation

Author(s):  
Mingjun Xin ◽  
Hanxiang Wu ◽  
Weimin Li ◽  
Zhihua Niu
2014 ◽  
Vol 519-520 ◽  
pp. 58-61 ◽  
Author(s):  
Jian Xu ◽  
Bin Ma

In the light of the excellent distributed storage and parallel processing feature of hadoop cluster, a new kind of network public opinion classification method based on Naive Bayes algorithm in hadoop environment is studied. The collected public opinion documents are stored locally according to the HDFS architecture, and whose character words are extracted paralleled in Mapreduce process. Thus the naive Bayesian classification algorithm is parallel encapsulated on cloud computing platform. The MapReduce packaged Naive Bayesian classification algorithm performance is verified and the results show that the algorithm execution speed are significantly improved compared to a single server. Its public opinion classification accuracy rate is more than 85%, which can effectively improve the classification performance of network public opinion and classification efficiency.


2013 ◽  
Vol 347-350 ◽  
pp. 2506-2510
Author(s):  
Yun Qi Gao ◽  
Chun Lin Peng

With the development of Internet, Network public opinion has been serving an import role in reflection of social public opinion. As there are a large number of websites and forums on the Internet, we need a powerful crawler system which can meet the demands of opinion mining. However, common crawler systems concern more about ranking and recommendation algorithms, which is less important in opinion mining. In this article, we introduced the design and implementation of a distributed crawler system for opinion mining. We also introduced some extra parameters such as keywords count and published time into the ranking and refreshing strategies. Experimental results demonstrate that the system can well support different sites, and the improved strategies can greatly enhance the crawling and monitoring efficiency.


2014 ◽  
Vol 635-637 ◽  
pp. 1624-1627
Author(s):  
Jian Xu ◽  
Bin Ma

A new kind of network public opinion classification method based on K_ nearest neighbor (K_NN) classification algorithm in Hadoop environment is studied in this paper. In the light of distributed storage and parallel processing Characteristics of Hadoop platform, the parallel K_NN classification algorithm in the frame of MapReduce is designed. The classification ability and execution efficiency of proposed scheme is verified and the results show that the parallel K_NN algorithm enhances the network public opinion classification precision and execution efficiently.


Sign in / Sign up

Export Citation Format

Share Document