scholarly journals The Power of Communities: A Text Classification Model with Automated Labeling Process Using Network Community Detection

Author(s):  
Minjun Kim ◽  
Hiroki Sayama
2016 ◽  
Vol 35 (2) ◽  
pp. 244-261 ◽  
Author(s):  
Frederic Guerrero-Solé

In November 9, 2014, the Catalan government called Catalan people to participate in a straw poll about the independence of Catalonia from Spain. This article analyzes the use of Twitter between November 8 and 10, 2014. Drawing on a methodology developed by Guerrero-Solé, Corominas-Murtra, and Lopez-Gonzalez, this work examines the structure of the retweet overlap network (RON), formed by those users whose communities of retweeters have nonzero overlapping, to detect the community structure of the network. The results show a high polarization of the resulting network and prove that the RON is a reliable method to determinate network community structures and users’ political leaning in political discussions.


2019 ◽  
Vol 14 (1) ◽  
pp. 124-134 ◽  
Author(s):  
Shuai Zhang ◽  
Yong Chen ◽  
Xiaoling Huang ◽  
Yishuai Cai

Online feedback is an effective way of communication between government departments and citizens. However, the daily high number of public feedbacks has increased the burden on government administrators. The deep learning method is good at automatically analyzing and extracting deep features of data, and then improving the accuracy of classification prediction. In this study, we aim to use the text classification model to achieve the automatic classification of public feedbacks to reduce the work pressure of administrator. In particular, a convolutional neural network model combined with word embedding and optimized by differential evolution algorithm is adopted. At the same time, we compared it with seven common text classification models, and the results show that the model we explored has good classification performance under different evaluation metrics, including accuracy, precision, recall, and F1-score.


Author(s):  
Noha Ali ◽  
Ahmed H. AbuEl-Atta ◽  
Hala H. Zayed

<span id="docs-internal-guid-cb130a3a-7fff-3e11-ae3d-ad2310e265f8"><span>Deep learning (DL) algorithms achieved state-of-the-art performance in computer vision, speech recognition, and natural language processing (NLP). In this paper, we enhance the convolutional neural network (CNN) algorithm to classify cancer articles according to cancer hallmarks. The model implements a recent word embedding technique in the embedding layer. This technique uses the concept of distributed phrase representation and multi-word phrases embedding. The proposed model enhances the performance of the existing model used for biomedical text classification. The result of the proposed model overcomes the previous model by achieving an F-score equal to 83.87% using an unsupervised technique that trained on PubMed abstracts called PMC vectors (PMCVec) embedding. Also, we made another experiment on the same dataset using the recurrent neural network (RNN) algorithm with two different word embeddings Google news and PMCVec which achieving F-score equal to 74.9% and 76.26%, respectively.</span></span>


2018 ◽  
Vol 29 (02) ◽  
pp. 1850011 ◽  
Author(s):  
Chun Gui ◽  
Ruisheng Zhang ◽  
Zhili Zhao ◽  
Jiaxuan Wei ◽  
Rongjing Hu

In order to deal with stochasticity in center node selection and instability in community detection of label propagation algorithm, this paper proposes an improved label propagation algorithm named label propagation algorithm based on community belonging degree (LPA-CBD) that employs community belonging degree to determine the number and the center of community. The general process of LPA-CBD is that the initial community is identified by the nodes with the maximum degree, and then it is optimized or expanded by community belonging degree. After getting the rough structure of network community, the remaining nodes are labeled by using label propagation algorithm. The experimental results on 10 real-world networks and three synthetic networks show that LPA-CBD achieves reasonable community number, better algorithm accuracy and higher modularity compared with other four prominent algorithms. Moreover, the proposed algorithm not only has lower algorithm complexity and higher community detection quality, but also improves the stability of the original label propagation algorithm.


Sign in / Sign up

Export Citation Format

Share Document