Similar Cluster Based Continuous Bag-of-Words for Word Vector Training

Author(s):  
Weikai Sun ◽  
Yinghua Ma ◽  
Shenghong Li ◽  
Shiyi Zhang
Keyword(s):  
2013 ◽  
Vol 34 (9) ◽  
pp. 2064-2070 ◽  
Author(s):  
Chun-hui Zhao ◽  
Ying Wang ◽  
KANEKO Masahide

2010 ◽  
Vol 28 (2) ◽  
pp. 204-226 ◽  
Author(s):  
Tom Botterill ◽  
Steven Mills ◽  
Richard Green

2012 ◽  
Author(s):  
Berkan Solmaz ◽  
Soumyabrata Dey ◽  
A. Ravishankar Rao ◽  
Mubarak Shah
Keyword(s):  

2020 ◽  
Vol 1 (4) ◽  
pp. 419-441
Author(s):  
Caio L.M. Jeronimo ◽  
Leandro B. Marinho ◽  
Cclaudio E.C. Carmpelo ◽  
Adriano Veloso ◽  
Allan S. Da Costa Melo

While many works investigate spread patterns of fake news in social networks, we focus on the textual content. Instead of relying on syntactic representations of documents (aka Bag of Words) as many works do, we seek more robust representations that may better differentiate fake from legitimate news. We propose to consider the subjectivity of news under the assumption that the subjectivity levels of legitimate and fake news are significantly different. For computing the subjectivity level of news, we rely on a set subjectivity lexicons for both Brazilian Portuguese and English languages. We then build subjectivity feature vectors for each news article by calculating the Word Mover's Distance (WMD) between the news and these lexicons considering the embedding the news words lie in, in order to analyze and classify the documents. The results demonstrate that our method is robust, especially in scenarios where training and test domains are different.


Sign in / Sign up

Export Citation Format

Share Document