Cross-lingual web spam classification

Author(s):  
András Garzó ◽  
Bálint Daróczy ◽  
Tamás Kiss ◽  
Dávid Siklósi ◽  
András A. Benczúr
Keyword(s):  
Web Spam ◽  
2012 ◽  
Author(s):  
Xin Liu ◽  
Xiaobin Zhou ◽  
Jianjun Zhu ◽  
Jing-Jen Wang

2015 ◽  
Author(s):  
Qiang Chen ◽  
Wenjie Li ◽  
Yu Lei ◽  
Xule Liu ◽  
Yanxiang He

Author(s):  
Xiaodan Zhuang ◽  
Arnab Ghoshal ◽  
Antti-Veikko Rosti ◽  
Matthias Paulik ◽  
Daben Liu

2019 ◽  
Vol 12 (3) ◽  
pp. 202-211
Author(s):  
Yuancheng Li ◽  
Rong Huang ◽  
Xiangqian Nie

Background: With the rapid development of the Internet, the number of web spam has increased dramatically in recent years, which has wasted search engine storage and computing power on a massive scale. To identify the web spam effectively, the content features, link features, hidden features and quality features of web page are integrated to establish the corresponding web spam identification index system. However, the index system is highly correlation dimension. Methods: An improved method of autoencoder named stacked autoencoder neural network (SAE) is used to realize the reduction of the web spam identification index system. Results: The experiment results show that our method could reduce effectively the index of web spam and significantly improves the recognition rate in the following work. Conclusion: An autoencoder based web spam indexes reduction method is proposed in this paper. The experimental results show that it greatly reduces the temporal and spatial complexity of the future web spam detection model.


Author(s):  
Yu Tsou ◽  
Deng-Neng Chen ◽  
Chia-Yu Lai
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document