Topic Crawling Strategy Based on Wikipedia and Analysis of Pages' Similarity
2012 ◽
Vol 220-223
◽
pp. 2407-2412
Considering the weaknesses existing in the present topic crawling strategies, this paper puts forward a new method which is based on Wikipedia and the analysis of page similarity. Firstly, the topic is described via Wikipedia. Then, handle the downloaded web. Finally, calculate the priorities of the links through text relativity and analysis of the web links. The result indicates that this new method is better than the traditional in terms of searching results and topic relativity and is worth popularizing.
2012 ◽
Vol 170-173
◽
pp. 2924-2928
Keyword(s):
2019 ◽
Vol 492
(1)
◽
pp. 589-602
◽
Keyword(s):
1996 ◽
Vol 07
(01)
◽
pp. 33-41
◽
Keyword(s):
2013 ◽
Vol 380-384
◽
pp. 2695-2698
2011 ◽
Vol 328-330
◽
pp. 1619-1622
◽
Keyword(s):