document compression
Recently Published Documents


TOTAL DOCUMENTS

22
(FIVE YEARS 0)

H-INDEX

6
(FIVE YEARS 0)

Author(s):  
Erwin Yudi Hidayat ◽  
Fahri Firdausillah ◽  
Khafiizh Hastuti ◽  
Ika Novita Dewi ◽  
Azhari Azhari

In this paper, we present Latent Drichlet Allocation in automatic text summarization to improve accuracy in document clustering. The experiments involving 398 data set from public blog article obtained by using python scrapy crawler and scraper. Several steps of clustering in this research are preprocessing, automatic document compression using feature method, automatic document compression using LDA, word weighting and clustering algorithm The results show that automatic document summarization with LDA reaches 72% in LDA 40%, compared to traditional k-means method which only reaches 66%.


2013 ◽  
Vol 22 (6) ◽  
pp. 2420-2428 ◽  
Author(s):  
A. Zaghetto ◽  
R. L. de Queiroz

Author(s):  
Alexandre Zaghetto ◽  
Bruno Macchiavello ◽  
Ricardo L. de Queiroz
Keyword(s):  

2011 ◽  
Vol 20 (6) ◽  
pp. 1611-1626 ◽  
Author(s):  
E Haneda ◽  
C A Bouman

2010 ◽  
Vol 36 (3) ◽  
pp. 411-441 ◽  
Author(s):  
James Clarke ◽  
Mirella Lapata

Sentence compression holds promise for many applications ranging from summarization to subtitle generation. The task is typically performed on isolated sentences without taking the surrounding context into account, even though most applications would operate over entire documents. In this article we present a discourse-informed model which is capable of producing document compressions that are coherent and informative. Our model is inspired by theories of local coherence and formulated within the framework of integer linear programming. Experimental results show significant improvements over a state-of-the-art discourse agnostic approach.


Author(s):  
Jiri Dvorsky ◽  
Jan Martinovic ◽  
Jan Platos ◽  
Vaclav Snasel

Sign in / Sign up

Export Citation Format

Share Document