document compression Latest Research Papers

In this paper, we present Latent Drichlet Allocation in automatic text summarization to improve accuracy in document clustering. The experiments involving 398 data set from public blog article obtained by using python scrapy crawler and scraper. Several steps of clustering in this research are preprocessing, automatic document compression using feature method, automatic document compression using LDA, word weighting and clustering algorithm The results show that automatic document summarization with LDA reaches 72% in LDA 40%, compared to traditional k-means method which only reaches 66%.

Download Full-text

EFFICIENT DOCUMENT COMPRESSION USING INTRA FRAME PREDICTION TECTHNIQUE

International Journal of Research in Engineering and Technology ◽

10.15623/ijret.2014.0319132 ◽

2014 ◽

Vol 03 (19) ◽

pp. 737-741

Author(s):

Chitradevi S .

Keyword(s):

Document Compression

Download Full-text

Scanned Document Compression Using Block-Based Hybrid Video Codec

IEEE Transactions on Image Processing ◽

10.1109/tip.2013.2251641 ◽

2013 ◽

Vol 22 (6) ◽

pp. 2420-2428 ◽

Cited By ~ 6

Author(s):

A. Zaghetto ◽

R. L. de Queiroz

Keyword(s):

Video Codec ◽

Block Based ◽

Document Compression

Download Full-text

HEVC-based scanned document compression

2012 19th IEEE International Conference on Image Processing ◽

10.1109/icip.2012.6466986 ◽

2012 ◽

Author(s):

Alexandre Zaghetto ◽

Bruno Macchiavello ◽

Ricardo L. de Queiroz

Keyword(s):

Document Compression

Download Full-text

Text Segmentation for MRC Document Compression

IEEE Transactions on Image Processing ◽

10.1109/tip.2010.2101611 ◽

2011 ◽

Vol 20 (6) ◽

pp. 1611-1626 ◽

Cited By ~ 15

Author(s):

E Haneda ◽

C A Bouman

Keyword(s):

Text Segmentation ◽

Document Compression

Download Full-text

Discourse Constraints for Document Compression

Computational Linguistics ◽

10.1162/coli_a_00004 ◽

2010 ◽

Vol 36 (3) ◽

pp. 411-441 ◽

Cited By ~ 11

Author(s):

James Clarke ◽

Mirella Lapata

Keyword(s):

Linear Programming ◽

Integer Linear Programming ◽

State Of The Art ◽

Experimental Results ◽

Sentence Compression ◽

Local Coherence ◽

Document Compression

Sentence compression holds promise for many applications ranging from summarization to subtitle generation. The task is typically performed on isolated sentences without taking the surrounding context into account, even though most applications would operate over entire documents. In this article we present a discourse-informed model which is capable of producing document compressions that are coherent and informative. Our model is inspired by theories of local coherence and formulated within the framework of integer linear programming. Experimental results show significant improvements over a state-of-the-art discourse agnostic approach.

Download Full-text