An Enhanced Short Text Compression Scheme for Smart Devices

Md. Rafiqul Islam; S. A. Ahsan Rajon

doi:10.4304/jcp.5.1.49-58

An Enhanced Short Text Compression Scheme for Smart Devices

Journal of Computers ◽

10.4304/jcp.5.1.49-58 ◽

2010 ◽

Vol 5 (1) ◽

Author(s):

Md. Rafiqul Islam ◽

S. A. Ahsan Rajon

Keyword(s):

Smart Devices ◽

Text Compression ◽

Compression Scheme ◽

Short Text

Download Full-text

Short text compression for smart devices

2008 11th International Conference on Computer and Information Technology ◽

10.1109/iccitechn.2008.4803017 ◽

2008 ◽

Cited By ~ 4

Author(s):

Md. Rafiqul Islam ◽

S. A. Ahsan Rajon ◽

Anonda Podder

Keyword(s):

Smart Devices ◽

Text Compression ◽

Short Text

Download Full-text

A text compression scheme that allows fast searching directly in the compressed file

ACM Transactions on Information Systems ◽

10.1145/248625.248639 ◽

1997 ◽

Vol 15 (2) ◽

pp. 124-136 ◽

Cited By ~ 62

Author(s):

Udi Manber

Keyword(s):

Text Compression ◽

Compression Scheme ◽

Fast Searching

Download Full-text

Better Adaptive Text Compression Scheme

JOURNAL OF EDUCATION AND SCIENCE ◽

10.33899/edusj.2018.147575 ◽

2018 ◽

Vol 27 (2) ◽

pp. 48-57

Author(s):

Duha Amir Sultan

Keyword(s):

Text Compression ◽

Compression Scheme

Download Full-text

Automatic Correction of Arabic Dyslexic Text

Computers ◽

10.3390/computers8010019 ◽

2019 ◽

Vol 8 (1) ◽

pp. 19 ◽

Cited By ~ 1

Author(s):

Maha Alamri ◽

William Teahan

Keyword(s):

Word Processing ◽

Language Model ◽

Text Compression ◽

Arabic Text ◽

Automatic Correction ◽

Compression Scheme ◽

Candidate List ◽

Arabic Word ◽

Processing Software ◽

Correction System

This paper proposes an automatic correction system that detects and corrects dyslexic errors in Arabic text. The system uses a language model based on the Prediction by Partial Matching (PPM) text compression scheme that generates possible alternatives for each misspelled word. Furthermore, the generated candidate list is based on edit operations (insertion, deletion, substitution and transposition), and the correct alternative for each misspelled word is chosen on the basis of the compression codelength of the trigram. The system is compared with widely-used Arabic word processing software and the Farasa tool. The system provided good results compared with the other tools, with a recall of 43%, precision 89%, F1 58% and accuracy 81%.

Download Full-text

A bit-level text compression scheme based on the ACW algorithm

International Journal of Automation and Computing ◽

10.1007/s11633-010-0123-6 ◽

2010 ◽

Vol 7 (1) ◽

pp. 123-131 ◽

Cited By ~ 7

Author(s):

Hussein Al-Bahadili ◽

Shakir M. Hussain

Keyword(s):

Text Compression ◽

Compression Scheme

Download Full-text

Two-Level Dictionary-Based Text Compression Scheme

2008 11th International Conference on Computer and Information Technology ◽

10.1109/iccitechn.2008.4803026 ◽

2008 ◽

Cited By ~ 2

Author(s):

Ziaul Karim Zia ◽

Dewan Fayzur Rahman ◽

Chowdhury Mofizur Rahman

Keyword(s):

Text Compression ◽

Compression Scheme

Download Full-text

Text Compression and Encryption through Smart Devices for Mobile Communication

2013 Seventh International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing ◽

10.1109/imis.2013.121 ◽

2013 ◽

Cited By ~ 7

Author(s):

Raffaele Pizzolante ◽

Bruno Carpentieri ◽

Aniello Castiglione ◽

Arcangelo Castiglione ◽

Francesco Palmieri

Keyword(s):

Mobile Communication ◽

Smart Devices ◽

Text Compression

Download Full-text

A text compression scheme that allows fast searching directly in the compressed file

Combinatorial Pattern Matching - Lecture Notes in Computer Science ◽

10.1007/3-540-58094-8_10 ◽

1994 ◽

pp. 113-124 ◽

Cited By ~ 12

Author(s):

Udi Manber

Keyword(s):

Text Compression ◽

Compression Scheme ◽

Fast Searching

Download Full-text

A BIT-LEVEL TEXT COMPRESSION SCHEME BASED ON THE HCDC ALGORITHM

International Journal of Computers and Applications ◽

10.2316/journal.202.2010.3.202-2914 ◽

2010 ◽

Vol 32 (3) ◽

Cited By ~ 3

Author(s):

H. Al-Bahadili ◽

A. Rababa’a

Keyword(s):

Text Compression ◽

Compression Scheme

Download Full-text

A Syllable-Based Technique for Uyghur Text Compression

Information ◽

10.3390/info11030172 ◽

2020 ◽

Vol 11 (3) ◽

pp. 172 ◽

Cited By ~ 3

Author(s):

Wayit Abliz ◽

Hao Wu ◽

Maihemuti Maimaiti ◽

Jiamila Wushouer ◽

Kahaerjiang Abiderexiti ◽

...

Keyword(s):

Compression Ratio ◽

Data Transmission ◽

Text Compression ◽

Compression Process ◽

Short Text ◽

Coding Scheme ◽

Coding Schemes ◽

Code Table ◽

Compression Coding ◽

Lzw Algorithm

To improve utilization of text storage resources and efficiency of data transmission, we proposed two syllable-based Uyghur text compression coding schemes. First, according to the statistics of syllable coverage of the corpus text, we constructed a 12-bit and 16-bit syllable code tables and added commonly used symbols—such as punctuation marks and ASCII characters—to the code tables. To enable the coding scheme to process Uyghur texts mixed with other language symbols, we introduced a flag code in the compression process to distinguish the Unicode encodings that were not in the code table. The experiments showed that the 12-bit coding scheme had an average compression ratio of 0.3 on Uyghur text less than 4 KB in size and that the 16-bit coding scheme had an average compression ratio of 0.5 on text less than 2 KB in size. Our compression schemes outperformed GZip, BZip2, and the LZW algorithm on short text and could be effectively applied to the compression of Uyghur short text for storage and applications.

Download Full-text