Character-Based N-gram Model for Uyghur Text Retrieval

Biometric Recognition - Lecture Notes in Computer Science ◽

10.1007/978-3-319-97909-0_72 ◽

2018 ◽

pp. 678-688

Author(s):

Turdi Tohti ◽

Lirui Xu ◽

Jimmy Huang ◽

Winira Musajan ◽

Askar Hamdulla

Keyword(s):

Text Retrieval ◽

N Gram

Download Full-text

Evaluation of n-gram conflation approaches for Arabic text retrieval

Journal of the American Society for Information Science and Technology ◽

10.1002/asi.21063 ◽

2009 ◽

Vol 60 (7) ◽

pp. 1448-1465 ◽

Cited By ~ 14

Author(s):

Farag Ahmed ◽

Andreas Nürnberger

Keyword(s):

Text Retrieval ◽

Arabic Text ◽

N Gram

Download Full-text

N-gram and Local Context Analysis for Persian text retrieval

10.1109/isspa.2007.4555345 ◽

2007 ◽

Cited By ~ 9

Author(s):

Abolfazl Aleahmad ◽

Parsia Hakimian ◽

Farzad Mahdikhani ◽

Farhad Oroumchian

Keyword(s):

Text Retrieval ◽

Local Context ◽

Context Analysis ◽

N Gram

Download Full-text

Character N-Gram Tokenization for European Language Text Retrieval

Information Retrieval ◽

10.1023/b:inrt.0000009441.78971.be ◽

2004 ◽

Vol 7 (1/2) ◽

pp. 73-97 ◽

Cited By ~ 129

Author(s):

Paul McNamee ◽

James Mayfield

Keyword(s):

Text Retrieval ◽

European Language ◽

N Gram ◽

Language Text

Download Full-text

A Word & Character N-Gram based Arabic OCR Error Simulation model

INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY ◽

10.24297/ijct.v12i8.2999 ◽

2014 ◽

Vol 12 (8) ◽

pp. 3758-3767 ◽

Cited By ~ 1

Author(s):

Mostafa Ezzat ◽

Tarek Ahmed ElGhazaly ◽

Mervat Gheith

Keyword(s):

Simulation Model ◽

High Precision ◽

Text Retrieval ◽

Search Query ◽

New Model ◽

Retrieval Effectiveness ◽

Proposed Model ◽

Arabic Ocr ◽

N Gram ◽

Error Simulation

This paper provides a new model aimed to enhanceArabic OCR degraded text retrieval effectiveness. The proposed model based onsimulating the Arabic OCR recognition mistakesbased on both, word based and Character N-Gram approaches. Then we expand the user search query using the expected OCR errors. The resulting search query expanded gives high precision and recall values in searching Arabic OCR-Degraded text rather than the original query. The proposed model showed a significant increase in the degraded text retrieval effectiveness over the previous models. The retrieval effectiveness of the newmodel is %93, while the best effectiveness published for word based approach was %84 and the best effectiveness for character based approach was %56.

Download Full-text

N-gram based Language Model for the QWERTY Keyboard Input Errors in a Touch Screen Environment

Korean Institute of Smart Media ◽

10.30693/smj.2018.7.2.54 ◽

2018 ◽

Vol 7 (2) ◽

pp. 54-59

Author(s):

Yoon Gee Ong ◽

◽

Seung Shik Kang ◽

Keyword(s):

Language Model ◽

Touch Screen ◽

Keyboard Input ◽

N Gram

Download Full-text

A Ranking model of proximal and structural text retrieval based on region algebra

Proceedings of the conference on SIGGRAPH 2004 course notes - GRAPH '04 ◽

10.3115/1075178.1075185 ◽

2003 ◽

Cited By ~ 3

Author(s):

Katsuya Masuda

Keyword(s):

Text Retrieval ◽

Ranking Model

Download Full-text

Learning N-Gram Language Models from Uncertain Data

10.21437/interspeech.2016-1093 ◽

2016 ◽

Cited By ~ 4

Author(s):

Vitaly Kuznetsov ◽

Hank Liao ◽

Mehryar Mohri ◽

Michael Riley ◽

Brian Roark

Keyword(s):

Uncertain Data ◽

Language Models ◽

N Gram

Download Full-text

Rescore in a Flash: Compact, Cache Efficient Hashing Data Structures for n-Gram Language Models

10.21437/interspeech.2020-1939 ◽

2020 ◽

Author(s):

Grant P. Strimel ◽

Ariya Rastrow ◽

Gautam Tiwari ◽

Adrien Piérard ◽

Jon Webb

Keyword(s):

Data Structures ◽

Language Models ◽

Cache Efficient ◽

N Gram

Download Full-text

OCR correction for Indonesian historic newspapers using word repetition, stemmer and n-gram

Journal of Physics Conference Series ◽

10.1088/1742-6596/1193/1/012032 ◽

2019 ◽

Vol 1193 ◽

pp. 012032

Author(s):

D Purwantoro ◽

H Akbar ◽

A Hidayati ◽

Sfenrianto

Keyword(s):

Word Repetition ◽

N Gram

Download Full-text

Robustness of Word and Character N-gram Combinations in Detecting Deceptive and Truthful Opinions

Journal of Data and Information Quality ◽

10.1145/3349536 ◽

2020 ◽

Vol 12 (1) ◽

pp. 1-24 ◽

Cited By ~ 1

Author(s):

Al Hafiz Akbar Maulana Siagian ◽

Masayoshi Aritsugi

Keyword(s):

N Gram

Download Full-text