Arabic Text Mining a Systematic Review of the Published Literature 2002-2014

The need for designing Arabic text mining systems for the use on social media posts is increasingly becoming a significant and attractive research area. It serves and enhances the knowledge needed in various domains. The main focus of this paper is to propose a novel framework combining sentiment analysis with subjective analysis on Arabic social media posts to determine whether people are interested or not interested in a defined subject. For those purposes, text classification methods—including preprocessing and machine learning mechanisms—are applied. Essentially, the performance of the framework is tested using Twitter as a data source, where possible volunteers on a certain subject are identified based on their posted tweets along with their subject-related information. Twitter is considered because of its popularity and its rich content from online microblogging services. The results obtained are very promising with an accuracy of 89%, thereby encouraging further research.

Download Full-text

A Comparative Study of Root -Based and Stem -Based Approaches for Measuring the Similarity Between Arabic Words for Arabic Text Mining Applications

Advanced Computing An International Journal ◽

10.5121/acij.2012.3607 ◽

2012 ◽

Vol 3 (6) ◽

pp. 55-67 ◽

Cited By ~ 13

Author(s):

Hanane FROUD

Keyword(s):

Text Mining ◽

Comparative Study ◽

Arabic Text

Download Full-text

Knowledge discovery out of text data: a systematic review via text mining

Journal of Knowledge Management ◽

10.1108/jkm-11-2017-0517 ◽

2018 ◽

Vol 22 (7) ◽

pp. 1471-1488 ◽

Cited By ~ 11

Author(s):

Antonio Usai ◽

Marco Pironti ◽

Monika Mital ◽

Chiraz Aouina Mejri

Keyword(s):

Systematic Review ◽

Text Mining ◽

Knowledge Discovery ◽

Research Collaboration ◽

Collaborative Writing ◽

Web Based ◽

Diverse Range ◽

Content Type ◽

Mining Technique ◽

Database Technology

Purpose The aim of this work is to increase awareness of the potential of the technique of text mining to discover knowledge and further promote research collaboration between knowledge management and the information technology communities. Since its emergence, text mining has involved multidisciplinary studies, focused primarily on database technology, Web-based collaborative writing, text analysis, machine learning and knowledge discovery. However, owing to the large amount of research in this field, it is becoming increasingly difficult to identify existing studies and therefore suggest new topics. Design/methodology/approach This article offers a systematic review of 85 academic outputs (articles and books) focused on knowledge discovery derived from the text mining technique. The systematic review is conducted by applying “text mining at the term level, in which knowledge discovery takes place on a more focused collection of words and phrases that are extracted from and label each document” (Feldman et al., 1998, p. 1). Findings The results revealed that the keywords extracted to be associated with the main labels, id est, knowledge discovery and text mining, can be categorized in two periods: from 1998 to 2009, the term knowledge and text were always used. From 2010 to 2017 in addition to these terms, sentiment analysis, review manipulation, microblogging data and knowledgeable users were the other terms frequently used. Besides this, it is possible to notice the technical, engineering nature of each term present in the first decade. Whereas, a diverse range of fields such as business, marketing and finance emerged from 2010 to 2017 owing to a greater interest in the online environment. Originality/value This is a first comprehensive systematic review on knowledge discovery and text mining through the use of a text mining technique at term level, which offers to reduce redundant research and to avoid the possibility of missing relevant publications.

Download Full-text

A systematic review on techniques of feature selection and classification for text mining

International Journal of Business Information Systems ◽

10.1504/ijbis.2018.10014636 ◽

2018 ◽

Vol 28 (4) ◽

pp. 504

Author(s):

P. Sivakumar ◽

K. Sridharan

Keyword(s):

Systematic Review ◽

Feature Selection ◽

Text Mining

Download Full-text

Measure of fuzzy presence of descriptors on Arabic Text Mining

2012 Colloquium in Information Science and Technology ◽

10.1109/cist.2012.6388063 ◽

2012 ◽

Author(s):

Ibtissam El Hassani ◽

Abdelaziz Kriouile ◽

Youssef BenGhabrit

Keyword(s):

Text Mining ◽

Arabic Text

Download Full-text

Erratum to: Using text mining for study identification in systematic reviews: a systematic review of current approaches

Systematic Reviews ◽

10.1186/s13643-015-0031-5 ◽

2015 ◽

Vol 4 (1) ◽

Cited By ~ 17

Author(s):

Alison O’Mara-Eves ◽

James Thomas ◽

John McNaught ◽

Makoto Miwa ◽

Sophia Ananiadou

Keyword(s):

Systematic Review ◽

Text Mining ◽

Systematic Reviews

Download Full-text

Text mining for market prediction: A systematic review

Expert Systems with Applications ◽

10.1016/j.eswa.2014.06.009 ◽

2014 ◽

Vol 41 (16) ◽

pp. 7653-7670 ◽

Cited By ~ 225

Author(s):

Arman Khadjeh Nassirtoussi ◽

Saeed Aghabozorgi ◽

Teh Ying Wah ◽

David Chek Ling Ngo

Keyword(s):

Systematic Review ◽

Text Mining

Download Full-text

Arabic text mining based on clustering and coreference resolution

2017 International Conference on Current Research in Computer Science and Information Technology (ICCIT) ◽

10.1109/crcsit.2017.7965549 ◽

2017 ◽

Author(s):

Salma Mahmood ◽

Faiez Musa Lahmood Al-Rufaye

Keyword(s):

Text Mining ◽

Coreference Resolution ◽

Arabic Text

Download Full-text

Text mining for Indonesian translation of the Quran: A systematic review

2017 International Conference on Computing, Engineering, and Design (ICCED) ◽

10.1109/ced.2017.8308122 ◽

2017 ◽

Cited By ~ 3

Author(s):

Syopiansyah Jaya Putra ◽

Teddy Mantoro ◽

Muhamad Nur Gunawan

Keyword(s):

Systematic Review ◽

Text Mining

Download Full-text

Arabic Text Mining Using Rule Based Classification

Journal of Information & Knowledge Management ◽

10.1142/s0219649212500062 ◽

2012 ◽

Vol 11 (01) ◽

pp. 1250006 ◽

Cited By ~ 5

Author(s):

Fadi Thabtah ◽

Omar Gharaibeh ◽

Rashid Al-Zubaidy

Keyword(s):

Text Mining ◽

Text Classification ◽

Business Intelligence ◽

Classification Problem ◽

Decision Making Process ◽

Classification Algorithms ◽

Arabic Text ◽

Essential Information ◽

Rule Based ◽

Arabic Text Classification

A well-known classification problem in the domain of text mining is text classification, which concerns about mapping textual documents into one or more predefined category based on its content. Text classification arena recently attracted many researchers because of the massive amounts of online documents and text archives which hold essential information for a decision-making process. In this field, most of such researches focus on classifying English documents while there are limited studies conducted on other languages like Arabic. In this respect, the paper proposes to investigate the problem of Arabic text classification comprehensively. More specifically the study measures the performance of different rule based classification approaches adopted from machine learning and data mining towards the problem of text Arabic classification. In particular, four different rule based classification approaches: Decision trees (C4.5), Rule Induction (RIPPER), Hybrid (PART) and Simple Rule (One Rule) are evaluated against the published Corpus of Contemporary Arabic Arabic text collection. This experimentation is carried out by employing a modified version of WEKA business intelligence tool. Through analysing the produced results from the experimentation, we determine the most suitable classification algorithms for classifying Arabic texts.

Download Full-text