ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition

Arabic text classification is a process to simultaneously categorize the different contextual Arabic contents into a proper category. In this paper, a novel deep learning Arabic text computer-aided recognition (ArCAR) is proposed to represent and recognize Arabic text at the character level. The input Arabic text is quantized in the form of 1D vectors for each Arabic character to represent a 2D array for the ArCAR system. The ArCAR system is validated over 5-fold cross-validation tests for two applications: Arabic text document classification and Arabic sentiment analysis. For document classification, the ArCAR system achieves the best performance using the Alarabiya-balance dataset in terms of overall accuracy, recall, precision, and F1-score by 97.76%, 94.08%, 94.16%, and 94.09%, respectively. Meanwhile, the ArCAR performs well for Arabic sentiment analysis, achieving the best performance using the hotel Arabic reviews dataset (HARD) balance dataset in terms of overall accuracy and F1-score by 93.58% and 93.23%, respectively. The proposed ArCAR seems to provide a practical solution for accurate Arabic text representation, understanding, and classification.

Download Full-text

Correction to: Benchmarking performance of machine and deep learning-based methodologies for Urdu text document classification

Neural Computing and Applications ◽

10.1007/s00521-020-05435-z ◽

2020 ◽

Author(s):

Muhammad Nabeel Asim ◽

Muhammad Usman Ghani ◽

Muhammad Ali Ibrahim ◽

Waqar Mahmood ◽

Andreas Dengel ◽

...

Keyword(s):

Deep Learning ◽

Document Classification ◽

Text Document ◽

Text Document Classification ◽

Benchmarking Performance

Download Full-text

Benchmarking performance of machine and deep learning-based methodologies for Urdu text document classification

Neural Computing and Applications ◽

10.1007/s00521-020-05321-8 ◽

2020 ◽

Author(s):

Muhammad Nabeel Asim ◽

Muhammad Usman Ghani ◽

Muhammad Ali Ibrahim ◽

Waqar Mahmood ◽

Andreas Dengel ◽

...

Keyword(s):

Deep Learning ◽

Document Classification ◽

Text Document ◽

Text Document Classification ◽

Benchmarking Performance

Download Full-text

Arabic Sentiment Analysis Using Deep Learning and Ensemble Methods

Arabian Journal for Science and Engineering ◽

10.1007/s13369-021-05475-0 ◽

2021 ◽

Author(s):

Amal Alharbi ◽

Manal Kalkatawi ◽

Mounira Taileb

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Ensemble Methods ◽

Arabic Sentiment Analysis

Download Full-text

Standard and Dialectal Arabic Text Classification for Sentiment Analysis

Model and Data Engineering - Lecture Notes in Computer Science ◽

10.1007/978-3-030-00856-7_18 ◽

2018 ◽

pp. 282-291 ◽

Cited By ~ 1

Author(s):

Mohcine Maghfour ◽

Abdeljalil Elouardighi

Keyword(s):

Sentiment Analysis ◽

Text Classification ◽

Arabic Text ◽

Dialectal Arabic ◽

Arabic Text Classification

Download Full-text

Multi-Label Arabic Text Classification Based On Deep Learning

2021 12th International Conference on Information and Communication Systems (ICICS) ◽

10.1109/icics52457.2021.9464538 ◽

2021 ◽

Author(s):

Batool alsukhni

Keyword(s):

Deep Learning ◽

Text Classification ◽

Arabic Text ◽

Arabic Text Classification

Download Full-text

Hybrid Neural Architecture for Intelligent Recommender System Classification Unit Design

Intelligent Techniques in Recommendation Systems ◽

10.4018/978-1-4666-2542-6.ch010 ◽

2013 ◽

pp. 192-213

Author(s):

Emmanuel Buabin

Keyword(s):

Recommender System ◽

Document Classification ◽

Research Field ◽

Neural Systems ◽

Fully Integrated ◽

Text Document ◽

Unit Design ◽

Boosting Algorithms ◽

New Research ◽

Text Document Classification

The objective is intelligent recommender system classification unit design using hybrid neural techniques. In particular, a neuroscience-based hybrid neural by Buabin (2011a) is introduced, explained, and examined for its potential in real world text document classification on the modapte version of the Reuters news text corpus. The so described neuroscience model (termed Hy-RNC) is fully integrated with a novel boosting algorithm to augment text document classification purposes. Hy-RNC outperforms existing works and opens up an entirely new research field in the area of machine learning. The main contribution of this book chapter is the provision of a step-by-step approach to modeling the hybrid system using underlying concepts such as boosting algorithms, recurrent neural networks, and hybrid neural systems. Results attained in the experiments show impressive performance by the hybrid neural classifier even with a minimal number of neurons in constituting structures.

Download Full-text

A Novel Inherent Distinguishing Feature Selector for Highly Skewed Text Document Classification

Arabian Journal for Science and Engineering ◽

10.1007/s13369-020-04763-5 ◽

2020 ◽

Vol 45 (12) ◽

pp. 10471-10491

Author(s):

Muhammad Sajid Ali ◽

Kashif Javed

Keyword(s):

Document Classification ◽

Text Document ◽

Feature Selector ◽

Text Document Classification

Download Full-text

Hindi Text Document Classification System Using SVM and Fuzzy

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2018100101 ◽

2018 ◽

Vol 5 (4) ◽

pp. 1-31 ◽

Cited By ~ 8

Author(s):

Shalini Puri ◽

Satya Prakash Singh

Keyword(s):

Classification System ◽

Character Recognition ◽

Optical Character Recognition ◽

Document Classification ◽

Data Availability ◽

Support Vector ◽

Handwritten Documents ◽

Text Document ◽

Survey Report ◽

Text Document Classification

In recent years, many information retrieval, character recognition, and feature extraction methodologies in Devanagari and especially in Hindi have been proposed for different domain areas. Due to enormous scanned data availability and to provide an advanced improvement of existing Hindi automated systems beyond optical character recognition, a new idea of Hindi printed and handwritten document classification system using support vector machine and fuzzy logic is introduced. This first pre-processes and then classifies textual imaged documents into predefined categories. With this concept, this article depicts a feasibility study of such systems with the relevance of Hindi, a survey report of statistical measurements of Hindi keywords obtained from different sources, and the inherent challenges found in printed and handwritten documents. The technical reviews are provided and graphically represented to compare many parameters and estimate contents, forms and classifiers used in various existing techniques.

Download Full-text