An Empirical Evaluation Of Lazy Learning Classifiers For Text Categorization

2012 ◽  
Vol 001 (001) ◽  
pp. 12-15
Author(s):  
P. Umar Sathic Ali ◽  
◽  
C. Jothi Venkateswaran ◽  
Author(s):  
Ali Bou Nassif ◽  
Abdollah Masoud Darya ◽  
Ashraf Elnagar

This work presents a detailed comparison of the performance of deep learning models such as convolutional neural networks, long short-term memory, gated recurrent units, their hybrids, and a selection of shallow learning classifiers for sentiment analysis of Arabic reviews. Additionally, the comparison includes state-of-the-art models such as the transformer architecture and the araBERT pre-trained model. The datasets used in this study are multi-dialect Arabic hotel and book review datasets, which are some of the largest publicly available datasets for Arabic reviews. Results showed deep learning outperforming shallow learning for binary and multi-label classification, in contrast with the results of similar work reported in the literature. This discrepancy in outcome was caused by dataset size as we found it to be proportional to the performance of deep learning models. The performance of deep and shallow learning techniques was analyzed in terms of accuracy and F1 score. The best performing shallow learning technique was Random Forest followed by Decision Tree, and AdaBoost. The deep learning models performed similarly using a default embedding layer, while the transformer model performed best when augmented with araBERT.


Author(s):  
G. L. GENTILI ◽  
M. MARINILLI ◽  
A. MICARELLI ◽  
F. SCIARRONE

This paper presents a text categorization system, capable of analyzing HTML/text documents collected from the Web. The system is a component of a more extensive intelligent agent for adaptive information filtering on the Web. It is based on a hybrid case-based architecture, where two multilayer perceptrons are integrated into a case-based reasoner. An empirical evaluation of the system was performed by means of a confidence interval technique. The experimental results obtained are encouraging and support the choice of a hybrid case-based approach to text categorization.


Author(s):  
Ankita Dhar ◽  
Himadri Mukherjee ◽  
Sk. Md. Obaidullah ◽  
K. C. Santosh ◽  
Niladri Sekhar Dash ◽  
...  

1986 ◽  
Vol 47 (7) ◽  
pp. 1149-1154
Author(s):  
Le Quang Rang ◽  
D. Voslamber

Sign in / Sign up

Export Citation Format

Share Document