A systematic review on techniques of feature selection and classification for text mining

This study aims to clarify tweets on twitter using the Support Vector Machine and Information Gain methods. The clarification itself aims to find a hyperplane that separates the negative and positive classes. In the research stage, there is a system process, namely text mining, text processing which has stages of tokenizing, filtering, stemming, and term weighting. After that, a feature selection is made by information gain which calculates the entropy value of each word. After that, clarify based on the features that have been selected and the output is in the form of identifying whether the tweet is bully or not. The results of this study found that the Support Vector Machine and Information Gain methods have sufficiently maximum results.

Download Full-text

Particle Swarm Optimization Based Two-Stage Feature Selection in Text Mining

2018 IEEE Congress on Evolutionary Computation (CEC) ◽

10.1109/cec.2018.8477773 ◽

2018 ◽

Cited By ~ 7

Author(s):

Xiaohan Bai ◽

Xiaoying Gao ◽

Bing Xue

Keyword(s):

Feature Selection ◽

Particle Swarm Optimization ◽

Text Mining ◽

Particle Swarm ◽

Two Stage ◽

Swarm Optimization

Download Full-text

Feature Selection Optimization for Highlighting Opinions Using Supervised and Unsupervised Learning on Arabic Language

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2021/251022021 ◽

2021 ◽

Vol 10 (2) ◽

pp. 636-642

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Text Mining ◽

Unsupervised Learning ◽

Arabic Language ◽

Machine Learning Techniques ◽

Business Decision ◽

Supervised And Unsupervised Learning ◽

Proposed Model ◽

The Impact

Text mining utilizes machine learning (ML) and natural language processing (NLP) for text implicit knowledge recognition, such knowledge serves many domains as translation, media searching, and business decision making. Opinion mining (OM) is one of the promised text mining fields, which are used for polarity discovering via text and has terminus benefits for business. ML techniques are divided into two approaches: supervised and unsupervised learning, since we herein testified an OM feature selection(FS)using four ML techniques. In this paper, we had implemented number of experiments via four machine learning techniques on the same three Arabic language corpora. This paper aims at increasing the accuracy of opinion highlighting on Arabic language, by using enhanced feature selection approaches. FS proposed model is adopted for enhancing opinion highlighting purpose. The experimental results show the outperformance of the proposed approaches in variant levels of supervisory,i.e. different techniques via distinct data domains. Multiple levels of comparison are carried out and discussed for further understanding of the impact of proposed model on several ML techniques.

Download Full-text

Text mining based on tax comments as big data analysis using SVM and feature selection

2018 International Conference on Information and Communications Technology (ICOIACT) ◽

10.1109/icoiact.2018.8350743 ◽

2018 ◽

Cited By ~ 1

Author(s):

Mihuandayani ◽

Ema Utami ◽

Emha Taufiq Luthfi

Keyword(s):

Feature Selection ◽

Big Data ◽

Data Analysis ◽

Text Mining ◽

Big Data Analysis

Download Full-text

Semantic Scoring Based on Small-World Phenomenon for Feature Selection in Text Mining

Advanced Data Mining and Applications - Lecture Notes in Computer Science ◽

10.1007/11811305_70 ◽

2006 ◽

pp. 636-643

Author(s):

Chong Huang ◽

Yonghong Tian ◽

Tiejun Huang ◽

Wen Gao

Keyword(s):

Feature Selection ◽

Text Mining ◽

Small World

Download Full-text

Optimized Swarm Search-Based Feature Selection for Text Mining in Sentiment Analysis

2015 IEEE International Conference on Data Mining Workshop (ICDMW) ◽

10.1109/icdmw.2015.231 ◽

2015 ◽

Cited By ~ 4

Author(s):

Simon Fong ◽

Elisa Gao ◽

Raymond Wong

Keyword(s):

Feature Selection ◽

Text Mining ◽

Sentiment Analysis ◽

Selection For

Download Full-text

Knowledge discovery out of text data: a systematic review via text mining

Journal of Knowledge Management ◽

10.1108/jkm-11-2017-0517 ◽

2018 ◽

Vol 22 (7) ◽

pp. 1471-1488 ◽

Cited By ~ 11

Author(s):

Antonio Usai ◽

Marco Pironti ◽

Monika Mital ◽

Chiraz Aouina Mejri

Keyword(s):

Systematic Review ◽

Text Mining ◽

Knowledge Discovery ◽

Research Collaboration ◽

Collaborative Writing ◽

Web Based ◽

Diverse Range ◽

Content Type ◽

Mining Technique ◽

Database Technology

Purpose The aim of this work is to increase awareness of the potential of the technique of text mining to discover knowledge and further promote research collaboration between knowledge management and the information technology communities. Since its emergence, text mining has involved multidisciplinary studies, focused primarily on database technology, Web-based collaborative writing, text analysis, machine learning and knowledge discovery. However, owing to the large amount of research in this field, it is becoming increasingly difficult to identify existing studies and therefore suggest new topics. Design/methodology/approach This article offers a systematic review of 85 academic outputs (articles and books) focused on knowledge discovery derived from the text mining technique. The systematic review is conducted by applying “text mining at the term level, in which knowledge discovery takes place on a more focused collection of words and phrases that are extracted from and label each document” (Feldman et al., 1998, p. 1). Findings The results revealed that the keywords extracted to be associated with the main labels, id est, knowledge discovery and text mining, can be categorized in two periods: from 1998 to 2009, the term knowledge and text were always used. From 2010 to 2017 in addition to these terms, sentiment analysis, review manipulation, microblogging data and knowledgeable users were the other terms frequently used. Besides this, it is possible to notice the technical, engineering nature of each term present in the first decade. Whereas, a diverse range of fields such as business, marketing and finance emerged from 2010 to 2017 owing to a greater interest in the online environment. Originality/value This is a first comprehensive systematic review on knowledge discovery and text mining through the use of a text mining technique at term level, which offers to reduce redundant research and to avoid the possibility of missing relevant publications.

Download Full-text

Arabic Text Mining a Systematic Review of the Published Literature 2002-2014

2015 International Conference on Cloud Computing (ICCC) ◽

10.1109/cloudcomp.2015.7149632 ◽

2015 ◽

Cited By ~ 4

Author(s):

Hind Al-Mahmoud ◽

Muna Al-Razgan

Keyword(s):

Systematic Review ◽

Text Mining ◽

Arabic Text

Download Full-text