Local Latent Semantic Analysis Based on Support Vector Machine for Imbalanced Text Categorization

At present, in the mainstream sentiment analysis methods represented by the Support Vector Machine, the vocabulary and the latent semantic information involved in the text are not well considered, and sentiment analysis of text is dependent overly on the statistics of sentiment words. Thus, a Fisher kernel function based on Probabilistic Latent Semantic Analysis is proposed in this paper for sentiment analysis by Support Vector Machine. The Fisher kernel function based on the model is derived from the Probabilistic Latent Semantic Analysis model. By means of this method, latent semantic information involving the probability characteristics can be used as the classification characteristics, along with the improvement of the effect of classification for support vector machine, and the problem of ignoring the latent semantic characteristics in text sentiment analysis can be addressed. The results show that the effect of the method proposed in this paper, compared with the comparison method, is obviously improved.

Download Full-text

A Method Based on Support Vector Machine for Feature Selection of Latent Semantic Features

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.181-182.830 ◽

2011 ◽

Vol 181-182 ◽

pp. 830-835

Author(s):

Min Song Li

Keyword(s):

Support Vector Machine ◽

Text Categorization ◽

Latent Semantic Indexing ◽

Classification Performance ◽

Compact Representation ◽

Support Vector ◽

Semantic Features ◽

Semantic Indexing ◽

Feature Extraction Method ◽

Feature Subspace

Latent Semantic Indexing(LSI) is an effective feature extraction method which can capture the underlying latent semantic structure between words in documents. However, it is probably not the most appropriate for text categorization to use the method to select feature subspace, since the method orders extracted features according to their variance,not the classification power. We proposed a method based on support vector machine to extract features and select a Latent Semantic Indexing that be suited for classification. Experimental results indicate that the method improves classification performance with more compact representation.

Download Full-text

Solving multi-label text categorization problem using support vector machine approach with membership function

Neurocomputing ◽

10.1016/j.neucom.2011.07.001 ◽

2011 ◽

Vol 74 (17) ◽

pp. 3682-3689 ◽

Cited By ~ 15

Author(s):

Tai-Yue Wang ◽

Huei-Min Chiang

Keyword(s):

Support Vector Machine ◽

Membership Function ◽

Text Categorization ◽

Support Vector

Download Full-text

Pairwise Protein Substring Alignment with Latent Semantic Analysis and Support Vector Machines to Detect Remote Protein Homology

Communications in Computer and Information Science - Ubiquitous Computing and Multimedia Applications ◽

10.1007/978-3-642-20998-7_60 ◽

2011 ◽

pp. 526-546 ◽

Cited By ~ 2

Author(s):

Surayati Ismail ◽

Razib M. Othman ◽

Shahreen Kasim

Keyword(s):

Support Vector Machines ◽

Latent Semantic Analysis ◽

Semantic Analysis ◽

Support Vector ◽

Protein Homology ◽

Vector Machines

Download Full-text

Categorizing Natural Language-Based Customer Satisfaction: An Implementation Method Using Support Vector Machine and Long Short-Term Memory Neural Network

International Journal of Integrated Engineering ◽

10.30880/ijie.2021.13.04.007 ◽

2021 ◽

Vol 13 (4) ◽

Author(s):

Ralph Sherwin A. Corpuz ◽

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Natural Language ◽

Text Categorization ◽

Short Term Memory ◽

Support Vector ◽

Feature Engineering ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Analyzing natural language-based Customer Satisfaction (CS) is a tedious process. This issue is practically true if one is to manually categorize large datasets. Fortunately, the advent of supervised machine learning techniques has paved the way toward the design of efficient categorization systems used for CS. This paper presents the feasibility of designing a text categorization model using two popular and robust algorithms – the Support Vector Machine (SVM) and Long Short-Term Memory (LSTM) Neural Network, in order to automatically categorize complaints, suggestions, feedbacks, and commendations. The study found that, in terms of training accuracy, SVM has best rating of 98.63% while LSTM has best rating of 99.32%. Such results mean that both SVM and LSTM algorithms are at par with each other in terms of training accuracy, but SVM is significantly faster than LSTM by approximately 35.47s. The training performance results of both algorithms are attributed on the limitations of the dataset size, high-dimensionality of both English and Tagalog languages, and applicability of the feature engineering techniques used. Interestingly, based on the results of actual implementation, both algorithms are found to be 100% effective in accurately predicting the correct CS categories. Hence, the extent of preference between the two algorithms boils down on the available dataset and the skill in optimizing these algorithms through feature engineering techniques and in implementing them toward actual text categorization applications.

Download Full-text

Predicting Students Grades Using Artificial Neural Networks and Support Vector Machine

Advanced Methodologies and Technologies in Modern Education Delivery - Advances in Educational Technologies and Instructional Design ◽

10.4018/978-1-5225-7365-4.ch059 ◽

2019 ◽

pp. 751-766

Author(s):

Sajid Umair ◽

Muhammad Majid Sharif

Keyword(s):

Neural Networks ◽

Support Vector Machine ◽

Artificial Neural Networks ◽

Student Performance ◽

Semantic Analysis ◽

Vital Role ◽

Support Vector ◽

Data Set ◽

Important Research Topic ◽

Artificial Neural

Prediction of student performance on the basis of habits has been a very important research topic in academics. Studies show that selection of the correct data set also plays a vital role in these predictions. In this chapter, the authors took data from different schools that contains student habits and their comments, analyzed it using latent semantic analysis to get semantics, and then used support vector machine to classify the data into two classes, important for prediction and not important. Finally, they used artificial neural networks to predict the grades of students. Regression was also used to predict data coming from support vector machine, while giving only the important data for prediction.

Download Full-text