Performance Evaluation of Keyword Extraction Methods and Visualization for Student Online Comments

Topic keyword extraction (as a typical task in information retrieval) refers to extracting the core keywords from document topics. In an online environment, students often post comments in subject forums. The automatic and accurate extraction of keywords from these comments are beneficial to lecturers (particular when it comes to repeatedly delivered subjects). In this paper, we compare the performance of traditional machine learning algorithms and two deep learning methods in extracting topic keywords from student comments posted in subject forums. For this purpose, we collected student comment data from a period of two years, manually tagging part of the raw data for our experiments. Based on this dataset, we comprehensively compared the five typical algorithms of naïve Bayes, logistic regression, support vector machine, convolutional neural networks, and Long Short-Term Memory with Attention (Att-LSTM). The performances were measured by the four evaluation metrics. We further examined the keywords by visualization. From the results of our experiment and visualization, we conclude that the Att-LSTM method is the best approach for topic keyword extraction from student comments. Further, the results from the algorithms and visualization are symmetry, to some degree. In particular, the extracted topics from the comments posted at the same stages of different teaching sessions are, almost, reflection symmetry.

Download Full-text

Using Machine Learning Algorithms on Prediction of Stock Price

Journal of Modeling and Optimization ◽

10.32732/jmo.2020.12.2.84 ◽

2020 ◽

Vol 12 (2) ◽

pp. 84-99

Author(s):

Li-Pang Chen

Keyword(s):

Machine Learning ◽

Stock Price ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Support Vector ◽

Short Term ◽

Learning Techniques ◽

Historical Database ◽

Long Short Term Memory

In this paper, we investigate analysis and prediction of the time-dependent data. We focus our attention on four different stocks are selected from Yahoo Finance historical database. To build up models and predict the future stock price, we consider three different machine learning techniques including Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN) and Support Vector Regression (SVR). By treating close price, open price, daily low, daily high, adjusted close price, and volume of trades as predictors in machine learning methods, it can be shown that the prediction accuracy is improved.

Download Full-text

A semiautomatic annotation approach for sentiment analysis

Journal of Information Science ◽

10.1177/01655515211006594 ◽

2021 ◽

pp. 016555152110065

Author(s):

Rahma Alahmary ◽

Hmood Al-Dossari

Keyword(s):

Deep Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Annotation Process ◽

Learning Classifiers ◽

Long Short Term Memory

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.

Download Full-text

Multi-Step-Ahead Prediction with Long Short Term Memory Networks and Support Vector Regression

2018 37th Chinese Control Conference (CCC) ◽

10.23919/chicc.2018.8484066 ◽

2018 ◽

Cited By ~ 3

Author(s):

Sai Li ◽

Huajing Fang ◽

Bing Shi

Keyword(s):

Support Vector Regression ◽

Short Term Memory ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

Categorizing Natural Language-Based Customer Satisfaction: An Implementation Method Using Support Vector Machine and Long Short-Term Memory Neural Network

International Journal of Integrated Engineering ◽

10.30880/ijie.2021.13.04.007 ◽

2021 ◽

Vol 13 (4) ◽

Author(s):

Ralph Sherwin A. Corpuz ◽

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Natural Language ◽

Text Categorization ◽

Short Term Memory ◽

Support Vector ◽

Feature Engineering ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Analyzing natural language-based Customer Satisfaction (CS) is a tedious process. This issue is practically true if one is to manually categorize large datasets. Fortunately, the advent of supervised machine learning techniques has paved the way toward the design of efficient categorization systems used for CS. This paper presents the feasibility of designing a text categorization model using two popular and robust algorithms – the Support Vector Machine (SVM) and Long Short-Term Memory (LSTM) Neural Network, in order to automatically categorize complaints, suggestions, feedbacks, and commendations. The study found that, in terms of training accuracy, SVM has best rating of 98.63% while LSTM has best rating of 99.32%. Such results mean that both SVM and LSTM algorithms are at par with each other in terms of training accuracy, but SVM is significantly faster than LSTM by approximately 35.47s. The training performance results of both algorithms are attributed on the limitations of the dataset size, high-dimensionality of both English and Tagalog languages, and applicability of the feature engineering techniques used. Interestingly, based on the results of actual implementation, both algorithms are found to be 100% effective in accurately predicting the correct CS categories. Hence, the extent of preference between the two algorithms boils down on the available dataset and the skill in optimizing these algorithms through feature engineering techniques and in implementing them toward actual text categorization applications.

Download Full-text

Short-Term Building Load Forecast Based on Patch Learning with Long Short-Term Memory Network and Support Vector Regression

10.1007/978-981-16-5188-5_53 ◽

2021 ◽

pp. 742-756

Author(s):

Zhaohui Dan ◽

Bo Wang ◽

Huijin Fan ◽

Lei Liu

Keyword(s):

Support Vector Regression ◽

Short Term Memory ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Load Forecast ◽

Memory Network ◽

Building Load ◽

Long Short Term Memory

Download Full-text

Pressure prediction and abnormal working conditions detection of water supply network based on LSTM

Water Science & Technology Water Supply ◽

10.2166/ws.2020.013 ◽

2020 ◽

Vol 20 (3) ◽

pp. 963-974 ◽

Cited By ~ 2

Author(s):

Zhe Xu ◽

Zhihao Ying ◽

Yuquan Li ◽

Bishi He ◽

Yun Chen

Keyword(s):

Water Supply ◽

Short Term Memory ◽

Learning Performance ◽

Support Vector ◽

Supply Network ◽

Short Term ◽

Water Supply Network ◽

Vector Machines ◽

Long Short Term Memory ◽

Deep Learning Model

Abstract In this study, a deep learning model based on LSTM (Long Short-Term Memory) is used to predict the state of a water supply network due to its highly complex nonlinearity. The inputs of the model include state information on the pressures at measuring points, as well as control information on the water supply pressure and flow at each entry point. In order to enhance the performance of the model in feature extraction and identification and improve prediction accuracy, a parallel LSTM tandem DNN deep neural network model (PLDNN) is proposed. The experimental results indicate that the model has better learning performance and accuracy compared with traditional prediction methods (artificial neural networks, support vector machines, etc.) and general LSTM models.

Download Full-text

Potent pairing: ensemble of long short-term memory networks and support vector machine for chemical-protein relation extraction

Database ◽

10.1093/database/bay120 ◽

2018 ◽

Vol 2018 ◽

Cited By ~ 3

Author(s):

Farrokh Mehryary ◽

Jari Björne ◽

Tapio Salakoski ◽

Filip Ginter

Keyword(s):

Support Vector Machine ◽

Short Term Memory ◽

Relation Extraction ◽

Support Vector ◽

Short Term ◽

Term Memory ◽

Long Short Term Memory

Download Full-text

An Improved Long Short-Term Memory Model for Dam Displacement Prediction

Mathematical Problems in Engineering ◽

10.1155/2019/6792189 ◽

2019 ◽

Vol 2019 ◽

pp. 1-14 ◽

Cited By ~ 3

Author(s):

Jun Zhang ◽

Xiyao Cao ◽

Jiemin Xie ◽

Pangao Kou

Keyword(s):

Short Term Memory ◽

Water Pressure ◽

Extreme Temperature ◽

Regression Tree ◽

Support Vector ◽

Temperature Structure ◽

Short Term ◽

Boosted Regression Tree ◽

Term Memory ◽

Long Short Term Memory

Displacement plays a vital role in dam safety monitoring data, which adequately responds to security risks such as the flood water pressure, extreme temperature, structure deterioration, and bottom bedrock damage. To make accurate predictions, former researchers established various models. However, these models’ input variables cannot efficiently reflect the delays between the external environment and displacement. Therefore, a long short-term memory (LSTM) model is proposed to make full use of the historical data to reflect the delays. Furthermore, the LSTM model is improved to optimize the performance by making variables more physically reasonable. Finally, a real-world radial displacement dataset is used to compare the performance of LSTM models, multiple linear regression (MLR), multilayer perceptron (MLP) neural networks, support vector machine (SVM), and boosted regression tree (BRT). The results indicate that (1) the LSTM models can efficiently reflect the delays and make the variables selection more convenient and (2) the improved LSTM model achieves the best performance by optimizing the input form and network structure based on a clearer physical meaning.

Download Full-text