scholarly journals Implementasi Long Short-Term Memory untuk Identifikasi Berita Hoax Berbahasa Inggris pada Media Sosial

Author(s):  
Jasman Pardede ◽  
Raka Gemi Ibrahim

Hoax atau berita palsu menyebar sangat cepat di media sosial. Berita itu dapat memengaruhi pembaca dan menjadi racun pikiran. Masalah seperti ini harus diselesaikan secara strategis untuk mengidentifikasi berita yang dibaca yang disebarluaskan di media sosial. Beberapa metode yang diusulkan untuk memprediksi tipuan adalah menggunakan Support Vector Classifier, Logistic Regression, dan MultinomialNaiveBayes. Dalam studi ini, para peneliti menerapkan Long Short-Term Memory untuk mengidentifikasi hoax. Kinerja sistem diukur berdasarkan nilai precision, recall, accuracy, dan F-Measure. Berdasarkan hasil eksperimen yang dilakukan pada data tipuan diperoleh nilai rata-rata precision, recall, accuracy, dan F-Measure masing-masing 0,94, 0,96, 0,94, dan 0,95. Berdasarkan hasil eksperimen ditemukan bahwa Long Short-Term Memory yang diusulkan memiliki kinerja yang lebih baik dibandingkan dengan metode sebelumnya.

2021 ◽  
pp. 016555152110065
Author(s):  
Rahma Alahmary ◽  
Hmood Al-Dossari

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.


Author(s):  
Ralph Sherwin A. Corpuz ◽  

Analyzing natural language-based Customer Satisfaction (CS) is a tedious process. This issue is practically true if one is to manually categorize large datasets. Fortunately, the advent of supervised machine learning techniques has paved the way toward the design of efficient categorization systems used for CS. This paper presents the feasibility of designing a text categorization model using two popular and robust algorithms – the Support Vector Machine (SVM) and Long Short-Term Memory (LSTM) Neural Network, in order to automatically categorize complaints, suggestions, feedbacks, and commendations. The study found that, in terms of training accuracy, SVM has best rating of 98.63% while LSTM has best rating of 99.32%. Such results mean that both SVM and LSTM algorithms are at par with each other in terms of training accuracy, but SVM is significantly faster than LSTM by approximately 35.47s. The training performance results of both algorithms are attributed on the limitations of the dataset size, high-dimensionality of both English and Tagalog languages, and applicability of the feature engineering techniques used. Interestingly, based on the results of actual implementation, both algorithms are found to be 100% effective in accurately predicting the correct CS categories. Hence, the extent of preference between the two algorithms boils down on the available dataset and the skill in optimizing these algorithms through feature engineering techniques and in implementing them toward actual text categorization applications.


2019 ◽  
Vol 2019 ◽  
pp. 1-14 ◽  
Author(s):  
Jun Zhang ◽  
Xiyao Cao ◽  
Jiemin Xie ◽  
Pangao Kou

Displacement plays a vital role in dam safety monitoring data, which adequately responds to security risks such as the flood water pressure, extreme temperature, structure deterioration, and bottom bedrock damage. To make accurate predictions, former researchers established various models. However, these models’ input variables cannot efficiently reflect the delays between the external environment and displacement. Therefore, a long short-term memory (LSTM) model is proposed to make full use of the historical data to reflect the delays. Furthermore, the LSTM model is improved to optimize the performance by making variables more physically reasonable. Finally, a real-world radial displacement dataset is used to compare the performance of LSTM models, multiple linear regression (MLR), multilayer perceptron (MLP) neural networks, support vector machine (SVM), and boosted regression tree (BRT). The results indicate that (1) the LSTM models can efficiently reflect the delays and make the variables selection more convenient and (2) the improved LSTM model achieves the best performance by optimizing the input form and network structure based on a clearer physical meaning.


Author(s):  
Iin Kurniasari ◽  
Kusrini Kusrini ◽  
Hanif Al Fatta

Perkembangan teknologi dewasa ini mendorong masyarakat untuk selalu tanggap teknologi, terlebih di era pandemi covid-19 yang selalu mengedepankan social distancing. Media sosial digunakan sebagai suatu alat untuk menyampaikan opini masyarakat kepada khalayak. Dalam penelitian ini, penulis melakukan penelitian tentang opini masyaraat pada media sosial instagram dengan mengguakan Support Vector Machine. Setelah dilakukan uji akurasi dan presisi ternyata SVM belum sesuai digunakan sebagai algoritma yang dapat menangkap urutan karena susunan kata yang dibolak-balik meskipun maknanya berbeda tetap bermakna sama oleh mesin SVM, hal ini dibuktikan juga dengan jumlah akurasi yang kecil.yaitu 59%. Sehingga diperlukan langkah untuk bisa diteliti dengan algoritma lain misalnya algoritma HRRN (Highest Response Ratio Next) atau LSTM (Long Short-Term Memory) yang memperhatikan urutan dan proses dengan rasio respon paling tinggi. Jika berdasarkan pendekatan ekstraksi fitur SVM dengan pendekatan count vector, tf-idf word level, tf-idf ngram level dan tf-idf char level. Dalam skenario ini nilai akurasi tertinggi terdapat pada perhitungan dengan menggunakan ekstraksi fitur count vector dan tf-idf ngram level.


Author(s):  
Satria Wiro Agung ◽  
◽  
Kelvin Supranata Wangkasa Rianto ◽  
Antoni Wibowo

- Foreign Exchange (Forex) is the exchange / trading of currencies from different countries with the aim of making profit. Exchange rates on Forex markets are always changing and it is hard to predict. Many factors affect exchange rates of certain currency pairs like inflation rates, interest rates, government debt, term of trade, political stability of certain countries, recession and many more. Uncertainty in Forex prediction can be reduced with the help of technology by using machine learning. There are many machine learning methods that can be used when predicting Forex. The methods used in this paper are Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU), Support Vector Regression (SVR). XGBOOST, and ARIMA. The outcome of this paper will be comparison results that show how other major currency pairs have influenced the performance and accuracy of different methods. From the results, it was proven that XGBoost outperformed other models by 0.36% compared to ARIMA model, 4.4% compared to GRU model, 8% compared to LSTM model, 9.74% compared to SVR model. Keywords— Forex Forecasting, Long Short Term Memory, Gated Recurrent Unit, Support Vector Regression, ARIMA, Extreme Gradient Boosting


Sign in / Sign up

Export Citation Format

Share Document