scholarly journals Potent pairing: ensemble of long short-term memory networks and support vector machine for chemical-protein relation extraction

Database ◽  
2018 ◽  
Vol 2018 ◽  
Author(s):  
Farrokh Mehryary ◽  
Jari Björne ◽  
Tapio Salakoski ◽  
Filip Ginter
Author(s):  
Ralph Sherwin A. Corpuz ◽  

Analyzing natural language-based Customer Satisfaction (CS) is a tedious process. This issue is practically true if one is to manually categorize large datasets. Fortunately, the advent of supervised machine learning techniques has paved the way toward the design of efficient categorization systems used for CS. This paper presents the feasibility of designing a text categorization model using two popular and robust algorithms – the Support Vector Machine (SVM) and Long Short-Term Memory (LSTM) Neural Network, in order to automatically categorize complaints, suggestions, feedbacks, and commendations. The study found that, in terms of training accuracy, SVM has best rating of 98.63% while LSTM has best rating of 99.32%. Such results mean that both SVM and LSTM algorithms are at par with each other in terms of training accuracy, but SVM is significantly faster than LSTM by approximately 35.47s. The training performance results of both algorithms are attributed on the limitations of the dataset size, high-dimensionality of both English and Tagalog languages, and applicability of the feature engineering techniques used. Interestingly, based on the results of actual implementation, both algorithms are found to be 100% effective in accurately predicting the correct CS categories. Hence, the extent of preference between the two algorithms boils down on the available dataset and the skill in optimizing these algorithms through feature engineering techniques and in implementing them toward actual text categorization applications.


Author(s):  
Iin Kurniasari ◽  
Kusrini Kusrini ◽  
Hanif Al Fatta

Perkembangan teknologi dewasa ini mendorong masyarakat untuk selalu tanggap teknologi, terlebih di era pandemi covid-19 yang selalu mengedepankan social distancing. Media sosial digunakan sebagai suatu alat untuk menyampaikan opini masyarakat kepada khalayak. Dalam penelitian ini, penulis melakukan penelitian tentang opini masyaraat pada media sosial instagram dengan mengguakan Support Vector Machine. Setelah dilakukan uji akurasi dan presisi ternyata SVM belum sesuai digunakan sebagai algoritma yang dapat menangkap urutan karena susunan kata yang dibolak-balik meskipun maknanya berbeda tetap bermakna sama oleh mesin SVM, hal ini dibuktikan juga dengan jumlah akurasi yang kecil.yaitu 59%. Sehingga diperlukan langkah untuk bisa diteliti dengan algoritma lain misalnya algoritma HRRN (Highest Response Ratio Next) atau LSTM (Long Short-Term Memory) yang memperhatikan urutan dan proses dengan rasio respon paling tinggi. Jika berdasarkan pendekatan ekstraksi fitur SVM dengan pendekatan count vector, tf-idf word level, tf-idf ngram level dan tf-idf char level. Dalam skenario ini nilai akurasi tertinggi terdapat pada perhitungan dengan menggunakan ekstraksi fitur count vector dan tf-idf ngram level.


2020 ◽  
Vol 9 (4) ◽  
pp. 365-374
Author(s):  
Sri Suning Kusumawardani ◽  
Syukron Abu Ishaq Alfarozi

Pada saat ini, penyelenggaraan sistem pembelajaran daring menjadi hal yang penting di tengah pandemi untuk menekan persebaran virus COVID-19. Namun, sistem ini sangat sulit menjaga motivasi dan tingkat keterlibatan mahasiswa karena tidak ada interaksi langsung antara pengajar dengan mahasiswa. Makalah ini meninjau penggunaan data log mahasiswa untuk kebutuhan analisis pembelajaran guna memprediksi kinerja atau kecenderungan drop-out mahasiswa dari suatu mata kuliah dengan melihat pada data log interaksi mahasiswa dengan sistem dan data demografis mahasiswa menggunakan suatu data terbuka, yaitu Open University Learning Analytics Dataset (OULAD). Dari tinjauan beberapa artikel penelitian yang merujuk pada dataset tersebut, ada beberapa hal yang perlu ditinjau: 1) permasalahan yang sering diangkat, yaitu prediksi kecenderungan gagal dari mata kuliah tertentu, prediksi kinerja, dan prediksi keterlibatan mahasiswa; 2) fitur yang digunakan pada saat pemodelan, yaitu fitur demografis dan interaksi, baik yang diringkas secara harian atau mingguan dengan berbagai representasi fitur; 3) metode analisis pembelajaran yang secara khusus menggunakan metode pembelajaran mesin yang sering digunakan, yaitu Decision Tree (DT), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), dan Long Short-Term Memory (LSTM). Makalah ini juga mendiskusikan proses mitigasi dari mahasiswa yang berisiko, perancangan sistem data yang mendukung analisis pembelajaran, dan permasalahan yang sering ditemui pada saat proses pemodelan.


2020 ◽  
Vol 8 (1) ◽  
pp. 50
Author(s):  
Abdulbasit K. Al-Talabani

The recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based models, and none is voice based. Poetry meter recognition is not easy for an ordinary reader, it is very difficult for the listener and it is usually performed manually by experts. This paper proposes a model to detect the poetry meter from a single spoken line (“Bayt”) of an Arabic poem. Data of 230 samples collected from 10 poems of Arabic poetry, including three meters read by two speakers, are used in this work. The work adopts the extraction of linear prediction cepstrum coefficient and Mel frequency cepstral coefficient (MFCC) features, as a time series input to the proposed long short-term memory (LSTM) classifier, in addition to a global feature set that is computed using some statistics of the features across all of the frames to feed the support vector machine (SVM) classifier. The results show that the SVM model achieves the highest accuracy in the speakerdependent approach. It improves results by 3%, as compared to the state-of-the-art studies, whereas for the speaker-independent approach, the MFCC feature using LSTM exceeds the other proposed models.


Energies ◽  
2021 ◽  
Vol 14 (21) ◽  
pp. 7269
Author(s):  
Grzegorz Kłosowski ◽  
Tomasz Rymarczyk ◽  
Konrad Niderla ◽  
Magdalena Rzemieniak ◽  
Artur Dmowski ◽  
...  

Electrical tomography is a non-invasive method of monitoring the interior of objects, which is used in various industries. In particular, it is possible to monitor industrial processes inside reactors and tanks using tomography. Tomography enables real-time observation of crystals or gas bubbles growing in a liquid. However, obtaining high-resolution tomographic images is problematic because it involves solving the so-called ill-posed inverse problem. Noisy input data cause problems, too. Therefore, the use of appropriate hardware solutions to eliminate this phenomenon is necessary. An important cause of obtaining accurate tomographic images may also be the incorrect selection of algorithmic methods used to convert the measurements into the output images. In a dynamically changing environment of a tank reactor, selecting the optimal algorithmic method used to create a tomographic image becomes an optimization problem. This article presents the machine learning method’s original concept of intelligent selection depending on the reconstructed case. The long short-term memory network was used to classify the methods to choose one of the five homogenous methods—elastic net, linear regression with the least-squares learner, linear regression with support vector machine learner, support vector machine model, or artificial neural networks. In the presented research, tomographic images of selected measurement cases, reconstructed using five methods, were compared. Then, the selection methods’ accuracy was verified thanks to the long short-term memory network used as a classifier. The results proved that the new concept of long short-term memory classification ensures better tomographic reconstructions efficiency than imaging all measurement cases with single homogeneous methods.


Renewable energy has recently gained considerable attention. In particular, interest in wind energy is rapidly increasing globally. However, the characteristics of instability and volatility in wind energy systems also have a significant on power systems. To address these issues, numerous studies have been carried out to predict wind speed and power. Methods used to forecast wind energy are divided into three categories: physical, data-driven (statistical and artificial intelligence methods), and hybrid methods. In this study, among artificial intelligence methods, we compare short-term wind power using a support vector machine (SVM) and long short-term memory (LSTM). The method using an SVM is a short-term wind power forecast that considers the wind speed and direction on Jeju Island, whereas the method using LSTM does not consider the wind speed and direction. As the experiment results indicate, the SVM method achieves an excellent performance when considering the wind speed and direction.


2021 ◽  
pp. 016555152110065
Author(s):  
Rahma Alahmary ◽  
Hmood Al-Dossari

Sentiment analysis (SA) aims to extract users’ opinions automatically from their posts and comments. Almost all prior works have used machine learning algorithms. Recently, SA research has shown promising performance in using the deep learning approach. However, deep learning is greedy and requires large datasets to learn, so it takes more time for data annotation. In this research, we proposed a semiautomatic approach using Naïve Bayes (NB) to annotate a new dataset in order to reduce the human effort and time spent on the annotation process. We created a dataset for the purpose of training and testing the classifier by collecting Saudi dialect tweets. The dataset produced from the semiautomatic model was then used to train and test deep learning classifiers to perform Saudi dialect SA. The accuracy achieved by the NB classifier was 83%. The trained semiautomatic model was used to annotate the new dataset before it was fed into the deep learning classifiers. The three deep learning classifiers tested in this research were convolutional neural network (CNN), long short-term memory (LSTM) and bidirectional long short-term memory (Bi-LSTM). Support vector machine (SVM) was used as the baseline for comparison. Overall, the performance of the deep learning classifiers exceeded that of SVM. The results showed that CNN reported the highest performance. On one hand, the performance of Bi-LSTM was higher than that of LSTM and SVM, and, on the other hand, the performance of LSTM was higher than that of SVM. The proposed semiautomatic annotation approach is usable and promising to increase speed and save time and effort in the annotation process.


Sign in / Sign up

Export Citation Format

Share Document