Quadratic tuned kernel parameter in Non-linear support vector machine (SVM) for agarwood oil compounds quality classification

<span>This paper presents the analysis of agarwood oil compounds quality classification by tuning quadratic kernel parameter in Support Vector Machine (SVM). The experimental work involved of agarwood oil samples from low and high qualities. The input is abundances (%) of the agarwood oil compounds and the output is the quality of the oil either high or low. The input and output data were processed by following tasks; i) data processing which covers normalization, randomization and data splitting into two parts in which training and testing database (ratio of 80%:20%), and ii) data analysis which covers SVM development by tuning quadratic kernel parameter. The training dataset was used to be train the SVM model and the testing dataset was used to test the developed SVM model. All the analytical works are performed via MATLAB software version R2013a. The result showed that, quadratic tuned kernel parameter in SVM model was successful since it passed all the performance criteria’s in which accuracy, precision, confusion matrix, sensitivity and specificity. The finding obtained in this paper is vital to the agarwood oil and its research area especially to the agarwood oil compounds classification system.</span>

Download Full-text

Radial Basis Function (RBF) tuned Kernel Parameter of Agarwood Oil Compound for Quality Classification using Support Vector Machine (SVM)

2018 9th IEEE Control and System Graduate Research Colloquium (ICSGRC) ◽

10.1109/icsgrc.2018.8657524 ◽

2018 ◽

Author(s):

Mohamad Amirul Aiman Ngadilan ◽

Nurlaila Ismail ◽

Mohd Hezri Fazalul Rahiman ◽

Mohd Nasir Taib ◽

Nor Azah Mohd Ali ◽

...

Keyword(s):

Support Vector Machine ◽

Radial Basis Function ◽

Basis Function ◽

Support Vector ◽

Quality Classification ◽

Kernel Parameter ◽

Radial Basis ◽

Agarwood Oil

Download Full-text

Opinion mining on newspaper headlines using SVM and NLP

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v9i3.pp2152-2163 ◽

2019 ◽

Vol 9 (3) ◽

pp. 2152 ◽

Cited By ~ 1

Author(s):

Chaudhary Jashubhai Rameshbhai ◽

Joy Paulose

Keyword(s):

Support Vector Machine ◽

Natural Language Processing ◽

Language Processing ◽

Opinion Mining ◽

Confusion Matrix ◽

Support Vector ◽

Text Data ◽

Mining Technique ◽

Svm Model ◽

Linear Svm

<p>Opinion Mining also known as Sentiment Analysis, is a technique or procedure which uses Natural Language processing (NLP) to classify the outcome from text. There are various NLP tools available which are used for processing text data. Multiple research have been done in opinion mining for online blogs, Twitter, Facebook etc. This paper proposes a new opinion mining technique using Support Vector Machine (SVM) and NLP tools on newspaper headlines. Relative words are generated using Stanford CoreNLP, which is passed to SVM using count vectorizer. On comparing three models using confusion matrix, results indicate that Tf-idf and Linear SVM provides better accuracy for smaller dataset. While for larger dataset, SGD and linear SVM model outperform other models.</p>

Download Full-text

Bearing Fault Diagnosis Using a Support Vector Machine Optimized by an Improved Ant Lion Optimizer

Shock and Vibration ◽

10.1155/2019/9303676 ◽

2019 ◽

Vol 2019 ◽

pp. 1-20 ◽

Cited By ~ 2

Author(s):

Dalian Yang ◽

Jingjing Miao ◽

Fanyu Zhang ◽

Jie Tao ◽

Guangbin Wang ◽

...

Keyword(s):

Support Vector Machine ◽

Kernel Function ◽

Gaussian Kernel ◽

Support Vector ◽

Model Parameters ◽

Bearing Faults ◽

Ring Fault ◽

Penalty Factor ◽

Kernel Parameter ◽

Svm Model

Bearing is an important mechanical component that easily fails in a bad working environment. Support vector machines can be used to diagnose bearing faults; however, the recognition ability of the model is greatly affected by the kernel function and its parameters. Unfortunately, optimal parameters are difficult to select. To address these limitations, an escape mechanism and adaptive convergence conditions were introduced to the ALO algorithm. As a result, the EALO method was proposed and has been applied to the more accurate selection of SVM model parameters. To assess the model, the vibration acceleration signals of normal, inner ring fault, outer ring fault, and ball fault bearings were collected at different rotation speeds (1500 r/min, 1800 r/min, 2100 r/min, and 2400 r/min). The vibration signals were decomposed using the variational mode decomposition (VMD) method. The features were extracted through the kernel function to fuse the energy value of each VMD component. In these experiments, the two most important parameters for the support vector machine—the Gaussian kernel parameter σ and the penalty factor C—were optimized using the EALO algorithm, ALO algorithm, genetic algorithm (GA), and particle swarm optimization (PSO) algorithm. The performance of these four methods to optimize the two parameters was then compared and analyzed, with the EALO method having the best performance. The recognition rates for bearing faults under different tested rotation speeds were improved when the SVM model parameters optimized by the EALO were used.

Download Full-text

NeuroCS: A Tool to Predict Cleavage Sites of Neuropeptide Precursors

Protein and Peptide Letters ◽

10.2174/0929866526666191112150636 ◽

2020 ◽

Vol 27 (4) ◽

pp. 337-345 ◽

Cited By ~ 1

Author(s):

Ying Wang ◽

Juanjuan Kang ◽

Ning Li ◽

Yuwei Zhou ◽

Zhongjie Tang ◽

...

Keyword(s):

Support Vector Machine ◽

Characteristic Curve ◽

Training Dataset ◽

Support Vector ◽

Feature Subset ◽

Cleavage Sites ◽

Accurate Identification ◽

Testing Dataset ◽

Optimal Feature Subset ◽

Independent Testing Dataset

Background: Neuropeptides are a class of bioactive peptides produced from neuropeptide precursors through a series of extremely complex processes, mediating neuronal regulations in many aspects. Accurate identification of cleavage sites of neuropeptide precursors is of great significance for the development of neuroscience and brain science. Objective: With the explosive growth of neuropeptide precursor data, it is pretty much needed to develop bioinformatics methods for predicting neuropeptide precursors’ cleavage sites quickly and efficiently. Method : We started with processing the neuropeptide precursor data from SwissProt and NueoPedia into two sets of data, training dataset and testing dataset. Subsequently, six feature extraction schemes were applied to generate different feature sets and then feature selection methods were used to find the optimal feature subset of each. Thereafter the support vector machine was utilized to build models for different feature types. Finally, the performance of models were evaluated with the independent testing dataset. Results: Six models are built through support vector machine. Among them the enhanced amino acid composition-based model reaches the highest accuracy of 91.60% in the 5-fold cross validation. When evaluated with independent testing dataset, it also showed an excellent performance with a high accuracy of 90.37% and Area under Receiver Operating Characteristic curve up to 0.9576. Conclusion: The performance of the developed model was decent. Moreover, for users’ convenience, an online web server called NeuroCS is built, which is freely available at http://i.uestc.edu.cn/NeuroCS/dist/index.html#/. NeuroCS can be used to predict neuropeptide precursors’ cleavage sites effectively.

Download Full-text

Laboratory Testing Implications of Risk-Stratification and Management of COVID-19 Patients

Frontiers in Medicine ◽

10.3389/fmed.2021.699706 ◽

2021 ◽

Vol 8 ◽

Author(s):

Caidong Liu ◽

Ziyu Wang ◽

Wei Wu ◽

Changgang Xiang ◽

Lingxiang Wu ◽

...

Keyword(s):

Support Vector Machine ◽

High Risk ◽

Risk Stratification ◽

Random Sampling ◽

Early Stage ◽

Viral Pneumonia ◽

Low Risk ◽

Training Dataset ◽

Support Vector ◽

Testing Dataset

Objective: To distinguish COVID-19 patients and non-COVID-19 viral pneumonia patients and classify COVID-19 patients into low-risk and high-risk at admission by laboratory indicators.Materials and methods: In this retrospective cohort, a total of 3,563 COVID-19 patients and 118 non-COVID-19 pneumonia patients were included. There are two cohorts of COVID-19 patients, including 548 patients in the training dataset, and 3,015 patients in the testing dataset. Laboratory indicators were measured during hospitalization for all patients. Based on laboratory indicators, we used the support vector machine and joint random sampling to risk stratification for COVID-19 patients at admission. Based on laboratory indicators detected within the 1st week after admission, we used logistic regression and joint random sampling to develop the survival mode. The laboratory indicators of COVID-10 and non-COVID-19 were also compared.Results: We first identified the significant laboratory indicators related to the severity of COVID-19 in the training dataset. Neutrophils percentage, lymphocytes percentage, creatinine, and blood urea nitrogen with AUC >0.7 were included in the model. These indicators were further used to build a support vector machine model to classify patients into low-risk and high-risk at admission in the testing dataset. Results showed that this model could stratify the patients in the testing dataset effectively (AUC = 0.89). Our model still has good performance at different times (Mean AUC: 0.71, 0.72, 0.72, respectively for 3, 5, and 7 days after admission). Moreover, laboratory indicators detected within the 1st week after admission were able to estimate the probability of death (AUC = 0.95). We identified six indicators with permutation p < 0.05, including eosinophil percentage (p = 0.007), white blood cell count (p = 0.045), albumin (p = 0.041), aspartate transaminase (p = 0.043), lactate dehydrogenase (p = 0.002), and hemoglobin (p = 0.031). We could diagnose COVID-19 and differentiate it from other kinds of viral pneumonia based on these laboratory indicators.Conclusions: Our risk-stratification model based on laboratory indicators could help to diagnose, monitor, and predict severity at an early stage of COVID-19. In addition, laboratory findings could be used to distinguish COVID-19 and non-COVID-19.

Download Full-text

Analisis Sentimen Sistem Ganjil Genap di Tol Bekasi Menggunakan Algoritma Support Vector Machine

Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi) ◽

10.29207/resti.v3i2.1050 ◽

2019 ◽

Vol 3 (2) ◽

pp. 243-250

Author(s):

Heru Sukma Utama ◽

Didi Rosiyadi ◽

Bobby Suryo Prakoso ◽

Dedi Ariadarma

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Opinion Mining ◽

Confusion Matrix ◽

Support Vector ◽

Support Vector Machine Algorithm ◽

Toll Road ◽

Svm Algorithm ◽

Svm Model ◽

Textual Data

Analysis of the odd even-numbered sentiment systems in Bekasi toll using the Support Vector Machine Algorithm, is a process of understanding, extracting, and processing textual data automatically from social media. The purpose of this study was to determine the level of accuracy, recall and precision of opinion mining generated using the Support Vector Machine algorithm to provide information community sentiment towards the effectiveness of the odd system of Bekasi tiolls on social media. The research method used in this study was to do text mining in comments-comments regarding posts regarding even odd oddities on Bekasi toll on Twitter, Instagram, Youtube and Facebook. The steps taken are starting from preprocessing, transformation, datamining and evaluation, followed by information gaon feature selection, select by weight and applying SVM Algorithm model. The results obtained from the study using the SVM model are obtained Confusion Matrix result, namely accuracyof 78.18%, Precision of 74.03%, and Sensitivity or Recall of 86.82%. Thus this study concludes that the use of Support Vector Machine Algorithms can analyze even odd sentiments on the Bekasi toll road.

Download Full-text

Combination of Support Vector Machine and K-Fold cross-validation for prediction of long-term degradation of the compressive strength of marine concrete

International Journal of Computational Physics Series ◽

10.29167/a1i1p120-130 ◽

2018 ◽

Vol 1 (1) ◽

pp. 120-130 ◽

Cited By ~ 1

Author(s):

Chunxiang Qian ◽

Wence Kang ◽

Hao Ling ◽

Hua Dong ◽

Chengyao Liang ◽

...

Keyword(s):

Support Vector Machine ◽

Environmental Factors ◽

Cross Validation ◽

Concrete Strength ◽

Simulation Method ◽

Support Vector ◽

Svm Model ◽

Artificial Neural Network Ann ◽

Influence Degree ◽

Fold Cross Validation

Support Vector Machine (SVM) model optimized by K-Fold cross-validation was built to predict and evaluate the degradation of concrete strength in a complicated marine environment. Meanwhile, several mathematical models, such as Artificial Neural Network (ANN) and Decision Tree (DT), were also built and compared with SVM to determine which one could make the most accurate predictions. The material factors and environmental factors that influence the results were considered. The materials factors mainly involved the original concrete strength, the amount of cement replaced by fly ash and slag. The environmental factors consisted of the concentration of Mg2+, SO42-, Cl-, temperature and exposing time. It was concluded from the prediction results that the optimized SVM model appeared to perform better than other models in predicting the concrete strength. Based on SVM model, a simulation method of variables limitation was used to determine the sensitivity of various factors and the influence degree of these factors on the degradation of concrete strength.

Download Full-text

ABC-Gly: identifying protein lysine glycation sites with artificial bee colony algorithm

Current Proteomics ◽

10.2174/1570164617666191227120136 ◽

2019 ◽

Vol 17 ◽

Author(s):

Yanqiu Yao ◽

Xiaosa Zhao ◽

Qiao Ning ◽

Junping Zhou

Keyword(s):

Support Vector Machine ◽

Amino Acid ◽

Artificial Bee Colony Algorithm ◽

Artificial Bee Colony ◽

Training Dataset ◽

Support Vector ◽

Supplementary File ◽

Feature Subset ◽

Lipid Molecule ◽

Bee Colony

Background: Glycation is a nonenzymatic post-translational modification process by attaching a sugar molecule to a protein or lipid molecule. It may impair the function and change the characteristic of the proteins which may lead to some metabolic diseases. In order to understand the underlying molecular mechanisms of glycation, computational prediction methods have been developed because of their convenience and high speed. However, a more effective computational tool is still a challenging task in computational biology. Methods: In this study, we showed an accurate identification tool named ABC-Gly for predicting lysine glycation sites. At first, we utilized three informative features, including position-specific amino acid propensity, secondary structure and the composition of k-spaced amino acid pairs to encode the peptides. Moreover, to sufficiently exploit discriminative features thus can improve the prediction and generalization ability of the model, we developed a two-step feature selection, which combined the Fisher score and an improved binary artificial bee colony algorithm based on support vector machine. Finally, based on the optimal feature subset, we constructed the effective model by using Support Vector Machine on the training dataset. Results: The performance of the proposed predictor ABC-Gly was measured with the sensitivity of 76.43%, the specificity of 91.10%, the balanced accuracy of 83.76%, the area under the receiver-operating characteristic curve (AUC) of 0.9313, a Matthew’s Correlation Coefficient (MCC) of 0.6861 by 10-fold cross-validation on training dataset, and a balanced accuracy of 59.05% on independent dataset. Compared to the state-of-the-art predictors on the training dataset, the proposed predictor achieved significant improvement in the AUC of 0.156 and MCC of 0.336. Conclusion: The detailed analysis results indicated that our predictor may serve as a powerful complementary tool to other existing methods for predicting protein lysine glycation. The source code and datasets of the ABC-Gly were provided in the Supplementary File 1.

Download Full-text

Single Channel EEG signal for Automatic Detection of Absence Seizure using Convolutional Neural Network

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666191122114608 ◽

2019 ◽

Vol 13 ◽

Author(s):

Niha Kamal Basha ◽

Aisha Banu Wahab

Keyword(s):

Neural Network ◽

Support Vector Machine ◽

Convolutional Neural Network ◽

Monitoring System ◽

Single Channel ◽

Confusion Matrix ◽

Sudden Change ◽

Automatic Detection ◽

Support Vector ◽

Absence Seizure

: Absence seizure is a type of brain disorder in which subject get into sudden lapses in attention. Which means sudden change in brain stimulation. Most of this type of disorder is widely found in children’s (5-18 years). These Electroencephalogram (EEG) signals are captured with long term monitoring system and are analyzed individually. In this paper, a Convolutional Neural Network to extract single channel EEG seizure features like Power, log sum of wavelet transform, cross correlation, and mean phase variance of each frame in a windows are extracted after pre-processing and classify them into normal or absence seizure class, is proposed as an empowerment of monitoring system by automatic detection of absence seizure. The training data is collected from the normal and absence seizure subjects in the form of Electroencephalogram. The objective is to perform automatic detection of absence seizure using single channel electroencephalogram signal as input. Here the data is used to train the proposed Convolutional Neural Network to extract and classify absence seizure. The Convolutional Neural Network consist of three layers 1] convolutional layer – which extract the features in the form of vector 2] Pooling layer – the dimensionality of output from convolutional layer is reduced and 3] Fully connected layer–the activation function called soft-max is used to find the probability distribution of output class. This paper goes through the automatic detection of absence seizure in detail and provide the comparative analysis of classification between Support Vector Machine and Convolutional Neural Network. The proposed approach outperforms the performance of Support Vector Machine by 80% in automatic detection of absence seizure and validated using confusion matrix.

Download Full-text

Three-Dimensional Site Characterization Model of Bangalore Using Support Vector Machine

ISRN Soil Science ◽

10.5402/2012/346439 ◽

2012 ◽

Vol 2012 ◽

pp. 1-10

Author(s):

Pijush Samui

Keyword(s):

Support Vector Machine ◽

Three Dimensional ◽

Site Characterization ◽

Standard Penetration Test ◽

Support Vector ◽

Characterization Model ◽

Svm Model ◽

Input Variables ◽

Learning Machine ◽

A Site

The main objective of site characterization is the prediction of in situ soil properties at any half-space point at a site based on limited tests. In this study, the Support Vector Machine (SVM) has been used to develop a three dimensional site characterization model for Bangalore, India based on large amount of Standard Penetration Test. SVM is a novel type of learning machine based on statistical learning theory, uses regression technique by introducing ε-insensitive loss function. The database consists of 766 boreholes, with more than 2700 field SPT values () spread over 220 sq km area of Bangalore. The model is applied for corrected () values. The three input variables (, , and , where , , and are the coordinates of the Bangalore) were used for the SVM model. The output of SVM was the data. The results presented in this paper clearly highlight that the SVM is a robust tool for site characterization. In this study, a sensitivity analysis of SVM parameters (σ, , and ε) has been also presented.

Download Full-text