Global Nonlinear Kernel Prediction for Large Data Set With a Particle Swarm-Optimized Interval Support Vector Regression

In this study, we proposed a general pruning procedure to reduce the dimension of a large database so that the properties of the extracted subset can be well defined. Since learning functions have been widely applied, we take this group of functions as an example to demonstrate the proposed procedure. Based on the concept of Support Vector Machine (SVM), three major stages of preliminary pruning, fitting function, and refining are proposed to discover a subset that possess the characteristics of some learning function from the given large data set. Three models were used to illustrate and evaluate the proposed pruning procedure and the results have shown to be promising in application.

Download Full-text

In SilicoLogPPrediction for a Large Data Set with Support Vector Machines, Radial Basis Neural Networks and Multiple Linear Regression

Chemical Biology & Drug Design ◽

10.1111/j.1747-0285.2009.00840.x ◽

2009 ◽

Vol 74 (2) ◽

pp. 142-147 ◽

Cited By ~ 16

Author(s):

Hai-Feng Chen

Keyword(s):

Neural Networks ◽

Support Vector Machines ◽

Linear Regression ◽

Multiple Linear Regression ◽

Large Data ◽

Support Vector ◽

Data Set ◽

Large Data Set ◽

Radial Basis Neural Networks ◽

Vector Machines

Download Full-text

Particle Swarm Optimization-Based Support Vector Regression for Tourist Arrivals Forecasting

Computational Intelligence and Neuroscience ◽

10.1155/2018/6076475 ◽

2018 ◽

Vol 2018 ◽

pp. 1-13 ◽

Cited By ~ 8

Author(s):

Hsiou-Hsiang Liu ◽

Lung-Cheng Chang ◽

Chien-Wei Li ◽

Cheng-Hong Yang

Keyword(s):

Particle Swarm Optimization ◽

Support Vector Regression ◽

Particle Swarm ◽

Forecast Accuracy ◽

Tourism Industry ◽

Support Vector ◽

Tourism Demand ◽

Economic Sectors ◽

Data Set ◽

Swarm Optimization

The tourism industry has become one of the most important economic sectors for governments worldwide. Accurately forecasting tourism demand is crucial because it provides useful information to related industries and governments, enabling stakeholders to adjust plans and policies. To develop a forecasting tool for the tourism industry, this study proposes a method that combines feature selection (FS) and support vector regression (SVR) with particle swarm optimization (PSO), named FS–PSOSVR. To ensure high forecast accuracy, FS and a PSO algorithm are employed to, respectively, select reliable input variables and to identify the optimal initial parameters of SVR. The proposed method was tested using a data set of monthly tourist arrivals to Taiwan from January 2006 to December 2016. The results reveal that the errors obtained using FS–PSOSVR are comparatively smaller than those obtained using other methods, indicating that FS–PSOSVR is an effective method for forecasting tourism demand.

Download Full-text

Active Learning Based Support Vector Data Description for Large Data Set Novelty Detection

Lecture Notes in Electrical Engineering - Proceedings of 2017 Chinese Intelligent Automation Conference ◽

10.1007/978-981-10-6445-6_32 ◽

2017 ◽

pp. 283-293 ◽

Cited By ~ 1

Author(s):

Lili Yin ◽

Huangang Wang ◽

Wenhui Fan ◽

Qingkai Wang

Keyword(s):

Active Learning ◽

Novelty Detection ◽

Large Data ◽

Support Vector ◽

Support Vector Data Description ◽

Vector Data ◽

Data Set ◽

Data Description ◽

Large Data Set

Download Full-text

RISK ASSESSMENT AND CLASSIFICATION FOR DETENTION BASINS BASED ON PARTICLE SWARM OPTIMIZATION - SUPPORT VECTOR REGRESSION (PSO-SVR) IN HUAIHE RIVER BASIN, CHINA

Environmental Engineering and Management Journal ◽

10.30638/eemj.2013.226 ◽

2013 ◽

Vol 12 (9) ◽

pp. 1843-1848 ◽

Cited By ~ 1

Author(s):

Junfei Chen ◽

Shihao Zhao ◽

Huimin Wang ◽

Shufang Zhao

Keyword(s):

Risk Assessment ◽

Particle Swarm Optimization ◽

Support Vector Regression ◽

River Basin ◽

Particle Swarm ◽

Support Vector ◽

Huaihe River Basin ◽

Huaihe River ◽

Swarm Optimization ◽

Detention Basins

Download Full-text

In silico Prediction of Inhibitory Constant of Thrombin Inhibitors Using Machine Learning

Combinatorial Chemistry & High Throughput Screening ◽

10.2174/1386207322666181220130232 ◽

2019 ◽

Vol 21 (9) ◽

pp. 662-669 ◽

Cited By ~ 1

Author(s):

Junnan Zhao ◽

Lu Zhu ◽

Weineng Zhou ◽

Lingfeng Yin ◽

Yuchen Wang ◽

...

Keyword(s):

Machine Learning ◽

Prediction Models ◽

Regression Tree ◽

Large Data ◽

Thrombin Inhibitors ◽

Coagulation Cascade ◽

Gradient Boosting ◽

Support Vector ◽

Data Set ◽

Descriptor Selection

Background: Thrombin is the central protease of the vertebrate blood coagulation cascade, which is closely related to cardiovascular diseases. The inhibitory constant Ki is the most significant property of thrombin inhibitors. Method: This study was carried out to predict Ki values of thrombin inhibitors based on a large data set by using machine learning methods. Taking advantage of finding non-intuitive regularities on high-dimensional datasets, machine learning can be used to build effective predictive models. A total of 6554 descriptors for each compound were collected and an efficient descriptor selection method was chosen to find the appropriate descriptors. Four different methods including multiple linear regression (MLR), K Nearest Neighbors (KNN), Gradient Boosting Regression Tree (GBRT) and Support Vector Machine (SVM) were implemented to build prediction models with these selected descriptors. Results: The SVM model was the best one among these methods with R2=0.84, MSE=0.55 for the training set and R2=0.83, MSE=0.56 for the test set. Several validation methods such as yrandomization test and applicability domain evaluation, were adopted to assess the robustness and generalization ability of the model. The final model shows excellent stability and predictive ability and can be employed for rapid estimation of the inhibitory constant, which is full of help for designing novel thrombin inhibitors.

Download Full-text

An enhanced extrapolation method based on particle swarm optimization-support vector regression to determine the friction coefficient between aircraft tire and runway surface

Review of Scientific Instruments ◽

10.1063/1.5090915 ◽

2019 ◽

Vol 90 (9) ◽

pp. 095110

Author(s):

Liwei Zhan ◽

Fang Ma ◽

Xiaojun Song ◽

Hongwei Sun ◽

Chengwei Li

Keyword(s):

Particle Swarm Optimization ◽

Friction Coefficient ◽

Support Vector Regression ◽

Particle Swarm ◽

Extrapolation Method ◽

Support Vector ◽

Swarm Optimization

Download Full-text

Correlation between the structure and skin permeability of compounds

Scientific Reports ◽

10.1038/s41598-021-89587-5 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Ruolan Zeng ◽

Jiyong Deng ◽

Limin Dang ◽

Xinliang Yu

Keyword(s):

Large Data ◽

Qsar Model ◽

Coefficient Of Determination ◽

Support Vector ◽

Skin Permeability ◽

Data Set ◽

Test Set ◽

Svm Algorithm ◽

Svm Model ◽

Toxicity Relationship

AbstractA three-descriptor quantitative structure–activity/toxicity relationship (QSAR/QSTR) model was developed for the skin permeability of a sufficiently large data set consisting of 274 compounds, by applying support vector machine (SVM) together with genetic algorithm. The optimal SVM model possesses the coefficient of determination R2 of 0.946 and root mean square (rms) error of 0.253 for the training set of 139 compounds; and a R2 of 0.872 and rms of 0.302 for the test set of 135 compounds. Compared with other models reported in the literature, our SVM model shows better statistical performance in a model that deals with more samples in the test set. Therefore, applying a SVM algorithm to develop a nonlinear QSAR model for skin permeability was achieved.

Download Full-text