Robust weighted linear loss twin multi-class support vector regression for large-scale classification

Forecasting the development of large-scale landslides is a contentious and complicated issue. In this study, we put forward the use of multi-factor support vector regression machines (SVRMs) for predicting the displacement rate of a large-scale landslide. The relative relationships between the main monitoring factors were analyzed based on the long-term monitoring data of the landslide and the grey correlation analysis theory. We found that the average correlation between landslide displacement and rainfall is 0.894, and the correlation between landslide displacement and reservoir water level is 0.338. Finally, based on an in-depth analysis of the basic characteristics, influencing factors, and development of landslides, three main factors (i.e., the displacement rate, reservoir water level, and rainfall) were selected to build single-factor, two-factor, and three-factor SVRM models. The key parameters of the models were determined using a grid-search method, and the models showed high accuracies. Moreover, the accuracy of the two-factor SVRM model (displacement rate and rainfall) is the highest with the smallest standard error (RMSE) of 0.00614; it is followed by the three-factor and single-factor SVRM models, the latter of which has the lowest prediction accuracy, with the largest RMSE of 0.01644.

Download Full-text

Development of an SVR Model for the Fault Diagnosis of Large-Scale Doubly-Fed Wind Turbines Using SCADA Data

Energies ◽

10.3390/en12173396 ◽

2019 ◽

Vol 12 (17) ◽

pp. 3396 ◽

Cited By ~ 2

Author(s):

Mingzhu Tang ◽

Wei Chen ◽

Qi Zhao ◽

Huawei Wu ◽

Wen Long ◽

...

Keyword(s):

Fault Diagnosis ◽

Wind Turbine ◽

Support Vector Regression ◽

Confidence Intervals ◽

Wind Turbines ◽

Large Scale ◽

Support Vector ◽

Effective Response ◽

Support Vector Regression Model ◽

Doubly Fed

Fault diagnosis and forecasting contribute significantly to the reduction of operating and maintenance associated costs, as well as to improve the resilience of wind turbine systems. Different from the existing fault diagnosis approaches using monitored vibration and acoustic data from the auxiliary equipment, this research presents a novel fault diagnosis and forecasting approach underpinned by a support vector regression model using data obtained by the supervisory control and data acquisition system (SCADA) of wind turbines (WT). To operate, the extraction of fault diagnosis features is conducted by measuring SCADA parameters. After that, confidence intervals are set up to guide the fault diagnosis implemented by the support vector regression (SVR) model. With the employment of confidence intervals as the performance indicators, an SVR-based fault detecting approach is then developed. Based on the WT SCADA data and the SVR model, a fault diagnosis strategy for large-scale doubly-fed wind turbine systems is investigated. A case study including a one-year monitoring SCADA data collected from a wind farm in Southern China is employed to validate the proposed methodology and demonstrate how it works. Results indicate that the proposed strategy can support the troubleshooting of wind turbine systems with high precision and effective response.

Download Full-text

Weighted Linear Loss Support Vector Machine for Large Scale Problems

Procedia Computer Science ◽

10.1016/j.procs.2014.05.311 ◽

2014 ◽

Vol 31 ◽

pp. 639-647 ◽

Cited By ~ 4

Author(s):

Yuan-Hai Shao ◽

Zhen Wang ◽

Zhi-Min Yang ◽

Nai-Yang Deng

Keyword(s):

Support Vector Machine ◽

Large Scale ◽

Support Vector ◽

Large Scale Problems ◽

Linear Loss

Download Full-text

Initial Optimal Parameters of Artificial Neural Network and Support Vector Regression

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v8i5.pp3341-3348 ◽

2018 ◽

Vol 8 (5) ◽

pp. 3341 ◽

Cited By ~ 1

Author(s):

Edy Fradinata ◽

Sakesun Suthummanon ◽

Wannarat Suntiamorntut

Keyword(s):

Neural Network ◽

Artificial Neural Network ◽

Support Vector Regression ◽

Kernel Function ◽

Activation Function ◽

Support Vector ◽

Optimal Parameters ◽

Artificial Neural ◽

Hidden Layer ◽

Linear Loss

This paper presents architecture of backpropagation Artificial Neural Network (ANN) and Support Vector Regression (SVR) models in supervised learning process for cement demand dataset. This study aims to identify the effectiveness of each parameter of mean square error (MSE) indicators for time series dataset. The study varies different random sample in each demand parameter in the network of ANN and support vector function as well. The variations of percent datasets from activation function, learning rate of sigmoid and purelin, hidden layer, neurons, and training function should be applied for ANN. Furthermore, SVR is varied in kernel function, lost function and insensitivity to obtain the best result from its simulation. The best results of this study for ANN activation function is Sigmoid. The amount of data input is 100% or 96 of data, 150 learning rates, one hidden layer, trinlm training function, 15 neurons and 3 total layers. The best results for SVR are six variables that run in optimal condition, kernel function is linear, loss function is ౬-insensitive, and insensitivity was 1. The better results for both methods are six variables. The contribution of this study is to obtain the optimal parameters for specific variables of ANN and SVR.

Download Full-text

Improvement of transportation cost estimation for prefabricated construction using geo-fence-based large-scale GPS data feature extraction and support vector regression

Advanced Engineering Informatics ◽

10.1016/j.aei.2019.101012 ◽

2020 ◽

Vol 43 ◽

pp. 101012 ◽

Cited By ~ 2

Author(s):

SangJun Ahn ◽

SangUk Han ◽

Mohamed Al-Hussein

Keyword(s):

Feature Extraction ◽

Support Vector Regression ◽

Cost Estimation ◽

Large Scale ◽

Transportation Cost ◽

Support Vector ◽

Gps Data

Download Full-text

Nonlinear Proximal Support Vector Machine Classifiers Aiming At Large Scale Classification Problems

Proceedings of the 11th Joint Conference on Information Sciences (JCIS) ◽

10.2991/jcis.2008.106 ◽

2008 ◽

Cited By ~ 1

Author(s):

Xiaoming Xu ◽

Ning Ye ◽

Qiaolin Ye

Keyword(s):

Support Vector Machine ◽

Large Scale ◽

Support Vector ◽

Classification Problems ◽

Proximal Support Vector Machine ◽

Scale Classification

Download Full-text

The Mechanism Analysis of the Accelerator for Support Vector Regression Based on Data Partition

Fuzzy Systems and Data Mining VI - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200730 ◽

2020 ◽

Author(s):

Yunsheng Song ◽

Fangyi Li ◽

Jianyu Liu ◽

Juao Zhang

Keyword(s):

Support Vector Regression ◽

Large Scale ◽

Real Life ◽

Support Vector ◽

Mechanism Analysis ◽

Data Partition ◽

Price Forecast ◽

Before And After ◽

Important Solution ◽

The Difference

Support vector regression is an important algorithm in machine learning, and it is widely used in real life for its good performance, such as house price forecast, disease prediction, weather forecast, and so on. However, it cannot efficiently process large-scale data, because it has a high time complexity in the training process. Data partition as an important solution to solve the large-scale learning problem mainly focuses on the classification task, it trains the classifiers over the divided subsets produced by data partition and obtain the final classifier by combining those classifiers. Meanwhile, the most existing method rarely study the influence of data partition on the regressor performance, so that it is difficult to keep its generation ability. To solve this problem, we obtain the estimation of the difference in objective function before and after the data partition. Mini-Batch K-Means clustering is adopted to largely reduce this difference, and an improved algorithm is proposed. This proposed algorithm includes training stage and prediction stage. In training stag, it uses Mini-Batch K-Means clustering to divide the input space into some disjoint sub-regions of equal sample size, then it trains the regressor on each divided sub-region using support vector regression algorithm. In the prediction stage, the regressor merely offers the predicted label for the unlabeled instances that are in the same sub-region. Experiment results on real datasets illustrate that the proposed algorithm obtains the similar generation ability as the original algorithm, but it has less execution time than other acceleration algorithms.

Download Full-text