Support vector machine based on low-rank tensor train decomposition for big data applications

Big data is a new trend at present, forcing the significant impacts on information technologies. In big data applications, one of the most concerned issues is dealing with large-scale data sets that often require computation resources provided by public cloud services. How to analyze big data efficiently becomes a big challenge. In this paper, we collaborate interval regression with the smooth support vector machine (SSVM) to analyze big data. Recently, the smooth support vector machine (SSVM) was proposed as an alternative of the standard SVM that has been proved more efficient than the traditional SVM in processing large-scale data. In addition the soft margin method is proposed to modify the excursion of separation margin and to be effective in the gray zone that the distribution of data becomes hard to be described and the separation margin between classes.

Download Full-text

Distributed Nonlinear Semiparametric Support Vector Machine for Big Data Applications on Spark Frameworks

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2018.2858778 ◽

2020 ◽

Vol 50 (11) ◽

pp. 4664-4675 ◽

Cited By ~ 1

Author(s):

Roberto Diaz-Morales ◽

Angel Navia-Vazquez

Keyword(s):

Support Vector Machine ◽

Big Data ◽

Support Vector ◽

Big Data Applications

Download Full-text

Throat polyp detection based on compressed big data of voice with support vector machine algorithm

EURASIP Journal on Advances in Signal Processing ◽

10.1186/1687-6180-2014-1 ◽

2014 ◽

Vol 2014 (1) ◽

Cited By ~ 64

Author(s):

Wei Wang ◽

Zhangliang Chen ◽

Jiasong Mu ◽

Tingting Han

Keyword(s):

Support Vector Machine ◽

Big Data ◽

Support Vector ◽

Support Vector Machine Algorithm ◽

Polyp Detection ◽

Throat Polyp Detection

Download Full-text

KLASIFIKASI TEKS SOSIAL MEDIA TWITTER MENGGUNAKAN SUPPORT VECTOR MACHINE (Studi Kasus Penusukan Wiranto)

Jurnal Informatika dan Rekayasa Elektronik ◽

10.36595/jire.v2i2.117 ◽

2019 ◽

Vol 2 (2) ◽

pp. 43

Author(s):

Lalu Mutawalli ◽

Mohammad Taufan Asri Zaen ◽

Wire Bagye

Keyword(s):

Social Media ◽

Support Vector Machine ◽

Big Data ◽

Mass Communication ◽

Confusion Matrix ◽

Classification Model ◽

Support Vector ◽

Large Set ◽

Appearance Time ◽

Data Production

In the era of technological disruption of mass communication, social media became a reference in absorbing public opinion. The digitalization of data is very rapidly produced by social media users because it is an attempt to represent the feelings of the audience. Data production in question is the user posts the status and comments on social media. Data production by the public in social media raises a very large set of data or can be referred to as big data. Big data is a collection of data sets in very large numbers, complex, has a relatively fast appearance time, so that makes it difficult to handle. Analysis of big data with data mining methods to get knowledge patterns in it. This study analyzes the sentiments of netizens on Twitter social media on Mr. Wiranto stabbing case. The results of the sentiment analysis showed 41% gave positive comments, 29% commented neutrally, and 29% commented negatively on events. Besides, modeling of the data is carried out using a support vector machine algorithm to create a system capable of classifying positive, neutral, and negative connotations. The classification model that has been made is then tested using the confusion matrix technique with each result is a precision value of 83%, a recall value of 80%, and finally, as much as 80% obtained in testing the accuracy.

Download Full-text

Research on Parallel Support Vector Machine Based on Spark Big Data Platform

Scientific Programming ◽

10.1155/2021/7998417 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Yao Huimin

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Big Data ◽

Support Vector Machines ◽

Cross Validation ◽

Machine Learning Algorithms ◽

Support Vector ◽

Lambda Architecture ◽

Vector Machines ◽

Data Platform

With the development of cloud computing and distributed cluster technology, the concept of big data has been expanded and extended in terms of capacity and value, and machine learning technology has also received unprecedented attention in recent years. Traditional machine learning algorithms cannot solve the problem of effective parallelization, so a parallelization support vector machine based on Spark big data platform is proposed. Firstly, the big data platform is designed with Lambda architecture, which is divided into three layers: Batch Layer, Serving Layer, and Speed Layer. Secondly, in order to improve the training efficiency of support vector machines on large-scale data, when merging two support vector machines, the “special points” other than support vectors are considered, that is, the points where the nonsupport vectors in one subset violate the training results of the other subset, and a cross-validation merging algorithm is proposed. Then, a parallelized support vector machine based on cross-validation is proposed, and the parallelization process of the support vector machine is realized on the Spark platform. Finally, experiments on different datasets verify the effectiveness and stability of the proposed method. Experimental results show that the proposed parallelized support vector machine has outstanding performance in speed-up ratio, training time, and prediction accuracy.

Download Full-text

Predictive big data analytic on demonetization data using support vector machine

Cluster Computing ◽

10.1007/s10586-018-2384-8 ◽

2018 ◽

Vol 22 (S6) ◽

pp. 14709-14720 ◽

Cited By ~ 11

Author(s):

Nattar Kannan ◽

S. Sivasubramanian ◽

M. Kaliappan ◽

S. Vimal ◽

A. Suresh

Keyword(s):

Support Vector Machine ◽

Big Data ◽

Support Vector ◽

Data Analytic

Download Full-text

CPI Big Data Prediction Based on Wavelet Twin Support Vector Machine

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421590138 ◽

2020 ◽

pp. 2159013

Author(s):

Yiqing Fan ◽

Zhihui Sun

Keyword(s):

Support Vector Machine ◽

Big Data ◽

Consumer Price Index ◽

Low Frequency ◽

Prediction Method ◽

Twin Support Vector Machine ◽

Support Vector ◽

Reconstruction Method ◽

Data Prediction ◽

Svm Model

In order to effectively improve the accuracy of Consumer Price Index (CPI) prediction so as to more truly reflect the overall level of the country’s macroeconomic situation, a CPI big data prediction method based on wavelet twin support vector machine (SVM) is proposed. First, the historical CPI data are decomposed into high-frequency part and low-frequency part by wavelet transform. Then a more advanced twin SVM is used to build a prediction model to obtain two kinds of prediction results. Finally, the wavelet reconstruction method is used to fuse the two kinds of prediction results to obtain the final CPI prediction results. The wavelet twin SVM model is used to fit and predict CPI index. Experimental results show that compared with the similar prediction methods, the proposed prediction method has higher fitting accuracy and smaller root mean square error.

Download Full-text

Big data approach to batch process monitoring: Simultaneous fault detection and diagnosis using nonlinear support vector machine-based feature selection

Computers & Chemical Engineering ◽

10.1016/j.compchemeng.2018.03.025 ◽

2018 ◽

Vol 115 ◽

pp. 46-63 ◽

Cited By ~ 36

Author(s):

Melis Onel ◽

Chris A. Kieslich ◽

Yannis A. Guzman ◽

Christodoulos A. Floudas ◽

Efstratios N. Pistikopoulos

Keyword(s):

Support Vector Machine ◽

Feature Selection ◽

Big Data ◽

Process Monitoring ◽

Batch Process ◽

Fault Detection And Diagnosis ◽

Support Vector ◽

Batch Process Monitoring ◽

Detection And Diagnosis ◽

Nonlinear Support

Download Full-text

Quantum Support Vector Machine for Big Data Classification

Physical Review Letters ◽

10.1103/physrevlett.113.130503 ◽

2014 ◽

Vol 113 (13) ◽

Cited By ~ 323

Author(s):

Patrick Rebentrost ◽

Masoud Mohseni ◽

Seth Lloyd

Keyword(s):

Support Vector Machine ◽

Big Data ◽

Data Classification ◽

Support Vector ◽

Big Data Classification

Download Full-text

Reflections on the Innovation of University Scientific Research Management in the Era of Big Data

Scientific Programming ◽

10.1155/2022/7674486 ◽

2022 ◽

Vol 2022 ◽

pp. 1-8

Author(s):

Yiming Li

Keyword(s):

Big Data ◽

Scientific Research ◽

Research Management ◽

Support Vector ◽

Optimization Strategy ◽

Web Database ◽

Computing Platform ◽

Research And Innovation ◽

Software Configuration ◽

Big Data Applications

In China, universities are important centers for SR (scientific research) and innovation, and the quality of SR management has a significant impact on university innovation. The informatization of SR management is a critical component of university development in the big data environment. As a result, it is crucial to figure out how to improve SR management. As a result, this paper builds a four-tier B/W/D/C (Browser/Web/Database/Client) university SR management innovation information system based on big data technology and thoroughly examines the system’s hardware and software configuration. The SVM-WNB (Support Vector Machine-Weighted NB) classification algorithm is proposed, and the improved algorithm runs in parallel on the Hadoop cloud computing platform, allowing the algorithm to process large amounts of data efficiently. The optimization strategy proposed in this paper can effectively optimize the execution of scientific big data applications according to a large number of simulation experiments and real-world multidata center environment experiments.

Download Full-text