Analyzing Big Data with the Hybrid Interval Regression Methods

Support Vector Machines in Big Data Classification: A Systematic Literature Review

10.21203/rs.3.rs-663359/v1 ◽

2021 ◽

Author(s):

Mohammad Hassan Almaspoor ◽

Ali Safaei ◽

Afshin Salajegheh ◽

Behrouz Minaei-Bidgoli

Keyword(s):

Machine Learning ◽

Big Data ◽

Large Scale ◽

Support Vector ◽

Research Areas ◽

Large Scale Data ◽

Training Samples ◽

Big Data Classification ◽

Scale Data

Abstract Classification is one of the most important and widely used issues in machine learning, the purpose of which is to create a rule for grouping data to sets of pre-existing categories is based on a set of training sets. Employed successfully in many scientific and engineering areas, the Support Vector Machine (SVM) is among the most promising methods of classification in machine learning. With the advent of big data, many of the machine learning methods have been challenged by big data characteristics. The standard SVM has been proposed for batch learning in which all data are available at the same time. The SVM has a high time complexity, i.e., increasing the number of training samples will intensify the need for computational resources and memory. Hence, many attempts have been made at SVM compatibility with online learning conditions and use of large-scale data. This paper focuses on the analysis, identification, and classification of existing methods for SVM compatibility with online conditions and large-scale data. These methods might be employed to classify big data and propose research areas for future studies. Considering its advantages, the SVM can be among the first options for compatibility with big data and classification of big data. For this purpose, appropriate techniques should be developed for data preprocessing in order to covert data into an appropriate form for learning. The existing frameworks should also be employed for parallel and distributed processes so that SVMs can be made scalable and properly online to be able to handle big data.

Download Full-text

Affordances of Data Science in Agriculture, Manufacturing, and Education

Web Services ◽

10.4018/978-1-5225-7501-6.ch052 ◽

2019 ◽

pp. 953-978

Author(s):

Krishnan Umachandran ◽

Debra Sharon Ferdinand-James

Keyword(s):

Big Data ◽

Large Scale ◽

Data Science ◽

Data Generation ◽

Large Scale Data ◽

Big Data Applications ◽

Effective Decision ◽

Effective Decision Making ◽

Text Images ◽

Scale Data

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.

Download Full-text

Affordances of Data Science in Agriculture, Manufacturing, and Education

Privacy and Security Policies in Big Data - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-2486-1.ch002 ◽

2017 ◽

pp. 14-40 ◽

Cited By ~ 2

Author(s):

Krishnan Umachandran ◽

Debra Sharon Ferdinand-James

Keyword(s):

Big Data ◽

Large Scale ◽

Data Science ◽

Data Generation ◽

Large Scale Data ◽

Big Data Applications ◽

Effective Decision ◽

Effective Decision Making ◽

Text Images ◽

Scale Data

Continued technological advancements of the 21st Century afford massive data generation in sectors of our economy to include the domains of agriculture, manufacturing, and education. However, harnessing such large-scale data, using modern technologies for effective decision-making appears to be an evolving science that requires knowledge of Big Data management and analytics. Big data in agriculture, manufacturing, and education are varied such as voluminous text, images, and graphs. Applying Big data science techniques (e.g., functional algorithms) for extracting intelligence data affords decision markers quick response to productivity, market resilience, and student enrollment challenges in today's unpredictable markets. This chapter serves to employ data science for potential solutions to Big Data applications in the sectors of agriculture, manufacturing and education to a lesser extent, using modern technological tools such as Hadoop, Hive, Sqoop, and MongoDB.

Download Full-text

Influencing Factors of e-Commerce Enterprise Development Based on Mobile Computing Big Data Analysis

Wireless Communications and Mobile Computing ◽

10.1155/2021/8750111 ◽

2021 ◽

Vol 2021 ◽

pp. 1-12

Author(s):

Yixue Zhu ◽

Boyue Chai

Keyword(s):

Big Data ◽

Data Analysis ◽

Large Scale ◽

Big Data Analysis ◽

Support Vector ◽

Data Sets ◽

Large Scale Data ◽

Vector Machines ◽

Physical Information ◽

Scale Data

With the development of increasingly advanced information technology and electronic technology, especially with regard to physical information systems, cloud computing systems, and social services, big data will be widely visible, creating benefits for people and at the same time facing huge challenges. In addition, with the advent of the era of big data, the scale of data sets is getting larger and larger. Traditional data analysis methods can no longer solve the problem of large-scale data sets, and the hidden information behind big data is digging out, especially in the field of e-commerce. We have become a key factor in competition among enterprises. We use a support vector machine method based on parallel computing to analyze the data. First, the training samples are divided into several working subsets through the SOM self-organizing neural network classification method. Compared with the ever-increasing progress of information technology and electronic equipment, especially the related physical information system finally merges the training results of each working set, so as to quickly deal with the problem of massive data prediction and analysis. This paper proposes that big data has the flexibility of expansion and quality assessment system, so it is meaningful to replace the double-sidedness of quality assessment with big data. Finally, considering the excellent performance of parallel support vector machines in data mining and analysis, we apply this method to the big data analysis of e-commerce. The research results show that parallel support vector machines can solve the problem of processing large-scale data sets. The emergence of data dirty problems has increased the effective rate by at least 70%.

Download Full-text

Cluster Reduction Support Vector Machine for Large-Scale Data Set Classification

2008 IEEE Pacific-Asia Workshop on Computational Intelligence and Industrial Application ◽

10.1109/paciia.2008.43 ◽

2008 ◽

Author(s):

Guangxi Chen ◽

Yan Cheng ◽

Jian Xu

Keyword(s):

Support Vector Machine ◽

Large Scale ◽

Support Vector ◽

Data Set ◽

Large Scale Data ◽

Scale Data ◽

Cluster Reduction

Download Full-text

Survey of Large-Scale Data Management Systems for Big Data Applications

Journal of Computer Science and Technology ◽

10.1007/s11390-015-1511-8 ◽

2015 ◽

Vol 30 (1) ◽

pp. 163-183 ◽

Cited By ~ 26

Author(s):

Lengdong Wu ◽

Liyan Yuan ◽

Jiahuai You

Keyword(s):

Big Data ◽

Data Management ◽

Large Scale ◽

Management Systems ◽

Data Management Systems ◽

Large Scale Data ◽

Big Data Applications ◽

Scale Data

Download Full-text

A Survey of Cloud-Based Services Leveraged by Big Data Applications

Web Services ◽

10.4018/978-1-5225-7501-6.ch088 ◽

2019 ◽

pp. 1706-1716

Author(s):

S. ZerAfshan Goher ◽

Barkha Javed ◽

Peter Bloodsworth

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Large Scale ◽

Future Trends ◽

Advantages And Disadvantages ◽

Large Scale Data ◽

Big Data Applications ◽

Big Data Storage ◽

Scale Data

Due to the growing interest in harnessing the hidden significance of data, more and more enterprises are moving to data analytics. Data analytics require the analysis and management of large-scale data to find the hidden patterns among various data components to gain useful insight. The derived information is then used to predict the future trends that can be advantageous for a business to flourish such as customers' likes/dislikes, reasons behind customers' churn and more. In this paper, several techniques for the big data analysis have been investigated along with their advantages and disadvantages. The significance of cloud computing for big data storage has also been discussed. Finally, the techniques to make the robust and efficient usage of big data have also been discussed.

Download Full-text

An online incremental learning support vector machine for large-scale data

Neural Computing and Applications ◽

10.1007/s00521-011-0793-1 ◽

2012 ◽

Vol 22 (5) ◽

pp. 1023-1035 ◽

Cited By ~ 39

Author(s):

Jun Zheng ◽

Furao Shen ◽

Hongjun Fan ◽

Jinxi Zhao

Keyword(s):

Support Vector Machine ◽

Incremental Learning ◽

Large Scale ◽

Support Vector ◽

Learning Support ◽

Large Scale Data ◽

Online Incremental Learning ◽

Scale Data

Download Full-text

An Online Incremental Learning Support Vector Machine for Large-scale Data

Artificial Neural Networks – ICANN 2010 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-15822-3_9 ◽

2010 ◽

pp. 76-81 ◽

Cited By ~ 4

Author(s):

Jun Zheng ◽

Hui Yu ◽

Furao Shen ◽

Jinxi Zhao

Keyword(s):

Support Vector Machine ◽

Incremental Learning ◽

Large Scale ◽

Support Vector ◽

Learning Support ◽

Large Scale Data ◽

Online Incremental Learning ◽

Scale Data

Download Full-text

A Survey of Cloud-Based Services Leveraged by Big Data Applications

Advances in Data Mining and Database Management - Managing and Processing Big Data in Cloud Computing ◽

10.4018/978-1-4666-9767-6.ch008 ◽

2016 ◽

pp. 121-131

Author(s):

S. ZerAfshan Goher ◽

Barkha Javed ◽

Peter Bloodsworth

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Large Scale ◽

Future Trends ◽

Advantages And Disadvantages ◽

Large Scale Data ◽

Big Data Applications ◽

Big Data Storage ◽

Scale Data

Due to the growing interest in harnessing the hidden significance of data, more and more enterprises are moving to data analytics. Data analytics require the analysis and management of large-scale data to find the hidden patterns among various data components to gain useful insight. The derived information is then used to predict the future trends that can be advantageous for a business to flourish such as customers' likes/dislikes, reasons behind customers' churn and more. In this paper, several techniques for the big data analysis have been investigated along with their advantages and disadvantages. The significance of cloud computing for big data storage has also been discussed. Finally, the techniques to make the robust and efficient usage of big data have also been discussed.

Download Full-text