scholarly journals Stream Data Load Prediction for Resource Scaling Using Online Support Vector Regression

Algorithms ◽  
2019 ◽  
Vol 12 (2) ◽  
pp. 37 ◽  
Author(s):  
Zhigang Hu ◽  
Hui Kang ◽  
Meiguang Zheng

A distributed data stream processing system handles real-time, changeable and sudden streaming data load. Its elastic resource allocation has become a fundamental and challenging problem with a fixed strategy that will result in waste of resources or a reduction in QoS (quality of service). Spark Streaming as an emerging system has been developed to process real time stream data analytics by using micro-batch approach. In this paper, first, we propose an improved SVR (support vector regression) based stream data load prediction scheme. Then, we design a spark-based maximum sustainable throughput of time window (MSTW) performance model to find the optimized number of virtual machines. Finally, we present a resource scaling algorithm TWRES (time window resource elasticity scaling algorithm) with MSTW constraint and streaming data load prediction. The evaluation results show that TWRES could improve resource utilization and mitigate SLA (service level agreement) violation.

2010 ◽  
Vol 145 ◽  
pp. 153-158
Author(s):  
Fei He ◽  
Zhi Guo Liang ◽  
Min Li ◽  
Jin Wu Xu

In order to predict product quality and optimize production process, the product quality model needs to be built. According to the fact that the common methods always cost long training time and can not realize real-time update, an online product quality model based on the online support vector regression is here proposed. The real field data of zinc coating weights from strip hot-dip galvanizing are used for validation. The results show that the models based on the online support vector regression have a higher prediction precision and shorter training time than traditional support vector regression, which is convenient to complete the real-time update. The zinc coating weights forecasting model based on the online support vector regression for multi-group data has an average of the relative prediction error of 4.35%, thus for the model will be used as an analysis tools for the quality control.


2009 ◽  
Vol 53 (2) ◽  
pp. 191-201 ◽  
Author(s):  
Paola Bermolen ◽  
Dario Rossi

Sign in / Sign up

Export Citation Format

Share Document