Distributed Caching Based Memory Optimizing Technology for Stream Data of IoV

The main purpose of our provenance research for DSP (distributed stream processing) systems is to analyze abnormal results. Provenance for these systems is not nontrivial because of the ephemerality of stream data and instant data processing mode in modern DSP systems. Challenges include but are not limited to an optimization solution for avoiding excessive runtime overhead, reducing provenance-related data storage, and providing it in an easy-to-use fashion. Without any prior knowledge about which kinds of data may finally lead to the abnormal, we have to track all transformations in detail, which potentially causes hard system burden. This paper proposes s2p (Stream Process Provenance), which mainly consists of online provenance and offline provenance, to provide fine- and coarse-grained provenance in different precision. We base our design of s2p on the fact that, for a mature online DSP system, the abnormal results are rare, and the results that require a detailed analysis are even rarer. We also consider state transition in our provenance explanation. We implement s2p on Apache Flink named as s2p-flink and conduct three experiments to evaluate its scalability, efficiency, and overhead from end-to-end cost, throughput, and space overhead. Our evaluation shows that s2p-flink incurs a 13% to 32% cost overhead, 11% to 24% decline in throughput, and few additional space costs in the online provenance phase. Experiments also demonstrates the s2p-flink can scale well. A case study is presented to demonstrate the feasibility of the whole s2p solution.

Download Full-text

Finding Frequent Structures in XML Stream Data

2009 International Conference on Computational Science and Its Applications ◽

10.1109/iccsa.2009.17 ◽

2009 ◽

Cited By ~ 3

Author(s):

Jeong Hee Hwang ◽

Mi Sug Gu

Keyword(s):

Stream Data ◽

Xml Stream

Download Full-text

The Real Time Railway Monitoring System suitable for Multi-View Object based on Sensor Stream Data Tracking

2020 International Conference on Information Science and Communications Technologies (ICISCT) ◽

10.1109/icisct50599.2020.9351474 ◽

2020 ◽

Author(s):

Abbos Abduvaytov ◽

Rakhimov Mukhammad Abdu Kayumbek ◽

Heung Seok Jeon ◽

Ryumduck Oh

Keyword(s):

Real Time ◽

Monitoring System ◽

Stream Data ◽

The Real ◽

Object Based ◽

Data Tracking

Download Full-text

Damped sliding based utility oriented pattern mining over stream data

Knowledge-Based Systems ◽

10.1016/j.knosys.2020.106653 ◽

2021 ◽

Vol 213 ◽

pp. 106653

Author(s):

Heonho Kim ◽

Unil Yun ◽

Yoonji Baek ◽

Hyunsoo Kim ◽

Hyoju Nam ◽

...

Keyword(s):

Pattern Mining ◽

Stream Data

Download Full-text

A Mining Frequent Itemsets Algorithm in Stream Data Based on Sliding Time Decay Window

Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition ◽

10.1145/3430199.3430226 ◽

2020 ◽

Author(s):

Xin Lu ◽

Shaonan Jin ◽

Xun Wang ◽

Jiao Yuan ◽

Kun Fu ◽

...

Keyword(s):

Frequent Itemsets ◽

Time Decay ◽

Stream Data ◽

Mining Frequent Itemsets

Download Full-text

Rethinking Operators Placement of Stream Data Application in the Edge

Proceedings of the 29th ACM International Conference on Information & Knowledge Management ◽

10.1145/3340531.3412116 ◽

2020 ◽

Author(s):

Thomas Lambert ◽

David Guyon ◽

Shadi Ibrahim

Keyword(s):

Stream Data ◽

Data Application

Download Full-text

Stream Data Load Prediction for Resource Scaling Using Online Support Vector Regression

Algorithms ◽

10.3390/a12020037 ◽

2019 ◽

Vol 12 (2) ◽

pp. 37 ◽

Cited By ~ 3

Author(s):

Zhigang Hu ◽

Hui Kang ◽

Meiguang Zheng

Keyword(s):

Support Vector Regression ◽

Real Time ◽

Virtual Machines ◽

Time Window ◽

Performance Model ◽

Streaming Data ◽

Support Vector ◽

Load Prediction ◽

Stream Data ◽

Online Support Vector Regression

A distributed data stream processing system handles real-time, changeable and sudden streaming data load. Its elastic resource allocation has become a fundamental and challenging problem with a fixed strategy that will result in waste of resources or a reduction in QoS (quality of service). Spark Streaming as an emerging system has been developed to process real time stream data analytics by using micro-batch approach. In this paper, first, we propose an improved SVR (support vector regression) based stream data load prediction scheme. Then, we design a spark-based maximum sustainable throughput of time window (MSTW) performance model to find the optimized number of virtual machines. Finally, we present a resource scaling algorithm TWRES (time window resource elasticity scaling algorithm) with MSTW constraint and streaming data load prediction. The evaluation results show that TWRES could improve resource utilization and mitigate SLA (service level agreement) violation.

Download Full-text

Multivariable stream data classification using motifs and their temporal relations

Information Sciences ◽

10.1016/j.ins.2009.06.036 ◽

2009 ◽

Vol 179 (20) ◽

pp. 3489-3504 ◽

Cited By ~ 2

Author(s):

Sungbo Seo ◽

Jaewoo Kang ◽

Keun Ho Ryu

Keyword(s):

Data Classification ◽

Temporal Relations ◽

Stream Data

Download Full-text

Distributed Caching Based Memory Optimizing Technology for Stream Data of IoV

Comparative Study of Different Classification Algorithms for Stream Data Mining Using MOA

Improved LSTM-based Abnormal Stream Data Detection and Correction System for Internet of Things

s2p: Provenance Research for Stream Processing System

Finding Frequent Structures in XML Stream Data

The Real Time Railway Monitoring System suitable for Multi-View Object based on Sensor Stream Data Tracking

Damped sliding based utility oriented pattern mining over stream data

A Mining Frequent Itemsets Algorithm in Stream Data Based on Sliding Time Decay Window

Rethinking Operators Placement of Stream Data Application in the Edge

Stream Data Load Prediction for Resource Scaling Using Online Support Vector Regression

Multivariable stream data classification using motifs and their temporal relations

Export Citation Format