Active replication for latency-sensitive stream processing in Apache Flink

Author(s):  
Guillaume Rosinosky ◽  
Florian Schmidt ◽  
Oleh Bodunov ◽  
Christof Fetzer ◽  
Andre Martin ◽  
...  
2020 ◽  
Vol 140 (9) ◽  
pp. 1030-1039
Author(s):  
W.A. Shanaka P. Abeysiriwardhana ◽  
Janaka L. Wijekoon ◽  
Hiroaki Nishi

2009 ◽  
Vol 29 (10) ◽  
pp. 2786-2790 ◽  
Author(s):  
Xiao-jia YIN ◽  
Shi-guang JU ◽  
Ying-jie WANG

Author(s):  
Martin Hirzel ◽  
Guillaume Baudart
Keyword(s):  

2021 ◽  
Vol 11 (12) ◽  
pp. 5523
Author(s):  
Qian Ye ◽  
Minyan Lu

The main purpose of our provenance research for DSP (distributed stream processing) systems is to analyze abnormal results. Provenance for these systems is not nontrivial because of the ephemerality of stream data and instant data processing mode in modern DSP systems. Challenges include but are not limited to an optimization solution for avoiding excessive runtime overhead, reducing provenance-related data storage, and providing it in an easy-to-use fashion. Without any prior knowledge about which kinds of data may finally lead to the abnormal, we have to track all transformations in detail, which potentially causes hard system burden. This paper proposes s2p (Stream Process Provenance), which mainly consists of online provenance and offline provenance, to provide fine- and coarse-grained provenance in different precision. We base our design of s2p on the fact that, for a mature online DSP system, the abnormal results are rare, and the results that require a detailed analysis are even rarer. We also consider state transition in our provenance explanation. We implement s2p on Apache Flink named as s2p-flink and conduct three experiments to evaluate its scalability, efficiency, and overhead from end-to-end cost, throughput, and space overhead. Our evaluation shows that s2p-flink incurs a 13% to 32% cost overhead, 11% to 24% decline in throughput, and few additional space costs in the online provenance phase. Experiments also demonstrates the s2p-flink can scale well. A case study is presented to demonstrate the feasibility of the whole s2p solution.


Sign in / Sign up

Export Citation Format

Share Document