Imputation for Missing Items in a Stream Data Based on Gamma Distribution

Author(s):  
Zhipeng Sun ◽  
Guosun Zeng ◽  
Chunling Ding
2020 ◽  
Vol 6 (8(77)) ◽  
pp. 13-17
Author(s):  
Azimkhan Kurmankozhayev ◽  
Elmira Seilbekovna Yesbergenova

Presented the results of evaluation of structural connection, identity and interchangeability of main asymmetric types of theoretical distributions most often acceptable for assessing the distributions of various indicators in geology and technology. The method of empirical analysis and statistical inference was used with the involvement of nonparametric facts according to the distribution patterns. The analysis of the empirical results of the application of the lognormal, gamma distribution and the Weibull distribution with the involvement of extensive statistical data from literary and research sources is carried out. The characteristic features and statistical regularities of distributions inherent to them are revealed, estimated statistical conclusions are obtained, according to which structural relationships between the functions of the lognormal, gamma and Weibull distributions are revealed. The identity and authenticity of the development of probabilistic frequencies in their application have been established, the complex geometric "image" of asymmetry inherent to these types of distributions is generalized. Structural relationships and interchangeability of asymmetric types of distributions are recommended to increase the reliability and credibility of the estimated choice of distribution in conditions of uncertainty and insignificance of statistical data when solving problems associated with forecasts, technological and computer developments.


1985 ◽  
Vol 50 (11) ◽  
pp. 2545-2557
Author(s):  
Pavel Hasal ◽  
Vladimír Kudrna ◽  
Jitka Vyhlídková

The paper is focused on a theoretical analysis of the function of continuous flow mixer with the so-called gamma-distribution of fluid residence times, used as a linear filter smoothing undesirable fluctuations of input properties. A relation is derived expressing the degree of smoothing of the signal passing through the system, as a function of statistical parameters of this signal and of gamma-distribution of fluid-residence times in the mixer. The analysis of this relation leads to conclusions concerning the prediction of the operation of smoothing mixers or the design of their basic parameters.


2021 ◽  
Vol 11 (12) ◽  
pp. 5523
Author(s):  
Qian Ye ◽  
Minyan Lu

The main purpose of our provenance research for DSP (distributed stream processing) systems is to analyze abnormal results. Provenance for these systems is not nontrivial because of the ephemerality of stream data and instant data processing mode in modern DSP systems. Challenges include but are not limited to an optimization solution for avoiding excessive runtime overhead, reducing provenance-related data storage, and providing it in an easy-to-use fashion. Without any prior knowledge about which kinds of data may finally lead to the abnormal, we have to track all transformations in detail, which potentially causes hard system burden. This paper proposes s2p (Stream Process Provenance), which mainly consists of online provenance and offline provenance, to provide fine- and coarse-grained provenance in different precision. We base our design of s2p on the fact that, for a mature online DSP system, the abnormal results are rare, and the results that require a detailed analysis are even rarer. We also consider state transition in our provenance explanation. We implement s2p on Apache Flink named as s2p-flink and conduct three experiments to evaluate its scalability, efficiency, and overhead from end-to-end cost, throughput, and space overhead. Our evaluation shows that s2p-flink incurs a 13% to 32% cost overhead, 11% to 24% decline in throughput, and few additional space costs in the online provenance phase. Experiments also demonstrates the s2p-flink can scale well. A case study is presented to demonstrate the feasibility of the whole s2p solution.


Sign in / Sign up

Export Citation Format

Share Document