stream analytics
Recently Published Documents


TOTAL DOCUMENTS

81
(FIVE YEARS 38)

H-INDEX

7
(FIVE YEARS 1)

Algorithms ◽  
2021 ◽  
Vol 14 (8) ◽  
pp. 221
Author(s):  
Zhihui Du ◽  
Oliver Alvarado Rodriguez ◽  
Joseph Patchett ◽  
David A. Bader

Data from emerging applications, such as cybersecurity and social networking, can be abstracted as graphs whose edges are updated sequentially in the form of a stream. The challenging problem of interactive graph stream analytics is the quick response of the queries on terabyte and beyond graph stream data from end users. In this paper, a succinct and efficient double index data structure is designed to build the sketch of a graph stream to meet general queries. A single pass stream model, which includes general sketch building, distributed sketch based analysis algorithms and regression based approximation solution generation, is developed, and a typical graph algorithm—triangle counting—is implemented to evaluate the proposed method. Experimental results on power law and normal distribution graph streams show that our method can generate accurate results (mean relative error less than 4%) with a high performance. All our methods and code have been implemented in an open source framework, Arkouda, and are available from our GitHub repository, Bader-Research. This work provides the large and rapidly growing Python community with a powerful way to handle terabyte and beyond graph stream data using their laptops.


2021 ◽  
Vol 19 (3) ◽  
Author(s):  
László Toka

AbstractNovel applications will require extending traditional cloud computing infrastructure with compute resources deployed close to the end user. Edge and fog computing tightly integrated with carrier networks can fulfill this demand. The emphasis is on integration: the rigorous delay constraints, ensuring reliability on the distributed, remote compute nodes, and the sheer scale of the system altogether call for a powerful resource provisioning platform that offers the applications the best of the underlying infrastructure. We therefore propose Kubernetes-edge-scheduler that provides high reliability for applications in the edge, while provisioning less than 10% of resources for this purpose, and at the same time, it guarantees compliance with the latency requirements that end users expect. We present a novel topology clustering method that considers application latency requirements, and enables scheduling applications even on a worldwide scale of edge clusters. We demonstrate that in a potential use case, a distributed stream analytics application, our orchestration system can reduce the job completion time to 40% of the baseline provided by the default Kubernetes scheduler.


2021 ◽  
pp. 175-208
Author(s):  
Sumit Misra ◽  
Sanjoy Kumar Saha ◽  
Chandan Mazumdar
Keyword(s):  

Author(s):  
András Benczúr ◽  
Ferenc Béres ◽  
Domokos Kelen ◽  
Róbert Pálovics
Keyword(s):  

Author(s):  
Paul Brogan ◽  
Andres Jarmillo ◽  
Xueqin Amy Liu ◽  
John Hastings ◽  
David Laverty ◽  
...  
Keyword(s):  

2021 ◽  
Author(s):  
Arun Manicka Raja M ◽  
Swamynathan Sankaranarayanan

Abstract Stream analytics, a new paradigm in data analytics, has gained mo- mentum due to the voluminous stream data generation. With the huge increase in the edits performed on Wikipedia topics, it is tedious for the digital knowledge discovery users to nd their domain updates immediately. The users need to go through large information and spend more time to nd the potential data. There is a need for retrieving the Wikipedia edits based on the meta data of the article edits for later retriev-al. Hence, the clustering technique may be employed in order to group the Wikipedia article edits domain wise. Hence, in this paper, hierarchi- cal stream clustering is applied in order to retrieve the edits based on the user interest. Over a period of month, the data from Wikipedia is collected and used as a dataset. Our method is compared with the state-of-the-art clus-tering system WikiAutoCat and it is observed that the accuracy is improved by 10% and the clustering time is reduced by 20%.


Author(s):  
Ms. Shailaja B. Jadhav ◽  
Dr. D. V. Kodavade

Nowadays, big data processing systems are evolving to be more stream-oriented; where each data record is processed as it arrives by distributed and low latency computational frameworks [18]. Data streams have been extensively used in several fields of computational analytics such as data mining, business intelligence etc. [17]. In every field, the data stream can be considered as an ordered sequence of data items, as they continuously arrive over the period. Due to this characteristic, streaming data analytics is a challenging area of research [5, 11]. This paper aims to present data stream processing as a growing research field , along with streaming analytics frameworks as a rich focus area. The paper also contributes to evaluate the efficacy of available stream analytics frameworks. One of the Industry 4.0 use case - predictive maintenance rail transportation - has been illustrated here as a case study design mapped with streaming analytics framework.


Author(s):  
Priyan Malarvizhi KUMAR ◽  
Choong Seon Hong ◽  
Fatemeh Afghah ◽  
Gunasekaran Manogaran ◽  
Keping Yu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document