HCache: A Hash-based Hybrid Caching Model for Real-Time Streaming Data Analytics

The volume of streaming sensor data from various environmental sensors continues to increase rapidly due to wider deployments of IoT devices at much greater scales than ever before. This, in turn, causes massive increase in the fog, cloud network traffic which leads to heavily delayed network operations. In streaming data analytics, the ability to obtain real time data insight is crucial for computational sustainability for many IoT enabled applications such as environmental monitors, pollution and climate surveillance, traffic control or even E-commerce applications. However, such network delays prevent us from achieving high quality real-time data analytics of environmental information. In order to address this challenge, we propose the Fog Sampling Node Selector (Fossel) technique that can significantly reduce the IoT network and processing delays by algorithmically selecting an optimal subset of fog nodes to perform the sensor data sampling. In addition, our technique performs a simple type of query executions within the fog nodes in order to further reduce the network delays by processing the data near the data producing devices. Our extensive evaluations show that Fossel technique outperforms the state-of-the-art in terms of latency reduction as well as in bandwidth consumption, network usage and energy consumption.

Download Full-text

Efficient Real-Time Decision Making Using Streaming Data Analytics in IoT Environment

International Conference on Advanced Computing Networking and Informatics - Advances in Intelligent Systems and Computing ◽

10.1007/978-981-13-2673-8_19 ◽

2018 ◽

pp. 165-173 ◽

Cited By ~ 1

Author(s):

S. Valliappan ◽

P. Bagavathi Sivakumar ◽

V. Ananthanarayanan

Keyword(s):

Decision Making ◽

Real Time ◽

Data Analytics ◽

Streaming Data

Download Full-text

A Real-Time Patient Monitoring Framework for Fall Detection

Wireless Communications and Mobile Computing ◽

10.1155/2019/9507938 ◽

2019 ◽

Vol 2019 ◽

pp. 1-13 ◽

Cited By ~ 8

Author(s):

Dharmitha Ajerla ◽

Sazia Mahfuz ◽

Farhana Zulkernine

Keyword(s):

Real Time ◽

Data Analytics ◽

Short Term Memory ◽

Detection System ◽

Fall Detection ◽

Edge Computing ◽

Streaming Data ◽

Real Time Analysis ◽

Long Short Term Memory ◽

Lstm Network

Fall detection is a major problem in the healthcare department. Elderly people are more prone to fall than others. There are more than 50% of injury-related hospitalizations in people aged over 65. Commercial fall detection devices are expensive and charge a monthly fee for their services. A more affordable and adaptable system is necessary for retirement homes and clinics to build a smart city powered by IoT and artificial intelligence. An effective fall detection system would detect a fall and send an alarm to the appropriate authorities. We propose a framework that uses edge computing where instead of sending data to the cloud, wearable devices send data to a nearby edge device like a laptop or mobile device for real-time analysis. We use cheap wearable sensor devices from MbientLab, an open source streaming engine called Apache Flink for streaming data analytics, and a long short-term memory (LSTM) network model for fall classification. The model is trained using a published dataset called “MobiAct.” Using the trained model, we analyse optimal sampling rates, sensor placement, and multistream data correction. Our edge computing framework can perform real-time streaming data analytics to detect falls with an accuracy of 95.8%.

Download Full-text

Understanding the End-Users and Technical Requirements for Real-Time Streaming Data Analytics and Visualisation

10.23919/icac50006.2021.9594257 ◽

2021 ◽

Author(s):

Fateme Dinmohammadi ◽

Duncan Wilson

Keyword(s):

Real Time ◽

Data Analytics ◽

End Users ◽

Streaming Data ◽

Technical Requirements

Download Full-text

Fog-Cloud Collaboration for Real-Time Streaming Applications

Advances in Wireless Technologies and Telecommunication - Handbook of Research on the IoT, Cloud Computing, and Wireless Network Optimization ◽

10.4018/978-1-5225-7335-7.ch007 ◽

2019 ◽

pp. 128-147

Author(s):

Biji Nair ◽

S. Mary Saira Bhanu

Keyword(s):

Real Time ◽

Data Analytics ◽

High Speed ◽

Fog Computing ◽

Data Representation ◽

Streaming Data ◽

Cloud Data ◽

Streaming Applications ◽

Stream Management

Real-time streaming applications (RTSAs) generate huge volumes of temporally ordered, infinite, continuous, high speed data streams demanding both real-time and long-term data analytics. Fog computing is a reliable solution for processing and analyzing real-time streaming data as it offers low latency, location-aware, geographically distributed service at fog node and provides long-term services at the cloud data center (DC). This chapter addresses the challenge of coordinating the fog nodes and cloud for efficient processing of real-time streaming data in motion and at rest. The fog-cloud collaboration framework proposed in this chapter employs data stream management system (DSMS) schema at the fog node for real-time stream data processing and response generation. The data representation in micro-clusters at fog node and macro-clusters at DC facilitates accurate data analytics. The coordination between fog node and DC is through local ontology and global ontology respectively.

Download Full-text

Designing Framework for Real Time Twitter Data Analytics using Apache Flume and Pig

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.f7726.038620 ◽

2020 ◽

Vol 8 (6) ◽

pp. 4474-4477

Keyword(s):

Real Time ◽

Data Analytics ◽

Social Issues ◽

Analytical Framework ◽

Streaming Data ◽

Data Streaming ◽

Text Data ◽

Breaking News ◽

Unstructured Text ◽

Twitter Data

In the world of technology, people prefer social media to express themselves. Record says Twitter has more than 321 million active users with 100 million users posting approximately 340 million tweets a day. Twitter is the largest source of breaking news on social issues specially election-related where people can express their views also suggest their opinion. Twitter is generating unlimited unstructured text data. Hadoop is one of the finest tools accessible for analyzing twitter data because it supports processing of distributed big data, streaming data, time stamped data, text data etc. Whereas Apache Flume is used to extract real time twitter data into HDFS. This study attempts to establish an analytical framework to derive and interpret structured as well as unstructured Twitter data. The proposed framework comprises of real time twitter data insertion, its processing, and data visualization utilizing Apache Flume and pig. In this project we fetch positive and negative tweets on election data from twitter and analyzing the party status and the probability to win the election.

Download Full-text

Developing a Real-Time Data Analytics Framework for Twitter Streaming Data

2017 IEEE International Congress on Big Data (BigData Congress) ◽

10.1109/bigdatacongress.2017.49 ◽

2017 ◽

Cited By ~ 4

Author(s):

Babak Yadranjiaghdam ◽

Seyedfaraz Yasrobi ◽

Nasseh Tabrizi

Keyword(s):

Real Time ◽

Data Analytics ◽

Streaming Data ◽

Time Data ◽

Real Time Data

Download Full-text

Opinion Mining with Real Time Ontology Streaming Data

International Journal of Psychosocial Rehabilitation ◽

10.37200/ijpr/v23i1/pr190244 ◽

2019 ◽

Vol 23 (1) ◽

pp. 346-357

Author(s):

Vithya G ◽

Naren J ◽

Varun V

Keyword(s):

Real Time ◽

Opinion Mining ◽

Streaming Data

Download Full-text

Stream Data Load Prediction for Resource Scaling Using Online Support Vector Regression

Algorithms ◽

10.3390/a12020037 ◽

2019 ◽

Vol 12 (2) ◽

pp. 37 ◽

Cited By ~ 3

Author(s):

Zhigang Hu ◽

Hui Kang ◽

Meiguang Zheng

Keyword(s):

Support Vector Regression ◽

Real Time ◽

Virtual Machines ◽

Time Window ◽

Performance Model ◽

Streaming Data ◽

Support Vector ◽

Load Prediction ◽

Stream Data ◽

Online Support Vector Regression

A distributed data stream processing system handles real-time, changeable and sudden streaming data load. Its elastic resource allocation has become a fundamental and challenging problem with a fixed strategy that will result in waste of resources or a reduction in QoS (quality of service). Spark Streaming as an emerging system has been developed to process real time stream data analytics by using micro-batch approach. In this paper, first, we propose an improved SVR (support vector regression) based stream data load prediction scheme. Then, we design a spark-based maximum sustainable throughput of time window (MSTW) performance model to find the optimized number of virtual machines. Finally, we present a resource scaling algorithm TWRES (time window resource elasticity scaling algorithm) with MSTW constraint and streaming data load prediction. The evaluation results show that TWRES could improve resource utilization and mitigate SLA (service level agreement) violation.

Download Full-text

Move Real-Time Data Analytics to the Cloud: A Case Study on Heron to Dataflow Migration

10.1109/bigdata52589.2021.9671294 ◽

2021 ◽

Author(s):

Huijun Wu ◽

Xiaoyao Qian ◽

Aleks Shulman ◽

Kanishk Karanawat ◽

Tushar Singh ◽

...

Keyword(s):

Real Time ◽

Data Analytics ◽

Time Data ◽

Real Time Data

Download Full-text