Performance evaluation of machine learning based big data processing framework for prediction of heart disease

Natural hazards result in devastating losses in human life, environmental assets and personal, and regional and national economies. The availability of different big data such as satellite imageries, Global Positioning System (GPS) traces, mobile Call Detail Records (CDRs), social media posts, etc., in conjunction with advances in data analytic techniques (e.g., data mining and big data processing, machine learning and deep learning) can facilitate the extraction of geospatial information that is critical for rapid and effective disaster response. However, disaster response systems development usually requires the integration of data from different sources (streaming data sources and data sources at rest) with different characteristics and types, which consequently have different processing needs. Deciding which processing framework to use for a specific big data to perform a given task is usually a challenge for researchers from the disaster management field. Therefore, this paper contributes in four aspects. Firstly, potential big data sources are described and characterized. Secondly, the big data processing frameworks are characterized and grouped based on the sources of data they handle. Then, a short description of each big data processing framework is provided and a comparison of processing frameworks in each group is carried out considering the main aspects such as computing cluster architecture, data flow, data processing model, fault-tolerance, scalability, latency, back-pressure mechanism, programming languages, and support for machine learning libraries, which are related to specific processing needs. Finally, a link between big data and processing frameworks is established, based on the processing provisioning for essential tasks in the response phase of disaster management.

Download Full-text

Astronomical big data processing using machine learning: A comprehensive review

Experimental Astronomy ◽

10.1007/s10686-021-09827-4 ◽

2022 ◽

Author(s):

Snigdha Sen ◽

Sonali Agarwal ◽

Pavan Chakraborty ◽

Krishna Pratap Singh

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Processing ◽

Big Data Processing ◽

Comprehensive Review

Download Full-text

Framework for Mobile Internet of Things Security Monitoring Based on Big Data Processing and Machine Learning

IEEE Access ◽

10.1109/access.2018.2881998 ◽

2018 ◽

Vol 6 ◽

pp. 72714-72723 ◽

Cited By ~ 13

Author(s):

Igor Kotenko ◽

Igor Saenko ◽

Alexander Branitskiy

Keyword(s):

Machine Learning ◽

Big Data ◽

Internet Of Things ◽

Data Processing ◽

Mobile Internet ◽

Big Data Processing ◽

Security Monitoring

Download Full-text

Performance Evaluation of Big Data Processing Strategies for Neuroimaging

2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) ◽

10.1109/ccgrid.2019.00059 ◽

2019 ◽

Author(s):

Valerie Hayot-Sasson ◽

Shawn T Brown ◽

Tristan Glatard

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Data Processing ◽

Big Data Processing ◽

Processing Strategies

Download Full-text

A Research on Machine Learning Methods for Big Data Processing

Proceedings of the 4th International Conference on Information Technology and Management Innovation ◽

10.2991/icitmi-15.2015.155 ◽

2015 ◽

Author(s):

Junfei Qiu ◽

Youming Sun

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Processing ◽

Learning Methods ◽

Big Data Processing ◽

Machine Learning Methods

Download Full-text

Predictive Modeling of Pavement Damage Using Machine Learning and Big Data Processing

Korean Society of Hazard Mitigation ◽

10.9798/kosham.2019.19.1.95 ◽

2019 ◽

Vol 19 (1) ◽

pp. 95-107

Author(s):

Dowan Kim ◽

Jinho Jeon ◽

Damryung Kim

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Processing ◽

Predictive Modeling ◽

Big Data Processing ◽

Pavement Damage

Download Full-text

Big data processing framework for manufacturing

Procedia CIRP ◽

10.1016/j.procir.2019.04.109 ◽

2019 ◽

Vol 83 ◽

pp. 661-664 ◽

Cited By ~ 1

Author(s):

Yinghao Ye ◽

Meilin Wang ◽

Shuhong Yao ◽

Jarvis N. Jiang ◽

Qing Liu

Keyword(s):

Big Data ◽

Data Processing ◽

Big Data Processing ◽

Processing Framework

Download Full-text

Attributes Reduction in Big Data

Applied Sciences ◽

10.3390/app10144901 ◽

2020 ◽

Vol 10 (14) ◽

pp. 4901

Author(s):

Waleed Albattah ◽

Rehan Ullah Khan ◽

Khalil Khan

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Processing ◽

Data Analytics ◽

Detection Efficiency ◽

Large Data ◽

Model Learning ◽

Big Data Processing ◽

Performance Loss ◽

Points Of View

Processing big data requires serious computing resources. Because of this challenge, big data processing is an issue not only for algorithms but also for computing resources. This article analyzes a large amount of data from different points of view. One perspective is the processing of reduced collections of big data with less computing resources. Therefore, the study analyzed 40 GB data to test various strategies to reduce data processing. Thus, the goal is to reduce this data, but not to compromise on the detection and model learning in machine learning. Several alternatives were analyzed, and it is found that in many cases and types of settings, data can be reduced to some extent without compromising detection efficiency. Tests of 200 attributes showed that with a performance loss of only 4%, more than 80% of the data could be ignored. The results found in the study, thus provide useful insights into large data analytics.

Download Full-text

Composable and efficient functional big data processing framework

2015 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2015.7363765 ◽

2015 ◽

Cited By ~ 2

Author(s):

Dongyao Wu ◽

Sherif Sakr ◽

Liming Zhu ◽

Qinghua Lu

Keyword(s):

Big Data ◽

Data Processing ◽

Big Data Processing ◽

Processing Framework

Download Full-text