A Survey on Prediction Using Big Data Analytics

International Journal of Big Data and Analytics in Healthcare ◽

10.4018/ijbdah.2017010101 ◽

2017 ◽

Vol 2 (1) ◽

pp. 1-15

Author(s):

M. Supriya ◽

A.J. Deepa

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Community Services ◽

Medical Data ◽

The Body ◽

Prediction Methods ◽

Accuracy Rate ◽

Accurate Analysis

This article describes how nowadays, the growth of big data in bio-medical and healthcare community services is increasing rapidly. The early detection of diseases and patient care are analyzed with the help of accurate analysis of medical data includes diagnosed patients' details. The analysis of accuracy rate is considerably reduced when the quality of medical data is unclear since every part of the body has unique characteristics of certain regional diseases that may suppress the prediction of diseases. This article reviews the detailed survey of different prediction methods developed for analyzing the accuracy rate of disease affected patients in 2015-2016 mainly focuses on choosing the efficient predictions based on the quality of medical data not only provides the overall view of prediction methods but also gives the idea of big data analytics in medical data further discusses the methods, techniques used and the pros and cons of prediction methods.

Download Full-text

Big Data Analytics in Health Care: A Review Paper

International Journal of Computer Science and Information Technology ◽

10.5121/ijcsit.2021.13202 ◽

2021 ◽

Vol 13 (2) ◽

pp. 17-28

Author(s):

Maria Mohammad Yousef

Keyword(s):

Health Care ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Medical Data ◽

Health Records ◽

Quality Of Patient Care ◽

Data Content ◽

Tools And Techniques

The application of big data in health care is a fast-growing field, with many discoveries and methodologies published in the last five years. Big data refers to datasets that are not only big but also high in variety and velocity, which makes them difficult to handle using traditional tools and techniques. Moreover, medical data is one of the most growing data, as it is obtained from Electronic Health Records (EHRs) or patients themselves. Due to the rapid growth of such medical data, we need to provide suitable tools and techniques in order to handle and extract value and knowledge from these datasets to improve the quality of patient care and reduces healthcare costs. Furthermore, such value can be provided using big data analytics, which is the application of advanced analytics techniques on big data. This paper presents an overview of big data content, sources, technologies, tools, and challenges in health care. It also intends to identify the strategies to overcome the challenges.

Download Full-text

Big Data Analytics in Healthcare

BioMed Research International ◽

10.1155/2015/370194 ◽

2015 ◽

Vol 2015 ◽

pp. 1-16 ◽

Cited By ~ 178

Author(s):

Ashwin Belle ◽

Raghuram Thiagarajan ◽

S. M. Reza Soroushmehr ◽

Fatemeh Navidi ◽

Daniel A. Beard ◽

...

Keyword(s):

Big Data ◽

Data Analytics ◽

Care Delivery ◽

Healthcare Delivery ◽

Big Data Analytics ◽

Healthcare Systems ◽

Medical Data ◽

Unstructured Data ◽

Process Of Care ◽

Multimodal Data

The rapidly expanding field of big data analytics has started to play a pivotal role in the evolution of healthcare practices and research. It has provided tools to accumulate, manage, analyze, and assimilate large volumes of disparate, structured, and unstructured data produced by current healthcare systems. Big data analytics has been recently applied towards aiding the process of care delivery and disease exploration. However, the adoption rate and research development in this space is still hindered by some fundamental problems inherent within the big data paradigm. In this paper, we discuss some of these major challenges with a focus on three upcoming and promising areas of medical research: image, signal, and genomics based analytics. Recent research which targets utilization of large volumes of medical data while combining multimodal data from disparate sources is discussed. Potential areas of research within this field which have the ability to provide meaningful impact on healthcare delivery are also examined.

Download Full-text

SMOTE-BD: An Exact and Scalable Oversampling Method for Imbalanced Classification in Big Data

Journal of Computer Science and Technology ◽

10.24215/16666038.18.e23 ◽

2018 ◽

Vol 18 (03) ◽

pp. e23 ◽

Cited By ~ 7

Author(s):

María José Basgall ◽

Waldo Hasperué ◽

Marcelo Naiouf ◽

Alberto Fernández ◽

Francisco Herrera

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Model Design ◽

Minority Class ◽

Imbalanced Classification ◽

Design And Implementation ◽

Learning Issues ◽

Intelligent Model

The volume of data in today's applications has meant a change in the way Machine Learning issues are addressed. Indeed, the Big Data scenario involves scalability constraints that can only be achieved through intelligent model design and the use of distributed technologies. In this context, solutions based on the Spark platform have established themselves as a de facto standard. In this contribution, we focus on a very important framework within Big Data Analytics, namely classification with imbalanced datasets. The main characteristic of this problem is that one of the classes is underrepresented, and therefore it is usually more complex to find a model that identifies it correctly. For this reason, it is common to apply preprocessing techniques such as oversampling to balance the distribution of examples in classes. In this work we present SMOTE-BD, a fully scalable preprocessing approach for imbalanced classification in Big Data. It is based on one of the most widespread preprocessing solutions for imbalanced classification, namely the SMOTE algorithm, which creates new synthetic instances according to the neighborhood of each example of the minority class. Our novel development is made to be independent of the number of partitions or processes created to achieve a higher degree of efficiency. Experiments conducted on different standard and Big Data datasets show the quality of the proposed design and implementation.

Download Full-text

Big Data Analytics Tools and Platform in Big Data Landscape

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Pattern Engineering System Development for Big Data Analytics ◽

10.4018/978-1-5225-3870-7.ch006 ◽

2018 ◽

pp. 80-89

Author(s):

Mohd Imran ◽

Mohd Vasim Ahamad ◽

Misbahul Haque ◽

Mohd Shoaib

Keyword(s):

Big Data ◽

Comparative Analysis ◽

Comparative Study ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Apache Hadoop ◽

Pros And Cons ◽

The Comparative Study ◽

Analytical Tools

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.

Download Full-text

Big Data Analytics Tools and Platform in Big Data Landscape

10.4018/978-1-6684-3662-2.ch029 ◽

2022 ◽

pp. 622-631

Author(s):

Mohd Imran ◽

Mohd Vasim Ahamad ◽

Misbahul Haque ◽

Mohd Shoaib

Keyword(s):

Big Data ◽

Comparative Analysis ◽

Comparative Study ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Apache Hadoop ◽

Pros And Cons ◽

The Comparative Study ◽

Analytical Tools

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.

Download Full-text

Using Big Data Analytics to Assist a Smart City to Prevent Cyber Security Threats

Examining the Socio-Technical Impact of Smart Cities - Advances in Human and Social Aspects of Technology ◽

10.4018/978-1-7998-5326-8.ch005 ◽

2021 ◽

pp. 107-124

Author(s):

Fenio Annansingh

Keyword(s):

Big Data ◽

Cyber Security ◽

Smart City ◽

Data Analytics ◽

Life Quality ◽

Big Data Analytics ◽

Review Of The Literature ◽

Smart Services ◽

Prevention Model

The concept of a smart city as a means to enhance the life quality of citizens has been gaining increasing importance in recent years globally. A smart city consists of city infrastructure, which includes smart services, devices, and institutions. Every second, these components of the smart city infrastructure are generating data. The vast amount of data is called big data. This chapter explores the possibilities of using big data analytics to prevent cybersecurity threats in a smart city. It also analyzed how big data tools and concepts can solve cybersecurity challenges and detect and prevent attacks. Using interviews and an extensive review of the literature have developed the data analytics and cyber prevention model. The chapter concludes by indicating that big data analytics allow a smart city to identify and solve cybersecurity challenges quickly and efficiently.

Download Full-text

IoT Based Agriculture as a Cloud and Big Data Service

Securing the Internet of Things ◽

10.4018/978-1-5225-9866-4.ch069 ◽

2020 ◽

pp. 1499-1521

Author(s):

Sukhpal Singh Gill ◽

Inderveer Chana ◽

Rajkumar Buyya

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Cloud Services ◽

Cloud Environment ◽

Big Data Technologies ◽

Sensor Networking ◽

New Applications

Cloud computing has transpired as a new model for managing and delivering applications as services efficiently. Convergence of cloud computing with technologies such as wireless sensor networking, Internet of Things (IoT) and Big Data analytics offers new applications' of cloud services. This paper proposes a cloud-based autonomic information system for delivering Agriculture-as-a-Service (AaaS) through the use of cloud and big data technologies. The proposed system gathers information from various users through preconfigured devices and IoT sensors and processes it in cloud using big data analytics and provides the required information to users automatically. The performance of the proposed system has been evaluated in Cloud environment and experimental results show that the proposed system offers better service and the Quality of Service (QoS) is also better in terms of QoS parameters.

Download Full-text

Mobile network quality of experience using big data analytics approach

2017 8th International Conference on Information Technology (ICIT) ◽

10.1109/icitech.2017.8079923 ◽

2017 ◽

Cited By ~ 3

Author(s):

Ayisat W. Yusuf-Asaju ◽

Zulkhairi B. Dahalin ◽

Azman Ta'a

Keyword(s):

Big Data ◽

Data Analytics ◽

Quality Of Experience ◽

Big Data Analytics ◽

Mobile Network ◽

Network Quality

Download Full-text

Identifying Requirements for Big Data Analytics and Mapping to Hadoop Tools

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c5524.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 4384-4392

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Big Data Analytics ◽

Management Tools ◽

Trade Offs ◽

Category Comparison ◽

Pros And Cons ◽

Hadoop Ecosystem ◽

Functional Areas

Big data is being generating in a wide variety of formats at an exponential rate. Big data analytics deals with processing and analyzing voluminous data to provide useful insight for guided decision making. The traditional data storage and management tools are not well-equipped to handle big data and its application. Apache Hadoop is a popular open-source platform that supports storage and processing of extremely large datasets. For the purposes of big data analytics, Hadoop ecosystem provides a variety of tools. However, there is a need to select a tool that is best suited for a specific requirement of big data analytics. The tools have their own advantages and drawbacks over each other. Some of them have overlapping business use cases however they differ in critical functional areas. So, there is a need to consider the trade-offs between usability and suitability while selecting a tool from Hadoop ecosystem. This paper identifies the requirements for Big Data Analytics (BDA) and maps tools of the Hadoop framework that are best suited for them. For this, we have categorized Hadoop tools according to their functionality and usage. Different Hadoop tools are discussed from the users’ perspective along with their pros and cons, if any. Also, for each identified category, comparison of Hadoop tools based on important parameters is presented. The tools have been thoroughly studied and analyzed based on their suitability for the different requirements of big data analytics. A mapping of big data analytics requirements to the Hadoop tools has been established for use by the data analysts and predictive modelers.

Download Full-text