Identifying Requirements for Big Data Analytics and Mapping to Hadoop Tools

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.

Download Full-text

Big Data Analytics Tools and Platform in Big Data Landscape

10.4018/978-1-6684-3662-2.ch029 ◽

2022 ◽

pp. 622-631

Author(s):

Mohd Imran ◽

Mohd Vasim Ahamad ◽

Misbahul Haque ◽

Mohd Shoaib

Keyword(s):

Big Data ◽

Comparative Analysis ◽

Comparative Study ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Apache Hadoop ◽

Pros And Cons ◽

The Comparative Study ◽

Analytical Tools

The term big data analytics refers to mining and analyzing of the voluminous amount of data in big data by using various tools and platforms. Some of the popular tools are Apache Hadoop, Apache Spark, HBase, Storm, Grid Gain, HPCC, Casandra, Pig, Hive, and No SQL, etc. These tools are used depending on the parameter taken for big data analysis. So, we need a comparative analysis of such analytical tools to choose best and simpler way of analysis to gain more optimal throughput and efficient mining. This chapter contributes to a comparative study of big data analytics tools based on different aspects such as their functionality, pros, and cons based on characteristics that can be used to determine the best and most efficient among them. Through the comparative study, people are capable of using such tools in a more efficient way.

Download Full-text

Big Data, Decision Models, and Public Health

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph17186723 ◽

2020 ◽

Vol 17 (18) ◽

pp. 6723

Author(s):

Chien-Lung Chan ◽

Chi-Chang Chang

Keyword(s):

Public Health ◽

Decision Making ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Decision Models ◽

Medical Decision ◽

Mining Machine ◽

Special Issue ◽

Trade Offs

Unlike most daily decisions, medical decision making often has substantial consequences and trade-offs. Recently, big data analytics techniques such as statistical analysis, data mining, machine learning and deep learning can be applied to construct innovative decision models. With complex decision making, it can be difficult to comprehend and compare the benefits and risks of all available options to make a decision. For these reasons, this Special Issue focuses on the use of big data analytics and forms of public health decision making based on the decision model, spanning from theory to practice. A total of 64 submissions were carefully blind peer reviewed by at least two referees and, finally, 23 papers were selected for this Special Issue.

Download Full-text

FACTORS AFFECTING THE ADOPTION OF BIG DATA ANALYTICS IN COMPANIES

Revista de Administração de Empresas ◽

10.1590/s0034-759020190607 ◽

2019 ◽

Vol 59 (6) ◽

pp. 415-429 ◽

Cited By ~ 1

Author(s):

JUAN-PEDRO CABRERA-SÁNCHEZ ◽

ÁNGEL F VILLAREJO-RAMOS

Keyword(s):

Big Data ◽

Data Storage ◽

Perceived Risk ◽

Data Analytics ◽

Big Data Analytics ◽

Structural Models ◽

Free Software ◽

Unified Theory ◽

Factors Affecting ◽

Use Of Technology

ABSTRACT With the total quantity of data doubling every two years, the low price of computing and data storage, make Big Data analytics (BDA) adoption desirable for companies, as a tool to get competitive advantage. Given the availability of free software, why have some companies failed to adopt these techniques? To answer this question, we extend the unified theory of technology adoption and use of technology model (UTAUT) adapted for the BDA context, adding two variables: resistance to use and perceived risk. We used the level of implementation of these techniques to divide companies into users and non-users of BDA. The structural models were evaluated by partial least squares (PLS). The results show the importance of good infrastructure exceeds the difficulties companies face in implementing it. While companies planning to use Big Data expect strong results, current users are more skeptical about its performance.

Download Full-text

Big Data on Machine Learning – A Review

Engineering and Scientific International Journal ◽

10.30726/esij/v8.i3.2021.83018 ◽

2021 ◽

Vol 8 (3) ◽

Author(s):

Balasree K ◽

Dharmarajan K

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Storage ◽

Data Analytics ◽

Rapid Development ◽

Learning Algorithms ◽

Big Data Analytics ◽

Machine Learning Algorithms ◽

Data Sets ◽

Big Data Technology

In rapid development of Big Data technology over the recent years, this paper discussing about the Machine Learning (ML) playing role that is based on methods and algorithms to Big Data Processing and Big Data Analytics. In evolutionary fields and computing fields of developments that both are complementing each other. Big Data: The rapid growth of such data solutions needed to be studied and provided to handle then to gain the knowledge from datasets and extracting values due to the data sets are very high in velocity and variety. The Big data analytics are involving and indicating the appropriate data storage and computational outline that enhanced by using Scalable Machine Learning Algorithms and Big Data Analytics then the analytics to reveal the massive amounts of hidden data’s and secret correlations. This type of Analytic information useful for organizations and companies to gain deeper knowledge, development and getting advantages over the competition. When using this Analytics we can predict the accurate implementation over the data. This paper presented about the detailed review of state-of-the-art developments and overview of advantages and challenges in Machine Learning Algorithms over big data analytics.

Download Full-text

Data Mining, Big Data, Data Analytics

Web Services ◽

10.4018/978-1-5225-7501-6.ch006 ◽

2019 ◽

pp. 89-104

Author(s):

Priya P. Panigrahi ◽

Tiratha Raj Singh

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Big Data Analytics ◽

Biological Data ◽

Smart Devices ◽

Data Life Cycle ◽

The Impact ◽

Special Relevance ◽

Generation Sequencing

In this digital and computing world, data formation and collection rate are growing very rapidly. With these improved proficiencies of data storage and fast computation along with the real-time distribution of data through the internet, the usual everyday ingestion of data is mounting exponentially. With the continuous advancement in data storage and accessibility of smart devices, the impact of big data will continue to develop. This chapter provides the fundamental concepts of big data, its benefits, probable pitfalls, big data analytics and its impact in Bioinformatics. With the generation of the deluge of biological data through next generation sequencing projects, there is a need to handle this data trough big data techniques. The chapter also presents a discussion of the tools for analytics, development of a novel data life cycle on big data, details of the problems and challenges connected with big data with special relevance to bioinformatics.

Download Full-text

Data Mining, Big Data, Data Analytics

Library and Information Services for Bioinformatics Education and Research - Advances in Library and Information Science ◽

10.4018/978-1-5225-1871-6.ch005 ◽

2017 ◽

pp. 91-111

Author(s):

Priya P. Panigrahi ◽

Tiratha Raj Singh

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Big Data Analytics ◽

Biological Data ◽

Smart Devices ◽

Data Life Cycle ◽

The Impact ◽

Special Relevance ◽

Generation Sequencing

In this digital and computing world, data formation and collection rate are growing very rapidly. With these improved proficiencies of data storage and fast computation along with the real-time distribution of data through the internet, the usual everyday ingestion of data is mounting exponentially. With the continuous advancement in data storage and accessibility of smart devices, the impact of big data will continue to develop. This chapter provides the fundamental concepts of big data, its benefits, probable pitfalls, big data analytics and its impact in Bioinformatics. With the generation of the deluge of biological data through next generation sequencing projects, there is a need to handle this data trough big data techniques. The chapter also presents a discussion of the tools for analytics, development of a novel data life cycle on big data, details of the problems and challenges connected with big data with special relevance to bioinformatics.

Download Full-text

Sustainability-Oriented Cost Management Tools and Big Data Analytics

10.1201/9781003090045-6 ◽

2021 ◽

pp. 105-130

Author(s):

Mohamed Abdelmounem Serag

Keyword(s):

Big Data ◽

Data Analytics ◽

Cost Management ◽

Big Data Analytics ◽

Management Tools

Download Full-text

Effects of Pros and Cons of Applying Big Data Analytics to Consumers’ Responses in an E-Commerce Context

Sustainability ◽

10.3390/su9050798 ◽

2017 ◽

Vol 9 (5) ◽

pp. 798 ◽

Cited By ~ 8

Author(s):

Thi Le ◽

Shu-Yi Liaw

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Pros And Cons

Download Full-text

Towards Big Data Analytics in the e-Learning Space

Cybernetics and Information Technologies ◽

10.2478/cait-2019-0023 ◽

2019 ◽

Vol 19 (3) ◽

pp. 16-24 ◽

Cited By ~ 2

Author(s):

Ivan P. Popchev ◽

Daniela A. Orozova

Keyword(s):

Big Data ◽

Data Storage ◽

Data Analytics ◽

Big Data Analytics ◽

Students At Risk ◽

Learning Space ◽

E Learning ◽

New Research ◽

Big Data Storage

Abstract The issues related to the analysis and management of Big Data, aspects of the security, stability and quality of the data, represent a new research, and engineering challenge. In the present paper, techniques for Big Data storage, search, analysis and management in the area of the virtual e-Learning space and the problems in front of them are considered. A numerical example for explorative analysis of data about the students from Burgas Free University is applied, using instrument for Data Mining of Orange. The analysis is a base for a system for localization of students at risk.

Download Full-text