Big Data Tools—Hadoop Ecosystem, Spark and NoSQL Databases

Benchmarking Big Data OLAP NoSQL Databases

Ubiquitous Networking - Lecture Notes in Computer Science ◽

10.1007/978-3-030-02849-7_8 ◽

2018 ◽

pp. 82-94 ◽

Cited By ~ 2

Author(s):

Mohammed El Malki ◽

Arlind Kopliku ◽

Essaid Sabir ◽

Olivier Teste

Keyword(s):

Big Data ◽

Nosql Databases

Download Full-text

Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented Databases

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2018070103 ◽

2018 ◽

Vol 14 (3) ◽

pp. 44-68 ◽

Cited By ~ 1

Author(s):

Fatma Abdelhedi ◽

Amal Ait Brahim ◽

Gilles Zurfluh

Keyword(s):

Big Data ◽

Data Warehouse ◽

Relational Databases ◽

Traditional Approach ◽

Physical Models ◽

Decision Making Process ◽

Nosql Databases ◽

Care Field ◽

Sufficient Degree

Nowadays, most organizations need to improve their decision-making process using Big Data. To achieve this, they have to store Big Data, perform an analysis, and transform the results into useful and valuable information. To perform this, it's necessary to deal with new challenges in designing and creating data warehouse. Traditionally, creating a data warehouse followed well-governed process based on relational databases. The influence of Big Data challenged this traditional approach primarily due to the changing nature of data. As a result, using NoSQL databases has become a necessity to handle Big Data challenges. In this article, the authors show how to create a data warehouse on NoSQL systems. They propose the Object2NoSQL process that generates column-oriented physical models starting from a UML conceptual model. To ensure efficient automatic transformation, they propose a logical model that exhibits a sufficient degree of independence so as to enable its mapping to one or more column-oriented platforms. The authors provide experiments of their approach using a case study in the health care field.

Download Full-text

Big Data Management Performance Evaluation in Hadoop Ecosystem

2017 3rd International Conference on Big Data Computing and Communications (BIGCOM) ◽

10.1109/bigcom.2017.26 ◽

2017 ◽

Cited By ~ 1

Author(s):

Qing Liu ◽

Yinjin Fu ◽

Guiqiang Ni ◽

Jianmin Mei

Keyword(s):

Big Data ◽

Performance Evaluation ◽

Data Management ◽

Management Performance ◽

Hadoop Ecosystem

Download Full-text

NoSQL Databases

Advances in Data Mining and Database Management - Handbook of Research on Cloud Infrastructures for Big Data Analytics ◽

10.4018/978-1-4666-5864-6.ch008 ◽

2014 ◽

pp. 186-215 ◽

Cited By ~ 2

Author(s):

Ganesh Chandra Deka

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Processing ◽

Open Source ◽

Data Storage ◽

Big Data Processing ◽

Nosql Databases ◽

Data Intensive ◽

Huge Data ◽

Data Intensive Applications

NoSQL databases are designed to meet the huge data storage requirements of cloud computing and big data processing. NoSQL databases have lots of advanced features in addition to the conventional RDBMS features. Hence, the “NoSQL” databases are popularly known as “Not only SQL” databases. A variety of NoSQL databases having different features to deal with exponentially growing data-intensive applications are available with open source and proprietary option. This chapter discusses some of the popular NoSQL databases and their features on the light of CAP theorem.

Download Full-text

Big Data Analytics

Handbook of Research on Cloud Computing and Big Data Applications in IoT - Advances in Computer and Electrical Engineering ◽

10.4018/978-1-5225-8407-0.ch004 ◽

2019 ◽

pp. 67-81

Author(s):

Nitigya Sambyal ◽

Poonam Saini ◽

Rupali Syal

Keyword(s):

Big Data ◽

Open Problem ◽

Data Analytics ◽

Big Data Analytics ◽

Data Capture ◽

Data Sets ◽

Full Potential ◽

Processing Application ◽

Nosql Databases ◽

The World

The world is increasingly driven by huge amounts of data. Big data refers to data sets that are so large or complex that traditional data processing application software are inadequate to deal with them. Healthcare analytics is a prominent area of big data analytics. It has led to significant reduction in morbidity and mortality associated with a disease. In order to harness full potential of big data, various tools like Apache Sentry, BigQuery, NoSQL databases, Hadoop, JethroData, etc. are available for its processing. However, with such enormous amounts of information comes the complexity of data management, other big data challenges occur during data capture, storage, analysis, search, transfer, information privacy, visualization, querying, and update. The chapter focuses on understanding the meaning and concept of big data, analytics of big data, its role in healthcare, various application areas, trends and tools used to process big data along with open problem challenges.

Download Full-text

Secure search for encrypted personal health records from big data NoSQL databases in cloud

Computing ◽

10.1007/s00607-019-00762-z ◽

2019 ◽

Vol 102 (6) ◽

pp. 1521-1545 ◽

Cited By ~ 1

Author(s):

Lanxiang Chen ◽

Nan Zhang ◽

Hung-Min Sun ◽

Chin-Chen Chang ◽

Shui Yu ◽

...

Keyword(s):

Big Data ◽

Personal Health Records ◽

Personal Health ◽

Health Records ◽

Nosql Databases ◽

Secure Search

Download Full-text

Big Data Migration and Sentiment Analysis of Real Time Events Using Hadoop Ecosystem

International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018 - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-03146-6_87 ◽

2018 ◽

pp. 764-770

Author(s):

R. Chandana ◽

D. Harshitha ◽

Meenakshi ◽

A. C. Ramachandra

Keyword(s):

Big Data ◽

Sentiment Analysis ◽

Real Time ◽

Data Migration ◽

Hadoop Ecosystem

Download Full-text

Sentiment Analysis of Twitter Data Using Big Data Tools and Hadoop Ecosystem

Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB) - Lecture Notes in Computational Vision and Biomechanics ◽

10.1007/978-3-030-00665-5_83 ◽

2019 ◽

pp. 857-863

Author(s):

Monica Malik ◽

Sameena Naaz ◽

Iffat Rehman Ansari

Keyword(s):

Big Data ◽

Sentiment Analysis ◽

Twitter Data ◽

Hadoop Ecosystem

Download Full-text

When Relational-Based Applications Go to NoSQL Databases: A Survey

Information ◽

10.3390/info10070241 ◽

2019 ◽

Vol 10 (7) ◽

pp. 241

Author(s):

Geomar A. Schreiner ◽

Denio Duarte ◽

Ronaldo dos S. Melo

Keyword(s):

Big Data ◽

Comparative Analysis ◽

Relational Databases ◽

Research Area ◽

Relational Data ◽

Data Sets ◽

System Architectures ◽

Nosql Databases ◽

State Of Art ◽

Open Issues

Several data-centric applications today produce and manipulate a large volume of data, the so-called Big Data. Traditional databases, in particular, relational databases, are not suitable for Big Data management. As a consequence, some approaches that allow the definition and manipulation of large relational data sets stored in NoSQL databases through an SQL interface have been proposed, focusing on scalability and availability. This paper presents a comparative analysis of these approaches based on an architectural classification that organizes them according to their system architectures. Our motivation is that wrapping is a relevant strategy for relational-based applications that intend to move relational data to NoSQL databases (usually maintained in the cloud). We also claim that this research area has some open issues, given that most approaches deal with only a subset of SQL operations or give support to specific target NoSQL databases. Our intention with this survey is, therefore, to contribute to the state-of-art in this research area and also provide a basis for choosing or even designing a relational-to-NoSQL data wrapping solution.

Download Full-text

A survey of open source tools for machine learning with big data in the Hadoop ecosystem

Journal Of Big Data ◽

10.1186/s40537-015-0032-1 ◽

2015 ◽

Vol 2 (1) ◽

Cited By ~ 179

Author(s):

Sara Landset ◽

Taghi M. Khoshgoftaar ◽

Aaron N. Richter ◽

Tawfiq Hasanin

Keyword(s):

Machine Learning ◽

Big Data ◽

Open Source ◽

Hadoop Ecosystem

Download Full-text