A Study on MapReduce: Challenges and Trends

Sachin Arun Thanekar; K. Subrahmanyam; A. B. Bagwan

doi:10.11591/ijeecs.v4.i1.pp176-183

Big Data and MapReduce Challenges, Opportunities and Trends

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i6.pp2911-2919 ◽

2016 ◽

Vol 6 (6) ◽

pp. 2911

Author(s):

Sachin Arun Thanekar ◽

K. Subrahmanyam ◽

A. B. Bagwan

Keyword(s):

Big Data ◽

High Velocity ◽

Structured Data ◽

Unstructured Data ◽

Map Reduce ◽

Huge Amount ◽

Commodity Hardware ◽

Survey Paper ◽

Recent Trends ◽

Large Clusters

<p>Nowadays we all are surrounded by Big data. The term ‘Big Data’ itself indicates huge volume, high velocity, variety and veracity i.e. uncertainty of data which gave rise to new difficulties and challenges. Big data generated may be structured data, Semi Structured data or unstructured data. For existing database and systems lot of difficulties are there to process, analyze, store and manage such a Big Data. The Big Data challenges are Protection, Curation, Capture, Analysis, Searching, Visualization, Storage, Transfer and sharing. Map Reduce is a framework using which we can write applications to process huge amount of data, in parallel, on large clusters of commodity hardware in a reliable manner. Lot of efforts have been put by different researchers to make it simple, easy, effective and efficient. In our survey paper we emphasized on the working of Map Reduce, challenges, opportunities and recent trends so that researchers can think on further improvement.</p>

Download Full-text

Big Data and MapReduce Challenges, Opportunities and Trends

International Journal of Electrical and Computer Engineering (IJECE) ◽

10.11591/ijece.v6i6.10555 ◽

2016 ◽

Vol 6 (6) ◽

pp. 2911

Author(s):

Sachin Arun Thanekar ◽

K. Subrahmanyam ◽

A. B. Bagwan

Keyword(s):

Big Data ◽

High Velocity ◽

Structured Data ◽

Unstructured Data ◽

Map Reduce ◽

Huge Amount ◽

Commodity Hardware ◽

Survey Paper ◽

Recent Trends ◽

Large Clusters

<p>Nowadays we all are surrounded by Big data. The term ‘Big Data’ itself indicates huge volume, high velocity, variety and veracity i.e. uncertainty of data which gave rise to new difficulties and challenges. Big data generated may be structured data, Semi Structured data or unstructured data. For existing database and systems lot of difficulties are there to process, analyze, store and manage such a Big Data. The Big Data challenges are Protection, Curation, Capture, Analysis, Searching, Visualization, Storage, Transfer and sharing. Map Reduce is a framework using which we can write applications to process huge amount of data, in parallel, on large clusters of commodity hardware in a reliable manner. Lot of efforts have been put by different researchers to make it simple, easy, effective and efficient. In our survey paper we emphasized on the working of Map Reduce, challenges, opportunities and recent trends so that researchers can think on further improvement.</p>

Download Full-text

Applications of Big Data in the Digital India: Opportunities and Challenges

IRA-International Journal of Technology & Engineering (ISSN 2455-4480) ◽

10.21013/jte.v3.n3.p7 ◽

2016 ◽

Vol 3 (3) ◽

Author(s):

Vinay Kumar ◽

Arpana Chaturvedi

Keyword(s):

Big Data ◽

Social Networking ◽

Exponential Growth ◽

Social Networking Sites ◽

Unstructured Data ◽

Threat Perception ◽

Data Repository ◽

Huge Amount ◽

Real Challenge ◽

Area Of Application

<div><p><em>With the advent of Social Networking Sites (SNS), volumes of data are generated daily. Most of these data are multimedia type and unstructured with exponential growth. This exponential growth of variety, volume and complexity of structured and unstructured data leads to the concept of big data. Managing big data and harnessing its benefits is a real challenge. With increase in access to big data repository for various applications, security and access control is another aspect that needs to be considered while managing big data. We have discussed area of application of big data, opportunities it provides and challenges that we face in the managing such huge amount of data for various applications. Issues related to security against different threat perception of big data are also discussed. </em></p></div>

Download Full-text

Insight Into Big Data Analytics

Advances in Systems Analysis, Software Engineering, and High Performance Computing - Handbook of Research on Pattern Engineering System Development for Big Data Analytics ◽

10.4018/978-1-5225-3870-7.ch005 ◽

2018 ◽

pp. 67-79

Author(s):

Mohd Vasim Ahamad ◽

Misbahul Haque ◽

Mohd Imran

Keyword(s):

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Healthcare Services ◽

Huge Amount ◽

Business Decision ◽

Digital Era ◽

Data Formats ◽

Recent Trends ◽

Analytical Approaches

In the present digital era, more data are generated and collected than ever before. But, this huge amount of data is of no use until it is converted into some useful information. This huge amount of data, coming from a number of sources in various data formats and having more complexity, is called big data. To convert the big data into meaningful information, the authors use different analytical approaches. Information extracted, after applying big data analytics methods over big data, can be used in business decision making, fraud detection, healthcare services, education sector, machine learning, extreme personalization, etc. This chapter presents the basics of big data and big data analytics. Big data analysts face many challenges in storing, managing, and analyzing big data. This chapter provides details of challenges in all mentioned dimensions. Furthermore, recent trends of big data analytics and future directions for big data researchers are also described.

Download Full-text

The Impact of Big Data on Security

Big Data ◽

10.4018/978-1-4666-9840-6.ch068 ◽

2016 ◽

pp. 1495-1518

Author(s):

Mohammad Alaa Hussain Al-Hamami

Keyword(s):

Social Media ◽

Big Data ◽

Management System ◽

Database Management ◽

Database Systems ◽

Structured Data ◽

Database Management System ◽

Unstructured Data ◽

And Behavior ◽

The Impact

Big Data is comprised systems, to remain competitive by techniques emerging due to Big Data. Big Data includes structured data, semi-structured and unstructured. Structured data are those data formatted for use in a database management system. Semi-structured and unstructured data include all types of unformatted data including multimedia and social media content. Among practitioners and applied researchers, the reaction to data available through blogs, Twitter, Facebook, or other social media can be described as a “data rush” promising new insights about consumers' choices and behavior and many other issues. In the past Big Data has been used just by very large organizations, governments and large enterprises that have the ability to create its own infrastructure for hosting and mining large amounts of data. This chapter will show the requirements for the Big Data environments to be protected using the same rigorous security strategies applied to traditional database systems.

Download Full-text

Privacy Preserving Data Mining on Unstructured Data

Privacy and Security Policies in Big Data - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-2486-1.ch008 ◽

2017 ◽

pp. 167-190

Author(s):

Trupti Vishwambhar Kenekar ◽

Ajay R. Dani

Keyword(s):

Data Mining ◽

Big Data ◽

Structure Data ◽

Data Privacy ◽

Differential Privacy ◽

Unstructured Data ◽

Map Reduce ◽

Individual Data ◽

Data Set ◽

Privacy Preserving Data Mining

As Big Data is group of structured, unstructured and semi-structure data collected from various sources, it is important to mine and provide privacy to individual data. Differential Privacy is one the best measure which provides strong privacy guarantee. The chapter proposed differentially private frequent item set mining using map reduce requires less time for privately mining large dataset. The chapter discussed problem of preserving data privacy, different challenges to preserving data privacy in big data environment, Data privacy techniques and their applications to unstructured data. The analyses of experimental results on structured and unstructured data set are also presented.

Download Full-text

A Survey on Big Data Analytics Using HADOOP

Asian Journal of Computer Science and Technology ◽

10.51983/ajcst-2019.8.s3.2091 ◽

2019 ◽

Vol 8 (S3) ◽

pp. 35-40

Author(s):

S. Mamatha ◽

T. Sudha

Keyword(s):

Big Data ◽

Social Networking Sites ◽

Data Analytics ◽

Business Processes ◽

Big Data Analytics ◽

Large Data ◽

Structured Data ◽

Map Reduce ◽

Data Set ◽

Digital World

In this digital world, as organizations are evolving rapidly with data centric asset the explosion of data and size of the databases have been growing exponentially. Data is generated from different sources like business processes, transactions, social networking sites, web servers, etc. and remains in structured as well as unstructured form. The term ― Big data is used for large data sets whose size is beyond the ability of commonly used software tools to capture, manage, and process the data within a tolerable elapsed time. Big data varies in size ranging from a few dozen terabytes to many petabytes of data in a single data set. Difficulties include capture, storage, search, sharing, analytics and visualizing. Big data is available in structured, unstructured and semi-structured data format. Relational database fails to store this multi-structured data. Apache Hadoop is efficient, robust, reliable and scalable framework to store, process, transforms and extracts big data. Hadoop framework is open source and fee software which is available at Apache Software Foundation. In this paper we will present Hadoop, HDFS, Map Reduce and c-means big data algorithm to minimize efforts of big data analysis using Map Reduce code. The objective of this paper is to summarize the state-of-the-art efforts in clinical big data analytics and highlight what might be needed to enhance the outcomes of clinical big data analytics tools and related fields.

Download Full-text

Big Data Mining using Map Reduce: A Survey Paper

IOSR Journal of Computer Engineering ◽

10.9790/0661-16673740 ◽

2014 ◽

Vol 16 (6) ◽

pp. 37-40 ◽

Cited By ~ 2

Author(s):

Shital Suryawanshi ◽

◽

Prof. V.S Wadne

Keyword(s):

Data Mining ◽

Big Data ◽

Map Reduce ◽

Survey Paper ◽

Big Data Mining

Download Full-text

The Rise of Big Data, Cloud, and Internet of Things

Critical Research on Scalability and Security Issues in Virtual Cloud Environments - Advances in Information Security, Privacy, and Ethics ◽

10.4018/978-1-5225-3029-9.ch010 ◽

2018 ◽

pp. 201-222 ◽

Cited By ~ 1

Author(s):

Reema Abdulraziq ◽

Muneer Bani Yassein ◽

Shadi Aljawarneh

Keyword(s):

Cloud Computing ◽

Big Data ◽

Internet Of Things ◽

End Users ◽

Structured Data ◽

The Internet ◽

Huge Amount ◽

Volume Velocity ◽

The Cost ◽

The Internet Of Things

Big data refers to the huge amount of data that is being used in commercial, industrial and economic environments. There are three types of big data; structured, unstructured and semi-structured data. When it comes to discussions on big data, three major aspects that can be considered as its main dimensions are the volume, velocity, and variety of the data. This data is collected, analysed and checked for use by the end users. Cloud computing and the Internet of Things (IoT) are used to enable this huge amount of collected data to be stored and connected to the Internet. The time and the cost are reduced by means of these technologies, and in addition, they are able to accommodate this large amount of data regardless of its size. This chapter focuses on how big data, with the emergence of cloud computing and the Internet of Things (IOT), can be used via several applications and technologies.

Download Full-text

Security Issues and Challenges Related to Big Data

Big Data Management and the Internet of Things for Improved Health Systems - Advances in Healthcare Information Systems and Administration ◽

10.4018/978-1-5225-5222-2.ch006 ◽

2018 ◽

pp. 86-101

Author(s):

Jaimin N. Undavia ◽

Atul Patel ◽

Sheenal Patel

Keyword(s):

Big Data ◽

Data Analysis ◽

Database Systems ◽

Heterogeneous Data ◽

Unstructured Data ◽

Huge Amount ◽

Current Time ◽

Security Issues ◽

Future Prediction ◽

Data Term

Availability of huge amount of data has opened up a new area and challenge to analyze these data. Analysis of these data become essential for each organization and these analyses may yield some useful information for their future prospectus. To store, manage and analyze such huge amount of data traditional database systems are not adequate and not capable also, so new data term is introduced – “Big Data”. This term refers to huge amount of data which are used for analytical purpose and future prediction or forecasting. Big Data may consist of combination of structured, semi structured or unstructured data and managing such data is a big challenge in current time. Such heterogeneous data is required to maintained in very secured and specific way. In this chapter, we have tried to identify such challenges and issues and also tried to resolve it with specific tools.

Download Full-text