Big Data Analytics

Author(s):  
Nitigya Sambyal ◽  
Poonam Saini ◽  
Rupali Syal

The world is increasingly driven by huge amounts of data. Big data refers to data sets that are so large or complex that traditional data processing application software are inadequate to deal with them. Healthcare analytics is a prominent area of big data analytics. It has led to significant reduction in morbidity and mortality associated with a disease. In order to harness full potential of big data, various tools like Apache Sentry, BigQuery, NoSQL databases, Hadoop, JethroData, etc. are available for its processing. However, with such enormous amounts of information comes the complexity of data management, other big data challenges occur during data capture, storage, analysis, search, transfer, information privacy, visualization, querying, and update. The chapter focuses on understanding the meaning and concept of big data, analytics of big data, its role in healthcare, various application areas, trends and tools used to process big data along with open problem challenges.

Author(s):  
Wajid Ali ◽  
Muhammad Usman Shafique ◽  
Muhammad Arslan Majeed ◽  
Ali Raza

A key ingredient in the world of cloud computing is a database that can be used by a great number of users. Distributed storage mechanisms become the de-facto method for data storage used by companies for the new generation of web applications. In the world of data storage, NoSQL (usually interpreted as "not only SQL" by developers) database is a growing trend. It is said that NoSQL alternates with the most widely used relational databases for the data storage, but, as the name implies, it does not fully replace the SQL. In this paper we will discuss about SQL and NoSQL databases, comparison of traditional SQL with NoSQL databases for Big Data analytics, NoSQL data models, types of NoSQL data stores, characteristics and features of each data store, advantages and disadvantages of NoSQL and RDBMS.


Author(s):  
Mohammad Abu Kausar ◽  
Mohammad Nasar

Background: Nowadays, the digital world is rising rapidly and becoming very difficult in nature's quantity, diversity, and speed. Recently, there have been two major changes in data management, which are NoSQL databases and Big Data Analytics. While evolving with the diverse reasons, their independent growths balance each other and their convergence would greatly benefit organization to make decisions on-time with the amount of multifaceted data sets that might be semi structured, structured, and unstructured. Though several software solutions have come out to support Big Data analytics on the one hand, on the other hand, there have been several packages of NoSQL database available in the market. Methods: The main goal of this article is to give comprehension of their perspective and a complete study to associate the future of the emerging several important NoSQL data models. Results: Evaluating NoSQL databases for Big Data analytics with traditional SQL performance shows that NoSQL database is a superior alternative for industry condition need high-performance analytics, adaptability, simplicity, and distributed large data scalability. Conclusion: This paper conclude with industry's current adoption status of NoSQL databases.


2018 ◽  
Vol 20 (1) ◽  
Author(s):  
Tiko Iyamu

Background: Over the years, big data analytics has been statically carried out in a programmed way, which does not allow for translation of data sets from a subjective perspective. This approach affects an understanding of why and how data sets manifest themselves into various forms in the way that they do. This has a negative impact on the accuracy, redundancy and usefulness of data sets, which in turn affects the value of operations and the competitive effectiveness of an organisation. Also, the current single approach lacks a detailed examination of data sets, which big data deserve in order to improve purposefulness and usefulness.Objective: The purpose of this study was to propose a multilevel approach to big data analysis. This includes examining how a sociotechnical theory, the actor network theory (ANT), can be complementarily used with analytic tools for big data analysis.Method: In the study, the qualitative methods were employed from the interpretivist approach perspective.Results: From the findings, a framework that offers big data analytics at two levels, micro- (strategic) and macro- (operational) levels, was developed. Based on the framework, a model was developed, which can be used to guide the analysis of heterogeneous data sets that exist within networks.Conclusion: The multilevel approach ensures a fully detailed analysis, which is intended to increase accuracy, reduce redundancy and put the manipulation and manifestation of data sets into perspectives for improved organisations’ competitiveness.


Author(s):  
Nirmit Singhal ◽  
Amita Goel, ◽  
Nidhi Sengar ◽  
Vasudha Bahl

The world generated 52 times the amount of data in 2010 and 76 times the number of information sources in 2022. The ability to use this data creates enormous opportunities, and in order to make these opportunities a reality, people must use data to solve problems. Unfortunately, in the midst of a global pandemic, when people all over the world seek reliable, trustworthy information about COVID-19 (Coronavirus). Tableau plays a key role in this scenario because it is an extremely powerful tool for quickly visualizing large amounts of data. It has a simple drag-and-drop interface. Beautiful infographics are simple to create and take little time. Tableau works with a wide variety of data sources. COVID-19 (Coronavirus)analytics with Tableau will allow you to create dashboards that will assist you. Tableau is a tool that deals with big data analytics and generates output in a visualization technique, making it more understandable and presentable. Data blending, real-time reporting, and data collaboration are one of its features. Ultimately, this paper provides a clear picture of the growing COVID19 (Coronavirus) data and the tools that can assist more effectively, accurately, and efficiently. Keywords: Data Visualization, Tableau, Data Analysis, Covid-19 analysis, Covid-19 data


Author(s):  
Pethuru Raj

The implications of the digitization process among a bevy of trends are definitely many and memorable. One is the abnormal growth in data generation, gathering, and storage due to a steady increase in the number of data sources, structures, scopes, sizes, and speeds. In this chapter, the author shows some of the impactful developments brewing in the IT space, how the tremendous amount of data getting produced and processed all over the world impacts the IT and business domains, how next-generation IT infrastructures are accordingly getting refactored, remedied, and readied for the impending big data-induced challenges, how likely the move of the big data analytics discipline towards fulfilling the digital universe requirements of extracting and extrapolating actionable insights for the knowledge-parched is, and finally, the establishment and sustenance of the dreamt smarter planet.


Web Services ◽  
2019 ◽  
pp. 1430-1443
Author(s):  
Louise Leenen ◽  
Thomas Meyer

The Governments, military forces and other organisations responsible for cybersecurity deal with vast amounts of data that has to be understood in order to lead to intelligent decision making. Due to the vast amounts of information pertinent to cybersecurity, automation is required for processing and decision making, specifically to present advance warning of possible threats. The ability to detect patterns in vast data sets, and being able to understanding the significance of detected patterns are essential in the cyber defence domain. Big data technologies supported by semantic technologies can improve cybersecurity, and thus cyber defence by providing support for the processing and understanding of the huge amounts of information in the cyber environment. The term big data analytics refers to advanced analytic techniques such as machine learning, predictive analysis, and other intelligent processing techniques applied to large data sets that contain different data types. The purpose is to detect patterns, correlations, trends and other useful information. Semantic technologies is a knowledge representation paradigm where the meaning of data is encoded separately from the data itself. The use of semantic technologies such as logic-based systems to support decision making is becoming increasingly popular. However, most automated systems are currently based on syntactic rules. These rules are generally not sophisticated enough to deal with the complexity of decisions required to be made. The incorporation of semantic information allows for increased understanding and sophistication in cyber defence systems. This paper argues that both big data analytics and semantic technologies are necessary to provide counter measures against cyber threats. An overview of the use of semantic technologies and big data technologies in cyber defence is provided, and important areas for future research in the combined domains are discussed.


Author(s):  
Abid Ali ◽  
Nursyarizal Mohd Nor ◽  
Taib Ibrahim ◽  
Mohd Fakhizan Romlie ◽  
Kishore Bingi

This chapter proposes Big Data Analytics for the sizing and locating of solar photovoltaic farms to reduce the total energy loss in distribution networks. The Big Data Analytics, which uses the advance statistical and computational tools for the handling of large data sets, has been adopted for modeling the 15 years of solar weather data. Total Power Loss Index (TPLI) is formulated as the main objective function for the optimization problem and meanwhile bus voltage deviations and penetrations of the PV farms are calculated. To solve the optimization problem, this study adopts the Mixed Integer Optimization using Genetic Algorithm (MIOGA) technique. By considering different time varying voltage dependent load models, the proposed algorithm is applied on IEEE 33 bus and IEEE 69 bus test distribution networks and optimum results are acquired. From the results, it is revealed that compared to single PV farm, the integration of two PV farms reduced more energy loss and reduced the total size of PV farms. Big Data Analytics is found very effective for the storing, handling, processing and the visualizing of the weather Big Data.


2020 ◽  
pp. 1839-1857
Author(s):  
Mamata Rath

Currently, there is an expanding interest for additional medical data from patients about their healthcare choices and related decisions, and they further need investment in their basic health issues. Big data provides patients presumptuous data to help them settle on the best choice and align with their medicinal treatment plan. One of the very advanced concepts related to the synthesis of big data sets to reveal the hidden pattern in them is big data analytics. It involves demanding techniques to mine and extract relevant data that includes the actions of piercing a database, effectively mine the data, query and inspect the data and is committed to enhance the technical execution of various task segments. The capacity to synthesize a lot of data can enable an association to manage data that can influence the business. In this way, the primary goal of big data analytics is to help business relationships to have enhanced comprehension of data, and subsequently, settle on proficient and very much educated decisions. Big data analytics empowers data diggers and researchers to examine an extensive volume of data that may not be outfit utilizing customary apparatuses. Big data analytics require advances and statistical instruments that can change a lot of organized, unstructured, and semi-organized data into more reasonable data and metadata designed for explanatory procedures. There is tremendous positive potential concerning the application of big data in human health care services and many related major applications are still in their developmental stages. The deployment of big data in health service demonstrates enhancing health care results and controlling the expenses of common people due to treatment, as proven by some developing use cases. Keeping in view such powerful processing capacity of big data analytics in various technical fields of modern civilization related to health care, the current research article presents a comprehensive study and investigation on big data analytics and its application in multiple sectors of society with significance in health care applications.


2022 ◽  
pp. 67-76
Author(s):  
Dineshkumar Bhagwandas Vaghela

The term big data has come due to rapid generation of data in various organizations. In big data, the big is the buzzword. Here the data are so large and complex that the traditional database applications are not able to process (i.e., they are inadequate to deal with such volume of data). Usually the big data are described by 5Vs (volume, velocity, variety, variability, veracity). The big data can be structured, semi-structured, or unstructured. Big data analytics is the process to uncover hidden patterns, unknown correlations, predict the future values from large and complex data sets. In this chapter, the following topics will be covered more in detail. History of big data and business analytics, big data analytics technologies and tools, and big data analytics uses and challenges.


Sign in / Sign up

Export Citation Format

Share Document