9.1. Big data analytics

Audit and Financial Analysis ◽

10.38097/afa.2020.84.50.037 ◽

2020 ◽

Author(s):

В.Т. Чая ◽

Н.И. Чупахина

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analysis ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Digital Economy ◽

Big Data Technologies

В связи с развитием технологий цифровой экономики возрастает по экспоненте и объем оцифрованной информации. Но информация имеет ценность, только если она анализируется определенным образом. Большие же объемы информации привычными методами анализировать невозможно. Речь уже идет о больших данных и технологиях больших данных. В статье описаны особенности больших данных. Рассмотрены методы и инструменты анализа больших данных. Подробно рассматривается такой метод решения задач на основе больших данных, как машинное обучение. In connection with the development of digital economy technologies, the volume of digitized information is growing exponentially. But information has value only if it is analyzed in a certain way. It is impossible to analyze large amounts of information using the usual methods. We are already talking about big data and big data technologies. The article describes the features of big data. Methods and tools for big data analysis are considered. Such a method of solving problems based on big data as machine learning is considered in detail.

Download Full-text

Big Data Analysis for Trend Recognition Using Machine Learning Techniques

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327910666200304141238 ◽

2020 ◽

Vol 10 (4) ◽

pp. 540-550

Author(s):

Cerene Mariam Abraham ◽

Mannathazhathu Sudheep Elayidom ◽

Thankappan Santhanakrishnan

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Data Analysis ◽

Data Analytics ◽

Research Work ◽

Big Data Analytics ◽

Big Data Analysis ◽

Machine Learning Techniques ◽

Derivative Market

Background: Machine learning is one of the most popular research areas today. It relates closely to the field of data mining, which extracts information and trends from large datasets. Aims: The objective of this paper is to (a) illustrate big data analytics for the Indian derivative market and (b) identify trends in the data. Methods: Based on input from experts in the equity domain, the data are verified statistically using data mining techniques. Specifically, ten years of daily derivative data is used for training and testing purposes. The methods that are adopted for this research work include model generation using ARIMA, Hadoop framework which comprises mapping and reducing for big data analysis. Results: The results of this work are the observation of a trend that indicates the rise and fall of price in derivatives , generation of time-series similarity graph and plotting of frequency of temporal data. Conclusion: Big data analytics is an underexplored topic in the Indian derivative market and the results from this paper can be used by investors to earn both short-term and long-term benefits.

Download Full-text

BIG DATA ANALYTICS AND PRECISION ANIMAL AGRICULTURE SYMPOSIUM: Machine learning and data mining advance predictive big data analysis in precision animal agriculture1

Journal of Animal Science ◽

10.1093/jas/sky014 ◽

2018 ◽

Vol 96 (4) ◽

pp. 1540-1550 ◽

Cited By ~ 42

Author(s):

Gota Morota ◽

Ricardo V Ventura ◽

Fabyano F Silva ◽

Masanori Koyama ◽

Samodha C Fernando

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Data Analysis ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Animal Agriculture

Download Full-text

A multilevel approach to big data analysis using analytic tools and actor network theory

SA Journal of Information Management ◽

10.4102/sajim.v20i1.914 ◽

2018 ◽

Vol 20 (1) ◽

Cited By ~ 4

Author(s):

Tiko Iyamu

Keyword(s):

Big Data ◽

Data Analysis ◽

Network Theory ◽

Data Analytics ◽

Big Data Analytics ◽

Actor Network Theory ◽

Big Data Analysis ◽

Data Sets ◽

Multilevel Approach ◽

Actor Network

Background: Over the years, big data analytics has been statically carried out in a programmed way, which does not allow for translation of data sets from a subjective perspective. This approach affects an understanding of why and how data sets manifest themselves into various forms in the way that they do. This has a negative impact on the accuracy, redundancy and usefulness of data sets, which in turn affects the value of operations and the competitive effectiveness of an organisation. Also, the current single approach lacks a detailed examination of data sets, which big data deserve in order to improve purposefulness and usefulness.Objective: The purpose of this study was to propose a multilevel approach to big data analysis. This includes examining how a sociotechnical theory, the actor network theory (ANT), can be complementarily used with analytic tools for big data analysis.Method: In the study, the qualitative methods were employed from the interpretivist approach perspective.Results: From the findings, a framework that offers big data analytics at two levels, micro- (strategic) and macro- (operational) levels, was developed. Based on the framework, a model was developed, which can be used to guide the analysis of heterogeneous data sets that exist within networks.Conclusion: The multilevel approach ensures a fully detailed analysis, which is intended to increase accuracy, reduce redundancy and put the manipulation and manifestation of data sets into perspectives for improved organisations’ competitiveness.

Download Full-text

Techniques and Methods That Help to Make Big Data the Simplest Recipe for Success

Big Data Analytics for Entrepreneurial Success - Advances in Business Information Systems and Analytics ◽

10.4018/978-1-5225-7609-9.ch006 ◽

2019 ◽

pp. 161-194

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analysis ◽

Statistical Analysis ◽

Data Analytics ◽

Big Data Analysis ◽

Customer Segmentation ◽

Learning Context ◽

Feature Vectors

Data analytics has grown in a machine learning context. Whatever the reason data is used or exploited, customer segmentation or marketing targeting, it must be processed first and represented on feature vectors. Many algorithms, such as clustering, regression, classification, and others, need to be represented and clarified in order to facilitate processing and statistical analysis. If we have seen, through the previous chapters, the importance of big data analysis (the Why?), as with every major innovation, the biggest confusion lies in the exact scope (What?) and its implementation (How?). In this chapter, we will take a look at the different algorithms and techniques analytics that we can use in order to exploit the large amounts of data.

Download Full-text

Hidden big data analytics issues in the healthcare industry

Health Informatics Journal ◽

10.1177/1460458219854603 ◽

2019 ◽

Vol 26 (2) ◽

pp. 981-998 ◽

Cited By ~ 3

Author(s):

Kenneth David Strang ◽

Zhaohao Sun

Keyword(s):

Big Data ◽

Data Analysis ◽

Data Analytics ◽

Research Method ◽

Big Data Analytics ◽

Big Data Analysis ◽

Healthcare Industry ◽

Statistical Control ◽

Subject Matter Experts ◽

Journal Articles

The goal of the study was to identify big data analysis issues that can impact empirical research in the healthcare industry. To accomplish that the author analyzed big data related keywords from a literature review of peer reviewed journal articles published since 2011. Topics, methods and techniques were summarized along with strengths and weaknesses. A panel of subject matter experts was interviewed to validate the intermediate results and synthesize the key problems that would likely impact researchers conducting quantitative big data analysis in healthcare studies. The systems thinking action research method was applied to identify and describe the hidden issues. The findings were similar to the extant literature but three hidden fatal issues were detected. Methodical and statistical control solutions were proposed to overcome the three fatal healthcare big data analysis issues.

Download Full-text

Research of Big Data Processing Platform

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.484-485.922 ◽

2014 ◽

Vol 484-485 ◽

pp. 922-926

Author(s):

Xiang Ju Liu

Keyword(s):

Cloud Computing ◽

Big Data ◽

Data Analysis ◽

Network Architecture ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Computing Platform ◽

Operational Characteristics ◽

Analysis Platform

This paper introduces the operational characteristics of the era of big data and the current era of big data challenges, and exhaustive research and design of big data analytics platform based on cloud computing, including big data analytics platform architecture system, big data analytics platform software architecture , big data analytics platform network architecture big data analysis platform unified program features and so on. The paper also analyzes the cloud computing platform for big data analysis program unified competitive advantage and development of business telecom operators play a certain role in the future.

Download Full-text

Studying Students' Knowledge of the Benefits, Challenges, and Applications of Big Data Analytics In Healthcare

10.21203/rs.2.22540/v1 ◽

2020 ◽

Author(s):

Elham Nazari ◽

Maryam Edalati Khodabandeh ◽

Ali Dadashi ◽

Marjan Rasoulian ◽

hamed tabesh

Keyword(s):

Big Data ◽

Data Analysis ◽

Work Experience ◽

Medical Records ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Information Modeling ◽

Field Of Study ◽

Cross Sectional

Abstract Introdution Today, with the advent of technologies and the production of huge amounts of data, Big Data analytics has received much attention especially in healthcare. Understanding this field and recognizing its benefits, applications and challenges provide useful background for conducting efficient research. Therefore, the purpose of this study was to evaluate the students' familiarity from different universities of Mashhad with the benefits, applications and challenges of Big Data analysis.Method This is a cross-sectional study that was conducted on students of Medical Engineering, Medical Informatics, Medical Records and Health Information Management in Mashhad-Iran. A questionnaire was designed based on literature review in pubmed, google scholar, science direct and EMBASE databases, using Delphi method and presence of 10 experts from different fields of study. The designed questionnaire evaluated the opinion of students regarding benefits, challenges and applications of Big Data analytics. 200 students participated in the study and completed the designed questionnaire. Participants' opinions were evaluated descriptively and analytically. Result Most students were between 20 and 30 years old. 63% of them were male and 43.5% had no work experience. Current and previous field of study of most of the students were HIT, HIM, and Medical Records. Most of the participants in this study were undergraduates. 61.5% were economically active, 54.5% were exposed to Big Data. The mean scores of participants in benefits, applications, and challenges section were 3.71, 3.68, and 3.71, respectively, and process management was significant in different age groups (p=0.046), information, modeling, research, and health informatics across different fields of studies were significant (p=0.015, 0.033, 0.001, 0.024) Information and research were significantly different between groups (p=0.043 and 0.019), research in groups with / without economic activity was significant (p= 0.017) and information in exposure / non exposure to Big Data groups was significant (p=0.02). Conclusion Despite the importance and benefits of Big Data analytics, students' lack of familiarity with the necessity and importance of these analytics in industries and research is significant. The field of study and level of study do not appear to have an effect on the degree of knowledge of individuals regarding Big Data analysis. The design of technical training courses in this field may increase the level of knowledge of individuals regarding Big Data analysis.

Download Full-text

Comprehensive Contemplation of Probabilistic Aspects in Intelligent Analytics

International Journal of Service Science Management Engineering and Technology ◽

10.4018/ijssmet.2020010108 ◽

2020 ◽

Vol 11 (1) ◽

pp. 116-141 ◽

Cited By ~ 2

Author(s):

Neeti Sangwan ◽

Vishal Bhatnagar

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Big Data Analysis ◽

Future Research ◽

Learning Approaches ◽

Text Analytics ◽

Review Of Literature ◽

Classification Framework

In Big Data analysis, the application of machine learning has proven to be a revolutionary. The systematic review of literature shows that research has been carried out on the domain of big data analytics particularly text analytics with the inclusion of machine learning approaches. This extensive survey deals with the data at hand that provides different ways and issues while combining the machine learning approaches with the text. During the course of the survey, various publications in the field of synchronous application of machine learning in text analytics were searched and studied. Classification framework is proposed as the contribution of machine learning in text analytics. A classification framework represented the various application areas to motivate researchers for future research on the application of two emerging technologies.

Download Full-text

Big Data Classification and Internet of Things in Healthcare

International Journal of E-Health and Medical Communications ◽

10.4018/ijehmc.2020040102 ◽

2020 ◽

Vol 11 (2) ◽

pp. 20-37 ◽

Cited By ~ 1

Author(s):

Amine Rghioui ◽

Jaime Lloret ◽

Abedlmajid Oumnad

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analysis ◽

Big Data Analytics ◽

Data Classification ◽

Big Data Analysis ◽

Machine Intelligence ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Daunting Task

Every single day, a massive amount of data is generated by different medical data sources. Processing this wealth of data is indeed a daunting task, and it forces us to adopt smart and scalable computational strategies, including machine intelligence, big data analytics, and data classification. The authors can use the Big Data analysis for effective decision making in healthcare domain using the existing machine learning algorithms with some modification to it. The fundamental purpose of this article is to summarize the role of Big Data analysis in healthcare, and to provide a comprehensive analysis of the various techniques involved in mining big data. This article provides an overview of Big Data, applicability of it in healthcare, some of the work in progress and a future works. Therefore, in this article, the use of machine learning techniques is proposed for real-time diabetic patient data analysis from IoT devices and gateways.

Download Full-text

Implementasi Big Data Analytical Untuk Perguruan Tinggi Menggunakan Machine Learning

Journal of Informatic and Information Security ◽

10.31599/jiforty.v2i1.633 ◽

2021 ◽

Vol 2 (1) ◽

pp. 77-88

Author(s):

Rakhmat Purnomo ◽

Wowon Priatna ◽

Tri Dharma Putra

Keyword(s):

Higher Education ◽

Machine Learning ◽

Decision Making ◽

Big Data ◽

Data Analysis ◽

Student Performance ◽

Big Data Analytics ◽

Big Data Analysis ◽

Process Data ◽

Grouping Students

The dynamics of higher education are changing and emphasize the need to adapt quickly. Higher education is under the supervision of accreditation agencies, governments and other stakeholders to seek new ways to improve and monitor student success and other institutional policies. Many agencies fail to make efficient use of the large amounts of available data. With the use of big data analytics in higher education, it can be obtained more insight into students, academics, and the process in higher education so that it supports predictive analysis and improves decision making. The purpose of this research is to implement big data analytical to increase the decision making of the competent party. This research begins with the identification of process data based on analytical learning, academic and process in the campus environment. The data used in this study is a public dataset from UCI machine learning, from the 33 available varibales, 4 varibales are used to measure student performance. Big data analysis in this study uses spark apace as a library to operate pyspark so that python can process big data analysis. The data already in the master slave is grouped using k-mean clustering to get the best performing student group. The results of this study succeeded in grouping students into 5 clusters, cluster 1 including the best student performance and cluster 5 including the lowest student performance

Download Full-text