Predicting Survival on Titanic by Applying Exploratory Data Analytics and Machine Learning Techniques

Data analytics is the essential component in deriving insights from data obtained from multiple sources. It represents the technology, methods and techniques used to obtain insights from massive datasets. As data increases, companies are looking for ways to gain relevant business insights underneath layers of data and information, to help them better understand new business ventures, opportunities, business trends and complex challenges. However, to date, while the extensive benefits of business data analytics to large organizations are widely published, micro, small, and medium sized organisations have not fully grasped the potential benefits to be gained from data analytics using machine learning techniques. This study is guided by the research question of how data analytics using machine learning techniques can benefit small businesses. Using the case study method, this paper outlines how small businesses in two different industries i.e. healthcare and retail can leverage data analytics and machine learning techniques to gain competitive advantage from the data. Details on the respective benefits gained by the small business owners featured in the two case studies provide important answers to the research question.

Download Full-text

Application Of Machine Learning Techniques, Big Data Analytics In Health Care Sector – A Literature Survey

2018 2nd International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), 2018 2nd International Conference on ◽

10.1109/i-smac.2018.8653654 ◽

2018 ◽

Author(s):

M. Sughasiny ◽

J. Rajeshwari

Keyword(s):

Machine Learning ◽

Health Care ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Health Care Sector ◽

Literature Survey ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Big Mac: A Distributed PaaS Framework for on Demand Big Data Processing Using Machine Learning Techniques

Journal of Computational and Theoretical Nanoscience ◽

10.1166/jctn.2020.8634 ◽

2020 ◽

Vol 17 (1) ◽

pp. 92-100

Author(s):

Balanand Jha ◽

Kumar Abhishek ◽

Akshay Deepak ◽

Prakhar Shrivastav ◽

Suraj Thakre ◽

...

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Machine Learning Techniques ◽

Computing Power ◽

On Demand ◽

Learning Techniques ◽

Ease Of Access ◽

Start Ups

In the age of start-ups and technical research, the demand for high-end computing power and loads of space is ever increasing. Machine learning techniques have become an inseparable part of the big data analytics. Setting up one’s own infrastructure to deal with all this vastness is usually not feasible due to high expenses and lack of desired expertise. As a solution to this problem, this paper proposes a system for Big-Data Analytics and Machine Learning based on Hadoop and Spark frameworks that also supports Operating System (OS) Rental Services. Machine Learning (ML) services provide option to use both existing inbuilt popular models or create one’s own model. OS Rental services provide users with high end infrastructure on their low-end devices on rent. The entire implementation has been made open source for ease of access and facilitating extensibility.

Download Full-text

REVIEW OF MACHINE LEARNING TECHNIQUES FOR VOLUMINOUS INFORMATION MANAGEMENT

Journal of Soft Computing Paradigm - September 2019 ◽

10.36548/jscp.2019.2.005 ◽

2019 ◽

Vol 2019 (2) ◽

pp. 103-112

Author(s):

Dr. Pasumpon pandian

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Learning Algorithms ◽

Big Data Analytics ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Technological Growth ◽

Rapid Pace

The recent technological growth at a rapid pace has paved way for the big data that denotes to the exponential growth of the information’s. The big data analytics are the trending concepts that have emerged as the promising technology that offers more enhanced perceptions from the huge set of the data that have been produced from the diverse areas. The review in the paper proceeds with the methods of the big-data-analytics and the machine-learning in handling, the huge set of data flow. The overview of the utilization of the machine-learning algorithms in the analytics of high voluminous data would provide with the deeper and the richer analysis of the huge set of information gathered to extract the valuable and turn it into actionable information’s. The paper is to review the part of machine-learning algorithms in the analytics of high voluminous data

Download Full-text

Discovering Knowledge Hidden in Big Data from Machine-Learning Techniques

Advances in Data Mining and Database Management - Web Data Mining and the Development of Knowledge-Based Decision Support Systems ◽

10.4018/978-1-5225-1877-8.ch010 ◽

2017 ◽

pp. 167-183 ◽

Cited By ~ 1

Author(s):

Adiraju Prashantha Rao

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Decision Makers ◽

Machine Learning Techniques ◽

Human Beings ◽

Raw Data ◽

Learning Techniques ◽

Data Analytic ◽

Information Growth

As the speed of information growth exceeds in this new century, excessive data is making great troubles to human beings. However, there are so much potential and highly useful values hidden in the huge volume of data. Big Data has drawn huge attention from researchers in information sciences, policy and decision makers in governments and enterprises. Data analytic is the science of examining raw data with the purpose of drawing conclusions about that information. Data analytics is about discovering knowledge from large volumes data and applying it to the business. Machine learning is ideal for exploiting the opportunities hidden in big data. This chapter able to discover and display the patterns buried in the data using machine learning.

Download Full-text

Identification of Losses in Turbomachinery With Machine Learning

Volume 1: Aircraft Engine; Fans and Blowers ◽

10.1115/gt2020-15337 ◽

2020 ◽

Author(s):

Gino Angelini ◽

Alessandro Corsini ◽

Giovanni Delibra ◽

Marco Giovannelli

Keyword(s):

Machine Learning ◽

Principal Component ◽

Machine Learning Techniques ◽

Post Processing ◽

Pressure Losses ◽

Learning Techniques ◽

Flow Features ◽

Exploratory Data ◽

Unsupervised Approach ◽

Important Design

Abstract One of the issues of handling large CFD datasets and process them to derive important design correlations is the limitation in automating the post-processing of data. Machine learning techniques, developed to process large unlabelled dataset, can play a key role on this subject. In this work an unsupervised approach to isolate different flow features inside a 2D cascade is proposed and validated. The approach relies on machine learning methods and in particular on Exploratory Data Analysis (EDA) and Principal Component Analysis for the pre-processing of the data and on K-means clustering for the post-processing. The K-means algorithm was trained on a Design of Experiments (DoE) of over 140 cases of 2D linear cascade configurations to identify the boundary layer on the profiles and the wake downstream. Validation resulted in a perfect capability of identifying the regions of interest. Then a possible exploitation of this method is presented, to compute pressure losses downstream of the cascade and train an artificial neural network to make a regression able to extend data to all the possible combinations of geometrical and operating parameters of the cascade. The same algorithm was applied to 3D flow cascades of profiles with sinusoidal leading edges to stress its extrapolation capability in case of flow regimes not present in the training DoE.

Download Full-text

Big Traffic Data Analytics For Smart Urban Intelligent Traffic System Using Machine Learning Techniques

2020 IEEE 9th Global Conference on Consumer Electronics (GCCE) ◽

10.1109/gcce50665.2020.9291790 ◽

2020 ◽

Author(s):

Su Su Hlaing ◽

Mie Mie Tin ◽

Mie Mie Khin ◽

Phyo Phyo Wai ◽

G R Sinha

Keyword(s):

Machine Learning ◽

Data Analytics ◽

Machine Learning Techniques ◽

Traffic Data ◽

Traffic System ◽

Learning Techniques ◽

Intelligent Traffic System ◽

Big Traffic Data

Download Full-text

Industrial Big Data Analytics for Cognitive Internet of Things: Wireless Sensor Networks, Smart Computing Algorithms, and Machine Learning Techniques

Analysis and Metaphysics ◽

10.22381/am1820193 ◽

2019 ◽

Vol 18 (0) ◽

pp. 23 ◽

Cited By ~ 7

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Big Data Analytics ◽

Machine Learning Techniques ◽

Wireless Sensor ◽

Learning Techniques ◽

Industrial Big Data ◽

Smart Computing ◽

Cognitive Internet Of Things

Download Full-text

Credit Rating Forecasting Using Machine Learning Techniques

Advances in Data Mining and Database Management - Managerial Perspectives on Intelligent Big Data Analytics ◽

10.4018/978-1-5225-7277-0.ch010 ◽

2019 ◽

pp. 180-198 ◽

Cited By ~ 1

Author(s):

Mark Wallis ◽

Kuldeep Kumar ◽

Adrian Gepp

Keyword(s):

Machine Learning ◽

Big Data ◽

Data Analytics ◽

Credit Ratings ◽

Relative Effectiveness ◽

Big Data Analytics ◽

Parametric Models ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Missing Variables

Credit ratings are an important metric for business managers and a contributor to economic growth. Forecasting such ratings might be a suitable application of big data analytics. As machine learning is one of the foundations of intelligent big data analytics, this chapter presents a comparative analysis of traditional statistical models and popular machine learning models for the prediction of Moody's long-term corporate debt ratings. Machine learning techniques such as artificial neural networks, support vector machines, and random forests generally outperformed their traditional counterparts in terms of both overall accuracy and the Kappa statistic. The parametric models may be hindered by missing variables and restrictive assumptions about the underlying distributions in the data. This chapter reveals the relative effectiveness of non-parametric big data analytics to model a complex process that frequently arises in business, specifically determining credit ratings.

Download Full-text