scholarly journals Predicting Survival on Titanic by Applying Exploratory Data Analytics and Machine Learning Techniques

2018 ◽  
Vol 179 (44) ◽  
pp. 32-38 ◽  
Author(s):  
Yogesh Kakde ◽  
Shefali Agrawal
Author(s):  
Rathimala Kannan ◽  
Intan Soraya Rosdi ◽  
Kannan Ramakrishna ◽  
Haziq Riza Abdul Rasid ◽  
Mohamed Haryz Izzudin Mohamed Rafy ◽  
...  

Data analytics is the essential component in deriving insights from data obtained from multiple sources. It represents the technology, methods and techniques used to obtain insights from massive datasets. As data increases, companies are looking for ways to gain relevant business insights underneath layers of data and information, to help them better understand new business ventures, opportunities, business trends and complex challenges. However, to date, while the extensive benefits of business data analytics to large organizations are widely published, micro, small, and medium sized organisations have not fully grasped the potential benefits to be gained from data analytics using machine learning techniques. This study is guided by the research question of how data analytics using machine learning techniques can benefit small businesses. Using the case study method, this paper outlines how small businesses in two different industries i.e. healthcare and retail can leverage data analytics and machine learning techniques to gain competitive advantage from the data. Details on the respective benefits gained by the small business owners featured in the two case studies provide important answers to the research question.


2020 ◽  
Vol 17 (1) ◽  
pp. 92-100
Author(s):  
Balanand Jha ◽  
Kumar Abhishek ◽  
Akshay Deepak ◽  
Prakhar Shrivastav ◽  
Suraj Thakre ◽  
...  

In the age of start-ups and technical research, the demand for high-end computing power and loads of space is ever increasing. Machine learning techniques have become an inseparable part of the big data analytics. Setting up one’s own infrastructure to deal with all this vastness is usually not feasible due to high expenses and lack of desired expertise. As a solution to this problem, this paper proposes a system for Big-Data Analytics and Machine Learning based on Hadoop and Spark frameworks that also supports Operating System (OS) Rental Services. Machine Learning (ML) services provide option to use both existing inbuilt popular models or create one’s own model. OS Rental services provide users with high end infrastructure on their low-end devices on rent. The entire implementation has been made open source for ease of access and facilitating extensibility.


2019 ◽  
Vol 2019 (2) ◽  
pp. 103-112
Author(s):  
Dr. Pasumpon pandian

The recent technological growth at a rapid pace has paved way for the big data that denotes to the exponential growth of the information’s. The big data analytics are the trending concepts that have emerged as the promising technology that offers more enhanced perceptions from the huge set of the data that have been produced from the diverse areas. The review in the paper proceeds with the methods of the big-data-analytics and the machine-learning in handling, the huge set of data flow. The overview of the utilization of the machine-learning algorithms in the analytics of high voluminous data would provide with the deeper and the richer analysis of the huge set of information gathered to extract the valuable and turn it into actionable information’s. The paper is to review the part of machine-learning algorithms in the analytics of high voluminous data


Author(s):  
Adiraju Prashantha Rao

As the speed of information growth exceeds in this new century, excessive data is making great troubles to human beings. However, there are so much potential and highly useful values hidden in the huge volume of data. Big Data has drawn huge attention from researchers in information sciences, policy and decision makers in governments and enterprises. Data analytic is the science of examining raw data with the purpose of drawing conclusions about that information. Data analytics is about discovering knowledge from large volumes data and applying it to the business. Machine learning is ideal for exploiting the opportunities hidden in big data. This chapter able to discover and display the patterns buried in the data using machine learning.


Author(s):  
Gino Angelini ◽  
Alessandro Corsini ◽  
Giovanni Delibra ◽  
Marco Giovannelli

Abstract One of the issues of handling large CFD datasets and process them to derive important design correlations is the limitation in automating the post-processing of data. Machine learning techniques, developed to process large unlabelled dataset, can play a key role on this subject. In this work an unsupervised approach to isolate different flow features inside a 2D cascade is proposed and validated. The approach relies on machine learning methods and in particular on Exploratory Data Analysis (EDA) and Principal Component Analysis for the pre-processing of the data and on K-means clustering for the post-processing. The K-means algorithm was trained on a Design of Experiments (DoE) of over 140 cases of 2D linear cascade configurations to identify the boundary layer on the profiles and the wake downstream. Validation resulted in a perfect capability of identifying the regions of interest. Then a possible exploitation of this method is presented, to compute pressure losses downstream of the cascade and train an artificial neural network to make a regression able to extend data to all the possible combinations of geometrical and operating parameters of the cascade. The same algorithm was applied to 3D flow cascades of profiles with sinusoidal leading edges to stress its extrapolation capability in case of flow regimes not present in the training DoE.


Author(s):  
Mark Wallis ◽  
Kuldeep Kumar ◽  
Adrian Gepp

Credit ratings are an important metric for business managers and a contributor to economic growth. Forecasting such ratings might be a suitable application of big data analytics. As machine learning is one of the foundations of intelligent big data analytics, this chapter presents a comparative analysis of traditional statistical models and popular machine learning models for the prediction of Moody's long-term corporate debt ratings. Machine learning techniques such as artificial neural networks, support vector machines, and random forests generally outperformed their traditional counterparts in terms of both overall accuracy and the Kappa statistic. The parametric models may be hindered by missing variables and restrictive assumptions about the underlying distributions in the data. This chapter reveals the relative effectiveness of non-parametric big data analytics to model a complex process that frequently arises in business, specifically determining credit ratings.


Sign in / Sign up

Export Citation Format

Share Document