Comparative Analysis of Machine Learning Techniques Using Predictive Modeling

Introduction: Machine learning is an intelligent technology that works as a bridge between businesses and data science. With the involvement of data science, the business goal focuses on findings to get valuable insights on available data. The large part of Indian Cinema is Bollywood which is a multi-million dollar industry. This paper attempts to predict whether the upcoming Bollywood Movie would be Blockbuster, Superhit, Hit, Average or Flop. For this Machine Learning techniques (classification and prediction) will be applied. To make classifier or prediction model first step is the learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations. Methods: All the techniques related to classification and Prediction such as Support Vector Machine(SVM), Random Forest, Decision Tree, Naïve Bayes, Logistic Regression, Adaboost, and KNN will be applied and try to find out efficient and effective results. All these functionalities can be applied with GUI Based workflows available with various categories such as data, Visualize, Model, and Evaluate. Result: To make classifier or prediction model first step is learning stage in which we need to give the training data set to train the model by applying some technique or algorithm and after that different rules are generated which helps to make a model and predict future trends in different types of organizations Conclusion: This paper focuses on Comparative Analysis that would be performed based on different parameters such as Accuracy, Confusion Matrix to identify the best possible model for predicting the movie Success. By using Advertisement Propaganda, they can plan for the best time to release the movie according to the predicted success rate to gain higher benefits. Discussion: Data Mining is the process of discovering different patterns from large data sets and from that various relationships are also discovered to solve various problems that come in business and helps to predict the forthcoming trends. This Prediction can help Production Houses for Advertisement Propaganda and also they can plan their costs and by assuring these factors they can make the movie more profitable.

Download Full-text

Application of Machine Learning Techniques As a Means of Mooring Integrity Monitoring

Volume 3: Structures, Safety, and Reliability ◽

10.1115/omae2019-96411 ◽

2019 ◽

Author(s):

Jonathan M. Gumley ◽

Hayden Marcollo ◽

Stuart Wales ◽

Andrew E. Potts ◽

Christopher J. Carra

Keyword(s):

Machine Learning ◽

Data Science ◽

Single Point ◽

Original System ◽

Training Data ◽

Machine Learning Techniques ◽

Mooring Line ◽

Artificial Noise ◽

Data Set ◽

Learning Techniques

Abstract There is growing importance in the offshore floating production sector to develop reliable and robust means of continuously monitoring the integrity of mooring systems for FPSOs and FPUs, particularly in light of the upcoming introduction of API-RP-2MIM. Here, the limitations of the current range of monitoring techniques are discussed, including well established technologies such as load cells, sonar, or visual inspection, within the context of the growing mainstream acceptance of data science and machine learning. Due to the large fleet of floating production platforms currently in service, there is a need for a readily deployable solution that can be retrofitted to existing platforms to passively monitor the performance of floating assets on their moorings, for which machine learning based systems have particular advantages. An earlier investigation conducted in 2016 on a shallow water, single point moored FPSO employed host facility data from in-service field measurements before and after a single mooring line failure event. This paper presents how the same machine learning techniques were applied to a deep water, semi taut, spread moored system where there was no host facility data available, therefore requiring a calibrated hydrodynamic numerical model to be used as the basis for the training data set. The machine learning techniques applied to both real and synthetically generated data were successful in replicating the response of the original system, even with the latter subjected to different variations of artificial noise. Furthermore, utilizing a probability-based approach, it was demonstrated that replicating the response of the underlying system was a powerful technique for predicting changes in the mooring system.

Download Full-text

A sentiment analysis system for social media using machine learning techniques: Social enablement

Digital Scholarship in the Humanities ◽

10.1093/llc/fqy037 ◽

2018 ◽

Vol 34 (3) ◽

pp. 569-581 ◽

Cited By ~ 1

Author(s):

Sujata Rani ◽

Parteek Kumar

Keyword(s):

Machine Learning ◽

Social Media ◽

Sentiment Analysis ◽

Media Analysis ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Analysis Tool ◽

Data Set ◽

Learning Techniques

Abstract In this article, an innovative approach to perform the sentiment analysis (SA) has been presented. The proposed system handles the issues of Romanized or abbreviated text and spelling variations in the text to perform the sentiment analysis. The training data set of 3,000 movie reviews and tweets has been manually labeled by native speakers of Hindi in three classes, i.e. positive, negative, and neutral. The system uses WEKA (Waikato Environment for Knowledge Analysis) tool to convert these string data into numerical matrices and applies three machine learning techniques, i.e. Naive Bayes (NB), J48, and support vector machine (SVM). The proposed system has been tested on 100 movie reviews and tweets, and it has been observed that SVM has performed best in comparison to other classifiers, and it has an accuracy of 68% for movie reviews and 82% in case of tweets. The results of the proposed system are very promising and can be used in emerging applications like SA of product reviews and social media analysis. Additionally, the proposed system can be used in other cultural/social benefits like predicting/fighting human riots.

Download Full-text

Rotor Unbalance Kind and Severity Identification by Current Signature Analysis with Adaptative Update to Multiclass Machine Learning Algorithms

Studies in Engineering and Technology ◽

10.11114/set.v8i1.5213 ◽

2021 ◽

Vol 8 (1) ◽

pp. 28

Author(s):

S. L. Ávila ◽

H. M. Schaberle ◽

S. Youssef ◽

F. S. Pacheco ◽

C. A. Penz

Keyword(s):

Machine Learning ◽

Machine Learning Algorithms ◽

Training Data ◽

Machine Learning Techniques ◽

Support Vector ◽

Signature Analysis ◽

Data Set ◽

Learning Techniques ◽

Environmental Variations ◽

Current Signature

The health of a rotating electric machine can be evaluated by monitoring electrical and mechanical parameters. As more information is available, it easier can become the diagnosis of the machine operational condition. We built a laboratory test bench to study rotor unbalance issues according to ISO standards. Using the electric stator current harmonic analysis, this paper presents a comparison study among Support-Vector Machines, Decision Tree classifies, and One-vs-One strategy to identify rotor unbalance kind and severity problem – a nonlinear multiclass task. Moreover, we propose a methodology to update the classifier for dealing better with changes produced by environmental variations and natural machinery usage. The adaptative update means to update the training data set with an amount of recent data, saving the entire original historical data. It is relevant for engineering maintenance. Our results show that the current signature analysis is appropriate to identify the type and severity of the rotor unbalance problem. Moreover, we show that machine learning techniques can be effective for an industrial application.

Download Full-text

Auroral Classification Ergonomics and the Implications for Machine Learning

10.5194/gi-2019-41 ◽

2020 ◽

Author(s):

Derek McKay ◽

Andreas Kvammen

Keyword(s):

Machine Learning ◽

Large Data ◽

Research Community ◽

Training Data ◽

Machine Learning Techniques ◽

Data Set ◽

Potential Source ◽

Learning Techniques ◽

Training Samples ◽

Learning Research

Abstract. The machine learning research community has focused greatly on bias in algorithms and have identified different manifestations of it. Bias in the training samples is recognised as a potential source of prejudice in machine learning. It can be introduced by human experts who define the training sets. As machine learning techniques are being applied to auroral classification, it is important to identify and address potential sources of expert-injected bias. In an ongoing study, 13 947 auroral images were manually classified with significant differences between classifications. This large data set allowed identification of some of these biases, especially those originating as a result of the ergonomics of the classification process. These findings are presented in this paper, to serve as a checklist for improving training data integrity, not just for expert classifications, but also for crowd-sourced, citizen science projects. As the application of machine learning techniques to auroral research is relatively new, it is important that biases are identified and addressed before they become endemic in the corpus of training data.

Download Full-text

Significance of Artificial Intelligence and Machine Learning Techniques in Smart Cloud Computing: A Review

International Journal of Soft Computing and Engineering - Regular Issue ◽

10.35940/ijsce.c3265.099319 ◽

2019 ◽

Vol 9 (3) ◽

pp. 1-7

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Cloud Computing ◽

Reinforcement Learning ◽

Heuristic Algorithms ◽

Machine Learning Algorithms ◽

Training Data ◽

Machine Learning Techniques ◽

Data Set ◽

Learning Techniques

Realization of the tremendous features and facilities provided by Cloud Computing by the geniuses in the world of digital marketing increases its demand. As customer satisfaction is the manifest of this ever shining field, balancing its load becomes a major issue. Various heuristic and meta-heuristic algorithms were applied to get optimum solutions. The current era is much attracted with the provisioning of self-manageable, self-learnable, self-healable, and self-configurable smart systems. To get self-manageable Smart Cloud, various Artificial Intelligence and Machine Learning (AI-ML) techniques and algorithms are revived. In this review, recent trend in the utilization of AI-ML techniques, their applied areas, purpose, their merits and demerits are highlighted. These techniques are further categorized as instance-based machine learning algorithms and reinforcement learning techniques based on their ability of learning. Reinforcement learning is preferred when there is no training data set. It leads the system to learn by its own experience itself even in dynamic environment.

Download Full-text

Prediction of Survivors in the Titanic Cruise

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c4408.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 1268-1271

Keyword(s):

Machine Learning ◽

Data Science ◽

Machine Learning Techniques ◽

Gradient Boosting ◽

Support Vector ◽

Data Set ◽

Learning Techniques ◽

Gradient Boosting Machine ◽

Individual Survival ◽

Long Time

On the 15th of April, 1912 the titanic witnessed a disaster resulting in the sinking of her passengers on the maiden voyage near North Atlantic. Even though it is a very long time since this maritime disaster took place, the idea behind what impacts each individual survival is still a great research attracting researcher’s attention. The approach taken in this paper is to utilize the publically available data set from website called Kaggle. Kaggle is a popular data science webpage that put together information of people in the titanic into a data set for the data mining competition: “Titanic: Machine Learning from Disaster”. The research and comparisons in this paper uses a few machine learning techniques and algorithms to analyse the data for classification and prediction of survivors. The prediction and efficiency of these algorithms depend greatly on data analysis and model. The techniques used to do so are Random Forest, Support Vector Machine, Gradient Boosting Machine.

Download Full-text

A Comparative Analysis of Ensemble Based Machine Learning Techniques for Diabetes Identification

2021 2nd International Conference on Robotics, Electrical and Signal Processing Techniques (ICREST) ◽

10.1109/icrest51555.2021.9331036 ◽

2021 ◽

Author(s):

Nahid Hossain Taz ◽

Abrar Islam ◽

Ishrak Mahmud

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text

Comparative Analysis of Machine Learning Techniques for Temperature Compensation in Microwave Sensors

IEEE Transactions on Microwave Theory and Techniques ◽

10.1109/tmtt.2021.3081119 ◽

2021 ◽

pp. 1-1

Author(s):

Nazli Kazemi ◽

Mohammad Abdolrazzaghi ◽

Petr Musilek

Keyword(s):

Machine Learning ◽

Comparative Analysis ◽

Temperature Compensation ◽

Machine Learning Techniques ◽

Learning Techniques ◽

Microwave Sensors

Download Full-text

Borderline and Depression: A Thin EEG Line

Clinical EEG and Neuroscience ◽

10.1177/15500594211060830 ◽

2021 ◽

pp. 155005942110608

Author(s):

Jakša Vukojević ◽

Damir Mulc ◽

Ivana Kinder ◽

Eda Jovičić ◽

Krešimir Friganović ◽

...

Keyword(s):

Machine Learning ◽

Borderline Personality ◽

Machine Learning Techniques ◽

Major Depressive ◽

Everyday Clinical Practice ◽

Data Set ◽

Learning Techniques ◽

Eeg Recordings ◽

The Given ◽

Close Interrelationship

In everyday clinical practice, there is an ongoing debate about the nature of major depressive disorder (MDD) in patients with borderline personality disorder (BPD). The underlying research does not give us a clear distinction between those 2 entities, although depression is among the most frequent comorbid diagnosis in borderline personality patients. The notion that depression can be a distinct disorder but also a symptom in other psychopathologies led our team to try and delineate those 2 entities using 146 EEG recordings and machine learning. The utilized algorithms, developed solely for this purpose, could not differentiate those 2 entities, meaning that patients suffering from MDD did not have significantly different EEG in terms of patients diagnosed with MDD and BPD respecting the given data and methods used. By increasing the data set and the spatiotemporal specificity, one could have a more sensitive diagnostic approach when using EEG recordings. To our knowledge, this is the first study that used EEG recordings and advanced machine learning techniques and further confirmed the close interrelationship between those 2 entities.

Download Full-text

Comparative Analysis of Machine Learning Techniques with Principal Component Analysis on Kidney and Heart Disease

10.1109/icesc51422.2021.9533011 ◽

2021 ◽

Author(s):

Reena Chandra ◽

Manoj Kapil ◽

Avinash Sharma

Keyword(s):

Machine Learning ◽

Principal Component Analysis ◽

Heart Disease ◽

Comparative Analysis ◽

Principal Component ◽

Component Analysis ◽

Machine Learning Techniques ◽

Learning Techniques

Download Full-text