Hybrid feature selection model based on machine learning and knowledge graph

Abstract Aiming at the problem that the current feature selection algorithm can not adapt to both supervised learning data and unsupervised learning data, and had poor feature interpretability, this paper proposed a hybrid feature selection model based on machine learning and knowledge graph. By the idea of hybridization, this model used supervised learning algorithms, unsupervised learning algorithms and knowledge graph technology to model from the perspective of data features and text features. Firstly, the data-based feature weights were obtained through the machine learning model, and then the text-based weights were obtained by using the knowledge graph technology, and the weight sets are combined to obtain a feature matrix with good explanatory properties that meets both the data and text features. Finally, the case analysis proves that the method proposed in this paper has good effects and interpretability.

Download Full-text

Big Data Mining Algorithms

Encyclopedia of Information Science and Technology, Fifth Edition - Advances in Information Quality and Management ◽

10.4018/978-1-7998-3479-3.ch052 ◽

2021 ◽

pp. 768-777

Author(s):

M. Govindarajan

Keyword(s):

Machine Learning ◽

Data Mining ◽

Big Data ◽

Unsupervised Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Data Sets ◽

Big Data Mining ◽

Supervised Learning Algorithms

Big data mining involves knowledge discovery from these large data sets. The purpose of this chapter is to provide an analysis of different machine learning algorithms available for performing big data analytics. The machine learning algorithms are categorized in three key categories, namely, supervised, unsupervised, and semi-supervised machine learning algorithm. The supervised learning algorithms are trained with a complete set of data, and thus, the supervised learning algorithms are used to predict/forecast. Example algorithms include logistic regression and the back propagation neural network. The unsupervised learning algorithms starts learning from scratch, and therefore, the unsupervised learning algorithms are used for clustering. Example algorithms include: the Apriori algorithm and K-Means. The semi-supervised learning combines both supervised and unsupervised learning algorithms. The semi-supervised algorithms are trained, and the algorithms also include non-trained learning.

Download Full-text

Machine Learning Algorithms for Indian Music Classification Based on Raga Framework

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.k7724.0991120 ◽

2020 ◽

Vol 9 (11) ◽

pp. 130-134

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Classical Music ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Support Vector ◽

Learning Methods ◽

Distribution Features ◽

Indian Music

The supervised and unsupervised learning methods in Machine Learning are successfully applied to solve various real time problems in different domains. The Indian Music has a base of Raga structure. The Raga is melodious framework for composition and improvisation. The identification and indexing of Raga for Indian Music data will improve efficiency and accuracy of retrieval being expected by e-learners, composers and classical music listeners. The identification of Raga in Indian Music is very difficult task for naïve user. The application of machine learning algorithms will definitely be best key idea. The paper demonstrates K-means and Agglomerative clustering methods from unsupervised learning nonetheless K Nearest Neighbor, Decision Tree and Support Vector Machine and Naïve Bayes classifiers are implemented from supervised learning. The partition of 70:30 is done for training data and testing data. Pitch Class Distribution features are extracted by identifying Pitch for every frame in an audio signal using Autocorrelation method. The comparison of above algorithms is done and observed supervised learning methods outperformed.

Download Full-text

Geometric morphometrics and machine learning challenge currently accepted species limits of the land snail Placostylus (Pulmonata: Bothriembryontidae) on the Isle of Pines, New Caledonia

Journal of Molluscan Studies ◽

10.1093/mollus/eyz031 ◽

2020 ◽

Vol 86 (1) ◽

pp. 35-41

Author(s):

Mathieu Quenu ◽

Steven A Trewick ◽

Fabrice Brescia ◽

Mary Morgan-Richards

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

New Caledonia ◽

Learning Algorithm ◽

Learning Algorithms ◽

Land Snail ◽

Machine Learning Algorithms ◽

Snail Species ◽

Size And Shape

Abstract Size and shape variations of shells can be used to identify natural phenotypic clusters and thus delimit snail species. Here, we apply both supervised and unsupervised machine learning algorithms to a geometric morphometric dataset to investigate size and shape variations of the shells of the endemic land snail Placostylus from New Caledonia. We sampled eight populations of Placostylus from the Isle of Pines, where two species of this genus reportedly coexist. We used neural network analysis as a supervised learning algorithm and Gaussian mixture models as an unsupervised learning algorithm. Using a training dataset of individuals assigned to species using nuclear markers, we found that supervised learning algorithms could not unambiguously classify all individuals of our expanded dataset using shell size and shape. Unsupervised learning showed that the optimal division of our data consisted of three phenotypic clusters. Two of these clusters correspond to the established species Placostylus fibratus and P. porphyrostomus, while the third cluster was intermediate in both shape and size. Most of the individuals that were not clearly classified using supervised learning were classified to this intermediate phenotype by unsupervised learning, and most of these individuals came from previously unsampled populations. These results may indicate the presence of persistent putative-hybrid populations of Placostylus in the Isle of Pines.

Download Full-text

Sentiment Analysis of Movie Reviews: A Study of Machine Learning Algorithms with Various Feature Selection Methods

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v5i9.113121 ◽

2017 ◽

Vol 5 (9) ◽

Cited By ~ 1

Author(s):

Rajwinder Kaur

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Sentiment Analysis ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Selection Methods

Download Full-text

Application of Machine Learning in Animal Disease Analysis and Prediction

Current Bioinformatics ◽

10.2174/1574893615999200728195613 ◽

2020 ◽

Vol 15 ◽

Author(s):

Shuwen Zhang ◽

Qiang Su ◽

Qin Chen

Keyword(s):

Machine Learning ◽

Unsupervised Learning ◽

Supervised Learning ◽

Clustering Algorithm ◽

Principal Component ◽

Support Vector ◽

Animal Disease ◽

Human Beings ◽

Animal Diseases ◽

Disease Analysis

Abstract: Major animal diseases pose a great threat to animal husbandry and human beings. With the deepening of globalization and the abundance of data resources, the prediction and analysis of animal diseases by using big data are becoming more and more important. The focus of machine learning is to make computers learn how to learn from data and use the learned experience to analyze and predict. Firstly, this paper introduces the animal epidemic situation and machine learning. Then it briefly introduces the application of machine learning in animal disease analysis and prediction. Machine learning is mainly divided into supervised learning and unsupervised learning. Supervised learning includes support vector machines, naive bayes, decision trees, random forests, logistic regression, artificial neural networks, deep learning, and AdaBoost. Unsupervised learning has maximum expectation algorithm, principal component analysis hierarchical clustering algorithm and maxent. Through the discussion of this paper, people have a clearer concept of machine learning and understand its application prospect in animal diseases.

Download Full-text

A State of Art Techniques on Machine Learning Algorithms: A Perspective of Supervised Learning Approaches in Data Classification

2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS) ◽

10.1109/iccons.2018.8663155 ◽

2018 ◽

Cited By ~ 15

Author(s):

R. Saravanan ◽

Pothula Sujatha

Keyword(s):

Machine Learning ◽

Supervised Learning ◽

Learning Algorithms ◽

Data Classification ◽

Machine Learning Algorithms ◽

Learning Approaches ◽

State Of Art ◽

Art Techniques

Download Full-text

Investigating the performance of the supervised learning algorithms for estimating NPPs parameters in combination with the different feature selection techniques

Annals of Nuclear Energy ◽

10.1016/j.anucene.2021.108299 ◽

2021 ◽

Vol 158 ◽

pp. 108299

Author(s):

Khalil Moshkbar-Bakhshayesh

Keyword(s):

Feature Selection ◽

Supervised Learning ◽

Learning Algorithms ◽

Supervised Learning Algorithms ◽

Feature Selection Techniques

Download Full-text

Feature Selection with Fast Correlation-Based Filter for Breast Cancer Prediction and Classification Using Machine Learning Algorithms

2018 International Symposium on Advanced Electrical and Communication Technologies (ISAECT) ◽

10.1109/isaect.2018.8618688 ◽

2018 ◽

Author(s):

Youness Khourdifi ◽

Mohamed Bahaj

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Feature Selection ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Cancer Prediction

Download Full-text

Comparative study on total nitrogen prediction in wastewater treatment plant and effect of various feature selection methods on machine learning algorithms performance

Journal of Water Process Engineering ◽

10.1016/j.jwpe.2021.102033 ◽

2021 ◽

Vol 41 ◽

pp. 102033

Author(s):

Faramarz Bagherzadeh ◽

Mohamad-Javad Mehrani ◽

Milad Basirifard ◽

Javad Roostaei

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Wastewater Treatment ◽

Comparative Study ◽

Total Nitrogen ◽

Wastewater Treatment Plant ◽

Learning Algorithms ◽

Treatment Plant ◽

Machine Learning Algorithms ◽

Selection Methods

Download Full-text

Marketing customer response scoring model based on machine learning data analysis

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189484 ◽

2020 ◽

pp. 1-11

Author(s):

Tang Yan ◽

Li Pengfei

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Data Extraction ◽

Machine Learning Algorithms ◽

Customer Relationship ◽

Customer Data ◽

Modeling And Analysis ◽

Scoring Model ◽

Model Based ◽

Learning Data

In marketing, problems such as the increase in customer data, the increase in the difficulty of data extraction and access, the lack of reliability and accuracy of data analysis, the slow efficiency of data processing, and the inability to effectively transform massive amounts of data into valuable information have become increasingly prominent. In order to study the effect of customer response, based on machine learning algorithms, this paper constructs a marketing customer response scoring model based on machine learning data analysis. In the context of supplier customer relationship management, this article analyzes the supplier’s precision marketing status and existing problems and uses its own development and management characteristics to improve marketing strategies. Moreover, this article uses a combination of database and statistical modeling and analysis to try to establish a customer response scoring model suitable for supplier precision marketing. In addition, this article conducts research and analysis with examples. From the research results, it can be seen that the performance of the model constructed in this article is good.

Download Full-text