A Comparative Study of Feature Selection and Machine Learning Methods for Sentiment Classification on Movie Data Set

A Comparative Study of Machine Learning Methods for Persistence Diagrams

Frontiers in Artificial Intelligence ◽

10.3389/frai.2021.681174 ◽

2021 ◽

Vol 4 ◽

Author(s):

Danielle Barnes ◽

Luis Polanco ◽

Jose A. Perea

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Shape Matching ◽

Data Sets ◽

Learning Methods ◽

Data Set ◽

Multi Scale ◽

Machine Learning Methods ◽

Real World Applications ◽

Persistence Diagrams

Many and varied methods currently exist for featurization, which is the process of mapping persistence diagrams to Euclidean space, with the goal of maximally preserving structure. However, and to our knowledge, there are presently no methodical comparisons of existing approaches, nor a standardized collection of test data sets. This paper provides a comparative study of several such methods. In particular, we review, evaluate, and compare the stable multi-scale kernel, persistence landscapes, persistence images, the ring of algebraic functions, template functions, and adaptive template systems. Using these approaches for feature extraction, we apply and compare popular machine learning methods on five data sets: MNIST, Shape retrieval of non-rigid 3D Human Models (SHREC14), extracts from the Protein Classification Benchmark Collection (Protein), MPEG7 shape matching, and HAM10000 skin lesion data set. These data sets are commonly used in the above methods for featurization, and we use them to evaluate predictive utility in real-world applications.

Download Full-text

Machine learning methods for cyber security intrusion detection: Datasets and comparative study

Computer Networks ◽

10.1016/j.comnet.2021.107840 ◽

2021 ◽

Vol 188 ◽

pp. 107840

Author(s):

Ilhan Firat Kilincer ◽

Fatih Ertam ◽

Abdulkadir Sengur

Keyword(s):

Machine Learning ◽

Intrusion Detection ◽

Comparative Study ◽

Cyber Security ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Feature Selection and Machine Learning Methods for Optimal Identification and Prediction of Subtypes in Parkinson's Disease

Computer Methods and Programs in Biomedicine ◽

10.1016/j.cmpb.2021.106131 ◽

2021 ◽

pp. 106131

Author(s):

Mohammad R. Salmanpour ◽

Mojtaba Shamsaei ◽

Arman Rahmim

Keyword(s):

Machine Learning ◽

Parkinson’S Disease ◽

Parkinson's Disease ◽

Feature Selection ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

A comparative study of machine learning methods for ordinal classification with absolute and relative information

Knowledge-Based Systems ◽

10.1016/j.knosys.2021.107358 ◽

2021 ◽

pp. 107358

Author(s):

Mengzi Tang ◽

Raúl Pérez-Fernández ◽

Bernard De Baets

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Learning Methods ◽

Ordinal Classification ◽

Machine Learning Methods ◽

Relative Information

Download Full-text

Improved Permeability Prediction of Porous Media by Feature Selection and Machine Learning Methods Comparison

Journal of Computing in Civil Engineering ◽

10.1061/(asce)cp.1943-5487.0000983 ◽

2022 ◽

Vol 36 (2) ◽

Author(s):

J. W. Tian ◽

Chongchong Qi ◽

Kang Peng ◽

Yingfeng Sun ◽

Zaher Mundher Yaseen

Keyword(s):

Machine Learning ◽

Porous Media ◽

Feature Selection ◽

Learning Methods ◽

Methods Comparison ◽

Machine Learning Methods ◽

Permeability Prediction

Download Full-text

Experiments on the Use of Feature Selection and Machine Learning Methods in Automatic Malay Text Categorization

Procedia Technology ◽

10.1016/j.protcy.2013.12.254 ◽

2013 ◽

Vol 11 ◽

pp. 748-754 ◽

Cited By ~ 6

Author(s):

Hamood Alshalabi ◽

Sabrina Tiun ◽

Nazlia Omar ◽

Mohammed Albared

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Text Categorization ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Automatic Misinformation Detection About COVID-19 in Brazilian Portuguese WhatsApp Messages

10.5753/sbbd_estendido.2021.18173 ◽

2021 ◽

Author(s):

Antônio Diogo Forte Martins ◽

José Maria Monteiro ◽

Javam Machado

Keyword(s):

Machine Learning ◽

Social Networks ◽

Brazilian Portuguese ◽

Primary Sources ◽

Learning Methods ◽

Data Set ◽

Machine Learning Methods

During the coronavirus pandemic, the problem of misinformation arose once again, quite intensely, through social networks. In Brazil, one of the primary sources of misinformation is the messaging application WhatsApp. However, due to WhatsApp's private messaging nature, there still few methods of misinformation detection developed specifically for this platform. In this context, the automatic misinformation detection (MID) about COVID-19 in Brazilian Portuguese WhatsApp messages becomes a crucial challenge. In this work, we present the COVID-19.BR, a data set of WhatsApp messages about coronavirus in Brazilian Portuguese, collected from Brazilian public groups and manually labeled. Then, we are investigating different machine learning methods in order to build an efficient MID for WhatsApp messages. So far, our best result achieved an F1 score of 0.774 due to the predominance of short texts. However, when texts with less than 50 words are filtered, the F1 score rises to 0.85.

Download Full-text

Comparison of machine learning methods for crack localization

Acta et Commentationes Universitatis Tartuensis de Mathematica ◽

10.12697/acutm.2019.23.13 ◽

2019 ◽

Vol 23 (1) ◽

pp. 125-142

Author(s):

Helle Hein ◽

Ljubov Jaanuska

Keyword(s):

Machine Learning ◽

Random Forests ◽

Crack Depth ◽

Haar Wavelet ◽

Extensive Investigation ◽

Learning Methods ◽

Data Set ◽

Crack Location ◽

Machine Learning Methods ◽

Discrete Transform

In this paper, the Haar wavelet discrete transform, the artificial neural networks (ANNs), and the random forests (RFs) are applied to predict the location and severity of a crack in an Euler–Bernoulli cantilever subjected to the transverse free vibration. An extensive investigation into two data collection sets and machine learning methods showed that the depth of a crack is more difficult to predict than its location. The data set of eight natural frequency parameters produces more accurate predictions on the crack depth; meanwhile, the data set of eight Haar wavelet coefficients produces more precise predictions on the crack location. Furthermore, the analysis of the results showed that the ensemble of 50 ANN trained by Bayesian regularization and Levenberg–Marquardt algorithms slightly outperforms RF.

Download Full-text