Machine Learning from imbalanced data-sets: an application to the bike-sharing inventory problem

Mapping Intimacies ◽

10.1109/mt-its49943.2021.9529281 ◽

2021 ◽

Author(s):

Giovanni Ceccarelli ◽

Guido Cantelmo ◽

Marialisa Nigro ◽

Constantinos Antoniou

Keyword(s):

Machine Learning ◽

Imbalanced Data ◽

Data Sets ◽

Inventory Problem ◽

Imbalanced Data Sets ◽

Download Full-text

Machine learning from imbalanced data sets for astronomical object classification

2011 International Conference of Soft Computing and Pattern Recognition (SoCPaR) ◽

10.1109/socpar.2011.6089283 ◽

2011 ◽

Author(s):

Jorge de la Calleja ◽

Antonio Benitez ◽

Ma. Auxilio Medina ◽

Olac Fuentes

Keyword(s):

Machine Learning ◽

Imbalanced Data ◽

Object Classification ◽

Data Sets ◽

Imbalanced Data Sets ◽

Astronomical Object

Download Full-text

Application of parallel distributed genetics-based machine learning to imbalanced data sets

2012 IEEE International Conference on Fuzzy Systems ◽

10.1109/fuzz-ieee.2012.6251192 ◽

2012 ◽

Author(s):

Yusuke Nojima ◽

Shingo Mihara ◽

Hisao Ishibuchi

Keyword(s):

Machine Learning ◽

Imbalanced Data ◽

Data Sets ◽

Imbalanced Data Sets

Download Full-text

DTO-SMOTE: Delaunay Tessellation Oversampling for Imbalanced Data Sets

Information ◽

10.3390/info11120557 ◽

2020 ◽

Vol 11 (12) ◽

pp. 557

Author(s):

Alexandre M. de Carvalho ◽

Ronaldo C. Prati

Keyword(s):

Machine Learning ◽

Geometric Mean ◽

Imbalanced Data ◽

Sampling Technique ◽

Classification Algorithms ◽

Data Sets ◽

Delaunay Tessellation ◽

Minority Class ◽

Imbalanced Data Sets

One of the significant challenges in machine learning is the classification of imbalanced data. In many situations, standard classifiers cannot learn how to distinguish minority class examples from the others. Since many real problems are unbalanced, this problem has become very relevant and deeply studied today. This paper presents a new preprocessing method based on Delaunay tessellation and the preprocessing algorithm SMOTE (Synthetic Minority Over-sampling Technique), which we call DTO-SMOTE (Delaunay Tessellation Oversampling SMOTE). DTO-SMOTE constructs a mesh of simplices (in this paper, we use tetrahedrons) for creating synthetic examples. We compare results with five preprocessing algorithms (GEOMETRIC-SMOTE, SVM-SMOTE, SMOTE-BORDERLINE-1, SMOTE-BORDERLINE-2, and SMOTE), eight classification algorithms, and 61 binary-class data sets. For some classifiers, DTO-SMOTE has higher performance than others in terms of Area Under the ROC curve (AUC), Geometric Mean (GEO), and Generalized Index of Balanced Accuracy (IBA).

Download Full-text

Imbalanced Data Detection Kernel Method in Closed Systems

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.756-759.3652 ◽

2013 ◽

Vol 756-759 ◽

pp. 3652-3658

Author(s):

You Li Lu ◽

Jun Luo

Keyword(s):

Kernel Methods ◽

Kernel Method ◽

Imbalanced Data ◽

Data Detection ◽

Data Sets ◽

System Call ◽

Data Set ◽

Imbalanced Data Sets ◽

Lower Complexity ◽

Under the study of Kernel Methods, this paper put forward two improved algorithm which called R-SVM & I-SVDD in order to cope with the imbalanced data sets in closed systems. R-SVM used K-means algorithm clustering space samples while I-SVDD improved the performance of original SVDD by imbalanced sample training. Experiment of two sets of system call data set shows that these two algorithms are more effectively and R-SVM has a lower complexity.

Download Full-text

Automatic Annotation of Protein Functional Class from Sparse and Imbalanced Data Sets

Data Mining and Bioinformatics - Lecture Notes in Computer Science ◽

10.1007/11960669_7 ◽

2006 ◽

pp. 65-77 ◽

Author(s):

Jaehee Jung ◽

Michael R. Thon

Keyword(s):

Imbalanced Data ◽

Functional Class ◽

Data Sets ◽

Automatic Annotation ◽

Imbalanced Data Sets

Download Full-text

An Improved Algorithm for SVMs Classification of Imbalanced Data Sets

Engineering Applications of Neural Networks - Communications in Computer and Information Science ◽

10.1007/978-3-642-03969-0_11 ◽

2009 ◽

pp. 108-118 ◽

Author(s):

Cristiano Leite Castro ◽

Mateus Araujo Carvalho ◽

Antônio Padua Braga

Keyword(s):

Imbalanced Data ◽

Data Sets ◽

Imbalanced Data Sets ◽

Improved Algorithm

Download Full-text

Multi-class Imbalanced Data-Sets with Linguistic Fuzzy Rule Based Classification Systems Based on Pairwise Learning

Computational Intelligence for Knowledge-Based Systems Design - Lecture Notes in Computer Science ◽

10.1007/978-3-642-14049-5_10 ◽

2010 ◽

pp. 89-98 ◽

Author(s):

Alberto Fernández ◽

Mara José del Jesus ◽

Francisco Herrera

Keyword(s):

Imbalanced Data ◽

Classification Systems ◽

Data Sets ◽

Imbalanced Data Sets ◽

Pairwise Learning

Download Full-text

An Optimized Random Forest Classification Method for Processing Imbalanced Data Sets of Alzheimer's Disease

10.1109/ccdc52312.2021.9602177 ◽

2021 ◽

Author(s):

Haijing Sun ◽

Anna Wang ◽

Yun Feng ◽

Chen Liu

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Random Forest ◽

Imbalanced Data ◽

Classification Method ◽

Data Sets ◽

Imbalanced Data Sets ◽

Random Forest Classification ◽

Forest Classification

Download Full-text

Minority–Majority Mix mean Oversampling Technique: An Efficient Technique to Improve Classification of Imbalanced Data Sets

Advances in Intelligent Systems and Computing - Computing in Engineering and Technology ◽

10.1007/978-981-32-9515-5_48 ◽

2019 ◽

pp. 501-509

Author(s):

Sachin Patil ◽

Shefali Sonavane

Keyword(s):

Imbalanced Data ◽

Efficient Technique ◽

Data Sets ◽

Imbalanced Data Sets

Download Full-text

A Novel Clustering Based Undersampling Algorithm for Imbalanced Data Sets Using Artificial Bee Colony Algorithm

Advances in Intelligent Systems and Computing - Innovations in Bio-Inspired Computing and Applications ◽

10.1007/978-3-030-73603-3_3 ◽

2021 ◽

pp. 32-42

Author(s):

O. A. Ajilisa ◽

V. P. Jagathyraj ◽

M. K. Sabu

Keyword(s):

Artificial Bee Colony Algorithm ◽

Artificial Bee Colony ◽

Imbalanced Data ◽

Data Sets ◽

Imbalanced Data Sets ◽

Download Full-text