Adaptive Hybrid Sampling Algorithm Based on BIRCH Clustering

Mapping Intimacies ◽

10.1109/itnec52019.2021.9587242 ◽

2021 ◽

Author(s):

Xuanrui Xiong ◽

Yang Huang ◽

Yuan Zhang ◽

Fan Zhang ◽

Yumei Jia ◽

...

Keyword(s):

Sampling Algorithm ◽

Hybrid Sampling

Download Full-text

A Novel Hybrid Sampling Algorithm for Solving Class Imbalance Problem in Big Data

Advances in Data Science and Adaptive Analysis ◽

10.1142/s2424922x21500054 ◽

2021 ◽

pp. 2150005

Author(s):

Khyati Ahlawat ◽

Anuradha Chug ◽

Amit Prakash Singh

Keyword(s):

Big Data ◽

Class Imbalance ◽

Support Vector ◽

Efficiency Gain ◽

Learning Approaches ◽

K Nearest Neighbor ◽

Class Imbalance Problem ◽

Sampling Algorithm ◽

Imbalance Problem ◽

Hybrid Sampling

The uneven distribution of classes in any dataset poses a tendency of biasness toward the majority class when analyzed using any standard classifier. The instances of the significant class being deficient in numbers are generally ignored and their correct classification which is of paramount interest is often overlooked in calculating overall accuracy. Therefore, the conventional machine learning approaches are rigorously refined to address this class imbalance problem. This challenge of imbalanced classes is more prevalent in big data scenario due to its high volume. This study deals with acknowledging a sampling solution based on cluster computing in handling class imbalance problems in the case of big data. The newly proposed approach hybrid sampling algorithm (HSA) is assessed using three popular classification algorithms namely, support vector machine, decision tree and k-nearest neighbor based on balanced accuracy and elapsed time. The results obtained from the experiment are considered promising with an efficiency gain of 42% in comparison to the traditional sampling solution synthetic minority oversampling technique (SMOTE). This work proves the effectiveness of the distribution and clustering principle in imbalanced big data scenarios.

Download Full-text

Hybrid sampling Algorithm Based on Ant Colony Optimization and k-means Clustering

Indian Journal of Computer Science and Engineering ◽

10.21817/indjcse/2021/v12i2/211202131 ◽

2021 ◽

Vol 12 (2) ◽

pp. 445-455

Author(s):

Mrs. S. Santha Subbulaxmi ◽

G. Arumugam Dr.

Keyword(s):

Ant Colony Optimization ◽

Sampling Algorithm ◽

Hybrid Sampling

Download Full-text

An Unbalanced Data Hybrid-Sampling Algorithm Based on Multi-Information Fusion

GLOBECOM 2017 - 2017 IEEE Global Communications Conference ◽

10.1109/glocom.2017.8254481 ◽

2017 ◽

Author(s):

Sijia Chen ◽

Bin Song ◽

Jie Guo ◽

Xiaojiang Du

Keyword(s):

Information Fusion ◽

Unbalanced Data ◽

Sampling Algorithm ◽

Hybrid Sampling

Download Full-text

A hybrid sampling algorithm combining M-SMOTE and ENN based on Random forest for medical imbalanced data

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2020.103465 ◽

2020 ◽

Vol 107 ◽

pp. 103465 ◽

Author(s):

Zhaozhao Xu ◽

Derong Shen ◽

Tiezheng Nie ◽

Yue Kou

Keyword(s):

Random Forest ◽

Imbalanced Data ◽

Sampling Algorithm ◽

Hybrid Sampling

Download Full-text

A depth-dependent derandomized sampling algorithm for lattice decoding

Automation, Mechanical and Electrical Engineering ◽

10.2495/amee141112 ◽

2014 ◽

Author(s):

X.S. Chen ◽

Y.H. Sun ◽

R.Z. Yang ◽

Y.H. Zhang

Keyword(s):

Sampling Algorithm ◽

Lattice Decoding

Download Full-text

Method for 3D modeling of radar coverage based on hybrid sampling

JOURNAL OF ELECTRONIC MEASUREMENT AND INSTRUMENT ◽

10.3724/sp.j.1187.2010.00010 ◽

2010 ◽

Vol 24 (1) ◽

pp. 10-16 ◽

Author(s):

Hang Qiu ◽

Leiting Chen ◽

Jim X Chen

Keyword(s):

3D Modeling ◽

Hybrid Sampling

Download Full-text

A General Importance Sampling Algorithm for Estimating Portfolio Loss Probabilities in Linear Factor Models

SSRN Electronic Journal ◽

10.2139/ssrn.2556527 ◽

2015 ◽

Author(s):

Alexandre Scott ◽

Adam Metzler

Keyword(s):

Importance Sampling ◽

Factor Models ◽

Sampling Algorithm ◽

Linear Factor ◽

General Importance ◽

Loss Probabilities ◽

Download Full-text

Sequential Gibbs Sampling Algorithm for Cognitive Diagnosis Models with Many Attributes

Multivariate Behavioral Research ◽

10.1080/00273171.2021.1896352 ◽

2021 ◽

pp. 1-37

Author(s):

Juntao Wang ◽

Ningzhong Shi ◽

Xue Zhang ◽

Gongjun Xu

Keyword(s):

Gibbs Sampling ◽

Cognitive Diagnosis ◽

Sampling Algorithm ◽

Cognitive Diagnosis Models ◽

Gibbs Sampling Algorithm

Download Full-text

An improved exact sampling algorithm for the standard normal distribution

Computational Statistics ◽

10.1007/s00180-021-01136-w ◽

2021 ◽

Author(s):

Yusong Du ◽

Baoying Fan ◽

Baodian Wei

Keyword(s):

Normal Distribution ◽

Standard Normal Distribution ◽

Sampling Algorithm ◽

Exact Sampling ◽

Standard Normal

Download Full-text

A Sequential Importance Sampling Algorithm for Counting Linear Extensions

Journal of Experimental Algorithmics ◽

10.1145/3385650 ◽

2020 ◽

Vol 25 ◽

pp. 1-14

Author(s):

Alathea Jensen ◽

Isabel Beichl

Keyword(s):

Importance Sampling ◽

Sequential Importance Sampling ◽

Linear Extensions ◽

Sampling Algorithm

Download Full-text