imbalanced classification
Recently Published Documents


TOTAL DOCUMENTS

283
(FIVE YEARS 148)

H-INDEX

21
(FIVE YEARS 5)

2022 ◽  
Vol 34 (4) ◽  
pp. 1-17
Author(s):  
Yunhong Xu ◽  
Guangyu Wu ◽  
Yu Chen

Online medical communities have revolutionized the way patients obtain medical-related information and services. Investigating what factors might influence patients’ satisfaction with doctors and predicting their satisfaction can help patients narrow down their choices and increase their loyalty towards online medical communities. Considering the imbalanced feature of dataset collected from Good Doctor, we integrated XGBoost and SMOTE algorithm to examine what factors and these factors can be used to predict patient satisfaction. SMOTE algorithm addresses the imbalanced issue by oversampling imbalanced classification datasets. And XGBoost algorithm is an ensemble of decision trees algorithm where new trees fix errors of existing trees. The experimental results demonstrate that SMOTE and XGBoost algorithm can achieve better performance. We further analyzed the role of features played in satisfaction prediction from two levels: individual feature level and feature combination level.


Author(s):  
Li-Ming Chen ◽  
Bao-Xin Xiu ◽  
Zhao-Yun Ding

AbstractFor short text classification, insufficient labeled data, data sparsity, and imbalanced classification have become three major challenges. For this, we proposed multiple weak supervision, which can label unlabeled data automatically. Different from prior work, the proposed method can generate probabilistic labels through conditional independent model. What’s more, experiments were conducted to verify the effectiveness of multiple weak supervision. According to experimental results on public dadasets, real datasets and synthetic datasets, unlabeled imbalanced short text classification problem can be solved effectively by multiple weak supervision. Notably, without reducing precision, recall, and F1-score can be improved by adding distant supervision clustering, which can be used to meet different application needs.


2021 ◽  
pp. 108511
Author(s):  
Sebastián Maldonado ◽  
Carla Vairetti ◽  
Alberto Fernandez ◽  
Francisco Herrera

2021 ◽  
Vol 178 ◽  
pp. 115011
Author(s):  
Nuno Moniz ◽  
Vitor Cerqueira

Sign in / Sign up

Export Citation Format

Share Document