Learning Fair Naive Bayes Classifiers by Discovering and Eliminating Discrimination Patterns

YooJung Choi; Golnoosh Farnadi; Behrouz Babaki; Guy Van den Broeck

doi:10.1609/aaai.v34i06.6565

Learning Fair Naive Bayes Classifiers by Discovering and Eliminating Discrimination Patterns

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i06.6565 ◽

2020 ◽

Vol 34 (06) ◽

pp. 10077-10084

Author(s):

YooJung Choi ◽

Golnoosh Farnadi ◽

Behrouz Babaki ◽

Guy Van den Broeck

Keyword(s):

Real World ◽

Naive Bayes ◽

Empirical Evaluation ◽

Naïve Bayes ◽

Test Time ◽

Bayes Classifier ◽

Partial Observations ◽

Fixed Set ◽

Fairness Constraints ◽

Real World Datasets

As machine learning is increasingly used to make real-world decisions, recent research efforts aim to define and ensure fairness in algorithmic decision making. Existing methods often assume a fixed set of observable features to define individuals, but lack a discussion of certain features not being observed at test time. In this paper, we study fairness of naive Bayes classifiers, which allow partial observations. In particular, we introduce the notion of a discrimination pattern, which refers to an individual receiving different classifications depending on whether some sensitive attributes were observed. Then a model is considered fair if it has no such pattern. We propose an algorithm to discover and mine for discrimination patterns in a naive Bayes classifier, and show how to learn maximum-likelihood parameters subject to these fairness constraints. Our approach iteratively discovers and eliminates discrimination patterns until a fair model is learned. An empirical evaluation on three real-world datasets demonstrates that we can remove exponentially many discrimination patterns by only adding a small fraction of them as constraints.

Download Full-text

Learning the naive Bayes classifier with optimization models

International Journal of Applied Mathematics and Computer Science ◽

10.2478/amcs-2013-0059 ◽

2013 ◽

Vol 23 (4) ◽

pp. 787-795 ◽

Cited By ~ 30

Author(s):

Sona Taheri ◽

Musa Mammadov

Keyword(s):

Real World ◽

Naive Bayes ◽

Optimization Problems ◽

Naïve Bayes ◽

Training Data ◽

Optimization Models ◽

Naive Bayes Classifier ◽

Conditional Probabilities ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Abstract Naive Bayes is among the simplest probabilistic classifiers. It often performs surprisingly well in many real world applications, despite the strong assumption that all features are conditionally independent given the class. In the learning process of this classifier with the known structure, class probabilities and conditional probabilities are calculated using training data, and then values of these probabilities are used to classify new observations. In this paper, we introduce three novel optimization models for the naive Bayes classifier where both class probabilities and conditional probabilities are considered as variables. The values of these variables are found by solving the corresponding optimization problems. Numerical experiments are conducted on several real world binary classification data sets, where continuous features are discretized by applying three different methods. The performances of these models are compared with the naive Bayes classifier, tree augmented naive Bayes, the SVM, C4.5 and the nearest neighbor classifier. The obtained results demonstrate that the proposed models can significantly improve the performance of the naive Bayes classifier, yet at the same time maintain its simple structure.

Download Full-text

Differentially Private Naïve Bayes Classifier Using Smooth Sensitivity

Proceedings on Privacy Enhancing Technologies ◽

10.2478/popets-2021-0077 ◽

2021 ◽

Vol 2021 (4) ◽

pp. 406-419

Author(s):

Farzad Zafarani ◽

Chris Clifton

Keyword(s):

Machine Learning ◽

Differential Privacy ◽

Naive Bayes ◽

Learning Algorithm ◽

Naïve Bayes ◽

Training Data ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Real World Datasets

Abstract There is increasing awareness of the need to protect individual privacy in the training data used to develop machine learning models. Differential Privacy is a strong concept of protecting individuals. Naïve Bayes is a popular machine learning algorithm, used as a baseline for many tasks. In this work, we have provided a differentially private Naïve Bayes classifier that adds noise proportional to the smooth sensitivity of its parameters. We compare our results to Vaidya, Shafiq, Basu, and Hong [1] which scales noise to the global sensitivity of the parameters. Our experimental results on real-world datasets show that smooth sensitivity significantly improves accuracy while still guaranteeing ɛ-differential privacy.

Download Full-text

CNB-MRF: Adapting Correlative Naive Bayes Classifier and MapReduce Framework for Big Data Classification

International Review on Computers and Software (IRECOS) ◽

10.15866/irecos.v11i11.10116 ◽

2016 ◽

Vol 11 (11) ◽

pp. 1007 ◽

Cited By ~ 3

Author(s):

Chitrakant Banchhor ◽

N. Srinivasu

Keyword(s):

Big Data ◽

Naive Bayes ◽

Data Classification ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Mapreduce Framework ◽

Big Data Classification

Download Full-text

An Approach for the Segmentation of Satellite Images Using Moving KFCM and Naive Bayes Classifier

i-manager’s Journal on Electronics Engineering ◽

10.26634/jele.3.2.2117 ◽

2013 ◽

Vol 3 (2) ◽

pp. 7-15 ◽

Cited By ~ 1

Author(s):

S. Praveena ◽

◽

S.P. Singh ◽

I.V. Muralikrishna ◽

◽

...

Keyword(s):

Satellite Images ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Behavior recognition in rehabilitation training based on modified naive Bayes classifier

Journal of Computer Applications ◽

10.3724/sp.j.1087.2013.03187 ◽

2013 ◽

Vol 33 (11) ◽

pp. 3187-3189

Author(s):

Yi ZHANG ◽

Cong HUANG ◽

Yuan LUO

Keyword(s):

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Behavior Recognition ◽

Rehabilitation Training

Download Full-text

Retrieval Information Using Generalized Vector Space Models And Sentiment Analysis Using Naïve Bayes Classifier For Evaluation Of Lecturers By Students

2020 Fifth International Conference on Informatics and Computing (ICIC) ◽

10.1109/icic50835.2020.9288584 ◽

2020 ◽

Author(s):

Suprianto ◽

Muhammad Fadlan ◽

Muhammad ◽

Yusni Amaliah ◽

Mussallimah

Keyword(s):

Vector Space ◽

Sentiment Analysis ◽

Naive Bayes ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Vector Space Models

Download Full-text

Design of agricultural ontology based on levy flight distributed optimization and Naïve Bayes classifier

Sadhana ◽

10.1007/s12046-021-01652-x ◽

2021 ◽

Vol 46 (3) ◽

Author(s):

Deepa Rajendran ◽

S Vigneshwari

Keyword(s):

Naive Bayes ◽

Distributed Optimization ◽

Naïve Bayes ◽

Naive Bayes Classifier ◽

Lévy Flight ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Levy Flight

Download Full-text

Determination of near-fault impulsive signals with multivariate naïve Bayes method

Natural Hazards ◽

10.1007/s11069-021-04755-0 ◽

2021 ◽

Author(s):

Deniz Ertuncay ◽

Giovanni Costa

Keyword(s):

Naive Bayes ◽

Strong Motion ◽

Naïve Bayes ◽

Strike Slip ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Earthquake Physics ◽

Near Fault ◽

A Site

AbstractNear-fault ground motions may contain impulse behavior on velocity records. To calculate the probability of occurrence of the impulsive signals, a large dataset is collected from various national data providers and strong motion databases. The dataset has a large number of parameters which carry information on the earthquake physics, ruptured faults, ground motion parameters, distance between the station and several parts of the ruptured fault. Relation between the parameters and impulsive signals is calculated. It is found that fault type, moment magnitude, distance and azimuth between a site of interest and the surface projection of the ruptured fault are correlated with the impulsiveness of the signals. Separate models are created for strike-slip faults and non-strike-slip faults by using multivariate naïve Bayes classifier method. Naïve Bayes classifier allows us to have the probability of observing impulsive signals. The models have comparable accuracy rates, and they are more consistent on different fault types with respect to previous studies.

Download Full-text

Performance of SMOTE in a random forest and naive Bayes classifier for imbalanced Hepatitis-B vaccination status

Journal of Physics Conference Series ◽

10.1088/1742-6596/1863/1/012073 ◽

2021 ◽

Vol 1863 (1) ◽

pp. 012073

Author(s):

V M Putri ◽

M Masjkur ◽

C Suhaeni

Keyword(s):

Random Forest ◽

Hepatitis B ◽

Naive Bayes ◽

Naïve Bayes ◽

Vaccination Status ◽

Hepatitis B Vaccination ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier

Download Full-text

Efficient Jamming Identification in Wireless Communication: Using Small Sample Data Driven Naive Bayes Classifier

IEEE Wireless Communications Letters ◽

10.1109/lwc.2021.3064843 ◽

2021 ◽

pp. 1-1

Author(s):

Yuxin Shi ◽

Xinjin Lu ◽

Yingtao Niu ◽

Yusheng Li.

Keyword(s):

Wireless Communication ◽

Naive Bayes ◽

Naïve Bayes ◽

Small Sample ◽

Data Driven ◽

Naive Bayes Classifier ◽

Bayes Classifier ◽

Naïve Bayes Classifier ◽

Sample Data

Download Full-text