article

Three naive Bayes approaches for discrimination-free classification

Authors:

Toon Calders,

Sicco VerwerAuthors Info & Claims

Data Mining and Knowledge Discovery, Volume 21, Issue 2

Pages 277 - 292

https://doi.org/10.1007/s10618-010-0190-x

Published: 01 September 2010 Publication History

Abstract

In this paper, we investigate how to modify the naive Bayes classifier in order to perform classification that is restricted to be independent with respect to a given sensitive attribute. Such independency restrictions occur naturally when the decision process leading to the labels in the data-set was biased; e.g., due to gender or racial discrimination. This setting is motivated by many cases in which there exist laws that disallow a decision that is partly based on discrimination. Naive application of machine learning techniques would result in huge fines for companies. We present three approaches for making the naive Bayes classifier discrimination-free: (i) modifying the probability of the decision being positive, (ii) training one model for every sensitive attribute value and balancing them, and (iii) adding a latent variable to the Bayesian model that represents the unbiased label and optimizing the model parameters for likelihood using expectation maximization. We present experiments for the three approaches on both artificial and real-life data.

References

[1]

Calders T, Kamiran F, Pechenizkiy M (2009) Building classifiers with independency constraints. In: IEEE ICDM workshop on domain driven data mining. IEEE press.

Crossref

Google Scholar

[2]

Calders T, Kamiran F, Pechenizkiy M (2010) Constructing decision trees under independency constraints. Technical report, TU Eindhoven.

Google Scholar

[3]

Chan PK, Stolfo SJ (1998) Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. In: Proceedings of ACM SIGKDD, pp 164-168.

Google Scholar

[4]

Duivesteijn W, Feelders AJ (2008) Nearest neighbour classification with monotonicity constraints. In: Proceedings of ECML/PKDD'08. Springer, Berlin, pp 301-316.

Crossref

Google Scholar

[5]

Elkan C (2001) The foundations of cost-sensitive learning. In: Proceedings of IJCAI'01, pp 973-978.

Crossref

Google Scholar

[6]

Kamiran F, Calders T (2009) Classifying without discriminating. In: Proceedings of IC409. IEEE press.

Google Scholar

[7]

Kamiran F, Calders T (2010) Classification with no discrimination by preferential sampling. In: Proc. Benelearn.

Google Scholar

[8]

Kotlowski W, Dembczynski K, Greco S, Slowinski R (2007) Statistical model for rough set approach to multicriteria classification. In: Proceedings of ECML/PKDD'07. Springer, Berlin.

Crossref

Google Scholar

[9]

Margineantu DD, Dietterich TG (1999) Learning decision trees for loss minimization in multi-class problems. Technical report, Department Computer Science, Oregon State University.

Google Scholar

[10]

Nijssen S, Fromont E (2007) Mining optimal decision trees from itemset lattices. In: Proceedings of ACM SIGKDD.

Crossref

Google Scholar

[11]

Pedreschi D, Ruggieri S, Turini F (2008) Discrimination-aware data mining. In: Proceedings of ACM SIGKDD.

Crossref

Google Scholar

[12]

Pedreschi D, Ruggieri S, Turini F (2009) Measuring discrimination in socially-sensitive decision records. In: Proceedings of SIAM DM.

Google Scholar

Cited By

View all

Haider CClifton CYin M(2024)Do Crowdsourced Fairness Preferences Correlate with Risk Perceptions?Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645209(304-324)Online publication date: 18-Mar-2024
https://dl.acm.org/doi/10.1145/3640543.3645209
Pessach DTassa TShmueli E(2024)Fairness-Driven Private Collaborative Machine LearningACM Transactions on Intelligent Systems and Technology10.1145/363936815:2(1-30)Online publication date: 22-Feb-2024
https://dl.acm.org/doi/10.1145/3639368
Sarkar PLiem C(2024)"It's the most fair thing to do but it doesn't make any sense": Perceptions of Mathematical Fairness Notions by Hiring ProfessionalsProceedings of the ACM on Human-Computer Interaction10.1145/36373608:CSCW1(1-35)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637360
Show More Cited By

Index Terms

Three naive Bayes approaches for discrimination-free classification
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees

Index terms have been assigned to the content through auto-classification.

Recommendations

Averaged Naive Bayes Trees: A New Extension of AODE
ACML '09: Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning

Naive Bayes (NB) is a simple Bayesian classifier that assumes the conditional independence and augmented NB (ANB) models are extensions of NB by relaxing the independence assumption. The averaged one-dependence estimators (AODE) is a classifier that ...
Naive Bayes for optimal ranking

It is well known that naive Bayes performs surprisingly well in classification, but its probability estimation is poor. AUC (the area under the receiver operating characteristics curve) is a measure different from classification accuracy and probability ...
A Novel Bayes Model: Hidden Naive Bayes

Because learning an optimal Bayesian network classifier is an NP-hard problem, learning-improved naive Bayes has attracted much attention from researchers. In this paper, we summarize the existing improved algorithms and propose a novel Bayes model: ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Data Mining and Knowledge Discovery

Data Mining and Knowledge Discovery Volume 21, Issue 2

September 2010

123 pages

ISSN:1384-5810

Issue’s Table of Contents

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 September 2010

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

212
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 04 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Haider CClifton CYin M(2024)Do Crowdsourced Fairness Preferences Correlate with Risk Perceptions?Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645209(304-324)Online publication date: 18-Mar-2024
https://dl.acm.org/doi/10.1145/3640543.3645209
Pessach DTassa TShmueli E(2024)Fairness-Driven Private Collaborative Machine LearningACM Transactions on Intelligent Systems and Technology10.1145/363936815:2(1-30)Online publication date: 22-Feb-2024
https://dl.acm.org/doi/10.1145/3639368
Sarkar PLiem C(2024)"It's the most fair thing to do but it doesn't make any sense": Perceptions of Mathematical Fairness Notions by Hiring ProfessionalsProceedings of the ACM on Human-Computer Interaction10.1145/36373608:CSCW1(1-35)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3637360
Binkyte RGorla DPalamidessi C(2024)BaBE: Enhancing Fairness via Estimation of Explaining VariablesProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3659016(1917-1925)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3659016
Chan ELiu ZQiu RZhang YMaciejewski RTong H(2024)Group Fairness via Group ConsensusProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3659006(1788-1808)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3659006
Yeh MMetevier BHoag AThomas P(2024)Analyzing the Relationship Between Difference and Ratio-Based Fairness MetricsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658922(518-528)Online publication date: 3-Jun-2024
https://dl.acm.org/doi/10.1145/3630106.3658922
Caton SHaas C(2024)Fairness in Machine Learning: A SurveyACM Computing Surveys10.1145/361686556:7(1-38)Online publication date: 9-Apr-2024
https://dl.acm.org/doi/10.1145/3616865
Yang MArai HYamashita NBaba Y(2024)Fair Machine Guidance to Enhance Fair Decision Making in Biased PeopleProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642627(1-18)Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3613904.3642627
Chen ZZhang JSarro FHarman MRoychoudhury APaiva AAbreu RStorey M(2024)Fairness Improvement with Multiple Protected Attributes: How Far Are We?Proceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3639083(1-13)Online publication date: 20-May-2024
https://dl.acm.org/doi/10.1145/3597503.3639083
Zheng WLin LWu XChen X(2024)An Empirical Study on Correlations Between Deep Neural Network Fairness and Neuron Coverage CriteriaIEEE Transactions on Software Engineering10.1109/TSE.2023.334900150:3(391-412)Online publication date: 1-Mar-2024
https://dl.acm.org/doi/10.1109/TSE.2023.3349001
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Cited By

Index Terms

Recommendations

Averaged Naive Bayes Trees: A New Extension of AODE

Naive Bayes for optimal ranking

A Novel Bayes Model: Hidden Naive Bayes

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations