skip to main content
article

Three naive Bayes approaches for discrimination-free classification

Published: 01 September 2010 Publication History

Abstract

In this paper, we investigate how to modify the naive Bayes classifier in order to perform classification that is restricted to be independent with respect to a given sensitive attribute. Such independency restrictions occur naturally when the decision process leading to the labels in the data-set was biased; e.g., due to gender or racial discrimination. This setting is motivated by many cases in which there exist laws that disallow a decision that is partly based on discrimination. Naive application of machine learning techniques would result in huge fines for companies. We present three approaches for making the naive Bayes classifier discrimination-free: (i) modifying the probability of the decision being positive, (ii) training one model for every sensitive attribute value and balancing them, and (iii) adding a latent variable to the Bayesian model that represents the unbiased label and optimizing the model parameters for likelihood using expectation maximization. We present experiments for the three approaches on both artificial and real-life data.

References

[1]
Calders T, Kamiran F, Pechenizkiy M (2009) Building classifiers with independency constraints. In: IEEE ICDM workshop on domain driven data mining. IEEE press.
[2]
Calders T, Kamiran F, Pechenizkiy M (2010) Constructing decision trees under independency constraints. Technical report, TU Eindhoven.
[3]
Chan PK, Stolfo SJ (1998) Toward scalable learning with non-uniform class and cost distributions: a case study in credit card fraud detection. In: Proceedings of ACM SIGKDD, pp 164-168.
[4]
Duivesteijn W, Feelders AJ (2008) Nearest neighbour classification with monotonicity constraints. In: Proceedings of ECML/PKDD'08. Springer, Berlin, pp 301-316.
[5]
Elkan C (2001) The foundations of cost-sensitive learning. In: Proceedings of IJCAI'01, pp 973-978.
[6]
Kamiran F, Calders T (2009) Classifying without discriminating. In: Proceedings of IC409. IEEE press.
[7]
Kamiran F, Calders T (2010) Classification with no discrimination by preferential sampling. In: Proc. Benelearn.
[8]
Kotlowski W, Dembczynski K, Greco S, Slowinski R (2007) Statistical model for rough set approach to multicriteria classification. In: Proceedings of ECML/PKDD'07. Springer, Berlin.
[9]
Margineantu DD, Dietterich TG (1999) Learning decision trees for loss minimization in multi-class problems. Technical report, Department Computer Science, Oregon State University.
[10]
Nijssen S, Fromont E (2007) Mining optimal decision trees from itemset lattices. In: Proceedings of ACM SIGKDD.
[11]
Pedreschi D, Ruggieri S, Turini F (2008) Discrimination-aware data mining. In: Proceedings of ACM SIGKDD.
[12]
Pedreschi D, Ruggieri S, Turini F (2009) Measuring discrimination in socially-sensitive decision records. In: Proceedings of SIAM DM.

Cited By

View all
  • (2024)Do Crowdsourced Fairness Preferences Correlate with Risk Perceptions?Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645209(304-324)Online publication date: 18-Mar-2024
  • (2024)Fairness-Driven Private Collaborative Machine LearningACM Transactions on Intelligent Systems and Technology10.1145/363936815:2(1-30)Online publication date: 22-Feb-2024
  • (2024)"It's the most fair thing to do but it doesn't make any sense": Perceptions of Mathematical Fairness Notions by Hiring ProfessionalsProceedings of the ACM on Human-Computer Interaction10.1145/36373608:CSCW1(1-35)Online publication date: 26-Apr-2024
  • Show More Cited By

Index Terms

  1. Three naive Bayes approaches for discrimination-free classification
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Data Mining and Knowledge Discovery
      Data Mining and Knowledge Discovery  Volume 21, Issue 2
      September 2010
      123 pages

      Publisher

      Kluwer Academic Publishers

      United States

      Publication History

      Published: 01 September 2010

      Author Tags

      1. Discrimination-aware classification
      2. Expectation maximization
      3. Naive Bayes

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 04 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Do Crowdsourced Fairness Preferences Correlate with Risk Perceptions?Proceedings of the 29th International Conference on Intelligent User Interfaces10.1145/3640543.3645209(304-324)Online publication date: 18-Mar-2024
      • (2024)Fairness-Driven Private Collaborative Machine LearningACM Transactions on Intelligent Systems and Technology10.1145/363936815:2(1-30)Online publication date: 22-Feb-2024
      • (2024)"It's the most fair thing to do but it doesn't make any sense": Perceptions of Mathematical Fairness Notions by Hiring ProfessionalsProceedings of the ACM on Human-Computer Interaction10.1145/36373608:CSCW1(1-35)Online publication date: 26-Apr-2024
      • (2024)BaBE: Enhancing Fairness via Estimation of Explaining VariablesProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3659016(1917-1925)Online publication date: 3-Jun-2024
      • (2024)Group Fairness via Group ConsensusProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3659006(1788-1808)Online publication date: 3-Jun-2024
      • (2024)Analyzing the Relationship Between Difference and Ratio-Based Fairness MetricsProceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency10.1145/3630106.3658922(518-528)Online publication date: 3-Jun-2024
      • (2024)Fairness in Machine Learning: A SurveyACM Computing Surveys10.1145/361686556:7(1-38)Online publication date: 9-Apr-2024
      • (2024)Fair Machine Guidance to Enhance Fair Decision Making in Biased PeopleProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642627(1-18)Online publication date: 11-May-2024
      • (2024)Fairness Improvement with Multiple Protected Attributes: How Far Are We?Proceedings of the IEEE/ACM 46th International Conference on Software Engineering10.1145/3597503.3639083(1-13)Online publication date: 20-May-2024
      • (2024)An Empirical Study on Correlations Between Deep Neural Network Fairness and Neuron Coverage CriteriaIEEE Transactions on Software Engineering10.1109/TSE.2023.334900150:3(391-412)Online publication date: 1-Mar-2024
      • Show More Cited By

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media