Artificial Intelligence and Law

, Volume 22, Issue 2, pp 211–238 | Cite as

Combating discrimination using Bayesian networks

Article

Abstract

Discrimination in decision making is prohibited on many attributes (religion, gender, etc…), but often present in historical decisions. Use of such discriminatory historical decision making as training data can perpetuate discrimination, even if the protected attributes are not directly present in the data. This work focuses on discovering discrimination in instances and preventing discrimination in classification. First, we propose a discrimination discovery method based on modeling the probability distribution of a class using Bayesian networks. This measures the effect of a protected attribute (e.g., gender) in a subset of the dataset using the estimated probability distribution (via a Bayesian network). Second, we propose a classification method that corrects for the discovered discrimination without using protected attributes in the decision process. We evaluate the discrimination discovery and discrimination prevention approaches on two different datasets. The empirical results show that a substantial amount of discrimination identified in instances is prevented in future decisions.

Keywords

Discrimination discovery Discrimination prevention Bayesian network Data mining 

References

  1. Calders T, Verwer S (2010) Three naive Bayes approaches for discrimination-free classification. Data Min J (special issue with selected papers from ECML/PKDD)Google Scholar
  2. Cooper GF, Herskovits E (1991) A Bayesian method for the induction of probabilistic networks from data. Mach Learn BMIR-1991-0293Google Scholar
  3. Dwork C, Hardt M, Pitassi T, Reingold O, Zemel R (2012) Fairness through awareness. In: ITCS, pp 214–226Google Scholar
  4. Hajian S, Domingo-Ferrer J (2012) A study on the impact of data anonymization on anti-discrimination. In: IEEE ICDM international workshop on discrimination and privacy-aware data miningGoogle Scholar
  5. Hajian S, Monreale A, Pedreschi D, Domingo-Ferrer J, Giannotti F (2012) Injecting discrimination and privacy awareness into pattern discovery. In: IEEE ICDM international workshop on discrimination and privacy-aware data miningGoogle Scholar
  6. Kamiran F, Calders T (2009) Classifying without discriminating. IEEE Press, New YorkGoogle Scholar
  7. Kamiran F, Calders T (2011) Data preprocessing techniques for classification without discrimination. Knowl Inf SystGoogle Scholar
  8. Kamiran F, Calders T, Pechenizkiy M (2010) Discrimination aware decision tree learning. In: Proceedings IEEE ICDM international conference on data miningGoogle Scholar
  9. Kamiran F, Karim A, Zhang X (2012) Decision theory for discrimination-aware classification. In: IEEE international conference on data miningGoogle Scholar
  10. Luong BT, Ruggieri S, Turini F (2011) k-NN as an implementation of situation testing for discrimination discovery and prevention. In: 17th ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510Google Scholar
  11. Mancuhan K, Clifton C (2012) Discriminatory decision policy aware classification. In: IEEE ICDM international workshop on discrimination and privacy-aware data miningGoogle Scholar
  12. Newman DJ, Hettich S, Blake CL, Merz CJ UCI repository of machine learning databases, http://archive.ics.uci.edu/ml/
  13. Pedreschi D, Ruggieri S, Turini F (2008) Discrimination-aware data mining. In: KDD conferenceGoogle Scholar
  14. Pedreschi D, Ruggieri S, Turini F (2009) Measuring discrimination in socially-sensitive decision records. In: 9th SIAM conference on data mining (SDM 2009). SIAM, pp 581–592Google Scholar
  15. Romei A, Ruggieri S (2013) A multidisciplinary survey on discrimination analysis. Knowl Eng Rev 1–57Google Scholar
  16. Ruggieri S, Pedreschi D, Turini F (2011) DCUBE: discrimination discovery in databases. In: ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510Google Scholar
  17. Tan P.-N, Steinbach M, Kumar V (2006) Introduction to data mining. Addison-Wesley, Reading, MA, pp 227-246Google Scholar
  18. Witten IH, Frank E (2011) Data mining: practical machine learning tools and techniques. 3rd edn. Morgan Kaufmann, Los Altos, CAGoogle Scholar
  19. Zemel R, Wu Y, Swersky K, Pitassi T, Dwork C (2013) Learning fair representations. In: ICMLGoogle Scholar
  20. Zliobaite I, Kamiran F, Calders T (2011) Handling conditional discrimination. In: Proceedings IEEE ICDM international conference on data miningGoogle Scholar

Copyright information

© Springer Science+Business Media Dordrecht 2014

Authors and Affiliations

  1. 1.Purdue UniversityWest LafayetteUSA

Personalised recommendations