Abstract
An outlier detection method may be considered fair over specified sensitive attributes if the results of outlier detection are not skewed towards particular groups defined on such sensitive attributes. In this paper, we consider, for the first time to our best knowledge, the task of fair outlier detection. Our focus is on the task of fair outlier detection over multiple multi-valued sensitive attributes (e.g., gender, race, religion, nationality, marital status etc.), one that has broad applications across web data scenarios. We propose a fair outlier detection method, FairLOF, that is inspired by the popular LOF formulation for neighborhood-based outlier detection. We outline ways in which unfairness could be induced within LOF and develop three heuristic principles to enhance fairness, which form the basis of the FairLOF method. Being a novel task, we develop an evaluation framework for fair outlier detection, and use that to benchmark FairLOF on quality and fairness of results. Through an extensive empirical evaluation over real-world datasets, we illustrate that FairLOF is able to achieve significant improvements in fairness at sometimes marginal degradations on result quality as measured against the fairness-agnostic LOF method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Change history
16 December 2020
The original version of this chapter was revised. The first author’s name was corrected to Deepak P.
Notes
- 1.
- 2.
- 3.
\(|N_k(X)|\) could be greater than k in case there is a tie for the \(k^{th}\) place.
- 4.
- 5.
- 6.
References
Abraham, S.S., Deepak, P., Sundaram, S.S.: Fairness in clustering with multiple sensitive attributes. In: EDBT, pp. 287–298 (2020)
Asudeh, A., Jagadish, H., Stoyanovich, J., Das, G.: Designing fair ranking schemes. In: SIGMOD (2019)
Babaei, K., Chen, Z., Maul, T.: Detecting point outliers using prune-based outlier factor (PLOF). arXiv preprint arXiv:1911.01654 (2019)
Barocas, S., Selbst, A.D.: Big data’s disparate impact. Calif. Law. Rev. 104, 671 (2016)
Bei, X., Liu, S., Poon, C.K., Wang, H.: Candidate selections with proportional fairness constraints. In: AAMAS (2020)
Bera, S.K., Chakrabarty, D., Flores, N., Negahbani, M.: Fair algorithms for clustering. In: NeurIPS, pp. 4955–4966 (2019)
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. In: SIGMOD, pp. 93–104 (2000)
Chandola, V., Banerjee, A., Kumar, V.: Outlier detection: a survey. ACM Comput. Surv. 14, 15 (2007)
Chawla, S., Sun, P.: SLOM: a new measure for local spatial outliers. Knowl. Inf. Syst. 9(4), 412–429 (2006). https://doi.org/10.1007/s10115-005-0200-2
Chen, J., Sathe, S., Aggarwal, C., Turaga, D.: Outlier detection with autoencoder ensembles. In: SDM (2017)
Chierichetti, F., Kumar, R., Lattanzi, S., Vassilvitskii, S.: Fair clustering through fairlets. In: NIPS (2017)
Chouldechova, A., Roth, A.: A snapshot of the frontiers of fairness in machine learning. Commun. ACM 63(5), 82–89 (2020)
Davidson, I., Ravi, S.: A framework for determining the fairness of outlier detection. In: ECAI (2020)
Domingues, R., Filippone, M., Michiardi, P., Zouaoui, J.: A comparative evaluation of outlier detection algorithms: experiments and analyses. Pattern Recogn. 74, 406–421 (2018)
Fan, W., Bouguila, N., Ziou, D.: Unsupervised anomaly intrusion detection via localized Bayesian feature selection. In: ICDM (2011)
Hawkins, D.M.: Identification of Outliers. MSAP, vol. 11. Springer, Dordrecht (1980). https://doi.org/10.1007/978-94-015-3994-4
Huang, L., Vishnoi, N.K.: Stable and fair classification. arXiv:1902.07823 (2019)
Jabez, J., Muthukumar, B.: Intrusion detection system (IDS): anomaly detection using outlier detection approach. Procedia Comput. Sci. 48, 338–346 (2015)
Knight, C.: Luck Egalitarianism: Equality, Responsibility, and Justice. EUP, Edinburgh (2009)
Kriegel, H.P., Kröger, P., Schubert, E., Zimek, A.: LoOP: local outlier probabilities. In: CIKM (2009)
Kumar, V., Kumar, D., Singh, R.: Outlier mining in medical databases: an application of data mining in health care management to detect abnormal values presented in medical databases. IJCSNS Int. J. Comput. Sci. Netw. Secur. 8, 272–277 (2008)
Olfat, M., Aswani, A.: Convex formulations for fair principal component analysis. In: AAAI, vol. 33, pp. 663–670 (2019)
Deepak, P.: Whither fair clustering? In: AI for Social Good Workshop (2020)
Patro, G.K., et al.: Incremental fairness in two-sided market platforms: on updating recommendations fairly. In: AAAI (2020)
Pawar, A.D., Kalavadekar, P.N., Tambe, S.N.: A survey on outlier detection techniques for credit card fraud detection. IOSR J. Comput. Eng. 16(2), 44–48 (2014)
Pearson, K.: Vii. Note on regression and inheritance in the case of two parents. Proc. R. Soc. London 58(347–352), 240–242 (1895)
Rawls, J.: A Theory of Justice. Harvard University Press, Cambridge (1971)
Schubert, E., Zimek, A., Kriegel, H.-P.: Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection. Data Min. Knowl. Disc. 28(1), 190–237 (2012). https://doi.org/10.1007/s10618-012-0300-z
Wang, B., Davidson, I.: Towards fair deep clustering with multi-state protected variables. arXiv preprint arXiv:1901.10053 (2019)
Yu, D., Sheikholeslami, G., Zhang, A.: FindOut: finding outliers in very large datasets. Knowl. Inf. Syst. 4(4), 387–412 (2002). https://doi.org/10.1007/s101150200013
Zafar, M.B., Valera, I., Rodriguez, M.G., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. arXiv preprint arXiv:1507.05259 (2015)
Zehlike, M., Bonchi, F., Castillo, C., Hajian, S., Megahed, M., Baeza-Yates, R.: FA*IR: a fair top-k ranking algorithm. In: CIKM, pp. 1569–1578 (2017)
Zhang, K., Hutter, M., Jin, H.: A new local distance-based outlier detection approach for scattered real-world data. In: PAKDD (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
P., D., Sam Abraham, S. (2020). Fair Outlier Detection. In: Huang, Z., Beek, W., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2020. WISE 2020. Lecture Notes in Computer Science(), vol 12343. Springer, Cham. https://doi.org/10.1007/978-3-030-62008-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-62008-0_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62007-3
Online ISBN: 978-3-030-62008-0
eBook Packages: Computer ScienceComputer Science (R0)