Fair Outlier Detection

P., Deepak; Sam Abraham, Savitha

doi:10.1007/978-3-030-62008-0_31

Deepak P.^13,14 &
Savitha Sam Abraham¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12343))

Included in the following conference series:

International Conference on Web Information Systems Engineering

1280 Accesses
3 Citations

The original version of this chapter was revised: the first author’s name was corrected to Deepak P. The correction to this chapter is available at https://doi.org/10.1007/978-3-030-62008-0_42

Abstract

An outlier detection method may be considered fair over specified sensitive attributes if the results of outlier detection are not skewed towards particular groups defined on such sensitive attributes. In this paper, we consider, for the first time to our best knowledge, the task of fair outlier detection. Our focus is on the task of fair outlier detection over multiple multi-valued sensitive attributes (e.g., gender, race, religion, nationality, marital status etc.), one that has broad applications across web data scenarios. We propose a fair outlier detection method, FairLOF, that is inspired by the popular LOF formulation for neighborhood-based outlier detection. We outline ways in which unfairness could be induced within LOF and develop three heuristic principles to enhance fairness, which form the basis of the FairLOF method. Being a novel task, we develop an evaluation framework for fair outlier detection, and use that to benchmark FairLOF on quality and fairness of results. Through an extensive empirical evaluation over real-world datasets, we illustrate that FairLOF is able to achieve significant improvements in fairness at sometimes marginal degradations on result quality as measured against the fairness-agnostic LOF method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

FairLOF: Fairness in Outlier Detection

Article Open access 29 August 2021

On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study

Article 16 January 2016

Automatic detection of discordant outliers via the Ueda’s method

Article Open access 30 September 2015

Change history

16 December 2020
The original version of this chapter was revised. The first author’s name was corrected to Deepak P.

Notes

1.
https://www.nyclu.org/en/stop-and-frisk-data.
2.
https://www.cnet.com/features/is-facebook-censoring-conservatives-or-is-moderating-just-too-hard/.
3.
\(|N_k(X)|\) could be greater than k in case there is a tie for the \(k^{th}\) place.
4.
https://www.statista.com/statistics/382525/share-of-police-officers-in-england-and-wales-gender-rank/.
5.
https://en.wikipedia.org/wiki/Reservation_in_India.
6.
https://en.wikipedia.org/wiki/AOL_search_data_leak.

References

Abraham, S.S., Deepak, P., Sundaram, S.S.: Fairness in clustering with multiple sensitive attributes. In: EDBT, pp. 287–298 (2020)
Google Scholar
Asudeh, A., Jagadish, H., Stoyanovich, J., Das, G.: Designing fair ranking schemes. In: SIGMOD (2019)
Google Scholar
Babaei, K., Chen, Z., Maul, T.: Detecting point outliers using prune-based outlier factor (PLOF). arXiv preprint arXiv:1911.01654 (2019)
Barocas, S., Selbst, A.D.: Big data’s disparate impact. Calif. Law. Rev. 104, 671 (2016)
Google Scholar
Bei, X., Liu, S., Poon, C.K., Wang, H.: Candidate selections with proportional fairness constraints. In: AAMAS (2020)
Google Scholar
Bera, S.K., Chakrabarty, D., Flores, N., Negahbani, M.: Fair algorithms for clustering. In: NeurIPS, pp. 4955–4966 (2019)
Google Scholar
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. In: SIGMOD, pp. 93–104 (2000)
Google Scholar
Chandola, V., Banerjee, A., Kumar, V.: Outlier detection: a survey. ACM Comput. Surv. 14, 15 (2007)
Google Scholar
Chawla, S., Sun, P.: SLOM: a new measure for local spatial outliers. Knowl. Inf. Syst. 9(4), 412–429 (2006). https://doi.org/10.1007/s10115-005-0200-2
Article Google Scholar
Chen, J., Sathe, S., Aggarwal, C., Turaga, D.: Outlier detection with autoencoder ensembles. In: SDM (2017)
Google Scholar
Chierichetti, F., Kumar, R., Lattanzi, S., Vassilvitskii, S.: Fair clustering through fairlets. In: NIPS (2017)
Google Scholar
Chouldechova, A., Roth, A.: A snapshot of the frontiers of fairness in machine learning. Commun. ACM 63(5), 82–89 (2020)
Article Google Scholar
Davidson, I., Ravi, S.: A framework for determining the fairness of outlier detection. In: ECAI (2020)
Google Scholar
Domingues, R., Filippone, M., Michiardi, P., Zouaoui, J.: A comparative evaluation of outlier detection algorithms: experiments and analyses. Pattern Recogn. 74, 406–421 (2018)
Article Google Scholar
Fan, W., Bouguila, N., Ziou, D.: Unsupervised anomaly intrusion detection via localized Bayesian feature selection. In: ICDM (2011)
Google Scholar
Hawkins, D.M.: Identification of Outliers. MSAP, vol. 11. Springer, Dordrecht (1980). https://doi.org/10.1007/978-94-015-3994-4
Book MATH Google Scholar
Huang, L., Vishnoi, N.K.: Stable and fair classification. arXiv:1902.07823 (2019)
Jabez, J., Muthukumar, B.: Intrusion detection system (IDS): anomaly detection using outlier detection approach. Procedia Comput. Sci. 48, 338–346 (2015)
Article Google Scholar
Knight, C.: Luck Egalitarianism: Equality, Responsibility, and Justice. EUP, Edinburgh (2009)
Google Scholar
Kriegel, H.P., Kröger, P., Schubert, E., Zimek, A.: LoOP: local outlier probabilities. In: CIKM (2009)
Google Scholar
Kumar, V., Kumar, D., Singh, R.: Outlier mining in medical databases: an application of data mining in health care management to detect abnormal values presented in medical databases. IJCSNS Int. J. Comput. Sci. Netw. Secur. 8, 272–277 (2008)
Google Scholar
Olfat, M., Aswani, A.: Convex formulations for fair principal component analysis. In: AAAI, vol. 33, pp. 663–670 (2019)
Google Scholar
Deepak, P.: Whither fair clustering? In: AI for Social Good Workshop (2020)
Google Scholar
Patro, G.K., et al.: Incremental fairness in two-sided market platforms: on updating recommendations fairly. In: AAAI (2020)
Google Scholar
Pawar, A.D., Kalavadekar, P.N., Tambe, S.N.: A survey on outlier detection techniques for credit card fraud detection. IOSR J. Comput. Eng. 16(2), 44–48 (2014)
Article Google Scholar
Pearson, K.: Vii. Note on regression and inheritance in the case of two parents. Proc. R. Soc. London 58(347–352), 240–242 (1895)
Google Scholar
Rawls, J.: A Theory of Justice. Harvard University Press, Cambridge (1971)
Google Scholar
Schubert, E., Zimek, A., Kriegel, H.-P.: Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection. Data Min. Knowl. Disc. 28(1), 190–237 (2012). https://doi.org/10.1007/s10618-012-0300-z
Article MathSciNet MATH Google Scholar
Wang, B., Davidson, I.: Towards fair deep clustering with multi-state protected variables. arXiv preprint arXiv:1901.10053 (2019)
Yu, D., Sheikholeslami, G., Zhang, A.: FindOut: finding outliers in very large datasets. Knowl. Inf. Syst. 4(4), 387–412 (2002). https://doi.org/10.1007/s101150200013
Article Google Scholar
Zafar, M.B., Valera, I., Rodriguez, M.G., Gummadi, K.P.: Fairness constraints: mechanisms for fair classification. arXiv preprint arXiv:1507.05259 (2015)
Zehlike, M., Bonchi, F., Castillo, C., Hajian, S., Megahed, M., Baeza-Yates, R.: FA*IR: a fair top-k ranking algorithm. In: CIKM, pp. 1569–1578 (2017)
Google Scholar
Zhang, K., Hutter, M., Jin, H.: A new local distance-based outlier detection approach for scattered real-world data. In: PAKDD (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Queen’s University Belfast, Belfast, UK
Deepak P.
Indian Institute of Technology Madras, Chennai, India
Deepak P. & Savitha Sam Abraham

Authors

Deepak P.
View author publications
You can also search for this author in PubMed Google Scholar
Savitha Sam Abraham
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Deepak P. .

Editor information

Editors and Affiliations

VU Amsterdam, Amsterdam, The Netherlands
Zhisheng Huang
VU Amsterdam, Amsterdam, The Netherlands
Wouter Beek
Victoria University, Melbourne, VIC, Australia
Hua Wang
Swinburne University of Technology, Hawthorn, VIC, Australia
Rui Zhou
Victoria University, Melbourne, VIC, Australia
Yanchun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

P., D., Sam Abraham, S. (2020). Fair Outlier Detection. In: Huang, Z., Beek, W., Wang, H., Zhou, R., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2020. WISE 2020. Lecture Notes in Computer Science(), vol 12343. Springer, Cham. https://doi.org/10.1007/978-3-030-62008-0_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-62008-0_31
Published: 21 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-62007-3
Online ISBN: 978-3-030-62008-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fair Outlier Detection

Abstract

Access this chapter

Similar content being viewed by others

FairLOF: Fairness in Outlier Detection

On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study

Automatic detection of discordant outliers via the Ueda’s method

Change history

16 December 2020

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Fair Outlier Detection

Abstract

Access this chapter

Similar content being viewed by others

FairLOF: Fairness in Outlier Detection

On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study

Automatic detection of discordant outliers via the Ueda’s method

Change history

16 December 2020

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation