Annals of Operations Research

, Volume 195, Issue 1, pp 97–110

Choquet integral for record linkage

  • Daniel Abril
  • Guillermo Navarro-Arribas
  • Vicenç Torra
Article

DOI: 10.1007/s10479-011-0989-x

Cite this article as:
Abril, D., Navarro-Arribas, G. & Torra, V. Ann Oper Res (2012) 195: 97. doi:10.1007/s10479-011-0989-x

Abstract

Record linkage is used in data privacy to evaluate the disclosure risk of protected data. It models potential attacks, where an intruder attempts to link records from the protected data to the original data. In this paper we introduce a novel distance based record linkage, which uses the Choquet integral to compute the distance between records. We use a fuzzy measure to weight each subset of variables from each record. This allows us to improve standard record linkage and provide insightful information about the re-identification risk of each variable and their interaction. To do that, we use a supervised learning approach which determines the optimal fuzzy measure for the linkage.

Keywords

Data privacy Record linkage Choquet integral Optimization 

Copyright information

© Springer Science+Business Media, LLC 2011

Authors and Affiliations

  • Daniel Abril
    • 1
  • Guillermo Navarro-Arribas
    • 1
  • Vicenç Torra
    • 1
  1. 1.IIIA, Institut d’Investigació en Intel⋅ligència Artificial—CSICConsejo Superior de Investigaciones CientíficasBellaterraSpain