Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Record Linkage

  • Josep Domingo-Ferrer
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1504

Synonyms

Record matching; Re-identification

Definition

Record linkage is a computational procedure for linking each record a in file A (e.g., a file masked for disclosure protection) to a record b in file B (original file). The pair (a, b) is a match if b turns out to be the original record corresponding to a.

Key Points

Record linkage techniques were created for data fusion and to increase data quality. However, they have also found an application in measuring the risk of identity disclosure in statistical disclosure control. In the SDC context, it is assumed that an intruder has an external dataset sharing some (key or outcome) attributes with the released protected dataset and containing additionally some identifier attributes (e.g., passport number, full name, etc.). The intruder is assumed to attempt to link the protected dataset with the external dataset using the shared attributes. The number of matches gives an estimation of the number of protected records whose respondent can...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Fellegi IP, Sunter AB. A theory for record linkage. J Am Stat Assoc. 1969;64(328):1183–210.zbMATHCrossRefGoogle Scholar
  2. 2.
    Torra V, Domingo-Ferrer J. Record linkage methods for multidatabase data mining. In: Torra V, editor. Information fusion in data mining. Berlin: Springer; 2003. p. 101–32.zbMATHCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Universitat Rovira i VirgiliTarragonaSpain

Section editors and affiliations

  • Elena Ferrari
    • 1
  1. 1.DiSTAUniv. of InsubriaVareseItaly