Human Genetics

, 130:383

Identifiability in biobanks: models, measures, and mitigation strategies

  • Bradley Malin
  • Grigorios Loukides
  • Kathleen Benitez
  • Ellen Wright Clayton
Original Investigation

DOI: 10.1007/s00439-011-1042-5

Cite this article as:
Malin, B., Loukides, G., Benitez, K. et al. Hum Genet (2011) 130: 383. doi:10.1007/s00439-011-1042-5

Abstract

The collection and sharing of person-specific biospecimens has raised significant questions regarding privacy. In particular, the question of identifiability, or the degree to which materials stored in biobanks can be linked to the name of the individuals from which they were derived, is under scrutiny. The goal of this paper is to review the extent to which biospecimens and affiliated data can be designated as identifiable. To achieve this goal, we summarize recent research in identifiability assessment for DNA sequence data, as well as associated demographic and clinical data, shared via biobanks. We demonstrate the variability of the degree of risk, the factors that contribute to this variation, and potential ways to mitigate and manage such risk. Finally, we discuss the policy implications of these findings, particularly as they pertain to biobank security and access policies. We situate our review in the context of real data sharing scenarios and biorepositories.

Copyright information

© Springer-Verlag 2011

Authors and Affiliations

  • Bradley Malin
    • 1
    • 2
  • Grigorios Loukides
    • 1
  • Kathleen Benitez
    • 1
  • Ellen Wright Clayton
    • 3
    • 4
    • 5
  1. 1.Department of Biomedical Informatics, School of MedicineVanderbilt UniversityNashvilleUSA
  2. 2.Department of Electrical Engineering and Computer Science, School of EngineeringVanderbilt UniversityNashvilleUSA
  3. 3.Department of PediatricsSchool of MedicineVanderbiltUSA
  4. 4.Center for Biomedical Ethics and Society, School of MedicineVanderbilt UniversityNashvilleUSA
  5. 5.School of LawVanderbilt UniversityNashvilleUSA