Deterministic and Probabilistic Record Linkage: an Application to Primary Care Data
- 71 Downloads
In the last decades, the availability of electronic records routinely collected in various health care settings has increased. The data sources include clinical databases, such as primary care databases, and administrative databases, such as electronic health record of hospital admissions, in-hospital procedures, and reimbursed medications. These data present opportunities for innovative research to improve patient care and to inform decisions in public health and clinical practice [1, 2].
In order to take advantage of the available data sources, linking procedures are important and consist of matching records of two or more datasets by means of common identifiers .
The difficulties of record linkage vary with the structure and the quality of the databases being linked. The linking variables may not uniquely identify an individual, are prone to errors and/or can be missing.
Two approaches for record linkage are possible, namely the deterministic and the probabilistic...
This study has been supported by the Italian College of General Practitioners and Primary Care.
Compliance with ethical standards
Conflict of interests
- 1.Morrato, E. H., Elias, M., and Gericke, C. A., Using population-based routine data for evidence-based health policy decisions: Lessons from three examples of setting and evaluating national health policy in Australia, the UK and the USA. J Public Health (Oxf) 29(4):463–471, 2007.CrossRefGoogle Scholar
- 6.Christen, P. Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection. Springer Science & Business Media. 2012.Google Scholar
- 7.Christen, P., and Goiser, K., Quality and complexity measures for data linkage and deduplication. Quality Measures in Data Mining 43:127–151, 2007.Google Scholar
- 8.Wasi, N., and Flaaen, A., Record linkage using STATA: Pre-processing, linking and reviewing utilities. The Stata Journal 15:672–697, 2014.Google Scholar
- 9.Sariyar, M., and Borg, A., The RecordLinkage package: Detecting errors in data. The R Journal 2:61–67, 2010.Google Scholar