Machine Learning

, Volume 95, Issue 1, pp 129–146

Tracking people over time in 19th century Canada for longitudinal analysis

  • Luiza Antonie
  • Kris Inwood
  • Daniel J. Lizotte
  • J. Andrew Ross
Article

DOI: 10.1007/s10994-013-5421-0

Cite this article as:
Antonie, L., Inwood, K., Lizotte, D.J. et al. Mach Learn (2014) 95: 129. doi:10.1007/s10994-013-5421-0

Abstract

Linking multiple databases to create longitudinal data is an important research problem with multiple applications. Longitudinal data allows analysts to perform studies that would be unfeasible otherwise. We have linked historical census databases to create longitudinal data that allow tracking people over time. These longitudinal data have already been used by social scientists and historians to investigate historical trends and to address questions about society, history and economy, and this comparative, systematic research would not be possible without the linked data. The goal of the linking is to identify the same person in multiple census collections. Data imprecision in historical census data and the lack of unique personal identifiers make this task a challenging one. In this paper we design and employ a record linkage system that incorporates a supervised learning module for classifying pairs of records as matches and non-matches. We show that our system performs large scale linkage producing high quality links and generating sufficient longitudinal data to allow meaningful social science studies. We demonstrate the impact of the longitudinal data through a study of the economic changes in 19th century Canada.

Keywords

Record linkage Classification Historical census 

Copyright information

© The Author(s) 2013

Authors and Affiliations

  • Luiza Antonie
    • 1
  • Kris Inwood
    • 2
  • Daniel J. Lizotte
    • 3
  • J. Andrew Ross
    • 4
  1. 1.Historical Data Research UnitUniversity of GuelphGuelphCanada
  2. 2.Department of Economics and FinanceUniversity of GuelphGuelphCanada
  3. 3.David R. Cheriton School of Computer ScienceUniversity of WaterlooWaterlooCanada
  4. 4.Department of HistoryUniversity of GuelphGuelphCanada