Knowledge and Information Systems

, Volume 45, Issue 2, pp 389–416

Effective record linkage for mining campaign contribution data

  • C. Giraud-Carrier
  • J. Goodliffe
  • B. M. Jones
  • S. Cueva
Regular Paper

DOI: 10.1007/s10115-014-0812-5

Cite this article as:
Giraud-Carrier, C., Goodliffe, J., Jones, B.M. et al. Knowl Inf Syst (2015) 45: 389. doi:10.1007/s10115-014-0812-5

Abstract

Up to now, most campaign contribution data have been reported at the level of the donation. While these are interesting, one often needs to have information at the level of the donor. Obtaining information at that level is difficult as there is neither a unique repository of donations nor any standard across existing repositories. In order to more meaningfully mine campaign contribution data, political scientists need an accurate way of grouping, or linking, together donations made by the same donor. In this paper, we describe a record linkage technique that is applicable to various sources and across large geographical areas. We show how it may be effectively applied in the context of nationwide donation data and report on new, previously unattainable results about campaign contributors in the 2007–2008 US election cycle.

Keywords

Record linkage Multiset distance Domain knowledge  Campaign contributions Political data 

Copyright information

© Springer-Verlag London 2014

Authors and Affiliations

  • C. Giraud-Carrier
    • 1
  • J. Goodliffe
    • 2
  • B. M. Jones
    • 2
  • S. Cueva
    • 1
  1. 1.Department of Computer ScienceBrigham Young UniversityProvoUSA
  2. 2.Department of Political ScienceBrigham Young UniversityProvoUSA