Residential Mobility and Lung Cancer Risk: Data-Driven Exploration Using Internet Sources

  • Hong-Jun Yoon
  • Georgia TourassiEmail author
  • Songhua Xu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9021)


Frequent relocation has been linked to health decline, particularly with respect to emotional and psychological wellbeing. In this paper we investigate whether there is an association between frequent relocation and lung cancer risk. For the initial investigation we used web crawling and tailored text mining to collect cancer and control subjects from online data sources. One data source includes online obituaries. The second data source includes augmented LinkedIn profiles. For each data source, the subjects’ spatiotemporal history is reconstructed from the available information provided in the obituaries and from the education and work experience provided in the LinkedIn profiles. The study shows that lung cancer subjects have higher mobility frequency than the control group. This trend is consistent for both data sources.


Residential mobility Lung cancer Social media Health data informatics 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Jelleyman, T., Spencer, N.: Residential Mobility in Childhood and Health Outcomes: A Systematic Review. J. Epidemiol Community Health 62, 584–592 (2007)CrossRefGoogle Scholar
  2. 2.
    Tønnessen, M., Telle, K., Syse, A.: Childhood Residential Mobility and Adult Outcomes. Statistics Norway Research Department. Discussion Papers (750) (2013)Google Scholar
  3. 3.
    Lin, K.C., Huang, H.C., Bai, Y.M., Kuo, P.C.: Lifetime residential mobility history and self-rated health at midlife. Journal of Epidemiology 22(2), 113–122 (2012)CrossRefGoogle Scholar
  4. 4.
    Warner, K.E., Mendez, D., Courant, P.N.: Toward a more realistic appraisal of the lung cancer risk from radon: the effects of residential mobility. American Journal of Public Health 86(9), 1222–1227 (1996)CrossRefGoogle Scholar
  5. 5.
    Krewski, D., et al.: Residential radon and risk of lung cancer: a combined analysis of 7 North American case-control studies. Epidemiology 16(2), 137–145 (2005)CrossRefGoogle Scholar
  6. 6.
    American Cancer Society: Cancer facts & figures (2014)Google Scholar
  7. 7.
    Xu, S., Yoon, H.J., Tourassi, G.D.: A user-oriented web crawler for selectively acquiring online content in e-health research. Bioinformatics 30(1), 104–114 (2014)CrossRefGoogle Scholar
  8. 8.
    Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The stanford corenlp natural language processing toolkit. In: Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)Google Scholar
  9. 9.
    Determine the Gender of a First Name.

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Health Data Sciences InstituteOak Ridge National LaboratoryOak RidgeUSA
  2. 2.Department of Information Systems, College of Computing SciencesNew Jersey Institute of Technology, University HeightsNewarkUSA

Personalised recommendations