Extracting Phenotypes from Patient Claim Records Using Nonnegative Tensor Factorization

  • Joyce C. Ho
  • Joydeep Ghosh
  • Jimeng Sun
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8609)

Abstract

Electronic health records (EHRs) are becoming an increasingly important source of patient information. Unfortunately, EHR data do not always directly and reliably map to medical concepts that clinical researchers need or use. Some recent studies have focused on EHR-derived phenotyping, which aims at mapping the EHR data to specific medical concepts; however, most of these approaches require labor intensive supervision from experienced clinical professionals.

In this paper, we use Limestone, a nonnegative tensor factorization method to derive phenotype candidates from claims data with virtually no human supervision. Limestone represents the interactions between diagnoses and procedures among patients naturally using tensors (a generalization of matrices). The resulting tensor factors are reported as phenotype candidates that automatically reveal patient clusters on specific diagnoses and procedures. To the best of our knowledge, this is the first study that successfully extracts useful phenotypes by applying sparse nonnegative tensor factorization to a large, public-domain EHR dataset covering a broad range of diseases. Our experiments demonstrate the interpretability and the promise of high-throughput phenotypes generated from tensor factorization.

Keywords

EHR phenotyping tensor factorization dimensionality reduction 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Joyce C. Ho
    • 1
  • Joydeep Ghosh
    • 1
  • Jimeng Sun
    • 2
  1. 1.Electrical and Computer Engineering DepartmentThe University of Texas at AustinAustinUSA
  2. 2.College of ComputingGeorgia Institute of TechnologyAtlantaUSA

Personalised recommendations