Integration of Spatial Distribution in Imaging-Genetics

  • Vaishnavi SubramanianEmail author
  • Weizhao Tang
  • Benjamin Chidester
  • Jian Ma
  • Minh N. Do
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11071)


To better understand diseases such as cancer, it is crucial for computational inference to quantify the spatial distribution of various cell types within a tumor. To this end, we used Ripley’s K-statistic, which captures the spatial distribution patterns at different scales of both individual point sets and interactions between multiple point sets. We propose to improve the expressivity of histopathology image features by incorporating this descriptor to capture potential cellular interactions, especially interactions between lymphocytes and epithelial cells. We demonstrate the utility of the Ripley’s K-statistic by analyzing digital slides from 710 TCGA breast invasive carcinoma (BRCA) patients. In particular, we consider its use in the context of imaging-genetics to understand correlations between gene expression and image features using canonical correlation analysis (CCA). Our analysis shows that including these spatial features leads to more significant associations between image features and gene expression.


  1. 1.
    Stanton, S.E., Disis, M.L.: Clinical significance of tumor-infiltrating lymphocytes in breast cancer. J. Immunother. Cancer 4(1), 59 (2016)CrossRefGoogle Scholar
  2. 2.
    Chidester, B., Do, M., Ma, J.: Discriminative bag-of-cells for imaging-genomics. In: Pacific Symposium on Biocomputing (2018)Google Scholar
  3. 3.
    Janowczyk, A., et al.: Deep learning for digital pathology image analysis: a comprehensive tutorial with selected use cases. J. Pathol. Inf. (2016)Google Scholar
  4. 4.
    Chang, Y.H., et al.: Quantitative analysis of histological tissue image based on cytological profiles and spatial statistics. In: Engineering in Medicine and Biology Society (EMBC). pp. 1175–1178. IEEE (2016)Google Scholar
  5. 5.
    Yuan, Y., et al.: Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling. Sci. Transl. Med. 4(157), 157ra143 (2012)CrossRefGoogle Scholar
  6. 6.
    Dixon, P.M.: Ripley’s K function. In: Encyclopedia of Environmetrics (2002)Google Scholar
  7. 7.
    Cancer Genome Atlas Network: Comprehensive molecular portraits of human breast tumours. Nature 490(7418), 61 (2012)Google Scholar
  8. 8.
    Subramanian, V., et al.: Correlating cellular features with gene expression using CCA. In: IEEE International Symposium on Biomedical Imaging, p. 805 (2018)Google Scholar
  9. 9.
    Carpenter, A., et al.: CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 7(10), R100 (2006)CrossRefGoogle Scholar
  10. 10.
    Hotelling, H.: Relations between two sets of variates. Biometrika 28, 321–377 (1936)CrossRefGoogle Scholar
  11. 11.
    Witten, D.M., et al.: A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10(3), 515–534 (2009)CrossRefGoogle Scholar
  12. 12.
    Cerami, E., et al.: The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2(5) (2012)Google Scholar
  13. 13.
    Huang, D.W., et al.: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4(1), 44 (2008)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Vaishnavi Subramanian
    • 1
    Email author
  • Weizhao Tang
    • 1
  • Benjamin Chidester
    • 2
  • Jian Ma
    • 2
  • Minh N. Do
    • 1
  1. 1.Department of Electrical and Computer EngineeringUniversity of Illinois at Urbana-ChampaignUrbanaUSA
  2. 2.School of Computer ScienceCarnegie Mellon UniversityPittsburghUSA

Personalised recommendations