Information Bottleneck for Pathway-Centric Gene Expression Analysis

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8753)


While DNA microarrays enable us to conveniently measure expression profiles in the scope of thousands of genes, the subsequent association studies typically suffer from a tremendous imbalance between number of variables (genes) and observations (subjects). Even more so, each gene is heavily perturbed by noise which prevents any meaningful analysis on the single-gene level [6]. Hence, the focus shifted to pathways as groups of functionally related genes [4], in the hope that aggregation potentiates the underlying signal. Technically, this leads to a problem of feature extraction which was previously tackled by principal component analysis [5]. We reformulate the task using an extension of the Meta-Gaussian Information Bottleneck method as a means to compress a gene set while preserving information about a relevance variable. This opens up new possibilities, enabling us to make use of clinical side information in order to uncover hidden characteristics in the data.


Copula Model Gaussian Copula Irrelevance Variable Gaussian Random Vector Univariate Margin 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Baffert, F., Thurston, G., Rochon-Duck, M., Le, T., Brekken, R., McDonald, D.M.: Age-related changes in vascular endothelial growth factor dependency and angiopoietin-1-induced plasticity of adult blood vessels. Circ. Res. 94(984), 984–992 (2004)CrossRefGoogle Scholar
  2. 2.
    Castle, S.C.: Clinical relevance of age-related immune dysfunction. Clin. Infect. Dis. 31(2), 578–585 (2000)CrossRefGoogle Scholar
  3. 3.
    Chechik, G., Globerson, A., Tishby, N., Weiss, Y.: Information Bottleneck for Gaussian Variables. J. Mach. Learn. Res. 6, 165–188 (2005)MathSciNetzbMATHGoogle Scholar
  4. 4.
    Curtis, R.K., Oresic, M., Vidal-Puig, A.: Pathways to the analysis of microarray data. Trends Biotechnol. 23(8), 429–435 (2005)CrossRefGoogle Scholar
  5. 5.
    Drier, Y., Sheffer, M., Domany, E.: Pathway-based personalized analysis of cancer. Proc. Natl. Acad. Sci. 110(16), 6388–6393 (2013)CrossRefGoogle Scholar
  6. 6.
    Ein-Dor, L., Zuk, O., Domany, E.: Thousands of samples are needed to generate a robust gene list for predicting outcome in cancer. Proc. Natl. Acad. Sci. 103, 5923–5928 (2006)CrossRefGoogle Scholar
  7. 7.
    Elsaleh, H., Joseph, D., Grieu, F., Zeps, N., Spry, N., Iacopetta, B.: Association of tumour site and sex with survival benefit from adjuvant chemotherapy in colorectal cancer. Lancet 355(9217), 1745–1750 (2000)CrossRefGoogle Scholar
  8. 8.
    Globerson, A., Tishby, N.: On the Optimality of the Gaussian Information Bottleneck Curve. The Hebrew University of Jerusalem. Technical report (2004)Google Scholar
  9. 9.
    Hoff, P.D.: Extending the rank likelihood for semiparametric copula estimation. Ann. Appl. Stat. 1(1), 265–283 (2007)MathSciNetCrossRefzbMATHGoogle Scholar
  10. 10.
    Licastro, F., Candore, G., Lio, D., Porcellini, E., Colonna-Romano, G., Franceschi, C., Caruso, C.: Innate immunity and inflammation in ageing: a key for understanding age-related diseases. Immun. Ageing 2(1), 8 (2005)CrossRefGoogle Scholar
  11. 11.
    Migliore, L., Coppede, F.: Genetic and environmental factors in cancer and neurodegenerative diseases. Mutat. Res. 512(2–3), 135–153 (2002)CrossRefGoogle Scholar
  12. 12.
    Pal, S.K., Hurria, A.: Impact of age, sex, and comorbidity on cancer therapy and disease progression. J. Clin. Oncol. 28(26), 4086–4093 (2010)CrossRefGoogle Scholar
  13. 13.
    Rey, M., Roth, V.: Meta-gaussian information bottleneck. Adv. Neural Inf. Process. Syst. 25, 1925–1933 (2012)Google Scholar
  14. 14.
    Sheffer, M., Bacolod, M.D., Zuk, O., Giardina, S.F., Pincas, H., Barany, F., Paty, P.B., Gerald, W.L., Notterman, D.A., Domany, E.: Association of survival and disease progression with chromosomal instability: a genomic exploration of colorectal cancer. Proc. Natl. Acad. Sci. 106(17), 7131–7136 (2009)CrossRefGoogle Scholar
  15. 15.
    Sklar, A.: Fonctions de répartition à n dimensions et leurs marges. Université Paris (1959)Google Scholar
  16. 16.
    Söderlund, S., Granath, F., Broström, O., Karlen, P., Löfberg, R., Ekbom, A., Askling, J.: Inflammatory bowel disease confers a lower risk of colorectal cancer to females than to males. Gastroenterology 138(5), 1697–1703 (2010)CrossRefGoogle Scholar
  17. 17.
    Tishby, N., Pereira, F.C., Bialek, W.: The information bottleneck method. In: Proceedings of the 37th Annual Allerton Conference on Communication, Control and Computing, pp. 368–377 (1999)Google Scholar
  18. 18.
    Wullschleger, S., Loewith, R., Hall, M.N.: TOR signaling in growth and metabolism. Cell 124(3), 471–484 (2006)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Department of Mathematics and Computer ScienceUniversity of BaselBaselSwitzerland

Personalised recommendations