Skip to main content

Cluster Analytic Strategy for Identification of Metagenes Relevant for Prognosis of Node Negative Breast Cancer

  • Conference paper
  • First Online:
  • 2488 Accesses

Abstract

Worldwide, breast cancer is the second leading cause of cancer deaths in women. To gain insight into the processes related to the course of the disease, human genetic data can be used to identify associations between gene expression and prognosis. Moreover, the expression data of groups of genes may be aggregated to metagenes that may be used for investigating complex diseases like breast cancer. Here we introduce a cluster analytic approach for identification of potentially relevant metagenes. In a first step of our approach we used gene expression patterns over time of erbB2 breast cancer MCF7 cell lines to obtain promising sets of genes for a metagene calculation. For this purpose, two cluster analytic approaches for short time-series of gene expression data – DIB-C and STEM – were applied to identify gene clusters with similar expression patterns. Among these we next focussed on groups of genes with transcription factor (TF) binding site enrichment or associated with a GO group. These gene clusters were then used to calculate metagenes of the gene expression data of 766 breast cancer patients from three breast cancer studies. In the last step of our approach Cox models were applied to determine the effect of the metagenes on the prognosis. Using this strategy we identified new metagenes that were associated with metastasis-free survival patients.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Dortet-Bernadet JL, Wicker N (2008) Model-based clustering on the unit sphere with an illustration using gene expression profiles. Biostatistics 9(1):66–80

    Article  MATH  Google Scholar 

  • Elkon R, Linhart C, Sharan R, Shamir R, Shiloh Y (2003) Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res 13:773–780

    Article  Google Scholar 

  • Ernst J, Nau GJ, Bar-Joseph Z (2005) Clustering short time series gene expression data. Bioinformatics 21(1):159–168

    Article  Google Scholar 

  • Freis E, Selinski S, Weibert B, Krahn U, Schmidt M, Gehrmann M, Hermes M, Maccoux L, West J, Schwender H, Rahnenfhrer J, Hengstler J, Ickstadt K (2009) Effects of metagene calculation on survival: An integrative approach using cluster and promoter analysis. In: Sixth International Workshop on Computational Systems Biology, Tampere, Finland, TICSP series 48, pp 47–50

    Google Scholar 

  • Glahn F, Schmidt-Heck W, Zellmer S, Guthke R, Wiese J, Golka K, Hergenroder R, Degen GH, Lehmann T, Hermes M, Schormann W, Brulport M, Bauer A, Bedawy E, Gebhardt R, Hengstler JG, Foth H (2008) Cadmium, cobalt and lead cause stress response, cell cycle deregulation and increased steroid as well as xenobiotic metabolism in primary normal human bronchial epithelial cells which is coordinated by at least nine transcription factors. Arch Toxicol 82:513–24

    Article  Google Scholar 

  • Hermes M (2007) Konditionale Expression von Her2/NeuT: Einfluss auf die Zell- und Tumorenentwicklung. PhD thesis, University of Leipzig

    Google Scholar 

  • Kim J, Kim JH (2007) Difference-based clustering of short time-course microarray data with replicates. Bioinformatics 8:253

    Google Scholar 

  • Krahn U (2008) Identifikation von Clustern in Gene-Expressions-Zeitreihen zur Analyse der Zellentwicklung. Diploma thesis, TU Dortmund

    Google Scholar 

  • Petry IB, Fieber E, Schmidt M, Gehrmann M, Gebhard S, Hermes M, Schormann W, Selinski S, Freis E, Schwender H, Brulport M, Ickstadt K, Rahnenfuhrer J, Maccoux L, West J, Kolbl H, Schuler M, Hengstler JG (2010) ERBB2 induces an antiapoptotic expression pattern of Bcl-2 family members in node-negative breast cancer. Clin Cancer Res 16(2):451–460

    Article  Google Scholar 

  • R Development Core Team (2010) R: A language and environment for statistical computing. Vienna, Austria, URL http://www.R-project.org

  • Rody A, Holtrich U, Pusztai L, Liedtke C, Gaetje R, Ruckhaeberle E, Solbach C, Hanker L, Ahr A, Metzler D, Engels K, Karn T, Kaufmann M (2009) T-cell metagene predicts a favourable prognosis in estrogen receptor negative and HER2 positive breast cancers. Breast Cancer Res 11:R15

    Article  Google Scholar 

  • Schmidt M, Bohm D, von Torne C, Steiner E, Puhl A, Pilch H, Lehr HA, Hengstler JG, Kolbl H, Gehrmann M (2008) The humoral immune system has a key prognostic impact in node-negative breast cancer. Cancer Res 68:5405–5413

    Article  Google Scholar 

  • Shamir R, Maron-Katz A, Tanay A, Linhart C, Steinfeld I, Sharan R, Shiloh Y, Elkon R (2005) EXPANDER - an integrative program suite for microarray data analysis. BMC Bioinformatics 6:232

    Article  Google Scholar 

  • Slamon DJ, Clark GM, Wong SG, Levin WJ, Ullrich A, McGuire WL (1987) Human breast cancer: Correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science 235:177–182

    Article  Google Scholar 

  • Tanay A (2005) Computational analysis of transcriptional programs: Function and evolution. PhD thesis, Tel Aviv University

    Google Scholar 

  • Trost TM, Lausch EU, Fees SA, Schmitt S, Enklaar T, Reutzel D, Brixel LR, Schmidtke P, Maringer M, Schiffer IB, Heimerdinger CK, Hengstler JG, Fritz G, Bockamp EO, Prawitt D, Zabel BU, Spangenberg C (2005) Premature senescence is a primary fail-safe mechanism of ERBB2-driven tumorigenesis in breast carcinoma cells. Cancer Res 65:840–849

    Google Scholar 

  • Wang X, Wu M, Li Z, Chan C (2008) Short time-series microarray analysis: Methods and challenges. BMC Syst Biol 2:58

    Article  MATH  Google Scholar 

  • Winter SC, Buffa FM, Silva P, Miller C, Valentine HR, Turley H, Shah KA, Cox GJ, Corbridge RJ, Homer JJ, Musgrove B, Slevin N, Sloan P, Price P, West CM, Harris AL (2007) Relation of a hypoxia metagene derived from head and neck cancer to prognosis of multiple cancers. Cancer Res 67(7):3441–3449

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank Ulrike Krahn, Marcus Schmidt, Mathias Gehrmann, Matthias Hermes, Lindsey Maccoux, Jonathan West, and Holger Schwender for collaboration and numerous helpful discussions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Evgenia Freis .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Freis, E., Selinski, S., Hengstler, J.G., Ickstadt, K. (2012). Cluster Analytic Strategy for Identification of Metagenes Relevant for Prognosis of Node Negative Breast Cancer. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24466-7_48

Download citation

Publish with us

Policies and ethics