Breast Cancer Research and Treatment

, Volume 118, Issue 3, pp 433–441

Meta-analysis of gene expression profiles related to relapse-free survival in 1,079 breast cancer patients

Preclinical Study

DOI: 10.1007/s10549-008-0242-8

Cite this article as:
Györffy, B. & Schäfer, R. Breast Cancer Res Treat (2009) 118: 433. doi:10.1007/s10549-008-0242-8


The transcriptome of breast cancers have been extensively screened with microarrays and large sets of genes associated with clinical features have been established. The aim of this study was to validate original gene sets on a large cohort of raw breast cancer microarray data with known clinical follow-up. We recovered 20 publications and matched them to Affymetrix HGU133A annotations. Raw Affymetrix HGU133A microarray data were extracted from GEO and MAS5 normalized. For classifying patients using the selected gene sets, we applied prediction analysis of microarrays and constructed Kaplan–Meier plots. A new classification including all patients was generated using supervised principal components analysis. Seven studies including 1,470 patients were downloaded from GEO. Notably, we uncovered 641 microarrays representing 251 individual tumor specimens among them, which were repeatedly described under independent GEO identifiers. We excluded all redundant data and used the remaining 1,079 samples. Eight of the 20 gene sets were able to predict response at a significance of P < 0.05. The discrimination of good and poor prognosis groups exclusively relying on gene expression data resulted in high significance (P = 1.8E−12). A model including genes fitted by both gene expression and clinical covariates (lymph node status and grade) contains 44 genes and can predict response at P = 9.5E−7. The outcome provides a ranking of the gene lists regarding applicability on an independent dataset. We established a consensus predictor combining the available clinical and gene expression data. The database comprising expression profiles of 1,079 breast cancers can be used to classify individual patients.


Microarray Gene expression signature Breast cancer prognosis Bioinformatics 

Supplementary material

10549_2008_242_MOESM1_ESM.pdf (69 kb)
Supplemental Table 1 (PDF 69 kb)
10549_2008_242_MOESM2_ESM.txt (234.9 mb)
Supplemental Table 2 (TXT 240574 kb)
10549_2008_242_MOESM3_ESM.pdf (272 kb)
Supplemental Table 3 (PDF 273 kb)
10549_2008_242_MOESM4_ESM.pdf (138 kb)
Supplemental Table 4 (PDF 139 kb)
10549_2008_242_MOESM5_ESM.txt (22.3 mb)
Supplemental Table 5 (TXT 22795 kb)
10549_2008_242_MOESM6_ESM.txt (68 kb)
Supplemental Table 6 (TXT 69 kb)

Copyright information

© Springer Science+Business Media, LLC. 2008

Authors and Affiliations

  1. 1.Research Group for Pediatrics and NephrologyHungarian Academy of Sciences and Semmelweis UniversityBudapestHungary
  2. 2.Children’s Hospital Boston Informatics ProgramHarvard-MIT Health Sciences and TechnologyBostonUSA
  3. 3.Laboratory of Molecular Tumor Pathology and Laboratory of Functional GenomicsCharité, Universitätsmedizin BerlinBerlinGermany

Personalised recommendations