Microarray Gene Expression Data Integration: An Application to Brain Tumor Grade Determination

  • Eduardo ValenteEmail author
  • Miguel Rocha
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 375)


World Health Organization ranks brain tumors in four stages, being the fourth grade the most aggressive. Glioblastoma, a fourth grade tumor, is one of the most severe human diseases that almost inevitability leads to death. Physicians address the classification in grades through direct inspection. Indeed, there is a need for good automatic predictors of tumor grade, which are not affected by human misclassification errors and that can be made with less invasive diagnostic tools. This work address the stages involved in the process of selecting a good tumor grade predictor, based on microarray gene expression data. In this work, the information integration from heterogeneous platforms is highlighted, evidencing the particularities of choosing approaches working at gene, transcript or probeset levels. Distinct machine learning algorithms and integration methods are tested, analyzing their ability to produce a good set of predictors for tumor grade.


Gene expression Glioblastoma Microarrays data integration Filter methods Wrapper methods 



The work is partially funded by Project 23060, PEM - Technological Support Platform for Metabolic Engineering, co- funded by FEDER through Portuguese QREN under the scope of the Technological Research and Development Incentive system, North Operational.


  1. 1.
    Cho, S., Ryu, J.: Classifying gene expression data of cancer using classifier ensemble with mutually exclusive features. Proc. IEEE 90(11), 1744–1753 (2002)CrossRefGoogle Scholar
  2. 2.
    Lorena, A., Costa, I., Souto, M.: On the complexity of gene expression classification data sets. In: 8th International Conference on Hybrid Intelligent Systems, pp. 825–830 (2008)Google Scholar
  3. 3.
    Huang, J., Fang, H., Fan, X.: Decision forest for classification of gene expression data. Comput. Biol. Med. 40(8), 698–704 (2010)CrossRefGoogle Scholar
  4. 4.
    Kuçukural, A., Yeniterzi, R., Yeniterzi, S., Sezerman, O.: Evolutionary selection of minimum number of features for classification of gene expression data using genetic algorithms. In: Proceedings of the 9th Annual confernce, GECCO 9, pp. 401–406 (2007)Google Scholar
  5. 5.
    Ballester, B., Johnson, N., Proctor, G., Flicek, P.: Consistent annotation of gene expression arrays. BMC Genom. 11, 294–308 (2010)CrossRefGoogle Scholar
  6. 6.
    Rouchka, E., Phatak, A., Singh, A.: Effect of single nucleotide polymorphisms on Affymetrix match-mismatch probe pairs. Bioinformation 2(9), 405–411 (2008)CrossRefGoogle Scholar
  7. 7.
    Zahurak, M., Parmigiani, G., Yu, W., Scharpf, R., Berman, D., Schaeffe, E., Shabbeer, S., Cope, L.: Pre-processing Agilent microarray data. BMC Bioinform. 8(142) (2007)Google Scholar
  8. 8.
    Kostadinova, E.: Data Integration: an approach to improve the preprocessing and analysis of gene expression data. Union Sci Bulgaria 6, 120–133 (2013)Google Scholar
  9. 9.
    Meng, C., Kuster, B., Culhane, A., Gholami, A.: A multivariate approach to the integration of multi-omics datasets. BMC Bioinformatics 15, 162–175 (2014)CrossRefGoogle Scholar
  10. 10.
    Papiez, A., Finnon, P., Badie, C., Bouffler, S., Polanska, J.: Integrating expression data from different microarray platforms in search of biomarkers of radiosensitivity. IWBBIO, Granada 1, 484–493 (2014)Google Scholar
  11. 11.
    Willis, C., Whyte, K., Baker, M., Pestell, R.: Integrating transcriptomic data using metacore pathway analysis to identify novel biomarkers of bevacizumab target engagement. In: Reuters, T. (ed.) American Association for Cancer Research, San Diego, vol. 74 (2014)Google Scholar
  12. 12.
    Valente, E., Rocha, M.: (in press) Transcript-based reannotation for microarray probesets. In: ACM Symposium on Applied Computing, Salamanca (2015)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.Computer Science and Information SystemsIPCBCastelo BrancoPortugal
  2. 2.Centre of Biological EngineeringUniversity of MinhoBragaPortugal

Personalised recommendations