Chapter

Methods of Microarray Data Analysis

pp 147-162

Integration of Microarray Data for a Comparative Study of Classifiers and Identification of Marker Genes

  • Daniel BerrarAffiliated withSchool of Biomedical Sciences, University of Ulster
  • , Brian SturgeonAffiliated withSchool of Biomedical Sciences, University of Ulster
  • , Ian BradburyAffiliated withSchool of Biomedical Sciences, University of Ulster
  • , C. Stephen DownesAffiliated withSchool of Biomedical Sciences, University of Ulster
  • , Werner DubitzkyAffiliated withSchool of Biomedical Sciences, University of Ulster

* Final gross prices may vary according to local VAT.

Get Access

Abstract:

Novel diagnostic tools promise the development of patient-tailored cancer treatment. However, one major step towards individualized therapy is to use a combination of various data sources, e.g. transcriptomic, proteomic, and clinical data. We have integrated clinical data and lung cancer microarray data that were generated on two different oligonucleotide platforms. We were interested in the question whether the prediction of survival outcome benefits from the integration of clinical and transcriptomic data. In addition, we attempted to identify those genes whose expression profiles correlate with survival outcome. We applied five machine learning techniques to predict survival risk groups, and we compared the models with respect to their performance and general user acceptance. Based on quantitative and qualitative evaluation criteria, we chose decision trees as the most relevant technique for this type of analysis. Our in silico analysis corroborates the role of numerous marker genes already described in lung adenocarcinomas. In addition, our study reveals a set of highly interesting genes whose expression profiles correlate with genetic risk groups of unexpected survival outcomes.

Key words:

Microarray lung cancer survival analysis machine learning