Gene Selection and Classification Rule Generation for Microarray Dataset
Microarray is a useful technique for measuring expression data of thousands or more of genes simultaneously. One of challenges in classification of cancer using high-dimensional gene expression data is to select a minimal number of relevant genes which can maximize classification accuracy. Because of the distinct characteristics inherent to specific cancerous gene expression profiles, developing flexible and robust gene identification methods is extremely fundamental. Many gene selection methods as well as their corresponding classifiers have been proposed. In the proposed method, a single gene with high class-discrimination capability is selected and classification rules are generated for cancer based on gene expression profiles.
KeywordsMicroarray cancer data K-means algorithm Gene selection Classification Rule Cancer sample identification
Unable to display preview. Download preview PDF.
- 1.Aerman, D.A., Gish, K., Ybarra, S., Mack, D., Levine, A.J.: Expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. 1, 6745–6750 (1999)Google Scholar
- 9.Wang, X., Gotoh, O.: Microarray-Based Cancer Prediction Using Soft Computing Approach. Cancer Informatics 7, 123–139 (2009)Google Scholar
- 10.Bradley, P.S., Bennett, K.P., Demiriz, A.: Constrained k-means clustering (Technical Report MSR-TR-2000-65), Microsoft Research, Redmond, WA (2000)Google Scholar
- 11.Pensa, R.G., Leschi, C., Besson, J., Boulicaut, J.: Assessment of discretization techniques for relevant pattern discovery from gene expression data. In: 4th Workshop on Data Mining in Bioinformatics (2004)Google Scholar