Abstract
Gene expression technology namely microarray, offers the ability to measure the expression levels of thousands of genes simultaneously in a biological organism. Microarray data are expected to be of significant help in the development of efficient cancer diagnosis and classification platform. The main problem that needs to be addressed is the selection of a small subset of genes that contributes to a disease from the thousands of genes measured on microarray that are inherently noisy. Most approaches from previous works have selected the numbers of genes manually and thus, have caused difficulty, especially for beginner biologists. Hence, this paper aims to automatically select a small subset of informative genes that is most relevant for the cancer classification. In order to achieve this aim, a recursive genetic algorithm has been proposed. Experimental results show that the gene subset is small in size and yield better classification accuracy as compared with other previous works as well as four methods experimented in this work. A list of informative genes in the best subsets is also presented for biological usage.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mohamad, M.S., Omatu, S., Deris, S., et al.: A model for gene selection and classification of gene expression data. J. Artif. Life Robotics 11(2), 219–222 (2007)
Armstrong, S.A., Staunton, J.E., Silverman, L.B., et al.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nat Genetics (30), 41–47 (2002)
Gordon, G.J., Jensen, R.V., Hsiao, L.L., et al.: Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res. 62, 4963–4967 (2002)
Li, J., Liu, H., Ng, S.K., et al.: Discovery of significant rules for classifying cancer diagnosis data. Bioinformatics 19, 93–102 (2003)
Wang, Y., Makedon, F.S., Ford, J.C., et al.: HykGene: A hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21(8), 1530–1537 (2005)
Mohamad, M.S., Omatu, S., Deris, S., et al.: A multi-objective strategy in genetic algorithm for gene selection of gene expression data. J. Artif. Life Robotics (appear in press)
Shah, S., Kusiak, A.: Cancer gene search with data-mining and genetic algorithms. Computers Bio. Med. 37(2), 251–261 (2007)
Huang, H.L., Chang, F.L.: ESVM: Evolutionary support vector machine for automatic feature selection and classification of microarray data. Biosystems 90, 516–528 (2007)
Moon, H., Ahn, H., Kodell, R.L., et al.: Ensemble methods for classification of patients for personalized medicine with high-dimensional data. Artif. Intell. Med. 41, 197–207 (2007)
Mohamad, M.S., Deris, S., Illias, R.M.: A hybrid of genetic algorithm and support vector machine for features selection and classification of gene expression microarray. J. Comput. Intell. Appl. 5, 91–107 (2005)
Ambroise, C., McLachlan, G.J.: Selection bias in gene extraction on the basis of microarray gene-expression data. In: National Academy of Science of the USA, Washington, pp. 6562–6566 (2002)
Evans, C., Young, A.N., Brown, M.R., et al.: Novel patterns of gene expression in pituitary adenomas identified by complementary deoxyribonucleic acid microarrays and quantitative reverse transcription-polymerase chain reaction. J. Clin. Endocrinol Metab. 86(7), 3097–3107 (2001)
Li, X., Rao, S., Wang, Y., et al.: Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling. Nucleic Acids Res. 32(9), 2685–2694 (2004)
Guinn, B., Gilkes, A.F., Woodward, E., et al.: Microarray analysis of tumour antigen expression in presentation acute myeloid leukaemia. Biochem. Biophys. Res. Communications 333(3), 703–713 (2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mohamad, M.S., Omatu, S., Deris, S., Yoshioka, M. (2009). A Recursive Genetic Algorithm to Automatically Select Genes for Cancer Classification. In: Corchado, J.M., De Paz, J.F., Rocha, M.P., Fernández Riverola, F. (eds) 2nd International Workshop on Practical Applications of Computational Biology and Bioinformatics (IWPACBB 2008). Advances in Soft Computing, vol 49. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85861-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-85861-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85860-7
Online ISBN: 978-3-540-85861-4
eBook Packages: EngineeringEngineering (R0)