Abstract
In this paper, a novel gene selection method which was merging the relevance score (BW ratio) and the Flexible Neural Tree (FNT) together was proposed for the multi-class cancer data classification. Firstly, the BW ratio method was adopted to select some informative genes, and then the FNT method was used to extract more characteristic genes from the gene subsets. FNT is a tree-structured neural network with input variables selection, over-layer connections and different activation functions for different nodes. Based on the pre-defined instruction/operator sets, a flexible neural tree model can be created and evolved. The FNT structure is developed by using probabilistic incremental program evolution (PIPE) algorithm, and the free parameters embedded in neural trees are optimized by particle swarm optimization (PSO) algorithm. Experiment on two well-known cancer datasets shows that the proposed method achieved better results compared with other methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Saeys, Y., Abeel, T., Van de Peer, Y.: Robust Feature Selection Using Ensemble Feature Selection Techniques. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part II. LNCS (LNAI), vol. 5212, pp. 313–325. Springer, Heidelberg (2008)
Sandrine, D., Jane, F., Terence, P.S.: Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. The American Statistical Association 97(457) (2002)
Hong, C., Carlotta, D.: An Evaluation of Gene Selection Methods for Multi-class Microarray Data Classification. In: Proceedings of the Second European Workshop on Data Mining and Text Mining in Bioinformatics
Yang, C.S., Chuang, L.Y., Li, J.C., Yang, C.H.: A Novel BPSO Approach for Gene Selection and Classification of Microarray Data. IEEE (2008)
Hrishikesh, M., Nitya, S., Krishna, M., Tapobrata, L.: An ANN-GA model based promoter prediction in Arabidopsis thaliana using tilling microarray data. Bioinformation 6(6), 240–243 (2011)
Kohavi, R., John, G.: Wrappers for feature subset selection. Artif. Intell. 97(1-2), 273–324 (1997)
Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Original Paper, Bioinformatics/btn644 25(3), 331–337 (2009)
Chen, Y., Peng, L., Abraham, A.: Gene Expression Profiling Using Flexible Neural Trees. In: Corchado, E., Yin, H., Botti, V., Fyfe, C. (eds.) IDEAL 2006. LNCS, vol. 4224, pp. 1121–1128. Springer, Heidelberg (2006)
Saeys, Y., Inza, I., Larranaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)
Salustowicz, R., Schmidhuber, J.: Probabilistic incremental program evolution. Evolutionary Computation 5(2), 123–141 (1997)
Yang, K., Cai, Z., Li, J., Lin, G.: A stable gene selection in microarray data analysis. BMC Bioinformatics 7(228) (2006)
Armstrong, S.A., Staunton, J.E., Silverman, L.B., Pieters, R., den Boer, M.L., Minden, M.D., Sallan, S.E., Lander, E.S., Golub, T.R., Korsmeyer, S.J.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nature Genetics 30, 41–47 (2002)
Nutt, C.L., Mani, D.R., Betensky, R.A., Tamayo, P., Cairncross, J.G., Ladd, C., Pohl, U., Hartmann, C., McLaughlin, M.E., Batchelor, T.T., Black, P.M., von Deimling, A., Pomeroy, S.L., Golub, T.R., Louis, D.N.: Gene Expression-based Classification of Malignant Gliomas Correlates Better with Survival than Histological Classification. Cancer Research 63, 1602–1607 (2003)
Zhang, B.-L.: Cancer Classification by Kernel Principal Component Self-regression. In: Sattar, A., Kang, B.-H. (eds.) AI 2006. LNCS (LNAI), vol. 4304, pp. 719–728. Springer, Heidelberg (2006)
Li, G.Z., Meng, H.H., Ni, J.: Embedded Gene Selection for Imbalanced Microarray Data Analysis. IEEE (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lei, X., Chen, Y., Zhao, Y. (2012). A Novel Gene Selection Method for Multi-catalog Cancer Data Classification. In: Huang, DS., Jiang, C., Bevilacqua, V., Figueroa, J.C. (eds) Intelligent Computing Technology. ICIC 2012. Lecture Notes in Computer Science, vol 7389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31588-6_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-31588-6_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31587-9
Online ISBN: 978-3-642-31588-6
eBook Packages: Computer ScienceComputer Science (R0)