Abstract
Gene ranking is widely employed in gene selection. Most criteria adopt a single quantitative value to rank genes. To some degree, they hardly provide comprehensive discriminative information of genes. In this paper, the supervised vector representation is proposed. The supervised vector reflecting the spatial distribution in the space expended by the gene is used as the gene representation. The possible problems of “bias” and “cumulative error”, which may be induced by the criteria based on a single value, can be avoided by the proposed criterion. An algorithm of gene selection based on supervised vector representation is also proposed to select gene subsets with the new criterion and its performance is compared with that of some existing gene selection algorithms. Experimental results demonstrate that the proposed algorithm is capable of generating the final gene subsets with higher predictive capability in most cases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bortoluzzi, S., Bisognin, A., Romualdi, C., et al.: Novel genes, possibly relevant for molecular diagnosis or therapy of human rhabdomyosarcoma, detected by genomic expression profiling. Gene 348, 65–71 (2005)
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. Journal of Bioinformatics and Computational Biology 3(02), 185–205 (2005)
Easmon, J., Purstinger, G., Thies, K.S., et al.: Synthesis, structure-activity relationships, and antitumor studies of 2-benzoxazolyl hydrazones derived from alpha-(N)-acyl heteroaromatics. Journal of Medicinal Chemistry 49(21), 6343–6350 (2006)
Frank, E., Hall, M., Holmes, G., et al.: Weka-a machine learning workbench for data mining. In: Data Mining and Knowledge Discovery Handbook, pp. 1269–1277. Springer US (2010)
Gutlein, M., Frank, E., Hall, M., et al.: Large-scale attribute selection using wrappers. In: IEEE Symposium on Computational Intelligence and Data Mining, CIDM 2009, pp. 332–339. IEEE (2009)
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Clustering validity checking methods: part II. ACM Sigmod Record 31(3), 19–27 (2002)
Hall, M.A.: Correlation-based feature selection for machine learning. The University of Waikato (1999)
Lee, S.Y., Kim, J.W., Jeong, M.H., et al.: Microtubule-associated Protein 1B Light Chain (MAP1B-LC1) negatively regulates the activity of tumor suppressor p53 in neuroblastoma cells. FEBS Letters 582(19), 2826–2832 (2008)
Li, J., Su, H., Chen, H., et al.: Optimal search-based gene subset selection for gene array cancer classification. IEEE Transactions on Information Technology in Biomedicine 11(4), 398–405 (2007)
Oh, I.S., Lee, J.S., Moon, B.R.: Hybrid genetic algorithms for feature selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(11), 1424–1437 (2004)
Pal, N.R., Aguan, K., Sharma, A., et al.: Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clustering. BMC Bioinformatics 8(1), 5 (2007)
Platt, J.: Sequential minimal optimization: A fast algorithm for training support vector machines (1998)
Robnik-Sikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RReliefF. Machine Learning 53(1-2), 23–69 (2003)
Taube, J.H., Herschkowitz, J.I., Komurov, K., et al.: Core epithelial-to-mesenchymal transition interactome gene-expression signature is associated with claudin-low and metaplastic breast cancer subtypes. Proceedings of the National Academy of Sciences 107(35), 15449–15454 (2010)
Wei, J.M., Wang, S.Q., Yuan, X.J.: Ensemble rough hypercuboid approach for classifying cancers. IEEE Transactions on Knowledge and Data Engineering 22(3), 381–391 (2010)
Yang, F., Mao, K.Z.: Robust feature selection for microarray data based on multicriterion fusion. IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB) 8(4), 1080–1092 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Yu, T., Gao, F., Jin, H., Wei, J. (2014). Gene Selection Based on Supervised Vector Representation of Genes. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_67
Download citation
DOI: https://doi.org/10.1007/978-3-319-13560-1_67
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13559-5
Online ISBN: 978-3-319-13560-1
eBook Packages: Computer ScienceComputer Science (R0)