Abstract
This paper proposes a gene selection approach based on clustering of DNA-microarray data. The proposal has been aimed at finding a boundary gene subset coming from gene groupings imposed by a clustering method applied to the case study: gene expression data in lung cancer. Thus, we assume that such a found gene subset represents informative genes, which can be used to train a classifier by learning tumor tissue samples. To do this, we compare the results of several methods of hierarchical clustering to select the best one and then choose the most suitable clustering based on visualization techniques. The latter is used to compute its boundary genes. The results achieved from the case study have shown the reliability of this approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Rothschild, S.I.: Advanced and metastatic lung cancer - what is new in the diagnosis and therapy. PRAXIS 104, 745–750 (2015)
Wang, K.J., Melani, A., Chen, K.H., Wang, K.M.: A hybrid classifier combining borderline-SMOTE with AIRS algorithm for estimating brain metastasis from lung cancer: A case study in taiwan. Computer Methods and Programs in Biomedicine 119, 63–76 (2015)
Berrar, D.P., Dubitzky, W., Granzow, M.: A Practical Approach to Microarray Data Analysis. Kluwer Academic Publishers, New York (2003)
Castellanos-Garzón, J.A., García, C.A., Novais, P., Díaz, F.: A visual analytics framework for cluster analysis of DNA microarray data. Expert Systems with Applications, Elsevier 40, 758–774 (2013)
Lazar, C., Taminau, J., Meganck, S., Steenhoff, D., Coletta, A., Molter, C., deSchaetzen, V., Duque, R., Bersini, H., Nowé, A.: A survey on filter techniques for feature selection in gene expression microarray analysis. IEEE/ACM Transactions On Computational Biology And Bioinformatics 9(4), 1106–1118 (2012)
Xia, C., Hsu, W., Lee, M.L., Ooi, B.C.: Border: Efficient computation of boundary points. IEEE Transactions on Knowledge and Data Engineering 18, 289–303 (2006)
Jain, A.K., Murty, N.M., Flynn, P.J.: Data clustering: A review. ACM Computing Surveys 31(3), 264–323 (1999)
Weiss, P.: Applications of generating functions in nonparametric tests. The Mathematica Journal 9(4), 803–823 (2005)
Eisen, M., Spellman, T., Brown, P., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences, USA 95, 14863–14868 (1998)
Chipman, H., Tibshirani, R.: Hybrid hierarchical clustering with applications to microarray data. Biostatistics 7, 302–317 (2006)
Castellanos-Garzón, J.A., Díaz, F.: An evolutionary computational model applied to cluster analysis of DNA microarray data. Expert Systems with Applications, Elsevier 40, 2575–2591 (2013)
Kuner, R., Muley, T., Meister, M., Ruschhaupt, M., Buness, A., Xu, E., Schnabel, P., Warth, A., Poustka, A., Sltmann, H., Hoffmann, H.: Global gene expression analysis reveals specific patterns of cell junctions in non-small cell lung cancer subtypes. Lung Cancer 63(1), 32–38 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Castellanos-Garzón, J.A., Ramos, J., González-Briones, A., de Paz, J.F. (2016). A Clustering-Based Method for Gene Selection to Classify Tissue Samples in Lung Cancer. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds) 10th International Conference on Practical Applications of Computational Biology & Bioinformatics. PACBB 2016. Advances in Intelligent Systems and Computing, vol 477. Springer, Cham. https://doi.org/10.1007/978-3-319-40126-3_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-40126-3_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40125-6
Online ISBN: 978-3-319-40126-3
eBook Packages: EngineeringEngineering (R0)