Abstract
The huge amount of gene expression profile data produced from DNA Microarray Technology has forced the analysis procedure to be applied in multiple biomedical fields. Analysis of cancer data for proper diagnosis of cancer is an important field where early detection of cancer or different levels of cancer helps in early recovery of cancer diseases. So, sample classification has become an evident task for this analysis. Clustering is an important process that can be applied in identification of new subtypes of cancer. Partition based clustering algorithms are popular due to their simplicity and ability to provide moderate results in most of the cases. In this regard here a comparative performance analysis has been performed to show the impact of partition based clustering algorithms to cluster samples in microarray data. In this paper, five classical and popular partition based algorithms are applied on eight gene expression datasets to illustrate the comparative performances. The results show the usefulness of the selected partition based algorithms clearly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Causton, H., Quackenbush, J., Brazma, A.: Microarray Gene Expression Data Analysis: A Beginner’s Guide. Wiley, New York (2003)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3, 185–205 (2005)
Shen, Q., Shi, W., Kong, W.: New gene selection method for multiclass tumor classification by class centroid. J. Biomed. Inform. 42, 59–65 (2009)
Liu, Q., Sung, A.H., Chen, Z., et al.: Gene selection and classification for cancer microarray data based on machine learning and similarity measures. BMC Genomics 12(5), Article S1 (2011)
Liao, J.G., Chin, K.-V.: Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Bioinformatics 23(15), 1945–1951 (2007)
Domany, E.: Cluster analysis of gene expression data. J. Stat. Phys. 110(3–6), 1117–1139 (2003)
Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (2004)
de Souto, M.C.P., et al.: Clustering cancer gene expression data: a comparative study. BMC Bioinform. 9, 497 (2008)
Maji, P., Paul, S.: Rough-fuzzy clustering for grouping functionally similar genes from microarray data. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 286–299 (2013)
Ng, R.T., Han, J.: CLARANS: a method for clustering objects for spatial data mining. IEEE Trans. Knowl. Data Eng. 14(5), 1003–1016 (2002)
Maji, P., Pal, S.K.: Rough set based generalized fuzzy C-means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(6), 1529–1540 (2007)
Maji, P., Das, C.: Relevant and significant supervised gene clusters for microarray cancer classification. IEEE Trans. Nanobiosci. 11(2), 161–168 (2012)
Wang, S.-L., et al.: Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumor classification. BMC Bioinform. 13, 178 (2012)
Maji, P.: Mutual information-based supervised attribute clustering for microarray sample classification. IEEE Trans. Knowl. Data Eng. 24(1), 127–140 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Das, C. et al. (2020). Impact of Partition Based Clustering Algorithms to Cluster Samples in Microarray Gene Expression Data. In: Dawn, S., Balas, V., Esposito, A., Gope, S. (eds) Intelligent Techniques and Applications in Science and Technology. ICIMSAT 2019. Learning and Analytics in Intelligent Systems, vol 12. Springer, Cham. https://doi.org/10.1007/978-3-030-42363-6_77
Download citation
DOI: https://doi.org/10.1007/978-3-030-42363-6_77
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42362-9
Online ISBN: 978-3-030-42363-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)