Performance Efficiency and Effectiveness of Clustering Methods for Microarray Datasets

Chormunge, Smita; Jena, Sudarson

doi:10.1007/978-81-322-2529-4_58

Smita Chormunge⁶ &
Sudarson Jena⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 44))

874 Accesses

Abstract

Numerous of clustering methods are proficiently work for low dimensional data. However Clustering High dimensional data is still challenging related to time complexity and accuracy. This paper presents the performance efficiency and effectiveness of K-means and Agglomerative hierarchical clustering methods based on Euclidean distance function and quality measures Precision, Recall and F-measure for Microarray datasets by varying cluster values. Efficiency concerns about computational time required to build up dataset and effectiveness concerns about accuracy to cluster the data. Experimental results on Microarray datasets reveal that K-means clustering algorithm is favorable in terms of effectiveness where as efficiency of clustering algorithms depends on dataset used for empirical study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (November 2004). doi:ieeecomputersociety.org/10.1109/TKDE
Steinbach, M., Ertöz, L., Kumar, V.: The challenges of clustering high dimensional data. In: New Vistas in Statistical Physics—Applications in Econophysics, Bioinformatics, and Pattern Recognition. Springer (2004)
Google Scholar
Xu, R., Wunsch, D., Survey of clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (May 2005)
Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining (2000)
Google Scholar
Onoda, T., Sakai, M., Independent component analysis based seeding method for k-means clustering. In: IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology (2011). doi:10.1109/WI-IAT.2011.29
Dhillon, I., Modha, D.: Concept decompositi-on for large sparse text data using clustering. Mach. Learn. 42, 143–175 (2001)
Article MATH Google Scholar
Rokach, L., Maimon, O.: Clustering Methods Data Mining and Knowledge Discovery Handbook. Springer (2005)
Google Scholar
Achtert, E., Goldhofer, S., Kriegel,H.-P., Schubert, E., Zimek, A.: Evaluation of clusterings—metrics and visual support. In: Proceedings of the 28th International Conference on Data Engineering (ICDE), Washington, DC (2012)
Google Scholar
Bourennani, F., Pu, K.Q., Zhu, Y.: Visualization and integration of databases using self-organizing map. IEEE Int. Conf. Adv. Databases Knowl. Data Appl. 155–160 (2009). doi:10.1109/DBKDA.2009.30
Greenacre, M., Primicerio, R.: Measures of Distance between Samples: Euclidean, pp. 47–59. Fundacion BBVA Publication (December 2013). ISBN:978-84-92937-50-9
Google Scholar
http://csse.szu.edu.cn/staff/zhuzx/Datasets.html
Bouckaert, R.R., Frank, E., Hall, M., Kirkby, R., Reutemann, P., Seewald, A., Scuse, D.: WEKA Manual for Version 3-7-10, July 31, 2013
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, GITAM University, Hyderabad, India
Smita Chormunge
Department of Information Technology, GITAM University, Hyderabad, India
Sudarson Jena

Authors

Smita Chormunge
View author publications
You can also search for this author in PubMed Google Scholar
Sudarson Jena
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Smita Chormunge .

Editor information

Editors and Affiliations

Department of Computer Science, Liverpool Hope University, Liverpool, United Kingdom
Atulya Nagar
Department of Computer Science and Engineering, National Institute of Technology Rourkela, Rourkela, India
Durga Prasad Mohapatra
Computer Science & Engineering, University of Calcutta, Kolkata, West Bengal, India
Nabendu Chaki

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chormunge, S., Jena, S. (2016). Performance Efficiency and Effectiveness of Clustering Methods for Microarray Datasets. In: Nagar, A., Mohapatra, D., Chaki, N. (eds) Proceedings of 3rd International Conference on Advanced Computing, Networking and Informatics. Smart Innovation, Systems and Technologies, vol 44. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2529-4_58

Download citation

DOI: https://doi.org/10.1007/978-81-322-2529-4_58
Published: 03 September 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2528-7
Online ISBN: 978-81-322-2529-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics