Impact of Partition Based Clustering Algorithms to Cluster Samples in Microarray Gene Expression Data

Das, Chandra; Bose, Shilpi; Karmakar, Debanjana; Roy, Agniswar; Ghosh, Natasha; Banerjee, Abhik; Chattopadhyay, Matangini

doi:10.1007/978-3-030-42363-6_77

Chandra Das⁸,
Shilpi Bose⁸,
Debanjana Karmakar⁸,
Agniswar Roy⁸,
Natasha Ghosh⁹,
Abhik Banerjee⁸ &
…
Matangini Chattopadhyay¹⁰

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 12))

Included in the following conference series:

International Conference on Innovation in Modern Science and Technology

861 Accesses

Abstract

The huge amount of gene expression profile data produced from DNA Microarray Technology has forced the analysis procedure to be applied in multiple biomedical fields. Analysis of cancer data for proper diagnosis of cancer is an important field where early detection of cancer or different levels of cancer helps in early recovery of cancer diseases. So, sample classification has become an evident task for this analysis. Clustering is an important process that can be applied in identification of new subtypes of cancer. Partition based clustering algorithms are popular due to their simplicity and ability to provide moderate results in most of the cases. In this regard here a comparative performance analysis has been performed to show the impact of partition based clustering algorithms to cluster samples in microarray data. In this paper, five classical and popular partition based algorithms are applied on eight gene expression datasets to illustrate the comparative performances. The results show the usefulness of the selected partition based algorithms clearly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Causton, H., Quackenbush, J., Brazma, A.: Microarray Gene Expression Data Analysis: A Beginner’s Guide. Wiley, New York (2003)
Google Scholar
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)
Article Google Scholar
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3, 185–205 (2005)
Article Google Scholar
Shen, Q., Shi, W., Kong, W.: New gene selection method for multiclass tumor classification by class centroid. J. Biomed. Inform. 42, 59–65 (2009)
Article Google Scholar
Liu, Q., Sung, A.H., Chen, Z., et al.: Gene selection and classification for cancer microarray data based on machine learning and similarity measures. BMC Genomics 12(5), Article S1 (2011)
Google Scholar
Liao, J.G., Chin, K.-V.: Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Bioinformatics 23(15), 1945–1951 (2007)
Article Google Scholar
Domany, E.: Cluster analysis of gene expression data. J. Stat. Phys. 110(3–6), 1117–1139 (2003)
Article Google Scholar
Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (2004)
Article Google Scholar
de Souto, M.C.P., et al.: Clustering cancer gene expression data: a comparative study. BMC Bioinform. 9, 497 (2008)
Article Google Scholar
Maji, P., Paul, S.: Rough-fuzzy clustering for grouping functionally similar genes from microarray data. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 286–299 (2013)
Article Google Scholar
Ng, R.T., Han, J.: CLARANS: a method for clustering objects for spatial data mining. IEEE Trans. Knowl. Data Eng. 14(5), 1003–1016 (2002)
Article Google Scholar
Maji, P., Pal, S.K.: Rough set based generalized fuzzy C-means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(6), 1529–1540 (2007)
Article Google Scholar
Maji, P., Das, C.: Relevant and significant supervised gene clusters for microarray cancer classification. IEEE Trans. Nanobiosci. 11(2), 161–168 (2012)
Article Google Scholar
Wang, S.-L., et al.: Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumor classification. BMC Bioinform. 13, 178 (2012)
Article Google Scholar
Maji, P.: Mutual information-based supervised attribute clustering for microarray sample classification. IEEE Trans. Knowl. Data Eng. 24(1), 127–140 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Netaji Subhash Engineering College, Kolkata, 700152, West Bengal, India
Chandra Das, Shilpi Bose, Debanjana Karmakar, Agniswar Roy & Abhik Banerjee
Department of Information Technology, Netaji Subhash Engineering College, Kolkata, 700152, West Bengal, India
Natasha Ghosh
School of Education Technology, Jadavpur University, Kolkata, West Bengal, India
Matangini Chattopadhyay

Authors

Chandra Das
View author publications
You can also search for this author in PubMed Google Scholar
Shilpi Bose
View author publications
You can also search for this author in PubMed Google Scholar
Debanjana Karmakar
View author publications
You can also search for this author in PubMed Google Scholar
Agniswar Roy
View author publications
You can also search for this author in PubMed Google Scholar
Natasha Ghosh
View author publications
You can also search for this author in PubMed Google Scholar
Abhik Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
Matangini Chattopadhyay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandra Das .

Editor information

Editors and Affiliations

Department of Electrical Engineering, Siliguri Institute of Technology, Sukna, West Bengal, India
Subhojit Dawn
Department of Automation and Applied Informatics, Aurel Vlaicu University of Arad, Arad, Arad, Romania
Valentina Emilia Balas
Department of Psychology and IIASS, Università della Campania “Luigi Vanvitelli”, Caserta, Caserta, Italy
Anna Esposito
Mizoram University, Aizawl, Mizoram, India
Sadhan Gope

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, C. et al. (2020). Impact of Partition Based Clustering Algorithms to Cluster Samples in Microarray Gene Expression Data. In: Dawn, S., Balas, V., Esposito, A., Gope, S. (eds) Intelligent Techniques and Applications in Science and Technology. ICIMSAT 2019. Learning and Analytics in Intelligent Systems, vol 12. Springer, Cham. https://doi.org/10.1007/978-3-030-42363-6_77

Download citation

DOI: https://doi.org/10.1007/978-3-030-42363-6_77
Published: 03 March 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42362-9
Online ISBN: 978-3-030-42363-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics