Skip to main content

Impact of Partition Based Clustering Algorithms to Cluster Samples in Microarray Gene Expression Data

  • Conference paper
  • First Online:
Intelligent Techniques and Applications in Science and Technology (ICIMSAT 2019)

Abstract

The huge amount of gene expression profile data produced from DNA Microarray Technology has forced the analysis procedure to be applied in multiple biomedical fields. Analysis of cancer data for proper diagnosis of cancer is an important field where early detection of cancer or different levels of cancer helps in early recovery of cancer diseases. So, sample classification has become an evident task for this analysis. Clustering is an important process that can be applied in identification of new subtypes of cancer. Partition based clustering algorithms are popular due to their simplicity and ability to provide moderate results in most of the cases. In this regard here a comparative performance analysis has been performed to show the impact of partition based clustering algorithms to cluster samples in microarray data. In this paper, five classical and popular partition based algorithms are applied on eight gene expression datasets to illustrate the comparative performances. The results show the usefulness of the selected partition based algorithms clearly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Causton, H., Quackenbush, J., Brazma, A.: Microarray Gene Expression Data Analysis: A Beginner’s Guide. Wiley, New York (2003)

    Google Scholar 

  2. Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)

    Article  Google Scholar 

  3. Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3, 185–205 (2005)

    Article  Google Scholar 

  4. Shen, Q., Shi, W., Kong, W.: New gene selection method for multiclass tumor classification by class centroid. J. Biomed. Inform. 42, 59–65 (2009)

    Article  Google Scholar 

  5. Liu, Q., Sung, A.H., Chen, Z., et al.: Gene selection and classification for cancer microarray data based on machine learning and similarity measures. BMC Genomics 12(5), Article S1 (2011)

    Google Scholar 

  6. Liao, J.G., Chin, K.-V.: Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Bioinformatics 23(15), 1945–1951 (2007)

    Article  Google Scholar 

  7. Domany, E.: Cluster analysis of gene expression data. J. Stat. Phys. 110(3–6), 1117–1139 (2003)

    Article  Google Scholar 

  8. Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (2004)

    Article  Google Scholar 

  9. de Souto, M.C.P., et al.: Clustering cancer gene expression data: a comparative study. BMC Bioinform. 9, 497 (2008)

    Article  Google Scholar 

  10. Maji, P., Paul, S.: Rough-fuzzy clustering for grouping functionally similar genes from microarray data. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 286–299 (2013)

    Article  Google Scholar 

  11. Ng, R.T., Han, J.: CLARANS: a method for clustering objects for spatial data mining. IEEE Trans. Knowl. Data Eng. 14(5), 1003–1016 (2002)

    Article  Google Scholar 

  12. Maji, P., Pal, S.K.: Rough set based generalized fuzzy C-means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 37(6), 1529–1540 (2007)

    Article  Google Scholar 

  13. Maji, P., Das, C.: Relevant and significant supervised gene clusters for microarray cancer classification. IEEE Trans. Nanobiosci. 11(2), 161–168 (2012)

    Article  Google Scholar 

  14. Wang, S.-L., et al.: Finding minimum gene subsets with heuristic breadth-first search algorithm for robust tumor classification. BMC Bioinform. 13, 178 (2012)

    Article  Google Scholar 

  15. Maji, P.: Mutual information-based supervised attribute clustering for microarray sample classification. IEEE Trans. Knowl. Data Eng. 24(1), 127–140 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chandra Das .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Das, C. et al. (2020). Impact of Partition Based Clustering Algorithms to Cluster Samples in Microarray Gene Expression Data. In: Dawn, S., Balas, V., Esposito, A., Gope, S. (eds) Intelligent Techniques and Applications in Science and Technology. ICIMSAT 2019. Learning and Analytics in Intelligent Systems, vol 12. Springer, Cham. https://doi.org/10.1007/978-3-030-42363-6_77

Download citation

Publish with us

Policies and ethics