A Novel Approach to Gene Selection of Leukemia Dataset Using Different Clustering Methods

Prasath, P.; Perumal, K.; Thangavel, K.; Manavalan, R.

doi:10.1007/978-81-322-1680-3_7

P. Prasath⁸,
K. Perumal⁸,
K. Thangavel⁹ &
…
R. Manavalan¹⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 246))

1513 Accesses

Abstract

Gene datasets from microarray comprise large number of genes. Clustering is a widely used approach for grouping similar kind of genes. The main objective of this paper is to identify the optimal subset of genes from the leukemia dataset in order to classify the leukemia cancer. Different clustering approaches such as K-means (KM) clustering, fuzzy C-means (FCM) clustering, and modified K-means (MKM) clustering have been adopted in this research. The clusters obtained from these methods are further clustered using K-means sample-wise (by omitting class values), and the results are compared with ground truth value to evaluate the performance of the different clustering methods. The highly correlated genes are selected from the cluster that produces more accurate classification results. It is observed that the FCM (gene-wise clustering) with K-means (sample-wise clustering) produces better accuracy, and the resultant genes have been identified.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Unsupervised gene selection using biological knowledge : application in sample clustering

Article Open access 22 November 2017

Informative Gene Selection Using Clustering and Gene Ontology

Correlation Based Cluster Validity Index for Recognition of Leukemia Mediating Biomarkers

References

Stanislav Busygin, Gerrit Jacobsen, and Ewald Kramer. Double conjugated clustering applied to leukemia microarray data. In Proceedings of the 2nd SIAM International Conference on Data Mining, Workshop on Clustering High Dimensional Data, 2002.
Google Scholar
Aik Choon Tan and David Gilbert, Ensemble machine learning on gene expression data for cancer classification: Applied Bioinformatics 2003:2 (3 Suppl) S75–S83.
Google Scholar
Cherie H. Dunphy (2006) Gene Expression Profiling Data in Lymphoma and Leukemia: Review of the Literature and Extrapolation of Pertinent Clinical Applications. Archives of Pathology & Laboratory Medicine: April 2006, Vol. 130, No. 4, pp. 483–520.
Google Scholar
Yoo CK, Vanrolleghem PA. Interpreting patterns and analysis of acute leukemia gene expression data by multivariate statistical analysis. In: Barbosa Povoa A, Matos H, editors. Computer-Aided Chemical Engineering. Elsevier Science; 2004. pp. 1165–70.
Google Scholar
Wei Li, Modified K-means clustering algorithm, Congress on Image & Signal Processing, IEEE, 2008, pp. 618–621.
Google Scholar
T.R. Golub et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science, 1999, Vol. 286, pp. 531–537.
Google Scholar
Palanisamy, P.; Perumal; Thangavel, K.; Manavalan, R., “A novel approach to select significant genes of leukemia cancer data using K-Means clustering,” Pattern Recognition, Informatics and Medical Engineering (PRIME), 2013 International Conference on, pp. 104, 108, 21–22 Feb. 2013.
Google Scholar

Download references

Acknowledgments

The third author gratefully acknowledges the UGC, New Delhi, for partial financial assistance under UGC-SAP (DRS) Grant No. F3-50/2011.

Author information

Authors and Affiliations

Department of Biotechnology, Periyar University, Salem, 636 011, India
P. Prasath & K. Perumal
Department of Computer Science, Periyar University, Salem, 636 011, India
K. Thangavel
Department of Computer Science, K. S. Rangasamy College of Arts and Science, Thiruchengode, India
R. Manavalan

Authors

P. Prasath
View author publications
You can also search for this author in PubMed Google Scholar
K. Perumal
View author publications
You can also search for this author in PubMed Google Scholar
K. Thangavel
View author publications
You can also search for this author in PubMed Google Scholar
R. Manavalan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Prasath .

Editor information

Editors and Affiliations

Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
G. Sai Sundara Krishnan
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. Anitha
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. S. Lekshmi
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
M. Senthil Kumar
Department of Mathematics, Ryerson University, Toronto, Ontario, Canada
Anthony Bonato
University of Basque Country, Paseo Manuel De Lardizalbal 1, San Sebastian, Spain
Manuel Graña

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prasath, P., Perumal, K., Thangavel, K., Manavalan, R. (2014). A Novel Approach to Gene Selection of Leukemia Dataset Using Different Clustering Methods. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_7

Download citation

DOI: https://doi.org/10.1007/978-81-322-1680-3_7
Published: 27 November 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1679-7
Online ISBN: 978-81-322-1680-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A Novel Approach to Gene Selection of Leukemia Dataset Using Different Clustering Methods

Abstract

Access this chapter

Similar content being viewed by others

Unsupervised gene selection using biological knowledge : application in sample clustering

Informative Gene Selection Using Clustering and Gene Ontology

Correlation Based Cluster Validity Index for Recognition of Leukemia Mediating Biomarkers

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Novel Approach to Gene Selection of Leukemia Dataset Using Different Clustering Methods

Abstract

Access this chapter

Similar content being viewed by others

Unsupervised gene selection using biological knowledge : application in sample clustering

Informative Gene Selection Using Clustering and Gene Ontology

Correlation Based Cluster Validity Index for Recognition of Leukemia Mediating Biomarkers

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation