Gene Clustering Using Particle Swarm Optimizer Based Memetic Algorithm

  • Zhen Ji
  • Wenmin Liu
  • Zexuan Zhu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6728)


K-means is one of the most commonly used clustering methods for analyzing gene expression data, where it is sensitive to the choice of initial clustering centroids and tends to be trapped in local optima. To overcome these problems, a memetic K-means (MKMA) algorithm, which is a hybridization of particle swarm optimizer (PSO) based memetic algorithm (MA) and K-means, is proposed in this paper. In particular, the PSO based MA is used to minimize the within-cluster sum of squares and the K-means is used to iteratively fine-tune the locations of the centers. The experimental results on two gene expression datasets indicate that MKMA is capable of obtaining more compact clusters than K-means, Fuzzy K-means, and the other PSO based K-means namely PK-means. MKMA is also demonstrated to attain faster convergence rate and more robustness against the random choice of initial centroids.


Particle Swarm Optimizer Mean Square Error Gene Expression Data Global Search Memetic Algorithm 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Schena, M., Shalon, D., Davis, R.W., et al.: Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray. Science 270(5235), 467–470 (1995)CrossRefGoogle Scholar
  2. 2.
    Milligan, G.W., Cooper, M.C.: A Study of the Comparability of External Criteria for Hierarchical Cluster Analysis. Multivariate Behavioral Research 21, 441–458 (1986)CrossRefGoogle Scholar
  3. 3.
    Fritzke, B.: Growing Cell Structures-a Self-organizing Network for Unsupervised and Supervised Learning. Networks 7, 1141–1160 (1994)Google Scholar
  4. 4.
    MacQueen, J.B.: Some Methods for Classification and Analysis of Multivariate Observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 271–297. University of California Press, Berkeley (1967)Google Scholar
  5. 5.
    Thalamuthu, A., Mukhopadhyay, I., Tseng, G.C.: Evaluation and Comparison of Gene Clustering Methods in Microarray Analysis. Bioinformatics 22(8), 2405–2412 (2006)CrossRefGoogle Scholar
  6. 6.
    Gasch, A.P., Eisen, M.B.: Exploring the Conditional Coregulation of Yeast Gene Expression through Fuzzy K-means Clustering. Genome Biology 3(11), 1–22 (2002)CrossRefGoogle Scholar
  7. 7.
    Du, Z.H., Wang, Y.W., Ji, Z.: PK-means:a New Algorithm for Gene Clustering. Computatoinal Biology and Chemistry 32, 243–247 (2008)CrossRefzbMATHGoogle Scholar
  8. 8.
    Ji, Z., Liao, H.L., Xu, W., Jiang, L.: A Strategy of Particle-pair for Vector Quantization in Image Coding. Acta Electron. Sin. 35(7), 86–89 (2007)Google Scholar
  9. 9.
    Kennedy, J., Eberhart, R.C.: Particle Swarm Optimization. In: Proceedings of IEEE International Conference on Neural Network, pp. 1942–1948. IEEE Service Center, Piscataway (1995)CrossRefGoogle Scholar
  10. 10.
    Moscato, P.: Memetic Algorithm: a Short Introduction. McGraw-Hill, London (1999)Google Scholar
  11. 11.
    Liang, J.J., Qin, A.K., et al.: Comprehensive Learing Particle Swarm Optimizer for Global Optimization of Multimodal Functions. IEEE Transactions on Evolutionary Computation 10(3), 281–295 (2006)CrossRefGoogle Scholar
  12. 12.
    Spellman, P.T., et al.: Comprehensive Identification of Cell Cycle-regulated Genes of the Yeast Saccharomyces Cerevisine by Microarray Hybridization. Mol. Biol. Cell 9, 3273–3297 (1998)CrossRefGoogle Scholar
  13. 13.
    Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T., Tibshirani, R., Bostein, D., Altman, R.B.: Missing Value Estimation Methods for DNA Microarrays. Bioinformatics 17, 520–525 (2001)CrossRefGoogle Scholar
  14. 14.
    Chu, S., et al.: The Transcriptional Program of Sporulation in Budding Yeast. Science 282, 699–705 (1998)CrossRefGoogle Scholar
  15. 15.
    Alizadeh, A.A., et al.: Distinct Types of Diffuse Large B-cell Lymphoma Identified by Gene Expression Profiling. Nature 403, 503–511 (2000)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Zhen Ji
    • 1
  • Wenmin Liu
    • 1
  • Zexuan Zhu
    • 1
  1. 1.Shenzhen City Key Laboratory of Embedded System Design, College of Computer Science and Software EngineeringShenzhen UniversityShenzhenChina

Personalised recommendations