Efficient Clustering of Dataset Based on Differential Evolution

  • Anima Naik
  • Suresh Chandra Satapathy
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 247)


A novel approach to combining feature selection and clustering is presented. It uses selection of weighted Principal Components for features selection and automatic clustering based on Improved DE for clustering in order to reduce the complexity of high dimensional datasets and speed up the DE clustering process. We report significant improvements in total runtime. Moreover, the clustering accuracy of the dimensionality reduction DE clustering algorithm is comparable to the one that uses full dimensional datasets. The efficiency of this approach has been demonstrated with some real life datasets.


Clustering PCs Dimension DE 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ben-Dor, A., Friedman, N., Yakhini, Z.: Class discovery in gene expression data. In: Procs. RECOMB, pp. 31–38 (2001)Google Scholar
  2. 2.
    Law, M.H., Jain, A.K., Figueiredo, M.A.T.: Feature selection in mixture-based clustering. In: Advances in Neural Information Processing Systems, vol. 15 (2003) (to appear)Google Scholar
  3. 3.
    Heydebreck, A.V., Huber, W., Poustka, A., Vingron, M.: Identifying splits with clear separation: A new class discovery method for gene expression data. Bioinformatics 17 (2001)Google Scholar
  4. 4.
    Kim, S.B., Rattakorn, P.: Unsupervised Feature Selection Using Weighted Principal Components (2010)Google Scholar
  5. 5.
    Das, S., Konar, A., Braham, A.: Automatic Clustering Using an Improved Differential Evolution Algorithm. IEEE Transactions on Systems, Man, and Cybernetics—Part a: Systems and Humans 38(1) (January 2008)Google Scholar
  6. 6.
    Boutsidis, C., Mahoney, M.W., Drineas: Unsupervised Feature Selection for Principal Components Analysis. In: KDD 2008, Las Vegas, Nevada, USA, August 24-27 (2008)Google Scholar
  7. 7.
    Jolliffe, I.T.: Principal Component Analysis. Springer, New York (2002)zbMATHGoogle Scholar
  8. 8.
    Vermaat, M.B., Ion, R.A., Does, R.J.M.M., Klaassen, C.A.J.: A comparison of Shewhart individuals control charts based on normal, non-parametric, and extreme-value theory. Quality and Reliability Engineering International 19, 337–353 (2003)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.MITSGwaliorIndia
  2. 2.Dept of Computer Science and EngineeringANITSThagarapuvalasaIndia

Personalised recommendations