Single and Multiobjective Evolutionary Algorithms for Clustering Biomedical Information with Unknown Number of Clusters

  • María Eugenia Curi
  • Lucía Carozzi
  • Renzo Massobrio
  • Sergio NesmachnowEmail author
  • Grégoire Danoy
  • Marek Ostaszewski
  • Pascal Bouvry
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10835)


This article presents single and multiobjective evolutionary approaches for solving the clustering problem with unknown number of clusters. Simple and ad-hoc operators are proposed, aiming to keep the evolutionary search as simple as possible in order to scale up for solving large instances. The experimental evaluation is performed considering a set of real problem instances, including a real-life problem of analyzing biomedical information in the Parkinson’s disease map project. The main results demonstrate that the proposed evolutionary approaches are able to compute accurate trade-off solutions and efficiently handle the problem instance involving biomedical information.


Clustering Biomedical information Multiobjective 


  1. 1.
    Kaufman, L., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)CrossRefGoogle Scholar
  2. 2.
    Welch, W.: Algorithmic complexity: Three NP- hard problems in computational statistics. J. Stat. Comput. Simul. 15(1), 17–25 (1982)MathSciNetCrossRefGoogle Scholar
  3. 3.
    Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2009). Scholar
  4. 4.
    Nesmachnow, S.: An overview of metaheuristics: accurate and efficient methods for optimisation. Int. J. Metaheuristics 3(4), 320–347 (2014)CrossRefGoogle Scholar
  5. 5.
    Hruschka, E., Campello, R., Freitas, A., de Carvalho, A.: A survey of evolutionary algorithms for clustering. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 39(2), 133–155 (2009)CrossRefGoogle Scholar
  6. 6.
    Sheng, W., Liu, X.: A hybrid algorithm for k-medoid clustering of large data sets. In: IEEE Congress on Evolutionary Computation, pp. 77–82 (2004)Google Scholar
  7. 7.
    University of Luxembourg: Parkinson’s disease map project, November 2017
  8. 8.
    Fujita, K., et al.: Integrating pathways of Parkinson’s disease in a molecular interaction map. Mol. Neurobiol. 49(1), 88–102 (2014)CrossRefGoogle Scholar
  9. 9.
    Das, S., Abraham, A., Konar, A.: Metaheuristic Clustering. Studies in Computational Intelligence, vol. 178. Springer, Heidelberg (2009). Scholar
  10. 10.
    Deng, Y., Bard, J.: A reactive GRASP with path relinking for capacitated clustering. J. Heuristics 17(2), 119–152 (2011)CrossRefGoogle Scholar
  11. 11.
    Cowgill, M., Harvey, R., Watson, L.: A genetic algorithm approach to cluster analysis. Comput. Mathematics Appl. 37(7), 99–108 (1999)MathSciNetCrossRefGoogle Scholar
  12. 12.
    Bandyopadhyay, S., Maulik, U.: Nonparametric genetic clustering: comparison of validity indices. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 31(1), 120–125 (2001)CrossRefGoogle Scholar
  13. 13.
    Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering. Springer, Heidelberg (2011). Scholar
  14. 14.
    Ripon, K., Tsang, C.H., Kwong, S., Ip, M.K.: Multi-objective evolutionary clustering using variable-length real jumping genes genetic algorithm. In: 18\(^{th}\) International Conference on Pattern Recognition, pp. 3609–3616 (2006)Google Scholar
  15. 15.
    Handl, J., Knowles, J.: An evolutionary approach to multiobjective clustering. IEEE Trans. Evol. Comput. 11(1), 56–76 (2007)CrossRefGoogle Scholar
  16. 16.
    Korkmaz, E., Du, J., Alhajj, R., Barker, K.: Combining advantages of new chromosome representation scheme and multi-objective genetic algorithms for better clustering. Intell. Data Anal. 10(2), 163–182 (2006)Google Scholar
  17. 17.
    Deaven, D., Ho, K.: Molecular geometry optimization with a genetic algorithm. Phys. Rev. Lett. 75, 288–291 (1995)CrossRefGoogle Scholar
  18. 18.
    Deb, K.: Multi-Objective Optimization Using Evolutionary Algorithms. Wiley, Hoboken (2001)zbMATHGoogle Scholar
  19. 19.
    Ministerio de Vivienda Ordenamiento Territorial y Medio Ambiente (Uruguay): Red de estaciones hidrométricas, November 2017
  20. 20.
    Alcalá, J., et al.: KEEL data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework. J. Multiple-valued Logic Soft Comput. 17(2–3), 255–287 (2010)Google Scholar
  21. 21.
    Luke, S., et al.: ECJ 23: A Java-based Evolutionary Computation Research System. Accessed March 2017
  22. 22.
    Kaufman, L., Rousseeuw, P.: Clustering by means of medoids. In: Statistical Data Analysis Based on the L1-Norm and Related Methods (1987)Google Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  • María Eugenia Curi
    • 1
  • Lucía Carozzi
    • 1
  • Renzo Massobrio
    • 1
  • Sergio Nesmachnow
    • 1
    Email author
  • Grégoire Danoy
    • 2
  • Marek Ostaszewski
    • 3
  • Pascal Bouvry
    • 2
  1. 1.Universidad de la RepúblicaMontevideoUruguay
  2. 2.FSTC/CSC-ILIASUniversity of LuxembourgLuxembourg CityLuxembourg
  3. 3.LCSBUniversity of LuxembourgLuxembourg CityLuxembourg

Personalised recommendations