A Grammar-Guided Genetic Programming Algorithm for Multi-Label Classification

  • Alberto Cano
  • Amelia Zafra
  • Eva L. Gibaja
  • Sebastián Ventura
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7831)


Multi-label classification is a challenging problem which demands new knowledge discovery methods. This paper presents a Grammar-Guided Genetic Programming algorithm for solving multi-label classification problems using IF-THEN classification rules. This algorithm, called G3P-ML, is evaluated and compared to other multi-label classification techniques in different application domains. Computational experiments show that G3P-ML often obtains better results than other algorithms while achieving a lower number of rules than the other methods.


Multi-label classification grammar-guided genetic programming rule learning 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Alcalá-Fdez, J., Fernandez, A., Luengo, J., Derrac, J., García, S., Sánchez, L., Herrera, F.: KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework. Journal of Multiple-Valued Logic and Soft Computing 17, 255–287 (2011)Google Scholar
  2. 2.
    Boutell, M.R., Luo, J., Shen, X., Brown, C.M.: Learning multi-label scene classification. Pattern Recognition 37(9), 1757–1771 (2004)CrossRefGoogle Scholar
  3. 3.
    Chen, B., Ma, L., Hu, J.: An improved multi-label classification method based on svm with delicate decision boundary. International Journal of Innovative Computing, Information and Control 6(4), 1605–1614 (2010)Google Scholar
  4. 4.
    Cohen, W.W.: Fast Effective Rule Induction. In: 12th International Conference on Machine Learning, pp. 1–10 (1995)Google Scholar
  5. 5.
    Demšar, J.: Statistical Comparisons of Classifiers over Multiple Data Sets. Machine Learning Research 7, 1–30 (2006)MathSciNetzbMATHGoogle Scholar
  6. 6.
    Diplaris, S., Tsoumakas, G., Mitkas, P.A., Vlahavas, I.P.: Protein Classification with Multiple Algorithms. In: Bozanis, P., Houstis, E.N. (eds.) PCI 2005. LNCS, vol. 3746, pp. 448–456. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  7. 7.
    Dunn, O.J.: Multiple Comparisons Among Means. Journal of the American Statistical Association 56(293), 52–64 (1961)MathSciNetzbMATHCrossRefGoogle Scholar
  8. 8.
    Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 40(2), 121–144 (2010)CrossRefGoogle Scholar
  9. 9.
    Frank, E., Witten, I.H.: Generating Accurate Rule Sets Without Global Optimization. In: 15th International Conference on Machine Learning, pp. 144–151 (1998)Google Scholar
  10. 10.
    Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemannr, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD 11, 10–18 (2009)CrossRefGoogle Scholar
  11. 11.
    Koza, J.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press (1992)Google Scholar
  12. 12.
    Ngan, P.S., Wong, M.L., Lam, W., Leung, K.S., Cheng, J.C.: Medical data mining using evolutionary computation. Artificial Intelligence in Medicine 16(1), 73–96 (1999)CrossRefGoogle Scholar
  13. 13.
    Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)Google Scholar
  14. 14.
    Schapire, R.E., Singer, Y.: BoosTexter: A Boosting-based System for Text Categorization. Machine Learning 39, 135–168 (2000)zbMATHCrossRefGoogle Scholar
  15. 15.
    Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.: Multilabel classification of music into emotions. In: 9th Int. Conf. on Music Information Retrieval (2008)Google Scholar
  16. 16.
    Tsoumakas, G., Katakis, I.: Multi-label classification: An overview. International Journal of Data Warehousing and Mining 3(3), 1–13 (2007)CrossRefGoogle Scholar
  17. 17.
    Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining Multi-label Data. In: Data Mining and Knowledge Discovery Handbook, pp. 667–685 (2010)Google Scholar
  18. 18.
    Ventura, S., Romero, C., Zafra, A., Delgado, J.A., Hervás, C.: JCLEC: A Java Framework for Evolutionary Computation. Soft Computing 12, 381–392 (2007)CrossRefGoogle Scholar
  19. 19.
    Whigham, P.A.: Schema theorem for context-free grammars. In: 2nd IEEE Conference on Evolutionary Computation, vol. 1, pp. 178–181 (1995)Google Scholar
  20. 20.
    Wong, M.L., Leung, K.S.: Data Mining Using Grammar Based Genetic Programming and Applications. Kluwer Academic Publisher (2000)Google Scholar
  21. 21.
    Yu, X., Gen, M.: Introduction to Evolutionary Algorithms. Springer (2010)Google Scholar
  22. 22.
    Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Transactions on Knowledge and Data Engineering 18, 1338–1351 (2006)CrossRefGoogle Scholar
  23. 23.
    Zhang, M.L., Zhou, Z.H.: Ml-knn: A lazy learning approach to multi-label learning. Pattern Recognition 40, 2038–2048 (2007)zbMATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Alberto Cano
    • 1
  • Amelia Zafra
    • 1
  • Eva L. Gibaja
    • 1
  • Sebastián Ventura
    • 1
  1. 1.Department of Computer Science and Numerical AnalysisUniversity of CordobaCordobaSpain

Personalised recommendations