Adaptive Classifier Selection in Large-Scale Hierarchical Classification

  • Ioannis Partalas
  • Rohit Babbar
  • Eric Gaussier
  • Cecile Amblard
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7665)

Abstract

Going beyond the traditional text classification, involving a few tens of classes, there has been a surge of interest in automatic document categorization in large taxonomies where the number of classes range from hundreds of thousands to millions. Due to the complex nature of the learning problem posed in such scenarios, one needs to adapt the conventional classification schemes to suit this domain. This paper presents a novel approach for classifier selection in large hierarchies, which is based on exploiting training data heterogeneity across the hierarchy. We also present a meta-learning framework for further flexibility in classifier selection. The experimental results demonstrate the applicability of our approach, which achieves accuracy comparable to the state-of-the-art and is also significantly faster for prediction.

Keywords

Hierarchical Classification Classifier Selection Meta- learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Babbar, R., Partalas, I., Gaussier, E., Amblard, C.: On empirical tradeoffs in large scale hierarchical classification. In: ACM CIKM (2012)Google Scholar
  2. 2.
    Bennett, N.P., Nguyen, N.: Refined experts: improving classification in large taxonomies. In: Int. ACM SIGIR Conference, pp. 11–18 (2009)Google Scholar
  3. 3.
    Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: CIKM, pp. 78–87 (2004)Google Scholar
  4. 4.
    Fan, E.R., Chang, W.K., Hsieh, J.C., Wang, R.X., Lin, J.C.P.: LIBLINEAR: A library for large linear classification. JMLR 9, 1871–1874 (2008)MATHGoogle Scholar
  5. 5.
    Liu, Y.T., Yang, Y., Wan, H., Zeng, J.H., Chen, Z., Ma, Y.W.: Support vector machines classification with a very large-scale taxonomy. SIGKDD Explor. Newsl., 36–43 (2005)Google Scholar
  6. 6.
    Ng, Y.A., Jordan, I.M.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. In: NIPS, pp. 841–848 (2001)Google Scholar
  7. 7.
    Schaul, T., Schmidhuber, J.: Metalearning. Scholarpedia 5, 4650 (2010)CrossRefGoogle Scholar
  8. 8.
    Secker, A., Davies, N.M., Freitas, A.A., Clark, B.E., Timmis, J., Flower, R.D.: Hierarchical classification of g-protein-coupled receptors with data-driven selection of attributes and classifiers. Int. J. Data Min. Bioinformatics, 91–210 (2010)Google Scholar
  9. 9.
    Xue, R.G., Xing, D., Yang, Q., Yu, Y.: Deep classification in large-scale text hierarchies. In: Int. ACM SIGIR Conference, pp. 619–626 (2008)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Ioannis Partalas
    • 1
  • Rohit Babbar
    • 1
  • Eric Gaussier
    • 1
  • Cecile Amblard
    • 1
  1. 1.LIGUniversité Joseph FourierGrenoble, cedex 9France

Personalised recommendations