Adaptive Classifier Selection in Large-Scale Hierarchical Classification
Going beyond the traditional text classification, involving a few tens of classes, there has been a surge of interest in automatic document categorization in large taxonomies where the number of classes range from hundreds of thousands to millions. Due to the complex nature of the learning problem posed in such scenarios, one needs to adapt the conventional classification schemes to suit this domain. This paper presents a novel approach for classifier selection in large hierarchies, which is based on exploiting training data heterogeneity across the hierarchy. We also present a meta-learning framework for further flexibility in classifier selection. The experimental results demonstrate the applicability of our approach, which achieves accuracy comparable to the state-of-the-art and is also significantly faster for prediction.
KeywordsHierarchical Classification Classifier Selection Meta- learning
Unable to display preview. Download preview PDF.
- 1.Babbar, R., Partalas, I., Gaussier, E., Amblard, C.: On empirical tradeoffs in large scale hierarchical classification. In: ACM CIKM (2012)Google Scholar
- 2.Bennett, N.P., Nguyen, N.: Refined experts: improving classification in large taxonomies. In: Int. ACM SIGIR Conference, pp. 11–18 (2009)Google Scholar
- 3.Cai, L., Hofmann, T.: Hierarchical document categorization with support vector machines. In: CIKM, pp. 78–87 (2004)Google Scholar
- 5.Liu, Y.T., Yang, Y., Wan, H., Zeng, J.H., Chen, Z., Ma, Y.W.: Support vector machines classification with a very large-scale taxonomy. SIGKDD Explor. Newsl., 36–43 (2005)Google Scholar
- 6.Ng, Y.A., Jordan, I.M.: On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. In: NIPS, pp. 841–848 (2001)Google Scholar
- 8.Secker, A., Davies, N.M., Freitas, A.A., Clark, B.E., Timmis, J., Flower, R.D.: Hierarchical classification of g-protein-coupled receptors with data-driven selection of attributes and classifiers. Int. J. Data Min. Bioinformatics, 91–210 (2010)Google Scholar
- 9.Xue, R.G., Xing, D., Yang, Q., Yu, Y.: Deep classification in large-scale text hierarchies. In: Int. ACM SIGIR Conference, pp. 619–626 (2008)Google Scholar