A confidence voting process for ranking problems based on support vector machines
In this paper, we deal with ranking problems arising from various data mining applications where the major task is to train a rank-prediction model to assign every instance a rank. We first discuss the merits and potential disadvantages of two existing popular approaches for ranking problems: the ‘Max-Wins’ voting process based on multi-class support vector machines (SVMs) and the model based on multi-criteria decision making. We then propose a confidence voting process for ranking problems based on SVMs, which can be viewed as a combination of the SVM approach and the multi-criteria decision making model. Promising numerical experiments based on the new model are reported.
KeywordsMulti-class classification Ranking “Max-Win” voting Fuzzy voting
Unable to display preview. Download preview PDF.
- Alexe, S., Hammer, P. L., Kogan, A., & Lejeune, M. A. (2003). A nonrecursive regression model for country risk rating (Technical Report). Rutgers University, Center for Operations Research, Piscataway, New Jersey. Google Scholar
- Bottou, L., Cortes, C., Denker, J. S., Drucker, H., Guyon, I., Jackel, L. D., LeCun, Y., Müller, U. A., Säckinger, E., Simard, P., & Vapnik, V. (1994). Comparison of classifier methods: a case study in handwritten digit recognition. In ICPR (pp. 77–87). Google Scholar
- Chang, C., & Lin, C. (2004). User manual for LIBSVM, http://www.csie.ntu.edu.tw/~cjlin/libsvm.
- Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20, 273–297. Google Scholar
- Crammer, K., & Singer, Y. (2001). Pranking with ranking. In Proceedings of the conference on neural information processing systems (NIPS2001). Google Scholar
- Doumpos, M., & Zopounidis, C. (2002). Multicriteria decision aid classification methods. Applied optimization (Vol. 73). Norwell: Kluwer Academic. Google Scholar
- Herbrich, R., Graepel, T., & Obermayer, K. (1999). Support vector learning for ordinal regression, In 9th international conference on artificial neural networks (ICANN 99) (pp. 97–102). Google Scholar
- Herbrich, R., Graepel, T., & Obermayer, K. (2000). Large margin rank boundaries for ordinal regression. In A. Smola, P. Bartlett, B. Schölkopf, & D. Schuurmans (Eds.) Advances in large margin classifiers (pp. 115–132). Cambridge: MIT Press. Google Scholar
- Kressel, U. (1999). Pairwise classification and support vector machines. In B. Scholkopf & C. J. C. Burges (Eds.) Advances in kernel methods: support vector learning (pp. 255–268). Cambridge: MIT Press. Google Scholar
- Newman, D. J., Hettich, S., Blake, C. L., & Merz, C. J. (1998). UCI Repository of Machine Learning Databases, Irvine, CA, University of California, Department of Information and Computer Science. See also http://www.ics.uci.edu/~mlearn/MLRepository.html.
- Peng, J. M., & Wang, X. J. (2005). Generalized utility additive discrimination and its application to country risk classification (Technical report). Department of CAS, McMaster University, Hamilton, Ontario, Canada. Google Scholar
- Platt, J. C., Cristianini, N., & Shawe-Taylor, J. (2000). Large margin DAGs for multiclass classification. In Advances in neural information processing systems (Vol. 12, pp. 547–553). Cambridge: MIT Press. Google Scholar
- Vapnik, V. (1998). Statistical learning theory. New York: Wiley. Google Scholar
- The World Bank (2002). World Development Indicators on CD-ROM, IBRD World Bank, Washington, D.C. Google Scholar