, Volume 22, Issue 1, pp 157–172

Classification Accuracy Based on Observed Margin

  • J. Shawe-Taylor

DOI: 10.1007/PL00013827

Cite this article as:
Shawe-Taylor, J. Algorithmica (1998) 22: 157. doi:10.1007/PL00013827


Following recent results [10] showing the importance of the fat-shattering dimension in explaining the beneficial effect of a large margin on generalization performance, the current paper investigates how the margin on a test example can be used to give greater certainty of correct classification in the distribution independent model. Hence, generalization analysis is possible at three distinct phases, a priori using a standard pac analysis, after training based on properties of the chosen hypothesis [10], and finally in this paper at testing based on properties of the test example. The results also show that even if the classifier does not classify all of the training examples correctly, the fact that a new example has a larger margin than that on the misclassified test examples, can be used to give very good estimates for the generalization performance in terms of the fat-shattering dimension measured at a scale proportional to the excess margin. The estimate relies on a sufficiently large number of the correctly classified training examples having a margin roughly equal to that used to estimate generalization, indicating that the corresponding output values need to be ``well sampled.''

Key words. Computational learning theory, Generalization, Fat-shattering, Large margin, Pac estimates, Agnostic learning.

Copyright information

© 1998 Springer-Verlag New York Inc.

Authors and Affiliations

  • J. Shawe-Taylor
    • 1
  1. 1.Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX, England.