Supermartingales in Prediction with Expert Advice

  • Alexey Chernov
  • Yuri Kalnishkan
  • Fedor Zhdanov
  • Vladimir Vovk
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5254)

Abstract

This paper compares two methods of prediction with expert advice, the Aggregating Algorithm and the Defensive Forecasting, in two different settings. The first setting is traditional, with a countable number of experts and a finite number of outcomes. Surprisingly, these two methods of fundamentally different origin lead to identical procedures. In the second setting the experts can give advice conditional on the learner’s future decision. Both methods can be used in the new setting and give the same performance guarantees as in the traditional setting. However, whereas defensive forecasting can be applied directly, the AA requires substantial modifications.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agarwal, R., Meehan, M., O’Regan, D.: Fixed Point Theory and Applications. Cambridge Tracts in Mathematics, vol. 141. Cambridge University Press, Cambridge (2001)MATHGoogle Scholar
  2. 2.
    Blum, A., Mansour, Y.: From External to Internal Regre. J. Mach. Learn. Res. 8, 1307–1324 (2007)MathSciNetGoogle Scholar
  3. 3.
    Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, Cambridge (2006)MATHGoogle Scholar
  4. 4.
    Dawid, A.P.: The geometry of proper scoring rules. Annals of the Institute of Statistical Mathematics 59, 77–93 (2007)MATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    Foster, D., Vohra, R.: Asymptotic calibration. Biometrika 85, 379–390 (1998)MATHCrossRefMathSciNetGoogle Scholar
  6. 6.
    Foster, D., Vohra, R.: Regret in the online decision problem. Games Econ. Behav. 29, 104–130 (1999)CrossRefMathSciNetGoogle Scholar
  7. 7.
    Gács, P.: Uniform test of algorithmic randomness over a general space. Theoretical Computer Science 341, 91–137 (2005)MATHCrossRefMathSciNetGoogle Scholar
  8. 8.
    Hoeffding, W.: Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association 58, 13–30 (1963)MATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    Levin, L.: Uniform tests of randomness. Soviet Mathematics Doklady 17, 337–340 (1976)MATHGoogle Scholar
  10. 10.
    Li, M., Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications, 2nd edn. Springer, New York (1997)MATHGoogle Scholar
  11. 11.
    Rockafellar, R.: Convex Analysis. Princeton University Press, Princeton (1996)Google Scholar
  12. 12.
    Shafer, G., Vovk, V.: Probability and Finance: It’s Only a Game. Wiley, New York (2001)MATHGoogle Scholar
  13. 13.
    Stoltz, G., Lugosi, G.: Internal Regret in On-Line Portfolio Selection. Machine Learning 59, 125–159 (2005)MATHCrossRefGoogle Scholar
  14. 14.
    Stoltz, G., Lugosi, G.: Learning correlated equilibria in games with compact sets of strategies. Games and Economic Behavior 59, 187–209 (2007)MATHCrossRefMathSciNetGoogle Scholar
  15. 15.
    Vovk, V.: Aggregating Strategies. In: Fulk, M., Case, J. (eds.) Proceedings of the Third Annual Workshop on Computational Learning Theory, San Mateo, CA, pp. 371–383 (1990)Google Scholar
  16. 16.
    Vovk, V.: A game of prediction with expert advice. Journal of Computer and System Sciences 56, 153–173 (1998)MATHCrossRefMathSciNetGoogle Scholar
  17. 17.
    Vovk, V.: Competitive on-line learning with a convex loss function. Technical Report arXiv:cs/0506041v3 [cs.LG], arXiv.org e-Print archive (September 2005)Google Scholar
  18. 18.
    Vovk, V.: On-line regression competitive with reproducing kernel Hilbert spaces. Technical Report arXiv:cs/0511058v2 [cs.LG], arXiv.org e-Print archive (January 2006)Google Scholar
  19. 19.
    Vovk, V.: Metric entropy in competitive on-line prediction. Technical Report arXiv:cs/0609045v1 [cs.LG], arXiv.org e-Print archive (September 2006)Google Scholar
  20. 20.
    Vovk, V.: Continuous and randomized defensive forecasting: unified view. Technical Report arXiv:0708.2353v2 [cs.LG], arXiv.org e-Print archive (August 2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Alexey Chernov
    • 1
  • Yuri Kalnishkan
    • 1
  • Fedor Zhdanov
    • 1
  • Vladimir Vovk
    • 1
  1. 1.Computer Learning Research Centre, Department of Computer Science Royal HollowayUniversity of LondonEghamUK

Personalised recommendations