Improved second-order bounds for prediction with expert advice

Cesa-Bianchi, Nicolò; Mansour, Yishay; Stoltz, Gilles

doi:10.1007/s10994-006-5001-7

Improved second-order bounds for prediction with expert advice

Published: 27 October 2006

Volume 66, pages 321–352, (2007)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Improved second-order bounds for prediction with expert advice

Download PDF

Nicolò Cesa-Bianchi¹,
Yishay Mansour² &
Gilles Stoltz³

1501 Accesses
79 Citations
Explore all metrics

Abstract

This work studies external regret in sequential prediction games with both positive and negative payoffs. External regret measures the difference between the payoff obtained by the forecasting strategy and the payoff of the best action. In this setting, we derive new and sharper regret bounds for the well-known exponentially weighted average forecaster and for a second forecaster with a different multiplicative update rule. Our analysis has two main advantages: first, no preliminary knowledge about the payoff sequence is needed, not even its range; second, our bounds are expressed in terms of sums of squared payoffs, replacing larger first-order quantities appearing in previous bounds. In addition, our most refined bounds have the natural and desirable property of being stable under rescalings and general translations of the payoff sequence.

Article PDF

Prediction with Expert Advice: A PDE Perspective

Article 08 August 2019

Online Prediction Problems with Variation

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

Article 11 March 2021

References

Allenberg-Neeman, C., & Neeman B. (2004). Full information game with gains and losses. Algorithmic Learning Theory, 15th International Conference, ALT 2004, Padova, Italy, October 2004, In Proceedings, volume 3244 of Lecture Notes in Artificial Intelligence, pp. 264-278, Springer.
Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R.E. (2002). The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32, 48–77.
Article MATH MathSciNet Google Scholar
Auer, P., Cesa-Bianchi, N., & Gentile, C. (2002). Adaptive and self-confident on-line learning algorithms. Journal of Computer and System Sciences, 64, 48–75.
Article MATH MathSciNet Google Scholar
Cesa-Bianchi, N., Freund, Y., Helmbold, D.P., Haussler, D., Schapire, R., & Warmuth, M.K. (1997). How to use expert advice. Journal of the ACM, 3, 427–485.
Article MathSciNet Google Scholar
Cesa-Bianchi, N., & Lugosi, G. (2003). Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51, 239–261.
Article MATH Google Scholar
Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, Learning, and Games. Cambridge University Press.
Cesa-Bianchi, N., Lugosi, G., & Stoltz, G. (2005). Minimizing regret with label efficient prediction. IEEE Transactions on Information Theory, 51, 2152–2162.
Article MathSciNet Google Scholar
Cesa-Bianchi, N., Lugosi, G., & Stoltz, G. (2006). Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3), 562–580.
Article MathSciNet Google Scholar
Freedman, D.A. (1975). On tail probabilities for martingales. The Annals of Probability, 3, 100–118.
MATH Google Scholar
Freund, Y., & Schapire, R.E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139.
Article MATH MathSciNet Google Scholar
Helmbold, D.P., Schapire, R.E., Singer, Y., & Warmuth, M. K. (1998). On-line portfolio selection using multiplicative updates. Mathematical Finance, 8,325–344, 1998.
Google Scholar
Littlestone, N., & Warmuth, M.K. (1994). The weighted majority algorithm. Information and Computation, 108, 212–261.
Article MATH MathSciNet Google Scholar
Piccolboni, A., & Schindelhauer, C. (2001). Discrete prediction games with arbitrary feedback and loss. In Proceedings of the 14th Annual Conference on Computational Learning Theory (pp. 208–223).
Vovk, V.G. (1998). A game of prediction with expert advice. Journal of Computer and System Sciences, 56(2), 153–173.
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

DSI, Università di Milano, via Comelico 39, 20135, Milano, Italy
Nicolò Cesa-Bianchi
School of Computer Science, Tel-Aviv University, Tel Aviv, Israel
Yishay Mansour
CNRS and Département de Mathématiques et Applications, Ecole Normale Supérieure, 75005, Paris, France
Gilles Stoltz

Authors

Nicolò Cesa-Bianchi
View author publications
You can also search for this author in PubMed Google Scholar
Yishay Mansour
View author publications
You can also search for this author in PubMed Google Scholar
Gilles Stoltz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gilles Stoltz.

Additional information

Editor: Avrim Blum

An extended abstract appeared in the Proceedings of the 18th Annual Conference on Learning Theory, Springer, 2005. The work of all authors was supported in part by the IST Programme of the European Community, under the PASCAL Network of Excellence, IST-2002-506778.

The work was done while Yishay Mansour was a fellow in the Institute of Advance studies, Hebrew University. His work was also supported by a grant no. 1079/04 from the Israel Science Foundation and an IBM faculty award.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cesa-Bianchi, N., Mansour, Y. & Stoltz, G. Improved second-order bounds for prediction with expert advice. Mach Learn 66, 321–352 (2007). https://doi.org/10.1007/s10994-006-5001-7

Download citation

Received: 07 February 2006
Revised: 11 July 2006
Accepted: 22 September 2006
Published: 27 October 2006
Issue Date: March 2007
DOI: https://doi.org/10.1007/s10994-006-5001-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Improved second-order bounds for prediction with expert advice

Abstract

Article PDF

Similar content being viewed by others

Prediction with Expert Advice: A PDE Perspective

Online Prediction Problems with Variation

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Improved second-order bounds for prediction with expert advice

Abstract

Article PDF

Similar content being viewed by others

Prediction with Expert Advice: A PDE Perspective

Online Prediction Problems with Variation

Asymptotically Optimal Strategies for Online Prediction with History-Dependent Experts

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation