Boosting conditional probability estimators

Gutfreund, Dan; Kontorovich, Aryeh; Levy, Ran; Rosen-Zvi, Michal

doi:10.1007/s10472-015-9465-7

Boosting conditional probability estimators

Published: 13 June 2015

Volume 79, pages 129–144, (2017)
Cite this article

Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Dan Gutfreund¹,
Aryeh Kontorovich²,
Ran Levy¹ &
…
Michal Rosen-Zvi¹

168 Accesses
1 Citation
Explore all metrics

Abstract

In the standard agnostic multiclass model, <instance, label > pairs are sampled independently from some underlying distribution. This distribution induces a conditional probability over the labels given an instance, and our goal in this paper is to learn this conditional distribution. Since even unconditional densities are quite challenging to learn, we give our learner access to <instance, conditional distribution > pairs. Assuming a base learner oracle in this model, we might seek a boosting algorithm for constructing a strong learner. Unfortunately, without further assumptions, this is provably impossible. However, we give a new boosting algorithm that succeeds in the following sense: given a base learner guaranteed to achieve some average accuracy (i.e., risk), we efficiently construct a learner that achieves the same level of accuracy with arbitrarily high probability. We give generalization guarantees of several different kinds, including distribution-free accuracy and risk bounds. None of our estimates depend on the number of boosting rounds and some of them admit dimension-free formulations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Algorithms via PAC-Bayes and Laplace Distributions

Optimization by Gradient Boosting

Boosting as a kernel-based method

Article 17 May 2019

Aleksandr Y. Aravkin, Giulio Bottegal & Gianluigi Pillonetto

References

Alon, N., Ben-David, S., Cesa-Bianchi, N., Haussler, D.: Scale-sensitive dimensions, uniform convergence, and learnability. J. ACM 44(4), 615–631 (1997)
Article MathSciNet MATH Google Scholar
Bartlett, P., Shawe-Taylor, J.: Generalization performance of support vector machines and other pattern classifiers (1999)
Breiman, L.: Arcing classifier (with discussion and a rejoinder by the author). Ann. Statist. 26, 801–849 (1998)
Article MathSciNet MATH Google Scholar
Das, D., Petrov, S.: Unsupervised part-of-speech tagging with bilingual graph-based projections. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies - volume 1, HLT ’11, pp. 600–609. Association for Computational Linguistics, Stroudsburg (2011)
Google Scholar
Devroye, L., Lugosi, G.: Combinatorial methods in density estimation, springer series in statistics. Springer, New York (2001)
Book MATH Google Scholar
Duffy, N., Helmbold, D.: Boosting methods for regression. Mach. Learn. 47, 153–200 (2002)
Article MATH Google Scholar
Fan, W., Stolfo, S.J., Zhang, J., Chan, P.K.: Adacost: misclassification cost-sensitive boosting. In: ICML, pp. 97–105 (1999)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)
Article MathSciNet MATH Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28, 337–374 (2000)
Article MathSciNet MATH Google Scholar
Gibbs, A.L., Su, F.E.: On choosing and bounding probability metrics. Int. Stat. Rev. 70(3), 419–435 (2002)
Article MATH Google Scholar
Gottlieb, L.A., Kontorovich, L., Krauthgamer, R.: Efficient classification for metric data. In: COLT, pp. 433–440 (2010)
Gupta, A., Krauthgamer, R., Lee, J.R.: Bounded geometries, fractals, and low-distortion embeddings. In: FOCS, pp. 534–543 (2003)
Kanamori, T.: Deformation of log-likelihood loss function for multiclass boosting. Neural Netw. 23(7), 843–864 (2010)
Article Google Scholar
Krauthgamer, R., Lee, J.R.: Navigating nets: Simple algorithms for proximity search. In: 15th annual ACM-SIAM Symposium on discrete algorithms, pp. 791–801 (2004)
McDiarmid, C.: On the method of bounded differences. In: Siemons, J. (ed.) Surveys in combinatorics of LMS lecture notes series, vol. 141, pp. 148–188. Morgan Kaufmann Publishers, San Mateo (1989)
Google Scholar
Mease, D., Wyner, A.J., Buja, A.: Boosted classification trees and class probability/quantile estimation. J. Mach. Learn. Res. 8, 409–439 (2007)
MATH Google Scholar
Schapire, R.E., Freund, Y., Bartlett, P., Lee, W.S.: Boosting the margin: a new explanation for the effectiveness of voting methods. Ann. Statist. 26(5), 1651–1686 (1998)
Article MathSciNet MATH Google Scholar
Talagrand, M.: New concentration inequalities in product spaces. Invent. Math. 126(3), 505–563 (1996)
Article MathSciNet MATH Google Scholar
Toutanova, K., Cherry, C.: A global model for joint lemmatization and part-of-speech prediction. In: Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP, ACL ’09, pp. 486–494 (2009)

Download references

Author information

Authors and Affiliations

IBM Research, Rueschlikon, Switzerland
Dan Gutfreund, Ran Levy & Michal Rosen-Zvi
Ben-Gurion University of the Negev, Beer Sheva, Israel
Aryeh Kontorovich

Authors

Dan Gutfreund
View author publications
You can also search for this author in PubMed Google Scholar
Aryeh Kontorovich
View author publications
You can also search for this author in PubMed Google Scholar
Ran Levy
View author publications
You can also search for this author in PubMed Google Scholar
Michal Rosen-Zvi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aryeh Kontorovich.

Additional information

A preliminary version was invited to ISAIM 2014. A.K. was partially supported by the Israel Science Foundation (grant No. 1141/12) and a Yahoo Faculty award.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gutfreund, D., Kontorovich, A., Levy, R. et al. Boosting conditional probability estimators. Ann Math Artif Intell 79, 129–144 (2017). https://doi.org/10.1007/s10472-015-9465-7

Download citation

Published: 13 June 2015
Issue Date: March 2017
DOI: https://doi.org/10.1007/s10472-015-9465-7

Keywords

Mathematics Subject Classification (2010)

65C50

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Boosting conditional probability estimators

Abstract

Access this article

Similar content being viewed by others

Robust Algorithms via PAC-Bayes and Laplace Distributions

Optimization by Gradient Boosting

Boosting as a kernel-based method

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2010)

Navigation

Boosting conditional probability estimators

Abstract

Access this article

Similar content being viewed by others

Robust Algorithms via PAC-Bayes and Laplace Distributions

Optimization by Gradient Boosting

Boosting as a kernel-based method

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation