On the Design of Cascades of Boosted Ensembles for Face Detection

Brubaker, S. Charles; Wu, Jianxin; Sun, Jie; Mullin, Matthew D.; Rehg, James M.

doi:10.1007/s11263-007-0060-1

On the Design of Cascades of Boosted Ensembles for Face Detection

Published: 18 September 2007

Volume 77, pages 65–86, (2008)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

S. Charles Brubaker¹,
Jianxin Wu¹,
Jie Sun¹,
Matthew D. Mullin¹ &
…
James M. Rehg¹

583 Accesses
104 Citations
3 Altmetric
Explore all metrics

Abstract

Cascades of boosted ensembles have become popular in the object detection community following their highly successful introduction in the face detector of Viola and Jones. Since then, researchers have sought to improve upon the original approach by incorporating new methods along a variety of axes (e.g. alternative boosting methods, feature sets, etc.). Nevertheless, key decisions about how many hypotheses to include in an ensemble and the appropriate balance of detection and false positive rates in the individual stages are often made by user intervention or by an automatic method that produces unnecessarily slow detectors. We propose a novel method for making these decisions, which exploits the shape of the stage ROC curves in ways that have been previously ignored. The result is a detector that is significantly faster than the one produced by the standard automatic method. When this algorithm is combined with a recycling method for reusing the outputs of early stages in later ones and with a retracing method that inserts new early rejection points in the cascade, the detection speed matches that of the best hand-crafted detector. We also exploit joint distributions over several features in weak learning to improve overall detector accuracy, and explore ways to improve training time by aggressively filtering features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Amit, Y., & Geman, D. (1999). A computational model for visual selection. Neural Computation, 11, 1691–1715.
Article Google Scholar
Anthony, M. (2004). Generalization error bounds for threshold decision lists. Journal of Machine Learning Research, 5, 189–217.
MathSciNet Google Scholar
Baker, S., & Nayar, S. (1996). Algorithms for pattern rejection. In Proceedings of ICPR (Vol. 2, pp. 869–874).
Bartlett, M., Littlewort, G., Fasel, I., & Movellan, J. (2003). Real time face detection and facial expression recognition: development and application to human-computer interaction.
Blanchard, G., & Blanchard, D. (June 2005). Sequential testing designs for pattern recognition. Annals of Statistics, 33, 1155–1202.
Article MATH MathSciNet Google Scholar
Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classification and regression trees. Monterey: Wadsworth and Brooks.
MATH Google Scholar
Brubaker, S. C., Mullin, M. D., & Rehg, J. M. (2006). Towards optimal training of cascaded detectors. In ECCV (1) (pp. 325–337).
Chen, X., & Yuille, A. L. (2005). A time-efficient cascade for real-time object detection: with applications for the visually impaired. In CVPR (3) (pp. 20–26).
Elad, M., Hel-Or, Y., & Keshet, R. (2002). Pattern detection using a maximal rejection classifier. Pattern Recognition Letters, 23(12), 1459–1471.
Article MATH Google Scholar
Fleuret, F. (2004). Fast binary feature selection with conditional mutual information. Journal of Machine Learning Research, 5, 1531–1555.
MathSciNet Google Scholar
Fleuret, F., & Geman, D. (2002). Fast face detection with precise pose estimation. In Proceedings of ICPR (Vol. 1, pp. 235–238).
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139.
Article MATH MathSciNet Google Scholar
Froba, B., & Ernst, A. (2004). Face detection with the modified census transform. In The sixth IEEE international conference on automatic face and gesture recognition (pp. 91–96), May 2004.
Gangaputra, S., & Geman, D. (2006). A design principle for coarse-to-fine classification. In Proceedings of CVPR (Vol. 2, pp. 1877–1884).
Grossmann, E. (2004). Automatic design of cascaded classifiers. In International IAPR workshop on statistical pattern recognition, ICPR.
Grossmann, E., Kale, A., & Jaynes, C. (2005). Towards interactive generation of “ground-truth” in background subtraction from partially labeled examples. In Proceedings of ICCV VS-PETS workshop.
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3, 1157–1182.
Article MATH Google Scholar
Heisele, B., Serre, T., Mukherjee, S., & Poggio, T. (2001). Feature reduction and hierarchy of classifiers for fast object detection in video images. In Proceedings of CVPR (Vol. 2, pp. 18–24).
Keren, D., Osadchy, M., & Gotsman, C. (2001). Antifaces: a novel, fast method for image detection. IEEE Transactions on PAMI, 23(7), 747–761.
Google Scholar
Kienzle, W., Bakir, G., Franz, M., & Schlkopf, B. (2005). Face detection—efficient and rank deficient. In Weiss, Y. (Ed.), NIPS (Vol. 17, pp. 673–680). Cambridge: MIT Press.
Google Scholar
Levi, K., & Weiss, Y. (2004). Learning object detection from a small number of examples: the importance of good features. In Proceedings of CVPR (Vol. 2).
Li, S. Z., & Zhang, Z. Q. (2004). Floatboost learning and statistical face detection. IEEE Transactions on PAMI, 26(9), 1112–1123.
Google Scholar
Lienhart, R., Kuranov, A., & Pisarevsky, V. (2002). Empirical analysis of detection cascades of boosted classifiers for rapid object detection (Technical report). MRL, Intel Labs.
Liu, C., & Shum, H. (2003). Kullback-Leibler boosting. In Proceedings of CVPR (Vol. I, pp. 587–594).
Luo, H. (2005). Optimization design of cascaded classifiers. In Proceedings of CVPR (Vol. 1, pp. 480–485).
Mas-Colell, A., Whinston, M. D., & Green, J. R. (1995). Microeconomic theory. Oxford: Oxford University Press.
Google Scholar
Opelt, A., Fussenegger, M., Pinz, A., & Auer, P. (2004). Weak hypotheses and boosting for generic object detection and recognition. In ECCV (2) (pp. 71–84).
Osadchy, R., Miller, M., & LeCun, Y. (2005). Synergistic face detection and pose estimation with energy-based model selection. In NIPS 17.
Rivest, R. (1987). Learning decision lists. Machine Learning, 2, 229–246.
MathSciNet Google Scholar
Romdhani, S., Torr, P., Schoelkopf, B., & Blake, A. (2001). Computationally efficient face detection. In Proceedings of ICCV (pp. 695–700).
Rowley, H. A., Baluja, S., & Kanade, T. (1998). Neural network-based face detection. IEEE Transactions on PAMI, 20(1), 23–38.
Google Scholar
Schapire, R. E., & Singer, Y. (1999). Improved boosting using confidence-rated predictions. Machine Learning, 37(3), 297–336.
Article MATH Google Scholar
Schneiderman, H. (2004). Feature-centric evaluation for efficient cascaded object detection. In Proceedings of CVPR (Vol. 2, pp. 29–36).
Šochman, J., & Matas, J. (2005). Waldboost-learning for time constrained sequential detection. In Proceedings of CVPR (Vol. 2, pp. 150–156).
Sun, J., Rehg, J. M., & Bobick, A. (2004). Automatic cascade training with perturbation bias. In Proceedings of CVPR (Vol. 2, pp. 276–283).
Sung, K., & Poggio, T. (1998). Example-based learning for view-based human face detection. IEEE Transactions on PAMI, 20(1), 39–51.
Google Scholar
Tu, Z. (2005). Probabilistic boosting-tree: learning discriminative models for classification, recognition, and clustering. In ICCV (pp. 1589–1596).
Vidal-Naquet, M., & Ullman, S. (2003). Object recognition with informative features and linear classification. In Proceedings of ICCV (Vol. 1, pp. 281–288).
Viola, P., & Jones, M. (2002). Fast and robust classification using asymmetric AdaBoost and a detector cascade. In NIPS 14.
Viola, P., & Jones, M. J. (2004). Robust real-time face detection. IJCV, 57(2), 137–154.
Article Google Scholar
Wu, J., Rehg, J. M., & Mullin, M. D. (2004). Learning a rare event detection cascade by direct feature selection. In NIPS 16.
Wu, J., Mullin, M. D., & Rehg, J. M. (2005). Linear asymmetric classifier for cascade detectors. In Proceedings of 22nd international conference on machine learning (pp. 993–1000).
Xiao, R., Zhu, L., & Zhang, H.-J. (2003) Boosting chain learning for object detection. In Proceedings of ICCV (Vol. I, pp. 709–715).

Download references

Author information

Authors and Affiliations

School of Interactive Computing, Georgia Institute of Technology, 85 5th Street NW, Suite 205, Atlanta, GA, 30332-0760, USA
S. Charles Brubaker, Jianxin Wu, Jie Sun, Matthew D. Mullin & James M. Rehg

Authors

S. Charles Brubaker
View author publications
You can also search for this author in PubMed Google Scholar
Jianxin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Sun
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Mullin
View author publications
You can also search for this author in PubMed Google Scholar
James M. Rehg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Charles Brubaker.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brubaker, S.C., Wu, J., Sun, J. et al. On the Design of Cascades of Boosted Ensembles for Face Detection. Int J Comput Vis 77, 65–86 (2008). https://doi.org/10.1007/s11263-007-0060-1

Download citation

Received: 02 September 2005
Accepted: 18 April 2007
Published: 18 September 2007
Issue Date: May 2008
DOI: https://doi.org/10.1007/s11263-007-0060-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Design of Cascades of Boosted Ensembles for Face Detection

Abstract

Access this article

Similar content being viewed by others

A Survey on Accelerating the Classifier Training Using Various Boosting Schemes Within Cascades of Boosted Ensembles

Generating highly accurate prediction hypotheses through collaborative ensemble learning

A Boosting-Based Decision Fusion Method for Learning from Large, Imbalanced Face Data Set

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On the Design of Cascades of Boosted Ensembles for Face Detection

Abstract

Access this article

Similar content being viewed by others

A Survey on Accelerating the Classifier Training Using Various Boosting Schemes Within Cascades of Boosted Ensembles

Generating highly accurate prediction hypotheses through collaborative ensemble learning

A Boosting-Based Decision Fusion Method for Learning from Large, Imbalanced Face Data Set

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation