Face Association across Unconstrained Video Frames Using Conditional Random Fields

Du, Ming; Chellappa, Rama

doi:10.1007/978-3-642-33786-4_13

Ming Du²¹ &
Rama Chellappa²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7578))

Included in the following conference series:

European Conference on Computer Vision

Abstract

Automatic face association across unconstrained video frames has many practical applications. Recent advances in the area of object detection have made it possible to replace the traditional tracking-based association approaches with the more robust detection-based ones. However, it is still a very challenging task for real-world unconstrained videos, especially if the subjects are in a moving platform and at distances exceeding several tens of meters. In this paper, we present a novel solution based on a Conditional Random Field (CRF) framework. The CRF approach not only gives a probabilistic and systematic treatment of the problem, but also elegantly combines global and local features. When ambiguities in labels cannot be solved by using the face appearance alone, our method relies on multiple contextual features to provide further evidence for association. Our algorithm works in an on-line mode and is able to reliably handle real-world videos. Results of experiments using challenging video data and comparisons with other methods are provided to demonstrate the effectiveness of our method.

Download to read the full chapter text

Chapter PDF

Interactive Face Labeling System in Real-World Videos

Multi-face Tracking with Occlusion Recovery

Multi-Camera Tracking via Online Discriminative Feature and Multi-Cue MRF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Sivic, J., Everingham, M., Zisserman, A.:“Who are you?” – learning person specific classifiers from video. In: CVPR, pp. 1145–1152 (2009)
Google Scholar
Everingham, M., Sivic, J., Zisserman, A.: “Hello! my name is.. buffy” – automatic naming of characters in tv video. In: BMVC, vol. 3, pp. 899–908 (2006)
Google Scholar
Ramanan, D., Baker, S., Kakade, S.: Leveraging archival video for building face datasets. In: ICCV, pp. 1–8 (2007)
Google Scholar
Zhang, L., Li, Y., Nevatia, R.: Global data association for multi-object tracking using network flows. In: Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Pirsiavash, H., Ramanan, D., Fowlkes, C.: Globally-optimal greedy algorithms for tracking a variable number of objects. In: CVPR, pp. 1201–1208 (2011)
Google Scholar
Yang, B., Huang, C., Nevatia, R.: Learning affinities and dependencies for multi-target tracking using a crf model. In: CVPR, pp. 1233–1240 (2011)
Google Scholar
Breitenstein, M.D., Reichlin, F., Leibe, B., Koller-Meier, E., Gool, L.J.V.: Robust tracking-by-detection using a detector confidence particle filter. In: ICCV (2009)
Google Scholar
Cai, Y., de Freitas, N., Little, J.J.: Robust Visual Tracking for Multiple Targets. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 107–118. Springer, Heidelberg (2006)
Chapter Google Scholar
Andriluka, M., Roth, S., Schiele, B.: People-tracking-by-detection and people-detection-by-tracking. In: CVPR, pp. 1–8 (2008)
Google Scholar
Huang, C., Wu, B., Nevatia, R.: Robust Object Tracking by Hierarchical Association of Detection Responses. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 788–801. Springer, Heidelberg (2008)
Chapter Google Scholar
Song, B., Jeng, T.-Y., Staudt, E., Roy-Chowdhury, A.K.: A Stochastic Graph Evolution Framework for Robust Multi-target Tracking. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 605–619. Springer, Heidelberg (2010)
Chapter Google Scholar
Yilmaz, A., Javed, O., Shah, M.: Object tracking: a survey. ACM Comput. Surv. 38 (2006)
Google Scholar
Zhao, T., Nevatia, R.: Tracking multiple humans in complex situations. PAMI 26, 1208–1221 (2004)
Article Google Scholar
Fitzgibbon, A.W., Zisserman, A.: On Affine Invariant Clustering and Automatic Cast Listing in Movies. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part III. LNCS, vol. 2352, pp. 304–320. Springer, Heidelberg (2002)
Chapter Google Scholar
Berg, T.L., Berg, A.C., Edwards, J., Maire, M., White, R., Teh, Y.W., Learned-Miller, E., Forsyth, D.A.: Names and faces in the news. In: CVPR, vol. 2, pp. 848–854 (2004)
Google Scholar
Sivic, J., Everingham, M., Zisserman, A.: Person Spotting: Video Shot Retrieval for Face Sets. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 226–236. Springer, Heidelberg (2005)
Chapter Google Scholar
Yang, B., Navatia, R.: An online learned crf model for multi-target tracking. In: CVPR (2012)
Google Scholar
Gallagher, A.C., Chen, T.: Using group prior to identify people in consumer images. In: CVPR, pp. 1–8 (2007)
Google Scholar
Anguelov, D., Lee, K.C., Gokturk, S.B., Sumengen, B.: Contextual identity recognition in personal photo albums. In: CVPR, pp. 1–7 (2007)
Google Scholar
Gallagher, A.C., Chen, T.: Using context to recognize people in consumer images. IPSJ Transactions on Computer Vision and Applications 1, 115–126 (2009)
Article Google Scholar
Jepson, A.D., Fleet, D.J., El-Maraghi, T.: Robust online appearance model for visual tracking. In: CVPR, vol. 1, pp. 415–422 (2001)
Google Scholar
Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3d human pose annotations. In: International Conference on Computer Vision (2009)
Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. IJCV 57, 137–154 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Automation Research, University of Maryland, College Park, MD, 20742, USA
Ming Du & Rama Chellappa

Authors

Ming Du
View author publications
You can also search for this author in PubMed Google Scholar
Rama Chellappa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Du, M., Chellappa, R. (2012). Face Association across Unconstrained Video Frames Using Conditional Random Fields. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33786-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-33786-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33785-7
Online ISBN: 978-3-642-33786-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Face Association across Unconstrained Video Frames Using Conditional Random Fields

Abstract

Chapter PDF

Similar content being viewed by others

Interactive Face Labeling System in Real-World Videos

Multi-face Tracking with Occlusion Recovery

Multi-Camera Tracking via Online Discriminative Feature and Multi-Cue MRF

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Face Association across Unconstrained Video Frames Using Conditional Random Fields

Abstract

Chapter PDF

Similar content being viewed by others

Interactive Face Labeling System in Real-World Videos

Multi-face Tracking with Occlusion Recovery

Multi-Camera Tracking via Online Discriminative Feature and Multi-Cue MRF

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation