Face Retrieval in Broadcasting News Video by Fusing Temporal and Intensity Information

  • Duy-Dinh Le
  • Shin’ichi Satoh
  • Michael E. Houle
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4071)


Human faces play an important role in efficiently indexing and accessing video contents, especially broadcasting news video. However, face appearance in real environments exhibits many variations such as pose changes, facial expressions, aging, illumination changes, low resolution and occlusion, making it difficult for current state of the art face recognition techniques to obtain reasonable retrieval results. To handle this problem, this paper proposes an efficient retrieval method by integrating temporal information into facial intensity information. First, representative faces are quickly generated by using facial intensities to organize the face dataset into clusters. Next, temporal information is introduced to reorganize cluster memberships so as to improve overall retrieval performance. For scalability and efficiency, the clustering is based on a recently-proposed model involving correlations among relevant sets (neighborhoods) of data items. Neighborhood queries are handled using an approximate search index. Experiments on the 2005 TRECVID dataset show promising results.


Face Recognition Temporal Information Face Appearance News Video Cluster Candidate 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face recognition: A literature survey. ACM Computing Surveys 35(4), 399–458 (2003)CrossRefGoogle Scholar
  2. 2.
    Yang, J., Chen, M., Hauptmann, A.: Finding person x: Correlating names with visual appearances. In: Proc. Int. Conf. on Image and Video Retrieval (CIVR), pp. 270–278 (2004)Google Scholar
  3. 3.
    Weber, R., Schek, H.J., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. Intl. Conf. on Very Large Data Bases (VLDB), pp. 194–205 (1998)Google Scholar
  4. 4.
    Fitzgibbon, A., Zisserman, A.: On Affine Invariant Clustering and Automatic Cast Listing in Movies. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 304–320. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  5. 5.
    Fitzgibbon, A.,, Z.: Joint manifold distance: a new approach to appearance based clustering. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 26–36 (2003)Google Scholar
  6. 6.
    Arandjelovic, O., Zisserman, A.: Automatic face recognition for film character retrieval in feature-length films. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 860–867 (2005)Google Scholar
  7. 7.
    Satoh, S., Kanade, T.: Name-it: Association of face and name in video. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 368–373 (1997)Google Scholar
  8. 8.
    Yang, J., Hauptmann, A.: Naming every individual in news video monologues. In: Proc. ACM International Conference on Multimedia (MM), pp. 580–587 (2004)Google Scholar
  9. 9.
    Yang, J., Yan, R., Hauptmann, A.: Multiple instance learning for labeling faces in broadcasting news video. In: Proc. ACM International Conference on Multimedia (MM), pp. 31–40 (2005)Google Scholar
  10. 10.
    Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, Chichester (1990)Google Scholar
  11. 11.
    Houle, M.E.: A generic query-based model for scalable clustering. Technical Report NII-2006-008E, National Institute of Informatics (2006)Google Scholar
  12. 12.
    Houle, M.E., Sakuma, J.: Fast approximate similarity search in extremely high-dimensional data sets. In: Proc. Int. Conf. on Data Engineering (ICDE), pp. 619–630 (2005)Google Scholar
  13. 13.
  14. 14.
    Le, D.D., Satoh, S.: Multi-stage approach to fast face detection. In: Proc. British Machine Vison Conf. (BMVC), vol. 2, pp. 769–778 (2005)Google Scholar
  15. 15.
    Le, D.D., Satoh, S.: Fusion of local and global features for efficient object detection. In: Proc. SPIE, Applications of Neural Networks and Machine Learning in Image Processing IX, vol. 5673, pp. 106–116 (2005)Google Scholar
  16. 16.
    Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(1), 23–38 (1998)CrossRefGoogle Scholar
  17. 17.
    Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR) (1991)Google Scholar
  18. 18.
    Phillips, P., Moon, H., Rizvi, S., Rauss, P.: The feret evaluation methodology for face recognition algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(10), 1094–1104 (2002)Google Scholar
  19. 19.
    Houle, M.E.: Navigating massive data sets via local clustering. In: Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (SIGKDD), pp. 547–552 (2003)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Duy-Dinh Le
    • 1
  • Shin’ichi Satoh
    • 1
    • 2
  • Michael E. Houle
    • 2
  1. 1.Department of InformaticsThe Graduate University for Advanced StudiesTokyoJapan
  2. 2.National Institute of InformaticsTokyoJapan

Personalised recommendations