Finding People Frequently Appearing in News

  • Derya Ozkan
  • Pınar Duygulu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4071)

Abstract

We propose a graph based method to improve the performance of person queries in large news video collections. The method benefits from the multi-modal structure of videos and integrates text and face information. Using the idea that a person appears more frequently when his/her name is mentioned, we first use the speech transcript text to limit our search space for a query name. Then, we construct a similarity graph with nodes corresponding to all of the faces in the search space, and the edges corresponding to similarity of the faces. With the assumption that the images of the query name will be more similar to each other than to other images, the problem is then transformed into finding the densest component in the graph corresponding to the images of the query name. The same graph algorithm is applied for detecting and removing the faces of the anchorpeople in an unsupervised way. The experiments are conducted on 229 news videos provided by NIST for TRECVID 2004. The results show that proposed method outperforms the text only based methods and provides cues for recognition of faces on the large scale.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Trec video retrieval evaluation (2004), http://www-nlpir.nist.gov/projects/trecvid/
  2. 2.
    Gross, R., Baker, S., Matthews, I., Kanade, T.: Face recognition across pose and illumination. In: Li, S.Z., Jain, A.K. (eds.) Handbook of Face Recognition. Springer, Heidelberg (2004)Google Scholar
  3. 3.
    Zhao, W., Chellappa, R., Phillips, P., Rosenfeld, A.: Face recognition: A literature survey. ACM Computing Surveys 35(4), 399–458 (2003)CrossRefGoogle Scholar
  4. 4.
    Satoh, S., Kanade, T.: Name-it: Association of face and name in video. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (1997)Google Scholar
  5. 5.
    Berg, T., Berg, A.C., Edwards, J., Forsyth, D.: Who is in the picture. In: Neural Information Processing Systems (NIPS) (2004)Google Scholar
  6. 6.
    Chen, M.Y., Hauptmann, A.: Searching for a specific person in broadcast news video. In: International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, Canada (2004)Google Scholar
  7. 7.
    Duygulu, P., Hauptmann, A.: What’s news, what’s not? Associating news videos with words. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 132–140. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  8. 8.
    İkizler, N., Duygulu, P.: Person search made easy. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 578–588. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  9. 9.
    Yang, J., Chen, M.Y., Hauptmann, A.: Finding person x: Correlating names with visual appearances. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 270–278. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  10. 10.
    Berg, T., Berg, A.C., Edwards, J., Maire, M., White, R., Teh, Y.W., Learned-Miller, E., Forsyth, D.: Faces and names in the news. In: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) (2004)Google Scholar
  11. 11.
    Ozkan, D., Duygulu, P.: Interesting faces in the news. In: IEEE Conf. on Computer Vision and Pattern Recognition (to appear, 2006)Google Scholar
  12. 12.
    Gauvain, J., Lamel, L., Adda, G.: The limsi broadcast news transcription system. Speech Communication 37(1-2) (2002)Google Scholar
  13. 13.
    Mikolajczyk, K.: Face detector. INRIA Rhone-Alpes, Ph.D Report (2004)Google Scholar
  14. 14.
    Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2) (2004)Google Scholar
  15. 15.
    Charikar, M.: Greedy approximation algorithms for finding dense components in a graph. In: Jansen, K., Khuller, S. (eds.) APPROX 2000. LNCS, vol. 1913, pp. 84–95. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  16. 16.
    Sivic, J., Everingham, M., Zisserman, A.: Person spotting: Video shot retrieval for face sets. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 226–236. Springer, Heidelberg (2005)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Derya Ozkan
    • 1
  • Pınar Duygulu
    • 1
  1. 1.Department of Computer EngineeringBilkent UniversityAnkaraTurkey

Personalised recommendations