Audio-Visual Identity Verification and Robustness to Imposture

Karam, Walid; Mokbel, Chafic; Greige, Hanna; Chollet, Gérard

doi:10.1007/978-3-642-01793-3_81

Walid Karam^18,19,
Chafic Mokbel¹⁸,
Hanna Greige¹⁸ &
…
Gérard Chollet¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5558))

Included in the following conference series:

International Conference on Biometrics

1735 Accesses
1 Citations

Abstract

The robustness of talking-face identity verification (IV) systems is best evaluated by monitoring their behavior under impostor attacks. We propose a scenario where the impostor uses a still face picture and a sample of speech of the genuine client to transform his/her speech and visual appearance into that of the target client. We propose MixTrans, an original text-independent technique for voice transformation in the cepstral domain, which allows a transformed audio signal to be estimated and reconstructed in the temporal domain. We also propose a face transformation technique that allows a frontal face image of a client to be animated, using principal warps to deform defined MPEG-4 facial feature points based on determined facial animation parameters. The robustness of the talking-face IV system is evaluated under these attacks. Results on the BANCA talking-face database clearly show that such attacks represent a serious challenge and a security threat to IV systems.

Download to read the full chapter text

Chapter PDF

Audiovisual Liveness Detection

Audiovisual synchrony assessment for replay attack detection in talking face biometrics

Article 18 August 2015

An Introduction to Digital Face Manipulation

Keywords

References

Reallusion crazytalk animation studio software, http://www.reallusion.com/crazytalk/
Blouet, R., Mokbel, C., Mokbel, H., Soto, E.S., Chollet, G., Greige, H.: Becars: A free software for speaker verification. In: Proc. ODYSSEY 2004, pp. 145–148 (2004)
Google Scholar
Bookstein, F.: Principal warps: Thin-plate splines and the decomposition of deformations. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(6), 567–585 (1989)
Google Scholar
Bredin, H., Chollet, G.: Making talking-face authentication robust to deliberate imposture. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), pp. 1693–1696 (2008)
Google Scholar
Duchon, J.: Interpolation des fonctions de deux variables suivant le principe de la flexion des plaques minces. R.A.I.R.O. Analyse numérique 10, 5–12 (1976)
Google Scholar
Fauve, B., Bredin, H., Karam, W., Verdet, F., Mayoue, A., Chollet, G., Hennebert, J., Lewis, R., Mason, J., Mokbel, C., Petrovska, D.: Some results from the biosecure talking face evaluation campaign. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), vol. 1, pp. 4137–4140 (2008)
Google Scholar
Lienhart, R., Maydt, J.: An extended set of haar-like features for rapid object detection. In: Proceedings of the International Conference on Image Processing, vol. 1, pp. I–900–I–903(2002)
Google Scholar
Popovici, V., Thiran, J., Bailly-Bailliere, E., Bengio, S., Bimbot, F., Hamouz, M., Kittler, J., Mariethoz, J., Matas, J., Messer, K., Ruiz, B., Poiree, F.: The BANCA database and evaluation protocol. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003. LNCS, vol. 2688, pp. 625–638. Springer, Heidelberg (2003)
Google Scholar
Sanderson, C., Paliwal, K.K.: Fast feature extraction method for robust face verification. IEE Electronics Letters 38(25), 1648–1650 (2002)
Google Scholar
Stylianou, Y., Cappe, O., Moulines, E.: Continuous probabilistic transform for voice conversion. IEEE Transactions on Speech and Audio Processing 15(6), 131–142 (1998)
Google Scholar
Tekalp, A., Ostermann, J.: Face and 2-d mesh animation in mpeg-4. Image Communication Journal 15(4-5), 387–421 (2000)
Google Scholar
Verdet, F., Hennebert, J.: Impostures of talking face systems using automatic face animation. In: Proceedings of the IEEE Conference on Biometrics: Theory, Applications and Systems (BTAS 2008) (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Balamand, Deir El-Balamand, Al-Kurah, Lebanon
Walid Karam, Chafic Mokbel & Hanna Greige
CNRS-LTCI, TELECOM ParisTech, 46 rue Barrault, 75634, Paris, France
Walid Karam & Gérard Chollet

Authors

Walid Karam
View author publications
You can also search for this author in PubMed Google Scholar
Chafic Mokbel
View author publications
You can also search for this author in PubMed Google Scholar
Hanna Greige
View author publications
You can also search for this author in PubMed Google Scholar
Gérard Chollet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision Laboratory, Facoltà di Architettura di Alghero, Dipartimento di Architettura e Pianificazione (DAP), Università di Sassari, Palazzo del Pou Salit, Piazza Duomo 6, 07041, Alghero (SS), Italy
Massimo Tistarelli
School of Electronics and Computer Science, University of Southampton, SO17 1BJ, Southampton, UK
Mark S. Nixon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karam, W., Mokbel, C., Greige, H., Chollet, G. (2009). Audio-Visual Identity Verification and Robustness to Imposture. In: Tistarelli, M., Nixon, M.S. (eds) Advances in Biometrics. ICB 2009. Lecture Notes in Computer Science, vol 5558. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01793-3_81

Download citation

DOI: https://doi.org/10.1007/978-3-642-01793-3_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01792-6
Online ISBN: 978-3-642-01793-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Audio-Visual Identity Verification and Robustness to Imposture

Abstract

Chapter PDF