Examining Vulnerability of Voice Verification Systems to Spoofing Attacks by Means of a TTS System

  • Vadim Shchemelinin
  • Konstantin Simonchik
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8113)

Abstract

This paper examines the method of spoofing text-dependent voice verification systems based on the most popular TTS approaches: Unit Selection and HMM. Research of this method shows the possibility of achieving a false acceptance error of 98%-100% if the duration of the TTS database is sufficiently large. A distinctive feature of the method is that it can be fully automatical if used in conjunction with a speech recognition system.

Keywords

spoofing speech synthesis unit selection HMM speaker recognition 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Matveev, Y.N.: Biometric technologies of person identification by voice and other modalities, Vestnik MGTU. Priborostroenie. Biometric Technologies, Special Issue 3(3), 46–61 (2012) (Rus.)Google Scholar
  2. 2.
    Wu, Z., Kinnunen, T., Chng, E.S., Li, H., Ambikairajah, E.: A Study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. In: Proc. APSIPA ASC 2012, Hollywood, USA, pp. 1–5 (December 2012)Google Scholar
  3. 3.
    Kenny, P.: Bayesian speaker verification with heavy tailed priors. In: Proceedings of the Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic (June 2010)Google Scholar
  4. 4.
    Simonchik, K., Pekhovsky, T., Shulipa, A., Afanasyev, A.: Supervized Mixture of PLDA Models for Cross-Channel Speaker Verification. In: 13th Annual Conference of the International Speech Communication Association, Interspeech 2012, Portland, Oregon, September 9-13 (2012)Google Scholar
  5. 5.
    The NIST Year 2012 Speaker Recognition Evaluation Plan (2012), http://www.nist.gov/itl/iad/mig/upload/NIST_SRE12_evalplan-v17-r1.pdf
  6. 6.
    Villalba, J., Lleida, E.: Speaker verification performance degradation against spoofing and tampering attacks. In: FALA 2010 Workshop, pp. 131–134 (2010)Google Scholar
  7. 7.
    Aleinik, S., Matveev, Y., Raev, A.: Method of evaluation of speech signal clipping level. Scientific and Technical Journal of Information Technologies, Mechanics and Optics 79(3), 79–83 (2012) (Rus.) Google Scholar
  8. 8.
    Pelecanos, J., Sridharan, S.: Feature warping for robust speaker verification. In: Proc. Speaker Odyssey, the Speaker Recognition Workshop, Crete, Greece (2001)Google Scholar
  9. 9.
    Matveev, Y.N., Simonchik, K.K.: The speaker identification system for the NIST SRE 2010. In: The 20th International Conference on Computer Graphics and Vision, GraphiCon 2010, Conference Proceedings, St. Petersburg, Russia, September 20-24 (2010)Google Scholar
  10. 10.
    Kozlov, A.V., Kudashev, O.Y., Matveev, Y.N., Pekhovsky, T.S., Simonchik, K.K., Shulipa, A.K.: Speaker recognition system for the NIST SRE 2012. In: SPIIRAS Proceedings, vol. 25(2), pp. 350–370 (March 2013)Google Scholar
  11. 11.
    Chistikov, P.G., Korolkov, E.A.: Data-driven Speech Parameter Generation for Russian Text-to-Speech System. Computational Linguistics and Intellectual Technologies. Papers from the Annual International Conference “Dialogue”, 1 of 2(11(18)), 103–111 (2012)Google Scholar
  12. 12.
    Chistikov, P.G., Korolkov, E.A., Talanov, A.O.: Combining HMM and unit selection tech nologies to increase naturalness of synthesized speech. In: Dialog 2013 (2013)Google Scholar
  13. 13.
    Campbell, J., Higgins, A.: “YOHO Speaker Verification database”, http://www.ldc.upenn.edu/Catalog/catalogEntry.jsp?catalogId=LDC94S16
  14. 14.
    Wu, Z., Chng, E.S., Li, H.: Speaker verification system against two different voice conversion techniques in spoofing attacks, Technical report, http://www3.ntu.edu.sg/home/wuzz/
  15. 15.
    Kinnunen, T., Wu, Z.-Z., Lee, K.A., Sedlak, F., Chng, E.S., Li, H.: Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech. In: Proc. ICASSP 2012, Kyoto, Japan, pp. 4401–4404 (March 2012)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  • Vadim Shchemelinin
    • 1
    • 2
  • Konstantin Simonchik
    • 2
  1. 1.Mechanics and OpticsNational Research University of Information TechnologiesSt. PetersburgRussia
  2. 2.Speech Technology Center LimitedSt. PetersburgRussia

Personalised recommendations