Skip to main content

Novel Interleaving Schemes for Speaker Recognition over Lossy Networks

  • Conference paper
  • 1302 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7143))

Abstract

Cases of cybercrime & terrorism on IP network is increasing day by day. In addition, there is a tendency to fraud phone-banking systems, and gain access to secure premises or accounts, which may be protected through the voice-based biometric system. To minimize these problems, we need a voice/speaker recognition system with utmost accuracy. Number of users of internet applications is also increasing, causes heavy traffic over IP channel almost round the clock. In this paper, the effect of packet loss on the performance of speaker recognition system is demonstrated and to alleviate this degradation we propose novel interleaving schemes. The proposed interleaving schemes help to spread the risk of burst loss in the network which is expected to improve speech quality and hence performance of the speaker recognition system.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mehta, P., Udani, S.: Voice over IP. IEEE Potentials 20, 36–40 (2001)

    Article  Google Scholar 

  2. Borah, D.K., DeLeon, D.: Speaker Identification in the Presence of Packet Loss. In: IEEE 11th Digital Signal Processing Workshop and IEEE Signal Processing Education Workshop, pp. 302–306 (2004)

    Google Scholar 

  3. Aggarwal, C., Olshefski, D., Saha, D., Shae, Z., Yu, P.: CSR: Speaker Recognition from Compressed VoIP Packet Stream. In: IEEE International Conference on Multimedia and Expo, ICME Amsterdam, Netherlands, pp. 970–973 (2005)

    Google Scholar 

  4. Wang, X., Lin, J.: Applying Speaker Recognition Over VoIP Auditing. In: Proceedings of the 6th International Conference on Machine Learning and Cybernetic, Hong Kong, pp. 3577–3581 (2007)

    Google Scholar 

  5. Davidson, J., Peters, J.: Voice Over IP Fundamentals. Cisco Press (2000)

    Google Scholar 

  6. McCree, A., Truong, K., Bryan, G., Barnwell, T.P., Vzswanathanl, V.: A 2.4 kbit/s MELP Coder Candidate for the New U.S. Federal Standard. In: Proceedings International Conference Acoust., Speech and Signal, ICASSP, Atlanta Georgia, pp. 200–203 (1996)

    Google Scholar 

  7. Mayorga, P., Besacier, L., Hernandez, A.: Packet Loss and Compression Effects on Vocal Recognition. In: Proceedings of CERMA (2006)

    Google Scholar 

  8. Hassan, M., Nayandoro, A., Atiquzzaman, M.: Internet Telephony: Services, Technical Challenges, and Products. IEEE Communications Magazine 38, 96–103 (2000)

    Article  Google Scholar 

  9. Sat, B., Wah, B.W.: Analysis and Evaluation of the Skype and Google-talk VoIP Systems. In: ICME, pp. 2153–2156 (2006)

    Google Scholar 

  10. Davis, S.B., Mermelstein, P.: Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Trans. Acoustic, Speech and Signal Processing 28(4), 357–366 (1980)

    Article  Google Scholar 

  11. Keller, H., George, S., Keith, T.: A Multilanguage Study of the Quality of Interleaved MELP Voice Traffice Over a Lossy Network. IEEE Signal Processing Letters 16, 565–568 (2009)

    Article  Google Scholar 

  12. Campbell, W.M., Assaleh, K.T., Broun, C.C.: Speaker Recognition with Polynomial Classifiers. IEEE Transactions on Speech and Audio Processing 10(4), 205–212 (2002)

    Article  Google Scholar 

  13. Martin, A.F., Doddington, G., Kamm, T., Ordowski, M., Przybocki, M.: The DET Curve in Assessment of Detection Task Performance. In: Proceeding Eurospeech 1997, Rhodes, Greece, vol. 4, pp. 1899–1903 (1997)

    Google Scholar 

  14. Wasem, O., Goodman, D., Dvorak, C., Page, H.: The Effect of Waveform Substitution on the Quality of PCM Packet Communications. IEEE Transactions on Acoustics, Speech, and Signal Processing 36, 342–348 (1988)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Patil, H.A., Goswami, P.A., Basu, T.K. (2012). Novel Interleaving Schemes for Speaker Recognition over Lossy Networks. In: Kundu, M.K., Mitra, S., Mazumdar, D., Pal, S.K. (eds) Perception and Machine Intelligence. PerMIn 2012. Lecture Notes in Computer Science, vol 7143. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27387-2_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27387-2_41

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27386-5

  • Online ISBN: 978-3-642-27387-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics