Changing the Voice of a Subscriber on the Example of an Implementation of the PSOLA Algorithm for the iOS and Android Mobile Platforms

  • Zbigniew Piotrowski
  • Michał Ciołek
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 368)

Abstract

This paper describes the implementation of the PSOLA algorithm for two mobile platforms with embedded operating systems: Android and iOS. In order to mask the voice identity of a telephony subscriber using a virtual voice, a modification of the time scale of the utterance and of the pitch of the speaker have been implemented, with the influence of these modifications on the recognition of the identity by listeners being studied. Mobile platforms were compared in terms of signal processing time, including the read and write times.

Keywords

voice impersonation voice morphing Android iOS smartphones PSOLA 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hui, Y., Young, S.: Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Transactions on Audio, Speech, and Language Processing 14, 1301–1312 (2006)CrossRefGoogle Scholar
  2. 2.
    Hui, Y., Young, S.: High quality voice morphing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 9–12. IEEE Press, New York (2004)Google Scholar
  3. 3.
    Duxans, H., Bonafonte, A.: Residual Conversion Versus Prediction on Voice Morphing Systems. In: IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1. IEEE Press, New York (2006)Google Scholar
  4. 4.
    Ning, X., Xi, S., Zhen, Y.: A Novel Voice Morphing System Using Bi-GMM for High Quality Transformation. In: Ninth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, pp. 485–489 (2008)Google Scholar
  5. 5.
    Ning, X., Zhen, Y.: A precise estimation of vocal tract parameters for high quality voice morphing. In: 9th International Conference on Signal Processing, pp. 684–687 (2008)Google Scholar
  6. 6.
    Furuya, K., Moriyama, T., Ozawa, S.: Generation of Speaker Mixture Voice using Spectrum Morphing. In: IEEE International Conference on Multimedia and Expo, pp. 344–347. IEEE Press, New York (2007)Google Scholar
  7. 7.
    Drgas, S., Zamorski, D., Dabrowski, A.: Speaker verification using various prosodic kernels, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–5 (2011)Google Scholar
  8. 8.
    Drgas, S., Cetnarowicz, D., Dabrowski, A.: Speaker verification based on prosodic features, Signal Processing Algorithms, Architectures, Arrangements, and Applications (SPA), pp. 79–82 (2008)Google Scholar
  9. 9.
    Lopatka, K., Suchomski, P., Czyzewski, A.: Time-domain prosodic modifications for Text-To-Speech Synthesizer, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 73–77 (2010)Google Scholar
  10. 10.
    Kupryjanow, A., Czyzewski, A.: Time-scale modification of speech signals for supporting hearing impaired schoolchildren, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 159–162 (2009)Google Scholar
  11. 11.
    Yinqiu, G., Zhen, Y.: Pitch modification based on syllable units for voice morphing system. In: International Conference on Network and Parallel Computing Workshops, pp. 135–139 (2007)Google Scholar
  12. 12.
    Kumar, K., Jain, J.: Speech Pitch Shifting using Complex Continuous Wavelet Transform. In: Annual IEEE India Conference, pp. 1–4. IEEE Press, New York (2006)CrossRefGoogle Scholar
  13. 13.
    Abe, M.: Speech morphing by gradually changing spectrum parameter and fundamental frequency. In: Fourth International Conference on Spoken Language, vol. 4, pp. 2235–2238 (1996)Google Scholar
  14. 14.
    Yifeng, S., Jia, J., Lianhong, C.: Detection on PSOLA-modified voices by seeking out duplicated fragments. In: International Conference on Systems and Informatics, pp. 2177–2182 (2012)Google Scholar
  15. 15.
    Wang, Y., Yang, S.: Speech synthesis based on PSOLA algorithm and modified pitch parameters. In: International Conference on Computational Problem-Solving, pp. 296–299 (2010)Google Scholar
  16. 16.
    Valbret, H., Moulines, E., Tubach, J.P.: Voice transformation using PSOLA technique. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 145–148 (1992)Google Scholar
  17. 17.
    Celik, M., Sharma, G., Murat Tekalp, A.: Pitch and Duration Modification for Speech Watermarking. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 17–20. IEEE Press, New York (2005)Google Scholar
  18. 18.
    Nakano, T., Goto, M.: Vocalistener2: A singing synthesis system able to mimic a user’s singing in terms of voice timbre changes as well as pitch and dynamics. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 453–456. IEEE Press, New York (2011)Google Scholar
  19. 19.
    Lisiecki, B., Meyer, A., Dabrowski, A.: Implementation of sound effects on DSP platform, Signal Processing Algorithms, Architectures, Arrangements, and Applications Conference Proceedings (SPA), pp. 1–3 (2011)Google Scholar
  20. 20.
    Key Global Telecom Indicators for the World Telecommunication Service Sector, http://www.itu.int/ITU-D/ict/statistics/at_glance/KeyTelecom.html (accessed January 30, 2013)
  21. 21.
    Smart phones overtake client PCs in 2011, http://www.canalys.com/newsroom/smart-phones-overtake-client-pcs-2011 (accessed January 30, 2013)
  22. 22.
    Piotrowski, Z., Grabiec, W.: Voice trust in public switched telephone networks. In: Dziech, A., Czyżewski, A. (eds.) MCSS 2012. CCIS, vol. 287, pp. 282–291. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  23. 23.
    Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Communication 9(5/6), 453–467 (1990)CrossRefGoogle Scholar
  24. 24.
    Verteletskaya, E., Šimák, B.: Performance Evaluation of Pitch Detection Algorithms, http://access.feld.cvut.cz/view.php?cisloclanku=2009060001 (accessed January 30, 2013)
  25. 25.
    Zölzer, U.: DAFX: Digital Audio Effects. Wiley, New York (2012)Google Scholar
  26. 26.
    http://code.google.com/p/sipdroid/ (accessed January 30, 2013)
  27. 27.
  28. 28.
  29. 29.
    Tyler, J.: XDA Developers’ Android Hacker’s Toolkit: The Complete Guide to Rooting, ROMs and Theming. Wiley, New York (2012)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Zbigniew Piotrowski
    • 1
  • Michał Ciołek
    • 1
  1. 1.Faculty of ElectronicsMilitary University of TechnologyWarsawPoland

Personalised recommendations