Abstract
This paper describes the initial experiments on voice conservation of patients with laryngeal cancer in an advanced stage. The final aim is to create a speechaid device which is able to “speak” with their former voices. Our initial work is focused on applicability of speech data from patients with an impaired vocal tract for the purposes of speech synthesis. Preliminary results indicate that appropriately selected synthesis method can successfully learn a new voice, even from speech data which is of a lower quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Denby, B., Schultz, T., Honda, K., Hueber, T., Gilbert, J., Brumberg, J.: Silent speech interfaces. Speech Communication 52, 270–287 (2010)
Doi, H., Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: An Evaluation of Alaryngeal Speech Enhancement Methods based on Voice Conversion Techniques. In: Proceedings of ICASSP 2011, pp. 5136–5139 (2011)
Hanzlíček, Z.: Czech HMM-Based Speech Synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS, vol. 6231, pp. 291–298. Springer, Heidelberg (2010)
Hanzlíček, Z.: Czech HMM-Based Speech Synthesis: Experiments with Model Adaptation. In: Habernal, I., Matoušek, V. (eds.) TSD 2011. LNCS, vol. 6836, pp. 107–114. Springer, Heidelberg (2011)
Matoušek, J., Romportl, J.: Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis. In: Matoušek, V., Mautner, P. (eds.) TSD 2007. LNCS (LNAI), vol. 4629, pp. 326–333. Springer, Heidelberg (2007)
Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: Speaking Aid System for Total Laryngectomees using Voice Conversion of Body Transmitted Artificial Speech. In: Proceedings of Interspeech 2006, pp. 1395–1398 (2006)
Nakamura, K., Toda, T., Saruwatari, H., Shikano, K.: The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion. In: Proceedings of Interspeech 2010, pp. 1628–1631 (2010)
Stanislav, P., Psutka, J.: Influence of different phoneme mappings on the recognition accuracy of electrolaryngeal speech. In: Proceedings of Sigmap 2012 (2012)
Yamagishi, J., Kobayashi, T., Nakano, Y., Ogata, K., Isogai, J.: Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm. IEEE Transactions on Audio, Speech, and Language Processing 17, 66–83 (2009)
Zen, H., Tokuda, K., Black, A.W.: Review: Statistical parametric speech synthesis. Speech Communication 51, 1039–1064 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Hanzlíček, Z., Romportl, J., Matoušek, J. (2013). Voice Conservation: Towards Creating a Speech-Aid System for Total Laryngectomees. In: Kelemen, J., Romportl, J., Zackova, E. (eds) Beyond Artificial Intelligence. Topics in Intelligent Engineering and Informatics, vol 4. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34422-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-34422-0_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34421-3
Online ISBN: 978-3-642-34422-0
eBook Packages: EngineeringEngineering (R0)