Analysis of Mouth Shape Deformation Rate for Generation of Japanese Utterance Images Automatically

Miyazaki, Tsuyoshi; Nakashima, Toyoshiro

doi:10.1007/978-3-319-11265-7_6

Tsuyoshi Miyazaki³ &
Toyoshiro Nakashima⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 578))

859 Accesses
1 Citations

Abstract

We have been doing research on machine lip-reading. In the process of the study, we proposed a reproduction method of utterance images without voice from Japanese kana. Firstly, sequence of codes that is called “Mouth Shapes Sequence Code” is generated. The Mouth Shapes Sequence Code expresses the order of mouth shapes when a Japanese word is uttered. The utterance images are generated by using the mouth images corresponding to the Mouth Shapes Sequence Code and the deformed mouth shape images generated with morphing. However, the deformation rate of the mouth shapes has been decided from the real utterance images experimentally. Therefore, there were cases in which the utterance images with the sense of incongruity about mouth shape deformation were generated. In this paper, the deformation rate of the mouth shapes is analyzed using real utterance images captured by a high-speed camera, and we propose a generation method of utterance images based on the results. Finally, the mean opinion score of the subjects are shown, and we evaluate the effectiveness of proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asa, H., Bertil, L.: Visual speech synthesis with concatenative speech. In: Auditory-Visual Speech Processing, pp. 181–184 (1998)
Google Scholar
Kiyotsugu, K., Satoshi, N., Kiyohiro, S.: Facial movement synthesis by HMM from audio speech. Trans. Inst. Electr. Inf. Commun. Eng. J83-D-I I(11), 2498–2506 (2000) (in Japanese)
Google Scholar
Shinichi, K., Hiroshi, S., Tsuneo, N., Takuya, N., Satoshi, N., Katsunobu, I., Shigeo, M., Tatsuo, Y., Atsuhiko, K., Akinobu, L., Yoichi, Y., Takao, K., Keiichi, T., Keikichi, H., Nobuaki, M., Atsushi, Y., Yasuharu, D., Takehito, U., Shigeki, S.: Design of software toolkit for anthropomorphic spoken dialog agent software with customization-oriented features. Inf. Process. Soc. Jpn. (IPSJ) J 43(7), 2249–2263 (2002) (in Japanese)
Google Scholar
Tsuyoshi, M., Toyoshiro, N.: The codification of distinctive mouth shapes and the expression method of data concerning changes in mouth shape when uttering Japanese. IEEJ Trans. Electr. Inf. Syst. 129(12), 2108–2114 (2009) (in Japanese)
Google Scholar
Tsuyoshi, M., Toyoshiro, N.: Development of lip-reading training application for smartphones. Multimedia Distrib. Coop Mob. Symp. 2012, 1863–1868 (2012) (in Japanese)
Google Scholar
Tsuyoshi, M., Toyoshiro, N., Naohiro, I.: Evaluation for an automatic generation of lips movement images based on mouth shapes sequence code in Japanese pronunciation. In: Proceedings of Japan-Cambodia Joint Symposium on Information Systems and Communication Technology, 2011 (JCAICT 2011), pp. 89–92 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Computer Sciences, Kanagawa Institute of Technology, 1030 Shimo-ogino, Kanagawa, Atsugi, Japan
Tsuyoshi Miyazaki
School of Culture-Information Studies, Sugiyama Jogakuen University, 17-3 Hoshigaoka-motomachi, Chikusa, Nagoya, Aichi, Japan
Toyoshiro Nakashima

Authors

Tsuyoshi Miyazaki
View author publications
You can also search for this author in PubMed Google Scholar
Toyoshiro Nakashima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tsuyoshi Miyazaki .

Editor information

Editors and Affiliations

Software Engineering and Information Technology Institute, Central Michigan University, Mount Pleasant, Michigan, USA
Roger Lee

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Miyazaki, T., Nakashima, T. (2015). Analysis of Mouth Shape Deformation Rate for Generation of Japanese Utterance Images Automatically. In: Lee, R. (eds) Software Engineering Research, Management and Applications. Studies in Computational Intelligence, vol 578. Springer, Cham. https://doi.org/10.1007/978-3-319-11265-7_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-11265-7_6
Published: 02 November 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11264-0
Online ISBN: 978-3-319-11265-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics