Abstract
In this paper, we present models for predicting major phrase boundary location and pause insertion using a stochastic context-free grammar (SCFG) from an input part of speech (POS) sequence. These prediction models were made with similar ideas as both major phrase boundary location and pause insertion have similar characteristics. In these models, word attributes and left/right-branching probability parameters representing stochastic phrasing characteristics are used as input parameters of a feed-forward neural network for the prediction. To obtain the probabilities, first, major phrase characteristics and pause characteristics are learned through the SCFG training using the inside-outside algorithm. Then, the probabilities of each bracketing structure are computed using the SCFG. Experiments were carried out to confirm the effectiveness of these stochastic models for the prediction of major phrase boundary locations and pause locations. In a test predicting major phrase boundaries with unseen data, 92.9% of the major phrase boundaries were correctly predicted with a 16.9% false insertion rate. For pause prediction with unseen data, 85.2% of the pause boundaries were correctly predicted with a 9.1% false insertion rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. Hirose, H. Fujisaki, H. Kawai, and M. Yamaguchi. Manifestation of linguistic and para-linguistic information in the voice fundamental frequency contours of spoken Japanese. In Proc. ICSLP, pp. 485–488, 1990.
K. Hakota and H. Sato. Prosodic rules in connected speech synthesis. Trans. IECE Japan, J63-D:715–722, 1980 (in Japanese).
P. Haffner, H. Sawai, A. Waibel, and K. Shikano. Fast back-propagation learning methods for large phonemic neural networks. In Rec. Spring Meeting, Acoust. Soc. Jpn., pp. 27–28, Mar. 1989.
N. Kaiki and Y. Sagisaka. Pause characteristics and local phrase dependency structure in Japanese. In Proc. ICSLP, pp. 357–360, 1992
K. Lari and S. J. Young. The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, 4:6–656, 1989.
F. Pereira and Y. Schabes. Inside-outside reestimation from partially bracketed corpora. In Proc. ACL, pp. 128–135, 1992.
Y. Sagisaka and N. Kaiki. Optimization of intonation control using statistical F0 resetting characteristics. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processes, pp. 49–52, 1992.
Y. Sagisaka and F. Pereira. Inductive learning of prosodic phrasing characteristics using stochastic context-free grammar. In Rec. Spring Meeting, Acoust. Soc. Jpn., pp. 225–226, Mar. 1994.
K. Suzuki and T. Saito. N-phrase parsing method for Japanese text-to-speech conversion and assignment of prosodic features based on N-phrase structures. Trans. IEICE Japan, J78-D- 11:177–187, Feb. 1995 (in Japanese).
Y. Sagisaka, K. Takeda, M. Abe, S. Katagiri, T. Umeda, and H. Kuwabara. A large-scale Japanese speech database. In Proceedings of the International Conference on Spoken Language Processing, Kobe, Japan, pp. 1089–1092, 1990.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer-Verlag New York, Inc.
About this chapter
Cite this chapter
Fujio, S., Sagisaka, Y., Higuchi, N. (1997). Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar. In: Sagisaka, Y., Campbell, N., Higuchi, N. (eds) Computing Prosody. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2258-3_17
Download citation
DOI: https://doi.org/10.1007/978-1-4612-2258-3_17
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4612-7476-6
Online ISBN: 978-1-4612-2258-3
eBook Packages: Springer Book Archive