Detection of Large Segmentation Errors with Score Predictive Model

Matura, Martin; Matoušek, Jindřich

doi:10.1007/978-3-319-24033-6_59

Martin Matura¹⁵ &
Jindřich Matoušek^15,16

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9302))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1808 Accesses

Abstract

This paper investigates a possibility of an utilization of regressive score predictive model (SPM) in a process of detection of large segmentation errors. SPM’s scores of automatically marked boundaries between all speech segments are examined and further elaborated in an effort to discover the best threshold to distinguish between small and large errors. It was shown that the suggested detection method with a proper threshold can be used to detect all large errors for a specific type of a boundary.

This research was supported by the Technology Agency of the Czech Republic, project No. TA01011264 and by the grant of the University of West Bohemia, project No. SGS-2013-032. The access to the MetaCentrum clusters provided under the programme LM2010005 is highly appreciated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Matoušek, J., Romportl, J.: Automatic pitch-synchronous phonetic segmentation. In: Proceedings of 9th Annual Conference of International Speech Communication Association, INTERSPEECH 2008, Brisbane, Australia, pp. 1626–1629. ISCA (2008)
Google Scholar
Matoušek, J., Tihelka, D., Šmídl, L.: On the impact of annotation errors on unit-selection speech synthesis. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2012. LNCS, vol. 7499, pp. 456–463. Springer, Heidelberg (2012)
Chapter Google Scholar
Lin, C.Y., Jang, J.S.: Automatic phonetic segmentation by score predictive model for the corpora of Mandarin singing voices. IEEE Transactions on Audio, Speech, and Language Processing 15(7), 2151–2159 (2007)
Article Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1–27:27 (2011). http://www.csie.ntu.edu.tw/~cjlin/libsvm
Young, S.J., Evermann, G., Gales, M.J.F., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.C.: HTK Book (for HTK Version 3.4). The Cambridge University, Cambridge (2006)
Google Scholar
Legát, M., Matoušek, J., Tihelka, D.: On the detection of pitch marks using a robust multi-phase algorithm. Speech Communication 53(4), 552–566 (2011)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)
MATH Google Scholar
Matura, M.: Phonetic Segmentation of Speech and Possibilities of its Automatic Correction. Master’s thesis, The University of West Bohemia, Faculty of Applied Sciences, Pilsen (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia, Pilsen, Czech Republic
Martin Matura & Jindřich Matoušek
New Technologies for the Information Society, Faculty of Applied Sciences, University of West Bohemia, Pilsen, Czech Republic
Jindřich Matoušek

Authors

Martin Matura
View author publications
You can also search for this author in PubMed Google Scholar
Jindřich Matoušek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Matura .

Editor information

Editors and Affiliations

University of West Bohemia, Pilsen, Czech Republic
Pavel Král
University of West Bohemia, Pilsen, Czech Republic
Václav Matoušek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Matura, M., Matoušek, J. (2015). Detection of Large Segmentation Errors with Score Predictive Model. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_59

Download citation

DOI: https://doi.org/10.1007/978-3-319-24033-6_59
Published: 11 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics