Pretreatment for Speech Machine Translation

Zhang, Xiaofei; Feng, Chong; Huang, Heyan

doi:10.1007/978-3-642-16732-4_13

Xiaofei Zhang²²,
Chong Feng²³ &
Heyan Huang²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6422))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1048 Accesses

Abstract

There are many meaningless modal particles and dittographes in natural spoken language, furthermore ASR (automatic speech recognition) often has some recognition errors and the ASR results have no punctuations. And thus the translation would be rather poor if the ASR results are directly translated by MT (machine translation). Therefore, it is necessary to transform the abnormal ASR results into normative texts to fit machine translation. In this paper, a pretreatment approach which based on conditional random field model was introduced to delete the meaningless modal particles and dittographes, correct the recognition errors, and punctuated the ASR results before machine translation. Experiments show that the MT BLEU of 0.2497 is obtained, that improved by 18.4% over the MT baseline without pretreatment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Xia, Y.Q., Wong, K.F., Li, W.J.: A Phonetic-based Approach to Chinese Chat Test Normalization. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 993–1000 (2006)
Google Scholar
Liu, Z., Brasser, M., Zheng, T.F., Xu, M.: A New Implementation Approach of Grammar Generation for Text-based SDS. Computer science 11, 205–209 (2006)
Google Scholar
Despres, J., et al.: Modeling northern and southern varieties of Dutch for STT. In: Interspeech 2009, Brighton, UK, pp. 96–99 (September 2009)
Google Scholar
Acero, A., Bernstein, N., Chambers, R., Jui, Y.C., Li, X., Odell, J., Nguyen, P., Scholz, O., Zweig, G.: Live search for mobile: Web services by voice on the cellphone. In: Proc. of ICASSP (2008)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML (2001)
Google Scholar
Finkel, J.R., Kleeman, A., Manning, C.D.: Efficient, feature-based, conditional random field parsing. In: Proc. ACL/HLT (2008)
Google Scholar
Zhao, H., Huang, C.N., Li, M.: An improved Chinese word segmentation system with conditional random field. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language (2006)
Google Scholar
Hifny, Y., Renals, S.: Speech Recognition using Augmented Conditional Random Fields. IEEE Transactions on Audio, Speech, and Language Processing 17(2), 354–365 (2009)
Article Google Scholar
Blunsom, P., Cohn, T.: Discriminative word alignment with conditional random fields. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics (2006)
Google Scholar
Watanabe, Y., Asahara, M., Matsumoto, Y.: A Graph-based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields. In: Proc. of EMNLP-CoNLL (2007)
Google Scholar
Berger, A.L., Della Pietra, V.J., et al.: A maximum entropy approach to natural language processing. In: Computational linguistics (1996)
Google Scholar
Chen, S.F., Rosenfeld, R.: A Gaussian prior for smoothing maximum entropy models. In Technical Report CMUCS (1999)
Google Scholar
Zhang, X.-f., Zhan, L., Huang, H.-y.: The Application of CRFs in Part-of-Speech Tagging. In: Proceedings of the International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC 2009), Hangzhou, China, August 26-27, vol. 2, pp. 347–350 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Research Center of Computer and Language Information Engineering, Chinese Academy of Sciences, Beijing, China, 100097
Xiaofei Zhang
School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China, 100081
Chong Feng & Heyan Huang

Authors

Xiaofei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chong Feng
View author publications
You can also search for this author in PubMed Google Scholar
Heyan Huang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, 415 Chien-Kung Road, 807, Kaohsiung, Taiwan
Jeng-Shyang Pan
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, #43, Sec.4, Keelung Rd., 106, Taipei, Taiwan
Shyi-Ming Chen
Institute of Informatics, Wroclaw University of Technology, Str. Wyb. Wyspianskiego 27, 50-370, Wroclaw, Poland
Ngoc Thanh Nguyen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, X., Feng, C., Huang, H. (2010). Pretreatment for Speech Machine Translation. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6422. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16732-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-16732-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16731-7
Online ISBN: 978-3-642-16732-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics