Abstract
There are many meaningless modal particles and dittographes in natural spoken language, furthermore ASR (automatic speech recognition) often has some recognition errors and the ASR results have no punctuations. And thus the translation would be rather poor if the ASR results are directly translated by MT (machine translation). Therefore, it is necessary to transform the abnormal ASR results into normative texts to fit machine translation. In this paper, a pretreatment approach which based on conditional random field model was introduced to delete the meaningless modal particles and dittographes, correct the recognition errors, and punctuated the ASR results before machine translation. Experiments show that the MT BLEU of 0.2497 is obtained, that improved by 18.4% over the MT baseline without pretreatment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Xia, Y.Q., Wong, K.F., Li, W.J.: A Phonetic-based Approach to Chinese Chat Test Normalization. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 993–1000 (2006)
Liu, Z., Brasser, M., Zheng, T.F., Xu, M.: A New Implementation Approach of Grammar Generation for Text-based SDS. Computer science 11, 205–209 (2006)
Despres, J., et al.: Modeling northern and southern varieties of Dutch for STT. In: Interspeech 2009, Brighton, UK, pp. 96–99 (September 2009)
Acero, A., Bernstein, N., Chambers, R., Jui, Y.C., Li, X., Odell, J., Nguyen, P., Scholz, O., Zweig, G.: Live search for mobile: Web services by voice on the cellphone. In: Proc. of ICASSP (2008)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proceedings of ICML (2001)
Finkel, J.R., Kleeman, A., Manning, C.D.: Efficient, feature-based, conditional random field parsing. In: Proc. ACL/HLT (2008)
Zhao, H., Huang, C.N., Li, M.: An improved Chinese word segmentation system with conditional random field. In: Proceedings of the Fifth SIGHAN Workshop on Chinese Language (2006)
Hifny, Y., Renals, S.: Speech Recognition using Augmented Conditional Random Fields. IEEE Transactions on Audio, Speech, and Language Processing 17(2), 354–365 (2009)
Blunsom, P., Cohn, T.: Discriminative word alignment with conditional random fields. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics (2006)
Watanabe, Y., Asahara, M., Matsumoto, Y.: A Graph-based Approach to Named Entity Categorization in Wikipedia Using Conditional Random Fields. In: Proc. of EMNLP-CoNLL (2007)
Berger, A.L., Della Pietra, V.J., et al.: A maximum entropy approach to natural language processing. In: Computational linguistics (1996)
Chen, S.F., Rosenfeld, R.: A Gaussian prior for smoothing maximum entropy models. In Technical Report CMUCS (1999)
Zhang, X.-f., Zhan, L., Huang, H.-y.: The Application of CRFs in Part-of-Speech Tagging. In: Proceedings of the International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC 2009), Hangzhou, China, August 26-27, vol. 2, pp. 347–350 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Feng, C., Huang, H. (2010). Pretreatment for Speech Machine Translation. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2010. Lecture Notes in Computer Science(), vol 6422. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16732-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-16732-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16731-7
Online ISBN: 978-3-642-16732-4
eBook Packages: Computer ScienceComputer Science (R0)