Abstract
Misspelling and misconception resulting from similar pronunciation appears frequently in Chinese texts. Without double check-up, this situation is getting even worse with the help of Chinese input method editor. It is hoped that the quality of Chinese writing would be enhanced if an effective automatic error detection and correction mechanism embedded in text editor. Therefore, the burden of manpower to proofread shall be released. Until recently, researches on automatic error detection and correction of Chinese text have undergone many challenges and suffered from bad performance compared with that of Western text editor. In view of the prominent phenomenon in Chinese writing problem, this study proposes a learning model based on Chinese phonemic alphabet. The experimental results demonstrate this model is effective in finding out most of words spelled incorrectly, and furthermore this model improves detection and correction rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cao, F.F.: Instances of interaction between Taiwanese Japan and Taiwanese Mandarin in Taiwan across the span of the last one hundred year. Chinese Study 36(12), 273–297 (2000)
Chang, C.-H.: A New Approach for Automatic Chinese Spelling Correction. In: Proceedings of Natural Language Processing Pacific Rim Symposium 1995, Seoul, Korea, pp. 278–283 (1995)
Chen, K.J., Bai, M.H.: Unknown Word Detection for Chinese by a Corpus-based Learning Method. In: Computational Linguistics and Chinese Language Processing, pp. 27–44 (1998)
Chi, C.: You jian bie zi, 2nd edn. Ming Jen Publications, Inc., Taipei (1980)
Chiang, H.: tou shi:ti xiao jie fei di bu zhi shi cuo wu bai chu. In: Focus on China beijing: BBC CHINESE.com (2006)
Chuang, T.I., Chuang, S.Y.: Yi zi zhi cha. Jian Lin, Taipei (1991)
Fan, S.P.: Xiao yuan chang jian cuo bie zi shou ce. Chinese improvement working group, Hong Kong (1998)
Hsieh, K.P.: Ti wan di qu nian qing ren yu zh(?), ch(?), sh(?) with z(?), c(?), s(?) zhen di bu fen ma?Do young people in Taiwan really confuse zh(?), ch(?), sh(?) with z(?), c(?), s(?)? The World of Chinese Language 90(12), 1–7 (1998)
Hsieh, P.C.: Bie zai xie cuo zi liao. Business Weekly Publications, Inc., Taipei (2001)
Huang, C.N., Chang, H.F.: Zi ran yu yan chu li ji shu di san ge li cheng bei. Foreign Language Teaching and Research. 2005, 180–187 (2002)
Hung, F.L.: Bian zi ji jin. Fu Wen, Kaohsiung (1997)
Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press, Cambridge (1999)
Papoulis, A.: Probability, Random Variables, and Stochastic Processes, 2nd edn. McGraw-Hill, New York (1984)
Ssu Ma, T.: Cuo bie zi chu lie, 1st edn. BusinessWeekly Publications, Inc., Taipei (2005)
Ssu Tu, A.J.: Hao wan cuo bie zi you xi. Singtao, Hong Kong (2005)
Tso, H.L.: Cuo bie zi bian zheng. The Commercial Press, Ltd, Taipei (1980)
Wagner, R.A.: Order-n correction for regular languages. Commun. ACM 17, 265–268 (1974)
Wang, H.J.: Gao zhong zhi xue sheng zuo wen cuo bie zi yan jiu-yi gao xiong shi gao zhong zhi xue sheng zuo wen wei li. In: Wang, H.J. (ed.) Junior high school material. vol. Graduate Student, vol. 220, National Kaohsiung Normal University, Kaohsiung (2003)
Witten, I.H., Bell, T.C.: The zero-frequency problem: estimating the probabilities of novel events in adaptive text compression.Information Theory. IEEE Transactions 37, 1085 (1991)
Yang, H.I.: Xue shi zhong wen cheng du qi ye zhu guan yao tou. China times express, Taipei Report (2005)
Zhang, L., Zhou, M., Huang, C., Lu, M.: Approach in automatic detection and correction of errors in Chinese text based on feature and learning. In: Proceedings of the 3rd world congress on Intelligent Control and Automation, Hefei, pp. 2744–2748 (2000)
Zhang, L., Zhou, M., Huang, C., Pan, H.: Automatic detecting/correcting errors in Chinese text by an approximate word-matching algorithm. In: The 38th Annual Meeting of the Association for Computational Linguistics, Hong Kong (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, CM., Wu, MC., Chang, CC. (2007). Error Detection and Correction Based on Chinese Phonemic Alphabet in Chinese Text. In: Torra, V., Narukawa, Y., Yoshida, Y. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2007. Lecture Notes in Computer Science(), vol 4617. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73729-2_44
Download citation
DOI: https://doi.org/10.1007/978-3-540-73729-2_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73728-5
Online ISBN: 978-3-540-73729-2
eBook Packages: Computer ScienceComputer Science (R0)