Error Detection and Correction Based on Chinese Phonemic Alphabet in Chinese Text

  • Chuen-Min Huang
  • Mei-Chen Wu
  • Ching-Che Chang
Conference paper

DOI: 10.1007/978-3-540-73729-2_44

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4617)
Cite this paper as:
Huang CM., Wu MC., Chang CC. (2007) Error Detection and Correction Based on Chinese Phonemic Alphabet in Chinese Text. In: Torra V., Narukawa Y., Yoshida Y. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2007. Lecture Notes in Computer Science, vol 4617. Springer, Berlin, Heidelberg

Abstract

Misspelling and misconception resulting from similar pronunciation appears frequently in Chinese texts. Without double check-up, this situation is getting even worse with the help of Chinese input method editor. It is hoped that the quality of Chinese writing would be enhanced if an effective automatic error detection and correction mechanism embedded in text editor. Therefore, the burden of manpower to proofread shall be released. Until recently, researches on automatic error detection and correction of Chinese text have undergone many challenges and suffered from bad performance compared with that of Western text editor. In view of the prominent phenomenon in Chinese writing problem, this study proposes a learning model based on Chinese phonemic alphabet. The experimental results demonstrate this model is effective in finding out most of words spelled incorrectly, and furthermore this model improves detection and correction rate.

Keywords

Error detection of Chinese text Error correction of Chinese text language model Chinese phonemic alphabet 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Chuen-Min Huang
    • 1
  • Mei-Chen Wu
    • 1
  • Ching-Che Chang
    • 1
  1. 1.Department of Information Management, National Yunlin University of Science & Technology, TaiwanR.O.C.

Personalised recommendations