Skip to main content

Identifying Verb-Preposition Multi-Category Words in Chinese-English Patent Machine Translation

  • Conference paper
Artificial Life and Computational Intelligence (ACALCI 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8955))

  • 1648 Accesses

Abstract

Multi-category words are widely distributed in Chinese patent documents, and identification of them has been one of the difficulties in machine translation (MT). This paper proposes a rule-based method for identifying verb and preposition multi-category words in Chinese-English patent machine translation. Based on principles of boundary perception and according to syntactic and semantic information of multi-category words as well as context information, some reliable disambiguation rules are designed to help the MT system analyze proper categories of words, then proposes adverbial and predicate identification rules to determine and identify the words further. Related experiments and BLEU evaluations show that the method is efficient to recognize verbs and prepositions better, and is also helpful to improve final translation quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jiang, H., Jiang, T., Zhang, K., et al.: Some Key Issues in Chinese-to-English Patent Machine Translation. In: Proceedings of the 9th China Workshop on Machine Translation, CWMT 2013 (2013)

    Google Scholar 

  2. Wang, D.: Chinese to English automatic patent machine translation at SIPO. J. World Patent Information. 31(2), 137–139 (2009)

    Article  Google Scholar 

  3. Goto, I., Lu, B., Chow, K.P., Sumita, E., Tsou, B.K.: Overview of the patent machine translation task at the ntcir-9 workshop. In: Proceedings of 9th NTCIR Workshop Meeting, pp. 559–578 (2011)

    Google Scholar 

  4. Eneko, A., Edmonds, P.: Word sense disambiguation, algorithms and applications. Springer, Heidelberg (2006)

    Google Scholar 

  5. McCarthy, D.: Word Sense Disambiguation: An Overview. J. Language and Linguistics Compass (3/2), 537–558 (2009)

    Google Scholar 

  6. Xia, J.: Automatic Recognition Research on Syntactic Category of Common Words (2012) (in Chinese)

    Google Scholar 

  7. Zhang, L.: Multi-category words processing mechanism combined rule with statistical methods (2002) (in Chinese)

    Google Scholar 

  8. Gan, J., Huang, D.: Automatic Identification of Chinese Prepositional Phrase. J. Journal of Chinese Information Processing 19(4), 17–23 (2005) (in Chinese)

    Google Scholar 

  9. Wen, M., Wu, Y.: Feature-rich Prepositional Phrase Boundary Identification based on SVM. J. Journal of Chinese Information Processing. 23(5), 19–24 (2009) (in Chinese)

    Google Scholar 

  10. Zan, H., Zhang, T., Zhang, K.: Automatic recognition research on preposition’s usage based on combinations of rules and statistics. J. Computer Engineering and Design 34(6), 2153–2157 (2013) (in Chinese)

    Google Scholar 

  11. List, J.: Review of machine translation in patents–Implications for search. J. World Patent Information 34, 193–195 (2012)

    Article  Google Scholar 

  12. Isozaki, H., Hirao, T., Duh, K., et al.: Automatic Evaluation of Translation Quality for Distant Language Pairs. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 944–952. MIT, Massachusetts (2010)

    Google Scholar 

  13. Wei, X., Zhang, Q.: Study on the computer-proceeding rules of V+V in Chinese. J. Application Research of Computers. 1, 43–46 (2006) (in Chinese)

    Google Scholar 

  14. Zhu, Y., Jin, Y.: A Chinese-English patent machine translation system based on the theory of hierarchical network of concepts. J. The Journal of China Universities of Posts and Telecommunications 19(Suppl. 2), 140–146 (2012)

    Article  Google Scholar 

  15. Papineni, K., Roukos, S., Ward, T., et al.: BLEU: a Method for Automatic Evaluation of Machine Translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318 (2002)

    Google Scholar 

  16. Liu, Q., Qian, Y.: Summary on Chinese Information Processing Technology Evaluation. J. Communications of China Computer Federation 02, 11–18 (2008) (in Chinese)

    Google Scholar 

  17. Madnani, N.: iBLEU: Interactively Debugging & Scoring Statistical Machine Translation Systems. In: Proceedings of the Fifth IEEE International Conference on Semantic Computing, pp. 213–214. IEEE Press, USA (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Li, H., Zhu, Y., Jin, Y. (2015). Identifying Verb-Preposition Multi-Category Words in Chinese-English Patent Machine Translation. In: Chalup, S.K., Blair, A.D., Randall, M. (eds) Artificial Life and Computational Intelligence. ACALCI 2015. Lecture Notes in Computer Science(), vol 8955. Springer, Cham. https://doi.org/10.1007/978-3-319-14803-8_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-14803-8_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-14802-1

  • Online ISBN: 978-3-319-14803-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics