From Treebank Conversion to Automatic Dependency Parsing for Vietnamese

  • Dat Quoc Nguyen
  • Dai Quoc Nguyen
  • Son Bao Pham
  • Phuong-Thai Nguyen
  • Minh Le Nguyen
Conference paper

DOI: 10.1007/978-3-319-07983-7_26

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8455)
Cite this paper as:
Nguyen D.Q., Nguyen D.Q., Pham S.B., Nguyen PT., Le Nguyen M. (2014) From Treebank Conversion to Automatic Dependency Parsing for Vietnamese. In: Métais E., Roche M., Teisseire M. (eds) Natural Language Processing and Information Systems. NLDB 2014. Lecture Notes in Computer Science, vol 8455. Springer, Cham

Abstract

This paper presents a new conversion method to automatically transform a constituent-based Vietnamese Treebank into dependency trees. On a dependency Treebank created according to our new approach, we examine two state-of-the-art dependency parsers: the MSTParser and the MaltParser. Experiments show that the MSTParser outperforms the MaltParser. To the best of our knowledge, we report the highest performances published to date in the task of dependency parsing for Vietnamese. Particularly, on gold standard POS tags, we get an unlabeled attachment score of 79.08% and a labeled attachment score of 71.66%.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Dat Quoc Nguyen
    • 1
  • Dai Quoc Nguyen
    • 1
  • Son Bao Pham
    • 1
  • Phuong-Thai Nguyen
    • 1
  • Minh Le Nguyen
    • 2
  1. 1.Faculty of Information Technology, University of Engineering and TechnologyVietnam National UniversityHanoiVietnam
  2. 2.School of Information ScienceJapan Advanced Institute of Science and TechnologyJapan

Personalised recommendations