Abstract
Ambiguities in natural languages make processing (parsing) them a difficult task. Parsing is even more difficult when dealing with a structurally complex natural language such as Arabic. In this paper, we briefly highlight some of the complex structure of Arabic, and we identify different parsing approaches and briefly discuss their limitations. Our goal is to produce a hybrid parser, by combining different parsing approaches, which retains the advantages of data-driven approaches but is guided by a set of grammatical rules to produce more accurate results. We describe a novel technique for directly combining different parsing approaches. Results for our initial experiments that we have conducted in this work, and our plans for future work are also presented.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aho, A.V., Ullman, J.D.: The Theory of Parsing, Translation, and Compiling, vol. 1. Prentice-Hall, Englewood Cliffs (1972)
Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Comput. Surveys 8(4), 1–22 (2009)
Collins, M.: Head-driven statistical models for natural language parsing. Comput. Linguist. 29(4), 589–637 (2003). http://dx.doi.org/10.1162/089120103322753356
Lee, C., Day, M., Sung, C., Lee, Y., Jiang, T., Wu, C., Shih, C., Chen, Y., Hsu, W.: Boosting Chinese question answering with two lightweight methods: ABSPs and SCO-QAT. ACM Trans. Asian Lang. Inf. Process. (TALIP) 7(4), 12:1–12:29 (2008)
Nivre, J., Hall, J., Nilsson, J., Eryigit, G., Svetoslav, M.: Labeled pseudo-projective dependency parsing with support vector machines, pp. 221–225. Association for Computational Linguistics (2006)
Nivre, J.: Inductive Dependency Parsing. Text, Speech and Language Technology. Springer, Netherlands (2006)
Kaplan, R.M., Riezler, S., King, T.H., Maxwell Iii, J.T., Vasserman, E., Crouch, R.: Speed and accuracy in shallow and deep stochastic parsing. In: Proceedings of Human Langauge Technology and the Conference of the North American Chapter of the Association for Computational Linguistics, HLT-NAACL, pp. 97–104 (2004)
Baptista, M.: On the nature of pro-drop in capeverdean creole. Harv. Working Pap. Linguist. 5, 3–17 (1995)
Daimi, K.: Identifying syntactic ambiguities in single-parse Arabic sentence. Comput. Humanit. 35(3), 333–349 (2001)
Attia, A.M.: Handling Arabic morphological and syntactic ambiguities within the LFG framework with a view to machine translation. Ph.D. Thesis, School of Languages, Linguistics and Cultures, Manchester University (2008)
Ramsay, A., Mansour, H.: Local constraints on Arabic word order. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds.) FinTAL 2006. LNCS (LNAI), vol. 4139, pp. 447–457. Springer, Heidelberg (2006)
Nelken, R., Shieber, S.M.: Arabic diacritization using weighted finite-state transducers. In: Proceedings of the Association for Computational Linguistics Workshop on Computational Approaches to Semitic Languages. Semitic 2005, pp. 79–86. Association for Computational Linguistics, Stroudsburg (2005)
Nivre, J., Hall, J., Nilsson, J.: MaltParser: A Data-Driven Parser-Generator for Dependency Parsing. Springer, Netherland (2006)
MacDonald, R.: Discriminative learning and spanning tree algorithms for dependency parsing. Ph.D. Thesis, Computer and Information Science, the University of Pennsylvania (2006). http://www.cis.upenn.edu/grad/documents/mcdonald.pdf
Øvrelid, L., Kuhn, J., Spreyer, K.: Improving data-driven dependency parsing using large-scale LFG grammars. In: Proceedings of the Association for Computational Linguistics-International Joint Conference on Natural Language Processing 2009 Conference Short Papers, pp. 37–40. Association for Computational Linguistics, Stroudsburg (2009). http://dl.acm.org/citation.cfm?id=1667583.1667597
Sagae, K., Miyao, Y.: HPSG parsing with shallow dependency constraints. In: Proceedings of ACL 2007 (2007)
Marton, Y., Habash, N., Rambow, O.: Improving Arabic dependency parsing with form-based and functional morphological features. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 1586–1596. Association for Computational Linguistics, Portland (2011). http://www.aclweb.org/anthology/P11-1159
Maamouri, M., Bies, A.: Developing an Arabic treebank: methods, guidelines, procedures, and tools. In: Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages, Geneva, pp. 2–9 (2004)
McDonald, R., Lerman, K., Pereira, F.: Multilingual dependency parsing with a two-stage discriminative parser. In: Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York (2006)
Xia, F., Palmer, M.: Converting dependency structures to phrase structures. In: Proceedings of the 1st Human Language Technology Conference (HLT-2001), San Diego, pp. 1–5 (2001)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann Series in Data Management Systems, 2nd edn. Morgan Kaufmann, San Francisco (2005). http://www.amazon.ca/exec/obidos/redirect?tag=citeulike09-20&path=ASIN/0120884070
Ramsay, A.M.: Direct parsing with discontinuous phrases. Nat. Lang. Eng. 5(3), 271–300 (1999)
Acknowledgments
Sardar Jaf’s contribution to this work was supported by the Qatar National Research Fund (grant NPRP 09-046-6-001). Allan Ramsay’s contribution was partially supported from the same grant.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Jaf, S., Ramsay, A. (2016). A Hybrid Approach to Parsing Natural Languages. In: Vetulani, Z., Uszkoreit, H., Kubis, M. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2013. Lecture Notes in Computer Science(), vol 9561. Springer, Cham. https://doi.org/10.1007/978-3-319-43808-5_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-43808-5_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43807-8
Online ISBN: 978-3-319-43808-5
eBook Packages: Computer ScienceComputer Science (R0)