Multiobjective Genetic Programming for Natural Language Parsing and Tagging
Parsing and Tagging are very important tasks in Natural Language Processing. Parsing amounts to searching the correct combination of grammatical rules among those compatible with a given sentence. Tagging amounts to labeling each word in a sentence with its lexical category and, because many words belong to more than one lexical class, it turns out to be a disambiguation task. Because parsing and tagging are related tasks, its simultaneous resolution can improve the results of both of them. This work aims developing a multiobjective genetic program to perform simultaneously statistical parsing and tagging. It combines the statistical data about grammar rules and about tag sequences to guide the search of the best structure. Results show that any of the implemented multiobjective optimization models improve on the results obtained in the resolution of each problem separately.
KeywordsMultiobjective Optimization Parse Tree Syntactic Category Aggregative Function Grammar Rule
Unable to display preview. Download preview PDF.
- 3.Charniak, E.: Statistical Language Learning. MIT Press, Cambridge (1993)Google Scholar
- 5.Dalrymple, M.: How much can tagging help parsing? Technical report, Department of Computer Science, King’s College, London (2004)Google Scholar
- 8.Fonseca, C.M., Fleming, P.J.: Genetic algorithms for multiobjective optimization: Formulation, discussion and generalization. In: Genetic Algorithms: Proc. of the Fifth Int. Conf., pp. 416–423. Morgan Kaufmann, San Francisco (1993)Google Scholar
- 9.Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of english: The penn treebank. Computational Linguistics 19(2), 313–330 (1994)Google Scholar
- 10.Sampson, G.: English for the Computer. Clarendon Press, Oxford (1995)Google Scholar