Supporting the Construction of Spanish Legal Ontologies with Text2Onto
The IST project SEKT (Semantically Enabled Knowledge Technologies) aims at developing semantic technologies by integrating knowledge management, text mining, and human language technology. Tools and methodologies implemented in the SEKT project are employed and optimized in three case studies, one of them being concerned with intelligent integrated decision support for legal professionals. The main goal of this case study is to offer decision support to newly appointed judges in Spain by means of iFAQ, an intelligent Frequently Asked Questions system based on a complex ontology of the legal domain. Building this ontology is a tedious and time-consuming task requiring profound knowledge of legal documents and language. Therefore, any kind of automatic support can significantly increase the efficiency of the knowledge acquisition process. In this paper we present Text2Onto, an open-source tool for ontology learning, and our experiments with legal case study data. The previously existing English version of Text2Onto has been adapted to support the linguistic analysis of Spanish texts, including language-specific algorithms for the extraction of ontological concepts, instances and relations. Text2Onto greatly facilitated the automatic generation of the initial version of the Spanish legal ontology from a given collection of Spanish documents. In further iterative steps which included a mixture of learning and manual effort the ontology has been refined and applied to the real-world case study.
KeywordsNoun Phrase Spanish Language Context Vector Legal Domain Human Language Technology
Unable to display preview. Download preview PDF.
- 1.Casanovas, P., Vallbé, J.-J., Casellas, N., Poblet, M., Blázquez, J.C.M., Benjamins, V., Cobo, J.-M.L.: D10.4.1 SEKT legal case study: After analysis, with the collaboration of Z. Huang, and J. Völker (2006)Google Scholar
- 3.Cimiano, P., Völker, J.: Text2onto - a framework for ontology learning and data-driven change discovery. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 227–238. Springer, Heidelberg (2005)Google Scholar
- 4.Cimiano, P., Völker, J.: Towards large-scale, open-domain and ontology-based named entity classification. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2005), pp. 166–172 (September 2005)Google Scholar
- 5.Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proceedings of the 40th Annual Meeting of the ACL (2002)Google Scholar
- 6.Fellbaum: WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge (1998)Google Scholar
- 8.Lenci, A., Montemagni, S., Pirrelli, V., Venturi, G.: NLP-based ontology learning from legal texts. a case study. In: Pompeu Casanovas, E.F.M.T.S., Biasiotti, M.A. (eds.) Proceedings of the Workshop on Legal Ontologies and Artificial Intelligence Techniques at ICAIL 2007, pp. 113–129 (June 2007)Google Scholar