Abstract
This paper deals with the issue of two-way traffic between on the one hand, language resources that have been conceived from a local perspective, i.e. from within a local project or institution, and on the other, a shared framework conceived from a global perspective that supplies such resources for local re-use or enhancement. We believe that a key enabler to such traffic is the choice of an appropriate sharing platform, and here we illustrate the point with respect to a constellation of EU projects that is attempting to enhance the quality and scope of shared resources, and a local project that has some already-developed local functionality. The paper first introduces the underlying projects, then goes on to discuss the proposed platform (U-Compare) whose use is then illustrated for a small module developed for a local project.
Keywords
- Language resources
- Sharing
- Sharing platform
This is a preview of subscription content, access via your institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
University of Malta and Malta Council for Science and Technology.
- 2.
- 3.
Apache UIMA is an Apache-licensed open source implementation of the UIMA specification, which is being developed concurrently by a technical committee within OASIS (http://www.oasis-open.org/committees/uima/).
- 4.
U-Compare is a joint project between the University of Tokyo, the Center for Computational Pharmacology (CCP) at the University of Colorado Health Science Center, and the National Centre for Text Mining (NaCTem) at the University of Manchester.
- 5.
A Java Integrated Development Environment (http://www.eclipse.org/).
- 6.
Navigate to http://u-compare.org/components/index.html for the full list of U-Compare components.
- 7.
See http://mlrs.research.um.edu.mt/index.php?page=6 for the latest updates.
References
Ananiadou, S., Thompson, P., Kano, Y., McNaught, J., Attwood, T.K., Day, P.J.R., Keane, J., Jackson, D., Pettifer, S.: Towards interoperability of European language resources. Ariadne 67 (2011)
Camilleri, J.J.: A computational grammar and lexicon for Maltese. M.Sc. thesis, Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg (2013)
Dannélls, D., Camilleri, J.J.: Verb morphology of Hebrew and Maltese - towards an open source type theoretical resource grammar in GF. In: Proceedings of the Workshop on Language Resources (LRs) and Human Language Technologies (HLT) for Semitic Languages: Status, Updates, and Prospects, LREC 2010, Malta, April 2010
Gatt, A., Borg, C.: Using the MLRS interface. Institute of Linguistics, University of Malta (2011). http://mlrs.research.um.edu.mt/corpusquery/cwb/doc/mlrs_userdoc.pdf
Gatt, A., Céplö, S.: Digital corpora and other electronic resources for Maltese. In: Proceedings of Corpus Linguistics Conference, University of Lancaster, July 2013
Hardie, A.: CQPweb - combining power, flexibility and usability in a corpus analysis tool. http://www.lancs.ac.uk/staff/hardiea/cqpweb-paper.pdf (draft) (forthcoming)
Kano, Y., McCrohon, L., Ananiadou, S., Tsujii, J.: Integrated NLP evaluation system for pluggable evaluation with extensive interoperable toolkit. In: Proceedings of the NAACL HLT Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing, Boulder, Colorado, Association for Computational Linguistics, pp. 22–30. Association for Computational Linguistics, Boulder (2009)
Kano, Y., Baumgartner Jr., W.A., McCrohon, L., Ananiadou, S., Bretonnel Cohen, K., Hunter, L., Tsujii, J.: U-Compare: share and compare text mining tools with UIMA. Bioinformatics 25(15), 1997–1998 (2009)
Kontonatsios, G., Thompson, P., Batista-Navarro, R.T., Mihăilă, C., Korkontzelos, I., Ananiadou, S.: Extending an interoperable platform to facilitate the creation of multilingual and multimodal NLP applications. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, 4–9 August 2013, pp. 43–48 (2013)
Krauwer, S.: The basic language resource kit as the first milestone for the language resources roadmap. Survey Lecture delivered at SPECOM, Moscow, October 2003. http://www.elsnet.org
Mihăilă, C., Kontonatsios, G., Batista-Navarro, R.T., Thompson, P., Korkontzelos, I., Ananiadou, S.: Towards a better understanding of discourse: integrating multiple discourse annotation perspectives using UIMA. In: Proceedings of the 7th Linguistic Annotation Workshop & Interoperability with Discourse, Sofia, Bulgaria, pp. 79–88, 8–9 August 2013 (2013)
Rosner, M., Caruana, J., Fabri, R.: Maltilex: a computational lexicon for Maltese. In: Rosner, M. (ed.) Computational Approaches to Semitic Languages: Proceedings of the Workshop COLING-ACL98, Université de Montréal, Canada, pp. 97–105 (1998)
Acknowledgment
The work described above has been carried out under the auspices of the METANET4U project jointly funded by the EU (grant number 270893) and the Universities of Malta and Manchester.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Rosner, M., Attard, A., Thompson, P., Gatt, A., Ananiadou, S. (2014). Extending a Tool Resource Framework with U-Compare. In: Vetulani, Z., Mariani, J. (eds) Human Language Technology Challenges for Computer Science and Linguistics. LTC 2011. Lecture Notes in Computer Science(), vol 8387. Springer, Cham. https://doi.org/10.1007/978-3-319-08958-4_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-08958-4_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08957-7
Online ISBN: 978-3-319-08958-4
eBook Packages: Computer ScienceComputer Science (R0)