Abstract
Free text botanical descriptions contained in printed floras can provide a wealth of valuable scientific information. In spite of this richness, these texts have seldom been analyzed on a large scale using NLP techniques. To fill this gap, we describe how we managed to extract a set of terminological resources by parsing a large corpus of botanical texts. The tools and techniques used are presented as well as the rationale for favoring a deep parsing approach coupled with error mining methods over a simple pattern matching approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kirkup, D., Malcolm, P., Christian, G., Paton, A.: Towards a digital african flora. Taxon 54(2), 457–466 (2005)
Rousse, G., de La Clergerie, É.V.: Analyse automatique de documents botaniques: le projet Biotim. In: Proc. of TIA 2005, Rouen, France, pp. 95–104 (April 2005)
Daille, B.: Terminology mining. In: Pazienza, M.T. (ed.) Information Extraction in the Web Era. Lectures Notes in Artifial Intelligence, pp. 29–44. Springer, Heidelberg (2003)
Faure, D., Nédellec, C.: ASIUM: learning subcategorization frames and restrictions of selection. In: Nédellec, C., Rouveirol, C. (eds.) Machine Learning: ECML-98. LNCS, vol. 1398, Springer, Heidelberg (1998)
Grefenstette, G.: Explorations in Automatic Thesaurus Construction. Kluwer Academic Publishers, Dordrecht (1994)
Cimiano, P., Staab, S., Hotho, A.: Clustering ontologies from text. In: Proceedings of LREC 2004, pp. 1721–1724 (2004)
de Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proc. of LREC 2006 (2006)
Lin, D., Pantel, P.: DIRT - discovery of inference rules from text. In: Proceedings of KDD-01, San Francisco, CA, pp. 323–328 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Role, F., Fernandez Gavilanes, M., Villemonte de la Clergerie, É. (2007). Large-Scale Knowledge Acquisition from Botanical Texts. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_36
Download citation
DOI: https://doi.org/10.1007/978-3-540-73351-5_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73350-8
Online ISBN: 978-3-540-73351-5
eBook Packages: Computer ScienceComputer Science (R0)