Large-Scale Knowledge Acquisition from Botanical Texts

Role, François; Fernandez Gavilanes, Milagros; Villemonte de la Clergerie, Éric

doi:10.1007/978-3-540-73351-5_36

François Role¹,
Milagros Fernandez Gavilanes² &
Éric Villemonte de la Clergerie³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4592))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

962 Accesses
1 Citations

Abstract

Free text botanical descriptions contained in printed floras can provide a wealth of valuable scientific information. In spite of this richness, these texts have seldom been analyzed on a large scale using NLP techniques. To fill this gap, we describe how we managed to extract a set of terminological resources by parsing a large corpus of botanical texts. The tools and techniques used are presented as well as the rationale for favoring a deep parsing approach coupled with error mining methods over a simple pattern matching approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kirkup, D., Malcolm, P., Christian, G., Paton, A.: Towards a digital african flora. Taxon 54(2), 457–466 (2005)
Article Google Scholar
Rousse, G., de La Clergerie, É.V.: Analyse automatique de documents botaniques: le projet Biotim. In: Proc. of TIA 2005, Rouen, France, pp. 95–104 (April 2005)
Google Scholar
Daille, B.: Terminology mining. In: Pazienza, M.T. (ed.) Information Extraction in the Web Era. Lectures Notes in Artifial Intelligence, pp. 29–44. Springer, Heidelberg (2003)
Google Scholar
Faure, D., Nédellec, C.: ASIUM: learning subcategorization frames and restrictions of selection. In: Nédellec, C., Rouveirol, C. (eds.) Machine Learning: ECML-98. LNCS, vol. 1398, Springer, Heidelberg (1998)
Google Scholar
Grefenstette, G.: Explorations in Automatic Thesaurus Construction. Kluwer Academic Publishers, Dordrecht (1994)
Google Scholar
Cimiano, P., Staab, S., Hotho, A.: Clustering ontologies from text. In: Proceedings of LREC 2004, pp. 1721–1724 (2004)
Google Scholar
de Marneffe, M.-C., MacCartney, B., Manning, C.D.: Generating typed dependency parses from phrase structure parses. In: Proc. of LREC 2006 (2006)
Google Scholar
Lin, D., Pantel, P.: DIRT - discovery of inference rules from text. In: Proceedings of KDD-01, San Francisco, CA, pp. 323–328 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

L3i, Université de La Rochelle, France
François Role
University of Vigo, Spain
Milagros Fernandez Gavilanes
INRIA Rocquencourt, France
Éric Villemonte de la Clergerie

Authors

François Role
View author publications
You can also search for this author in PubMed Google Scholar
Milagros Fernandez Gavilanes
View author publications
You can also search for this author in PubMed Google Scholar
Éric Villemonte de la Clergerie
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zoubida Kedad Nadira Lammari Elisabeth Métais Farid Meziane Yacine Rezgui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Role, F., Fernandez Gavilanes, M., Villemonte de la Clergerie, É. (2007). Large-Scale Knowledge Acquisition from Botanical Texts. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_36

Download citation

DOI: https://doi.org/10.1007/978-3-540-73351-5_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73350-8
Online ISBN: 978-3-540-73351-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics