Abstract
Understanding texts written in natural language is a challenging task. Semantic Web technologies, in particular ontologies, can be used to represent knowledge from a specific domain and reason like a human. Ontology population from texts aims to transform textual contents into ontological assertions. This paper deals with an approach of automatic ontology population from French textual descriptions. This approach has been designed to be domain-independent, as long as a domain ontology is provided. It relies on text-based and knowledge-based analyses, which are fully explained. Experiments performed on French classified advertisements are discussed and provide encouraging results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
Experimental files (inputs, outputs for all tested approaches, and GS) are at https://doi.org/10.5281/zenodo.5776752. A zip file with a runnable jar for KOnPote with Aker’s lemmatizer is at https://alec.users.greyc.fr/research/konpote/.
References
Alani, H., et al.: Automatic ontology-based knowledge extraction and tailored biography generation from the web. IEEE Intell. Syst. 18, 14–21 (2003)
Ayadi, A., Samet, A., de Bertrand de Beuvron, F., Zanni-Merk, C.: Ontology population with deep learning-based NLP: a case study on the Biomolecular Network Ontology. Procedia Comput. Sci. 159, 572–581 (2019)
Castano, S., et al.: Multimedia interpretation for dynamic ontology evolution. J. Logic Comput. 19(5), 859–897 (2008)
Chasseray, Y., Barthe-Delanoë, A.M., Négny, S., Le Lann, J.M.: A generic metamodel for data extraction and generic ontology population. J. Inf. Sci. 48(6), 838–856 (2022)
Faria, C., Serra, I., Girardi, R.: A domain-independent process for automatic ontology population from text. Sci. Comput. Program. 95, 26–43 (2014)
Gasmi, H., Laval, J., Bouras, A.: Cold-start cybersecurity ontology population using information extraction with LSTM. In: CSET, Doha, Qatar, pp. 1–6 (2019)
Horridge, M., Bechhofer, S.: The OWL API: a Java API for working with OWL 2 ontologies. In: OWLED, Aachen, DEU, pp. 49–58 (2009)
Horrocks, I., et al.: SWRL: A Semantic Web Rule Language Combining OWL and RuleML. Technical report, World Wide Web Consortium (2004)
Jayawardana, V., et al.: Semi-supervised instance population of an ontology using word vector embedding. In: ICTer, September 2017. IEEE (2017)
Korger, A., Baumeister, J.: Rule-based semantic relation extraction in regulatory documents. In: LWDA. CEUR Workshop Proceedings, September 2021, vol. 2993, pp. 26–37 (2021)
Lubani, M., Noah, S.A.M., Mahmud, R.: Ontology population: approaches and design aspects. J. Inf. Sci. 45, 502–515 (2019)
Makki, J., Alquier, A.M., Prince, V.: Ontology population via NLP techniques in risk management. Int. J. Humanit. Soc. Sci. 3, 212–217 (2009)
Manning, C.D., et al.: The Stanford CoreNLP natural language processing toolkit. In: ACL System Demonstrations, pp. 55–60 (2014)
Oramas, S., Sordo, M., Espinosa-Anke, L.: A rule-based approach to extracting relations from music Tidbits. In: WWW, Florence, Italy, pp. 661–666 (2015)
Reyes-Ortiz, J.A.: Criminal event ontology population and enrichment using patterns recognition from text. IJPRAI 33(11), 1940014 (2019)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees (1994)
Staab, S., Studer, R.: Handbook on Ontologies. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-540-92673-3
Suchanek, F., Ifrim, G., Weikum, G.: LEILA\(:\) learning to extract information by linguistic analysis. In: Workshop on Ontology Learning and Population, Sydney, Australia, pp. 18–25 (2006)
Acknowledgements
We thank Quentin Leroy and Jean-Philippe Kotowicz for their participation in the ontology design, and Enor-Anaïs Carré and Morgan Gueret for the corpus.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alec, C. (2023). Ontology Population from French Classified Ads. In: Ojeda-Aciego, M., Sauerwald, K., Jäschke, R. (eds) Graph-Based Representation and Reasoning. ICCS 2023. Lecture Notes in Computer Science(). Springer, Cham. https://doi.org/10.1007/978-3-031-40960-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-40960-8_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-40959-2
Online ISBN: 978-3-031-40960-8
eBook Packages: Computer ScienceComputer Science (R0)