Abstract
Search performance can be greatly improved by describing data using Natural Language Processing (NLP) tools to create new metadata and domain ontologies. A methodology is presented to use domain specific knowledge to improve user request. This knowledge is based on concepts, extracted from the document itself, used as “semantic metadata tags” in order to annotate XML documents. We present the process followed to define and to add new XML semantic metadata into the digital library of scientific theses. Using these new metadata, an ontology is also constructed by following a methodology. Effective retrieval information is obtained by using an intelligent system based on XML semantic metadata and domain ontology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abascal, R., Rumpler, B., Pinon, J.-M.: An analysis of tools for an automatic extraction of concept in documents for a better knowledge management. In: Proceedings of 2003 IRMA International Conference, Philadelphia Pennsylvania (May 2003)
Blazquez, M., Fernandez, M., Garcia-Pinar, J.-M., Gomez-Perez, A.: Building Ontologies at the Knowledge Level using the Ontology Design Environment. In: Gaines, B.R., Musen, M.A. (eds.) Proceedings of the 11th Banff Knowledge Acquisition for Knowledge-based Systems workshop, KAW 1998, Department of Computer Science, University of Calgary, SRDG Publications (1998)
Condamines, A., Rebeyrolle, J.: Construction d’une Base de Connaissances Terminologiques à partir de Textes: Expérimentation et Definition d’une Méthode. In: Journées Ingénierie des Connaissances et Apprentissage Automatique (JICAA 1997), Roscoff, France, pp. 191–206 (1997)
Ding, Y., Foo, S.: Ontology Research and Development. Part 1 – A Review of Ontology Generation. Journal of Information Science 28(2) (2002)
Eriksson, H., Berglund, E., Nevalainen, P.: Using Knowledge Engineering Support for a Java Documentation Viewer. In: ACM Press (ed.) Proceedings of the 14th International Conference on Software Engineering and Knowledge Engineering, Ischia, Italy, pp. 57–64. ACM Press, New York (2002)
Frantzi, K., Ananiadou, S.: Automatic term recognition using contextual clues. In: Third DELOS Workshop. Cross-Language Information Retrieval, Zurich, March 5-7 (1997)
Gauch, S., Wang, J., Rachakonda, S.M.: A corpus analysis approach for automatic query expansion and its extension to multiple databases. ACM Transactions on Information Systems 17(3), 250 (1999)
Golebiowska, J.: SAMOVAR - Knowledge Capitalization in the Automobile Industry Aided by Ontologies. In: Proceedings of the 12th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2000), Juan-les-Pins, France (October 2000)
Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Dordrecht (1994)
Grosso, W., Eriksson, H., Fergerson, R., Gennari, J., Tu, S., Musen, M.: Knowledge Modeling at the Millennium: The Design and Evolution of Protégé. In: Proceedings of the 12th International Workshop on Knowledge Acquisition, Modeling and Management (KAW 1999) (1999)
Gruber, T.R.: A translation approach to portable ontology specifications. Knowledge Acquisition 5(2), 199–220 (1993)
Knowledge Based Systems Inc. The IDEF5 Ontology Description Capture Method Overview. Technical report, KBSI, Texas (1994)
Motta, E.: Reusable Components for Knowledge Modelling. IOS Press, Amsterdam (1999)
Musen, M.A., Fergerson, R.W., Grosso, W., Noy, N.F., Crubezy, M., Gennari, J.H.: Component-Based Support for Building Knowledge-Acquisition Systems. In: Proceedings of the Conference on Intelligent Information Processing IPP 2000 of the International Federation for Information Processing World Computer Congress (WCC 2000), Beijing (2000)
Noy, N., Fergerson, R., Musen, M.: The knowledge model of Protégé-2000: Combining interoperability and flexibility. In: Dieng, R., Corby, O. (eds.) EKAW 2000. LNCS, vol. 1937, pp. 17–32. Springer, Heidelberg (2000)
Plante, P., Dumas, L., Plante, A.: Nomino version 4.2.22 (2001), http://www.nominotechnologies.com
Rousselot, F., Frath, P.: Terminologie et Intelligence Artificielle. In: Kleiber, G., Le Queler, N. (dir.) Traits d’Union, pp. 181–192. Presses Universitaires de Caen (2002)
Schuler, W., Smith, J.: Author’s Argumentation Assistant (AAA): A Hypertext-Based Authoring Tool for Argumentative Texts. In: Proceedings of the ECHT 1990: European Conference on Hypertext: Argumentation, Design and Knowledge Acquisition, pp. 137–151. Cambridge University Press, Cambridge (1990)
Shum, S.B., Domingue, J., Motta, E.: Scholarly Discourse as Computable Structure. In: OHS-6/SC-2, pp. 120–128 (2000)
Shum, S.B., Motta, E., Domingue, J.: ScholOnto: an ontology-based digital library server for research documents and discourse. International Journal on Digital Libraries 3(3), 237–248 (2000)
Uschold, M., Gruninger, M.: Ontologies: Principles, Methods and Applications. Knowledge Engineering Review 11(2) (1996)
Uschold, M., King, M.: Towards a Methodology for Building Ontologies. In: Workshop on Basic Ontological Issues in Knowledge Sharing held in conjunction with IJCAI 1995 (1995)
Weinstein, P.: Seed Ontologies: Growing Digital Libraries as Distributed, Intelligent Systems. In: Proceedings of the Second International ACM Digital Library Conference, Philadelphia, PA, USA (July 1997)
Ian Witten, H., Paynter, G.W., Frank, E., Gutwin, C., Nevill-Manning, C.G.: KEA: Practical Automatic Keyphrase Extraction. In: ACM DL, pp. 254–255 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abascal, R., Rumpler, B., Pinon, JM. (2004). Information Retrieval in Digital Theses Based on Natural Language Processing Tools. In: Vicedo, J.L., Martínez-Barco, P., Muńoz, R., Saiz Noeda, M. (eds) Advances in Natural Language Processing. EsTAL 2004. Lecture Notes in Computer Science(), vol 3230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30228-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-30228-5_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23498-2
Online ISBN: 978-3-540-30228-5
eBook Packages: Springer Book Archive