Skip to main content

Information Retrieval in Digital Theses Based on Natural Language Processing Tools

  • Conference paper
  • First Online:
Advances in Natural Language Processing (EsTAL 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3230))

Included in the following conference series:

  • 678 Accesses

Abstract

Search performance can be greatly improved by describing data using Natural Language Processing (NLP) tools to create new metadata and domain ontologies. A methodology is presented to use domain specific knowledge to improve user request. This knowledge is based on concepts, extracted from the document itself, used as “semantic metadata tags” in order to annotate XML documents. We present the process followed to define and to add new XML semantic metadata into the digital library of scientific theses. Using these new metadata, an ontology is also constructed by following a methodology. Effective retrieval information is obtained by using an intelligent system based on XML semantic metadata and domain ontology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abascal, R., Rumpler, B., Pinon, J.-M.: An analysis of tools for an automatic extraction of concept in documents for a better knowledge management. In: Proceedings of 2003 IRMA International Conference, Philadelphia Pennsylvania (May 2003)

    Google Scholar 

  2. Blazquez, M., Fernandez, M., Garcia-Pinar, J.-M., Gomez-Perez, A.: Building Ontologies at the Knowledge Level using the Ontology Design Environment. In: Gaines, B.R., Musen, M.A. (eds.) Proceedings of the 11th Banff Knowledge Acquisition for Knowledge-based Systems workshop, KAW 1998, Department of Computer Science, University of Calgary, SRDG Publications (1998)

    Google Scholar 

  3. Condamines, A., Rebeyrolle, J.: Construction d’une Base de Connaissances Terminologiques à partir de Textes: Expérimentation et Definition d’une Méthode. In: Journées Ingénierie des Connaissances et Apprentissage Automatique (JICAA 1997), Roscoff, France, pp. 191–206 (1997)

    Google Scholar 

  4. Ding, Y., Foo, S.: Ontology Research and Development. Part 1 – A Review of Ontology Generation. Journal of Information Science 28(2) (2002)

    Google Scholar 

  5. Eriksson, H., Berglund, E., Nevalainen, P.: Using Knowledge Engineering Support for a Java Documentation Viewer. In: ACM Press (ed.) Proceedings of the 14th International Conference on Software Engineering and Knowledge Engineering, Ischia, Italy, pp. 57–64. ACM Press, New York (2002)

    Google Scholar 

  6. Frantzi, K., Ananiadou, S.: Automatic term recognition using contextual clues. In: Third DELOS Workshop. Cross-Language Information Retrieval, Zurich, March 5-7 (1997)

    Google Scholar 

  7. Gauch, S., Wang, J., Rachakonda, S.M.: A corpus analysis approach for automatic query expansion and its extension to multiple databases. ACM Transactions on Information Systems 17(3), 250 (1999)

    Article  Google Scholar 

  8. Golebiowska, J.: SAMOVAR - Knowledge Capitalization in the Automobile Industry Aided by Ontologies. In: Proceedings of the 12th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2000), Juan-les-Pins, France (October 2000)

    Google Scholar 

  9. Grefenstette, G.: Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, Dordrecht (1994)

    Book  Google Scholar 

  10. Grosso, W., Eriksson, H., Fergerson, R., Gennari, J., Tu, S., Musen, M.: Knowledge Modeling at the Millennium: The Design and Evolution of Protégé. In: Proceedings of the 12th International Workshop on Knowledge Acquisition, Modeling and Management (KAW 1999) (1999)

    Google Scholar 

  11. Gruber, T.R.: A translation approach to portable ontology specifications. Knowledge Acquisition 5(2), 199–220 (1993)

    Article  Google Scholar 

  12. Knowledge Based Systems Inc. The IDEF5 Ontology Description Capture Method Overview. Technical report, KBSI, Texas (1994)

    Google Scholar 

  13. Motta, E.: Reusable Components for Knowledge Modelling. IOS Press, Amsterdam (1999)

    MATH  Google Scholar 

  14. Musen, M.A., Fergerson, R.W., Grosso, W., Noy, N.F., Crubezy, M., Gennari, J.H.: Component-Based Support for Building Knowledge-Acquisition Systems. In: Proceedings of the Conference on Intelligent Information Processing IPP 2000 of the International Federation for Information Processing World Computer Congress (WCC 2000), Beijing (2000)

    Google Scholar 

  15. Noy, N., Fergerson, R., Musen, M.: The knowledge model of Protégé-2000: Combining interoperability and flexibility. In: Dieng, R., Corby, O. (eds.) EKAW 2000. LNCS, vol. 1937, pp. 17–32. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  16. Plante, P., Dumas, L., Plante, A.: Nomino version 4.2.22 (2001), http://www.nominotechnologies.com

  17. Rousselot, F., Frath, P.: Terminologie et Intelligence Artificielle. In: Kleiber, G., Le Queler, N. (dir.) Traits d’Union, pp. 181–192. Presses Universitaires de Caen (2002)

    Google Scholar 

  18. Schuler, W., Smith, J.: Author’s Argumentation Assistant (AAA): A Hypertext-Based Authoring Tool for Argumentative Texts. In: Proceedings of the ECHT 1990: European Conference on Hypertext: Argumentation, Design and Knowledge Acquisition, pp. 137–151. Cambridge University Press, Cambridge (1990)

    Google Scholar 

  19. Shum, S.B., Domingue, J., Motta, E.: Scholarly Discourse as Computable Structure. In: OHS-6/SC-2, pp. 120–128 (2000)

    Google Scholar 

  20. Shum, S.B., Motta, E., Domingue, J.: ScholOnto: an ontology-based digital library server for research documents and discourse. International Journal on Digital Libraries 3(3), 237–248 (2000)

    Article  Google Scholar 

  21. Uschold, M., Gruninger, M.: Ontologies: Principles, Methods and Applications. Knowledge Engineering Review 11(2) (1996)

    Google Scholar 

  22. Uschold, M., King, M.: Towards a Methodology for Building Ontologies. In: Workshop on Basic Ontological Issues in Knowledge Sharing held in conjunction with IJCAI 1995 (1995)

    Google Scholar 

  23. Weinstein, P.: Seed Ontologies: Growing Digital Libraries as Distributed, Intelligent Systems. In: Proceedings of the Second International ACM Digital Library Conference, Philadelphia, PA, USA (July 1997)

    Google Scholar 

  24. Ian Witten, H., Paynter, G.W., Frank, E., Gutwin, C., Nevill-Manning, C.G.: KEA: Practical Automatic Keyphrase Extraction. In: ACM DL, pp. 254–255 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abascal, R., Rumpler, B., Pinon, JM. (2004). Information Retrieval in Digital Theses Based on Natural Language Processing Tools. In: Vicedo, J.L., Martínez-Barco, P., Muńoz, R., Saiz Noeda, M. (eds) Advances in Natural Language Processing. EsTAL 2004. Lecture Notes in Computer Science(), vol 3230. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30228-5_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30228-5_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23498-2

  • Online ISBN: 978-3-540-30228-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics