Skip to main content

User-System Cooperation in Document Annotation Based on Information Extraction

  • Conference paper
  • First Online:
Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web (EKAW 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2473))

Abstract

The process of document annotation for the Semantic Web is complex and time consuming, as it requires a great deal of manual annotation. Information extraction from texts (IE) is a technology used by some very recent systems for reducing the burden of annotation. The integration of IE systems in annotation tools is quite a new development and there is still the necessity of thinking the impact of the IE system on the whole annotation process. In this paper we initially discuss a number of requirements for the use of IE as support for annotation. Then we present and discuss a model of interaction that addresses such issues and Melita, an annotation framework that implements a methodology for active annotation for the Semantic Web based on IE. Finally we present an experiment that quantifies the gain in using IE as support to human annotators.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Reference

  1. F. Ciravegna, A. Lavelli, G. Satta: “Bringing information extraction out of the labs: the Pinocchio Environment”, in ECAI2000, Proc. of the 14th European Conference on Artificial Intelligence, ed., W. Horn, Amsterdam, 2000. IOS Press

    Google Scholar 

  2. P. Kogut and W. Holmes: “Applying Information Extraction to Generate DAML Annotations from Web Pages”, K-CAP 2001 Workshop Knowledge Markup & Semantic Annotation, Victoria B.C., Canada (2001).

    Google Scholar 

  3. M. E. Califf, D. Freitag, N. Kushmerick and I. Muslea (eds.): AAAI-99 Workshop on Machine Learning for Information Extraction, Orlando Florida (1999), http://www.isi.edu/~muslea/RISE/ML4IE/

  4. R. Basili, F. Ciravegna, R. Gaizauskas (eds.) ECAI2000 Workshop on Machine Learning for IE, Berlin (2000), http://www.dcs.shef.ac.uk/~fabio/ecai-workshop.html

  5. F. Ciravegna, N. Kushmerick, R. Mooney and I. Muslea (eds.), IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence, Seattle, (2001), http://www.smi.ucd.ie/ATEM2001/

  6. M. Vargas-Vera, Enrico Motta, J. Domingue, M. Lanzoni, A. Stutt and F. Ciravegna: “MnM: Ontology driven semi-automatic or automatic support for semantic markup”, Proc. of the 13th International Conference on Knowledge Engineering and Knowledge Management, EKAW02, Sigiienza, Spain (2002).

    Google Scholar 

  7. S. Handschuh, S. Staab and F. Ciravegna: “S-CREAM-Semi-automatic CREAtion of Metadata”, Proc. of the 13th International Conference on Knowledge Engineering and Knowledge Management, EKAW02, Sigiienza, Spain, (2002).

    Google Scholar 

  8. F. Ciravegna and D. Petrelli: “User Involvement in Adaptive Information Extraction: Position Paper” in Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence, Seattle (2001).

    Google Scholar 

  9. D. Maynard, V. Tablan, H. Cunningham, C. Ursu, H. Saggion, K. Bontcheva and Y. Wilks: “Architectural Elements of Language Engineering Robustness”, Journal of Natural Language Engineering, Special Issue on Robust Methods in Analysis of Natural Language Data, forthcoming in 2002.

    Google Scholar 

  10. F. Ciravegna: “Adaptive Information Extraction from Text by Rule Induction and Generalisation” in Proceedings of 17th International Joint Conference on Artificial Intelligence (2001).

    Google Scholar 

  11. F. Ciravegna: “(LP)2, an Adaptive Algorithm for Information Extraction from Web-related Texts” in Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence (IJCAI-01), Seattle, August, 2001

    Google Scholar 

  12. N. Kushmerick, D. Weld and R. Doorenbos: ‘Wrapper induction for information extraction’, Proc. of 15th International Conference on Artificial Intelligence, Japan (1997).

    Google Scholar 

  13. F. Ciravegna: “Challenges in Information Extraction from Text for Knowledge Management”, IEEE Intelligent Systems and Their Applications, 16–6, November, (2001).

    Google Scholar 

  14. M. E. Califf: ‘Relational Learning Techniques for Natural Language’ IE, PhD. thesis, Univ. Texas, Austin, (1998), http://www.cs.utexas.edu/users/mecaliff

    Google Scholar 

  15. D. Freitag and N. Kushmerick, ‘Boosted wrapper induction’, in R. Basili, F. Ciravegna, R. Gaizauskas (eds). ECAI2000 Workshop on Machine Learning for Information Extraction, Berlin, 2000, http://www.dcs.shef.ac.uk/~fabio/ecai-workshop.html.

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ciravegna, F., Dingli, A., Petrelli, D., Wilks, Y. (2002). User-System Cooperation in Document Annotation Based on Information Extraction. In: Gómez-Pérez, A., Benjamins, V.R. (eds) Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web. EKAW 2002. Lecture Notes in Computer Science(), vol 2473. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45810-7_15

Download citation

  • DOI: https://doi.org/10.1007/3-540-45810-7_15

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44268-4

  • Online ISBN: 978-3-540-45810-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics