S-CREAM — Semi-automatic CREAtion of Metadata

Handschuh, Siegfried; Staab, Steffen; Ciravegna, Fabio

doi:10.1007/3-540-45810-7_32

Siegfried Handschuh³,
Steffen Staab³ &
Fabio Ciravegna⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2473))

Included in the following conference series:

International Conference on Knowledge Engineering and Knowledge Management

1458 Accesses
99 Citations

Abstract

Richly interlinked, machine-understandable data constitute the basis for the Semantic Web. We provide a framework, S-CREAM, that allows for creation of metadata and is trainable for a specific domain. Annotating web documents is one of the major techniques for creating metadata on the web. The implementation of S-CREAM, OntoMat-Annotizer supports now the semi-automatic annotation of web pages. This semi-automatic annotation is based on the information extraction component Amilcare. OntoMat-Annotizer extract with the help of Amil-care knowledge structure from web pages through the use of knowledge extraction rules. These rules are the result of a learning-cycle based on already annotated pages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fabio Ciravegna. Adaptive Information Extraction from Text by Rule Induction and Generalisation. In Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI)e, Seattle, Usa, August 2001.
Google Scholar
Fabio Ciravegna. Challenges in Information Extraction from Text for Knowledge Management. IEEE Intelligent Systems and Their Applications, 16(6):88–90, 2001.
Google Scholar
Fabio Ciravegna. (LP)², an Adaptive Algorithm for Information Extraction from Web-related Texts. In Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, Usa, August 2001.
Google Scholar
Fabio Ciravegna and Daniela Petrelli. User Involvement in Adaptive Information Extraction: Position Paper. In Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, Usa, August 2001.
Google Scholar
S. Decker, M. Erdmann, D. Fensel, and R. Studer. Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information. In R. Meersman et al., editors, Database Semantics: Semantic Issues in Multimedia Systems, pages 351–369. Kluwer Academic Publisher, 1999.
Google Scholar
L. Denoue and L. Vignollet. An annotation tool for Web browsers and its applications to information retrieval. In In Proceedings of RIAO2000, Paris, April 2000. http://www.univ-savoie.fr/labos/syscom/Laurent.Denoue/riao2000.doc.
Aaron Douthat. The message understanding conference scoring software user’s manual. In 7th Message Understanding Conference Proceedings, MUC-7, 1998. http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/.
M. Erdmann, A. Maedche, H.-P. Schnurr, and Steffen Staab. From Manual to Semi-automatic Semantic Annotation: About Ontology-based Text Annotation Tools. In P. Buitelaar & K. Hasida (eds). Proceedings of the COLING 2000 Workshop on Semantic Annotation and Intelligent Content, Luxembourg, August 2000.
Google Scholar
H. Eriksson, R. Fergerson, Y. Shahar, and M. Musen. Automatic generation of ontology editors. In Proceedings of the 12th Banff Knowledge Acquisition Workshop, Banff, Alberta, Canada, 1999.
Google Scholar
D. Fensel, J. Angele, S. Decker, M. Erdmann, H.-P. Schnurr, S. Staab, R. Studer, and Andreas Witt. On2broker: Semantic-based access to information sources at the WWW. In In Proceedings of the World Conference on the WWW and Internet (WebNet 99), Honolulu, Hawaii, USA, 1999.
Google Scholar
Reference description of the DAML+OIL (March 2001) ontology markup language, March 2001. http://www.daml.org/2001/03/reference.html.
B. J. Grosz and C. L. Sidner. Attention, intentions, and the structure of discourse. Computational Linguistics, 12(3):175204, 1986.
Google Scholar
T. R. Gruber. A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition, 6(2):199–221, 1993.
Article Google Scholar
S. Handschuh, S. Staab, and A. Maedche. CREAM — Creating relational meta-data with a component-based, ontology driven framework. In In Proceedings of K-Cap 2001, Victoria, BC, Canada, October 2001.
Google Scholar
Siegfried Handschuh and Steffen Staab. Authoring and Annotation of Web Pages in CREAM. In Proceeding of the WWW2002-Eleventh International World Wide Web Conferenceb (to appear), Hawaii, USA, May 2002.
Google Scholar
J. Heflin and J. Hendler. Searching the Web with SHOE. In Artificial Intelligence for Web Search. Papers from the AAAI Workshop. WS-00-01, pages 35–40. AAAI Press, 2000.
Google Scholar
J. Kahan, M. Koivunen, E. Prud’Hommeaux, and R. Swick. Annotea: An Open RDF Infrastructure for Shared Web Annotations. In Proc. of the WWW 10 International Conference. Hong Kong, 2001.
Google Scholar
Nicholas Kushmerick. Wrapper induction for information extraction. In Proceedings of the 15th International Joint Conference on Artificial Intelligence (IJCAI), 1997.
Google Scholar
S. Luke, L. Spector, D. Rager, and J. Hendler. Ontology-based Web Agents. In Proceedings of First International Conference on Autonomous Agents, 1997.
Google Scholar
P. Martin and P. Eklund. Embedding Knowledge in Web Documents. In Proceedings of the 8th Int. World Wide Web Conf. (WWW’8), Toronto, May 1999, pages 1403–1419. Elsevier Science B.V., 1999.
Google Scholar
Diana Maynard, Valentin Tablan, Hamish Cunningham, Cristian Ursu, Horacio Saggion, Kalina Bontcheva, and Yorick Wilks. Architectural Elements of Language Engineering Robustness. Journal of Natural Language Engineering-Special Issue on Robust Methods in Analysis of Natural Language Data, 2002. forthcoming.
Google Scholar
R.S. Mickalski, I. Mozetic, J. Hong, and H. Lavrack. The multi purpose incremental learning system AQ15 and its testing application to three medical domains. In Proceedings of the 5th National Conference on Artificial Intelligence, Philadelphia, USA, 1986.
Google Scholar
M. Strube and U. Hahn. Functional Centering — Grounding Referential Coherence in Information Structure. Computational Linguistics, 25(3):309–344, 1999.
Google Scholar
M. Vargas-Vera, E. Motta, J. Domingue, S. Buckingham Shum, and M. Lanzoni. Knowledge Extraction by using an Ontology-based Annotation Tool. In K-CAP 2001 workshop on Knowledge Markup and Semantic Annotation, Victoria, BC, Canada, October 2001.
Google Scholar
Ka-Ping Yee. CritLink: Better Hyperlinks for the WWW, 1998. http://crit.org/~ping/ht98.html

Download references

Author information

Authors and Affiliations

AIFB, University of Karlsruhe, Germany
Siegfried Handschuh & Steffen Staab
Department of Computer Science, University of Sheffield, UK
Fabio Ciravegna

Authors

Siegfried Handschuh
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Staab
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Ciravegna
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad Politécnica de Madrid, Campus de Montegancedo, s/n, 28660, Boadilla del Monte, Madrid, Spain
Asunción Gómez-Pérez
Intelligent Software Components, S.A. (iSOCO), Francisca Delgado, 11 - 2∘, 28108, Alcobendas, Madrid, Spain
V. Richard Benjamins

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Handschuh, S., Staab, S., Ciravegna, F. (2002). S-CREAM — Semi-automatic CREAtion of Metadata. In: Gómez-Pérez, A., Benjamins, V.R. (eds) Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web. EKAW 2002. Lecture Notes in Computer Science(), vol 2473. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45810-7_32

Download citation

DOI: https://doi.org/10.1007/3-540-45810-7_32
Published: 13 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44268-4
Online ISBN: 978-3-540-45810-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics