Abstract
Ontologies can play a very important role in information systems. They can support various information system processes, particularly information acquisition and integration. Ontologies themselves need to be designed, built and maintained. An important part of the ontology engineering cycle is the ability to keep a handcrafted ontology up to date. Therefore, we have developed a tool called MnM that helps during the ontology maintenance process. MnM extracts information from texts and populates ontology. It uses NLP (Natural Language Processing), Information Extraction and Machine Learning technologies. In particular, MnM was tested using an electronic newsletter consisting of news articles describing events happening in the Knowledge Media Institute (KMi). MnM could constitute an important part of an ontology-driven information system, with its integrated web-based ontology editor and provision of open APIs to link to ontology servers and to integrate with information extraction tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hicks JR. Management Information Systems: a User Perspective (3rd Ed). West, 1993.
Uschold M, King M. Towards a Methodology for Building Ontologies. Workshop on Basic Ontological Issues in Knowledge Sharing. 1995.
Grüninger M, Fox MS. The Role of Competency Questions in Enterprise Engineering. IFIP WG 5.7 Workshop on Benchmarking. Theory and Practice. Throndhein, Norway. 1994.
Fernandez M, Gomez-Perez A, Juristo N. METHONTOLOGY: From Ontological Art to Towards Ontological Engineering. In Proceedings of AAAI97 Spring Symposium Series, Workshop on Ontological Engineering, 1997; 33–40.
Domingue J. Tadzebao and WebOnto Discussing, Browsing, and Editing Ontologies on the Web. Proceedings of the 11th Banff Knowledge Acquisition Workshop, Banff, Alberta, Canada. 1998.
Berners-Lee, T., Hendler, J. and Lassila, O. (2001). The Semantic Web, Scientific American, May 2001.
Hayes P. RDF Model Theory, W3C Working Draft, February 2002. URL: http://www.w3.org/TR/rdf-mt/.
Lassila O, Swick R. Resource Description Framework (RDF): Model and Syntax Specification. Recommendation, World Wide Web Consortium, 1999. URL: http://www.w3.org/TR/REC-rdf-syntax/.
Brickley D, and Guha R. Resource Description Framework (RDF). Schema Specification 1.0. Candidate recommendation, World Wide Web Consortium, 2000. URL: http://www.w3.org/TR/2000/CR-rdf-schema-20000327.
Moreale E, Vargas-Vera M. Semantic Services in E-Learning: an Argumentation Case Study. IFETS Journal Special issue Ontologies and the Semantic Web for E-learning, IEEE Educational Technology & Society Journal, Volume 7 Issue 4 October 2004.
Feigenbaum E A. The art of artificial intelligence 1: Themes and case studies of knowledge engineering. Technical report, Pub. no. STAN-SC-77-621, Stanford University, Department of Computer Science. 1977.
Gruber T R. A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition 1993; 5(2), 199–220.
McIlraith S, Son TC, Zeng H. Semantic Web Services, IEEE Intelligent Systems, Special Issue on the Semantic Web, 2001; Volume 16, No. 2, pp. 46–53.
Motta E. Reusable Components for Knowledge Models. IOS Press, Amsterdam, 1999.
Fensel D, Motta E. Structured Development of Problem Solving Methods. Transactions on Knowledge and Data Engineering 13(6):9131–932, 2001.
Guarino N. Formal Ontology and Information Systems. Proceedings of FOIS’98, Trento, Italy, 6–8 June 1998. Amsterdam, IOS Press, 3–15.
Vargas-Vera M, Motta E, Domingue J, Lanzoni M, Stutt A, Ciravegna F. MnM: Ontology Driven Semi-Automatic and Automatic Support for Semantic Markup. The 13th International Conference on Knowledge Engineering and Management (EKAW 2002), Lecture Notes in Computer Science 2473, ed Gomez-Perez, A., Springer Verlag, 2002, 379–391.
Riloff E. An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains. The AI Journal, 1996; 85, 101–134.
Soderland S, Fisher D, Aseltine J, Lehnert W. Crystal: Inducing a Conceptual dictionary. Proceedings of the Fourteenth International Join Conference on Artificial Intelligence, 1995; 1314–1321.
Mitchell T. Generalization as search. Artificial Intelligence, 18:203–226, 1982.
Vargas-Vera M, Domingue J, Kalfoglou Y, Motta E, Buckingham-Shum S. Template-driven information extraction for populating ontologies. Proc of the IJCAI’01 Workshop on Ontology Learning, 2001, Seattle, WA, USA.
Vargas-Vera M, Motta E, Domingue J, Buckingham Shum S, Lanzoni M. Knowledge Extraction by using an Ontology-bases Annotation Tool. First International Conference on Knowledge Capture (K-CAP 2001). Workshop on Knowledge Markup and Semantic Annotation, Victoria B.C., Canada.
Vargas-Vera M, Celjuska D. Ontology-Driven Event Recognition on News Stories. KMI-TR-135, Knowledge Media Institute, The Open University, 2003.
Vargas-Vera M, Celjuska D. Event Recognition on News Stories and Semi-Automatic Population of an Ontology. IEEE/ACM International Joint Conference on Intelligent Agent and Web Intelligence (WI 2004), Beijing, China, September 20–24 2004, IEEE Computer Society Press, 2004.
Celjuska D and Vargas-Vera M. Ontosophie: A Semi-Automatic System for Ontology Population from Text. International Conference on Natural Language Processing ICON 2004, Hyderabad, India.
Ciravegna F. Adaptive Information Extraction from Text by Rule Induction and Generalisation, Proc. of 17th International Joint Conference on Artificial Intelligence (IJCAI-2001).
Ciravegna F and Petrelli D. User Involvement in Adaptive Information Extraction: Position Paper in Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence (IJCAI-01).
Maynard D, Tablan V, Cunningham H, Ursu C, Saggion O, Bontcheva K, Wilks Y. Architectural Elements of Language Engineering Robustness. Journal of Natural Language Engineering — Special Issue on Robust Methods in Analysis of Natural Language Data, 2002; 8:257–274.
Ciravegna F. LP2 an Adaptive Algorithm for Information Extraction from Web-related Texts. Proc. of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining held in conjunction with the 17th International Conference on Artificial Intelligence (IJCAI-01).
Kushmerick N, Weld D, Doorenbos R. Wrapper induction for information extraction, Proc. of 15th International Conference on Artificial Intelligence, IJCAI-97.
Michalski RS, Mozetic I, Hong J, Lavrack H. The multi purpose incremental learning system AQ15 and its testing application to three medical domains, in Proceedings of the 5th National Conference on Artificial Intelligence, 1986, Philadelphia. Morgan Kaufmann publisher.
Ciravegna F. Challenges in Information Extraction from Text for Knowledge Management in IEEE Intelligent Systems and Their Applications, November 2001, (Trend and Controversies).
Domingue J, Scott P. KMi Planet: A Web Based News Server. Asia Pacific Computer Human Interaction Conference (APCHI’98), Shonan Village Center, Hayama-machi, Kanagawa, Japan.
Domingue J, Motta E. Planet-Onto: From News Publishing to Integrated Knowledge Management Support. IEEE Intelligent Systems Special Issue on Knowledge Management and Knowledge Distribution over the Internet, May/June, 2000, 26–32. (ISSN 1094-7167).
Kalfoglou Y, Domingue J, Motta E, Vargas-Vera M, Buckingham Shum S. MyPlanet: an ontology-driven Web based personalised news service. Proceedings of the IJCAI’01 workshop on Ontologies and Information Sharing, Seattle, WA, USA.
Ciravegna F, Dingli A, Guthrie D, Wilks Y. Integrating Information to Bootstrap Information Extraction from Web Sites. Proceedings of the IJCAI 2003 Workshop on Information Integration on the Web, workshop in conjunction with the 18th International Joint Conference on Artificial Intelligence (IJCAI 2003). Acapulco, Mexico, August, 9–15.
Lavelli A, Califf ME, Ciravegna F, Freitag D, Giuliano C, Kushmerick N, Romano L. A Critical Survey of the Methodology for IE Evaluation. In Proceedings of the AAAI-04 Workshop on Adaptive Text Extraction and Mining (ATEM-2004), San Jose, California, 26 July 2004.
Daelemans W, Hoste V. Evaluation of machine learning methods for natural language processing tasks. In Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002). Las Palmas, Spain.
Jones D, Thompson C. Identifying Events using Similarity and Context, in Proceedings of the Conference on Computational Natural Language Learning, Edmonton, Canada, 2003.
Riloff E, Schmelzenbach M. An Empirical Approach to Conceptual Case Frame Acquisition. Proceedings of the Sixth Workshop on Very Large Corpora, 1998.
Kahan J, Koivunen M-J, Prud’Hommeaux E, Swick R. Annotea: An Open RDF Infrastructure for Shared Web Annotations. In Proc. of the 10th International World Wide Web Conference. 2001. Hong Kong.
Heflin J, Hendler J. A Portrait of the Semantic Web in Action. IEEE Intelligent Systems 2001; 16(2), 54–59.
Bechhofer S and Goble C. Towards Annotation Using DAML+OIL. First International Conference on Knowledge Capture (K-CAP 2001). Workshop on Semantic Markup and Annotation. Victoria, B.C., Canada. 2001.
Kogut P, Holmes W. AeroDAML. Applying Information Extraction to Generate DAML Annotations from Web Pages. First International Conference on Knowledge Capture (KCAP 2001). Workshop on Knowledge Markup and Semantic Annotation, Victoria, B.C., Canada.
Ciravegna F, Dingli A, Petrelli D, Wilks Y. User-System Cooperation in Document Annotation based on Information Extraction. Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management, EKAW02, Springer Verlag. 2002.
Handschuh S, Staab S, Maedche A. CREAM — Creating relational metadata with a component-based, ontology-driven annotation framework. First International Conference on Knowledge Capture (K-CAP 2001).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Vargas-Vera, M., Moreale, E., Stutt, A., Motta, E., Ciravegna, F. (2007). MnM: Semi-Automatic Ontology Population from Text. In: Sharman, R., Kishore, R., Ramesh, R. (eds) Ontologies. Integrated Series in Information Systems, vol 14. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-37022-4_13
Download citation
DOI: https://doi.org/10.1007/978-0-387-37022-4_13
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-37019-4
Online ISBN: 978-0-387-37022-4
eBook Packages: Computer ScienceComputer Science (R0)