Abstract
Information extraction approaches are heavily used to gather product information on the Web, especially focusing on technical product specifications. If requesting different sources for retrieving such specifications, the outcome is of varying formats (different languages, units, etc.). The problem of how to bring such information sets into a unique, interchangeable format is not considered in many extraction systems. We develop a generic process for semantically integrating heterogeneous product specifications with the help of a product information ontology. The approach is based on a number of measures for detecting the right product attributes in the ontology to be matched with the extracted specifications and finally normalizing the specifications’ values (e.g., concerning units). The feasibility of our approach is proven in a federated product search prototype called Fedseeko.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Euzenat, J., Shvaiko, P.: Ontology matching, 1st edn. Springer, Heidelberg (2007)
Peukert, E., Berthold, H., Rahm, E.: Rewrite techniques for performance optimization of schema matching processes. In: Proceedings of the 13th EDBT, pp. 453–464. ACM, New York (2010)
Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. Journal on Data Semantics 4, 146–171 (2005)
Do, H.H., Rahm, E.: Coma - a system for flexible combination of schema matching approaches. In: Proceedings of the 28th VLDB, VLDB Endowment, pp. 610–621 (2002)
Madhavan, J., Bernstein, P.A., Rahm, E.: Generic schema matching with cupid. In: Proceedings of the 27th VLDB, pp. 49–58. Morgan Kaufmann Publishers Inc., San Francisco (2001)
Giunchiglia, F., Shvaiko, P., Yatskevich, M.: S-match: an algorithm and an implementation of semantic matching. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 61–75. Springer, Heidelberg (2004)
Euzenat, J., Valtchev, P.: Similarity-based ontology alignment in owl-lite. In: Proceedings of the 16th ECAI, pp. 333–337. IOS Press, Amsterdam (2004)
Ehrig, M., Sure, Y.: Ontology mapping - an integrated approach. In: Bussler, C., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 76–91. Springer, Heidelberg (2004)
Ehrig, M., Staab, S.: QOM – quick ontology mapping. In: McIlraith, S.A., Plexousakis, D., van Harmelen, F. (eds.) ISWC 2004. LNCS, vol. 3298, pp. 683–697. Springer, Heidelberg (2004)
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding: A versatile graph matching algorithm and its application to schema matching. In: Proceedings of the 18th ICDE, pp. 117–128 (2002)
Bozovic, N., Vassalos, V.: Two-phase schema matching in real world relational databases. In: Proceedings of the ICDE Workshops, pp. 290–296 (2008)
Berlin, J., Motro, A.: Database schema matching using machine learning with feature selection. In: Pidduck, A.B., Mylopoulos, J., Woo, C.C., Ozsu, M.T. (eds.) CAiSE 2002. LNCS, vol. 2348, pp. 452–466. Springer, Heidelberg (2002)
Doan, A., Madhavan, J., Dhamankar, R., Domingos, P., Halevy, A.: Learning to match ontologies on the semantic web. The VLDB Journal 12(4), 303–319 (2003)
Walther, M., Hähne, L., Schuster, D., Schill, A.: Locating and extracting product specifications from producer websites. In: Proceedings of the 12th ICEIS, INSTICC (2010)
Walther, M., Schuster, D., Juchheim, T., Schill, A.: Category-based ranking of federated product offers. In: Proceedings of the 8th WWW/Internet. IADIS Press (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Walther, M., Jäckel, N., Schuster, D., Schill, A. (2011). Enabling Product Comparisons on Unstructured Information Using Ontology Matching. In: Mugellini, E., Szczepaniak, P.S., Pettenati, M.C., Sokhn, M. (eds) Advances in Intelligent Web Mastering – 3. Advances in Intelligent and Soft Computing, vol 86. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18029-3_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-18029-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-18028-6
Online ISBN: 978-3-642-18029-3
eBook Packages: EngineeringEngineering (R0)