Abstract
Valuable user-generated information about locations (points of interest, POIs) is stored in various online social media platforms. Merging the data associated with one POI is hard because the platforms lack common identifiers. In addition, user-generated data is commonly faulty or contradictory. Here we present an approach matching POIs from Qype and Facebook Places to their counterparts in OpenStreetMap. The algorithm uses different similarity measures taking the geographic distance of POIs into account as well as the string similarity of selected metadata fields, showing good results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cohen, W.W.: Data integration using similarity joins and a word-based information representation language. ACM Trans. Inf. Syst. 18, 288–321 (2000)
Dozier, C., Molina-Salgado, H., Thomas, M., Veeramachaneni, S.: Concord - a tool that automates the construction of record resolution systems. In: Proceedings of the Entity 2010 Workshop at LREC 2010, Valetta, Malta (2010)
Elmagarmid, A., Ipeirotis, P., Verykios, V.: Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering 19(1), 1–16 (2007)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval, online edn. Cambridge University Press (April 2009)
Sparck Jones, K.: A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation 28(1), 11–21 (1972)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Scheffler, T., Schirru, R., Lehmann, P. (2012). Matching Points of Interest from Different Social Networking Sites. In: Glimm, B., Krüger, A. (eds) KI 2012: Advances in Artificial Intelligence. KI 2012. Lecture Notes in Computer Science(), vol 7526. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33347-7_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-33347-7_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33346-0
Online ISBN: 978-3-642-33347-7
eBook Packages: Computer ScienceComputer Science (R0)