Abstract
Person disambiguation monitors web appearances of a person by disambiguating information belonging to different people sharing the same name. In this paper we extend person disambiguation to incorporate the abstract notion of identity. This extension utilises semantic web technologies to represent the identity of the person to be found and the web resources to be disambiguated as semantic graphs. Our approach extracts a complete semantic social graph from distributed Web 2.0 services. Web resources containing possible person references are converted into semantic graphs describing available identity features. We disambiguate these web resources to identify correct identity references by performing random walks through the graph space, measuring the distances between the social graph and web resource graphs, and clustering similar web resources. We present a new distance measure called “Optimum Transitions” and evaluate the accuracy of our approach using the information retrieval measure f-measure.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Adida, B., Birbeck, M.: RDFa Primer: Bridging the Human and Data Webs. World Wide Web Consortium (2008), http://www.w3.org/TR/xhtml-rdfa-primer/
Akhtar, W., Kopecky, J., Krennwallner, T., Polleres, A.: XSPARQL: Travelling between the XML and RDF Worlds – and Avoiding the XSLT Pilgrimage. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 432–447. Springer, Heidelberg (2008)
Andrejevic, M.: The Discipline of Watching: Detection, Risk, and Lateral Surveillance. Critical Studies in Media Communication 23(5), 392–407 (2006)
Bekkerman, R., McCallum, A.: Disambiguating Web Appearances of People in a Social Network. In: Proc. 14th international conference on World Wide Web, Chiba, Japan, pp. 463–470 (2005)
Breslin, J., Harth, A., Bojars, U., Decker, S.: Towards Semantically Interlinked Online Communities. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 500–514. Springer, Heidelberg (2005)
Brickley, D., Miller, L.: FOAF Vocabulary Specification (OpenID Edition). Creative Commons (2007), http://xmlns.com/foaf/spec/
Brickley, D.: Basic Geo Vocabulary. World Wide Web Consortium (2006), http://www.w3.org/2003/01/geo
Chapman, S., Norton, B., Ciravegna, F.: Armadillo: Integrating Knowledge for the Semantic Web. In: Proc. Dagstuhl Seminar in Machine Learning for the Semantic Web, Dagstuhl, Germany (2005)
Clark, J.: XSL Transformations. World Wide Web Consortium (1999), http://www.w3.org/TR/xslt
Connolly, D.: Gleaning Resource Descriptions from Dialects of Language. World Wide Web Consortium (2007), http://www.w3.org/TR/grddl/
Finin, T., Ding, L., Zhou, L., Joshi, A.: Social Networking on the Semantic Web. The Learning Organisation 1(5), 418–435 (2005)
Fleischman, M., Hovy, E.: Multi-Document Person Name Resolution. In: Proc. Workshop on Reference Resolution and its Applications: ACL 2004, Barcelona (2004)
Iria, J., Xia, L., Zhang, Z.: WIT: Web People Search Disambiguation using Random Walks. In: Proc. of the 4th International Workshop on Semantic Evaluations (Semeval 2007), Prague, Czech Republic (2007)
Kalashnikov, D., Chen, Z., Mehrotra, S., Nuray, R.: Web People Search via Connection Analysis. IEEE Transactions on Knowledge and Data Engineering 20(11), 1550–1565 (2008)
Khare, K.: Microformats: the next (small) thing on the Semantic Web? IEEE Internet Computing 10, 68–75 (2006)
Malin, B.: Unsupervised Name Disambiguation via Social Network Similarity. In: Proc. Workshop on Link Analysis, Counterterrorism, and Security, SIAM International Conference on Data Mining, Newport Beach, CA (2005)
McBride, B.: Jena: a Semantic Web toolkit. IEEE Internet Computing 6, 55–59 (2002)
Meyn, S., Tweedie, R.: Markov chains and stochastic stability. Springer, London (1993)
Mika, P.: Bootstrapping the FOAF-Web: An Experiment in Social Network Mining. In: Proc. Workshop on Friend of a Friend, Social Networking and the Semantic Web, Galway, Ireland (2004)
Nottingham, M.: Atom Syndication Specification. The Internet Society (2005), http://www.atomenabled.org/developers/syndication/atom-format-spec.php
Passant, A.: RDF Export of Flickr Profiles with FOAF and SIOC (2007), http://apassant.net/blog/2007/12/18/
RDF in HTML. Talis (2006), http://research.talis.com/2005/erdf/
Rowe, M., Ciravegna, F.: Getting to Me - Exporting Semantic Social Network Information from Facebook. In: Proc. Social Data on the Web Workshop, ISWC 2008, Karlsruhe, Germany (2008)
Rowe, M.: Interlinking distributed Social Graphs. In: Proc. Linked Data on the Web Workshop, WWW 2009, Madrid Spain (2009)
Saerens, M., Fouss, F., Yen, L., Dupont, P.: The principal components analysis of a graph, and its relationships to spectral clustering. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS, vol. 3201, pp. 371–383. Springer, Heidelberg (2004)
Tarjan, R.: Depth-first search and linear graph algorithms. SIAM Journal on Computing 1(2), 146–160 (1972)
Wan, X., Gao, J., Li, M., Ding, B.: Person resolution in Person Search Results: WebHawk. In: Proc. of the 14th ACM international conference on Information and knowledge management, pp. 163–170 (2005)
Winer, D.: RSS 2.0 Specification. Creative Commons (2007), http://www.rssboard.org/rss-specification
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rowe, M. (2009). Applying Semantic Social Graphs to Disambiguate Identity References. In: Aroyo, L., et al. The Semantic Web: Research and Applications. ESWC 2009. Lecture Notes in Computer Science, vol 5554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02121-3_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-02121-3_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02120-6
Online ISBN: 978-3-642-02121-3
eBook Packages: Computer ScienceComputer Science (R0)