Advertisement

Applying Semantic Social Graphs to Disambiguate Identity References

  • Matthew Rowe
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5554)

Abstract

Person disambiguation monitors web appearances of a person by disambiguating information belonging to different people sharing the same name. In this paper we extend person disambiguation to incorporate the abstract notion of identity. This extension utilises semantic web technologies to represent the identity of the person to be found and the web resources to be disambiguated as semantic graphs. Our approach extracts a complete semantic social graph from distributed Web 2.0 services. Web resources containing possible person references are converted into semantic graphs describing available identity features. We disambiguate these web resources to identify correct identity references by performing random walks through the graph space, measuring the distances between the social graph and web resource graphs, and clustering similar web resources. We present a new distance measure called “Optimum Transitions” and evaluate the accuracy of our approach using the information retrieval measure f-measure.

Keywords

Social Networking Site Resource Description Framework Identity Node Social Graph Identity Reference 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Adida, B., Birbeck, M.: RDFa Primer: Bridging the Human and Data Webs. World Wide Web Consortium (2008), http://www.w3.org/TR/xhtml-rdfa-primer/
  2. 2.
    Akhtar, W., Kopecky, J., Krennwallner, T., Polleres, A.: XSPARQL: Travelling between the XML and RDF Worlds – and Avoiding the XSLT Pilgrimage. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 432–447. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  3. 3.
    Andrejevic, M.: The Discipline of Watching: Detection, Risk, and Lateral Surveillance. Critical Studies in Media Communication 23(5), 392–407 (2006)CrossRefGoogle Scholar
  4. 4.
    Bekkerman, R., McCallum, A.: Disambiguating Web Appearances of People in a Social Network. In: Proc. 14th international conference on World Wide Web, Chiba, Japan, pp. 463–470 (2005)Google Scholar
  5. 5.
    Breslin, J., Harth, A., Bojars, U., Decker, S.: Towards Semantically Interlinked Online Communities. In: Gómez-Pérez, A., Euzenat, J. (eds.) ESWC 2005. LNCS, vol. 3532, pp. 500–514. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  6. 6.
    Brickley, D., Miller, L.: FOAF Vocabulary Specification (OpenID Edition). Creative Commons (2007), http://xmlns.com/foaf/spec/
  7. 7.
    Brickley, D.: Basic Geo Vocabulary. World Wide Web Consortium (2006), http://www.w3.org/2003/01/geo
  8. 8.
    Chapman, S., Norton, B., Ciravegna, F.: Armadillo: Integrating Knowledge for the Semantic Web. In: Proc. Dagstuhl Seminar in Machine Learning for the Semantic Web, Dagstuhl, Germany (2005)Google Scholar
  9. 9.
    Clark, J.: XSL Transformations. World Wide Web Consortium (1999), http://www.w3.org/TR/xslt
  10. 10.
    Connolly, D.: Gleaning Resource Descriptions from Dialects of Language. World Wide Web Consortium (2007), http://www.w3.org/TR/grddl/
  11. 11.
    Finin, T., Ding, L., Zhou, L., Joshi, A.: Social Networking on the Semantic Web. The Learning Organisation 1(5), 418–435 (2005)CrossRefGoogle Scholar
  12. 12.
    Fleischman, M., Hovy, E.: Multi-Document Person Name Resolution. In: Proc. Workshop on Reference Resolution and its Applications: ACL 2004, Barcelona (2004)Google Scholar
  13. 13.
    Iria, J., Xia, L., Zhang, Z.: WIT: Web People Search Disambiguation using Random Walks. In: Proc. of the 4th International Workshop on Semantic Evaluations (Semeval 2007), Prague, Czech Republic (2007)Google Scholar
  14. 14.
    Kalashnikov, D., Chen, Z., Mehrotra, S., Nuray, R.: Web People Search via Connection Analysis. IEEE Transactions on Knowledge and Data Engineering 20(11), 1550–1565 (2008)CrossRefGoogle Scholar
  15. 15.
    Khare, K.: Microformats: the next (small) thing on the Semantic Web? IEEE Internet Computing 10, 68–75 (2006)CrossRefGoogle Scholar
  16. 16.
    Malin, B.: Unsupervised Name Disambiguation via Social Network Similarity. In: Proc. Workshop on Link Analysis, Counterterrorism, and Security, SIAM International Conference on Data Mining, Newport Beach, CA (2005)Google Scholar
  17. 17.
    McBride, B.: Jena: a Semantic Web toolkit. IEEE Internet Computing 6, 55–59 (2002)CrossRefGoogle Scholar
  18. 18.
    Meyn, S., Tweedie, R.: Markov chains and stochastic stability. Springer, London (1993)CrossRefzbMATHGoogle Scholar
  19. 19.
    Mika, P.: Bootstrapping the FOAF-Web: An Experiment in Social Network Mining. In: Proc. Workshop on Friend of a Friend, Social Networking and the Semantic Web, Galway, Ireland (2004)Google Scholar
  20. 20.
    Nottingham, M.: Atom Syndication Specification. The Internet Society (2005), http://www.atomenabled.org/developers/syndication/atom-format-spec.php
  21. 21.
    Passant, A.: RDF Export of Flickr Profiles with FOAF and SIOC (2007), http://apassant.net/blog/2007/12/18/
  22. 22.
    RDF in HTML. Talis (2006), http://research.talis.com/2005/erdf/
  23. 23.
    Rowe, M., Ciravegna, F.: Getting to Me - Exporting Semantic Social Network Information from Facebook. In: Proc. Social Data on the Web Workshop, ISWC 2008, Karlsruhe, Germany (2008)Google Scholar
  24. 24.
    Rowe, M.: Interlinking distributed Social Graphs. In: Proc. Linked Data on the Web Workshop, WWW 2009, Madrid Spain (2009)Google Scholar
  25. 25.
    Saerens, M., Fouss, F., Yen, L., Dupont, P.: The principal components analysis of a graph, and its relationships to spectral clustering. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS, vol. 3201, pp. 371–383. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  26. 26.
    Tarjan, R.: Depth-first search and linear graph algorithms. SIAM Journal on Computing 1(2), 146–160 (1972)MathSciNetCrossRefzbMATHGoogle Scholar
  27. 27.
    Wan, X., Gao, J., Li, M., Ding, B.: Person resolution in Person Search Results: WebHawk. In: Proc. of the 14th ACM international conference on Information and knowledge management, pp. 163–170 (2005)Google Scholar
  28. 28.
    Winer, D.: RSS 2.0 Specification. Creative Commons (2007), http://www.rssboard.org/rss-specification

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Matthew Rowe
    • 1
  1. 1.OAK Group, Department of Computer ScienceUniversity of SheffieldSheffieldUnited Kingdom

Personalised recommendations