Mining Information for Instance Unification

  • Niraj Aswani
  • Kalina Bontcheva
  • Hamish Cunningham
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4273)


Instance unification determines whether two instances in an ontology refer to the same object in the real world. More specifically, this paper addresses the instance unification problem for person names. The approach combines the use of citation information (i.e., abstract, initials, titles and co-authorship information) with web mining, in order to gather additional evidence for the instance unification algorithm. The method is evaluated on two datasets – one from the BT digital library and one used in previous work on name disambiguation. The results show that the information mined from the web contributes substantially towards the successful handling of highly ambiguous cases which lowered the performance of previous methods.


Knowledge Management Digital Library Paper Title Author Citation Mining Information 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Alani, H., Dasmahapatra, S., Gibbins, N., Glaser, H., Harris, S., Kalfoglou, Y., O’Hara, K., Shadbolt, N.: Managing Reference: Ensuring Referential Integrity of Ontologies for the Semantic Web. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS, vol. 2473, pp. 317–334. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  2. 2.
    Bruijn, J., Polleres, A.: Towards An Ontology Mapping Specification Language For the Semantic Web. Technical report, Digital Enterprise Research Institute (2004)Google Scholar
  3. 3.
    Ciravegna, F., Chapman, S., Dingli, A., Wilks, Y.: Learning to Harvest Information for the Semantic Web. In: Bussler, C.J., Davies, J., Fensel, D., Studer, R. (eds.) ESWS 2004. LNCS, vol. 3053, pp. 312–326. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  4. 4.
    Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL 2002) (2002)Google Scholar
  5. 5.
    Feitelson, D.G.: On identifying name equivalences in digital libraries. Information Research 9(4) (2004)Google Scholar
  6. 6.
    Guha, R.V., Garg, A.: Disambiguating People in Search. In: Proceedings of the 13th World Wide Web Conference (WWW 2004). ACM Press, New York (2004)Google Scholar
  7. 7.
    Han, H., Giles, C.L., Zha, H.: A model-based k-means algorithm for name disambiguation. In: Proceedings of the 2nd International Semantic Web Technologies for Searching and Retrieving Scientific Data, Florida, USA (2003)Google Scholar
  8. 8.
    Han, H., Giles, L., Zha, H., Li, C., Tsioutsiouliklis, K.: Two supervised learning approaches for name disambiguation in author citations. In: Proceedings of the 4th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2004) (2004)Google Scholar
  9. 9.
    Mann, G.S., Yarowsky, D.: Unsupervised personal name disambiguation. In: Daelemans, W., Osborne, M. (eds.) Proceedings of the 7th Conference on Natural Language Learning (CoNLL 2003), Edmonton, Canada, May 2003, pp. 33–40 (2003)Google Scholar
  10. 10.
    Wellner, B., McCallum, A., Peng, F., Hay, M.: An integrated, conditional model of information extraction and coreference with application to citation matching. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, Banff, Canada, pp. 593–601 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Niraj Aswani
    • 1
  • Kalina Bontcheva
    • 1
  • Hamish Cunningham
    • 1
  1. 1.Department of Computer ScienceUniversity of Sheffield, Regent CourtSheffieldUK

Personalised recommendations