Skip to main content
Log in

Interactive context-aware user-driven metadata correction in digital libraries

  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

Personal name variants are a common problem in digital libraries, reducing the precision of searches and complicating browsing-based interaction. The book-centric approach of name authority control has not scaled to match the growth and diversity of digital repositories. In this paper, we present a novel system for user-driven integration of name variants when interacting with web-based information—in particular digital library—systems. We approach these issues via a client-side JavaScript browser extension that can reorganize web content and also integrate remote data sources. Designed to be agnostic towards the web sites it is applied to, we illustrate the developed proof-of-concept system through worked examples using three different digital libraries. We discuss the extensibility of the approach in the context of other user-driven information systems and the growth of the Semantic Web.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Adrian, B., Hees, J., Herman, I., Sintek, M., Dengel, A.: Epiphany: adaptable RDFa generation linking the web of documents to the web of data. In: Cimiano, A., Pinto, H. (eds.) Knowledge Engineering and Management by the Masses, pp. 178–192. Springer, Berlin (2010)

    Chapter  Google Scholar 

  2. Bainbridge D., Novak B.J.: Seamless web editing for curated content. In: Proceedings of the 14th European Conference on Research and Advanced Technology for Digital Libraries (ECDL’10). pp. 168–175. Springer, Berlin (2010)

  3. Bainbridge, D., Twidale, M.B., Nichols, D.M.: That’s ‘é’ not ‘þ’ or ‘\(\square \)’ a user-driven context-aware approach to erroneous metadata in digital libraries. In: Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital libraries (JCDL’11), pp. 39–48. ACM, New York (2011)

  4. Beall, J.: Metadata for name disambiguation and collocation. Future Internet 2(1), 1–15 (2010)

    Article  Google Scholar 

  5. Bennett, R., Hengel-Dittrich, C., O’Neill, E., Tillett B.: VIAF (Virtual International Athority File): Linking Die Deutsche Bibliothek and Library of Congress name authority files. In: World Library and Information Congress: 72nd IFLA General Conference and Council. http://www.ifla.org/IV/ifla72/papers/123-Bennett-en.pdf (2006)

  6. Bizer, C., Heath, T., Berners-Lee, T.: Linked data—the story so far. Int. J. Semant. Web Inf. Syst. 4(2), 1–22 (2009)

    Google Scholar 

  7. Burrows, T.: Identity parade: building web portals about people. OCLC Syst. Serv Int. Digit. Libr. Perspect. 23(4), 329–331 (2003)

    Google Scholar 

  8. de Laat, P.B.: How can contributors to open-source communities be trusted? On the assumption, inference, and substitution of trust. Ethics Inf. Technol. 12(4), 327–341 (2010)

    Article  Google Scholar 

  9. Feitelson, D.: On identifying name equivalences in digital libraries. Inf. Res. 9(4), paper 192. http://InformationR.net/ir/9-4/paper192.html (2004)

  10. Fenner, M.: ORCID: unique identifiers for authors and contributors. Inf. Stand. Q. 23(3), 10–13 (2011)

    MathSciNet  Google Scholar 

  11. Hill, A.: What’s in a name? Prototyping a name authority service for UK repositories. In: Culture and Identity in Knowledge Organization: Proceedings of the Tenth International ISKO Conference (ISKO 2008), pp. 196–202. Ergon, Würzburg (2008)

  12. Kaiser, M., Lieder, H.-J., Majcen, K., Vallant, H.: New ways of sharing and using authority information: the LEAF project. D-Lib Mag. 9(11). http://www.dlib.org/dlib/november03/lieder/11lieder.html (2003)

  13. Laender, A.H., Gonçalves, M.A., Cota, R.G., Ferreira, A.A., Santos, R.L., Silva, A.J. : Keeping a digital library clean: new solutions to old problems. In: Proceedings of the Eighth ACM Symposium on Document Engineering (DocEng’08), pp. 257–262. ACM, New York (2008)

  14. Lagoze, C., Krafft, D., Cornwell, T., Dushay, N., Eckstrom, D., Saylor, J.: Metadata aggregation and “automated digital libraries”: a retrospective on the NSDL experience. In: Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital libraries (JCDL’06). pp. 230–239. ACM, New York (2006)

  15. Li, Y., Wen, A., Lin, Q., Li, R., Lu, Z.: Incorporating user feedback into name disambiguation of scientific cooperation network. In: Wang, H., Li, S., Oyama, S., Hu, X., Qian, T. (eds.) Web-Age Information Management, pp. 454–466. Springer, Berlin (2011)

    Chapter  Google Scholar 

  16. McKay, D., Sanchez, S., Parker, R.: What’s my name again? Sociotechnical considerations for author name management in research databases. In: Proceedings of the 22nd Annual Conference of the Australian Computer–Human Interaction Special Interest Group (OZCHI 2010), pp. 240–247. CHISIG, Australia (2010)

  17. Pereira, D.A., Ribeiro-Neto, B., Ziviani, N., Laender, A.H., Gonçalves, M.A., Ferreira, A.A.: Using web information for author name disambiguation. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL’09), pp. 49–58. ACM, New York (2009)

  18. Pilgrim, M.: Greasemonkey Hacks: Tips& Tools for Remixing the Web with Firefox. O’Reilly Media, Inc., Cambridge (2005)

  19. Qiu, J.: Scientific publishing: identity crisis. Nature 451(7180), 766–767 (2008)

    Article  Google Scholar 

  20. Rotenberg, E., Kushmerick, A.: The author challenge: identification of self in the scholarly literature. Cataloging Classif. Q. 49(6), 503–520 (2011)

    Article  Google Scholar 

  21. Salo, D.: Name authority control in institutional repositories. Cataloging Classif. Q. 47(3), 249–261 (2009)

    Article  Google Scholar 

  22. Smalheiser, N., Torvik, V.: Author name disambiguation. Annual Review of Information Science and Technology, vol. 43, pp. 287–313. Information Today, Medford (2009)

  23. Tillett, B.: Authority control: state of the art and new perspectives. Cataloging Classif. Q. 38(3), 23–41 (2004)

    Article  Google Scholar 

  24. Torvik, V.I., Smalheiser, N.R.: Author name disambiguation in MEDLINE. ACM Trans. Knowl. Discov. Data 3(3), Article 11 (2009)

    Google Scholar 

  25. van Raan, A.: Fatal attraction: conceptual and methodological problems in the ranking of universities by bibliometric methods. Scientometrics 62(1), 133–143 (2005)

    Article  Google Scholar 

  26. Xia, J.: Personal name identification in the practice of digital repositories. Progr. Electron. Libr. Inf. Syst. 40(3), 256–267 (2006)

    Google Scholar 

  27. Yang, Y., Singh, P., Yao, J., AuYeung, C.M., AuYeung, A., Wang, X., Cai, Z., Salvadores, M., Gibbins, N., Hall, W., Shadbolt, N.: Distributed human computation framework for linked data co-reference resolution. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) The Semantic Web: Research and Applications, pp. 32–46. Springer, Berlin (2011)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Bainbridge.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bainbridge, D., Twidale, M.B. & Nichols, D.M. Interactive context-aware user-driven metadata correction in digital libraries . Int J Digit Libr 13, 17–32 (2012). https://doi.org/10.1007/s00799-012-0100-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-012-0100-5

Keywords

Navigation