Skip to main content

SudocAD: A Knowledge-Based System for the Author Linkage Problem

  • Conference paper
Knowledge and Systems Engineering

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 244))

Abstract

SudocAD is a system concerning the author linkage problem in a bibliographic database context. Having a bibliographic database \(\mathcal E\) and a (new) bibliographic notice d, r being an identifier of an author in \(\mathcal E\) and r′ being an identifier of an author in d: is that r and r′ refer to the same author ? The system, which is a prototype, has been evaluated in a real situation. Compared to results given by expert librarians, the results of SudocAD are interesting enough to plan a transformation of the prototype into a production system. SudocAD is based on a method combining numerical and knowledge based techniques. This method is abstractly defined and even though SudocAD is devoted to the author linkage problem the method could be adapted for other kinds of linkage problems especially in the semantic web context.

This work benefited from the support of ANR, the French Research National Agency (ANR-12-CORD-0012).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ABES, http://www.abes.fr/

  2. Arasu, A., Christopher, R., Suciu, D.: Large-scale deduplication with constraints using dedupalog. In: Proceedings of the 25th International Conference on Data Engineering (ICDE), pp. 952–963 (2009)

    Google Scholar 

  3. Benjelloun, O., Garcia-Molina, H., Menestrina, D., Su, Q., Euijong Whang, S., Widom, J.: Swoosh: a generic approach to entity resolution. The VLDB Journal 18(9-10), 255–276 (2009)

    Article  Google Scholar 

  4. Baget, J.-F., Leclère, M., Mugnier, M.-L., Salvat, E.: On rules with existential variables: Walking the decidability line. Artif. Intell. 175(9-10), 1620–1654 (2011)

    Article  MATH  Google Scholar 

  5. The CIDOC CRM, http://www.cidoc-crm.org/

  6. Chein, M., Mugnier, M.-L.: Graph-based Knowledge Representation. Springer, London (2009)

    MATH  Google Scholar 

  7. COGUI, http://www.lirmm.fr/cogui/

  8. de Carvalho, M.G., Laender, A.H.F., Goncalves, M.A., da Silva, A.S.: Genetic programming approach to record deduplication. IEEE Transactions on Knowledge and Data Engineering 24(3), 399–412 (2012)

    Article  Google Scholar 

  9. Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: A survey. Transactions on Knowledge and Data Engineering, p. 2007 (2007)

    Google Scholar 

  10. COGUI examples, http://www.lirmm.fr/cogui/examples.php#sudocad

  11. Functional Requirements for Bibliographic Records, http://www.ifla.org/publications/functional-requirements-for-bibliographic-records

  12. Object Formulation of FRBR, http://www.cidoc-crm.org/frbr_inro.html

  13. Fellegi, I.P., Sunter, A.B.: A theory for record linkage. Journal of the American Statistical Association (1969)

    Google Scholar 

  14. Fatiha Sais, F., Pernelle, N., Rousset, M.-C.: Combining a logical and a numerical method for reference reconciliation. Journal of Data Semantics, 66–94 (2009)

    Google Scholar 

  15. Gu, L., Baxter, R., Vickers, D., Rainsford, C.: Record linkage: current practice and future directions. Technical Report 03/83, CSIRO Mathematical and Information Sciences (2003)

    Google Scholar 

  16. Gomatam, S.: An empirical comparison of record linkage procedures. Statist. Med. 21(1), 1485–1496 (2002)

    Article  MathSciNet  Google Scholar 

  17. Hernández, M.A., Stolfo, S.J.: Real-world data is dirty: data cleansing and the merge/purge problem. Data Min. Knowl. Discov. 20(2(1)), 9–37 (1998)

    Google Scholar 

  18. IdRef:authority files of the Sudoc database, http://en.abes.fr/Other-services/IdRef

  19. Suchanek, F.M., Abiteboul, S., Senellart, P.: Paris: Probabilistic alignment of relations, instances, and schema. Proceedings of the VLDB Endowment 5(3), 157–168 (2012)

    Google Scholar 

  20. Newcombe, H.B., Kennedy, J.M., Axford, S.J., James, A.P.: Automatic linkage of vital records. Science (1959)

    Google Scholar 

  21. Neil, N.R., Smalheiser, R., Torvik, V.I.: Author name disambiguation. Annual Review of Information Science and Technology (ARIST) 43 (2009)

    Google Scholar 

  22. PerseeD, http://www.persee.fr/web/guest/home

  23. RDA: Resource Description and Access, http://www.rda-jsc.org/rda.html

  24. Singla, P., Domingos, P.: Object identification with attribute-mediated dependences. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 297–308. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  25. Shvaiko, P., Euzenat, J.: Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng. 25(1), 158–176 (2013)

    Article  Google Scholar 

  26. Sais, F., Pernelle, N., Rousset, M.-C.: L2r: a logical method for reference reconciliation. In: Proc. of AAAI 2007, pp. 329–334 (2007)

    Google Scholar 

  27. SUDOC, http://www.abes.fr/Sudoc/Sudoc-public

  28. sudocAD, http://www.abes.fr/Sudoc/Projets-en-cours/SudocAD

  29. Winkler, W.E.: Overview of record linkage and current research directions. Technical report, U.S. Census Bureau (2006)

    Google Scholar 

  30. Winkler, W.E.: Record linkage references. Technical report, U.S. Census Bureau (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Chein, M., Leclère, M., Nicolas, Y. (2014). SudocAD: A Knowledge-Based System for the Author Linkage Problem. In: Huynh, V., Denoeux, T., Tran, D., Le, A., Pham, S. (eds) Knowledge and Systems Engineering. Advances in Intelligent Systems and Computing, vol 244. Springer, Cham. https://doi.org/10.1007/978-3-319-02741-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02741-8_8

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02740-1

  • Online ISBN: 978-3-319-02741-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics