Abstract
SudocAD is a system concerning the author linkage problem in a bibliographic database context. Having a bibliographic database \(\mathcal E\) and a (new) bibliographic notice d, r being an identifier of an author in \(\mathcal E\) and r′ being an identifier of an author in d: is that r and r′ refer to the same author ? The system, which is a prototype, has been evaluated in a real situation. Compared to results given by expert librarians, the results of SudocAD are interesting enough to plan a transformation of the prototype into a production system. SudocAD is based on a method combining numerical and knowledge based techniques. This method is abstractly defined and even though SudocAD is devoted to the author linkage problem the method could be adapted for other kinds of linkage problems especially in the semantic web context.
This work benefited from the support of ANR, the French Research National Agency (ANR-12-CORD-0012).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ABES, http://www.abes.fr/
Arasu, A., Christopher, R., Suciu, D.: Large-scale deduplication with constraints using dedupalog. In: Proceedings of the 25th International Conference on Data Engineering (ICDE), pp. 952–963 (2009)
Benjelloun, O., Garcia-Molina, H., Menestrina, D., Su, Q., Euijong Whang, S., Widom, J.: Swoosh: a generic approach to entity resolution. The VLDB Journal 18(9-10), 255–276 (2009)
Baget, J.-F., Leclère, M., Mugnier, M.-L., Salvat, E.: On rules with existential variables: Walking the decidability line. Artif. Intell. 175(9-10), 1620–1654 (2011)
The CIDOC CRM, http://www.cidoc-crm.org/
Chein, M., Mugnier, M.-L.: Graph-based Knowledge Representation. Springer, London (2009)
COGUI, http://www.lirmm.fr/cogui/
de Carvalho, M.G., Laender, A.H.F., Goncalves, M.A., da Silva, A.S.: Genetic programming approach to record deduplication. IEEE Transactions on Knowledge and Data Engineering 24(3), 399–412 (2012)
Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: A survey. Transactions on Knowledge and Data Engineering, p. 2007 (2007)
COGUI examples, http://www.lirmm.fr/cogui/examples.php#sudocad
Functional Requirements for Bibliographic Records, http://www.ifla.org/publications/functional-requirements-for-bibliographic-records
Object Formulation of FRBR, http://www.cidoc-crm.org/frbr_inro.html
Fellegi, I.P., Sunter, A.B.: A theory for record linkage. Journal of the American Statistical Association (1969)
Fatiha Sais, F., Pernelle, N., Rousset, M.-C.: Combining a logical and a numerical method for reference reconciliation. Journal of Data Semantics, 66–94 (2009)
Gu, L., Baxter, R., Vickers, D., Rainsford, C.: Record linkage: current practice and future directions. Technical Report 03/83, CSIRO Mathematical and Information Sciences (2003)
Gomatam, S.: An empirical comparison of record linkage procedures. Statist. Med. 21(1), 1485–1496 (2002)
Hernández, M.A., Stolfo, S.J.: Real-world data is dirty: data cleansing and the merge/purge problem. Data Min. Knowl. Discov. 20(2(1)), 9–37 (1998)
IdRef:authority files of the Sudoc database, http://en.abes.fr/Other-services/IdRef
Suchanek, F.M., Abiteboul, S., Senellart, P.: Paris: Probabilistic alignment of relations, instances, and schema. Proceedings of the VLDB Endowment 5(3), 157–168 (2012)
Newcombe, H.B., Kennedy, J.M., Axford, S.J., James, A.P.: Automatic linkage of vital records. Science (1959)
Neil, N.R., Smalheiser, R., Torvik, V.I.: Author name disambiguation. Annual Review of Information Science and Technology (ARIST) 43 (2009)
PerseeD, http://www.persee.fr/web/guest/home
RDA: Resource Description and Access, http://www.rda-jsc.org/rda.html
Singla, P., Domingos, P.: Object identification with attribute-mediated dependences. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 297–308. Springer, Heidelberg (2005)
Shvaiko, P., Euzenat, J.: Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng. 25(1), 158–176 (2013)
Sais, F., Pernelle, N., Rousset, M.-C.: L2r: a logical method for reference reconciliation. In: Proc. of AAAI 2007, pp. 329–334 (2007)
Winkler, W.E.: Overview of record linkage and current research directions. Technical report, U.S. Census Bureau (2006)
Winkler, W.E.: Record linkage references. Technical report, U.S. Census Bureau (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Chein, M., Leclère, M., Nicolas, Y. (2014). SudocAD: A Knowledge-Based System for the Author Linkage Problem. In: Huynh, V., Denoeux, T., Tran, D., Le, A., Pham, S. (eds) Knowledge and Systems Engineering. Advances in Intelligent Systems and Computing, vol 244. Springer, Cham. https://doi.org/10.1007/978-3-319-02741-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-02741-8_8
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-02740-1
Online ISBN: 978-3-319-02741-8
eBook Packages: EngineeringEngineering (R0)