Advertisement

EPEMS: An Entity Matching System for E-Commerce Products

  • Lei Gao
  • Pengpeng Zhao
  • Victor S. Sheng
  • Zhixu Li
  • An Liu
  • Jian Wu
  • Zhiming Cui
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9313)

Abstract

Entity Matching is used to identify records representing the same entities in the real world. As e-commerce is developing rapidly, online products grow explosively in both amount and variety. Applying entity matching to e-commerce data and finding records representing the same products make customers convenient to compare prices. This paper proposes an entity matching system for e-commerce data, called EPEMS. Compared with existing systems, we improve an existing sorted neighborhood blocking method, which is used to reduce the number of comparisons. At the same time the similarity of product pictures is used to improve matching results.

Keywords

Entity Matching E-commerce Data Blocking Picture Similarity 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Christen, P.: A survey of indexing techniques for scalable record linkage and deduplication. IEEE Transactions on Knowledge and Data Engineering 24(9), 1537–1555 (2012)CrossRefGoogle Scholar
  2. 2.
    Hernández, M.A., Stolfo, S.J.: The merge/purge problem for large databases. ACM SIGMOD Record 24, 127–138 (1995)CrossRefGoogle Scholar
  3. 3.
    Warshall, S.: A theorem on boolean matrices. Journal of the ACM (JACM) 9(1), 11–12 (1962)MathSciNetCrossRefzbMATHGoogle Scholar
  4. 4.
    Draisbach, U., Naumann, F., Szott, S., Wonneberg, O.: Adaptive windows for duplicate detection. In: 2012 IEEE 28th International Conference on Data Engineering (ICDE), pp. 1073–1083. IEEE (2012)Google Scholar
  5. 5.
    Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168. IEEE (2006)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Lei Gao
    • 1
  • Pengpeng Zhao
    • 1
  • Victor S. Sheng
    • 2
  • Zhixu Li
    • 1
  • An Liu
    • 1
  • Jian Wu
    • 1
  • Zhiming Cui
    • 1
  1. 1.School of Computer Science and TechnologySoochow UniversitySuzhouP.R. China
  2. 2.Computer Science DepartmentUniversity of Central ArkansasConwayUSA

Personalised recommendations