EPEMS: An Entity Matching System for E-Commerce Products
Entity Matching is used to identify records representing the same entities in the real world. As e-commerce is developing rapidly, online products grow explosively in both amount and variety. Applying entity matching to e-commerce data and finding records representing the same products make customers convenient to compare prices. This paper proposes an entity matching system for e-commerce data, called EPEMS. Compared with existing systems, we improve an existing sorted neighborhood blocking method, which is used to reduce the number of comparisons. At the same time the similarity of product pictures is used to improve matching results.
KeywordsEntity Matching E-commerce Data Blocking Picture Similarity
Unable to display preview. Download preview PDF.
- 4.Draisbach, U., Naumann, F., Szott, S., Wonneberg, O.: Adaptive windows for duplicate detection. In: 2012 IEEE 28th International Conference on Data Engineering (ICDE), pp. 1073–1083. IEEE (2012)Google Scholar
- 5.Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2161–2168. IEEE (2006)Google Scholar