Pattern Analysis & Applications

, Volume 6, Issue 2, pp 122–133 | Cite as

Faster algorithm of string comparison

  • Q. X. Yang
  • S. S. Yuan
  • L. Zhao
  • L. Chun
  • S. Peng
Original Paper

Abstract

In many applications, it is necessary to determine the field similarity. Our paper introduces a package of substring-based new algorithms to determine Field Similarity. Combined together, our new algorithms not only achieves higher accuracy, but also gains the time complexity O(knm) (k<0.75) for the worst case, O( β*n) where β<6 for the average case and O(1) for the best case. Throughout the paper, we use the approach of comparative examples to show the higher accuracy of our algorithms compared to that proposed in Lee et al. [1]. Theoretical analysis, concrete examples and experimental results show that our algorithms can significantly improve the accuracy and time complexity of the calculation of field similarity.

Data cleaning Data mining Field similarity Pattern recognition Record similarity String similarity 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag London Limited 2003

Authors and Affiliations

  • Q. X. Yang
    • 1
  • S. S. Yuan
    • 1
  • L. Zhao
    • 1
  • L. Chun
    • 1
  • S. Peng
    • 1
  1. 1.Institute of High Performance Computing#01-01 The Capricorn Singapore Science Park IISingapore

Personalised recommendations