When patterns are represented as histograms, the earth mover’s distance, EMD has been considered an excellent metric between two distributions. EMD is formulated as the transportation problem which is a hard optimization problem. In similarity based pattern retrieval problems, computing EMDs for all histograms in the database against a query histogram would take too long time for users to wait for the output. Hence, the candidate selection technique is presented to speed up the EMD based multivariate ordinal type histogram retrieval problem. It guarantees to find all similar histograms while achieving significant speed up. Theoretical relationships between other metrics for multivariate histograms and their usages are presented as well.


Transportation Problem Edit Distance Cost Matrix Retrieval Problem City Block Distance 
This process is experimental and the keywords may be updated as the learning algorithm improves.


  Sung-Hyuk Cha
  Department of Computer SciencePace UniversityPleasantville

