Skip to main content

High-Dimensional Simplexes for Supermetric Search

  • Conference paper
  • First Online:
Similarity Search and Applications (SISAP 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10609))

Included in the following conference series:

Abstract

In a metric space, triangle inequality implies that, for any three objects, a triangle with edge lengths corresponding to their pairwise distances can be formed. The n-point property is a generalisation of this where, for any \((n+1)\) objects in the space, there exists an n-dimensional simplex whose edge lengths correspond to the distances among the objects. In general, metric spaces do not have this property; however in 1953, Blumenthal showed that any semi-metric space which is isometrically embeddable in a Hilbert space also has the n-point property.

We have previously called such spaces supermetric spaces, and have shown that many metric spaces are also supermetric, including Euclidean, Cosine, Jensen-Shannon and Triangular spaces of any dimension.

Here we show how such simplexes can be constructed from only their edge lengths, and we show how the geometry of the simplexes can be used to determine lower and upper bounds on unknown distances within the original space. By increasing the number of dimensions, these bounds converge to the true distance.

Finally we show that for any Hilbert-embeddable space, it is possible to construct Euclidean spaces of arbitrary dimensions, from which these lower and upper bounds of the original space can be determined. These spaces may be much cheaper to query than the original. For similarity search, the engineering tradeoffs are good: we show significant reductions in data size and metric cost with little loss of accuracy, leading to a significant overall improvement in exact search performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In [9] the authors note it works better for some metrics than for others; in our understanding, it will work well only for spaces with the n-point property.

  2. 2.

    For precise definitions of the non-Euclidean metrics used, see [5].

  3. 3.

    https://richardconnor@bitbucket.org/richardconnor/metric-space-framework.git.

References

  1. Blumenthal, L.M.: A note on the four-point property. Bull. Amer. Math. Soc. 39(6), 423–426 (1933)

    Article  MathSciNet  MATH  Google Scholar 

  2. Blumenthal, L.M.: Theory and Applications of Distance Geometry. Clarendon Press, London (1953)

    MATH  Google Scholar 

  3. Chávez, E., Navarro, G.: Metric databases. In: Rivero, L.C., Doorn, J.H., Ferraggine, V.E. (eds.) Encyclopedia of Database Technologies and Applications, pp. 366–371. Idea Group (2005)

    Google Scholar 

  4. Chávez, E., Navarro, G., Baeza-Yates, R., Marroquín, J.L.: Searching in metric spaces. ACM Comput. Surv. 33(3), 273–321 (2001)

    Article  Google Scholar 

  5. Connor, R., Cardillo, F.A., Vadicamo, L., Rabitti, F.: Hilbert exclusion: improved metric search through finite isometric embeddings. ACM Trans. Inform. Syst. 35(3), 17:1–17:27 (2016)

    Article  Google Scholar 

  6. Connor, R., Vadicamo, L., Rabitti, F.: High-Dimensional Simplexes for Supermetric Search. arXiv e-prints, July 2017

    Google Scholar 

  7. Connor, R.: A tale of four metrics. In: Amsaleg, L., Houle, M.E., Schubert, E. (eds.) SISAP 2016. LNCS, vol. 9939, pp. 210–217. Springer, Cham (2016). doi:10.1007/978-3-319-46759-7_16

    Google Scholar 

  8. Cox, M.A.A., Cox, T.F.: Multidimensional Scaling. Springer, Heidelberg (2008). pp. 315–347

    Book  MATH  Google Scholar 

  9. De Silva, V., Tenenbaum, J.B.: Sparse multidimensional scaling using landmark points. Technical report (2004)

    Google Scholar 

  10. Figueroa, K., Navarro, G., Chávez, E.: Metric spaces library (2007). http://www.sisap.org

  11. Fodor, I.K.: A survey of dimension reduction techniques. Technical report, Center for Applied Scientific Computing, Lawrence Livermore National Laboratory (2002)

    Google Scholar 

  12. Jolliffe, I.: Principal Component Analysis. Wiley, New York (2014)

    Book  MATH  Google Scholar 

  13. Mao, R., Miranker, W.L., Miranker, D.P.: Dimension reduction for distance-based indexing. In: SISAP 2010, pp. 25–32. ACM (2010)

    Google Scholar 

  14. Matoušek, J.: Lectures on Discrete Geometry, Graduate Texts in Mathematics. Springer, New York (2013)

    Google Scholar 

  15. Menger, K.: Untersuchungen ber allgemeine metrik. Math. Ann. 100, 75–163 (1928)

    Article  MathSciNet  MATH  Google Scholar 

  16. Micó, M.L., Oncina, J., Vidal, E.: A new version of the nearest-neighbour approximating and eliminating search algorithm (AESA) with linear preprocessing time and memory requirements. Patt. Recogn. Lett. 15(1), 9–17 (1994)

    Article  Google Scholar 

  17. Wilson, W.A.: A relation between metric and euclidean spaces. Am. J. Math. 54(3), 505–517 (1932)

    Article  MathSciNet  MATH  Google Scholar 

  18. Yang, L.: Distance Metric Learning: A Comprehensive Survey (2006)

    Google Scholar 

  19. Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach. Advances in Database Systems, vol. 32. Springer, Heidelberg (2006)

    Google Scholar 

Download references

Acknowledgements

The work was partially funded by Smart News, “Social sensing for breaking news”, co-funded by the Tuscany region under the FAR-FAS 2014 program, CUP CIPE D58C15000270008.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Richard Connor .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Connor, R., Vadicamo, L., Rabitti, F. (2017). High-Dimensional Simplexes for Supermetric Search. In: Beecks, C., Borutta, F., Kröger, P., Seidl, T. (eds) Similarity Search and Applications. SISAP 2017. Lecture Notes in Computer Science(), vol 10609. Springer, Cham. https://doi.org/10.1007/978-3-319-68474-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68474-1_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68473-4

  • Online ISBN: 978-3-319-68474-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics