Skip to main content

Efficient Nearest-Neighbor Query and Clustering of Planar Curves

  • Conference paper
  • First Online:
Algorithms and Data Structures (WADS 2019)

Abstract

We study two fundamental problems dealing with curves in the plane, namely, the nearest-neighbor problem and the center problem. Let \(\mathcal {C}\) be a set of n polygonal curves, each of size m. In the nearest-neighbor problem, the goal is to construct a compact data structure over \(\mathcal {C}\), such that, given a query curve Q, one can efficiently find the curve in \(\mathcal {C}\) closest to Q. In the center problem, the goal is to find a curve Q, such that the maximum distance between Q and the curves in \(\mathcal {C}\) is minimized. We use the well-known discrete Fréchet distance function, both under \(L_\infty \) and under \(L_2\), to measure the distance between two curves.

For the nearest-neighbor problem, despite discouraging previous results, we identify two important cases for which it is possible to obtain practical bounds, even when m and n are large. In these cases, either Q is a line segment or \(\mathcal {C}\) consists of line segments, and the bounds on the size of the data structure and query time are nearly linear in the size of the input and query curve, respectively. The returned answer is either exact under \(L_\infty \), or approximated to within a factor of \(1+\varepsilon \) under \(L_2\). We also consider the variants in which the location of the input curves is only fixed up to translation, and obtain similar bounds, under \(L_\infty \).

As for the center problem, we study the case where the center is a line segment, i.e., we seek the line segment that represents the given set as well as possible. We present near-linear time exact algorithms under \(L_\infty \), even when the location of the input curves is only fixed up to translation. Under \(L_2\), we present a roughly \(O(n^2m^3)\)-time exact algorithm.

A more complete version of this paper is available on arXiv [7].

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abraham, C., Cornillon, P.A., Matzner-Lober, E., Molinari, N.: Unsupervised curve clustering using b-splines. Scand. J. Stat. 30(3), 581–595 (2003). https://doi.org/10.1111/1467-9469.00350

    Article  MathSciNet  MATH  Google Scholar 

  2. Afshani, P., Driemel, A.: On the complexity of range searching among curves. In: Proceedings of the 29th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 898–917. SIAM (2018)

    Chapter  Google Scholar 

  3. Agarwal, P.K., Procopiuc, C.M.: Exact and approximation algorithms for clustering. Algorithmica 33(2), 201–226 (2002). https://doi.org/10.1007/s00453-001-0110-y

    Article  MathSciNet  MATH  Google Scholar 

  4. Agarwal, P.K., Avraham, R.B., Kaplan, H., Sharir, M.: Computing the discrete Fréchet distance in subquadratic time. SIAM J. Comput. 43(2), 429–449 (2014). https://doi.org/10.1137/130920526

    Article  MathSciNet  MATH  Google Scholar 

  5. Alewijnse, S.P.A., Buchin, K., Buchin, M., Kölzsch, A., Kruckenberg, H., Westenberg, M.A.: A framework for trajectory segmentation by stable criteria. In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM Press, Dallas, November 2014. https://doi.org/10.1145/2666310.2666415

  6. Alt, H., Godau, M.: Computing the Fréchet distance between two polygonal curves. Intern. J. Comput. Geom. Appl. 05(01n02), 75–91 (1995). https://doi.org/10.1142/S0218195995000064

    Article  MATH  Google Scholar 

  7. Aronov, B., Filtser, O., Horton, M., Katz, M.J., Sheikhan, K.: Efficient nearest-neighbor query and clustering of planar curves. arXiv preprint arXiv:1904.11026 (2019)

  8. de Berg, M., Cook, A.F., Gudmundsson, J.: Fast Fréchet queries. Comput. Geom. 46(6), 747–755 (2013). https://doi.org/10.1016/j.comgeo.2012.11.006

    Article  MathSciNet  MATH  Google Scholar 

  9. de Berg, M., Gudmundsson, J., Mehrabi, A.D.: A dynamic data structure for approximate proximity queries in trajectory data. In: Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, p. 48. ACM (2017)

    Google Scholar 

  10. Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in timeseries. In: Papers from the AAAI Knowledge Discovery in Databases Workshop: Technical report WS-94-03, pp. 359–370. AAAI Press, Seattle, July 1994

    Google Scholar 

  11. Bringmann, K.: Why walking the dog takes time: Fréchet distance has no strongly subquadratic algorithms unless SETH fails. In: Proceedings of the 55th IEEE Symposium Foundations of Computer Science. IEEE, Philadelphia, October 2014. https://doi.org/10.1109/focs.2014.76

  12. Bringmann, K., Mulzer, W.: Approximability of the discrete Fréchet distance. J. Comput. Geom. 7(2), 46–76 (2016). http://jocg.org/index.php/jocg/article/view/261

    MathSciNet  MATH  Google Scholar 

  13. Buchin, K., et al. Approximating \((k, l)\)-center clustering for curves. In: Proceedings of the 30th Annual ACM-SIAM Symposium on Discrete Algorithms, San Diego, California, USA, 6–9 January 2019, pp. 2922–2938 (2019). https://doi.org/10.1137/1.9781611975482.181

    Chapter  Google Scholar 

  14. Chiou, J.M., Li, P.L.: Functional clustering and identifying substructures of longitudinal data. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 69(4), 679–699 (2007). https://doi.org/10.1111/j.1467-9868.2007.00605.x

    Article  MathSciNet  Google Scholar 

  15. Driemel, A., Har-Peled, S.: Jaywalking your dog—computing the Fréchet distance with shortcuts. In: Proceedings of the 23rd ACM-SIAM Symposium on Discrete Algorithms, pp. 318–355. Society for Industrial and Applied Mathematics, Kyoto, January 2012. https://doi.org/10.1137/1.9781611973099.30

  16. Driemel, A., Krivošija, A., Sohler, C.: Clustering time series under the Fréchet distance. In: Proceedings of the 27th ACM-SIAM Symposium on Discrete Algorithms, pp. 766–785. SIAM, January 2016. https://doi.org/10.1137/1.9781611974331.ch55

  17. Driemel, A., Silvestri, F.: Locality-sensitive hashing of curves. In: Proceedings of the 33rd International Symposium on Computational Geometry, SoCG 2017, Brisbane, Australia, pp. 37:1–37:16 (2017). http://drops.dagstuhl.de/opus/volltexte/2017/7203

  18. Eiter, T., Mannila, H.: Computing discrete Fréchet distance. Technical report CD-TR 94/64, Christian Doppler Labor. für Expertensysteme, Technische Uni. Wien (1994)

    Google Scholar 

  19. Emiris, I.Z., Psarros, I.: Products of Euclidean metrics and applications to proximity questions among curves. In: Proceedings of the 34th International Symposium on Computational Geometry, SoCG 2018, 11–14 June 2018, Budapest, Hungary, pp. 37:1–37:13 (2018). https://doi.org/10.4230/LIPIcs.SoCG.2018.37. arXiv:1712.06471

  20. Fréchet, M.M.: Sur quelques points du calcul fonctionnel. Rendiconti del Circolo Matematico di Palermo 22(1), 1–72 (1906). https://doi.org/10.1007/BF03018603

    Article  MATH  Google Scholar 

  21. Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theor. Comput. Sci. 38, 293–306 (1985). https://doi.org/10.1016/0304-3975(85)90224-5

    Article  MathSciNet  MATH  Google Scholar 

  22. Gudmundsson, J., Horton, M.: Spatio-temporal analysis of team sports. ACM Comput. Surv. 50(2), 1–34 (2017). https://doi.org/10.1145/3054132

    Article  Google Scholar 

  23. Hausdorff, F.: Mengenlehre. Walter de Gruyter, Berlin (1927)

    MATH  Google Scholar 

  24. Hsu, W.L., Nemhauser, G.L.: Easy and hard bottleneck location problems. Discr. Appl. Math. 1(3), 209–215 (1979). https://doi.org/10.1016/0166-218x(79)90044-1

    Article  MathSciNet  MATH  Google Scholar 

  25. Indyk, P.: Approximate nearest neighbor algorithms for Fréchet distance via product metrics. In: Proceedings of the 8th Symposium on Computational Geometry, pp. 102–106. ACM Press, Barcelona, June 2002. https://doi.org/10.1145/513400.513414

  26. Indyk, P., Matoušek, J.: Low-distortion embeddings of finite metric spaces. In: Handbook of Discrete and Computational Geometry, 2 edn. Chapman and Hall/CRC, April 2004. https://doi.org/10.1201/9781420035315.ch8

    Google Scholar 

  27. Niu, H., Wang, J.: Volatility clustering and long memory of financial time series and financial price model. Digit. Signal Process. 23(2), 489–498 (2013). https://doi.org/10.1016/j.dsp.2012.11.004

    Article  MathSciNet  Google Scholar 

  28. Willard, D.E., Lueker, G.S.: Adding range restriction capability to dynamic data structures. J. ACM 32(3), 597–617 (1985). https://doi.org/10.1145/3828.3839

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

B. Aronov was supported by NSF grants CCF-12-18791 and CCF-15-40656, and by grant 2014/170 from the US-Israel Binational Science Foundation. O. Filtser was supported by the Israeli Ministry of Science, Technology & Space, and by grant 2014/170 from the US-Israel Binational Science Foundation. Most of the work on this project by M. Horton was performed while visiting the Department of Computer Science and Engineering at the Tandon School of Engineering, New York University in the spring/summer of 2018, partially supported by NSF grant CCF-12-18791. M. Katz was supported by grant 1884/16 from the Israel Science Foundation and by grant 2014/170 from the US-Israel Binational Science Foundation. Part of the work on this project by M. Katz was performed while visiting the Department of Computer Science and Engineering at the Tandon School of Engineering, New York University in the spring of 2018, partially supported by NSF grants CCF-12-18791 and CCF-15-40656. Work of K. Sheikhan on this paper was performed while at the Tandon School of Engineering, New York University, supported by NSF grant CCF-12-18791.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Matthew J. Katz .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Aronov, B., Filtser, O., Horton, M., Katz, M.J., Sheikhan, K. (2019). Efficient Nearest-Neighbor Query and Clustering of Planar Curves. In: Friggstad, Z., Sack, JR., Salavatipour, M. (eds) Algorithms and Data Structures. WADS 2019. Lecture Notes in Computer Science(), vol 11646. Springer, Cham. https://doi.org/10.1007/978-3-030-24766-9_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24766-9_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24765-2

  • Online ISBN: 978-3-030-24766-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics