Skip to main content

A cluster-driven classification approach to truck stop location identification using passive GPS data

Abstract

Classifying the type of truck stops is vital in transportation planning and goods movement strategies. Truck stops could be classified into primary or secondary. While the latter entail stopping to re-fuel or rest, the former takes place to deliver or pick up merchandize. The availability of GPS transponders on board moving trucks and the ability to access such information in recent years has made it possible to analyze various freight aspects including movement trajectories and stopped locations. This paper utilizes machine learning methods and proposes a two-step cluster-based classification approach to classify truck stop locations into either primary or secondary. The DBSCAN clustering technique is applied on the GPS dataset to obtain stop locations. Next, several features per location are derived to classify the stops using well-known classification models. The generated information is then used to evaluate the approach using a large truck GPS dataset for the year 2016. The Random Forest classifier is chosen as it can identify primary stop locations with an accuracy of 97%. The overall accuracy of the classifier for correctly identifying both types of stops is 83%. Further, the prediction accuracy for primary stops is 92%.

This is a preview of subscription content, access via your institution.

Fig. 1

(Source: Adopted from Gingerich et al. (2016))

Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

References

  • Aziz R, Kedia M, Dan S et al (2016) Identifying and characterizing truck stops from GPS data. In: Perner P (ed) Advances in data mining. Applications and theoretical aspects. Springer International Publishing, Cham, pp 168–182

    Chapter  Google Scholar 

  • Birant D, Kut A (2007) ST-DBSCAN: an algorithm for clustering spatial–temporal data. Data Knowl Eng 60:208–221. https://doi.org/10.1016/j.datak.2006.01.013

    Article  Google Scholar 

  • Bohte W, Maat K (2009) Deriving and validating trip purposes and travel modes for multi-day GPS-based travel surveys: a large-scale application in the Netherlands. Transp Res Part C Emerg Technol 17:285–297. https://doi.org/10.1016/j.trc.2008.11.004

    Article  Google Scholar 

  • Borah B, Bhattacharyya DK (2007) A clustering technique using density difference. In: 2007 international conference on signal processing, communications and networking. pp 585–588

  • Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324

    Article  Google Scholar 

  • Camargo P, Hong S, Livshits V (2017) Expanding the uses of truck GPS data in freight modeling and planning activities. Transp Res Rec 2646:68–76. https://doi.org/10.3141/2646-08

    Article  Google Scholar 

  • Chen W, Ji M, Wang J (2014) T-DBSCAN: a spatiotemporal density clustering for GPS trajectory segmentation. Int J Onl Eng 10:19. https://doi.org/10.3991/ijoe.v10i6.3881

    Article  Google Scholar 

  • Diana M, Pirra M, Woodcock A (2020) Freight distribution in urban areas: a method to select the most important loading and unloading areas and a survey tool to investigate related demand patterns. Eur Transp Res Rev 12:40. https://doi.org/10.1186/s12544-020-00430-w

    Article  Google Scholar 

  • Elbatta MTH, Ashour WM (2013) A dynamic method for discovering density varied clusters

  • Ester M, Kriegel H-P, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of 2nd international conference on knowledge discovery and data mining, pp 226–231

  • Gaonkar MN, Sawant K (2013) AutoEpsDBSCAN: DBSCAN with Eps automatic for large dataset. Int J Adv Comput Theory Eng 2(2):11–16

    Google Scholar 

  • Gingerich K, Maoh H, Anderson W (2016) Classifying the purpose of stopped truck events: an application of entropy to GPS data. Transp Res Part C Emerg Technol 64:17–27. https://doi.org/10.1016/j.trc.2016.01.002

    Article  Google Scholar 

  • Gong L, Sato H, Yamamoto T et al (2015) Identification of activity stop locations in GPS trajectories by density-based clustering method combined with support vector machines. J Mod Transp 23:202–213. https://doi.org/10.1007/s40534-015-0079-x

    Article  Google Scholar 

  • Gong L, Yamamoto T, Morikawa T (2018) Identification of activity stop locations in GPS trajectories by DBSCAN-TE method combined with support vector machines. Transp Res Procedia 32:146–154. https://doi.org/10.1016/j.trpro.2018.10.028

    Article  Google Scholar 

  • Hess S, Quddus M, Rieser-Schüssler N, Daly A (2015) Developing advanced route choice models for heavy goods vehicles using GPS data. Transp Res Part E Logist Transp Rev 77:29–44. https://doi.org/10.1016/j.tre.2015.01.010

    Article  Google Scholar 

  • Holguín-Veras J, Encarnación T, Pérez-Guzmán S, Yang X (2020) Mechanistic identification of freight activity stops from global positioning system data. Transp Res Rec 2674:235–246. https://doi.org/10.1177/0361198120911922

    Article  Google Scholar 

  • Hunt JD, Stefan KJ (2007) Tour-based microsimulation of urban commercial movements. Transp Res Part B Methodol 41:981–1013. https://doi.org/10.1016/j.trb.2007.04.009

    Article  Google Scholar 

  • Jiang K, Yin H, Wang P, Yu N (2013) Learning from contextual information of geo-tagged web photos to rank personalized tourism attractions. Neurocomputing 119:17–25. https://doi.org/10.1016/j.neucom.2012.02.049

    Article  Google Scholar 

  • Khetarpaul S, Chauhan R, Gupta SK et al (2011) Mining GPS data to determine interesting locations. In: Proceedings of the 8th international workshop on information integration on the web in conjunction with WWW 2011 - IIWeb ’11. ACM Press, Hyderabad, India, pp 1–6

  • Laranjeiro PF, Merchán D, Godoy LA et al (2019) Using GPS data to explore speed patterns and temporal fluctuations in urban logistics: the case of São Paulo, Brazil. J Transp Geogr 76:114–129. https://doi.org/10.1016/j.jtrangeo.2019.03.003

    Article  Google Scholar 

  • Li Q, Zheng Y, Xie X et al (2008) Mining user similarity based on location history. In: proceedings of the 16th ACM SIGSPATIAL international conference on advances in geographic information systems - GIS ’08. ACM Press, Irvine, California, p 1

  • Liu P, Zhou D, Wu N (2007) VDBSCAN: varied density based spatial clustering of applications with noise. In: 2007 international conference on service systems and service management. pp 1–4

  • Liu B, de Souza EN, Matwin S, Sydow M (2014) Knowledge-based clustering of ship trajectories using density-based approach. In: 2014 IEEE international conference on big data (Big Data). pp 603–608

  • Luo T, Zheng X, Xu G et al (2017) An improved DBSCAN algorithm to detect stops in individual trajectories. ISPRS Int J Geo Inf 6:63. https://doi.org/10.3390/ijgi6030063

    Article  Google Scholar 

  • Luong C, Do S, Hoang T (2015) A method for detecting significant places from GPS trajectory data. J Adv Inf Technol. https://doi.org/10.12720/jait.6.1.44-48

    Article  Google Scholar 

  • Rodrigue J-P, Comtois C, Slack B (2009) The geography of transport systems, 2nd edn. Routledge, New York, pp 156–157

    Google Scholar 

  • Rosalina E, Salim FD, Sellis T (2017) Automated density-based clustering of spatial urban data for interactive data exploration. In: 2017 IEEE conference on computer communications workshops (INFOCOM WKSHPS). pp 295–300

  • Sarti L, Bravi L, Sambo F et al (2017) Stop purpose classification from GPS data of commercial vehicle fleets. In: 2017 IEEE international conference on data mining workshops (ICDMW). pp 280–287

  • Soni N, Ganatra DA (2016) AGED (Automatic generation of Eps for DBSCAN). Int J Comput Sci Inf Secur 14(5):536–559

    Google Scholar 

  • Sowjanya MV, Padmaja TM (2017) Varied density based graph clustering algorithm for social networks. In: 2017 international conference on I-SMAC (IoT in social, mobile, analytics and cloud) (I-SMAC). pp 520–524

  • Sun D, Leurent F, Xie X (2021) Mining vehicle trajectories to discover individual significant places: case study using floating car data in the Paris region. Transp Res Rec 2675:1–9. https://doi.org/10.1177/0361198121995500

    Article  Google Scholar 

  • Taghavi M, Irannezhad E, Prato CG (2019) Identifying truck stops from a large stream of GPS data via a hidden Markov Chain Model. In: 2019 IEEE intelligent transportation systems conference (ITSC). pp 2265–2271

  • Thierry B, Chaix B, Kestens Y (2013) Detecting activity locations from raw GPS data: a novel kernel-based algorithm. Int J Health Geogr 12:14. https://doi.org/10.1186/1476-072X-12-14

    Article  Google Scholar 

  • Wolf J, Guensler R, Bachman W (2001) Elimination of the travel diary: experiment to derive trip purpose from global positioning system travel data. Transp Res Rec. https://doi.org/10.3141/1768-15

    Article  Google Scholar 

  • Wu T, Shen H, Qin J, Xiang L (2021) Extracting Stops from spatio-temporal trajectories within dynamic contextual features. Sustainability 13:690. https://doi.org/10.3390/su13020690

    Article  Google Scholar 

  • Yang X, Sun Z, Ban XJ, Holguín-Veras J (2014) Urban freight delivery stop identification with GPS data. Transp Res Rec 2411:55–61. https://doi.org/10.3141/2411-07

    Article  Google Scholar 

  • Yu X, Zhou D, Zhou Y (2005) A new clustering algorithm based on distance and density. In: Proceedings of ICSSSM ’05. 2005 International conference on services systems and services management, 2005, vol 2. pp 1016–1021

  • Zheng Y, Zhang L, Xie X, Ma W-Y (2009) Mining interesting locations and travel sequences from GPS trajectories. In: Proceedings of the 18th international conference on World wide web—WWW ’09. ACM Press, Madrid, Spain, p 791

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hanna Maoh.

Ethics declarations

Conflict of interest

The author(s) declare(s) that there is no conflict of interest regarding the publication of this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Patel, V., Maleki, M., Kargar, M. et al. A cluster-driven classification approach to truck stop location identification using passive GPS data. J Geogr Syst (2022). https://doi.org/10.1007/s10109-022-00380-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10109-022-00380-y

Keywords

  • GPS
  • DBSCAN clustering
  • Trucks
  • Stop location
  • Specialization-index
  • Classification

JEL Classification

  • C02