Abstract
Accurate estimation of running and dwell times is important for all levels of planning and control of railway traffic. The availability of historical track occupation data with a high degree of granularity inspired a data-driven approach for estimating these process times. In this paper we present and compare the accuracy of several approaches to model running and dwell times in railway traffic. Three global predictive model approaches are presented based on advanced statistical learning techniques: LTS robust linear regression, regression trees and random forests. Also local models are presented for a particular train line, station or block section, based on LTS robust linear regression with some refinements. The models are validated and compared using a test set independent from the training set. The applicability of the proposed data-driven approach for real-time applications is proved by the accuracy of the obtained estimates and the low computation times. Overall, the local models perform best both in accuracy and computation time.
Similar content being viewed by others
References
Abril M, Barber F, Ingolotti L, Salido M, Tormos P, Lova A (2008) An assessment of railway capacity. Transp Res Part E Logist Transp Rev 44(5):774–806
Berger A, Gebhardt A, Müller-Hannemann M, Ostrowski M (2011) Stochastic delay prediction in large train networks. In: Caprara A, Kontogiannis S (eds) 11th Workshop on algorithmic approaches for transportation modelling, optimization, and systems, Dagstuhl, pp 100–111
Bešinović N, Quaglietta E, Goverde RMP (2013) A simulation-based optimization approach for the calibration of dynamic train speed profiles. J Rail Transp Plan Manag 3(4):126–136
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Breiman L, Friedman J, Ohlsen R, Stone C (1984) Classification and regression trees. Wadsworth, New York
Brünger O, Dahlhaus E (2014) Running time estimation. In: Hansen IA, Pachl J (eds) Railway timetable and operations—analysis. Modelling, optimisation, simulation, performance evaluation. Eurailpress, Hamburg, pp 65–89
Buchmueller S, Weidmann U, Nash A (2008) Development of a dwell time calculation model for timetable planning. In: Allan J, Brebbia CA, Rumsey AF, Sciutto G, Sone S, Goodman CJ (eds) Computers in railways XI. WIT Press, Southampton, pp 525–534
Büker T, Seybold B (2012) Stochastic modelling of delay propagation in large networks. J Rail Transp Plan Manag 2(1–2):34–50
D’Ariano A, Pranzo M, Hansen IA (2007) Conflict resolution and train speed coordination for solving real-time timetable perturbations. IEEE Trans Intell Transp Syst 8(2):208–222
Dolder U, Krista M, Voelcker M (2009) RCS—rail control system—realtime train run simulation and conflict detection on a net wide scale based on updated train positions. In: Proceedings of the 3rd international seminar on railway operations modelling and analysis (RailZurich2009), Zurich, pp 1–15
Goverde RMP (2007) Railway timetable stability analysis using max-plus system theory. Transp Res Part B Methodol 41(2):179–201
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning. Springer Science+Business Media, New York
Hooghiemstra JS (1996) Design of regular interval timetables for strategic and tactical railway planning. In: Allan J, Brebbia CA, Hill RJ, Sciutto G, S S (eds) Computers in railways V, Computational Mechanics Publications. WIT Press, Southempton, pp 393–402
Kecman P, Goverde RMP (2012) Process mining of train describer event data and automatic conflict identification. In: Brebbia CA, Tomii N, Mera JM (eds) Computers in railways XIII, WIT transactions on the built environment, vol 127. WIT Press, Southampton, pp 227–238
Kecman P, Goverde RMP (2014) Online data-driven adaptive prediction of train event times. IEEE Trans Intell Transp Syst 16(1):465–474
Lee YC, Daamen W, Wiggenraad PBL (2007) Dwell times of public transport vehicles: a state-of-the-art report. In: Transportation Research Board 86th Annual Meeting, Washington, pp 1–14
Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18–22
Longo G, Medeossi G (2013) An approach for calibrating and validating the simulation of complex rail networks. In: Transportation Research Board 92nd Annual Meeting, Washington, pp 1–19
Longo G, Medeossi G, Nash A (2012) Estimating train motion using detailed sensor data. In: Transportation Research Board 91st Annual Meeting, Washington, pp 1–15
Lüthi M (2009) Improving the efficiency of heavily used railway networks through integrated real-time rescheduling. Ph.D. thesis, ETH Zurich, Zurich
Medeossi G, Longo G, de Fabris S (2011) A method for using stochastic blocking times to improve timetable planning. J Rail Transp Plan Manag 1(1):1–13
Nash A, Huerlimann D (2004) Railroad simulation using OpenTrack. In: Allan J, Brebbia CA, Hill RJ, Sciutto G, Sone S (eds) Computers in railways IX. WIT Press, Southampton, pp 45–54
R Core Team (2013) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. http://www.R-project.org/
Rousseeuw PJ (2005) Robust regression and outlier detection. Wiley, New York
Rousseeuw PJ, Driessen K (2006) Computing LTS regression for large data sets. Data Min Knowl Discov 12(1):29–45
Rousseeuw PJ, Croux C, Todorov V, Ruckstuhl A, Salibian-Barrera M, Verbeke T, Koller M, Maechler M (2014) robustbase: basic robust statistics. R package version 0.90-2
Schöbel A, Schwarze S (2013) Finding delay-resistant line concepts using a game-theoretic approach. Netnomics 14(3):95–117
Stam-Van den Berg BWV, Weeda VA (2007) VTL-tool: detailed analysis of dutch railway traffic. In: Proceedings of the 3rd international seminar on railway operations modelling and analysis (RailHanover2007), Hanover, pp 1–10
Therneau T, Atkinson B, Ripley B (2014) rpart: recursive partitioning and regression trees. R package version 4.1-5
Van der Meer DJ, Goverde RMP, Hansen IA (2010) Prediction of train running times and conflicts using track occupation data. In: Proceedings of the 12th world conference on transport research (WCTR 2013), Lisbon
Wende D (ed) (2003) Fahrdynamik des Schienenverkehrs. Teubner Verlag, Wiesbaden, B.G (in German)
Wiggenraad PBL (2001) Alighting and boarding times of passengers at Dutch railway stations—analysis of data collected at 7 stations in October 2000. In: Papers of the TRAIL workshop train delay at stations and network stability, TRAIL Research Scool, Delft
Acknowledgments
This work is partially funded by the Dutch Technology Foundation STW, research project: Model-Predictive Railway Traffic Management (Project No. 11025).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kecman, P., Goverde, R.M.P. Predictive modelling of running and dwell times in railway traffic. Public Transp 7, 295–319 (2015). https://doi.org/10.1007/s12469-015-0106-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12469-015-0106-7