Skip to main content
Log in

Global synchromodal shipment matching problem with dynamic and stochastic travel times: a reinforcement learning approach

  • Original Research
  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

Global synchromodal transportation involves the movement of container shipments between inland terminals located in different continents using ships, barges, trains, trucks, or any combination among them through integrated planning at a network level. One of the challenges faced by global operators is the matching of accepted shipments with services in an integrated global synchromodal transport network with dynamic and stochastic travel times. The travel times of services are unknown and revealed dynamically during the execution of transport plans, but the stochastic information of travel times are assumed available. Matching decisions can be updated before shipments arrive at their destination terminals. The objective of the problem is to maximize the total profits that are expressed in terms of a combination of revenues, travel costs, transfer costs, storage costs, delay costs, and carbon tax over a given planning horizon. We propose a sequential decision process model to describe the problem. In order to address the curse of dimensionality, we develop a reinforcement learning approach to learn the value of matching a shipment with a service through simulations. Specifically, we adopt the Q-learning algorithm to update value function estimations and use the \(\epsilon \)-greedy strategy to balance exploitation and exploration. Online decisions are created based on the estimated value functions. The performance of the reinforcement learning approach is evaluated in comparison to a myopic approach that does not consider uncertainties and a stochastic approach that sets chance constraints on feasible transshipment under a rolling horizon framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

Download references

Acknowledgements

This research is financially supported by the National Key R&D program of China under Grant 2019YFB1600400, by the project “Complexity Methods for Pre dictive Synchromodality” (Project 439.16.120) of the Netherlands Organisation for Scientic Research (NWO), as well as by the Natural Sciences and Engineering Council of Canada (NSERC) through its Cooperative Research and Development Grants Program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to W. Guo.

Ethics declarations

Conflicts of interest

The authors declare that they have no conict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guo, W., Atasoy, B. & Negenborn, R.R. Global synchromodal shipment matching problem with dynamic and stochastic travel times: a reinforcement learning approach. Ann Oper Res (2022). https://doi.org/10.1007/s10479-021-04489-z

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10479-021-04489-z

Keywords

Navigation