Intelligent Inventory Control: Is Bootstrapping Worth Implementing?

Katanyukul, Tatpong; Chong, Edwin K. P.; Duff, William S.

doi:10.1007/978-3-642-32891-6_10

Tatpong Katanyukul⁴,
Edwin K. P. Chong⁵ &
William S. Duff⁶

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 385))

Included in the following conference series:

International Conference on Intelligent Information Processing

1418 Accesses
1 Citations

Abstract

The common belief is that using Reinforcement Learning methods (RL) with bootstrapping gives better results than without. However, inclusion of bootstrapping increases the complexity of the RL implementation and requires significant effort. This study investigates whether inclusion of bootstrapping is worth the effort when applying RL to inventory problems. Specifically, we investigate bootstrapping of the temporal difference learning method by using eligibility trace. In addition, we develop a new bootstrapping extension to the Residual Gradient method to supplement our investigation. The results show questionable benefit of bootstrapping when applied to inventory problems. Significance tests could not confirm that bootstrapping had statistically significantly reduced costs of inventory controlled by a RL agent. Our empirical results are based on a variety of problem settings, including demand correlations, demand variances, and cost structures.

Download to read the full chapter text

Chapter PDF

A Reinforcement Learning Approach to Inventory Management

Hybrid algorithm based on reinforcement learning for smart inventory management

Article Open access 03 August 2022

Learning Inventory Control Rules for Perishable Items by Simulation-Based Optimization

Keywords

References

Baird, L.: Residual Algorithms: Reinforcement Learning with Function Approximation. In: Proceedings of the 12th International Conference on Machine Learning, pp. 30–37. Morgan Kaufmann (1995)
Google Scholar
Barreto, A.M.S., Anderson, C.W.: Restricted gradient-descent algorithm for value-function approximation in reinforcement learning. Artificial Intelligence 172(4-5), 454–482 (2008)
Article MathSciNet MATH Google Scholar
Jiang, C., Sheng, Z.: Case-based reinforcement learning for dynamic inventory control in a multi-agent supply chain system. Expert Systems with Applications 36(3), 6520–6526 (2009)
Article Google Scholar
Katanyukul, T., Duff, W.S., Chong, E.K.P.: Approximate dynamic programming for an inventory problem: Empirical comparison. Computers & Industrial Engineering 60(4), 719–743 (2011)
Article Google Scholar
Kim, C.O., Jun, J., Baek, J.K., Smith, R.L., Kim, Y.D.: Adaptive inventory control models for supply chain management. International Journal of Advanced Manufacturing Technology 26(9-10), 1184–1192 (2005)
Article Google Scholar
Kim, C.O., Kwon, I.H., Baek, J.G.: Asynchronous action-reward learning for nonstationary serial supply chain inventory control. Applied Intelligence 28(1), 1–16 (2008)
Article Google Scholar
Kwon, I.H., Kim, C.O., Jun, J., Lee, J.H.: Case-based myopic reinforcement learning for satisfying target service level in supply chain. Expert Systems with Applications 35(1-2), 389–397 (2008)
Article Google Scholar
Leng, J., Jain, L., Fyfe, C.: Experimental analysis of eligibility traces strategies in temporal difference learning. International Journal of Knowledge Engineering and Soft Data Paradigms 1(1), 26–39 (2009)
Article Google Scholar
Maei, H.R., Szepesvari, C., Bhatnagar, S., Precup, D., Silver, D., Sutton, R.S.: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. In: Advances in Neural Information Processing Systems. MIT Press, Vancouver (2009)
Google Scholar
Prestwich, S.D., Tarim, S.A., Rossi, R., Hnich, B.: A Cultural Algorithm for POMDPs from Stochastic Inventory Control. In: Blesa, M.J., Blum, C., Cotta, C., Fernández, A.J., Gallardo, J.E., Roli, A., Sampels, M. (eds.) HM 2008. LNCS, vol. 5296, pp. 16–28. Springer, Heidelberg (2008)
Chapter Google Scholar
Reynolds, R.G.: An Introduction to Cultural Algorithms. In: Proceedings of the 3rd Annual Conference on Evolutionary Programming, pp. 131–139. World Scientific Publishing (1994)
Google Scholar
Shervais, S., Shannon, T.T., Lendaris, G.G.: Intelligent Supply Chain Management Using Adaptive Critic Learning. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 33(2), 235–244 (2003)
Article Google Scholar
Singh, S.P., Sutton, R.S.: Reinforcement Learning with Replacing Eligibility Traces. Machine Learning 22(1-3), 123–158 (1996)
Article MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning. MIT Press (1998)
Google Scholar
Tesauro, G.J.: TD-Gammon, a self-teaching backgammon program, achieves master level play. Neural Computation 6(2), 215–219 (1994)
Article Google Scholar
Van Roy, B., Bertsekas, D.P., Lee, Y., Tsitsiklis, J.N.: A Neuro-Dynamic Programming Approach to Retailer Inventory Management. In: Proceedings of the IEEE Conference on Decision and Control (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Embedded System Research Group and Department of Computer Engineering, Faculty of Engineering, Khon Kaen University, Khon Kaen, Thailand
Tatpong Katanyukul
Department of Electrical and Computer Engineering, College of Engineering, Colorado State University, Fort Collins, CO, USA
Edwin K. P. Chong
Department of Mechanical Engineering, College of Engineering, Colorado State University, Fort Collins, CO, USA
William S. Duff

Authors

Tatpong Katanyukul
View author publications
You can also search for this author in PubMed Google Scholar
Edwin K. P. Chong
View author publications
You can also search for this author in PubMed Google Scholar
William S. Duff
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, China
Zhongzhi Shi
Computer Science Department, Indiana University, 47405, Bloomington, IN, USA
David Leake
School of Computing Science and Engineering, University of Salford, M5 4WT, Salford, UK
Sunil Vadera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Katanyukul, T., Chong, E.K.P., Duff, W.S. (2012). Intelligent Inventory Control: Is Bootstrapping Worth Implementing?. In: Shi, Z., Leake, D., Vadera, S. (eds) Intelligent Information Processing VI. IIP 2012. IFIP Advances in Information and Communication Technology, vol 385. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32891-6_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-32891-6_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32890-9
Online ISBN: 978-3-642-32891-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Intelligent Inventory Control: Is Bootstrapping Worth Implementing?

Abstract

Chapter PDF

Similar content being viewed by others

A Reinforcement Learning Approach to Inventory Management

Hybrid algorithm based on reinforcement learning for smart inventory management

Learning Inventory Control Rules for Perishable Items by Simulation-Based Optimization

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Intelligent Inventory Control: Is Bootstrapping Worth Implementing?

Abstract

Chapter PDF

Similar content being viewed by others

A Reinforcement Learning Approach to Inventory Management

Hybrid algorithm based on reinforcement learning for smart inventory management

Learning Inventory Control Rules for Perishable Items by Simulation-Based Optimization

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation