Abstract
In order to mitigate food waste resulting from irrational pricing and concurrently augment overall business revenue, this paper introduces a finite-shelf-life food dynamic pricing framework denoted as W-DQN, founded upon the principles of deep reinforcement learning theory. Initially, the dynamic pricing problem for perishable goods is formulated as a Markov decision process. Subsequently, a dynamic pricing algorithm model and corresponding reward function are devised to bolster business revenue while curtailing food waste. Experimental findings unequivocally demonstrate that, relative to tabular-based dynamic pricing algorithm models, W-DQN attains commendable returns. Furthermore, the proposed reward function effectively reduces waste. In comparison to conventional pricing approaches, W-DQN significantly diminishes food waste, thereby enhancing overall business revenue.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhou, H., Chen, K., Wang, S.: Two-period pricing and inventory decisions of perishable products with partial lost sales. Eur. J. Oper. Res. 310(2), 611–626 (2023). https://doi.org/10.1016/j.ejor.2023.03.010
Lu, R., Hong, S.H., Zhang, X.: A dynamic pricing demand response algorithm for smart grid: reinforcement learning approach. Appl. Energy 2018(220), 220–230 (2018)
Zhang, L., Gao, Y., Zhu, H., et al.: Bi-level stochastic real-time pricing model in multi-energy generation system: a reinforcement learning approach. Energy 239, 121926 (2022)
Wang, J., Yang, D., Chen, K., et al.: Cruise dynamic pricing based on SARSA algorithm. Marit. Policy Manage. 48(2), 259–282 (2021)
Yang, C., Feng, Y., Whinston, A.: Dynamic pricing and information disclosure for fresh produce: an artificial intelligence approach. Prod. Oper. Manage. 31(1), 155–171 (2022)
Wang, K., Long, C., Ong, D.J., et al.: Single-site perishable inventory management under uncertainties: a deep reinforcement learning approach. IEEE Trans. Knowl. Data Eng. 2023 (2013)
Wang, X., Li, D.: A dynamic product quality evaluation based pricing model for perishable food supply chains. Omega 40(6), 906–917 (2012). https://doi.org/10.1016/j.omega.2012.02.001
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015). https://doi.org/10.1038/nature14236
Zhang, Y., Wang, Z.: Integrated ordering and pricing policy for perishable products with inventory inaccuracy. In: 2018 IEEE 14th International Conference on Automation Science and Engineering (CASE), pp. 1230–1236. IEEE (2018)
Rana, R., Oliveira, F.S.: Dynamic pricing policies for interdependent perishable products or services using reinforcement learning. Expert Syst. Appl. 42(1), 426–436 (2015). https://doi.org/10.1016/j.eswa.2014.07.007
Acknowledgements
This research is supported by the Liaoning Provincial Department of Education Basic Research Project, the National Natural Science Foundation of China and the Doctoral Scientific Foundation of Liaoning Province of China under Grant (LJKQZ2021120, 61802041 and 2020-BS-210).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Huang, H., Wang, C. (2024). Research on Food Dynamic Pricing Algorithm Based on Deep Reinforcement Learning. In: Song, H., Xu, M., Yang, L., Zhang, L., Yan, S. (eds) Innovative Technologies for Printing, Packaging and Digital Media. CACPP 2023. Lecture Notes in Electrical Engineering, vol 1144. Springer, Singapore. https://doi.org/10.1007/978-981-99-9955-2_63
Download citation
DOI: https://doi.org/10.1007/978-981-99-9955-2_63
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9954-5
Online ISBN: 978-981-99-9955-2
eBook Packages: EngineeringEngineering (R0)