An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

Xu, Siying; Zhang, Gaoyu; Yuan, Xianzhi

doi:10.1007/s10614-023-10524-x

An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

Published: 28 December 2023

(2023)
Cite this article

Computational Economics Aims and scope Submit manuscript

Siying Xu^1,2,
Gaoyu Zhang² &
Xianzhi Yuan³

122 Accesses
Explore all metrics

Abstract

In recent years, the study of enterprise survival development and cooperation in the whole economic market has been rapidly developed. However, in most literature studies, the traditional enterprise multi-agent cannot effectively simulate the process of enterprise survival and development since the fundamental characteristics used to describe enterprises in social networks, such as the values of enterprise multi-agent attributes, cannot be changed in process of the simulation. To address this problem, an enterprise multi-agent model based on game Q- learning to simulate enterprise decision making which aims to maximize the benefits of enterprises and optimize the effect of inter-firm cooperation is proposed in this article. The Firm Q Learning algorithm is used to dynamically change the attribute values of the enterprise multi-agent to optimize the game results in the evolutionary game model and thus effectively simulate the dynamic cooperation among the enterprise agents. The simulation result shows that the evolution of the enterprise multi-agent model based on game Q-learning can more realistically reflect the process of real enterprise survival and development than the multi-agent simulation with fixed attribute values.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Monte Carlo Tree Search: a review of recent modifications and applications

Article Open access 19 July 2022

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Game-theoretic multi-agent motion planning in a mixed environment

Article 15 March 2024

References

Ahmedova, S. (2015). Factors for increasing the competitiveness of small and medium- sized enterprises (SMEs) in Bulgaria. Procedia - Social and Behavioral Sciences, 195, 1104–1112. https://doi.org/10.1016/j.sbspro.2015.06.155
Article Google Scholar
Cappiello, G., Giordani, F., & Visentin, M. (2020). Social capital and its effect on networked firm innovation and competitiveness. Industrial Marketing Management, 89, 422–430. https://doi.org/10.1016/j.indmarman.2020.03.007
Article Google Scholar
Chang, F., Zhou, G., Cheng, W., Zhang, C., & Tian, C. (2019). A service-oriented multi-player maintenance grouping strategy for complex multi-component system based on game theory. Advanced Engineering Informatics, 42, 100970. https://doi.org/10.1016/j.aei.2019.100970
Article Google Scholar
Chen, M.-H., Wei, H., Wei, M., Huang, H., & Ching-Hui Joan, Su. (2021). Modeling a green supply chain in the hotel industry: An evolutionary game theory approach. International Journal of Hospitality Management, 92, 102716. https://doi.org/10.1016/j.ijhm.2020.102716
Article Google Scholar
Clifton, J., & Laber, E. (2020). Q-learning: Theory and applications. In Reid, N., Stigler, S. M. & Louis, T. A. (Eds.), Annual Review of Statistics and Its Application, Vol 7, 2020 (Vol. 7, pp. 279–301). https://doi.org/10.1146/annurev-statistics-031219-041220.
Coninx, K., Deconinck, G., & Holvoet, T. (2018). Who gets my flex? An evolutionary game theory analysis of flexibility market dynamics. Applied Energy, 218, 104–113. https://doi.org/10.1016/j.apenergy.2018.02.098
Article Google Scholar
Ding, H., Guo, B., & Liu, Z. (2011). Information sharing and profit allotment based on supply chain cooperation. International Journal of Production Economics, 133(1), 70–79. https://doi.org/10.1016/j.ijpe.2010.06.015
Article Google Scholar
Escorcia-Caballero, J. P., Henríquez, L. M., & Chams-Anturi, O. (2022). The effect of internal and external sources on product innovation: Colombian context. Procedia Computer Science, 203, 486–490. https://doi.org/10.1016/j.procs.2022.07.067
Article Google Scholar
Friedman, D., Sinervo, B. (2016). Evolutionary games in natural, social, and virtual worlds, Oxford University Press
Golub, B., & Jackson, M. O. (2012). How homophily affects the speed of learning and best-response dynamics. Quarterly Journal of Economics, 127(3), 1287–1338.
Article Google Scholar
Grimm, V., Berger, U., Bastiansen, F., Eliassen, S., Ginot, V., Giske, J., Goss-Custard, J., Grand, T., Heinz, S. K., Huse, G., Huth, A., Jepsen, J. U., Jørgensen, C., Mooij, W. M., Müller, B., Pe’er, G., Piou, C., Railsback, S. F., Robbins, A. M., & DeAngelis, D. L. (2006). A standard protocol for describing individual-based and agent-based models. Ecological Modelling, 198(1–2), 115–126. https://doi.org/10.1016/j.ecolmodel.2006.04.023
Article Google Scholar
Grimm, V. (2020). The ODD protocol: An update with guidance to support wider and more consistent use. Ecological Modelling, 428, 109105. https://doi.org/10.1016/j.ecolmodel.2020.109105
Article Google Scholar
Grimm, V., Berger, U., Deangelis, D. L., Polhill, J. G., Giske, J., & Railsback, S. F. (2010). The ODD protocol: A review and first update. Ecological Modelling, 221(23), 2760–2768. https://doi.org/10.1016/j.ecolmodel.2010.08.019
Article Google Scholar
Han, L., Zhou, M., Jia, W., Dalil, Z., & Xingbo, Xu. (2019). Intrusion detection model of wireless sensor networks based on game theory and an autoregressive model. Information Sciences, 476, 491–504. https://doi.org/10.1016/j.ins.2018.06.017
Article Google Scholar
He, Yi., Wang, H., Guo, Q., & Qingyun, Xu. (2019). Coordination through cooperative advertising in a two-period consumer electronics supply chain. Journal of Retailing and Consumer Services, 50, 179–188. https://doi.org/10.1016/j.jretconser.2019.05.010
Article Google Scholar
Hsiao, Y.-C., & Hsu, Z.-X. (2018). Firm-specific advantages-product innovation capability complementarities and innovation success: A core competency approach. Technology in Society, 55, 78–84. https://doi.org/10.1016/j.techsoc.2018.06.009
Article Google Scholar
Ingenbleek, P. T. M., Frambach, R. T., & Verhallen, T. M. M. (2013). Best practices for new product pricing: Impact on market performance and price level under different conditions. Journal of Product Innovation Management, 30(3), 560–573. https://doi.org/10.1111/jpim.12008
Article Google Scholar
Ji, A., Xiaolong Xue, Q. P., Ha, X. L., & Zhang, M. (2021). Game theory–based bilevel model for multiplayer pavement maintenance management. Automation in Construction, 129, 103763. https://doi.org/10.1016/j.autcon.2021.103763
Article Google Scholar
Kaya, O. (2022). Determinants and consequences of SME insolvency risk during the pandemic. Economic Modelling, 115, 105958. https://doi.org/10.1016/j.econmod.2022.105958
Article Google Scholar
Kuncoro, W., & Suriani, W. O. (2018). Achieving sustainable competitive advantage through product innovation and market driving. Asia Pacific Management Review, 23(3), 186–192. https://doi.org/10.1016/j.apmrv.2017.07.006
Article Google Scholar
Le, T. T., & Ikram, M. (2022). Do sustainability innovation and firm competitiveness help improve firm performance? Evidence from the SME sector in vietnam. Sustainable Production and Consumption, 29, 588–599. https://doi.org/10.1016/j.spc.2021.11.008
Article Google Scholar
Li, G., Wang, X., Shibin, Su., & Yuan, Su. (2019). How green technological innovation ability influences enterprise competitiveness. Technology in Society, 59, 101136. https://doi.org/10.1016/j.techsoc.2019.04.012
Article Google Scholar
Li, H., Chai, J., Qian, Z., & Chen, H. (2022). Cooperation strategies when leading firms compete with small and medium-sized enterprises in a potentially competitive market. Journal of Management Science and Engineering, 7(3), 489–509. https://doi.org/10.1016/j.jmse.2022.02.003
Article Google Scholar
Liu, H., Yang, N., Yang, Z., Lin, J., & Zhang, Y. (2020). The impact of firm heterogeneity and awareness in modeling risk propagation on multiplex networks. Physica a: Statistical Mechanics and Its Applications, 539, 122919. https://doi.org/10.1016/j.physa.2019.122919
Article Google Scholar
Liu, J.-C., Sheu, J.-B., Li, D.-F., & Dai, Y.-W. (2021a). Collaborative profit allocation schemes for logistics enterprise coalitions with incomplete information. Omega, 101, 102237. https://doi.org/10.1016/j.omega.2020.102237
Article Google Scholar
Liu, K., Liu, Y., Zhou, H., Kou, Y., Ji, Q., & Li, D. (2021b). Evolutionary game and numerical simulation of participants’ collaborative behavior in integrated project delivery project. Alexandria Engineering Journal, 60(1), 373–385. https://doi.org/10.1016/j.aej.2020.08.038
Article Google Scholar
Mahdiraji, H. A., Hatami-Marbini, A., Moazed, N. M., Ansari, M., & Kamardi, A. A. A. (2021). Differential game approach to pricing and advertising decisions. Operations Research Letters, 49(5), 688–695. https://doi.org/10.1016/j.orl.2021.07.002
Article Google Scholar
Marino, M., Parrotta, P., & Pozzoli, D. (2016). Educational diversity and knowledge transfers via inter-firm labor mobility. Journal of Economic Behavior & Organization, 123, 168–183. https://doi.org/10.1016/j.jebo.2015.10.019
Article Google Scholar
Mondal, C., & Giri, B. C. (2020). Pricing and used product collection strategies in a two-period closed-loop supply chain under greening level and effort dependent demand. Journal of Cleaner Production, 265, 121335. https://doi.org/10.1016/j.jclepro.2020.121335
Article Google Scholar
Monghasemi, S., Nikoo, M. R., & Adamowski, J. (2016). Sequential ordering of crane service requests considering the pending times of the requests: An approach based on game theory and optimization techniques. Automation in Construction, 70, 62–76. https://doi.org/10.1016/j.autcon.2016.06.006
Article Google Scholar
Ndubisi, N. O., Zhai, X. A., & Lai, K. H. (2021). Small and medium manufacturing enterprises and Asia’s sustainable economic development. International Journal of Production Economics, 233, 107971. https://doi.org/10.1016/j.ijpe.2020.107971
Article Google Scholar
Nojavan, S., Zare, K., & Mohammadi-Ivatloo, B. (2017). Optimal stochastic energy management of retailer based on selling price determination under smart grid environment in the presence of demand response program. Applied Energy, 187, 449–464. https://doi.org/10.1016/j.apenergy.2016.11.024
Article Google Scholar
Nowak, M. A., & Sigmund, K. (2004). Evolutionary dynamics of biological games. Science, 303(5659), 793–799.
Article Google Scholar
Panetto, H., Iung, B., Ivanov, D., Weichhart, G., & Wang, X. (2019). Challenges for the cyber-physical manufacturing enterprises of the future. Annual Reviews in Control, 47, 200–213. https://doi.org/10.1016/j.arcontrol.2019.02.002
Article Google Scholar
Prashar, A., & Vijaya Sunder, M. (2020). A bibliometric and content analysis of sustainable development in small and medium-sized enterprises. Journal of Cleaner Production, 245, 118665. https://doi.org/10.1016/j.jclepro.2019.118665
Article Google Scholar
Rana, R., & Oliveira, F. S. (2015). Dynamic pricing policies for interdependent perishable products or services using reinforcement learning. Expert Systems with Applications, 42(1), 426–436. https://doi.org/10.1016/j.eswa.2014.07.007
Article Google Scholar
Schneckenreither, M., Haeussler, S., & Peiró, J. (2022). Average reward adjusted deep reinforcement learning for order release planning in manufacturing. Knowledge-Based Systems, 247, 108765. https://doi.org/10.1016/j.knosys.2022.108765
Article Google Scholar
Sutton R. S., Barto A. G. (2018). Reinforcement learning: An introduction, MIT press.
Urieli, D., & Stone, P. (2016), An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, Singapore, Singapore.
Vernadat, F. (2020). Enterprise modelling: Research review and outlook. Computers in Industry, 122, 103265. https://doi.org/10.1016/j.compind.2020.103265
Article Google Scholar
Wei, S., Zhang, Z., Ke, G. Y., & Chen, X. (2019). The more cooperation, the better? optimizing enterprise cooperative strategy in collaborative innovation networks. Physica a: Statistical Mechanics and Its Applications, 534, 120810. https://doi.org/10.1016/j.physa.2019.04.046
Article Google Scholar
Yan, Xu., Bin, Hu., Jiang, Wu., & Zhang, J. (2014). Nonlinear analysis of the cooperation of strategic alliances through stochastic catastrophe theory. Physica a: Statistical Mechanics and Its Applications, 400, 100–108. https://doi.org/10.1016/j.physa.2014.01.004
Article Google Scholar
Yan, Y., Chow, A. H. F., Ho, C. P., Kuo, Y.-H., Wu, Q., & Ying, C. (2022). Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities. Transportation Research Part E: Logistics and Transportation Review, 162, 102712. https://doi.org/10.1016/j.tre.2022.102712
Article Google Scholar
Yang, R., Tang, W., & Zhang, J. (2021). Technology improvement strategy for green products under competition: The role of government subsidy. European Journal of Operational Research, 289(2), 553–568. https://doi.org/10.1016/j.ejor.2020.07.030
Article Google Scholar
Zhang, H., Akuamoah, S. W., & Georgescu, P. (2020a). A flow model of corporate activities with quality assurance. Physica a: Statistical Mechanics and Its Applications, 549, 124447. https://doi.org/10.1016/j.physa.2020.124447
Article Google Scholar
Zhang, Y., Chen, W., & Mi, Y. (2020b). Third-party remanufacturing mode selection for competitive closed-loop supply chain based on evolutionary game theory. Journal of Cleaner Production, 263, 121305. https://doi.org/10.1016/j.jclepro.2020.121305
Article Google Scholar
Zhang, Y., Yang, Q., Li, D., & An, D. (2022). A reinforcement and imitation learning method for pricing strategy of electricity retailer with customers’ flexibility. Applied Energy, 323, 119543. https://doi.org/10.1016/j.apenergy.2022.119543
Article Google Scholar
Zheng, Z. L., Gao, X., & Ruan, X. L. (2019). Does economic financialization lead to the alienation of enterprise investment behavior? Evidence from China, Physica a: Statistical Mechanics and Its Applications, 536, 120858–120916. https://doi.org/10.1016/j.physa.2019.04.094
Article Google Scholar

Download references

Acknowledgements

This research is supported by the National Natural Science Foundation of China with Nos. U1811462 and 71971031.

Funding

This research is supported by the National Natural Science Foundation of China with Nos. U1811462 and 71971031.

Author information

Authors and Affiliations

Shanghai University of Finance and Economics, Shanghai, 200433, People’s Republic of China
Siying Xu
Shanghai Lixin University of Accounting and Finance, Shanghai, 201209, People’s Republic of China
Siying Xu & Gaoyu Zhang
Chengdu University, Chengdu, 610106, People’s Republic of China
Xianzhi Yuan

Authors

Siying Xu
View author publications
You can also search for this author in PubMed Google Scholar
Gaoyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xianzhi Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SX Conceptualization, Methodology, Resources, Software, Writing. GZ Conceptualization, Methodology, Software, Investigation, Resources, Writing, Supervision. XY Conceptualization, Methodology, Writing, Supervision.

Corresponding author

Correspondence to Gaoyu Zhang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with animals performed by any of the authors. Informed consent was obtained from all individual participants included in the study.

Informed consent

For all the above contents and statements, all authors in this manuscript have informed consent.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Xu, S., Zhang, G. & Yuan, X. An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor. Comput Econ (2023). https://doi.org/10.1007/s10614-023-10524-x

Download citation

Accepted: 28 November 2023
Published: 28 December 2023
DOI: https://doi.org/10.1007/s10614-023-10524-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

Abstract

Access this article

Similar content being viewed by others

Monte Carlo Tree Search: a review of recent modifications and applications

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Game-theoretic multi-agent motion planning in a mixed environment

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Enterprise Multi-agent Model with Game Q-Learning Based on a Single Decision Factor

Abstract

Access this article

Similar content being viewed by others

Monte Carlo Tree Search: a review of recent modifications and applications

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Game-theoretic multi-agent motion planning in a mixed environment

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation