Multi-agent Reinforcement Learning for Decentralized Stable Matching

Taywade, Kshitija; Goldsmith, Judy; Harrison, Brent

doi:10.1007/978-3-030-87756-9_24

Kshitija Taywade¹⁰,
Judy Goldsmith¹⁰ &
Brent Harrison¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13023))

Included in the following conference series:

International Conference on Algorithmic Decision Theory

716 Accesses
1 Citations

Abstract

In the real world, people/entities usually find matches independently and autonomously, such as finding jobs, partners, roommates, etc. It is possible that this search for matches starts with no initial knowledge of the environment. We propose the use of a multi-agent reinforcement learning (MARL) paradigm for a spatially formulated decentralized two-sided matching market with independent and autonomous agents. Having autonomous agents acting independently makes our environment very dynamic and uncertain. Moreover, agents lack the knowledge of preferences of other agents and have to explore the environment and interact with other agents to discover their own preferences through noisy rewards. We think such a setting better approximates the real world and we study the usefulness of our MARL approach for it. Along with conventional stable matching case where agents have strictly ordered preferences, we check the applicability of our approach for stable matching with incomplete lists and ties. We investigate our results for stability, level of instability (for unstable results), and fairness. Our MARL approach mostly yields stable and fair outcomes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bachrach, Y., et al.: Negotiating team formation using deep reinforcement learning (2018)
Google Scholar
Chalkiadakis, G., Boutilier, C.: Bayesian reinforcement learning for coalition formation under uncertainty. In: Proceeding of AAMAS 2004, pp. 1090–1097 (2004)
Google Scholar
Comola, M., Fafchamps, M.: An experimental study on decentralized networked markets. J. Econ. Behav. Organ. 145, 567–591 (2018)
Article Google Scholar
Diamantoudi, E., Miyagawa, E., Xue, L.: Decentralized matching: the role of commitment. Games Econ. Behav. 92, 1–17 (2015)
Article MathSciNet Google Scholar
Echenique, F., Yariv, L.: An experimental study of decentralized matching (2012)
Google Scholar
Eriksson, K., Häggström, O.: Instability of matchings in decentralized markets with various preference structures. Int. J. Game Theor. 36(3–4), 409–420 (2008)
Article MathSciNet Google Scholar
Gale, D., Shapley, L.S.: College admissions and the stability of marriage. Am. Math. Monthly 69(1), 9–15 (1962)
Article MathSciNet Google Scholar
Gusfield, D.: Three fast algorithms for four problems in stable marriage. SIAM J. Comput. 16(1), 111–128 (1987)
Article MathSciNet Google Scholar
Gusfield, D., Irving, R.W.: The Stable Marriage Problem: Structure and Algorithms. MIT Press, Cambridge (1989)
Google Scholar
Haeringer, G., Wooders, M.: Decentralized job matching. Int. J. Game Theor. 40(1), 1–28 (2011)
Article MathSciNet Google Scholar
Hoepman, J.H.: Simple distributed weighted matchings. arXiv cs/0410047 (2004)
Google Scholar
Irving, R.W.: Stable marriage and indifference. Discrete Appl. Math. 48(3), 261–272 (1994)
Article MathSciNet Google Scholar
Irving, R.W., Leather, P., Gusfield, D.: An efficient algorithm for the optimal stable marriage. J. ACM (JACM) 34(3), 532–543 (1987)
Article MathSciNet Google Scholar
Iwama, K., Miyazaki, S.: A survey of the stable marriage problem and its variants. In: International Conference on Informatics Education and Research for Knowledge-Circulating Society, pp. 131–136. IEEE Computer Society (January 2008)
Google Scholar
Khan, A., et al.: Efficient approximation algorithms for weighted b-matching. SIAM J. Sci. Comput. 38(5), S593–S619 (2016)
Article MathSciNet Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Machine Learning Proceedings 1994, pp. 157–163. Elsevier (1994)
Google Scholar
Matthews, T., Ramchurn, S.D., Chalkiadakis, G.: Competing with humans at fantasy football: Team formation in large partially-observable domains. In: Twenty-Sixth AAAI Conference on Artificial Intelligence, aaai.org (2012)
Google Scholar
Niederle, M., Roth, A.E.: Making markets thick: How norms governing exploding offers affect market performance. preprint (2006)
Google Scholar
Niederle, M., Yariv, L.: Matching through decentralized markets. Stanford University, Discussion Paper (2007)
Google Scholar
Niederle, M., Yariv, L.: Decentralized matching with aligned preferences. Technical report, National Bureau of Economic Research (2009)
Google Scholar
Pais, J., Pintér, A., Veszteg, R.F.: Decentralized matching markets: a laboratory experiment (2012)
Google Scholar
Pais, J., Pintér, Á., Veszteg, R.F.: Decentralized matching markets with (out) frictions: a laboratory experiment. Exp. Econ. 1–28 (2017)
Google Scholar
Pini, M.S., Rossi, F., Venable, K.B., Walsh, T.: Stability and optimality in matching problems with weighted preferences. In: Filipe, J., Fred, A. (eds.) ICAART 2011. CCIS, vol. 271, pp. 319–333. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-29966-7_21
Chapter Google Scholar
Preis, R.: Linear time 1/2-approximation algorithm for maximum weighted matching in general graphs. In: Meinel, C., Tison, S. (eds.) STACS 1999. LNCS, vol. 1563, pp. 259–269. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-49116-3_24
Chapter Google Scholar
Roth, A.E.: A natural experiment in the organization of entry-level labor markets: regional markets for new physicians and surgeons in the United Kingdom. Am. Econ. Rev. 415–440 (1991)
Google Scholar
Roth, A.E., Xing, X.: Turnaround time and bottlenecks in market clearing: decentralized matching in the market for clinical psychologists. J. Political Econ. 105(2), 284–329 (1997)
Article Google Scholar
Rummery, G.A., Niranjan, M.: On-Line Q-Learning Using Connectionist Systems, vol. 37. University of Cambridge, Department of Engineering England (1994)
Google Scholar
Satterthwaite, M., Shneyerov, A.: Dynamic matching, two-sided incomplete information, and participation costs: existence and convergence to perfect competition. Econometrica 75(1), 155–200 (2007)
Article MathSciNet Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
Google Scholar
Ünver, M.U.: On the survival of some unstable two-sided matching mechanisms. Int. J. Game Theor. 33(2), 239–254 (2005)
Article MathSciNet Google Scholar
Viet, H.H., Trang, L.H., Lee, S., Chung, T.: A bidirectional local search for the stable marriage problem. In: 2016 International Conference on Advanced Computing and Applications (ACOMP), pp. 18–24. ieeexplore.ieee.org (November 2016)
Google Scholar
Wattenhofer, M., Wattenhofer, R.: Distributed weighted matching. In: Guerraoui, R. (ed.) DISC 2004. LNCS, vol. 3274, pp. 335–348. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-30186-8_24
Chapter MATH Google Scholar
Zhao, D., Wang, H., Shao, K., Zhu, Y.: Deep reinforcement learning with experience replay based on SARSA. In: 2016 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–6. IEEE (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Kentucky, Lexington, KY, USA
Kshitija Taywade, Judy Goldsmith & Brent Harrison

Authors

Kshitija Taywade
View author publications
You can also search for this author in PubMed Google Scholar
Judy Goldsmith
View author publications
You can also search for this author in PubMed Google Scholar
Brent Harrison
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kshitija Taywade .

Editor information

Editors and Affiliations

National Technical University of Athens, Athens, Greece
Dimitris Fotakis
Consejo Superior de Investigaciones Cientificas, Madrid, Madrid, Spain
David Ríos Insua

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Taywade, K., Goldsmith, J., Harrison, B. (2021). Multi-agent Reinforcement Learning for Decentralized Stable Matching. In: Fotakis, D., Ríos Insua, D. (eds) Algorithmic Decision Theory. ADT 2021. Lecture Notes in Computer Science(), vol 13023. Springer, Cham. https://doi.org/10.1007/978-3-030-87756-9_24

Download citation

DOI: https://doi.org/10.1007/978-3-030-87756-9_24
Published: 27 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87755-2
Online ISBN: 978-3-030-87756-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-agent Reinforcement Learning for Decentralized Stable Matching