An Adversarial Algorithm for Delegation

Afanador, Juan; Baptista, Murilo; Oren, Nir

doi:10.1007/978-3-030-17294-7_10

Juan Afanador¹⁵,
Murilo Baptista¹⁵ &
Nir Oren¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11327))

Included in the following conference series:

International Conference on Agreement Technologies

565 Accesses
1 Citations

Abstract

Task delegation lies at the heart of the service economy, and is a fundamental aspect of many agent marketplaces. Research in computational trust considers which agent a task should be delegated to for execution given the agent’s past behaviour. However, such work does not consider the effects of the agent delegating the task onwards, forming a chain of delegations before the task is finally executed (as occurs in many human outsourcing scenarios). In this paper we consider such delegation chains, and empirically demonstrate that existing trust based approaches do not handle these situations as well. We then introduce a new algorithm based on quitting games to cater for recursive delegation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, S., Goyal, N.: Analysis of Thompson sampling for the multi-armed bandit problem. In: Conference on Learning Theory, pp. 39–1 (2012)
Google Scholar
Auer, P., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47, 235–256 (2002)
Article Google Scholar
Brezzi, M., Lai, T.L.: Optimal learning and experimentation in bandit problems. J. Econ. Dyn. Control. 27(1), 87–108 (2002)
Article MathSciNet Google Scholar
Burnett, C., Oren, N.: Sub-delegation and trust. In: AAMAS, pp. 1359–1360. IFAAMAS (2012)
Google Scholar
Chapelle, O., Li, L.: An empirical evaluation of Thompson sampling. In: Advances in Neural Information Processing Systems, pp. 2249–2257 (2011)
Google Scholar
Franke, S., Mehlitz, P., Pilecka, M.: Optimality conditions for the simple convex bilevel programming problem in banach spaces. Optimization 67(2), 237–268 (2018)
Article MathSciNet Google Scholar
Gittins, J., Glazebrook, K., Weber, R.: Multi-Armed Bandit Allocation Indices. Wiley, Hoboken (2011)
Book Google Scholar
Gutin, E., Farias, V.: Optimistic Gittins indices. In: Advances in Neural Information Processing Systems, pp. 3153–3161 (2016)
Google Scholar
He, X., Zhou, Y., Chen, Z.: Evolutionary bilevel optimization based on covariance matrix adaptation. IEEE Trans. Evol. Comput. (2018)
Google Scholar
Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Am. Stat. Assoc. 58(301), 13–30 (1963)
Article MathSciNet Google Scholar
Koulouriotis, D.E., Xanthopoulos, A.: Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems. Appl. Math. Comput. 196(2), 913–922 (2008)
MATH Google Scholar
Kulkarni, T.D., Narasimhan, K., Saeedi, A., Tenenbaum, J.: Hierarchical deep reinforcement learning: integrating temporal abstraction and intrinsic motivation. In: Advances in Neural Information Processing Systems, pp. 3675–3683 (2016)
Google Scholar
Sen, S., Ridgway, A., Ripley, M.: Adaptive budgeted bandit algorithms for trust development in a supply-chain. In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2015, pp. 137–144. International Foundation for Autonomous Agents and Multiagent Systems, Richland (2015). http://dl.acm.org/citation.cfm?id=2772879.2772900
Skibski, O., Michalak, T.P., Rahwan, T., Wooldridge, M.: Algorithms for the shapley and myerson values in graph-restricted games. In: Proceedings of the 2014 International Conference on Autonomous Agents and Multi-agent Systems, pp. 197–204. International Foundation for Autonomous Agents and Multiagent Systems (2014)
Google Scholar
Solan, E., Vieille, N.: Quitting games. Math. Oper. Res. 26(2), 265–285 (2001)
Article MathSciNet Google Scholar
Solan, E., Vieille, N.: Quitting games-an example. Int. J. Game Theory 31(3), 365–381 (2003)
Article MathSciNet Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2011)
MATH Google Scholar
Vezhnevets, A.S., et al.: Feudal networks for hierarchical reinforcement learning. arXiv preprint arXiv:1703.01161 (2017)
Welch, P.D.: The statistical analysis of simulation results. In: The Computer Performance Modeling Handbook, vol. 22, pp. 268–328 (1983)
Google Scholar
Zhang, H., Zenios, S.: A dynamic principal-agent model with hidden information: sequential optimality through truthful state revelation. Oper. Res. 56(3), 681–696 (2008)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

University of Aberdeen, Aberdeen, AB24 3UE, Scotland
Juan Afanador, Murilo Baptista & Nir Oren

Authors

Juan Afanador
View author publications
You can also search for this author in PubMed Google Scholar
Murilo Baptista
View author publications
You can also search for this author in PubMed Google Scholar
Nir Oren
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Afanador .

Editor information

Editors and Affiliations

IMT Lille Douai, Douai, France
Marin Lujak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Afanador, J., Baptista, M., Oren, N. (2019). An Adversarial Algorithm for Delegation. In: Lujak, M. (eds) Agreement Technologies. AT 2018. Lecture Notes in Computer Science(), vol 11327. Springer, Cham. https://doi.org/10.1007/978-3-030-17294-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-17294-7_10
Published: 04 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17293-0
Online ISBN: 978-3-030-17294-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics