On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

Pandelis, Dimitrios G.; Teneketzis, Demosthenis

doi:10.1007/s001860050080

On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

Published: December 1999

Volume 50, pages 449–461, (1999)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Dimitrios G. Pandelis¹ &
Demosthenis Teneketzis²

265 Accesses
19 Citations
Explore all metrics

Abstract.

We investigate the general multi-armed bandit problem with multiple servers. We determine a condition on the reward processes sufficient to guarantee the optimality of the strategy that operates at each instant of time the projects with the highest Gittins indices. We call this strategy the Gittins index rule for multi-armed bandits with multiple plays, or briefly the Gittins index rule. We show by examples that: (i) the aforementioned sufficient condition is not necessary for the optimality of the Gittins index rule; and (ii) when the sufficient condition is relaxed the Gittins index rule is not necessarily optimal. Finally, we present an application of the general results to the multiserver scheduling of parallel queues without arrivals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Author information

Authors and Affiliations

ERIM International, Inc., P.O. Box 134001, Ann Arbor, MI 48113-4001, USA (e-mail: pandelis@erim-int.com), , , , , , US
Dimitrios G. Pandelis
Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109-2122, USA (e-mail:teneket@eecs.umich.edu), , , , , , US
Demosthenis Teneketzis

Authors

Dimitrios G. Pandelis
View author publications
You can also search for this author in PubMed Google Scholar
Demosthenis Teneketzis
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Manuscript received: March 1999/final version received: July 1999

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pandelis, D., Teneketzis, D. On the optimality of the Gittins index rule for multi-armed bandits with multiple plays. Mathematical Methods of OR 50, 449–461 (1999). https://doi.org/10.1007/s001860050080

Download citation

Issue Date: December 1999
DOI: https://doi.org/10.1007/s001860050080

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the optimality of the Gittins index rule for multi-armed bandits with multiple plays

Abstract.

Access this article

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation