Measuring the price of anarchy in critical care unit interactions

Knight, Vincent; Komenda, Izabela; Griffiths, Jeff

doi:10.1057/s41274-016-0100-8

Measuring the price of anarchy in critical care unit interactions

Open access
Published: 17 November 2016

Volume 68, pages 630–642, (2017)
Cite this article

Download PDF

You have full access to this open access article

Journal of the Operational Research Society

Measuring the price of anarchy in critical care unit interactions

Download PDF

Vincent Knight¹,
Izabela Komenda² &
Jeff Griffiths¹

3976 Accesses
7 Citations
2 Altmetric
Explore all metrics

Abstract

Hospital throughput is often studied and optimised in isolation, ignoring the interactions between hospitals. In this paper, critical care unit (CCU) interaction is placed within a game theoretic framework. The methodology involves the use of a normal form game underpinned by a two-dimensional continuous Markov chain. A theorem is given that proves that a Nash Equilibrium exists in pure strategies for the games considered. In the United Kingdom, a variety of utilisation targets are often discussed: aiming to ensure that wards/hospitals operate at a given utilisation value. The effect of these target policies is investigated justifying their use to align the interests of individual hospitals and social welfare. In particular, we identify the lowest value of a utilisation target that aligns these.

Managing admission and discharge processes in intensive care units

Article Open access 10 June 2021

Simulation analysis of hospital intensive care unit reimbursement policies from the triple bottom line perspective

Article 19 December 2014

Data-driven optimization methodology for admission control in critical care units

Article 13 March 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The effect of state dependent service rates in healthcare has been well studied in isolated hospitals. In Batt and Terwiesch (2012), an empirical study is undertaken and service rate slow down is identified. Furthermore, it is shown that modelling whilst ignoring these state dependent rates leads to errors. This effect is further identified in Chan et al (2014), Kc and Terwiesch (2012) where for example the negative effect on patient outcome is revealed. In Kim et al (2013), Shmueli et al (2003), this is expanded to consider a variety of admission policies in two CCUs; in particular the effect of rejection (due to too high occupancy) is measured. These policies are studied in the setting of a single hospital, and thus, from a game theoretic point of view correspond to rational behaviour (a good game theory reference text is Maschler et al (2013)). In practice, a simplification of rational strategies is managed by policy makers by setting utilisation targets that ensure that hospitals do not run at a level likely to have a high rejection rate (example of these can be seen in Bevan and Hood (2006), Kesavan (2013)).

The aim of this paper is to further investigate the effect of rational policies employed by Hospitals. In particular, the goal is to place this in a game theoretic setting so as to identify whether or not target policies result in uncoordinated behaviour that is damaging for patients.

A critical care unit (CCU), also sometimes referred to as an Intensive Therapy Unit or Intensive Therapy Department, is a special ward that is found in most acute hospitals. It provides intensive care (treatment and monitoring) for people who are critically ill or are in an unstable condition. CCUs face major challenges, on average, 8% of patients are refused admission to a CCU because the Unit is full (Audit Commission, 1999). The CCU occupancy rates for some hospitals are reportedly very high (Mitchell and Grounds, 1995; Smith et al, 1995) and a shortage of beds has been identified throughout the United Kingdom.

Many previous researchers have developed queueing models to help manage bed capacities in hospitals: Cooper and Corcoran (1974), Dumas (1984), Gallivan and Utley (2011), Gorunescu et al (2002), Griffiths et al (2013), Harper and Shahani (2002). Also, a vast amount of literature has been devoted to the simulation of CCUs; Cahill and Render (1999), Costa et al (2003), Griffiths et al (2005), Kim et al (1999), Litvak et al (2008), Shahani et al (2008).

This paper describes part of a project undertaken with managers from the Aneurin Bevan University Health Board (ABUHB), which is an NHS Wales organisation in South Wales, that serves 21% of the total population of Wales (Board, 2014). Critical care is delivered on two sites, at the Nevill Hall hospital in Abergavenny and at the Royal Gwent hospital in Newport. For the remainder of this paper, the Nevill Hall hospital will be referred to as NH and the Royal Gwent hospital as RG.

The main proposition of the work requested by the ABUHB was to develop a mathematical model of bed occupancies at the CCUs at RG and NH. After an initial analysis of the data, behavioural aspects became apparent; for example, delaying patients discharge if there was no pressure on CCU beds, or admitting fewer patients if bed occupancy levels were high. As a result of this, a state-dependent queueing model has been developed, which includes the dependency of admission rate on actual occupancy (Williams et al 2015). This state dependent model was applied to both NH and RG separately. It is however obvious that the actions of one CCU impact on the other CCU, as diversion of patients from one CCU to the other sometimes occurs. A pictorial representation of the situation is given in Figure 1.

Most research where game theory is applied in healthcare has mainly concentrated on emergency departments (EDs) and how to deal with diversions of patients and ambulances. In Hagtvedt and Ferguson (2009), cooperative strategies for hospitals are considered, in order to reduce occurrences when ambulances are turned away due to the ED being full. In Deo and Gurvich (2011), a queueing network model of two EDs is proposed to study the network effect of ambulance diversion. Each ED aims to minimise the expected waiting time of its patients (walk-ins and ambulances) and chooses its diversion threshold based on the number of patients at its location. Decentralised decision making in the network is modelled as a non-cooperative game.

Some other work that has not concentrated on EDs, but has been applied to healthcare includes: Knight and Harper (2013), where results concerning the congestion related implications of decisions made by patients when choosing between healthcare facilities were presented. In Howard (2002), a model of the accept/reject decision for transplant organs is developed.

In the wider intersection of game theory and queueing theory (where this work lies), papers that consider price and/or capacity include Allon and Federgruen (2007), Cachon and Harker (2002), Cachon and Zhang (2007), Kalai et al (1992), Levhari and Luski (1978). In these models, the choice of price/or capacity determines the arrival rate for each firm; this is similar to the approach taken in this paper.

The work presented in this paper contributes to the growing body of literature by applying state-dependent queueing models in a game theoretical context to CCU interaction. In particular, this consideration allows for the investigation of targets imposed by central control (Bevan and Hood, 2006). The findings of this work justify and identify a choice of targets that align the interests of the individual hospitals with social welfare.

The data used for the work presented in this paper were provided by the Intensive Care National Audit and Research Centre (ICNARC) and refer to patients admitted to CCUs in NH and RG, and cover a period of three years, from the 1st January 2009 till the 31st December 2011. The dataset contains information about a patient’s source of admission, date and time of admission, date and time of discharge, CCU outcome and delay to discharge. The parameters obtained from the data used in this work are shown in Table 1.

Table 1 Parameter values used in the model

Full size table

Note that, the inter-arrival and service rates are not state dependent. These will serve as a base level for the state dependent rates used throughout the game theoretic models.

The paper is organised as follows: Section 2 presents the general methodology as well as a theoretical existence condition for Nash Equilibrium. Section 3 presents the findings from two models. Finally, conclusions and further ideas for progression are given in Section 4.

2 Queueing and game theoretic models

Throughout this paper, it is assumed that both CCUs (NH and RG) act selfishly and rationally. The strategies of each CCU are capacity thresholds at which they declare being in “diversion.” When in “diversion” the arrival rates of patients are modified. Given the proximity of the two CCUs, one CCU could for example divert their patients to the other. Figure 2 shows a diagrammatic representation of this where $\lambda _H^{r}$ for $H\in \{\text {NH}, \text {RG}\}$ and $r\in \{(l,l), (l,h), (h,l), (h,h)\}$ simply denote arrival rates that will be defined for both models considered in Section 3, where r denotes regions with boundaries defined by the capacity thresholds. For example (l, h) denotes a region for which $\text {NH}$ experiences low demand and $\text {RG}$ experiences high demand. It is also assumed that diverted patients will be treated under the length of stay profile of the CCU they are admitted to. The capacity thresholds are denoted as $K_{H}\in \mathbb {Z}$ for $H\in \{\text {NH},\text {RG}\}$. Note that $0\le K_H\le c_H$.

To formally investigate the impact of decentralised decision making, the interaction between two CCUs is placed within a non-cooperative game framework. The interaction will be modelled through a two dimensional continuous Markov chain that will now be described.

2.1 Queueing model

The state space for the Markov chain is given by (1).

$$\begin{aligned} S=S(c_{\text {NH}}, c_{\text {RG}})= \{(u,v)\in \mathbb {Z}^2\ |\ 0\le u\le c_{\text {NH}},\ 0\le v\le c_{\text {RG}}\} \end{aligned}$$

(1)

For given $K_{\text {NH}}, K_{\text {RG}}$ and using the notation of Figure 2, the generic Markov chain used to model the interactive queueing system in this paper is shown in Figure 3.

In total there are $(c_{\text {NH}}+1)\times (c_{\text {RG}}+1)$ states and they are indexed lexicographically: $(0,0), (0,1), (0,2), \dots$.

The stochastic transition rate matrix $Q=Q(c_{NH},c_{RG})$ of the continuous-time Markov chain (Stewart 2009) has entries $q_{ij}=q_{(u_i,v_i),(u_j,v_j)}$ which is the rate at which a transition from state i to state j occurs. The transition rates are given by (2) and are illustrated diagrammatically in Figure 3.

$$\begin{aligned} q_{(u_i,v_i),(u_j,v_j)}={\left\{ \begin{array}{ll} u\mu _{\text {NH}} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(1,0),\\ v\mu _{\text {RG}} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,1),\\ \lambda _{NH}^{(l,l)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(-1,0) \,\text{ and } \, u_i< K_{\text {NH}} \, \text { and } \, v_i< K_{\text {RG}},\\ \lambda _{RG}^{(l,l)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i< K_{\text {NH}} \, \text { and } \, v_i< K_{\text {RG}},\\ \lambda _{NH}^{(l,h)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i< K_{\text {NH}} \, \text { and } \, v_i \ge K_{\text {RG}},\\ \lambda _{RG}^{(l,h)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i< K_{\text {NH}} \, \text { and } \, v_i \ge K_{\text {RG}},\\ \lambda _{NH}^{(h,l)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i \ge K_{\text {NH}} \, \text { and } \, v_i< K_{\text {RG}},\\ \lambda _{RG}^{(h,l)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i \ge K_{\text {NH}} \, \text { and } \, v_i < K_{\text {RG}},\\ \lambda _{NH}^{(h,h)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i \ge K_{\text {NH}} \, \text { and } \, v_i \ge K_{\text {RG}},\\ \lambda _{RG}^{(h,h)} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i \ge K_{\text {NH}} \, \text { and } \, v_i \ge K_{\text {RG}},\\ 0 &{} \text { otherwise}. \end{array}\right. } \end{aligned}$$

(2)

Utilities will be of interest when this queueing theoretical model will be inserted in the game theoretical model. Throughput of patients is a natural choice of utility given that most hospitals are financially rewarded per served patient (Pate, 2009). For each threshold pair $(K_{\text {NH}},K_{\text {RG}})$, the utilisation rate $U_H$ and throughput $T_H$ can easily be obtained for each CCU: $H\in \{\text {NH},\text {RG}\}$, using the following formulas:

$$\begin{aligned} U_{H}={{\sum _{n=0}^{c_H} nP^{(H)}(n)}\over {c_{H}}} \end{aligned}$$

$$\begin{aligned} T_{H}=\mu _H \sum _{n=0}^{c_H} nP^{(H)}(n) \end{aligned}$$

where $P^{(H)}=P^{(H)}(K_{\text {NH}},K_{\text {RG}})$ is the steady-state probability distribution function (obtained from the corresponding transition matrix $Q=Q(K_{\text {NH}},K_{\text {RG}})$) for $H\in \{\text {NH},\text {RG}\}$.

For $c_{\text {NH}}=8$, $c_{\text {RG}}=16$, $\lambda _{\text {NH}}=(\lambda _{\text {NH}}^{(l,l)},\lambda _{\text {NH}}^{(l,h)},\lambda _{\text {NH}}^{(h,l)},\lambda _{\text {NH}}^{(h,h)})=(1.5,3.74,0,0)$, $\lambda _{\text {RG}}=(\lambda _{\text {RG}}^{(l,l)},\lambda _{\text {RG}}^{(l,h)},\lambda _{\text {RG}}^{(h,l)},\lambda _{\text {RG}}^{(h,h)})=(2.24,0,3.74,0)$ and $(K_{\text {NH}},K_{\text {RG}})=(6,12)$, the steady-state probabilities for each CCU are given in Figure 4.

For the parameters of Figure 4, the utilisation rates are 59% at NH and 62% at RG and a throughput of 1.23 at NH and 1.98 at RG (patients per day).

For a different threshold pair of $(K_{\text {NH}},K_{\text {RG}})=(1,12)$ the steady-state probabilities are given in Figure 5. The utilisation rates are 11% at NH and 67% at RG and throughput of 0.23 at NH and 2.13 at RG. We see that the RG is now busier as a result of NH having a lower diversion threshold. A model of this interaction will be given in the next section.

2.2 Game theoretic model

Based on the discussion above, the game theoretic model is presented as a synchronous optimisation problem shown in the following optimisation problem.

Optimisation problem.

For all $H\in \{\text {NH}, \text {RG}\}$ minimise

$$\begin{aligned} \left( U_{H}-t\right) ^2 \end{aligned}$$

Subject to

$$\begin{aligned} 0\le&\; K_H \le c_{H}\\&K_H \in \mathbb {Z}\nonumber. \end{aligned}$$

(3)

This game is equivalent to a bimatrix game with restriction to pure strategies where both players aim to get their utilisation as close as possible to a certain target. Such a Nash Equilibrium is not guaranteed by traditional game theoretical results (Nash, 1950), which guarantee the existence of equilibria in mixed strategies. Based on discussions with ABUHB, long-term threshold policies are a realistic consideration and so a pure strategy space is used.

The following result is a sufficient condition for the existence of an equilibrium:

Theorem

Let $f_{H}(k):[1,c_{\bar{H}}]\rightarrow [1,c_H]$ be the best response of player $H\in \{\text {NH}, \text {RG}\}$ to the diversion threshold of $\bar{H}\ne H$ ($\bar{H}\in \{\text {NH}, \text {RG}\}$).

If $f_{H}(k)$ is a non-decreasing function in k then the game of (3) has at least one Nash Equilibrium.

Proof

The function $f_H$ is well defined as it maximises a continuous function over a finite discrete set. In case of multiple values that minimise $U_H$, it is assumed that $f_H$ returns the lowest such value, this is consistent with the Price of Anarchy (PoA) being a theoretical upper bound of the effect of uncoordinated behaviour (Roughgarden, 2005).

As such if $f_H$ is non-decreasing, then it is in fact a stepwise non-decreasing function. If we consider $f_{\text {NH}}$ and $f_{\text {RG}}$ plotted on the same axis (so that the domain of $f_{\text {NH}}$ is the x-axis and the domain of $f_{\text {RG}}$ is the y-axis), it is obvious to see that the functions must intersect at some point as shown in Figure 6.

This point of intersection corresponds to a Nash Equilibrium of (3). $\square$

This Theorem is in itself not that useful as the properties of $f_{H}$ are difficult to ascertain. Although the methodology alluded to is how the equilibria are found for the work presented here (exhaustive investigation of best response functions). The following Lemma will however be of more use in Section 3.

Lemma

Using the convention of Figure 2:

If $\lambda _{\text {NH}}^{(h,l)}\le \lambda _{\text {NH}}^{(h,h)},\lambda _{\text {NH}}^{(l,l)}\le \lambda _{\text {NH}}^{(l,h)}$ then $f_{\text {NH}}(k)$ is a non-decreasing function in k.
If $\lambda _{\text {RG}}^{(l,h)}\le \lambda _{\text {RG}}^{(h,h)},\lambda _{\text {RG}}^{(l,l)}\le \lambda _{\text {RG}}^{(h,l)}$ then $f_{\text {RG}}(k)$ is a non-decreasing function in k.

Observation

The utilisation $U_H=U_H(\lambda )$ is an increasing function in $\lambda$. As the traffic intensity at H increases: H gets busier.

Proof

A proof for the first part of the Lemma is given (the proof for the second part is analogous).

Let $\bar{\lambda }_{\text {NH}} = \bar{\lambda }_{\text {NH}}(K_{\text {RG}})$ be the effective arrival rate at NH. If $\lambda _{\text {NH}}^{(h,l)}\le \lambda _{\text {NH}}^{(h,h)},\lambda _{\text {NH}}^{(l,l)}\le \lambda _{\text {NH}}^{(l,h)}$ then this implies that $\bar{\lambda }_{\text {NH}}(K_{\text {RG}}) \ge \bar{\lambda }_{\text {NH}}(K_{\text {RG}} + 1)$ as shown in Figure 7.

Based on the previous observation this in turn implies: $U_{\text {NH}}(K_{\text {RG}}) = U_{\text {NH}}(\bar{\lambda }_{\text {RG}}(K_{\text {RG}}) \ge U_{\text {NH}}(\bar{\lambda }_{\text {RG}}(K_{\text {RG}}+1)) = U_{\text {NH}}(K_{\text {RG}}+1)$ giving:

$$\begin{aligned} U_{\text {NH}}(K_{\text {RG}}) \ge U_{\text {NH}}(K_{\text {RG}}+1) \end{aligned}$$

(4)

In the same way (illustrated by Figure 8), this gives: $U_{\text {NH}}(K_{\text {NH}}) = U_{\text {NH}}(\bar{\lambda }_{\text {NH}}(K_{\text {NH}})) \le U_{\text {NH}}(\bar{\lambda }_{\text {NH}}(K_{\text {NH}}+1)) = U_{\text {NH}}(K_{\text {NH}}+1)$ giving:

$$\begin{aligned} U_{\text {NH}}(K_{\text {NH}}) \le U_{\text {NH}}(K_{\text {NH}}+1) \end{aligned}$$

(5)

Using (4) and (5), the general inequalities associated with $U_{\text {NH}}(K_{\text {NH}}, K_{\text {RG}})$ are summarised in Figure 9.

As $U_{\text {NH}}$ increases from the lowest value (top left in Figure 9) to the highest (bottom right in Figure 9), the value of $U_{\text {NH}}-t$ will change sign (from negative to positive). Let

$$\begin{aligned} f^+(K_{\text {RG}})={{\mathrm{arg\!min}}}_{\{0\le k \le C_{\text {NH}}|U_{\text {NH}}(k, K_{\text {RG}})\ge t\}}(U_{\text {NH}}(k, K_{\text {RG}})) \end{aligned}$$

(6)

and

$$\begin{aligned} f^-(K_{\text {RG}})={{\mathrm{arg\!max}}}_{\{0\le k \le C_{\text {NH}}|U_{\text {NH}}(k, K_{\text {RG}})\le t\}}(U_{\text {NH}}(k, K_{\text {RG}})). \end{aligned}$$

(7)

Thus, $f^{\pm }({K_{\text {RG}}})$ is the value of $K_{\text {NH}}$ that gives the value of $U_{\text {NH}}(K_{\text {NH}}, K_{\text {RG}})$ that is closest to $t$ such that

$$\begin{aligned} U_{\text {NH}}(f^-(K_{\text {RG}}),K_{\text {RG}})\le t \le U_{\text {NH}}(f^+(K_{\text {RG}}),K_{\text {RG}}). \end{aligned}$$

(8)

This immediately (from Figure 9) gives

$$\begin{aligned} f^{\pm }(K_{\text {RG}}) \le f^{\pm }(K_{\text {RG}} + 1). \end{aligned}$$

(9)

Let

$$\begin{aligned} S(K_{\text {RG}}) = \{f^{-}(K_{\text {RG}})\}\cup \{f^{+}(K_{\text {RG}})\}. \end{aligned}$$

(10)

Thus, $f(K_{\text {RG}})\in S(K_{\text {RG}})$. Note that from (5) $\max (S(K_{\text {RG}}))=f^{+}(K_{\text {RG}})$ and $\min S(K_{\text {RG}})=f^{+}(K_{\text {RG}})$. In essence $S(K_{\text {RG}})$ corresponds to the set of two values of $K_{\text {NH}}$ that give a utilisation just below and just above $t$.

There are two possibilities that will now be considered:

1.
$S(K_{\text {RG}})\ne S(K_{\text {RG}}+1)$
2.
$S(K_{\text {RG}})= S(K_{\text {RG}}+1)$

If $S(K_{\text {RG}})\ne S(K_{\text {RG}})$ this implies that $f^{+}(K_{\text {RG}})\le f^{-}(K_{\text {RG}}+1)$ which gives
$$\begin{aligned} f(K_{\text {RG}}) \le f^{+}(K_{\text {RG}}) \le f^{-}(K_{\text {RG}}+1) \le f(K_{\text {RG}}+1) \end{aligned}$$
(11)
which is the required result.

To finish the proof consider $S(K_{\text {RG}})=S(K_{\text {RG}})$ which is equivalent to $f^{\pm }(K_{\text {RG}})=f^{\pm }(K_{\text {RG}}+1)$ and assume

$$\begin{aligned} f(K_{\text {RG}})=f^{+}(K_{\text {RG}}) \text { but } \, f(K_{\text {RG}}+1)=f^{-}(K_{\text {RG}}+1) \end{aligned}$$

(12)

which, because it has been assumed that $f^{-}(K_{\text {RG}}+1)=f^{-}(K_{\text {RG}})$, contradicts the required result (as this would imply that $f(K_{\text {RG}})< f(K_{\text {RG}}+1)$).

For simplicity of notation let

$$\begin{aligned} a^{\pm }=(f^{\pm }(K_{\text {RG}}), K_{\text {RG}})\,\, \text { and } \,\, b^{\pm }=(f^{\pm }(K_{\text {RG}}+1), K_{\text {RG}}+1))=(f^{\pm }(K_{\text {RG}}), K_{\text {RG}}+1)). \end{aligned}$$

Recalling equations (4–5) and Figure 9 gives

$$\begin{aligned} U_{\text {NH}}(b^-)&\le U_{\text {NH}}(a^-)\end{aligned}$$

(13)

$$\begin{aligned} U_{\text {NH}}(b^+)&\le U_{\text {NH}}(a^+) \end{aligned}$$

(14)

which implies

$$\begin{aligned} U_{\text {NH}}(b^-) + U_{\text {NH}}(b^+) \le U_{\text {NH}}(a^-) + U_{\text {NH}}(a^-). \end{aligned}$$

(15)

As $f(K_{\text {RG}})=f^{+}(K_{\text {RG}})$ (this is the assumption made):

$$\begin{aligned} |U_{\text {NH}}(a^+)-t|<|U_{\text {NH}}(a^-)-t| \end{aligned}$$

(16)

which implies

$$\begin{aligned} U_{\text {NH}}(a^+)+U_{\text {NH}}(a^-)<2t. \end{aligned}$$

(17)

Similarly as $f(K_{\text {RG}}+1)=f^{-}(K_{\text {RG}}+1)$ (this is the assumption made):

$$\begin{aligned} |U_{\text {NH}}(b^+)-t|>|U_{\text {NH}}(b^-)-t| \end{aligned}$$

(18)

which implies

$$\begin{aligned} U_{\text {NH}}(b^+)+U_{\text {NH}}(b^-)>2t. \end{aligned}$$

(19)

Combining (17) and (19) contradicts (15) implying that the original assumption was incorrect thus proving the required result. $\square$

The aim of the work presented is to measure the inefficiency created by the removal of central control between CCUs.

We let $\widetilde{T}$ denote the sum of throughputs at the Nash Equilibrium obtained by solving the game of (3) (in case of multiple equilibria we take $\widetilde{T}$ to be the lowest throughput) and let $T^*=\max _{K_{\text {NH}}, K_{\text {RG}}}\left( T_{\text {NH}}+T_{\text {RG}}\right)$. This optimal throughput $T^*$ is independent of $t$.

Note that intuitively it could be thought that $T^*$ is obtained when $K_{\text {NH}}=c_{\text {NH}}$ and $K_{\text {RG}}=c_{\text {RG}};$ however, this is not always the case (numerical experiments have been carried out to verify this).

The measure used to quantify inefficiency is the Price of Anarchy (PoA) (Koutsoupias and Papadimitriou, 1999; Roughgarden, 2005), which is the ratio of the social optimum welfare to the welfare of the worst Nash Equilibrium. That is, the ratio of the largest social welfare, $T^*$ , to the smallest social welfare, $\widetilde{T}$, achieved at any Nash Equilibrium. Thus,

$$\begin{aligned} \text {PoA}={{T^*}\over {\widetilde{T}}}. \end{aligned}$$

Note that the classic definition of PoA has been modified here to allow for a maximisation problem. Social welfare is here considered to be a maximisation of throughput. An immediate alignment of interests can be obtained by setting $t=1$. This however would not be in the interest of the hospital (nor necessarily in the interests of patients) as it would imply aiming to run at 100% utilisation which imply a large quantity of patients being rejected. A sensible value of t is the lowest value of t that ensures a low PoA.

3 Results

The game theoretic model of (3) is solved using exhaustive consideration of best responses whilst taking advantage of the structure identified by the Lemma of Section 2. For any given pair of threshold strategies, the matrix equation $\pi Q=0$ is solved by obtaining a basis for the kernel of Q. For the purpose of this paper, this is implemented in Sagemath (Stein et al, 2013).

3.1 Model 1: Strict diversion

This model assumes that if the bed occupancy level at both CCUs exceeds a predetermined threshold, then the admission to the CCU is cancelled. This cancellation could correspond to sending the patient to a completely different CCUs (outside of the model), moving one of the current CCU patients (ready to be discharged) to another ward and/or using the post-anaesthesia care unit as a temporary overflow measure. This model corresponds to the first possibility: the patient is lost (from the point of view of this model).

Recalling Figure 2 this implies

$$\begin{aligned} \lambda _{\text {NH}}^{(r)}={\left\{ \begin{array}{ll} \lambda _{\text {NH}}, &{}\text {if }r=(l,l)\\ \lambda _{\text {NH}}+\lambda _{\text {RG}}, &{}\text {if }r=(l,h)\\ 0, &{}\text {if }r\in \{(h,l),(h,h)\}\\ \end{array}\right. } \qquad \qquad \lambda _{\text {RG}}^{(r)}={\left\{ \begin{array}{ll} \lambda _{\text {RG}}, &{}\text {if }r=(l,l)\\ \lambda _{\text {NH}}+\lambda _{\text {RG}}, &{}\text {if }r=(h,l)\\ 0, &{}\text {if }r\in \{(l,h),(h,h)\}\\ \end{array}\right. } \end{aligned}$$

We immediately see that the Lemma of Section 2 holds and so a Nash Equilibrium for our model exists.

If either CCU chooses their threshold at zero, patients are not admitted at all, and, consequently both Units are closed.

Therefore, the matrix Q has entries $q_{ij}$ as follows:

$$\begin{aligned} q_{(u_i,v_i),(u_j,v_j)}={\left\{ \begin{array}{ll} u_i\mu _{NH} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(1,0),\\ v_i\mu _{RG} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,1),\\ \lambda _{\text {NH}} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i<K_{\text {NH}} \, \text { and } \, v_i<K_{\text {RG}},\\ \lambda _{\text {RG}} &{} \text { if } (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i<K_{\text {NH}} \, \text { and } \, v_i< K_{\text {RG}},\\ \lambda _{\text {NH}}+\lambda _{\text {RG}} &{} \text { if } \left\{ \begin{array}{l} (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i<K_{\text {NH}} \, \text { and } \, v_i \ge K_{\text {RG}} \text { or}\\ (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i \ge K_{\text {NH}} \, \text { and } \, v_i < K_{\text {RG}},\\ \end{array}\right. \\ 0 &{} \text { otherwise}. \end{array}\right. } \end{aligned}$$

(20)

For the parameters of Table 1, the best responses are shown in Figure 10. For example, in Figure 10a if RG chooses $K_{\text {RG}}=6$, NH has best response $K_{\text {NH}}=8$ . Similarly, if $K_{\text {NH}}=2$, RG has best response $K_{\text {RG}}=15$. A Nash Equilibrium for our game is a pair of points that intersect.

For this model, the Nash Equilibrium is at (8, 16), which gives $\widetilde{T}=3.65$ and we obtain $T^*=3.65$. Importantly, a PoA of 1 is not guaranteed for this problem. For example in Figure 10b similar best response behaviour is shown for $t=0.6$ for which the $\text {PoA}=1.18$ (the optimal throughput is again at (8, 16)).

Whilst removing central control, a certain influence can be exerted by a choice of t. Figure 11 (note: the non linear scale) and Table 2 show the effect of t and overall demand. We modify the demand rate from Table 1 by taking $\lambda _H \leftarrow \lambda _H(1+x)$ for $-0.9\le x\le 2$.

Table 2 Numerical values of PoA for different target and demand rates

Full size table

We see that an extremely large PoA is obtained for $t<0.2$. For values of $t>0.5$ the PoA is still high: a PoA of 2 corresponds to 100% less throughput of patients. These findings seem to give some backing to the targets implemented throughout the NHS (Bevan and Hood, 2006).

In particular, it can be seen that a value of $t>0.8$ becomes imperative for high demand. The lowest value of t which gives PoA $=1$ for the actual demand levels ($x=0$) is in fact $t=0.72$. It is also noted that as demand increases the effect of uncoordinated behaviour increases (and the recommended target also increases) as shown in Figure 12. This is potentially due to the fact that as demand increases there is the scope for larger discrepancy between optimal and suboptimal behaviours.

In this model, there is the potential for both CCUs to divert patients at the same time, and so patients are lost to the entire system. The model of the next section will investigate the effect of not allowing total rejections.

3.2 Model 2: Soft diversion

Recalling Figure 2, this model assumes

$$\begin{aligned} \lambda _{NH}^{(r)}={\left\{ \begin{array}{ll} \lambda _{NH}, &{}\text {if }r\in \{(l,l),(h,h)\}\\ \lambda _{NH}+\lambda _{RG}, &{}\text {if }r=(l,h)\\ 0, &{}\text {if }r=(h,l)\\ \end{array}\right. } \end{aligned}$$

$$\begin{aligned} \lambda _{RG}^{(r)}={\left\{ \begin{array}{ll} \lambda _{RG}, &{}\text {if }r\in \{(l,l), (h,h)\}\\ \lambda _{NH}+\lambda _{RG}, &{}\text {if }r=(h,l)\\ 0, &{}\text {if }r=(l,h)\\ \end{array}\right. } \end{aligned}$$

We immediately see that the Lemma of Section 2 holds and so a Nash Equilibrium for our model exists.

This means that if bed occupancy levels at both Units exceed a pre-determined threshold, then diversions do not occur and each CCU has to accommodate their own patients. In effect we are modelling a certain level of cooperation in this case where CCUs only divert if the other CCU is not busy.

Therefore, the transition matrix Q is obtained from the following transition rates $q_{ij}$:

$$\begin{aligned} q_{(u_i,v_i), (u_j, v_j)}={\left\{ \begin{array}{ll} u_i\mu _{NH} &{} \text { if } \, (u_i,v_i)-(u_j,v_j)=(1,0),\\ v_i\mu _{RG} &{} \text { if } \, (u_i,v_i)-(u_j,v_j)=(0,1),\\ \lambda _{NH} &{} \text { if } \, (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, \left\{ \begin{array}{l} u_i<K_{NH} \, \text { and } \, v_i<K_{RG} \text { or } \\ u_i \ge K_{NH} \, \text { and } \, v_i \ge K_{RG},\end{array}\right. \\ \lambda _{RG} &{} \text { if } \, (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, \left\{ \begin{array}{l}u_i< K_{NH} \, \text { and } \, v_i<K_{RG}\text { or }\\ u_i\ge K_{NH} \, \text { and } \, v_i\ge K_{RG}, \\ \end{array}\right. \\ \\ \lambda _{NH}+\lambda _{RG} &{} \text { if } \, \left\{ \begin{array}{l} (u_i,v_i)-(u_j,v_j)=(-1,0) \, \text { and } \, u_i<K_{NH} \, \text { and } \, v_i \ge K_{RG} \text { or }\\ (u_i,v_i)-(u_j,v_j)=(0,-1) \, \text { and } \, u_i \ge K_{NH} \, \text { and } \, v_i < K_{RG}, \\ \end{array}\right. \\ 0 &{} \text { otherwise}. \end{array}\right. } \end{aligned}$$

(21)

As before, Figure 13 and Table 3 present the PoA for different target values and demand rate changes.

Table 3 Numerical values of PoA for different target and demand rates

Full size table

We immediately note that the underlying cooperation that is now being forced on our players (divert only if the other player can accommodate the patients) has reduced the PoA. Note that PoA $=1.02$ still implies a reduced throughput of 2% which has very large cost implications for a national health service. A tipping point is now visible as demand increases; this is similar to the profiles shown in Knight and Harper (2013) and can be explained as follows:

Also, for very low values of demand, cooperation can be obtained with no target. When the demand is low, there is no scope for uncoordinated behaviour to be damaging. When the demand is very high, the system is saturated and once again uncoordinated behaviour has no negative effect in comparison to optimal behaviour. There is however a region of demand for which there is a high PoA.

For example, for $t=0.8$ the PoA starts to rapidly increase for demand changes higher than 0.1, and starts to decrease for a demand change of 0.6; this region will be investigated closely. Table 4 presents results for $t=0.8$ and a demand change from 0.1 to 0.6. For a 50% increase in demand, without a matching increase in capacity, rational behaviour of CCUs would incur 6% less patient throughput.

Table 4 Soft diversion results for $t=0.8$

Full size table

Clearly, as the demand change increases, the Nash Equilibrium thresholds decrease. This is due to the fact that both CCUs are attempting to divert their patients in less busy states as these states become rarer. If one CCU diverts early, the other will follow suit (both CCUs incrementally reacting to each other). As a result the Nash Equilibrium for $x=0.6$ is at (0, 0), meaning that each CCU takes care of their own patients. As the demand increases even further the Nash Equilibrium remains at (0, 0) and the PoA decreases.

Figure 14 shows the lowest values of t which gives a PoA of 1. We see that as demand increases this value also increases. Also, for very low values of demand, cooperation can be obtained with no target. For the actual demand ($x=0$) a target value of $t=0.72$ is once again recommended.

4 Conclusions

In this work, a generic game theoretical model has been presented that accounts for the rational actions of two CCUs. This game theoretic model is underpinned by a queueing model that takes into account the stochastic nature of queueing systems. This work extends the application of game theoretic models already present in the literature to healthcare (Li et al, 2002; Xie and Ai, 2006).

A result is proved that allows for the assertion of existence of a Nash Equilibrium. This result is then applied to two particular models that are influenced by discussions with a local health board. Strict diversion: patients can be lost to the system if both CCUs declare being in diversion. Soft diversion: if both CCUs are in diversion then they cannot divert their own patients.

An analysis of the effect of rational behaviour is given for both of these models in the form of PoA calculations. The PoA is calculated so as to measure the effect of rational behaviour on overall patient throughput. The PoA represents a theoretical lower bound for the potential damages caused by uncoordinated behaviour. High PoAs are found in the case of strict diversion which is to be expected as soft diversion implies a certain level of cooperation. Importantly, a non-negligible effect of rational behaviour is calculated for certain policy target values. A recommendation of setting $t=0.72$ is found across both models. This gives some evidence to a particular target value of maximal utilisation in a two CCU ward setting.

This value of t is investigated against increasing demand and is shown to be increasing in overall demand across the system. Investigating demand is akin to investigating the capacity of the CCUs and as shown in Figure 15: if capacity is not sufficient, rational behaviour can have a very damaging effect on overall patient throughput.

It is vital to acknowledge the limitations of the work presented:

The assumptions as to the strategy space of our players is restrictive: a single threshold policy might not be optimal (although it is present in various pieces of literature on optimal control of queueing systems: Naor, 1969, Shone et al, 2013). Indeed, in reality critical care managers could have far more complex boundaries for their heuristic decision making;
This model only assumes the presence of two players; however in reality the system has a variety of stakeholders. Multiplayer systems could be worth considering. This would reflect health boards/hospitals in a concentrated area so that interactions are not just between two hospitals but between many.
The restriction to pure strategies is influenced by discussions with ABUHB and also does not detract from the results presented thanks to the Theorem and Lemma of Section 2. However, allowing for mixed strategies could also be of interest, corresponding to decision making that is not constant over time. Managers could alternate between a variety of behaviours: at time accepting patient despite being busy and at other times not;
Patient length of stay is assumed to be dependent on the CCU at which they receive service. A further extension of the work would be to use the service rate from original CCU (prior to diversion). This would require a Markov chain with a higher dimensional state space. In practice this corresponds to patient morbidity corresponding to the original CCU.

Despite these mathematical limitations, the work presented here gives a strong analytical evidence as to the use of policies in a decentralised healthcare environment.

As discussed above, reducing the decision making of critical care managers to rational reactions to capacity targets is not without limitations. However, this quantitative model of behaviour was described as insightful and informative by ABUHB. In practice, stakeholders describe a target of 80% capacity. Whilst this is not only at times impossible, it is also not evidence based and in particular does not take in to account interactions between CCUs. This is a common theme in practice and the literature which this manuscript aims to address.

Finally, congestion and throughput are not the only concern of a healthcare system. Further work could involve the investigation of patient survival instead of throughput as utility. This would be similar to work such as Erkut et al (2008), Knight et al (2012).

The code used in this work can be found here: https://gist.github.com/anonymous/81effc06eea70a9e4e2f. The graphics for this paper were obtained using software described in Hunter (2007), Stein et al (2013); a worksheet with the data and code used for the plots can be found here: https://cloud.sagemath.com/projects/c293aefd-1fdf-4b9c-95f4-75bb77035e42/files/MeasuringThePriceOfAnarchyInCCUInteractions.sagews.

References

Allon G and Federgruen A (2007). Competition in service industries. Operations Research 55(1):37–55.
Article Google Scholar
Audit Commission (1999). The place of efficient and effective critical care services within the acute hospital. Technical report.
Batt RJ and Terwiesch C (2012). Doctors under load: An empirical study of state-dependent service times in emergency care. Technical report, Working paper.
Bevan G and Hood C (2006). Have targets improved performance in the English NHS? British Medical Journal (Clinical research ed.) 332(7538):419–422.
Article Google Scholar
Board ABH (2014) Aneurin Bevan Health Board—An Official NHS Wales website.
Cachon GP and Harker PT (2002). Competition and outsourcing with scale economies. Management Science 48(10):1314–1333.
Article Google Scholar
Cachon GP and Zhang F (2007). Obtaining fast service in a queueing system via performance-based allocation of demand. Management Science 53(3):408–420.
Article Google Scholar
Cahill W and Render M (1999) Dynamic simulation modeling of ICU bed availability. Simulation Conference Proceedings, pp. 1573–1576.
Chan CW, Yom-Tov G and Escobar G (2014). When to use speedup: An examination of service systems with returns. Operations Research 62(2):462–482.
Article Google Scholar
Cooper JK and Corcoran TM (1974). Estimating bed needs by means of queuing theory. New England Journal of Medicine 291(8):404–405.
Article Google Scholar
Costa AX, Ridley SA, Shahani AK, Harper PR, De Senna V and Nielsen MS (2003). Mathematical modelling and simulation for planning critical care capacity. Anaesthesia 58(4):320–327.
Article Google Scholar
Deo S and Gurvich I (2011) Centralized vs. decentralized ambulance diversion: A network perspective. Management Science 57(7):1300–1319.
Article Google Scholar
Dumas MB (1984). Simulation modeling for hospital bed planning. Simulation 43(2):69–78.
Article Google Scholar
Erkut E, Ingolfsson A and Erdoğan G (2008). Ambulance location for maximum survival. Naval Research Logistics 55(1):42–58.
Article Google Scholar
Gallivan S and Utley M (2011). A technical note concerning emergency bed demand. Health Care Management Science 14(3):250–252.
Article Google Scholar
Gorunescu F, McClean SI and Millard PH (2002). Using a queueing model to help plan bed allocation in a department of geriatric medicine. Health Care Management Science 5(4):307–312.
Article Google Scholar
Griffiths JD, Knight V and Komenda I (2013). Bed management in a critical care unit. IMA Journal of Management Mathematics 24(2):137–153.
Article Google Scholar
Griffiths JD, Price-Lloyd N, Smithies M, Williams JE (2005). Modelling the requirement for supplementary nurses in an intensive care unit. Journal of the Operational Research Society 56(2):126–133.
Article Google Scholar
Hagtvedt R and Ferguson M (2009). Cooperative strategies to reduce ambulance diversion. in Winter Simulation Conference, pp. 1861–1874.
Harper PR and Shahani AK (2002). Modelling for the planning and management of bed capacities in hospitals. Journal of the Operational Research Society 53(1):11–18.
Article Google Scholar
Howard DH (2002) Why do transplant surgeons turn down organs? A model of the accept/reject decision. Journal of Health Economics 21(6):957–969.
Article Google Scholar
Hunter JD (2007). Matplotlib: A 2d graphics environment. Computing In Science & Engineering 9(3):90–95.
Article Google Scholar
Kalai E, Kamien MI and Rubinovitch M (1992). Optimal service speeds in a competitive environment. Management Science 38(8):1154–1163.
Article Google Scholar
Kc DS and Terwiesch C (2012) An econometric analysis of patient flows in the cardiac intensive care unit. Manufacturing & Service Operations Management 14(1):50–65.
Article Google Scholar
Kesavan (2013). Is there an optimal elective operating theatre utilisation target? Health Services and Outcomes Research.
Kim SC, Horowitz I and Young KK (1999). Analysis of capacity management of the intensive care unit in a hospital. European Journal of Operational Research 115(1):36–46.
Article Google Scholar
Kim S-H, Chan C, Olivares M, and Escobar GJ (2013) Icu admission control: An empirical study of capacity allocation and its implication on patient outcomes. Columbia Business School Research Paper (12/34).
Knight VA and Harper PR (2013) Selfish routing in public services. European Journal of Operational Research 230(1):122–132.
Article Google Scholar
Knight VA, Harper PR and Smith L (2012). Ambulance allocation for maximal survival with heterogeneous outcome measures. Omega 40(6):918–926.
Article Google Scholar
Koutsoupias E, Papadimitriou C (1999) Worst-Case Equilibria. in Proceedings of the 16th Annual Symposium on Theoretical Aspects of Computer Science, pp. 404–413.
Levhari D and Luski I (1978). Duopoly pricing and waiting lines. European Economic Review 11(1):17–35.
Article Google Scholar
Li SX, Huang Z, Zhu J and Chau PY (2002). Cooperative advertising, game theory and manufacturer–retailer supply chains. Omega 30(5):347–357.
Article Google Scholar
Litvak N, Vanrijsbergen M, Boucherie R and Vanhoudenhoven M (2008). Managing the overflow of intensive care patients. European Journal of Operational Research 185(3):998–1010.
Article Google Scholar
Maschler M, Solan E, and Zamir S (2013). Game Theory. Cambridge University Press.
Mitchell I and Grounds M (1995). Intensive care in the ailing UK health care system. Lancet 1970–1970.
Naor P (1969). The regulation of queue size by levying tolls. Econometrica 37(1):15–24.
Article Google Scholar
Nash JF (1950) Equilibrium Points in N-Person Games, in Proceedings of the National Academy of Sciences of the United States of America, pp 48–49.
Pate R (2009). What is Payment by Results? Technical Report May.
Roughgarden T (2005). Selfish Routing and the Price of Anarchy. MIT Press.
Shahani AK, Ridley SA and Nielsen MS (2008). Modelling patient flows as an aid to decision making for critical care capacities and organisation. Anaesthesia 63(10):1074–1080.
Article Google Scholar
Shmueli A., Sprung CL and Kaplan EH (2003). Optimizing admissions to an intensive care unit. Health Care Management Science 6(3):131–136.
Article Google Scholar
Shone R, Knight VA and Williams JE (2013). Comparisons between observable and unobservable ${M}/{M}/1$ queues with respect to optimal customer behavior. European Journal of Operational Research 227(1):133–141.
Article Google Scholar
Smith GB, Taylor BL, McQuillan PJ and Nials E (1995). Rationing intensive care. Intensive care provision varies widely in Britain. British Medical Journal 310(5):1412–1413.
Article Google Scholar
Stein W et al (2013). Sage Mathematics Software (Version 6.0.0), The Sage Development Team. http://www.sagemath.org.
Stewart WJ (2009). Probability, Markov Chains, Queues, and Simulation, 1st edn. Priceton University Press.
Williams J, Dumont S, Parry-Jones J, Komenda I, Griffiths J and Knight V (2015). Mathematical modelling of patient flows to predict critical care capacity required following the merger of two district general hospitals into one. Anaesthesia 70(1):32–40.
Article Google Scholar
Xie J and Ai S (2006). A note on cooperative advertising, game theory and manufacturer–retailer supply chains. Omega 34(5):501–504.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Cardiff School of Mathematics, Cardiff University, Senghennydd Road, Cardiff, CF24 4AG, UK
Vincent Knight & Jeff Griffiths
ABUHB Health Board, ABCi, Cwmbran, UK
Izabela Komenda

Authors

Vincent Knight
View author publications
You can also search for this author in PubMed Google Scholar
Izabela Komenda
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Griffiths
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vincent Knight.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Knight, V., Komenda, I. & Griffiths, J. Measuring the price of anarchy in critical care unit interactions. J Oper Res Soc 68, 630–642 (2017). https://doi.org/10.1057/s41274-016-0100-8

Download citation

Received: 23 July 2015
Accepted: 18 July 2016
Published: 17 November 2016
Issue Date: June 2017
DOI: https://doi.org/10.1057/s41274-016-0100-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Measuring the price of anarchy in critical care unit interactions

Abstract

Similar content being viewed by others

Managing admission and discharge processes in intensive care units

Simulation analysis of hospital intensive care unit reimbursement policies from the triple bottom line perspective

Data-driven optimization methodology for admission control in critical care units

1 Introduction

2 Queueing and game theoretic models

2.1 Queueing model

2.2 Game theoretic model

Theorem

Proof

Lemma

Observation

Proof

3 Results

3.1 Model 1: Strict diversion

3.2 Model 2: Soft diversion

4 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Measuring the price of anarchy in critical care unit interactions

Abstract

Similar content being viewed by others

Managing admission and discharge processes in intensive care units

Simulation analysis of hospital intensive care unit reimbursement policies from the triple bottom line perspective

Data-driven optimization methodology for admission control in critical care units

1 Introduction

2 Queueing and game theoretic models

2.1 Queueing model

2.2 Game theoretic model

Theorem

Proof

Lemma

Observation

Proof

3 Results

3.1 Model 1: Strict diversion

3.2 Model 2: Soft diversion

4 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation