FormalPara Highlights
  • We consider an operational problem that deals with the effective allocation of capacity among different types of appointments.

  • We integrate the operational decisions (i.e., capacity allocation) with the clinical operations (i.e., treatment and diagnosis of patients, disease progression).

  • We show how the expected number of patients at different appointments changes with respect to some of the model parameters.

  • We find that although virtual appointments are not as effective as office appointments, they can have equal importance to office appointments due to their lower costs.

  • We propose efficient and practical solutions that would help clinics in their capacity allocation decisions.

1 Introduction

Chronic care involves the treatment and monitoring of pre-existing and long-term diseases such as diabetes, high blood pressure, asthma, Alzheimer’s disease, and cardiovascular disease [10]. In the U.S., 45% of the population has at least one chronic disease, and the cost of chronic care constitutes over 75% of the entire health care spending in the U.S. [25, 48]. Given that the population is increasing and aging, the need for chronic care in the future will increase faster. Current care processes are insufficient to address this coming mismatch in supply and demand [24]. To improve patient access to chronic care and to reduce their burden, health care providers increasingly rely on virtual appointments as a new alternative way to provide effective and consistent long-term care. Virtual appointments, consisting of e-mail, phone, and online consultations, can improve patient access and ensure continuity of care and, consequently, better outcomes [12, 42].

Virtual appointments can be used as a substitute for, or be complementary to office appointments, and they can take many different forms. For example, virtual appointments can be used for diagnosis only, for treatment only, and for both treatment and diagnosis, similar to office appointments [7]. More specifically, through virtual appointments that provide diagnosis only, chronic-care patients can be monitored remotely in real time and updates regarding the patients’ status can be obtained [3, 37]. Through virtual appointments that provide treatment only, educational support and reliable resources can be provided to patients without diagnosing their status [22]. Finally, virtual appointments can also be used to provide both diagnosis and treatment, in which both the patients’ health status are diagnosed and proper treatments are provided [7]. Since virtual appointments are provided remotely, they can enhance the delivery of health care to geographically-disadvantaged and medically-underserved populations [1]. In addition, patients who are unable or unwilling to leave their homes to seek medical treatments or are in poor physical condition can also benefit from virtual appointments [4, 8]. Virtual appointments have the potential to enhance primary care delivery by enabling cost reductions for both health delivery and travel and larger panel sizes without sacrifices in the quality of health care [5, 44]. Parallel to its benefits, more patients are willing to receive care through this convenient method. Thus, the demand for virtual appointments is increasing quickly. The total number of virtual consultations is growing by around 10% a year, with growth projected to reach around 25 million in 2020 [49].

Despite the increased usage of virtual appointments and their observed benefits in chronic care, the integration of virtual appointments with office appointments can be operationally challenging for the clinics. One of the reasons for this challenge is that office and virtual appointments can have differences in their treatment/diagnosis effectiveness and in their costs. More specifically, although virtual appointments can provide cost-effective treatments, they can result in similar [16] or worse patient-related outcomes [33, 38] compared to office appointments, which makes it harder to decide how to allocate the available capacity among appointments with different effectiveness. Moreover, with the integration of virtual appointments, the patient flow dynamics become complex and it gets difficult to identify the expected number of patients that can be scheduled for office and virtual appointments. Indeed, faced with rising costs and patient populations, managers of health facilities like clinics strive to determine an appropriate capacity to meet the needs of the patients and avoid the opportunity cost and over-utilization cost as much as possible. Thus, it is important to develop strategies to determine the expected number of patients and allocate available capacity efficiently by considering the patient flow dynamics. To address the need for capacity allocation policies, we study in this paper a chronic care setting in which patients are scheduled for virtual or office appointments. We consider that, similar to office appointments, virtual appointments can also provide both treatment and diagnosis, and parallel to previous studies [7, 33, 38] we assume that virtual appointments can be less effective than office appointments. We develop a modeling framework to determine the optimal allocation of the capacity for both office and virtual appointments and aim to answer the following operational questions:

  1. 1.

    What is the expected number of patients scheduled for office and virtual appointments for the given follow-up, service, arrival, and departure rates?

  2. 2.

    How should the available capacity be allocated among office and virtual appointments to maximize the long-run average earnings of a health clinic?

To address these questions, we develop a migration network model to analyze patient flow and disease progressions. Using the migration network model, we first analytically investigate the number of patients in the steady state who are scheduled for office and virtual appointments. Second, we analyze how the expected number of patients at each node of the migration network would change with respect to some model parameters. Third, we develop a newsvendor-type model to maximize the long-run average earnings of a health clinic. We further propose heuristics to find the capacity allocations among office and virtual appointments. Fourth, we analytically investigate how limited capacity impacts the proposed heuristics and the optimal capacity allocation decisions. Finally, through our numerical studies, we analyze the effect of model parameters on the allocation of the capacity of the office and virtual appointments by analyzing different scenarios.

The remainder of this paper is organized as follows: In Section 2, we review the related literature. Section 3 presents the migration network model and characterizes the number of patients in steady-state conditions. In Section 4, we develop capacity allocation models for different settings and propose heuristics to identify capacity allocations among office and virtual appointments. In Section 5, we perform numerical experiments and sensitivity analysis to illustrate the application of our models. Finally, our conclusions are outlined in Section 6.

2 Relevant literature and contributions

In this section, we discuss separately the relevant literature and the contribution of our study.

2.1 Relevant literature

Our study builds on the literature of decision models in community-based chronic care delivery. Related to this area, [31] present and analyze three representative examples of prevailing quantitative decision models for managing community-based chronic care (i.e., [18, 30]). For each example, they analyze the background of the problem, present the methodology, and show their findings and implications. Among these examples, [30] propose a Markov decision process to model multiple care-provider visit patterns for stroke patients, while [18] combine a Markovian disease progression model with a capacity allocation model to determine revisit intervals for childhood asthma care. A major difference of our paper from the listed literature is that we consider different types of appointments (i.e., office and virtual appointments) and investigate the capacity allocation decisions among the different types of appointments.

Related to the virtual appointment setting, studies that investigate the management of office and virtual appointments are limited. In a relevant study, [36] build an optimization model to design effective checkup plans (i.e., phone calls, office visits) for monitoring individual patients after hospital discharge. Their study considers only the diagnosis impacts of the virtual appointments, whereas we include both the treatment and the diagnosis impact of the virtual appointments. Among the studies considering the impact of virtual appointments on both treatment and diagnosis, [6] develop a Markovian model to determine the patient revisit intervals in primary care by incorporating virtual appointments into an office appointment setting. In another study, [7] develop a stochastic dynamic programming model to determine the follow-up rates for office and virtual appointments, and they investigate the value of virtual appointments in patients’ health outcomes. In these papers, the capacity of the appointments is assumed to be given. In contrast to these studies, we investigate the capacity allocation of office and virtual appointments for different settings.

Another stream of literature that is relevant to our study is on the capacity planning problem in health care, which addresses the issue of allocating limited resources to satisfy the demand of the patients. There are several studies in this area, and [26] provide a comprehensive review of resource allocation and capacity planning in health care. Among this literature, the following papers are more relevant to our methodology. [11] develop an optimization/queuing network model for optimal planning of resource allocations (e.g., beds and nurses) and apply it to a blood bank and a health maintenance organization. [32] develop a multi-class migration network model as an optimization model to determine the optimal capacity that maximizes the overall profit of a dialysis clinic. [34] present a long-term care network model to determine the optimal capacity for nursing homes and community-based services. Distinct from the above literature, we consider both patient flows and patients’ disease progression to determine optimal capacity allocations. Our differences in the modeling structure are detailed in the following section.

2.2 Contributions

In this paper, we consider an operational problem that deals with the effective allocation of capacity among different types of appointments (i.e., office and virtual appointments) to maximize the long-run average earnings of the clinics. The paper contributes to the capacity allocation literature on the basis of the model structure, since we consider two types of appointments with different effectiveness and patient interactions in the chronic care setting. More specifically, different from the above literature, we consider (i) the integration of operational decisions (i.e., how to allocate capacity among different appointments) with the clinical operations (i.e., treatment and diagnosis of patients), (ii) two types of appointments having different diagnosis and treatment effectiveness, (iii) disease progression due to the chronic nature of the condition, and (iv) patient dynamics and different patient groups rather than homogenous patients, as we categorize patients as controlled vs. uncontrolled and returning vs. new. The differences in the modeling approaches lead us to reach different and unique conclusions. For example, we derive the expected number of patients scheduled for office and virtual appointments and the expected number of patients in the controlled and uncontrolled health states by considering patients’ disease progressions. Through our results, we present how office and virtual follow-up rates impact the optimal office and virtual appointment capacity allocations and present how limited total capacity and time impact the allocated capacity and average clinic earnings. We further propose efficient and practical solutions that would help clinics in their capacity allocation decisions and bring them higher average earnings. We find that although virtual appointments are not as effective as office appointments, they can have equal importance to office appointments due to their lower costs. Different from the cited literature, we also show how the expected number of patients at different nodes in the migration network model changes with respect to some of the model parameters.

3 Migration network model for office and virtual appointments

In this section, we consider a cohort of patients receiving chronic care via both office and virtual appointments. In this network, two types of patients are served (i.e., new patients and returning patients), and physicians provide both office and virtual appointments. We use a continuous-time open migration network [27, p.48-p.57] to simulate the population dynamics (i.e., patient flows and disease progression) in which patients’ arrivals are considered as Poisson process and the time intervals between patient transitions are independently and exponentially distributed. We further consider an infinite population so that the node capacities of the migration network are unlimited.

We illustrate our migration network model in Fig. 1, and we describe the nodes and flows of the network in this section. We use \(i \in {\mathcal {I}=}\{o,v\}\), where “o” corresponds to office appointments and “v” corresponds to virtual appointments, to denote the set of appointment types. New patients with office and virtual appointments arrive with Poisson arrival rate \(\lambda _{i}, {\forall i \in \mathcal {I}}\). We define the “service” as the diagnosis and treatment of a patient, and we consider that office and virtual appointments provide both diagnosis and treatment during the appointment. More specifically, service time corresponds to the duration of an appointment, and service times of patients are exponentially and independently distributed. We use \({\mu _{i}}, {\forall i \in \mathcal {I}}\) to denote the service rate of office and virtual appointments, respectively. We define follow-up time (i.e., revisit interval) as the time between the current visit and the next time the patient initiates an appointment. We consider that after each appointment, the physician recommends to the patient the type and the time of the next visit. Hence, patients are scheduled for appointments based on the physician’s recommendation. Patient follow-up times are assumed to be independently and exponentially distributed, with a rate of \({\sigma _{i}}, {\forall i \in \mathcal {I}}\). Some patients may depart from the physicians’ panel before scheduling another appointment (i.e., change the physician or clinic). Patients’ departure times are independently and exponentially distributed with a rate of δ.

We use the “control” measure to characterize the patient health status. The “control” measure helps with understanding how well chronic-care related symptoms are currently controlled in a patient. Depending on the types of chronic diseases, these categorizations may differ. For example, for asthma, four categories can be used as follows: (i) controlled, (ii) improved, (iii) unchanged, and (iv) worsened, and the last three are classified as an uncontrolled health state [18]. For the sake of simplification, in our model, we consider two health states as controlled and uncontrolled to characterize the patients’ health states. Then, let \(j {\in \mathcal {J}=} \{c,u\}\) represent the set of health states, where “c” corresponds to the controlled health state and “u” corresponds to the uncontrolled health state.

We assume that patients in the network may not be scheduled for an appointment (i.e., may not receive any care) and they may be in the controlled and in the uncontrolled health states. Hence, we define wc as patients who are in the controlled health state and not scheduled for an appointment, while we use wu to denote patients who are in the uncontrolled health state and not scheduled for an appointment. We assume that there is no transition from the uncontrolled state to the controlled state without treatment. However, due to disease progression, some of the patients in the controlled health state and not receiving care (i.e., wc) may transition into the uncontrolled health state (i.e., wu) within the unit time. The time for a controlled patient to progress into the uncontrolled state is assumed to follow an exponential distribution with a rate of γ.

We also consider that patients in the network may be scheduled for office and virtual appointments and receive care. At each type of appointment, the health state of the patient is diagnosed and the patient is treated. We assume that office appointments can be more effective than virtual appointments [7, 33, 38], and the treatment and the diagnosis in the office appointments are perfect, while those of the virtual appointments are imperfect. Perfect treatment means that a patient’s health state recovers to the best health state after treatment, while perfect diagnosis means that a patient’s health state is revealed accurately during the diagnosis. On the other hand, imperfect treatment means that a patient’s health state can transit into a different health state with some probability, while imperfect diagnosis means that a patient’s health state may be revealed inaccurately during the diagnosis. The perfect diagnosis/treatment assumption is similar to the ones in the machine maintenance and repair literature as well [9, 43]. Moreover, in the healthcare literature, the perfect diagnosis and treatment assumption is also used by [6, 7, 18]. More specifically, patients in each health state are assumed to be always diagnosed accurately if they are scheduled for an office appointment, and they will be in the controlled health state after the office appointment regardless of their initial health state before the appointment. On the other hand, patients scheduled for a virtual appointment may be diagnosed inaccurately, since virtual appointments are expected to be less precise than office appointments [7, 33, 38].

Thus, we first define so to represent patients who are scheduled for office appointments and receiving care (i.e., diagnosis and treatment). Next, to capture the imperfect diagnosis of virtual appointments, we define sv,j to denote patients who are scheduled for virtual appointments and diagnosed in health state \(j {\in \mathcal {J}}\) at the virtual appointment. We use conditional probability to define the imperfect diagnosis probability for the virtual appointments. Let \(p_{j|j^{\prime }}, {\forall j,j^{\prime } \in \mathcal {J}}\) denote the probability that the patient in health state \(j^{\prime }\) is diagnosed in health state j at the virtual appointment, where \(p_{c|j^{\prime }} + p_{u|j^{\prime }} = 1, { \forall j^{\prime }\in \mathcal {J}}\). We also capture the imperfect treatment in virtual appointments. More specifically, a patient diagnosed in the uncontrolled health state at the virtual appointment may remain in the uncontrolled health state with probability (1 − pu) or may transition into the controlled health state with probability pu. Similarly, a patient diagnosed in the controlled health state at the virtual appointment may remain in the controlled health state with probability pc after the virtual appointment or may be in the uncontrolled health state with probability (1 − pc) after the virtual appointment (since not all patients in the controlled health state may be diagnosed accurately). Hence, the outcomes of nodes sv,c and sv,u are the patients who are being diagnosed and treated during the appointment, and the patients’ health statuses may remain the same, may improve, or may get worse after the virtual appointment (i.e., after being diagnosed and treated). We further assume that the new patients scheduled for virtual appointments will be diagnosed in the controlled health state with probability ph.

Overall, we consider five nodes in the network, and we use \(k \in {\mathcal {K}=} \{{w_{c}}; {w_{u}}; {s_{o}}; {s_{v,c}}; {s_{v,u}}\}\) to represent the set of nodes in the migration network. In Fig. 1, we illustrate the described flow of patients between each node through arcs. The arcs between nodes represent the process of a patient that flows from one node to another. For example, the arc from node “so” to “wc” represents the flow of patients from an office appointment to their homes after they have their appointment. We also show the inflow and outflow for each node next to each arc. For example, there are two outflows from node “sv,u” where uncontrolled patients can improve to the controlled health state or can remain in the uncontrolled health state after receiving a virtual appointment (i.e., being diagnosed and treated).

Fig. 1
figure 1

Migration network model

We define αk to denote the expected number of patients at node \(k \in {\mathcal {K}}\) in the steady-state condition. The number of patients at node k satisfy the following balance equations, which are derived from Fig. 1 [27, p.49]:

$$ \begin{array}{@{}rcl@{}} \mu_{v}{\alpha_{s_{v,c}}} - \sigma_{v}{p_{c|c}}\alpha_{{w_{c}}} -\sigma_{v}{p_{c|u}}\alpha_{{w_{u}}} &=& p_{h}\lambda_{v} \end{array} $$
(1)
$$ \begin{array}{@{}rcl@{}} \mu_{v}{\alpha_{s_{v,u}}} - \sigma_{v}(1-{p_{c|c}})\alpha_{{w_{c}}} -\sigma_{v}(1-{p_{c|u}})\alpha_{{w_{u}}} &=& (1-p_{h})\lambda_{v} \end{array} $$
(2)
$$ \begin{array}{@{}rcl@{}} -\mu_{v}{p_{c}}{\alpha_{s_{v,c}}} -\mu_{v}{p_{u}}{\alpha_{s_{v,u}}} + (\sigma_{v}+\sigma_{o}+\delta+\gamma)\alpha_{{w_{c}}} -\mu_{o}{\alpha_{s_{o}}} &=& 0 \end{array} $$
(3)
$$ \begin{array}{@{}rcl@{}} -\mu_{v}(1-{p_{c}}){\alpha_{s_{v,c}}} -\mu_{v}(1-{p_{u}}){\alpha_{s_{v,u}}} - \gamma\alpha_{{w_{c}}} +(\sigma_{v}+\sigma_{o}+\delta)\alpha_{{w_{u}}} &=& 0 \end{array} $$
(4)
$$ \begin{array}{@{}rcl@{}} - \sigma_{o}\alpha_{{w_{c}}} - \sigma_{o}\alpha_{{w_{u}}} +\mu_{o}{\alpha_{s_{o}}} &=& \lambda_{o} \end{array} $$
(5)

These equations represent that the total inflow to node \(k \in {\mathcal {K}}\) must be equal to the total outflow from that node. Equations 15 are the balance equations with five unknowns (i.e., \({\alpha _{s_{v,c}}}\); \({\alpha _{s_{o}}}\); \({\alpha _{s_{v,u}}}\); \(\alpha _{{w_{c}}}\); \(\alpha _{{w_{u}}}\)), and we solve these balance equations to obtain the average number of patients in each node at steady state. The result for each αk, \({\forall k \in \mathcal {K}}\) is included in the Appendix B. By using these equations, we characterize some of the structural properties to show the relationship between the model parameters and the number of patients at each node through Theorem 1.Footnote 1

Theorem 1

In the migration network model, the number of patients at each specific node presents the following structural properties:

  1. (a)

    Expected number of patients at virtual appointments (i.e., \({\alpha _{s_{v,c}}} + {\alpha _{s_{v,u}}}\)) is a linearly increasing function of virtual follow-up rate (σv), while it is not dependent on the office follow-up rate (σo), and expected number of patients at office appointments (i.e., αo) is a linearly increasing function of office follow-up rate (σo), while it is not dependent on the virtual follow-up rate (σv).

  2. (b)

    Expected number of patients who are not scheduled for an appointment and in the controlled health state (i.e., \(\alpha _{w_{c}}\)) is an increasing concave function of office follow-up rate (σo), and expected number of patients who are not scheduled for an appointment and in the uncontrolled health state (i.e., \(\alpha _{w_{u}}\)) is a decreasing convex function of office follow-up rate (σo) if pu > pc.

  3. (c)

    Expected number of patients who are not scheduled for an appointment and in the controlled health state (i.e., \(\alpha _{w_{c}}\)) is an increasing convex function of pc, while it is an increasing concave function of pu. On the other hand, the expected number of patients who are not scheduled for an appointment and in the uncontrolled health state (i.e., \(\alpha _{w_{u}}\)) is a decreasing concave function of pc, while it is a decreasing convex function of pu.

  4. (d)

    Expected number of patients who are not scheduled for an appointment and in the controlled health state (i.e., \(\alpha _{w_{c}}\)) is a decreasing convex function of pc|c, while it is a decreasing concave function of pc|u if pu > pc.

  5. (e)

    Expected numbers of patients who are not scheduled for an appointment and in both controlled and uncontrolled health states (i.e., \(\alpha _{w_{c}}\), and \(\alpha _{w_{u}}\)) are not dependent on the service rate of office and virtual appointments (μo and μv).

Theorem 1 (a) shows that as the virtual follow-up rate σv (resp. office follow-up rate σo) increases (i.e., as patients have virtual visits (resp. office visits) more frequently), the expected number of patients scheduled for virtual appointments (resp. office appointments) increases linearly. On the other hand, office follow-up rate σo (resp. virtual follow-up rate σv) does not impact the expected number of patients scheduled for virtual appointments (resp. office appointments). We further investigate how the office follow-up rate impacts the expected number of non-scheduled patients in the controlled and uncontrolled health states. Theorem 1 (b) states that if the condition of pu > pc is satisfied, as the office follow-up rate σo increases, patients’ health statuses are more controlled and the expected number of non-scheduled patients in the controlled health state increases with a concave structure, while the expected number of patients in the uncontrolled health state decreases with a convex structure. Moreover, we investigate the treatment impact of virtual appointments on the expected number of non-scheduled patients in different health states. More specifically, as described in Theorem 1 (c), if the treatment effectiveness of virtual appointments in the controlled and uncontrolled health states (i.e., pc and pu) increase, the expected number of non-scheduled patients in the controlled health state (i.e., \(\alpha _{w_{c}}\)) will increase with a convex and a concave structure, respectively. Note that both pc and pu can be at most 1, and hence the expected number of non-scheduled patients in the controlled health state will take its highest value when pc = pu = 1. On the other hand, the increase in the treatment effectiveness of virtual appointments in the controlled and uncontrolled health states (i.e., pc and pu) results in a decrease in the expected number of non-scheduled patients in the uncontrolled health state (i.e., \(\alpha _{w_{u}}\)) with a concave and a convex structure, respectively. We also investigate the impact of the diagnosis effectiveness of virtual appointments on non-scheduled patients. In Theorem 1 (d), it is described that as pc|c and pc|u increase, the expected number of non-scheduled patients in the controlled health state (i.e., \(\alpha _{w_{c}}\)) decreases with a convex and a concave structure, respectively. Finally, through Theorem 1 (e) we find that office and virtual service rates (i.e., μv and μo) do not impact the expected number of non-scheduled patients in both controlled and uncontrolled health states. Overall, through Theorem 1, we show how patient dynamics and flows change at each node with respect to the model parameters.

Next, we use our results from the migration network model to define the capacity allocation model. Hence, we use αk, \(\forall k \in {\mathcal {K}}\) to define the steady-state distribution πk, \(\forall k \in {\mathcal {K}}\). Hence, let xk denote the number of patients at node \(k \in {\mathcal {K}}\). Kelly ([27], p.53) shows that in steady state, the nodes states are independent, and the steady-state distribution for each node is a Poisson distribution with parameter αk and given by Eq. 6 as follows:

$$ \begin{array}{@{}rcl@{}} \left. \pi_{k}(x_{k}=x) = \frac{{\alpha_{k}^{x}}}{x!} \middle/ {\sum}_{n=0}^{\infty} \frac{{\alpha_{k}^{n}}}{n!} \right. = e^{-\alpha_{k}} \frac{{\alpha_{k}^{x}}}{x!}, \quad { \forall k\in {\mathcal{K}}} \end{array} $$
(6)

The steady-state distribution defines the probability of having xk number of patients at each node k. We use these probabilities to define the probabilistic capacity allocation model in Section 4.

4 Capacity allocation model

In this section, we build newsvendor-type capacity allocation models to find the optimal capacity for office and virtual appointments to maximize the long-run average earnings of a clinic. As described in the previous section, we consider that the node capacities of the migration network are unlimited, where the number of patients at each node is unlimited. However, we consider a threshold capacity for office and virtual appointments [34]. The threshold capacity that we assign describes the number of patients that can be served under the regular cost, and the actual number of patients in office and virtual appointments can exceed this threshold capacity. When the number of patients in office and virtual appointments exceeds this threshold capacity, we consider that a penalty cost due to patient overflow occurs. Hence, we aim to find the optimal threshold capacity for office and virtual appointments for the clinics under the assumption that node capacities are unlimited. In the following sections, we first introduce the capacity allocation model without constraints. Then, we modify the unconstrained model by adding constraints on the optimal office and virtual appointment threshold capacities.

4.1 Unconstrained capacity allocation model

We consider a clinic that provides both office and virtual appointments with a threshold capacity of \(M_{s_{o}}\) for office appointments, a threshold capacity of \(M_{s_{v}}\) for virtual appointments, and the total threshold capacity of \(M={M_{s_{o}}}+ {M_{s_{v}}}\). Since in our migration network we split the virtual appointments into two parts to reflect the imperfect diagnosis and treatment, we define \(M_{{s_{v,c}}}\) and \(M_{{s_{v,u}}}\), which denote the threshold capacity for virtual appointments in the controlled and uncontrolled health states, respectively (i.e., \({M_{s_{v}}} = M_{{s_{v,c}}} + M_{{s_{v,u}}}\)). Defining different types of capacities for virtual appointments ensures more flexibility in the model definition and our findings can impact the following two areas in practice: (i) capacity splitting decisions and (ii) patient scheduling decisions. In practice, although it is not common to split the capacity of virtual appointments according to the patients’ health statuses, there are different implementations of virtual appointments and especially after COVID-19, they have been used in practice more often. More specifically, (i) some virtual appointments can be provided asynchronously for patients in controlled health states [39], (ii) some of them can be provided synchronously for both controlled and uncontrolled patients [28], and (iii) some of them can be provided synchronously for emergency patients (i.e., tele-emergency) [20]. Hence, clinics can consider splitting their virtual appointment capacities based on patient needs. On the other hand, clinics do not need to split their virtual appointment capacity, but they can implement the proposed policies in their patient scheduling decisions. More specifically, during the appointments, physicians diagnose patients, provide treatments, and by considering the patient’s overall health status and progress, the provider recommends the next appointment time to the patient. Thus, when physicians are making their scheduling recommendations for the next appointment they can take into account the capacity allocation decisions (i.e., the ratio of \({M_{s_{v,c}}/M_{s_{v,u}}}\)).

Let \(\mathcal {K}^{\prime }=\{{s_{o}}; {s_{v,c}}; {s_{v,u}}\}\) be the set of updated migration network nodes. Then, we first define \(r_{k} \forall k \in {\mathcal {K}^{\prime }}\) as the marginal profit of office and virtual appointments. We note that there is one type of virtual appointment and \({r_{s_{v}}}=r_{{s_{v,c}}}=r_{{s_{v,u}}}\) (i.e., the difference between the revenue and variable costFootnote 2 is the same for all virtual appointments). Similarly, we define ηk, \(\forall k \in {\mathcal {K}^{\prime }}\) to denote the unit threshold capacity cost for office and virtual appointments, where unit threshold capacity cost is the fixed cost of allocating capacity, which can be employee salaries, building-related costs, and equipment. We assume that rk > ηk [32]. By assuming rk > ηk, we ensure that the optimal threshold capacity Mk is greater than 0. More specifically, if the unit threshold capacity cost is larger than or equal to the marginal profit, it will be optimal to provide no service and Mk = 0. Since there is one type of virtual appointment, we consider that \({{\eta _{s_{v}}}}={\eta _{{s_{v,c}}}}={\eta _{{s_{v,u}}}}\). We further assume that the number of patients at the office and virtual appointments can exceed the allocated threshold capacity, and in this case, the clinic provides the corresponding appointment but at a higher total cost. To reflect the cost of patient overflow, we define fk > 0, \(\forall k \in {\mathcal {K}^{\prime }}\), which represents the unit net penalty cost of the overflow, where \({f_{s_{v}}}=f_{{s_{v,c}}}=f_{{s_{v,u}}}\). The definition is similar to the definition of the overbooking cost used by [32]. It is the net cost of meeting the overflow demand, which is the difference between the total variable cost of meeting the extra demand and the revenue earned for that appointment. The clinic still earns the marginal profit rk for the overflow patients, but the extra variable cost of meeting this excess demand is more than the marginal profit. Let xk(t) denote the current number of patients at node \(k, \forall k \in {\mathcal {K}^{\prime }}\) at time t. Then, our unconstrained capacity allocation model can be defined as follows:

$$ \begin{array}{@{}rcl@{}} max \quad A(M) &=& \lim_{T\rightarrow\infty} \frac{1}{T} \left\{ {\sum}_{k\in{\mathcal{K}^{\prime}}} {{\int}_{0}^{T}} r_{k} min \left[ x_{k}(t), M_{k} \right] dt \right.\\&&\qquad\qquad\left.- {\sum}_{k\in{\mathcal{K}^{\prime}}} {{\int}_{0}^{T}} {\eta_{k}} M_{k} dt \right. \\ &&- \left.{\sum}_{k\in{\mathcal{K}^{\prime}}} {{\int}_{0}^{T}} f_{k} \left( x_{k}(t) - M_{k} \right)^{+} dt \right\} \end{array} $$
(7)
$$ \begin{array}{@{}rcl@{}} &&s.t. \\ && M_{k} \geq 0 \quad \quad \quad \quad \quad \quad \quad \quad \forall k \in {\mathcal{K}^{\prime}} \end{array} $$
(8)

As noted before, the objective function is defined as the function of threshold capacity and the number of patients at each node. Since the number of patients at each node is uncertain, we use the steady-state probabilities πk defined in Section 3. In objective function (7), the first term represents the marginal profit generated from office and virtual appointments, the second term represents the fixed threshold capacity cost, and the third term represents the penalty costs associated with a capacity shortage. Equation 8 defines the non-negativity setting.

Let \(\mathbb {E}_{\pi _{k}} (x_{k}),k \in {\mathcal {K}^{\prime }}\) be the expected number of patients at node k under the steady-state distribution \(\pi _{k},k \in {\mathcal {K}^{\prime }}\). Due to the ergodicity of the open migration network [27, p.49], we can define the following equations:

$$ \begin{array}{@{}rcl@{}} &&\lim_{T\rightarrow\infty} \frac{1}{T} {{\int}_{0}^{T}} x_{k} (t) dt = \mathbb{E}_{\pi_{k}} (x_{k}) = \alpha_{k} \end{array} $$
(9)
$$ \begin{array}{@{}rcl@{}} &&\lim_{T\rightarrow\infty} \frac{1}{T} {{\int}_{0}^{T}} \left[M_{k} - x_{k} (t)\right]^{+} dt = \mathbb{E}_{\pi_{k}} \big[(M_{k} - x_{k})^{+}\big] \end{array} $$
(10)
$$ \begin{array}{@{}rcl@{}} &&\lim_{T\rightarrow\infty} \frac{1}{T} {{\int}_{0}^{T}} \left[x_{k} (t) - M_{k} \right]^{+} dt = \mathbb{E}_{\pi_{k}} \big[(x_{k} - M_{k})^{+} \big] \end{array} $$
(11)

Then, we reformulate the objective function (7) with the following equation, and we include the detailed steps of the reformulation in the Appendix B.

$$ \begin{array}{@{}rcl@{}} A(M) &=& {\sum}_{k\in{\mathcal{K}^{\prime}}} \Big[ (r_{k} - {\eta_{k}})\alpha_{k} - {\eta_{k}} \mathbb{E}_{\pi_{k}} \big[(M_{k} - x_{k})^{+}\big] \\&&- (f_{k} + r_{k} - {\eta_{k}}) \mathbb{E}_{\pi_{k}} \big[(x_{k} - M_{k})^{+} \big] \Big] \end{array} $$
(12)

Similar to Eq. 7, in Eq. 12, the first term is the difference between the marginal profit and the fixed cost. The second term is the opportunity cost for unutilized capacity, and the last term represents the cost due to patient overflow.

We define Ak(Mk) as the individual objective function for appointment \(k \in {\mathcal {K}^{\prime }}\), and it can be defined as follows:

$$ \begin{array}{@{}rcl@{}} A_{k} (M_{k}) \!&=&\! (r_{k} - {\eta_{k}})\alpha_{k} - {\eta_{k}} \mathbb{E}_{\pi_{k}} \big[(M_{k} - x_{k})^{+}\big] \\&&\!- (f_{k} + r_{k} - {\eta_{k}}) \mathbb{E}_{\pi_{k}} \big[(x_{k} - M_{k})^{+} \big], \qquad \forall k \in {\mathcal{K}^{\prime}} \end{array} $$
(13)

To maximize the objective A(M), each sub-objective Ak(Mk) can be maximized separately. We derive the optimal threshold capacity \(M_{k}, \forall k \in \mathcal {K}^{\prime }\) for this unconstrained capacity planning model through Proposition 1.

Proposition 1

The optimal solution of the unconstrained capacity allocation model, denoted by \(M^{*} = ({M^{min}_{s_{o}}}, M^{min}_{{s_{v,c}}}, M^{min}_{{s_{v,u}}})\), is given by

$$ \begin{array}{@{}rcl@{}} M_{k}^{min} &=& min \left\{ M_{k} \geq 0: \pi_{k} (x_{k} \le M_{k}) \ge \frac{f_{k} + r_{k} - {\eta_{k}}}{f_{k} + r_{k}}\right\} \\&& \forall k\in {\mathcal{K}^{\prime}} \end{array} $$
(14)

where πk(xkMk) is the cumulative probability that xi is less than or equal to Mk, \(\forall k \in \mathcal {K}^{\prime }\).

According to Eq. 14, the optimal threshold capacity depends on both Poisson distribution parameter αk and the ratio of \(\frac {f_{k} + r_{k} - {\eta _{k}}}{f_{k} + r_{k}}\). For a fixed Poisson parameter αk, the cumulative probability function is a non-decreasing function of Mk. Hence, as the ratio \(\frac {f_{k} + r_{k} - {\eta _{k}}}{f_{k} + r_{k}}\) increases, the optimal threshold capacity will also increase.

4.2 Capacity allocation model with capacity constraint

In practice, due to the limited resources of the clinic, the number of regular office and virtual appointments may be limited. In this section, we extend the unconstrained capacity allocation model presented in Section 4.1 and investigate the impact of adding an upper bound on the optimal threshold capacity decisions. This change does not impact the balance equations of the migration network and patient flows, and the objective function A(M) remains the same as with the unconstrained model. More specifically, node capacities in the migration network are unlimited, and it is still allowed to have more than the Mk number of patients [34]. Hence, Eqs. 16 still hold when the constraint (16) is added. We use TC to denote the limited total threshold capacity. Then, the capacity allocation model can be updated as follows:

$$ \begin{array}{@{}rcl@{}} max \quad A(M) &=& {\sum}_{k\in{\mathcal{K}^{\prime}}} \left[ (r_{k} - {\eta_{k}})\alpha_{k} - {\eta_{k}} \mathbb{E}_{\pi_{k}} \big[(M_{k} - x_{k})^{+}\big] \right.\\&&\left.- (f_{k} + r_{k} - {\eta_{k}}) \mathbb{E}_{\pi_{k}} \big[(x_{k} - M_{k})^{+} \big] \right] \end{array} $$
(15)
$$ \begin{array}{@{}rcl@{}} &&s.t. {\sum}_{k \in {\mathcal{K}^{\prime}}} M_{k} \leq TC \end{array} $$
(16)
$$ \begin{array}{@{}rcl@{}} && M_{k} \geq 0 \quad \quad \quad \quad \quad \quad \quad \quad \forall k \in {\mathcal{K}^{\prime}} \end{array} $$
(17)

In the model, Eq. 16 states that the allocated threshold capacity should be less than or equal to the total available threshold capacity TC, and Eq. 17 defines non-negativity constraints. Let \(M^{}_{TC}\) be the allocated threshold capacity when the total threshold capacity is limited. Recall that \(M^{*} = ({M^{min}_{s_{o}}}, M^{min}_{{s_{v,c}}}, M^{min}_{{s_{v,u}}})\) is the optimal threshold capacity for the unconstrained capacity allocation model given in Proposition 1. It is clear that if \({\sum }_{k \in {\mathcal {K}^{\prime }}} M_{k}^{min} \leq TC\), then \(M^{}_{TC} = M^{*}\). This means that the clinic has enough resources, which maximizes its overall average earnings, and the clinic may be considered not to have excess capacity. If \({\sum }_{k \in {\mathcal {K}^{\prime }}} M_{k}^{min} \geq TC\), this means that the clinic has scarce resources, and the optimal solution provided in Proposition 1 may not hold. Hence, we propose Heuristic 1 to determine the capacity allocation decisions. Let \({M_{k}^{t}}, k \in {\mathcal {K}^{\prime }}\) be the threshold capacity of node k at iteration t. Heuristic 1 uses the partial derivation calculation of the objective function, so we define the partial derivative of the objective function for each \(k \in {\mathcal {K}^{\prime }}\) as follows:

$$ \begin{array}{@{}rcl@{}} A^{\prime} (M_{k}) &=& \frac{{\Delta A(M)}}{{\Delta M_{k}}} = A(M_{k} +1) - A(M_{k}) = f_{k} + r_{k} \\&&- {\eta_{k}} - (f_{k} + r_{k}) \pi_{k} (x_{k} \leq M_{k}) \quad \forall k \in {\mathcal{K}^{\prime}} \end{array} $$
(18)

Then, Heuristic 1 can be stated as follows:

figure a

In Heuristic 1, we calculate the marginal gain of having one more unit of the threshold capacity for the office and virtual appointments. At each step, we compare the marginal gain of having one office and one virtual appointment and increase the threshold capacity of the appointment with the highest gain by one. The heuristic stops when the allocated capacity reaches the available total threshold capacity or when adding one more unit of threshold capacity for all appointments yields a negative profit gain.

4.3 Capacity allocation model with time constraint

In this section, we take into account the total time required for providing each type of appointment. Section 4.2 assumes that office and virtual appointments both take an equal amount of time. However, virtual appointments are expected to be shorter than office appointments. Hence, we update (16) by considering the total available time and service times of office and virtual appointments. Let Tw denote the average total available time for the clinic. As defined in Section 4.1, \({\mu _{i}}, {\forall i \in \mathcal {I}}\) represents the service rate of an office and a virtual appointment, respectively. We assume that virtual appointments for controlled and uncontrolled patients have the same service rate, where \(\mu _{{s_{v,c}}} = \mu _{{s_{v,u}}} = \mu _{v}\). Hence, \(\frac {1}{{\mu _{i}}}, {\forall i \in \mathcal {I}}\) represents the average service time of appointment type i. Then the capacity allocation model with the time constraint can be defined as follows:

$$ \begin{array}{@{}rcl@{}} \mathit{max}\!\! \quad A(M) \!&=&\! {\sum}_{k\in{\mathcal{K}^{\prime}}} \left[ (r_{k} - {\eta_{k}})\alpha_{k} - {\eta_{k}} \mathbb{E}_{\pi_{k}} \big[(M_{k} - x_{k})^{+}\big] \right.\\&&\left.\!- (f_{k} + r_{k} - {\eta_{k}}) \mathbb{E}_{\pi_{k}} \big[(x_{k} - M_{k})^{+} \big] \right] \end{array} $$
(19)
$$ \begin{array}{@{}rcl@{}} &&\!s.t. \!\quad \frac{1}{\mu_{o}} {M_{s_{o}}} + \frac{1}{\mu_{v}} (M_{{s_{v,c}}} + M_{{s_{v,u}}}) \!\leq\! T_{w} \end{array} $$
(20)

where the objective function remains the same. Let \(M^{}_{T_{w}}\) be the threshold capacity allocation decision for the model with limited time. Recall that \(M^{*} = ({M^{min}_{s_{o}}}, M^{min}_{{s_{v,c}}}, M^{min}_{{s_{v,u}}})\) is the optimal threshold capacity for the unconstrained capacity allocation model given in Proposition 1. Similar to Section 4.2, if \(\frac {1}{\mu _{o}} {M_{s_{o}}}^{min} + \frac {1}{\mu _{v}} (M_{{s_{v,c}}}^{min} + M_{{s_{v,u}}}^{min}) \leq T_{w}\), then \(M^{}_{T_{w}} = M^{*}\). On the other hand, if the resources are scarce, Proposition 1 does not hold. Hence, we define Heuristic 2 to determine the capacity allocation decisions when the total time is limited. Let \({M_{k}^{t}}, k \in {\mathcal {K}^{\prime }}\) be the threshold capacity of node k at the tth iteration. We further define \(Z_{k} (M_{k}) = A^{\prime } (M_{k}) / \frac {1}{{\mu _{k}}} = {\mu _{k}} A^{\prime } (M_{k})\), which is the marginal profit gain of node \(k \in {\mathcal {K}^{\prime }}\) when total available time is limited.

Similar to Heuristic 1, Heuristic 2 also compares the marginal gains of different types of appointments at each iteration and increases the threshold capacity of the appointment with the highest positive gain by 1. Neither Heuristic 1 nor Heuristic 2 guarantees the optimal solution. Hence, we analyze the relative errors of the solutions of the heuristics from the optimal solution in the following proposition:

Proposition 2

The relative errors of the solutions of Heuristic 1 (i.e., MTC) and Heuristic 2 (i.e., \(M_{T_{w}}\)), which are the approximations of \(M_{TC}^{*}\) and \(M_{T_{w}}^{*}\), respectively, are no greater than the relative error terms of \(\frac {max_{k} A^{\prime } ({M_{k}^{t}})}{A(M_{TC})}\) and \(\frac {max_{k} A^{\prime } ({M_{k}^{t}})}{A(M_{T_{w}})}\), respectively. We state the corresponding equations as follows:

$$ \begin{array}{@{}rcl@{}} \frac{A(M_{TC}^{*}) - A(M_{TC})}{A(M_{TC}^{*})} &\leq& \frac{A(M_{TC}^{*}) - A(M_{TC})}{A(M_{TC})} \!<\! \frac{max_{k} A^{\prime} ({M_{k}^{t}})}{A(M_{TC})} \end{array} $$
(21)
$$ \begin{array}{@{}rcl@{}} \frac{A(M_{T_{w}}^{*}) - A(M_{T_{w}})}{A(M_{T_{w}}^{*})} &\leq& \frac{A(M_{T_{w}}^{*}) - A(M_{T_{w}})}{A(M_{T_{w}})} \!<\! \frac{max_{k} A^{\prime} ({M_{k}^{t}})}{A(M_{T_{w}})} \end{array} $$
(22)

where \({M_{k}^{t}}\) represents the threshold capacity in the final iteration of the heuristics. Also, \(A^{\prime } ({M_{k}^{t}})\) is the marginal gain of appointment type \(k \in {\mathcal {K}^{\prime }}\) in the final iteration. Proposition 2 ensures that the percent profit gap between the optimal solution and the proposed heuristic solutions is not greater than the percent marginal profit gain in the final iteration.

figure b

5 Numerical studies

In this section, we perform numerical experiments to analyze how the optimal capacity allocation decision varies under different scenarios. To this end, we first describe the model parameter estimation process. Then, we investigate the change in the optimal threshold capacity with respect to the follow-up rate and capacity constraint. Finally, we compare the proposed solutions with some common policies in practice.

5.1 Parameters estimation

In this section, we describe how the model parameters are obtained. We note that our data are based on the literature, and we use several sources to find the parameter values. The parameter values we obtain represent different characteristics. Due to the variation in the parameters’ characteristics, fluctuation in the results can be expected. The parameters that we present in this section is our initial setting, and the results that are obtained based on a single setting may not be generalized. To overcome these issues, in Sections 5.2 and 5.3, we define several scenarios and investigate the changes in the proposed office and virtual appointment capacities by considering possible fluctuations in the parameter values. We present minimum, average, and maximum values of the proposed threshold capacity values to provide a range for the decision-makers.

Flow parameters

Based on a survey of American physicians [45], doctors see 20.2 patients per day on average, and physicians work on average 51.40 hours per week (including all clinical and non-clinical duties). Of these, physicians work on average 11.37 hours per week on non-clinical (paperwork) duties only. Hence, we calculate the average service time for each patient as (51.4 − 11.37)/(5 × 20.2) = 0.396 hours and the service rate of the office appointment as μo = 1/0.396 = 2.525/hour. In addition, the average appointment time of the virtual appointments is less than that of the office appointments, and it is reported as around 12 minutes [47]. Thus, the virtual service rate is estimated as \(\mu _{v}=\mu _{{s_{v,c}}}=\mu _{{s_{v,u}}} = 60 / 12 = 5\)/hour. To calculate the new patient arrival rate, we consider the state of Michigan. The population of Michigan was 9.976 million in 2017, and 47.9% of them suffered from chronic diseases [41], while in 2018, the Michigan population increased to 9.996 million, and 48.1% of them suffered from chronic diseases [46]. Thus, the number of new chronic patients in Michigan can be calculated as (9.996 × 48.1% − 9.976 × 47.9%) × 106 = 29,572. There are 278 clinics in the state of Michigan [15], from which number the total monthly new arrival rate may be estimated as λo + λv = 29572/(278 × 12) = 8.865 patients/month. Also, around 10.4% percent of the visits occur through virtual appointments [45]. Thus, λo = 8.865 × (1 − 10.4%) = 7.943 patients/month, λv = 8.865 − 7.943 = 0.922 patients/month. According to the regulations on chronic care management (CCM) [19], a patient should receive at least 20 minutes of clinical care by a physician or other qualified health care professional per calendar month. Considering the service rates of office and virtual appointments, the follow-up rate can be estimated as \(\sigma _{o} = \frac {20 minutes/month}{0.396 hour} = 0.842\)/month, and \(\sigma _{v} = \frac {20 minutes/month}{12 minutes} = 1.667\)/month. Based on a CDC report [14], generally incurable and ongoing chronic diseases affected approximately 133 million Americans in 2009, representing more than 40% of the total population in this country. In 2009, 7 out of 10 deaths in the U.S. were due to chronic diseases, and the death rate due to chronic diseases was around 1.706 million [29]. Thus, the monthly departure (i.e., death) rate can be calculated as δ = 1.706/(133 × 12) = 0.00107/month. In addition, the disease progression (i.e., transferring from a controlled health state to an uncontrolled health state) is estimated as γ = 0.5/week [13]. To estimate the new patients’ health status, we consider a study that considers diabetes patients. [17] present that 43% of new patients among their analytic sample had indications of uncontrolled diabetes. Hence, we assume ph = 1 − 43% = 57%, which is the probability that the newly arrived patient in a virtual appointment is in the controlled health state. We note that for the remaining parameters (pc,pu,pc|c, and pc|u) we perform sensitivity analysis to investigate their effects on the capacity allocation.

Revenue and cost

We assume that the workday for a clinic is 20 days per month. For the revenue and cost parameters of the office appointment, we refer to the study of [32]. Then, the marginal profit of an office appointment is estimated as \(r_{{s_{o}}} = \$131\) /day × 20 days/month= $2620/month, the fixed threshold capacity cost of an office appointment is estimated as \({\eta _{s_{o}}} = \$ 84.6/\textit {day} \times 20 \)days/month = $1692/month, and the penalty cost of an office appointment is estimated as \({f_{s_{o}}} = \$ 50\)/day × 20 days/month = $1000/month [32]. For the virtual appointments, there is no direct historical data, but the total cost of virtual appointments is estimated as 32% less than that of traditional office appointments [2]. Hence, the fixed threshold capacity cost for virtual appointments is estimated as \({\eta _{{s_{v,c}}}} = {\eta _{{s_{v,u}}}} = \$1692\)/month × (1 − 32%) = $1150.56/month. For the marginal profit and the penalty cost of the virtual appointments, we use the same parameter values as with the office appointments (i.e, \(r_{{s_{v,c}}} = r_{{s_{v,u}}} = r_{{s_{o}}}\), and \(f_{{s_{v,c}}} = f_{{s_{v,u}}} = {f_{s_{o}}}\)). Table 1 summarizes the value of the patient-flow and profit-related parameters together with the sources from which they are estimated. We note that the parameter values listed in Table 1 are used in one of the scenarios. Then, we analyze several scenarios by considering the possible fluctuations in the parameter values. Hence, through our scenarios we also consider the cases where \({r_{s_{v,c}}}={r_{s_{v,u}}}>r_{{s_{o}}}\) or \({r_{s_{v,c}}}={r_{s_{v,u}}}<r_{{s_{o}}}\) (resp. where \({f_{s_{v,c}}}={f_{s_{v,u}}}>{f_{s_{o}}}\) or \({f_{s_{v,c}}}={f_{s_{v,u}}}<{f_{s_{o}}}\)).

Table 1 List of model parameters

5.2 Impact of the model parameters on the threshold capacity

In this section, we investigate the changes in the optimal threshold capacity and average earnings as some of the key parameters in the model change. The parameters we use in the model are obtained from the literature and are not specific to any healthcare organization. For that reason, we perform sensitivity analysis to investigate the optimal threshold capacity for varying parameter values to ensure that the changes in the parameter values due to the different clinics’ characteristics can be addressed. We show how the optimal threshold capacity can change with respect to a change in other parameter values. Through our results, we show not only how the optimal threshold capacity changes but also the range of the change in optimal threshold capacity and average earnings. Our sensitivity analysis would also help us to generalize our results for a varying set of parameters and data sets.

Follow-up rate

We first study the impact of follow-up rate (i.e., σo,σv) on the optimal threshold capacity. It has important relevance since reducing or increasing the follow-up rate implies a lower or greater frequency of patients’ visits. We vary the follow-up rate in a range between 0.5σ and 1.5σ and present the corresponding optimal capacities in Fig. 2. The results show that as we increase the follow-up rate of office appointments, the optimal threshold capacity of office appointments increases monotonically, while the total optimal threshold capacity for virtual appointments does not change. This is reasonable, because the increase in the office follow-up rate would result in an increase in the expected number of patients in the office appointments at the steady state but not in the expected number of patients in the virtual appointments. It is also observed that the expected number of patients in virtual appointments is not a function of the office follow-up rate as stated in Eq. 39 in the Appendix B. There occurs a slight increase in the optimal threshold capacity of controlled patients, the reason for which is that the increasing follow-up rate of office appointments transfers more patients from the uncontrolled health state into the controlled health state. Similarly, as the follow-up rate of virtual appointments increases, it is observed that the optimal threshold capacity for controlled health state increases more than the optimal threshold capacity for uncontrolled health state. Although in practice there may be no distinction between capacity for uncontrolled and controlled conditions, this an indication that the increasing follow-up rate of virtual appointments relatively reduces the number of patients in the uncontrolled health state. Thus, increasing the follow-up rate of either office or virtual appointments can improve the health condition of the patients, but it simultaneously increases the demand for office and virtual appointments, which is a challenge for the clinic. This result is consistent with the practice that as the average follow-up rate of the patients increases, the panel size of one physician would decrease. Thus, to serve the same number of patients, more physicians are needed for the clinic.

Fig. 2
figure 2

The impact of follow-up rate on the optimal threshold capacity

Limited capacity

Next, we analyze the impact of limited total threshold capacity (TC) on the optimal threshold capacity and the average earnings of the clinic. To this end, we first determine M, which is the optimal threshold capacity allocation vector for the unconstrained model. For the given model parameters in the unconstrained model, the optimal threshold capacity for office appointments, for controlled virtual appointments, and for uncontrolled virtual appointments is 17, 12, and 8, respectively, and the optimal total threshold capacity of the clinic is 37. Hence, we change the range of limited total threshold capacity (TC) from 25 to 40. As shown in Fig. 3, as the limited total threshold capacity increases, the office and virtual capacities increase until they reach the optimal. We can see that when the limited total threshold capacity (TC) is greater than 37 (i.e., \({\sum }_{k \in {\mathcal {K}^{\prime }}} M_{k}^{min} \leq TC\)), the threshold capacity allocation vector for the model with the equal-weighted capacity constraint (i.e., \(M_{TC}^{}\)) is equal to the optimal result of the unconstrained model (i.e.,M), which is consistent with the analysis in Section 4.2.

Fig. 3
figure 3

The impact of limited threshold capacity (TC) on the optimal threshold capacity and average earnings

In addition, as we increase the limited total threshold capacity, the average long-run earnings of office and virtual appointments increase and the marginal profit gain decreases, which is consistent with Eq. 87, that the second-order derivation of the long-run average earnings is a monotonic decreasing function.

Limited time

We also study the impact of limited working time (Tw) on the threshold capacity and the average earnings of the clinic. Recall that the optimal threshold capacity for the unconstrained model is \(M^{*}=\{{M_{s_{o}}}^{min}=17, M_{{s_{v,c}}}^{min}=12,M_{{s_{v,u}}}^{min}=8\}\). By considering the service rates (i.e., μo = 2.525/hour and μv = 5/hour), the optimal threshold capacity for the unconstrained model is around 11 hours. Hence, we vary the range of the limited working time from 8 to 12 hours. Then, we use Heuristic 2 to obtain the allocation decision for this problem. As shown in Fig. 4, when Tw = 8hours, the allocated capacities for office and virtual appointments for controlled and uncontrolled patients are 11, 11, and 7, and when Tw = 10hours, the allocated capacities for office and virtual appointments for controlled and uncontrolled patients are 16, 11, and 7.

Fig. 4
figure 4

The impact of limited working time (Tw) on capacity allocation and average earnings

We observe that the change in the limited time affects the threshold capacity for office appointments more than the threshold capacity for virtual appointments. This is because the average service time of office appointments is nearly twice that of virtual appointments. If the limited time decreases, it becomes more profitable to reduce the threshold capacity for office appointments by one unit rather than reducing the threshold capacity for virtual appointments by two units. As shown, if the limited time is greater than 11 hours, the actual working time remains constant, which is consistent with the analysis in Section 4.2. Since \(\frac {1}{\mu _{o}} {M_{s_{o}}}^{min} + \frac {1}{\mu _{v}} (M_{{s_{v,c}}}^{min} + M_{{s_{v,u}}}^{min}) \leq T_{w}\), \(M_{T_{w}} = M^{*}\).

5.3 Comparison of policies

In this section, we compare the total profits of some common benchmark policies with those of our proposed policies (i.e., optimal policy, Heuristics 1 and 2). As benchmark policies, we consider three varying ratios of office appointment threshold capacity to virtual appointment threshold capacity (i.e., \({M_{s_{o}}}/{M_{s_{v}}}\)): (i) Policy-1: \({M_{s_{o}}}/{M_{s_{v}}}=2\), (ii) Policy-2: \({M_{s_{o}}}/{M_{s_{v}}}=1\), and (iii) Policy-3: \({M_{s_{o}}}/{M_{s_{v}}}=1/2\). Initially, we consider that the virtual appointment capacities allocated for controlled and uncontrolled patients are equal to each other. For comparison, we analyze several scenarios by varying the parameter values. As the number of varying parameters increases, the number of scenarios and the complexity of the analyses increase. Hence, considering that the impact of the δ and γ variables on the capacity allocation decisions can be small and that preserving the relationship of μoμv is important, we keep these variables constant. For all 16 remaining parameters, we use two possible values (i.e., low, high). We use the following formulas to calculate the low and high levels for each parameter:

$$ \begin{array}{@{}rcl@{}} &&\text{Low Value of a Parameter} = (1-\text{Fluctuation Rate})\\&&\times \text{Original Parameter Value} \end{array} $$
(23)
$$ \begin{array}{@{}rcl@{}} &&\text{High Value of a Parameter} = (1+\text{Fluctuation Rate})\\&&\times \text{Original Parameter Value} \end{array} $$
(24)

By considering all possible combinations, we evaluate 215 = 32,768 scenarios for the unconstrained model, and for the model with the capacity and time constraint, we analyze 216 = 65,536 scenarios as we also change the parameter TC and Tw. In Tables 2 and 3, we present the solutions obtained from the proposed heuristics (i.e., the ratio of \({M_{s_{o}}}/{M_{s_{v}}}\), \(M_{{s_{v,c}}}/M_{{s_{v,u}}}\), and A(M)) and the comparison results of the policies (i.e., Policy-1, Policy-2, and Policy-3) with respect to the proposed heuristics for the unconstrained model, the capacity-constrained model, and the time-constrained model. Table 2 shows the results for a fluctuation rate of 5%, while Table 3 shows the results for a fluctuation rate of 10%. To calculate the percent gap between the profit function of the policies and of the proposed heuristics, we use the following formula:

Table 2 Comparison of Policy-1, Policy-2, and Policy-3 with benchmark policies when the fluctuation rate is 5%
Table 3 Comparison of Policy-1, Policy-2, and Policy-3 with benchmark policies when the fluctuation rate is 10%
$$ \begin{array}{@{}rcl@{}} &&\!\!\!\!\text{Percent Profit Gap} \\&&\!\!\!\!= \frac{\text{Profit of Proposed {Heuristic} - Profit of Corresponding Policy}}{\text{Profit of Proposed {Heuristic}}} \\&&\!\!\!\!\times 100<percent> \end{array} $$
(25)

In Tables 2 and 3, we present the average, maximum, and minimum values obtained over all scenarios, and we observe that the allocated capacities according to the optimal policy and heuristics for office and virtual appointments fluctuate as parameters change. When the three common policies are compared, we can see that Policy-2 (i.e., \({M_{s_{o}}}/{M_{s_{v}}}=1\)) is the best, even though the results become worse as fluctuation increases. For the unconstrained model, the optimal threshold capacity allocation ratio (i.e., \({M_{s_{o}}}/{M_{s_{v}}}\)) varies between 0.71 and 1.11 when the fluctuation rate is 5%, while it varies between 0.62 and 1.27 when the fluctuation rate is 10%. As expected, when the uncertainty in the parameter values increases, the optimal threshold capacity allocation ratio varies more. It also shows that even if the fluctuation rate is high, it is not reasonable to use a threshold capacity allocation ratio of less than 0.62 or more than 1.27. Similar to the unconstrained model, in the model with the capacity and time constraints, Policy-2 performs the closest to the proposed solutions, but the variation is more compared to the unconstrained model where the model with time constraint has the highest variability. In the model with capacity constraint, the suggested \({M_{s_{o}}}/{M_{s_{v}}}\) ratio varies between 0.7 and 1.14 when the fluctuation rate is 5%, and it varies between 0.5 and 1.5 when the fluctuation rate is 10%. Finally, in the time-constrained model, the proposed \({M_{s_{o}}}/{M_{s_{v}}}\) ratio changes between 0.64 and 1.11 when the fluctuation rate is 5%, while it changes between 0.54 and 1.29 when the fluctuation rate is 10%. According to the results of the proposed policies (i.e., optimal, Heuristics 1 and 2), the average threshold capacity allocation ratio \({M_{s_{o}}}/{M_{s_{v}}}\) should be 0.89 for the unconstrained model, 0.92 for the capacity-constrained model, and 0.86 for the time-constrained model.

We also compare the change in the total threshold capacity split for virtual appointments and for all models in Tables 2 and 3. For the unconstrained model, the optimal \(M_{{s_{v,c}}}/M_{{s_{v,u}}}\) ratio varies from 1.11 to 1.86 when the fluctuation is 5%, and it varies between 0.88 and 2.43 when the fluctuation is 10%. For the model with capacity constraint, the suggested \(M_{{s_{v,c}}}/M_{{s_{v,u}}}\) ratio ranges between 1.11 and 2.2 when the fluctuation rate is 5%, while this ratio ranges between 0.88 and 3.33 when the fluctuation rate is 10%. Finally, for the time-constrained model, the suggested \(M_{{s_{v,c}}}/M_{{s_{v,u}}}\) ratio changes between 1.1 and 2 for the fluctuation rate of 5%, while this change is between 0.88 and 2.6 for the fluctuation rate of 10%. For all models, we observe that the average suggested \(M_{{s_{v,c}}}/M_{{s_{v,u}}}\) ratio is greater than 1, which means that the threshold capacity allocated for patients diagnosed in the controlled health state at virtual appointments is greater than the threshold capacity allocated for patients who are diagnosed in the uncontrolled health state at the virtual appointments. In practice, physicians can take into account this ratio while making their recommendation to the patient for the next appointment.

We next analyze the impact of the fluctuation rate on the optimal policy for the unconstrained model. We vary the fluctuation rate between 2.5% and 20%. In Fig. 5, we present the impact of the fluctuation rate on the average, maximum, minimum, and standard deviation of the optimal \({M_{s_{o}}}/{M_{s_{v}}}\) ratio for the unconstrained model. As the fluctuation rate increases, the average increases slightly, but the range of the ratio of office appointment threshold capacity to virtual appointment threshold capacity and the fluctuation of this ratio become larger. This indicates that the increasing uncertainty in parameter values makes the allocation decision harder for the policymakers.

Fig. 5
figure 5

The impact of fluctuation rate on the optimal threshold capacity allocation ratio, \({M_{s_{o}}}/{M_{s_{v}}}\) for the unconstrained model

6 Conclusion

Virtual appointments, consisting of e-mail, phone, and online consultations, are increasingly changing our point of view of traditional office appointments, which makes the integration of virtual appointments and office appointments critical for healthcare providers. Nowadays, many health clinics and hospitals are transitioning in virtual health services, which brings several operational challenges. Hence, the capacity planning for these two kinds of appointments has become an important but complex problem in the field of healthcare.

In this study, we use a migration network to model the flow of chronic patients. Our model further reflects the varying effectiveness of office and virtual appointments in treatment and diagnosis. We build a newsvendor optimization model to determine how to allocate the capacity of the office and virtual appointments to maximize the network’s long-run average earnings. We present an unconstrained model and two extended models with capacity and time constraints to demonstrate the potential use of this model in the clinic network under different scenarios. Through numerical studies, we present one clinic network with parameters estimated from the state of Michigan. We study the use of our optimization models under different scenarios and perform sensitivity analysis for the comparison of different allocation policies. Our theoretical and numerical studies bring us several insights into the clinic system and the application of virtual appointments. We propose efficient and practical solutions that bring higher average earnings compared to the common policies in practice. Our results also suggest that virtual appointments should be used more for following up with controlled patients than for treating uncontrolled patients. Also, although virtual appointments are not as effective as office appointments, they have equal importance to office appointments due to their lower costs.

There are several limitations that can be extended in several directions. First, our study focuses on capacity allocation decisions for given arrival and follow-up rates. As an extension, the optimal follow-up rates and arrival rates can be investigated for a given capacity of office and virtual appointments. Considering the arrival rate as a decision variable refers to the physician’s panel size decision, where the physician can decide on the rate of new patients to accept into her/his panel [23, 35, 40]. Second, our data are based on the literature, which limits the generalization of the model. In the future, the model can be verified by using a specific clinic’s or hospital’s data. Finally, we assume in our study that all patients show up for their appointments. However, no-shows are common in practice, and no-shows in office and virtual appointments can be different. In a future study, the impact of no-shows on capacity allocation decisions can be investigated.