Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning

Shin, Jongkyung; Lee, Donggi Augustine; Kim, Juram; Lim, Chiehyeon; Choi, Byung-Kwan

doi:10.1007/s10729-024-09676-5

Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning

Open access
Published: 01 June 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Health Care Management Science Aims and scope Submit manuscript

Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning

Download PDF

Jongkyung Shin¹,
Donggi Augustine Lee²,
Juram Kim³,
Chiehyeon Lim¹ &
…
Byung-Kwan Choi⁴

752 Accesses
1 Altmetric
Explore all metrics

Abstract

Long waiting time in outpatient departments is a crucial factor in patient dissatisfaction. We aim to analytically interpret the waiting times predicted by machine learning models and provide patients with an explanation of the expected waiting time. Here, underestimating waiting times can cause patient dissatisfaction, so preventing this in predictive models is necessary. To address this issue, we propose a framework considering dissatisfaction for estimating the waiting time in an outpatient department. In our framework, we leverage asymmetric loss functions to ensure robustness against underestimation. We also propose a dissatisfaction-aware asymmetric error score (DAES) to determine an appropriate model by considering the trade-off between underestimation and accuracy. Finally, Shapley additive explanation (SHAP) is applied to interpret the relationship trained by the model, enabling decision makers to use this information for improving outpatient service operations. We apply our framework in the endocrinology metabolism department and neurosurgery department in one of the largest hospitals in South Korea. The use of asymmetric functions prevents underestimation in the model, and with the proposed DAES, we can strike a balance in selecting the best model. By using SHAP, we can analytically interpret the waiting time in outpatient service (e.g., the length of the queue affects the waiting time the most) and provide explanations about the expected waiting time to patients. The proposed framework aids in improving operations, considering practical application in hospitals for real-time patient notification and minimizing patient dissatisfaction. Given the significance of managing hospital operations from the perspective of patients, this work is expected to contribute to operations improvement in health service practices.

How to adjust the expected waiting time to improve patient’s satisfaction?

Article Open access 08 May 2023

Bayesian prediction of emergency department wait time

Article 06 January 2022

A Case Based Approach to Assess Waiting Time Prediction at an Intensive Care Unity

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Highlights

Proposing an analytical framework for estimating waiting times in outpatient departments.
Applying Shapley additive explanation (SHAP) to interpret the relationship between waiting times and operational features.
Leveraging asymmetric loss functions to prevent underestimation of waiting times in the framework.
Introducing a dissatisfaction-aware asymmetric error score (DAES) to balance the trade-off between underestimation and accuracy.
Demonstrating the framework’s effectiveness through a case study in a hospital’s endocrinology metabolism department and neurosurgery department.

2 Introduction

Due to the large demand for outpatient services, overcrowding in outpatient departments is common in hospitals (e.g., in the United States [1] and Asian countries [2]). The long waiting times resulting from overcrowding is the main cause of patient dissatisfaction, which causes a decline in perceived service quality [3, 4]. Therefore, managing waiting times is necessary to improve the quality of outpatient services, and it is an issue that has been considered extremely important for hospital operations management [5, 6].

Given this need, previous studies have attempted to measure the importance of service features affecting outpatient waiting times using linear models (e.g., linear regression (LR)) that statistically estimate the extent to which different features affect service quality based on questionnaire data collected from outpatients [7,8,9]. However, this approach has at least two limitations. First, the traditional technique can represent the intuitive relationship between waiting times and considered features. However, it exhibits low predictive accuracy because linear assumptions may not be suitable to accommodate the complex patterns in hospital operation systems [10, 11]. Second, measurement using outpatient questionnaires is expensive and time-consuming [8]. Moreover, this approach mainly represents outpatient perspectives on isolated temporal snapshots, whereas actual service improvement requires the continuous analysis of service operations to allow for real-time inferences [12].

To address these limitations, technologies using data collected from hospital information systems (HIS) and machine learning (ML) have been studied to improve hospital services [13,14,15,16]. As a result, waiting time management has benefited substantially from the use of prediction models trained with rich data to estimate outpatient waiting times [10, 17, 18]. However, although this data-driven ML approach achieves high accuracy, the inability to interpret the predictions makes it challenging to estimate the importance of service features that affect waiting times. To address the lack of interpretability of ML models, several studies have suggested and employed interpretable machine learning (IML) approaches in healthcare. For instance, Ahmad et al. [19] and Stiglic et al. [20] emphasized the significance of IML and its applicability to address challenges and requirements within healthcare. Gao et al. [21] utilized a two-step extracted regression tree approach for hospital readmission prediction, achieving a balance between accuracy with interpretability. Okay et al. [22] applied IML methods in conjunction with random forest (RF) and gradient boosting algorithms to diagnose diabetes, enhancing interpretability without sacrificing accuracy. Finally, Hu et al. [23] employed IML to understand the reasoning behind the outcome of an optimal model for the early prediction of prognosis in sepsis. However, specific efforts to apply IML for predicting patient waiting times in healthcare are still lacking despite the importance of the prediction model’s interpretability from an operational perspective. Without an analytical interpretation of the relationship between service features and waiting times, hospital managers face difficulty in prioritizing which features to adjust to reduce waiting times and enhance overall patient satisfaction.

Meanwhile, given that outpatients are notified of the expected waiting time, a significant research gap in the literature is the lack of ML models that reflect patients’ perspective in the training phase. Providing patients with individual expected waiting times improves patient satisfaction, as it enables them to manage their time effectively instead of passively waiting for their consultation in the waiting room. From the hospital’s perspective, a prediction model can be useful for allocating workloads to medical workers through simulation analysis, such that operations can be improved to alleviate patient dissatisfaction. However, this plausible scenario considers an important assumption, that is, the model accurately predicts the waiting time for each patient. In case of overestimation, the actual waiting time being less than the predicted waiting time can be acceptable and may even have a positive effect on patient satisfaction [24]. However, a serious concern of hospital staff regarding the introduction of predictive models is the underestimation by the models. If a patient cannot enter the consultation room even after the notified waiting time, then dissatisfaction may increase with the patients’ perceived waiting time [25]. This situation occurs frequently in practice. In addition, if the perceived waiting times based on the patient’s situational observation of the waiting room and the predicted waiting times are different, then patient dissatisfaction can increase [26, 27]. In summary, to reduce patient dissatisfaction when introducing a prediction model for waiting time notification, it is important that the ML model does not underestimate patients’ waiting times. Additionally, it is crucial to identify and explain the specific reasons for long waiting times to the patients.

To address these issues, we propose a framework for waiting time prediction considering patient dissatisfaction using the IML method and asymmetric loss functions. First, IML can explain the mechanisms of ML models trained by black-box models [28], thereby addressing the trade-off between predictability and interpretability. Second, the loss functions employed in existing studies to train ML models are symmetric, which compute equivalent penalties that may result in overestimation and underestimation. By contrast, the asymmetric loss functions used in our study calculate different penalties depending on either the sign of the error value or the specific interval it contains; even the absolute value is equivalent. Third, given that the introduction of an asymmetric loss function adversely affects the accuracy of ML models, the dissatisfaction-aware asymmetric error score (DAES) introduced in this study allows for the selection of a suitable model by considering the trade-off between accuracy and underestimation. Hence, our framework allows the accurate prediction of outpatient waiting times and estimation of the importance of service features affecting the waiting times from a service-oriented perspective. The objective is to reduce patients’ dissatisfaction caused by the underestimation of waiting times. Furthermore, our framework can alleviate the underestimation problem of the prediction model by the asymmetric loss function and provide an explanation of the reason for each waiting time prediction. Finally, with our framework, decision makers can interpret the prediction model to identify directions for improving outpatient service operations.

We validate the proposed framework through case studies in one of the largest hospitals in South Korea. We analyze continuously collected operations data archived in the HIS and define the variables to construct datasets for developing prediction models. Then, we investigate the importance of features on waiting time prediction and discuss the theoretical and practical implications of the analysis results in this paper. The results of the case study confirmed that the proposed analytical framework is a useful tool for waiting time management, and serves as a starting point for gaining a deeper understanding of the relationships between the factors in outpatient service and waiting time. Furthermore, the model trained with asymmetric loss functions considering patient dissatisfaction can be implemented in a system providing stakeholders with the expected waiting time and the corresponding explanation to enhance patient satisfaction and reduce the perceived waiting time. In practice, outpatient waiting time prediction should be used for patient notification and improvement in outpatient operations. In conclusion, our framework goes beyond mere prediction; it offers practical solutions to the challenges faced in outpatient healthcare settings. By providing real-time patient notifications, enabling operations improvements, and minimizing patient dissatisfaction, our work contributes significantly to the field, ultimately benefiting both patients and healthcare providers.

To further illustrate the contribution of our work, the remainder of this paper is organized as follows. Section 2 describes the overall framework and methodological background, including Shapley additive explanations (SHAP), the asymmetric loss function, and DAES. Section 3 describes the results of applying the proposed framework to an actual hospital in South Korea. Finally, the conclusions of this study are presented in Section 4.

3 Methodology

3.1 Proposed framework

The overall process of the proposed approach is shown in Fig. 1 and comprises four steps: (1) data preparation, (2) model development, (3) performance evaluation, and (4) model interpretation. The blue line in the figure represents the process for understanding the features that affect waiting times for service improvement, and the red line represents the process for developing a model for waiting time prediction service. The data preparation step involves data collection from the HIS, feature engineering through literature review and interviews, and data preprocessing. In model development and performance evaluation steps, ML models are trained to accurately estimate the waiting time by hyperparameter optimization, and the performance of the trained ML models is examined by evaluation metrics to determine the best model. Finally, SHAP (Section 2.2) is applied to perform an analytical interpretation of the relationship between waiting times and the considered features. For the waiting time prediction service, the ML models are trained by asymmetric loss function (Section 2.3) and evaluated by DAES (Section 2.4) to consider patient dissatisfaction.

3.2 SHAP

SHAP is a model-agnostic IML method based on coalition game theory; it calculates the Shapley value representing the contribution of each player [29, 30]. SHAP has been shown to satisfy the properties of local accuracy, consistency, and missingness [29]. Among various IML methods, such as Local Interpretable Model-agnostic Explanations [31] and Anchors [32], we employ the SHAP for the following reasons. First, SHAP is based on the rigorous theoretical background of Shapley values used in game theory [30] and offers explanations of the model’s results without sacrificing its accuracy, ensuring a solid scientific basis for our interpretability framework. Second, SHAP allows for sample-wise interpretability, offering customer-centric insights tailored to individual customers. Each patient can be provided with an explanation of the expected waiting time, increasing their satisfaction and awareness of the waiting situation. Third, from a healthcare operations perspective, SHAP helps us understand global feature importance and analyze the significance of specific data subsets, facilitating optimized resource allocation and operational efficiency. Finally, this method is model-agnostic and universally applicable without the need for custom parameter tuning. It excels in considering complex interplays between variables, making it valuable for our analysis of factors affecting outpatient service waiting times in hospitals. Moreover, SHAP assigns importance to each feature through additive feature attribution, which is calculated as follows:

$$\begin{aligned} g(z') = \varphi _{0} + \sum ^{M}_{i=1} \varphi _{i}z'_{i} \end{aligned}$$

(1)

where g is the explanation model; $z'\in \{0,1\}^{M}$ is the coalition vector that represents whether the feature is present (=1) or absent (=0); M is the number of features; $\varphi _{0}$ is the intercept value; and $\varphi _{i} \in \mathbb {R}$ is the feature attribution for feature i, the SHAP value.

Specifically, the SHAP value of feature i is calculated using the expected marginal contribution. This is the difference between the importance value without feature i and the importance of the entire subset, as shown in Eq. (2).

$$\begin{aligned} \small \varphi _{i} = \sum _{S \subseteq S_{all} \backslash \{ i \}} { \vert S\vert !(M-\vert S \vert -1)! \over M! } (f_{x} (S \cup \{ i \} - f_{x}(S)) \end{aligned}$$

(2)

where $\varphi _{i}$ denotes the SHAP value of feature i, S denotes the subset of features, and $S_{all}$ denotes the set of all features. $f_{x}(S \cup \{ i \})$ and $f_{x}(S)$ are the conditional expectations of model f with and without feature i, respectively, at an observed variable x belonging to S.

The SHAP value is useful in determining the local importance of individual instances. However, calculating the average absolute SHAP value across all instances enables us to determine the global importance, as shown as follows:

$$\begin{aligned} GI_{i} = {1 \over N} \sum ^{N}_{j=1} \vert \varphi _{i}^{(j)} \vert \end{aligned}$$

(3)

where $GI_{i}$ is the global importance of feature i, N is the number of data instances, and $\varphi _{i}^{(j)}$ is the SHAP value of feature i for the $j^{th}$ instance.

To reduce time complexity, SHAP values can be approximated by various methods, such as kernelSHAP (applicable to all ML models), DeepSHAP (applicable to neural network models), and TreeSHAP (applicable to decision-tree-based ensemble learning models) [29, 33].

3.3 Asymmetric loss function

As aforementioned in the Introduction section, avoiding underestimation is crucial in practical applications, in addition to predictive ability. The reason is that underestimation is more critical to patient dissatisfaction than overestimation. Therefore, we introduce the asymmetric loss function to train the waiting time predictive model. Generally, loss functions, such as mean square errors (MSE), are symmetric and derive the same penalty regardless of the sign of the error value. Different from symmetric loss functions, asymmetric loss functions can derive different penalties depending on overestimation or underestimation. In our study, we employ two popular asymmetric loss functions. These functions contain a quadratic or exponential component that makes them smooth and differentiable, thereby allowing the model to be optimized using gradient-based methods. Figure 2 illustrates the shapes of these loss functions when the parameters are set to penalize underestimation more heavily. This adjustment allows a predictive model to be robust against underestimation by using the asymmetric loss function.

The double quadratic loss function (Quad-Quad) increases quadratically on each side of the origin, but its penalty differs depending on the sign of the error. According to [34], Quad-Quad can be formulated as follow:

$$\begin{aligned} L_{quad}(\alpha ) = 2 \cdot [\alpha + (1 - 2\alpha ) \cdot 1_{\{\epsilon <0\}}] \cdot {|\epsilon |}^{2} \end{aligned}$$

(4)

where $\epsilon = y - \hat{y}$ is the error between the target value (y) and prediction ($\hat{y}$). $1(\cdot )$ is a unit indicator equal to 1 if $\epsilon < 0$, and 0 otherwise. $\alpha \in (0,1) $ is a parameter that adjusts the degree of asymmetry. When $\alpha $ is 0.5, the error is symmetric. If $\alpha > 0.5$, the penalty for underestimation (i.e., $\epsilon > 0$) is larger than that for overestimation (i.e., $\epsilon < 0$).

The linear exponential loss function (LINEX) [35] increases linearly on one side of the origin and exponentially on the other side. This function is convex and can handle asymmetry smoo-thly [36, 37]. The function of LINEX is defined as follows:

$$\begin{aligned} L_{linex}(\beta ) = {2 \over {\beta }^{2}} [\exp ({\beta \epsilon }) - \beta \epsilon -1] \end{aligned}$$

(5)

where $\beta \ne 0$ is a parameter that adjusts the asymmetry degree, and $\exp (\cdot )$ denotes the exponential function. When $\beta > 0$, positive errors incur more penalties than negative errors.

3.4 Asymmetric error score for dissatisfaction-aware waiting time prediction

We leverage asymmetric loss functions (i.e., Quad-Quad and LINEX), such that the prediction model encourages avoiding underestimation. The model trained by the asymmetric loss function suitably increases the overall prediction values; however, it causes excessive errors in the overestimated predictions. This leads to another dissatisfaction issue when the patients’ turn for consultation is missed, especially for patients who believe in an excessively overestimated result and leave the waiting room. To address this issue, we propose DAES to determine an appropriate deployment model. It provides high penalties in the case of not only underestimated errors but also overestimated errors that satisfy certain conditions. Therefore, the model with the lowest DAES is the most suitable. DAES is formulated as follows:

$$\begin{aligned} DAES(\gamma ) = \rho _{(\epsilon> 0)} \cdot e(\epsilon > 0) + \rho _{(\epsilon<-\gamma )} \cdot e(\epsilon <-\gamma ) \end{aligned}$$

(6)

where $\epsilon >0$ and $\epsilon <-\gamma $ represent the underestimated and overestimated errors larger than $\gamma $ min, respectively; $\rho $ denotes the ratio of each condition; and $e(\cdot )$ denotes a performance metric, such as the root mean square error (RMSE).

Table 1 Example of the operational event log dataset

Full size table

4 Application results

In this section, we present case studies of an endocrinology metabolism (EM) department and a neurosurgery (NS) department in one of the largest hospitals in South Korea. Our target hospital is located in a metropolitan city with more than 3 million residents. In addition, more than 2 million people living near cities visit this hospital. As a result, approximately 900,000 outpatients visit this hospital annually. We initially applied our framework to the EM department to illustrate the use of the proposed approach. The EM department is one of the most crowded departments in this hospital with approximately 200 patients visiting each day. Patients experience a relatively long waiting time for consultation at an average of approximately 40 min, which has been identified as the main cause of dissatisfaction with EM outpatient services. Therefore, we applied our framework to develop a satisfaction-oriented and accurate waiting time prediction model and analyzed the effects of various features on waiting time.

Table 2 Description of features

Full size table

4.1 Data collection

As medical workers (e.g., nurses and physicians) provide services to patients, operational data such as the status of patients and log histories of workers are automatically recorded in the HIS. Specifically, the HIS of the EM department in our target hospital includes a text-to-speech-based electronic calling system that allows workers to call patients into the consultation room. Thus, related operational event log records (e.g., call times and call events) were collected.

Figure 3 shows the outpatient process and corresponding operational processes recorded in the HIS of the EM department. When a patient at the front desk requests to register for medical consultation after arriving at the hospital, a nurse collects the patient’s personal information and checks his/her status. According to the patient’s appointment status and necessity of any preliminary examination, the nurse determines when to add the patient to the queue after registration. Typically, most patients are queued immediately after registration; however, patients who require a preliminary examination are queued after an examination. Therefore, we define the start of waiting (T1) as the moment at which patients were queued and recorded in the HIS. The moment when a physician calls the patient for consultation is regarded as the start of consultation (T2). Moreover, the moment when the physician calls the next patient after completing the consultation is regarded as the end of the consultation (T3) and the start of the consultation of the next patient (T2’). Table 1 presents a dummy dataset for describing the operational event log data stored in the HIS server.

Waiting for outpatient services occurs before consultation; thus, the waiting time is defined as the time difference between T2 and T1. Consultation time is the period in which the patient consults with a physician; hence, it is defined as the time difference between T3 and T2. For this research, we extracted the sample operational event log data from 06/01/2020 to 09/30/2020 for 7,709 outpatients who visited the EM department in our target hospital. Data collection for this period was determined by the hospital with the Institutional Review Board (IRB) approval only for research purposes, and the data did not include patient identification information.

4.2 Feature engineering

Through literature reviews and discussions with nurses and physicians, we categorized four factors that represent the context of waiting and affect patients’ waiting times. The factors influencing the waiting times were queue, patient, time, and physician [7, 18, 38, 39]. Moreover, we defined the features that present the properties of these factors, which could be used to provide an explanation to the patients. Particularly, the features were measured only with collectible information when the patient came to the front desk. Table 2 introduces the factors, their corresponding feature labels, and their descriptions. A detailed description of the feature engineering process is provided in the following subsection. The explanation and interpretability of the model depend on input features. Thus, the features must be carefully selected to be suitable and relevant, such that medical workers can be convinced of the interpretation results, and patients can understand the explanations given by the workers. In view of this issue, we determined the features after interviews with medical workers and observations of patient flow in outpatient services.

4.2.1 Characteristics of the queue

We defined the features representing the characteristics of the queue as the length of the queue and proportion of each patient type in the queue. The length of the queue has generally been considered a queue feature that is most relevant for waiting time prediction [7, 10, 40, 41]. Moreover, according to queuing theory [38], the average waiting time is positively correlated with queue length; thus, queue length is regarded as a necessary feature. We confirmed through discussions with nurses that patient type could affect the consultation time. For example, a returning patient often has a relatively short consultation time compared with a newly visiting patient. However, a patient’s waiting time could also be affected by the sum of the consultation times of the patients in the queue of the physician to whom they were assigned. Therefore, the proportions of each patient type in the queue were added as features related to the queue.

4.2.2 Characteristics of the patient

Patient characteristics influenced the order in which the nurses assigned patients to the outpatient care service of our target hospital. The nurses at the hospital had patients with appointments preferentially queued ahead of patients without an appointment. In other words, patients who made appointments were placed in the queue immediately after registration, whereas patients without an appointment were delayed in entering the queue. These delays could lead to an increase in waiting time. Therefore, whether a patient had an appointment was selected as a feature to represent the patient’s characteristics.

Table 3 Descriptive statistics of the dataset

Full size table

4.2.3 Characteristics of time

Our target hospital operates the outpatient service from 09:00 to 17:30, excluding lunchtime from 12:00 to 13:00. The service is organized into morning and afternoon consultations, with registration opening at 08:00. Several observations of the service flow revealed that the front desk tended to become busier over time. Specifically, patient intake showed smooth flow when each consultation began, that is, in the early morning (between 08:00 and 10:00) and right after lunchtime (between 13:00 and 15:00). However, at the end of each consultation period, patient intake intensified and became the most congested. This tendency resulted in oversaturation of work for the nurses because it required them to process patients’ registration immediately. Such an abnormal process at certain times could cause a delay in nurses’ work, such as registration and addition to the queue for patients, which could result in an increase in patient waiting time. Thus, time zone features were utilized as proxy variables for this phenomenon. Figure 4 illustrates a trend in the distribution of waiting time by registration time^{Footnote 1}. The waiting time is nonstationary, demonstrating its dependence on the registration time, which aligns with our observations. As a result, we set the period between 08:00 and 10:00 and between 13:00 and 15:00 as the smooth flow time zone.

4.2.4 Characteristics of the physician

As described previously, the sum of the consultation times of previous patients consulted by the same physician could affect the waiting time of a patient. These consultation times would be influenced by the speed of the physician’s consultation because the distribution of consultation times could differ even among physicians who diagnose and treat the same disease in the same department. In addition, the interval between each patient’s consultations affects the waiting time. This interval is inevitable, but sometimes it could be long because of personal factors, such as the physician taking phone calls or attending to other tasks. Long intervals increase waiting times for subsequent patients and decrease the efficiency of the entire consultation service process. Therefore, we defined two features as the characteristics of a physician: the speed and efficiency of each physician’s consultation. Speed is the average consultation time of a physician within $\tau $ min prior to patient registration, whereas efficiency is the number of consulted patients within $\tau $ min prior to patient registration. In this study, we set $\tau $ as 60. Speed and efficiency have an inverse relationship only if the physician consulted without any break time. However, this situation is impossible in most cases due to inevitable intervals.

4.3 Data preprocessing

After feature engineering, we constructed a dataset comprising nine observed variables and a target variable (i.e., waiting time). We divided the dataset into training and test sets at a ratio of 8:2 while maintaining a consistent ratio of instances per physician. Given that the waiting time showed a right-skewed distribution, we applied a square root transformation. The interquartile method was then used to remove outliers. Furthermore, we applied min-max normalization to the observed variables to prevent the effect of certain large-scale variables. Preprocessing was initially performed with the training dataset, and a threshold of the training dataset was applied to the test dataset. After preprocessing, 8,970 instances of the training dataset and 2,250 instances of the test dataset remained, and each was used for the training and evaluation of the ML models. Table 3 presents the statistics for each variable in the dataset.

4.4 Prediction of waiting time for deriving an accurate relationship

We compared the prediction performance of different ML models to identify the model that best represented the relationship between waiting time and observed features. Linear models included LR and LR with regularization terms: elastic net (Elastic). Nonlinear models included support vector machines (SVM), RF [42], light gradient boosting machines (LightGBM) [43], extreme gradient boosting (XGBoost) [44], and multilayer perceptron (MLP). We also employed advanced neural network-based models to represent nonlinear relationships well. These models included TabNet [45] and FT-Transformer [46], which have recently demonstrated state-of-the-art prediction performance on several tabular datasets. For hyperparameter tuning, Bayesian optimization with a five-fold cross-validation was conducted. The R-squared ($R^2$), mean absolute error (MAE), and RMSE were computed for each model. The root mean square log error (RMSLE), which incurs larger penalties for underestimation, was also computed.

Table 4 presents a comparison of the ML models, with the average results on test datasets across five training runs using random seeds. LightGBM had the highest prediction accuracy, and it even had the lowest RMSLE. Notably, the LR model, which assumes linearity between service features and waiting times, showed the lowest accuracy compared with the other models that can represent nonlinear relationships. Given the interpretability of the model, the linear model was mainly used to understand and identify healthcare service operational situations. However, these results indicate that the assumption of linearity fails to accurately capture and represent their relationship, suggesting that a nonlinear assumption is necessary for more precise estimation. As a result, LightGBM was chosen as the model that best represented the relationship between the service features and waiting times. Conversely, the predictive performance of advanced neural network-based models, such as TabNet and FT-Transformer, fell short of the best results achieved by the other ML methods. This result suggests the inherent difficulty of the waiting time prediction problem due to the complex relationship between the features and waiting times, as reported in previous studies [7, 10, 17].

Table 4 Performance comparison of ML models

Full size table

4.5 Understanding the importance of features on waiting time

We used TreeSHAP [47] to interpret the relationship between features and waiting times because LightGBM, based on the tree model, was the best model in our experiments. Figure 5a shows the global importance of each feature that affected the prediction of waiting times based on the SHAP values. The influence of $Q_L$ was the most significant, followed by $T_S$ and $P_{NP}$. However, simple quantitative values could not determine whether these features have a positive or negative effect on waiting times. To identify the overall local importance according to the feature value, a SHAP beeswarm plot sorted by global importance is shown in Fig. 5b. Each point represents a SHAP value for each feature and is colored depending on the feature value. This plot also represents the distribution of values based on the line thickness. According to these results, a longer $Q_{L}$ results in higher SHAP values, which corresponds to an increase in the predicted waiting times. Through correlation analysis, we found that $Q_L$ and its corresponding SHAP values had a strong linear relationship ($r = .972, p < .0001$)^{Footnote 2}. By contrast, other features belonging to the queue factor had a relatively low influence on waiting time. $Q_N$, $Q_F$, and $Q_D$ were distributed at low values and had relatively less influence on decreasing the waiting time. However, waiting times may increase when these values are high. Particularly, $Q_N$ exhibited the most consistent tendency among the three. Interestingly, this result was consistent with the nurses’ statements that waiting time could increase when patients visiting the hospital for the first time or being consulted in a new department accounted for a large percentage of the queue. When $A_S$ was 1, its importance values were negative and close to 0, and when $A_S$ was 0, its importance values were mostly positive. Moreover, if the patient registration time was within the smooth flow time zone (i.e., $T_S=1$), the SHAP values were negative. In the opposite case (i.e., $T_S=0$), the SHAP values were positive and close to 0. Registration in the smooth flow time zone significantly affected the reduction of waiting times, but in the opposite case, it had a relatively insignificant effect. Finally, $P_{NP}$ had a significantly negative or positive effect on waiting times when the value was high or low, respectively. Alternatively, $P_{ACT}$ had less influence than $P_{NP}$ but exhibited positive or negative importance as it increased or decreased, respectively.

To analyze the effects of single features on the prediction of waiting times and their interaction with other features, feature dependency plots are presented in Fig. 6. The x- and y-axes represent the values of the feature and its corresponding SHAP values, respectively, and the colors represent the values of a subject feature. We conducted this analysis for combinations of all features. As a result, the three combinations showed distinct relationships with the largest interaction effect. Figure 6a presents the effects of $P_{NP}$ and $Q_L$ on prediction. As shown in the figure, $P_{NP}$ had a negative correlation with its corresponding SHAP values ($r = -.880, p < .0001$), and it had a negative effect on waiting times when its value exceeded 0.421 (8.0 patients). Moreover, the SHAP values for $P_{NP}$ with higher $Q_L$ values were higher, regardless of the $P_{NP}$ values. This result suggests that the effect of $P_{NP}$ on decreasing waiting time was offset by the increase in $Q_L$. Figure 6b shows the effects of $A_S$ and $Q_L$ on the model output. For most patients who visited with appointments, the SHAP value for $A_S$ was unaffected, whereas variations in the SHAP value were observed when patients visited without an appointment. Interestingly, this variation tended to increase with $Q_L$. Finally, we explored the effects of $T_{S}$ and $P_{NP}$. As shown in Fig. 6c, as $P_{NP}$ increased, the SHAP value for $T_S$ became close to 0; that is, $P_{NP}$ tended to offset the overall effect of $T_{S}$ on the predicted waiting times.

4.6 Determining the model for waiting time prediction service

We compared the performance of the model trained using asymmetric loss functions. For this, we selected LightGBM, which showed the best performance in the above experiment, as a base model. The other ML models trained using asymmetric loss functions could also be utilized for prediction after performance comparisons among the models. The candidates of $\alpha $ for Quad-Quad and $\beta $ for LINEX were set as $\{$0.5, 0.6, 0.7, 0.8, 0.9, 1.0$\}$^{Footnote 3} and $\{0.1, 0.2, \dots , 0.9, 1.0\}$, respectively. The other hyperparameter tuning settings were the same as those in Section 3.4. Here, we introduced DAES as a score function to identify the optimal set of hyperparameters, including the selection of an asymmetric loss function. DAES evaluates a trained model’s performance by considering patient dissatisfaction resulting from underestimation and excessive overestimation. We set the $\gamma $ of DAES to 30 min, considering the nurses’ concerns after the interview, and $e(\cdot )$ as the RMSE. In this study, five-fold cross-validation was used to improve reliability with respect to sampling variation. To confirm the robustness of the model’s performance with respect to hyperparameters, we performed five iterations of cross-validation with random seeds and reported the average results in Table 5. According to the results, the ratio of underestimation $\rho _{(\epsilon > 0)}$ gradually decreased and RMSE increased as the values of parameters $\alpha $ and $\beta $ increased. Specifically, when a model was trained using Quad-Quad with $\alpha =1.0$, its accuracy was significantly reduced, although it showed the lowest ratio of underestimation. Although a model trained using LINEX with $\beta =0.1$ achieved high accuracy, it could not prevent underestimation. This finding implies the existence of a trade-off between underestimation and accuracy. On the basis of the DAES results, these models were not notably evaluated for their performance. Conversely, a model trained by Quad-Quad with $\alpha =0.8$ outperformed the other models in terms of DAES, indicating that it had a proper balance between underestimation and accuracy. These results demonstrate that the DAES allowed for the evaluation of goodness of fit by considering the trade-off.

Table 5 Cross-validation results with different loss functions

Full size table

In general, RMSE is utilized as a score function for prediction models, with a primary emphasis on accuracy. To ascertain the effectiveness of DAES as a score function for waiting time prediction considering dissatisfaction, we conducted a comparative analysis of the performances of the best model determined by DAES and RMSE across the degrees of asymmetry. The comparative results are graphically presented in Fig. 7. In both score functions, an increase in the degree of asymmetry consistently reduced the underestimation ratio, accompanied by an increase in RMSE (i.e., a decrease in accuracy). This observation reaffirms the inherent trade-off between underestimation and accuracy. Notably, in the case of DAES, the reduction in the underestimation ratio was more pronounced compared with the case of RMSE. Given that RMSE primarily emphasizes accuracy without considering the intention of underestimation, it may not be the most suitable choice for capturing this aspect. By contrast, DAES demonstrates its effectiveness in preventing underestimation. In other words, utilizing DAES as a scoring function facilitates the identification of proper models that can incorporate preventing underestimation as a desirable attribute.

Table 6 Evaluation results of the best models determined by the parameters of DAES

Full size table

DAES penalizes not only underestimation but also excessive overestimation. The penalty for excessive overestimation is applied only when it exceeds a predefined time threshold $\gamma $, which practitioners can adjust as a parameter. To confirm the ability of DAES to control excessive overestimation, we evaluated the best models on the test datasets. These best models were trained with asymmetric loss function and tuned using different DAES parameters. The candidate value for $\gamma $ was set as {10, 20, 30, 40, 50, 60}. We also evaluated model performance trained with MSE across DAES parameter settings to examine the compatibility between score and loss functions. The results are reported in Table 6. As the time threshold $\gamma $ increased, a loss function with greater asymmetry was selected, leading to a decrease in the underestimation rate and a shift in the model’s prediction distribution toward overestimation. In other words, by adjusting the permissible range for overestimation errors, the prediction distribution of the model could be deterred from leaning considerably toward overestimation. Hence, with the parameter $\gamma $, practitioners can control the degree of the excessive model’s overestimation. However, when $\gamma $ exceeded 40 min, no further changes in the prediction distribution based on the parameter were observed. The reason is that the time threshold $\gamma $ exceeded the model’s prediction error range, and the penalty for overestimation became negligible. Therefore, practitioners should consider the model’s prediction range when determining the value of $\gamma $. Furthermore, when employing MSE, which is a symmetric loss function, the performance remained consistent regardless of the DAES parameter variations. This result suggests utilizing DAES and an asymmetric loss function concurrently can be effective for a predictive model considering outpatient dissatisfaction.

4.7 Providing the expected waiting time and its explanation

SHAP values explain how the predicted waiting times were affected by each feature. Thus, we used TreeSHAP to interpret the LightGBM trained with Quad-Quad with $\alpha =0.8$, which showed the best performance in terms of DAES(30). Figure 8 shows examples of the feature importance at the local level. The root-squared waiting time is presented in colored plots. Red and blue represent features that increase or decrease the prediction value, respectively. The length of each feature corresponds to the extent of its effect on the SHAP value. As shown in Fig. 8a, the model predicted an actual value of 5.88 (34.52 min) to 6.90 (47.61 min). The most important feature in this case was $P_{NP}$, with a value of 0.684 (13 patients), which decreased the prediction value. Moreover, the $P_{ACT}$ value of 0.241 (approximately 3.6 min) decreased the waiting times. By contrast, $Q_L$ and $T_S$ with values of 0.259 (7 patients) and 0, respectively, contributed to an increase in waiting times. Therefore, physicians’ high efficiency and short consultation time reduced the expected waiting times despite the long queue lengths and visiting a nonsmooth flow time zone ($T_S=0$). As shown in Fig. 8b, the predicted value was 7.81, corresponding to 61.00 min. The actual waiting time for this instance was 53.52 min. $Q_L$ with a value of 0.296 (8 patients) had the greatest effect on increasing the prediction. In view of the nonsmooth flow time zone, a newly visiting patient in the queue (i.e., $Q_N=0.125$; one patient) and long consultation times of assigned physicians (i.e., $P_{ACT}=0.435$; 6.48 min) affected the increase. Although $P_{NP}$ contributed to decreasing the prediction value, the effect was somewhat insignificant because its value (0.368, 7 patients per hour) was relatively small compared with the result in Fig. 8a. Lastly, Fig. 8c shows the result of the model that predicted a value of 5.19 corresponding to a value of 26.94 min, and its actual waiting time was 26.18 min. $Q_L$ and $T_S$ with values of 0.111 (3 patients) and 1, respectively, contributed to a decrease in waiting times. In the queue, newly visiting and returning patients had an equal ratio (i.e., $Q_{N}=Q_{R}=0.333$). However, the importance of the newly visiting patient ratio on waiting time was marginally larger than that of the revisiting patient ratio. Specifically, although the ratio of newly visiting patients contributed to an increase in waiting time, the ratio of returning patients had the opposite effect, contributing to a decrease in waiting times. Conversely, the variables related to the physician factor had similar absolute values of effect on waiting time. However, the physician’s slightly fast consultation speed (i.e., $P_{ACT}$=0.373; 5.56 min) contributed to a decrease in waiting time, whereas low consultation efficiency (i.e., $P_{NP}$=0.105; 2 patients per hour) led to an increase in waiting time.

Table 7 Performance comparison of ML models for the NS department

Full size table

Providing this type of information to nurses will enable them to inform the expected waiting time and explain the reasons to their patients. For example, a nurse can provide the following information to a patient: "Your waiting time could have been longer due to 13 patients currently waiting and registration in a busy time zone, but your doctor has been consulting quickly for the last hour, so your expected waiting time is about 50 minutes", or "Your waiting time is predicted to be about 27 minutes. Although there are only three patients ahead of you, the increase in waiting time is expected because a newly visiting patient is in the queue. Also, your attending physician is conducting consultations quickly, but it seems to be slightly delayed due to unforeseen personal reasons. So, a slight increase in waiting time is expected". With this, we expect patients to understand the factors that they could not observe in the waiting room and their predicted waiting time.

4.8 Additional case study on the NS department

We applied our framework to the NS department of the same hospital to validate the applicability of our framework and compare analysis results between different departments. For the NS department, the mean waiting time was 29.31 min with a standard deviation of 20.55 min, whereas the mean and standard deviation of consultation time were 5.48 min and 3.31 min, respectively. We extracted event logs from the HIS for the same period. After discussion with medical workers and observations of patient flow, we set parameters for data construction, specifically $\tau $ was set to 60 and periods for $T_{S}$ were set as the start time of each consultation (i.e., between 08:00 and 10:00 and between 13:00 and 15:00). After data preprocessing, 3,773 data instances from 2,741 patients were utilized to train and evaluate the ML models.

The result of the performance comparison is presented in Table 7. The LightGBM exhibited superior performance even in the NS department. Accordingly, To understand the importance of features on waiting time in the NS department, we applied TreeSHAP to the LightGBM and displayed the bar plot and beeswarm plot in Fig. 9. As shown in Fig. 9a, the number of queues had the most significant effect on waiting times, aligning with the result of the EM department. In the NS department, a doctor’s consultation efficiency had a greater influence on waiting times than the that on the smooth flow time zone. Notably, patients’ appointment status almost had no effect on waiting times. In addition, the effects of the newly visiting patient ratio and department first-time outpatient ratio were more pronounced among the waiting queue factor compared with the EM department. These analysis results from these departments suggest that the importance of each feature related to waiting time can differ based on the unique clinical settings of each medical department. Furthermore, as illustrated in Fig. 9b, the trend of changes in the SHAP values for each feature value in the NS department aligned with the EM department’s analysis results. These consistent results confirm the applicability of our analytical approach using IML.

Figure 10 displays the feature dependency plots specific to the NS department. As illustrated in Fig. 10a, the relationship between SHAP values for $P_{NP}$ and $Q_{L}$ was similar to the patterns observed in the EM department. However, different from the EM department, the effect of $P_{NP}$ decreased as $Q_{L}$ increased when $P_{NP}$ was equal to 0. Thus, in the NS department, the increase in $Q_{L}$ offset the effect of $P_{NP}$ on waiting times. Figure 10b and c represent the dependency plots of combinations that showed distinct relationships in the EM department. However, these relationships were absent in the NS department. Conversely, in NS department, a different relationship was observed, which was not found in the EM department. Figure 10d reveals that as $Q_{L}$ increased, the effect of $T_{S}$ on waiting times became more pronounced. Hence, when $Q_{L}$ was short, the patient registration time within $T_{S}$ did not significantly affect the waiting time; however, when $Q_{L}$ was long, the patient registration time within $T_{S}$ influenced the reduction in waiting times more.

Table 8 Evaluation results of the best models determined by the parameters of DAES for the NS department

Full size table

To identify the most suitable model for waiting time prediction service in the NS department considering patient dissatisfaction, we applied our approach using asymmetric loss function and DAES to train the model and identify the optimal combination of hyperparameters. Table 8 shows the evaluation results of models trained with asymmetric loss function across different parameters of DAES. Given that the waiting times in the NS department were relatively shorter than those in the EM department, we added {15, 25} to candidates of $\gamma $. The results showed that the loss function with pronounced asymmetry was chosen, as the time threshold $\gamma $ increased, thereby decreasing the model’s underestimation ratio and increasing its overestimation ratio. This trend aligned with the results from the EM department. For the NS department, models with $\gamma $ values beyond 25 min showed no further changes in the prediction distribution as $\gamma $ values. Thus, practitioners of the NS department in this hospital should set an appropriate DAES within 25 min.

5 Concluding remarks

In practical settings, the use of IML and an asymmetric loss function for outpatient waiting time prediction offers several advantages. First, the use of IML enabled us to identify the effects of different service operational features on waiting times. The quantitative value output from IML can provide valuable insights for decision makers to understand the underlying features of hospital systems and make informed improvements to reduce waiting times. In practice, on the basis of the analysis conducted with IML, we recommended the EM department to encourage patients to make appointments. We also suggested an effective allocation of physicians during a smooth flow time zone and an increase in the number of medical workers. For the NS department, we recommended inducing newly visiting and first-visiting patients to schedule their consultation time into a smooth flow time zone and effectively assigning physicians considering their consultation style. Second, ML-based waiting time prediction models must provide predictions that align with patients’ perception and satisfaction. This study addressed this requirement by developing a method that incorporates asymmetric loss functions and DAES to predict dissatisfaction-aware waiting times. The use of asymmetric loss functions enabled the model not to underestimate the expected waiting times, which, in turn, reduced the likelihood of patients experiencing dissatisfaction due to unexpected short waiting times. In addition, the use of DAES ensures the model to make accurate predictions with suitable asymmetry, reducing the risk of excessive overestimation inherent in using asymmetric loss functions. It also provides practitioners the flexibility to adjust DAES parameters, allowing for tailored asymmetry in consideration of outpatient dissatisfaction.

To the best of our knowledge, this study is the first attempt to use IML to accurately estimate the extent to which focal operational features affect outpatient waiting times. The majority of previous studies that aim to understand operational situations and identify service improvements in healthcare were limited in accurately capturing the complex relationship between features and waiting times, primarily due to linearity assumption. Our study empirically reveals the necessity of considering a nonlinear relationship between waiting times and operational features in outpatient care services. By employing IML, we overcome the limitations of previous research, enabling a more precise estimation and interpretation of nonlinear relationships. Thus, our study contributes to filling this research gap that has not yet been addressed in healthcare service management. Moreover, this study represents a novel approach to employing asymmetric loss function and DAES for training of waiting time prediction model, specifically aimed at reducing both underestimation and excessive overestimation. Thus, our study also contributes to bridging the research gap by adapting patient-centered philosophies, which have been previously overlooked in the literature on outpatient waiting time prediction.

The methodological contribution and practical applicability of this study were validated through a case study on the importance estimation of operational features affecting the waiting times in the EM and NS departments’ outpatient services of one of the largest hospitals in South Korea. To highlight the practical contributions of our framework and provide concrete examples of its application, we present several valuable applications in the healthcare setting as follows: (Real-time patient notifications) One key application is the provision of real-time patient notifications. For instance, many patients have expressed frustration with not knowing how long they may need to wait for their consultations. Our case studies demonstrate that our methodology accurately predicts waiting times in 1-min units, providing a higher level of granularity compared with traditional 15- and 30-min units. Our approach also offers explanations for its prediction. Providing patients with precise updates about their consultation can enhance the overall patient experience, reducing anxiety and dissatisfaction. (Operations improvement) From the perspective of outpatient operations, our framework allows healthcare providers to better understand the factors contributing to longer waiting times. On the basis of the advantages of IML discussed earlier, we can identify the specific reasons behind extended waiting times. This knowledge allows for strategic interventions, such as adding extra nurses, to reduce waiting times or adjusting appointment schedules. The ability to determine the causes of longer waiting times is a significant contribution to operational efficiency. (Minimizing patient dissatisfaction) Our research emphasizes the importance of minimizing potential patient dissatisfaction due to inaccurate predictions. By incorporating an asymmetric loss function and DAES into our model, we can reduce the effects of inaccurate predictions on patients. This not only enhances the patient experience but also reduces the risk of dissatisfaction, which can negatively affect the reputation of healthcare service providers. In conclusion, we believe the proposed framework will serve as a valuable tool for monitoring, managing, and enhancing hospital outpatient services.

Availability of data and materials

Data are not publicly available due to security, privacy, or ethical restrictions.

Notes

In Fig. 4, the black line represents the mean waiting time, and the gray shade area represents the 95% confidence interval.
r: Pearson’s correlation coefficient, p: p-value.
In case of $\alpha =0.5 $, the Quad-Quad performs similarly to MSE. To compare the difference in the degree of asymmetry, we added $\alpha =0.5$ in the candidates.

References

Horwitz LI, Green J, Bradley EH (2010) Us emergency department performance on wait time and length of visit. Ann Emerg Med 55(2):133–141. https://doi.org/10.1016/j.annemergmed.2009.07.023. URL https://www.sciencedirect.com/science/ article/pii/S0196064409012839
Park SH (2001) Analysis of factors delaying on waiting time for medical examination of outpatient on a hospital. Journal of Korean Society Quality Assurance Health Care 8(1):56
Google Scholar
Alarcon-Ruiz CA, Heredia P, Taype- Rondan A (2019) Association of waiting and consultation time with patient satisfaction: secondary-data analysis of a national survey in peruvian ambulatory care facilities. BMC Health Services Research 19(1):439. https://doi.org/10.1186/s12913-019-4288-6
Article Google Scholar
Nottingham QJ, Johnson DM, Russell RS (2018) The effect of waiting time on patient perceptions of care quality. Quality Management Journal 25(1):32–45. https://doi.org/10.1080/10686967.2018.1404368
Article Google Scholar
Eilers GM (2004) Improving patient satisfaction with waiting time. Journal of American College Health 53(1):41–48. https://doi.org/10.3200/JACH.53.1.41-48
Article Google Scholar
Waters S, Edmondston SJ, Yates PJ, Gucciardi DF (2016) Identification of factors influencing patient satisfaction with orthopaedic outpatient clinic consultation: A qualitative study. Manual Therapy 25:48–55. https://doi.org/10.1016/j.math.2016.05.334. URL https://www.sciencedirect.com/science/
Sun Y, Teow KL, Heng BH, Ooi CK, Tay SY (2012) Real-time prediction of waiting time in the emergency department, using quantile regression. Annals of Emergency Medicine 60(3):299–308. https://doi.org/10.1016/j.annemergmed.2012.03.011. URL https://www.sciencedirect.com/science/ article/pii/S0196064412002624
Mowen JC, Licata JW, McPhail J (1993) Waiting in the emergency room: How to improve patient satisfaction. Journal of health care marketing 13(2): 26. URL http://openlink.library.unist.ac.kr/link.cgi?url=https: //www.proquest.com/scholarly-journals/ waiting-emergency-room-how-improve-patient/ docview/232338152/se-2. Copyright - Copyright American Marketing Association Summer 1993; Last updated - 2022-10-20; CODEN - JHCMDT; SubjectsTermNotLitGenreText - US
Conner-Spady BL, Sanmartin C, Johnston GH, McGurran JJ, Kehler M, Noseworthy TW (2011) The importance of patient expectations as a determinant of satisfaction with waiting times for hip and knee replacement surgery. Health Policy 101(3):245–252. https://doi.org/10.1016/j.healthpol.2011.05.011. URL https://www.sciencedirect.com/ science/article/pii/S0168851011001059
Kuo YH, Chan NB, Leung JM, Meng H, So AMC, Tsoi KK, Graham CA (2020) An integrated approach of machine learning and systems thinking for waiting time prediction in an emergency department. International Journal of Medical Informatics 139:104143. https://doi.org/10.1016/j.ijmedinf.2020.104143. URL https://www.sciencedirect.com/ science/article/pii/S1386505619309657
Olsson O (2014) Managing variable patient flows at hospitals. Ph.D. thesis, Linköping University Electronic Press. https://doi.org/10.3384/lic.diva-111635
Kim J, Lim C (2021) Customer complaints monitoring with customer review data analytics: An integrated method of sentiment and statistical process control analyses. Advanced Engineering Informatics 49: 101304. https://doi.org/10.1016/j.aei.2021.101304. URL https://www.sciencedirect.com/science/ article/pii/S1474034621000586
Bentayeb D, Lahrichi N, Rousseau LM (2019) Patient scheduling based on a service-time prediction model: a data-driven study for a radiotherapy center. Health Care Management Science 22(4):768–782. https://doi.org/10.1007/s10729-018-9459-1
Suleiman M, Demirhan H, Boyd L, Girosi F, Aksakalli V (2022) Bayesian prediction of emergency department wait time. Health Care Management Science 25(2):275–290. https://doi.org/10.1007/s10729-021-09581-1
Article Google Scholar
Fairley M, Scheinker D, Brandeau ML (2019) Improving the efficiency of the operating room environment with an optimization and machine learning model. Health Care Management Science 22(4):756–767. https://doi.org/10.1007/s10729-018-9457-3
Article Google Scholar
Kayíş E, Khaniyev TT, Suermondt J, Sylvester K (2015) A robust estimation model for surgery durations with temporal, operational, and surgery team effects. Health Care Management Science 18(3):222–233. https://doi.org/10.1007/s10729-014-9309-8
Lin WC, Goldstein IH, Hribar MR, Sanders DS, Chiang MF (2019) In: AMIA Annual Symposium Proceedings, vol. 2019 (American Medical Informatics Association, 2019), p. 1121
Pak A, Gannon B, Staib A (2021) Predicting waiting time to treatment for emergency department patients. International Journal of Medical Informatics 145:104303. https://doi.org/10.1016/j.ijmedinf.2020.104303. URL https://www.sciencedirect.com/ science/article/pii/S1386505620305219
Ahmad MA, Eckert C, Teredesai A (2018) In: Proceedings of the 2018 ACM international conference on bioinformatics, computational biology, and health informatics, pp. 559–560
Stiglic G, Kocbek P, Fijacko N, Zitnik M, Verbert K, Cilar L (2020) Interpretability of machine learning-based prediction models in healthcare. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 10(5):e1379
Google Scholar
Gao X, Alam S, Shi P, Dexter F, Kong N (2023) Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach. BMC Medical Informatics and Decision Making 23(1):1–11
Article Google Scholar
Okay FY, Yıldırım M, Özdemir S (2021) In: 2021 International Symposium on Networks, Computers and Communications (ISNCC) (IEEE, 2021), pp. 1–6
Hu C, Li L, Huang W, Wu T, Xu Q, Liu J, Hu B (2022) Interpretable machine learning for early prediction of prognosis in sepsis: a discovery and validation study. Infectious Diseases and Therapy 11(3):1117–1132
Article Google Scholar
Thompson DA, Yarnold PR, Williams DR, Adams SL (1996) Effects of actual waiting time, perceived waiting time, information delivery, and expressive quality on patient satisfaction in the emergency department. Annals of Emergency Medicine 28(6):657–665. https://doi.org/10.1016/S0196-0644(96)70090-2. URL https://www.sciencedirect.com/ science/article/pii/S0196064496700902
Thompson DA, Yarnold PR (1995) Relating patient satisfaction to waiting time perceptions and expectations: the disconfirmation paradigm. Academic Emergency Medicine 2(12):1057–1062
Article Google Scholar
Pruyn A, Smidts A (1998) Effects of waiting on the satisfaction with the service: Beyond objective time measures. International Journal of Research in Marketing 15(4): 321–334. https://doi.org/10.1016/S0167-8116(98)00008-1. URL https://www.sciencedirect.com/ science/article/pii/S0167811698000081
Spechbach H, Rochat J, Gaspoz JM, Lovis C, Ehrler F (2019) Patients’ time perception in the waiting room of an ambulatory emergency unit: a cross-sectional study. BMC Emergency Medicine 19(1):1–10
Article Google Scholar
Murdoch WJ, Singh C, Kumbier K, Abbasi-Asl R, Yu B (2019) Definitions, methods, and applications in interpretable machine learning. Proceedings of the National Academy of Sciences 116(44):22071–22080. https://doi.org/10.1073/pnas.1900654116
Lundberg SM, Lee SI (2017) In: Advances in Neural Information Processing Systems, vol. 30, ed. by I. Guyon, U.V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, R. Garnett (Curran Associates, Inc., 2017). URL https://proceedings.neurips.cc/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper. pdf
Shapley LS (1953) Stochastic games*. Proceedings of the National Academy of Sciences 39(10):1095–1100. https://doi.org/10.1073/pnas.39.10.1095
Ribeiro MT, Singh S, Guestrin C (2016) In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp. 1135–1144
Ribeiro MT, Singh S, Guestrin C (2018) In: Proceedings of the AAAI conference on artificial intelligence, vol. 32
Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee SI (2020) From local explanations to global understanding with explainable ai for trees. Nat Mach Intell 2(1):56–67
Article Google Scholar
Elliott G, Komunjer I, Timmermann A (2005) Estimation and testing of forecast rationality under flexible loss. Rev Econ Stud 72(4):1107–1125. URL http://www.jstor.org/stable/3700702
Varian HR (1975) A bayesian approach to real estate assessment. Studies in Bayesian econometric and statistics in Honor of Leonard J. Savage pp. 195–208
Ma Y, Zhang Q, Li D, Tian Y (2019) Linex support vector machine for large-scale classification. IEEE Access 7:70319–70331. https://doi.org/10.1109/ACCESS.2019.2919185
Tang J, Xu W, Li J, Tian Y, Xu S (2021) Multi-view learning methods with the linex loss for pattern classification. Knowledge-Based Systems 228: 107285. https://doi.org/10.1016/j.knosys.2021.107285. URL https://www.sciencedirect.com/ science/article/pii/S0950705121005475
Sundarapandian V (2009) Probability, statistics and queuing theory (PHI Learning Pvt. Ltd., 2009)
Ang E, Kwasnick S, Bayati M, Plambeck EL, Aratow M (2016) Accurate emergency department wait time prediction. Manufacturing & Service Operations Management 18(1):141–156
Article Google Scholar
Curtis C, Liu C, Bollerman TJ, Pianykh OS (2018) Machine learning for predicting patient wait times and appointment delays. Journal of the American College of Radiology 15(9):1310– 1316. https://doi.org/10.1016/j.jacr.2017.08.021. URL https://www.sciencedirect.com/science/ article/pii/S1546144017310141
Pianykh OS, Rosenthal DI (2015) Can we predict patient wait time? Journal of the American College of Radiology 12(10):1058–1066. https://doi.org/10.1016/j.jacr.2015.04.010. URL https://www.sciencedirect.com/science/ article/pii/S1546144015002628
Breiman L (2001) Random forests. Machine Learning 45:5–32
Article Google Scholar
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu TY (2017) In: Advances in Neural Information Processing Systems, vol. 30, ed. by I. Guyon, U.V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, R. Garnett (Curran Associates, Inc., 2017). URL https://proceedings.neurips.cc/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf
Chen T, Guestrin C (2016) In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Association for Computing Machinery, New York, NY, USA, 2016), KDD ’16, pp. 785–794. https://doi.org/10.1145/2939672.2939785
Arik SĀ, Pfister T (2021) Tabnet: Attentive interpretable tabular learning. Proceedings of the AAAI Conference on Artificial Intelligence 35(8):6679–6687. https://doi.org/10.1609/aaai.v35i8.16826
Article Google Scholar
Gorishniy Y, Rubachev I, Khrulkov V, Babenko A (2021) Revisiting deep learning models for tabular data. Advances in Neural Information Processing Systems 34:18932–18943
Google Scholar
Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee SI (2020) From local explanations to global understanding with explainable ai for trees. Nature Machine Intelligence 2(1):2522–5839
Article Google Scholar

Download references

Acknowledgements

This research is intended for academic purposes only and does not replace Microsoft’s official position. The opinions expressed in this research are entirely those of the researchers individually. Microsoft does not guarantee the reliability of any information in the delivery provided herein.

Funding

This work was supported by the following institutes and grants: National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2021R1I1A4A01049121, NRF-2022S1A5A8054010, NRF-2023S1A5A8080351, NRF-2020S1A5A8046717) and Institute of Information & Communications Technology Planning & Evaluation (IITP) grants funded by the Korean government (MSIT) (No. 2020-0-01336: Artificial Intelligence Graduate School Program - UNIST, No. 2021-0-02068, Artificial Intelligence Innovation Hub)

Author information

Authors and Affiliations

Graduate School of Artificial Intelligence, Ulsan National Institute of Science and Technology, 50 Unist-gil, Eonyang-eup, Ulju-gun, 44919, Ulsan, Republic of Korea
Jongkyung Shin & Chiehyeon Lim
Microsoft Technology Centers, Microsoft Korea, 50, Jongno 1-gil, Jongno-gu, 03142, Seoul, Republic of Korea
Donggi Augustine Lee
Center for R &D Investment and Strategy Research, Korea Institute of Science and Technology Information, 66 Hoegi-ro, Dongdaemun-gu, 02456, Seoul, Republic of Korea
Juram Kim
Department of Neurosurgery, Pusan National University Hospital, 179, Gudeok-ro, Seo-gu, 49241, Busan, Republic of Korea
Byung-Kwan Choi

Authors

Jongkyung Shin
View author publications
You can also search for this author in PubMed Google Scholar
Donggi Augustine Lee
View author publications
You can also search for this author in PubMed Google Scholar
Juram Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chiehyeon Lim
View author publications
You can also search for this author in PubMed Google Scholar
Byung-Kwan Choi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors participated in conceptualization. Jongkyung Shin: Methodology, Writing-original draft, Writing-review & editing, Software, Validation, Visualization. Donggi Augustine Lee: Formal analysis, Data curation, Investigation. Juram Kim: Formal analysis, Writing-review & editing, Supervision. Chiehyeon Lim: Supervision, Writing-review & editing, Data curation. Byung-Kwan Choi: Data curation, Project administration.

Corresponding authors

Correspondence to Juram Kim or Chiehyeon Lim.

Ethics declarations

Conflict of interest/Competing interests

The authors declare no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethics approval

This research study was conducted using data that do not include personal information of patients.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shin, J., Lee, D.A., Kim, J. et al. Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning. Health Care Manag Sci (2024). https://doi.org/10.1007/s10729-024-09676-5

Download citation

Received: 10 April 2023
Accepted: 06 May 2024
Published: 01 June 2024
DOI: https://doi.org/10.1007/s10729-024-09676-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Dissatisfaction-considered waiting time prediction for outpatients with interpretable machine learning

Abstract

Similar content being viewed by others

How to adjust the expected waiting time to improve patient’s satisfaction?

Bayesian prediction of emergency department wait time

A Case Based Approach to Assess Waiting Time Prediction at an Intensive Care Unity

1 Highlights

2 Introduction

3 Methodology

3.1 Proposed framework

3.2 SHAP

3.3 Asymmetric loss function

3.4 Asymmetric error score for dissatisfaction-aware waiting time prediction

4 Application results

4.1 Data collection

4.2 Feature engineering

4.2.1 Characteristics of the queue

4.2.2 Characteristics of the patient

4.2.3 Characteristics of time

4.2.4 Characteristics of the physician

4.3 Data preprocessing

4.4 Prediction of waiting time for deriving an accurate relationship

4.5 Understanding the importance of features on waiting time

4.6 Determining the model for waiting time prediction service

4.7 Providing the expected waiting time and its explanation

4.8 Additional case study on the NS department

5 Concluding remarks

Availability of data and materials

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest/Competing interests

Ethics approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation