INTRODUCTION

Laboratory testing is an important contributor to health care expenditure,1,2,3 and yet up to 42% of laboratory testing could be considered wasteful.4,5,6,7 Redundant testing has been estimated to waste up to 5 billion USD annually in the USA8,9,10 and Canadians receive over 1 million unnecessary tests each year.4 Unnecessary diagnostic testing performed without appropriate consideration of pretest probability can generate false-positive results, which drives further unnecessary tests and wasted health care dollars.3 Therefore, redundant laboratory testing could result in substantial downstream costs even if the individual tests themselves are relatively inexpensive. The motivation to improve testing practices extends beyond cost savings. Excessive blood work may result in additional unintended consequences which include patient discomfort, hospital-acquired anemia, unnecessary transfusions, prolonged hospitalizations, over-investigation of false positives, and increased mortality for patients with cardiopulmonary diseases.3,11,12 Efforts to reduce the frequency of laboratory tests can improve patient satisfaction and reduce costs without worsening patient outcomes, readmission rates, critical care utilization, or mortality.3,13,14

Several organizations attempt to address unnecessary laboratory testing. Choosing Wisely is a campaign to help clinicians and patients engage in conversations about reducing unnecessary tests, treatments, and procedures.15 Several of its recommendations focus on appropriate utilization of laboratory testing. The American Association of Blood Banks recommends against performing serial blood counts on clinically stable patients.16 The Society for the Advancement of Blood Management recommends against performing laboratory blood testing unless clinically indicated or necessary for diagnosis/management in order to avoid iatrogenic anemia.17 Choosing Wisely Canada recommends internists to avoid ordering repeated complete blood count and chemistry testing in the face of clinical and lab stability in the inpatient setting.18 Although there is some evidence to guide this in the perioperative setting, there is minimal existing evidence in the medical inpatient population to guide either indications or frequencies for use of common laboratory tests.19

Without clear guidance on how frequently to order common laboratory tests and under what circumstances, optimization of inpatient laboratory testing is difficult. This is particularly relevant for complex patients admitted under internal medicine who are often responsible for significant resource use in teaching hospitals.20 In addition, there is substantial inter-physician variability with respect to practices and patterns of testing even within the same practice setting.21 The aim of this study was to develop consensus-based frequency recommendations for the use of common laboratory tests in routinely encountered clinical scenarios on general medical units. This contributes to filling the current gap of evidence-based guidelines for laboratory testing.

METHODS

Study Design

To reach consensus on frequency recommendations on common laboratory tests, we used the modified Delphi22 approach. Consensus was defined as > 80% of agreement by experts on the same frequency choice. This cutoff is in keeping with current recommendations for consensus-based studies.23 Consensus was considered weak when it was reached by combining two adjacent frequency categories. We determined a priori to conduct no more than three rounds of voting.24 All rounds of this closed survey were conducted using an online survey tool25 between November 2017 and March 2018. Each expert who completed all rounds of the survey received $50 CAD in honorarium.

Participants

An expert considered for inclusion in this study was an internist who had been in independent practice in a Canadian medical unit for at least 5 years and who had made significant contributions to the fields of quality improvement, medical education, or both. This included serving as an examiner at the Royal College of Physicians and Surgeons of Canada, authoring peer-reviewed publications in either area, and/or holding educational or quality improvement leadership roles within their hospitals. Medical units considered for this study are Canadian clinical teaching units, which are non-critical care general internal medicine inpatient teaching wards.26 Our expert panel was limited to this setting because similar types of patients are cared for in a similar fashion across the country. This homogeneity assists with consensus building given the shared mental model held by experts across the country.

As only a small subset of academic internists in Canada have known expertise in the fields of quality improvement and/or medical education, we used a non-probabilistic snowball technique to form our expert panel, by first targeting experts known to our authorship team.27 Targeting a panel size of at least 10, consistent with guidelines on consensus methods,21 and assuming a response rate of 25%, we sought to invite a minimum of 40 experts to participate in this panel through an e-mail invitation.

Laboratory Test Selection and Survey Development

We performed a review of laboratory test use in medical units in four adult tertiary care hospitals in Western Canada to identify the highest cost contributors to laboratory test expenditure (Appendix Table 4). We decided to focus on the top contributors, i.e., complete blood count and differential (CBC), electrolytes, renal studies (creatinine and urea), extended electrolytes (calcium, magnesium, phosphate), and coagulation studies [international normalized ratio (INR) and partial thromboplastin time (PTT)]. Although not a high-cost contributor, we also included creatine kinase in our survey because its narrow range of utilization makes it a good candidate for an attempt to derive consensus-based recommendations for use. Research team members who were general internists with expertise in quality improvement and survey design/consensus methods (A.A. and I.M. respectively), with input from local internists, compiled an initial survey draft of commonly encountered clinical scenarios (total 123) where the above laboratory tests may be ordered on medical units. The scenarios referred only to general medical units in Canada and did not include scenarios which might require an intensive care unit admission.

We piloted the initial draft survey on 12 internists who were not part of the expert panel and solicited feedback on survey usability and technical functionality. Based on their input regarding common clinical scenarios, we reduced the number of scenarios to 45 and the number of laboratory tests to six in the final survey to optimize survey length while still focusing on common scenarios for the most utilized tests. These six tests were CBC (13 scenarios), electrolytes (14 scenarios), creatinine (7 scenarios), urea (3 scenarios), INR (5 scenarios), and PTT (3 scenarios). For each of the 45 clinical scenarios, we asked experts how frequently they would recommend ordering the associated laboratory test on a time scale that included the following selections: every 2–4 h, every 6–8 h, twice a day, daily, every 2–3 days, weekly, once for diagnostic workup, or not indicated.

Participants were provided space for written feedback in the survey. Scenarios where expert comments demonstrated a requirement for more contextual clarity were modified and included in round 2. For each scenario, frequency range choices that had received no votes in round 1 were removed for round 2. We provided statistical group response feedback to participants between rounds including quantitative results (% agreement in prior round for each scenario).24

RESULTS

Sixty-four experts were invited to participate in this panel. A total of 36 members participated representing 13 of 17 (76%) Canadian academic institutions and 8 of the 10 Canadian provinces (Table 1). The majority (n = 31, 86%) of the experts were specialists in internal medicine or general internal medicine and the remaining (n = 5, 14%) had additional training in other medicine subspecialties (Table 1).

Table 1 Demographic of the 36 Members of the Expert Panel Convened for the Study

Round One

Of the 45 clinical scenarios included, consensus was reached in nine clinical scenarios, weak consensus was reached for 20 scenarios, and no consensus was reached for the remaining 16 scenarios (Fig. 1, Appendix Table 5).

Figure 1
figure 1

Flow diagram of the consensus process summary.

Round Two

All 36 members participated in this round. A total of 18 scenarios were included (Fig. 1, Appendix Table 6). Of the 20 scenarios that had reached weak consensus in round 1, 6 were modified for round 2. Of the 16 scenarios that had not reached consensus in round 1, 4 were modified and an additional 4 split into two each for round 2. For this round, the frequency options were the following: every 2–4 h, every 6–8 h, twice a day, daily for 3 days followed by reassessment, every 2–3 days, weekly, and none unless diagnostic suspicion.

Of the six scenarios that had been modified from the “weak consensus” pool from round 1, four scenarios now reached consensus4,8,25,28 while the other two remained in the weak consensus category.3,5 Of the total of 12 scenarios that had entered round 2 from the no consensus pool from round 1, we arrived at consensus for four, weak consensus for three, and no consensus for the remaining five scenarios (Fig. 1).

Summary of Consensus

We started with 45 clinical scenarios and for round 2 split up scenarios 1, 6, 14, and 28 leading to a total of 49 surveyed scenarios. Of these, we arrived at consensus on frequency recommendations for 17 scenarios (Table 2), weak consensus for 19 scenarios (Table 3), and no consensus for 13 scenarios (Table 3).

Table 2 Consensus-Based Frequency Recommendation for Scenarios That Reached Consensus or Weak Consensus (over Two Adjacent Frequency Categories)
Table 3 Survey Results on Scenarios Where No Consensus Was Reached

DISCUSSION

In this study, for ordering laboratory investigations on the medical inpatient unit, frequency recommendations were reached for 17 common clinical scenarios and weak consensus was reached for 19 clinical scenarios. Our experts seemed to agree on scenarios that require urgent, daily, or no blood work at all. For example, they agreed that patients with diabetic ketoacidosis should get electrolytes tested every 2–4 h, but for several other conditions daily testing sufficed (e.g., daily creatinine for patients with sepsis, acute kidney injury, or those on nephrotoxic agents; daily electrolytes when abnormalities were anticipated and daily CBC for workup for severely abnormal cell counts). For stable inpatients who are awaiting rehabilitation/transition/placement, our experts agreed that regular blood work was not indicated. In addition, urea was generally thought to be unnecessary for most patients. Lastly, for diagnostic purposes, our experts felt that testing of coagulation parameters once during the hospital stay was sufficient, to be repeated only in the setting of anticipated invasive procedure.

Our experts were less able to agree in the scenarios where testing needs to be done either more than daily or less than daily. For instance, in patients with gastrointestinal bleeding or acute electrolyte abnormalities (sodium/potassium), experts generally agreed that testing should be done more frequently than daily. However, a number of experts pointed out that the exact frequency would depend on numerous contextual factors that the scenario description cannot capture. In scenario 2, looking at the use of CBC for monitoring an actively bleeding patient requiring hemodynamic support, our experts indicated that the exact frequency would depend on the starting hemoglobin, the rate of bleeding, requirements for and response to transfusion, and timing and success of planned interventions. Thus, the suggested frequency may range between every 2–8 h. Similarly in scenarios involving patients with resolving issues, those with less severe or chronic anemia, less severe or chronic thrombocytopenia, in patients on warfarin, and those with stable chronic kidney disease, experts agree that testing should generally be less than daily. Again the exact frequency would depend on other factors. Often, the scenarios where the experts desired more clinical information and context were the ones where there was either weak or no consensus.

Our expert panel, comprised predominantly of general internists, was not able to agree on the utility of urea in diagnosis/monitoring of gastrointestinal bleeding and uremic encephalopathy.

Although we only arrived at consensus in 17 scenarios, the results from the other scenarios (including weak and no consensus) still provide insight into expert recommendations of laboratory test utilization. For instance, in scenario 1b (Appendix Table 6), for a stable inpatient with resolving medical issues, while experts did not agree on how frequently CBC should be performed, all agreed that it should be done no more frequently than every 2–3 days. Hence, even though there was no consensus on a specific frequency window for several scenarios, the frequency range can still be helpful to guide optimization efforts.

Unnecessary blood work is often ordered daily in many institutions.29,30 Several studies acknowledge the paucity of consensus on what comprises appropriate laboratory testing in this population and recognize a need for guideline development.28,31,32,33,34,35 However, little has been done to establish appropriate testing frequencies in medical inpatients possibly because there are too many variables that make the task onerous. We used a geographically diverse expert panel of internists in Canada to provide frequency recommendations for the utilization of six laboratory among medical inpatients. These six tests are known to be highly utilized, both from local data (Appendix Table 4) and in the test utilization literature.13,14,20,29 Even though we arrived at consensus in only 17 scenarios, we believe that knowledge of the range of frequency selections by our expert group can still help with professional development and guide quality improvement efforts to standardize practices. Intervention bundles used to optimize laboratory testing often include an educational component.3 These recommendations could be incorporated into the Choosing Wisely Toolkit and help standardize the educational component of these bundles and be used to set benchmarks for audit and feedback.

There are several limitations to our study. First, our group is composed entirely of internists who work on Canadian medical units. This limits the generalizability of our recommendations outside the Canadian medical teaching unit context. However, medical units serve as teaching units for Canadian internists. We also know that spending habits picked up during residency can persist for years.36 Hence, optimization of laboratory test use in this population and setting may impact the practice pattern of current residents and future internists. Second, a disadvantage of using non-probabilistic sampling strategy is that it may be difficult to assess how representative our sample is compared to all possible relevant experts in the field. Third, the scope of our study was such that we were able to focus only on the common scenarios for highly utilized tests. We did not attempt to comprehensively define all possible indications for each laboratory test. The description of clinical scenarios was general and could not possibly capture all relevant contextual features. Thus, any ensuing recommendations cannot replace nuanced clinical judgment. Fourth, our recommendations are only based on expert opinion-based consensus. We did not grade the strength of our recommendations nor conduct a systematic review on all applications. However, given the lack of evidence on the ideal testing frequency for most of these scenarios, we believe that consensus-based recommendations from experts serve as an important starting point. As additional evidence becomes available on appropriate use of laboratory tests, the current recommendations will need to be updated. Fifth, the frequency recommendations are limited to scenarios commonly seen on general medical units and may not be applicable to specific scenarios encountered on specialized services. Sixth is the issue of representation; we deliberately sought to seek general internists as experts in the field of laboratory test management in the inpatient setting. However, we notice the ambiguity in certain areas (e.g., use of urea for diagnosis of gastrointestinal bleeding or uremic encephalopathy) where additional medical subspecialty representation may have been useful. Lastly, our recommended test order frequency do not take into account automatic test bundling that may be in place for a variety of reasons in specific institutions. For example, it may not be possible to order an INR without a PTT or a sodium and potassium without extended electrolytes. Individual physicians will need to take these practice constraints into consideration.

In conclusion, our expert panel consensus-based recommendations highlight considerate and indication-driven utilization of laboratory testing in the inpatient setting. They are not intended to replace clinical judgment. In the setting of limited evidence in this area, consensus-based recommendations are an important intermediate step as we move towards evidence-based guidelines directing appropriate use of laboratory tests. These recommendations can guide future clinical trials of restrictive versus liberal frequency of laboratory testing to assess their impact on patient-oriented outcomes.