Introduction

As COVID-19 spreads around the world, it causes an influx in the emergency departments of the hospitals [1, 2]. In this context, quick triage of patients towards the COVID+ and COVID− hospital units is essential.

There is a growing interest in the role and appropriateness of chest computed tomography (CT) for the screening, diagnosis, and management of patients with suspected COVID-19 infection. This interest is reinforced taking into account the limited availability of virological testing kits to date, the delays concerning the output of the virological test, the concern for test sensitivity from earlier reports in China, and the growing number of publications describing the CT appearance in the setting of known or suspected COVID-19 infection [3,4,5,6,7].

In our institution, we opted to carry out chest CT as well as virological tests in any patient in multiple emergency departments, for which a hospitalization was planned whatever the reasons, i.e., COVID- or non-COVID-related, in order to orient those patients in COVID+ or COVID− units.

The goal of this study was to analyze the diagnostic performances of the chest CT for COVID-19 compared with reverse transcription polymerase chain reaction (RT-PCR) as method of reference, and to compare the delay in reporting results between the two methods, before orienting the patients towards the appropriate hospital unit.

Material and methods

Population

This multi-departmental monocentric retrospective study was conducted in a large university hospital (Hospices Civils de Lyon (HCL), Lyon, France) from March 3 to April 4, 2020. All CT examinations requested by three general emergency departments of the HCL (Hospital Croix-Rousse, Hospital Edouard Herriot, Hospital Lyon Sud) during this period were extracted from our Radiology Electronic Requesting system. Patients under 18 years old were excluded. A total of 1945 CTs were requested, including 720 chest CTs among which 694 were indicated for COVID-19 detection. All these 694 patients (sex-ratio M/F = 363/331, mean age = 66.4 ± 18.63 years) required a hospitalization based on their clinical symptoms at admission. They all underwent a virological test. The main reasons for admission at the emergency department were classified as follows: pulmonary and otolaryngology, abdominal, syncopal episode, trauma, deterioration of general condition, confusion-coma, stroke, and psychiatric. Presence of dyspnea, caught, rhinitis, throat, and/or chest pain, fever (defined as a temperature above 38 °C), and anosmia were recorded. Patients with none of these signs were classified as “asymptomatic.” Data usage policy of our institution in terms of confidentiality, anonymization, and security was applied for each study and approval was obtained from the ethical committee.

CT protocols and analysis

Chest CTs were performed on different helical CT systems, including Revolution GSI® (GE Healthcare), iCT 256® (Philips Healthcare), Ingenuity CT® (Philips Healthcare), SOMATOM Definition AS® (Siemens Healthineers), SOMATOM Definition AS+® (Siemens Healthineers), and Aquilion Lightning® (Canon Medical Systems). The scanning parameters were as follows: mean tube voltage of 119.3 ± 11.9 kVp (range: 80–140 kVp), mean slice thickness of 2.3 ± 0.8 mm (range: 0.6–3 mm). Scan length, mAs, and field of view were adapted for each patient and clinical indication, including exploration of the abdomen when required. One hundred and twenty-two (122) out of the six hundred and ninety-four (694) examinations (17.6%) were obtained after intravenous administration of iodinated contrast material. The purpose for injection was the detection of a pulmonary embolism in 105 patients and thoracic-abdominal and pelvic exploration in 17 patients. All acquisitions were performed at the end of a deep inspiration.

All the chest CTs were analyzed as part of the clinical workflow by senior radiologists from the HCL, including during night shift. The presence of typical features of COVID disease were identified, including ground glass opacity (GGO), crazy paving, sub-pleural bands of consolidations, reversed halo sign, and lung consolidations as described previously [3, 8,9,10,11,12]. An example of these signs is shown in Fig. 1. On the final report, patients were rated as “Surely COVID+” when presenting with peripheral, bilateral, or multifocal GGO of rounded morphology ± consolidation or crazy paving, reversed halo sign, or sub-pleural bands of consolidations. Patients were rated as “Possible COVID+” when presenting with multifocal, diffuse, peripheral, or unilateral GGO ± consolidation lacking a specific distribution and non-rounded or non-peripheral or with only few very small GGO with a non-rounded and non-peripheral distribution or with atypical findings: large pleural effusion, major lymph node size increase, or bronchiolitis pattern. Patients were rated as “COVID−” when the chest CT was normal or demonstrating another pathology.

Fig. 1
figure 1

Typical images from five different patients COVID-19 positive at virology test: ground glass opacity (a), crazy paving (b), posterior lung consolidations (c) and sub-pleural bands of consolidations (d), reversed halo sign (e)

In order to assess the inter-reader reproducibility of CT scan analysis, 70 randomly selected patients (10% of the total dataset) were re-analyzed by an experienced senior radiologist (15 years of experience in the thoracic imaging field).

The delay between the chest CT request by the emergency department and the validation of the report by the radiologist was recorded for all patients. We also recorded whether the CT was performed during day or night shift.

Virological tests

All patients were tested for SARS-CoV-2 nucleic acid detection by RT-PCR assay on upper and/or lower respiratory samples. Clinical specimens for 2019-nCoV diagnostic testing were obtained in accordance with the World Health Organization guidelines (WHO/COVID-19/laboratory/2020.5). The RT-PCR assays were performed using the RdRp IP2-IP4 primers and probes of the Institut Pasteur protocol, which is used in France for SARS-CoV-2 detection by the National Reference Center for Respiratory Virus Infection and French hospital laboratories [13]. This protocol, detecting two targets in RdRp gene, was adapted on the Panther Fusion® (Hologic) molecular system in the Laboratory of Virology (Institut des Agents Infectieux) of the HLC, for high throughput diagnostic. Two hundred microliters of sample was transferred in the Hologic Panther lysis buffer tube (710 μL) in BSL2 biosafety conditions and then loaded in the machine for nucleic acid extraction and molecular amplification according to the manufacturer’s protocol. The quality effectiveness of the process was checked using an internal control tested for each sample. All amplification curves were checked before validation. RT-PCR tests were all performed independently of the CT findings.

The result of the RT-PCR testing of the first sample was recorded for all patients and was classified as “COVID+” or “COVID−” initial virology diagnostic. In case of multiple samples, in the days following the initial test, the final virological diagnostic was rated as COVID+ if one of the samples was positive.

The delay between the RT-PCR test request by the emergency department and the validation of the report by the virologist was recorded for all patients. It takes in account, the delivery time of the sample from three different hospitals to one lab by shuttle every 1.5 h, the record and the treatment of the sample before analysis. The sample molecular analysis is 2.5-h long and was done, at the time of the study, from 8:30 am to 9:00 pm but not all night long.

Statistical analysis

All statistics were performed using R Statistical Software (Foundation for Statistical Computing, version 3.6.1). Quantitative variables were described as mean ± standard deviation and ranges. Accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were estimated for “Surely COVID+” class as positive CT and “Possible COVID+” and “COVID−” CT classes as negative CT on one hand and “Surely COVID+” and “Possible COVID+” classes as positive CT and “COVID−” class as negative CT on the other hand, both compared with the final RT-PCR results considered as the gold standard. Ninety-five percent confidence intervals were built using the Wilson’s method. Agreement between the CT and the virological diagnostic was measured using kappa coefficient. Inter-reader reproducibility between the initial CT analysis and the analysis from the experienced radiologist on the 70 randomly selected patients was measured using the kappa coefficient.

Differences in the delays between CT examination and virology report were compared with the paired Student t test.

Results

Chest CT and initial virology test results were available for the 694 patients. A flowchart of the study is shown in Fig. 2. Main reasons for admittance together with the COVID-19 relevant symptoms are provided in Table 1. Pulmonary and otolaryngology symptomatology was the main reason for admission at the emergency department for 67% of the patients.

Fig. 2
figure 2

Flowchart of the study

Table 1 Main reasons for admittance at the emergency department by clinical category and COVID-19 relevant symptoms per category

Rate of RT-PCR positive tests varied with regard to the reason for admission and was relatively high even for non-COVID-19-related symptoms such as trauma (17.6% which corresponds to 9 patients among which 5 were completely asymptomatic with a positive RT-PCR) or psychiatric disorders (16.6%). It is noticeable that among the 694 patients, 124 patients (17.9%) were fully asymptomatic to COVID common signs from which 21 (17%) had RT-PCR positive tests.

Regarding initial RT-PCR, sampling site was the upper respiratory tract (nasopharyngeal swabs or aspirates) in 685 patients and the lower respiratory tract (endotracheal aspirate or bronchoalveolar lavage) in 9 patients (positive in 3 cases). Additionally, 20 patients underwent multiple RT-PCR during their hospital stay (2 tests in 19 patients and 4 tests in one patient). Sampling site was the upper respiratory tract in 18 patients and the lower respiratory tract in two patients (both negative). For these 20 patients, RT-PCR has been repeated as the patient was negative at the first test with an imperative need of knowing his COVID status for optimal care adaptation or hospital organizational reasons (14 patients were in intensive care unit, 5 had surgery, and one was transferred to a dialysis center). All the patients were initially scheduled to be hospitalized but 54 patients (7.8%) were finally discharged.

On the initial RT-PCR test, 278 patients were tested positive to COVID-19 and 287 were positive on the final RT-PCR test, leading to a final prevalence of the disease of 41.4%.

Chest CTs were rated as “Surely COVID+” in 308 cases (44.4%), “Possible COVID+” in 34 cases (4.9%), and “COVID−” in 352 cases (50.7%), including 283 (40.8%) normal chest CTs and 69 (9.9%) chest CTs demonstrating another pathology than COVID-19.

Using the final virological diagnosis as standard of reference, when considering the “Surely COVID+” CT class as positive CT and “Possible COVID+” and “COVID−” CT classes as negative CT, chest CT accuracy was calculated at 88.9% (95% CI 86.4–90.0%), sensitivity at 90.2% (95% CI 87.3–93.2%), specificity at 88% (95% CI 84.4–90.8%), PPV at 84.1% (95% CI 79.6–87.8%), and NPV at 92.7% (95% CI 89.7–94.9%) (Table 2). Agreement between the two methods was good with a kappa of 0.77 (95% CI 0.73–0.82).

Table 2 Diagnostic performance of the chest CT for the “Surely COVID+” rated CT examinations (column 1) and the “Surely COVID+” and “Possible COVID+” rated CT examinations together (column 2) with final RT-PCR results as gold standard. 95% confidence intervals are provided in brackets

Similarly, when considering the “Surely COVID+” and “Possible COVID+” classes as positive CT and “COVID−” class as negative CT, chest CT accuracy was calculated at 86.6% (95% CI 83.9–88.9%), sensitivity at 93.4% (95% CI 89.9–95.7%), specificity at 81.8% (95% CI 77.8–85.3%), PPV at 78.4% (95% CI 73.7–82.4%), and NPV at 94.6% (95% CI 91.7–96.5%) (Table 2). Agreement between the two methods was good with a kappa of 0.73 (95% CI 0.68–0.78).

Radiologists were aware of the results of the RT-PCR when doing the report in 18 patients including 6 patients positive to COVID-19 and 12 negative. Three hundred and six (44%) of the CT scans were performed during night shift (6:30 PM to 7:30 AM). Inter-reader reproducibility was very good with a kappa of 0.86. Mean volumetric Computed Tomography Dose Index (CTDIvol) and mean Dose Length Product (DLP) were 9.71 ± 4.39 mGy (range: 2.8–33.9 mGy) and 387.4 ± 223.6 mGy.cm (range: 104.9–1903 mGy.cm) respectively.

The mean delay between the examination request by the emergency department and the validation of the report was 187 ± 148 min (range: 21–1267 min, median: 154 min, interquartile range: 138 min) and 573 ± 327 min (range: 189–2812 min, median: 444 min, interquartile range: 403 min) for chest CT and RT-PCR respectively (p < 0.0001).

Discussion

Since the beginning of the COVID-19 epidemic phase, finding ways to control the congestion in emergency departments has been of major importance. As such, it is imperative to have a reliable and rapid tool for correctly referring patients to COVID+ or COVID− hospital units. Virological tests by nasopharyngeal sampling have been widely used for this task, but the relative reliability of upper respiratory virus detection and the delays in obtaining results prompt to find alternative solutions, mainly by using chest CT.

In our study, when considering only the “Surely COVID+” chest CT, accuracy, when compared with the final results of the virological tests, reached 88.9% with a sensitivity and specificity of 90.2% and 88% respectively. In previously published results, sensitivity and specificity varied from 60 to 98% and 25 to 53% respectively [4,5,6,7, 14, 15]. These variabilities concerning the values of sensitivity and specificity are most likely due to the retrospective nature of these studies, including ours, but also to differences in virological test accuracy. In our study, specificity is particularly higher than other studies, probably because alternative diagnosis to COVID-19 was relatively infrequent mainly since flu and other respiratory virus epidemic were ending in France which may not be the case for the other studies that occurred earlier in the year. Indeed, even if some differences have been reported between the CT presentation of flu and COVID, the differences remain small and difficult to assess in clinical routine [16]. Furthermore, specificity is probably underestimated in most of the studies, including ours, since the sensitivity of virological tests is questionable which leads to erroneous false positives in CT. Indeed, besides the performance of the detection kits themselves, which is high and comparable between the different tests available on the market (including ours) [17, 18], several external factors may affect RT-PCR detectability including sampling operations, specimens source (upper or lower respiratory tract), sampling timing (delay after the illness onset), and performance of detection kits [19]. Upper respiratory tract sample, representing the majority of the samples in our study (98.5%), has been reported to have a detection rate comprising between 16.4% [20] and 63% [16] for the nasal swabs, lowermost than lower respiratory tract samples such as sputum (66.4% [20] to 72% [16]) or bronchoalveolar lavage (93%) [16]. We should also take into account that the detectable viral load depends on the days after the illness onset and the site sampled. In the first 5–6 days after onset, SARS-CoV-2 could most reliably be detected in the nasopharyngeal tract [20, 21]. The peak of viral shedding is reported at 5–6 days after onset [19, 20]. Seven to 9 days after onset, sputum is more reliable than nasopharyngeal swabs for viral detection [20, 22]. Other studies have also shown that the detection of the virus from a sputum is of a significantly longer period of time compared with the nasopharyngeal swab (29 days versus 11 days) [19]. Given the variability in the viral loads according to the time and the site of sampling, a negative test result from respiratory samples does not rule out the disease [23]. In contrary to the CT specificity which suffers from the imperfect RT-PCR sensitivity as mentioned above, the CT sensitivity is surely more reliable as specificity of RT-PCR is reported to be very high [13].

Regarding the PPV and NPV, Wen et al [7] reported a PPV and NPV of 92% and 42% respectively. In our study, we found a PPV of 84.1% and a NPV of 92.7%. Same as for the calculated specificity, the false negatives of the virological tests probably decreased the apparent PPV of chest CT. High NPV can be explained by the population taking part in this study, mostly patients with severe illness since they were scheduled for hospitalization and thus with a low probability of having a false negative chest CT. Furthermore, the difference in prevalence of the disease between our study and the study of Wen et al [7] (41.4% versus 85%) partially explains the difference in NPV.

Considering the “Possible COVID+” CT results as positive CT increased the sensitivity (from 90.2 to 93.4%) and the NPV (from 92.7 to 94.6%) but impaired the overall accuracy (from 88.9 to 86.6%), the specificity (from 88% to 81.8%) and the PPV (from 84.1 to 78.4%) of chest CT, due to the inclusion of atypical CT patterns. In our study, the clinical objective was to refer quickly the positive patients to the COVID+ units. As a result, the inclusion of “Possible COVID+” diagnosis was not suitable since the PPV was decreased. Conversely, after the pandemic phase, the better NPV might lead to include these “Possible COVID+” diagnosis in order to benefit from the good NPV to safely address the negative patients to the COVID− units. When this pandemic phase will be over, the management of CT COVID+ patients will have to integrate the prevalence of other pulmonary diseases with CT findings equivalent to those of COVID-19 such as influenza. This may lead to isolate these patients in the emergency department until the RT-PCR results are available.

Regarding the delay to obtain the results for a given patient, RT-PCR results, obtained with high throughput diagnostic system, were available after in average 9.5 h (median: 7.4 h). Even if real time RT-PCR is the fastest available viral detection method, chest CT clearly outperforms the RT-PCR in terms of delay to report the presence of COVID-19 infection. In our study, the delay between the request by the emergency department and the validation of the report was three times longer for the virological test than for chest CT.

Taking into account the difference in delays and the NPV and PPV aforementioned values, we can consider the possibility to make use of the chest CT to orient patients suspected to be positive towards the COVID+ unit to decrease congestion in the emergency departments. In the current epidemic phase, suspected positive patients can then wait, in an isolated room, in the COVID+ unit for the final virological tests. As the NPV is not equal to 100%, we consider that the probability of orienting a positive patient towards the COVID− unit is high and the risk of contaminating other patients is important. As a result, for patients that are negative according to chest CT, we suggest that they wait in the emergency department until the results of the virological test are available and then be oriented towards to the COVID− or COVID+ units accordingly. The interest of this systematic strategy is reinforced by the fact that, during the epidemic phase, a high rate of RT-PCR COVID-19 positive patients was observed in patients with non-COVID-19-related symptoms such as trauma (17.6%) or psychiatric disorder (16.6%).

There are several limitations in the present study. First as discussed before, by using RT-PCR assays with relatively low positive rate in upper respiratory samples as reference, the PPV and the specificity of chest CT for COVID-19 may be underestimated. A second limitation is that access to serology data that might help to reclassify RT-PCR false negatives was not possible at the moment of the study and is difficult to perform retrospectively. Furthermore, the relative sensitivity and specificity of these tests might not allow fully solving the problem [24]. A third one is that in the current pandemic context, a positive RT-PCR does not necessarily imply that there is SARS-CoV-2 pneumonia. A fourth one is related to the difficulties in generalizing our results in areas or periods of time with lower disease prevalence. Indeed, PPV would certainly be decreased in these settings. Nevertheless, the overload of the emergency departments that we observed during the epidemic peak would also be less important and as a result the strategy of triage should be adapted accordingly. An additional limitation concerns the retrospective nature of the study with the common bias related to this kind of studies. Nevertheless, we are confident that this bias risk is limited as the information system of our institution allowed to take into account all the patients going into the emergency department and all the requested CT scans. Finally, we should mention that we did not perform a sample size calculation prior to our study. Indeed, our study is a retrospective study in which we took into account all the patients matching the inclusion criteria since beginning of March 2020.

Conclusion

During the COVID-19 epidemic phase, in a high prevalence environment of the disease, chest CT is a rapid and most probably an adequately reliable triage tool to refer patients requiring hospitalization to the COVID+ or COVID− hospital units, when response times for virological tests are too long, in order to decrease congestion in the emergency departments.