FormalPara Key Summary Points

Age, comorbidity, CD4+ T cell count, CRP, and corticosteroid use were independent risk factors associated with prolonged SARS-CoV-2 RNA shedding.

A prediction model was established, called the CCCCA score, which scores from 5 (age ≤ 50 years, absence of comorbidity, CD4+ T cell > 410 cells/ul, CRP ≤ 10 mg/L, no corticosteroid use) to 14 points (age > 50 years, presence of comorbidity, CD4+ T cell ≤ 410 cells/ul, CRP > 10 mg/L, corticosteroid use).

Using the CCCCA model, clinicians can identify patients with prolonged SARS-CoV-2 RNA shedding, which is of importance for determining an appropriate period of isolation and planning strategies to reduce COVID-19 transmission.

Digital Features

This article is published with digital features, including a summary slide, to facilitate understanding of the article. To view digital features for this article go to https://doi.org/10.6084/m9.figshare.14222525.

Introduction

In December 2019, a novel coronavirus, called severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), caused a global outbreak of coronavirus disease-19 (COVID-19) [1]. Phylogenetic analysis of the complete viral genome revealed that SARS-CoV-2 was most closely related (89.1% similarity) to severe acute respiratory syndrome coronavirus (SARS-CoV) that is the infectious agent responsible for the outbreak of severe acute respiratory syndrome (SARS) in 2003 [2]. At present, the COVID-19 pandemic has become a global public health issue. As of 12 January 2021, the overall upward trend of COVID-19 is continuing, which brings the cumulative numbers to over 88 million reported cases and over 1.9 million deaths globally since the start of the pandemic [3].

Although a negative SARS-CoV-2 ribose nucleic acid (RNA) test is not required in all countries for non-immunocompromised individuals, the main importance of SARS-CoV-2 RNA is the public health implications for planning strategies to reduce transmission. In a previous study, long-term virus shedding was documented in patients with mild symptoms and in asymptomatic patients, and this should be considered when making decisions regarding clinical and public health, and strategies for the prevention and control of SARS-CoV-2 infection [4]. In addition, the duration of viral shedding is often considered in determining an appropriate period of isolation, as it is often used as a marker of infectivity. In China, one of the release criteria for hospitalized patients with COVID-19 is SARS-CoV-2 RNA shedding cessation, which is defined as the occurrence of two consecutive negative results of nasopharyngeal or throat swab specimens with at least 24-h intervals. However, patients with prolonged SARS-CoV-2 RNA shedding presented difficulties in terms of their safe discharge from isolation treatment centers. SARS-CoV-2 RNA shedding from respiratory tract specimens has been found to persist for up to 63 days after symptom onset among patients with COVID-19 [5, 6].

A number of papers have demonstrated prolonged viral shedding in severe illness. In the study by Liu et al., patients with severe COVID-19 tended to have a high viral load and a long virus-shedding period [7]. Zheng et al. also found that the virus persisted longer with higher load and peaked later in the respiratory tissue of patients with severe COVID-19 [8]. Li et al. identified nine (50%) asymptomatic patients and nine (50%) patients with mild symptoms of COVID-19 at admission, and found that the median duration of virus shedding was 11.5, 28, and 31 days for pre-symptomatic, asymptomatic, and mildly symptomatic patients, respectively [4]. In conclusion, there have been several studies evaluating risk factors for prolonged shedding of SARS-CoV-2 RNA. However, specifically in non-severe but not asymptomatic patients, the data are scarce and no prediction score has so far been provided. Therefore, we aimed to clarify the high-risk factors associated with long-term SARS-CoV-2 RNA shedding, and to establish an accurate prediction model for the prediction of long-term SARS-CoV-2 RNA shedding.

Methods

Study Population

A total of 1332 consecutive COVID-19 patients with age ≥ 18 years old, who were admitted to Shanghai Public Health Clinical Center, between January 20, 2020 and November 30, 2020, were included in our study. Shanghai Public Health Clinical Center, located in Shanghai, China, is a tertiary teaching hospital, and the only designated hospital for adult patients with COVID-19 (≥ 18 years old). Exclusion criteria: (1) patients presenting with severe COVID-19 (n = 38); (2) patients who were still being treated up to November 30, 2020 (n = 83). Finally, 1211 non-severe adult hospitalized patients with COVID-19 were enrolled. The 796 patients admitted to hospital between January 20 and August 31, 2020 constituted the training set, and 415 patients admitted to hospital between September 1 and November 30, 2020 constituted the validation set. The flow chart of the study population is shown in Fig. 1.

Fig. 1
figure 1

Flow diagram of the study population. Finally, 1211 non-severe patients with COVID-19 were enrolled. The 796 patients admitted to hospital between January 20th 2020 and August 31st 2020 constituted the training set, and 415 patients admitted to hospital between September 1st 2020 and November 30st 2020 constituted the validation set

The study was approved by the Clinical Research Ethics Committee of the Shanghai Public Health Clinical Center (No. 2020-S219-01). Although this is a retrospective study, at hospital admission, all patients were provided verbal consent of the possibility of their clinical data being used for a medical study. The procedures followed were in accordance with the ethical standards of the Helsinki Declaration (1964, amended most recently in 2008) of the World Medical Association.

Diagnostic Criteria

Patients with COVID-19 were confirmed according to the positive results of SARS-CoV-2 RNA tests in nasopharyngeal or throat swab specimens using the real-time reverse transcription polymerase chain reaction (RT-PCR) method, which is the gold standard for detecting SARS-CoV-2 infection based on the World Health Organization interim guidance [9]. Patients with severe COVID-19 were diagnosed according to at least one of the following standards [10]: (1) respiratory distress, respiratory rates ≥ 30 breaths/min; (2) resting oxygen saturation ≤ 93%; (3) oxygenation index ≤ 300 mmHg; (4) requirement of mechanical ventilation; (5) shock; and (6) combined with other organ failures. Patients with non-severe COVID-19 were defined as patients without any of the above six standards. SARS-CoV-2 RNA shedding cessation was defined as the occurrence of two consecutive RT-PCR negative results of nasopharyngeal or throat swab specimens with at least 24-h intervals. SARS-CoV-2 RNA shedding duration was defined as the time from illness onset (symptom onset for symptomatic patients, first positive RT-PCR tests for asymptomatic patients) to SARS-CoV-2 RNA shedding cessation. At present, there are no unified standards for the cut-off value of prolonged SARS-CoV-2 RNA shedding. Based on previous studies [4, 8, 11,12,13], which reported that the median duration of viral shedding was 12–21 days, in this study, prolonged or long-term SARS-CoV-2 RNA shedding was defined as SARS-CoV-2 RNA shedding duration of more than 21 days. Patients were divided into two groups: the long-term SARS-CoV-2 RNA shedding group (> 21 days), and the short-term SARS-CoV-2 RNA shedding group (≤ 21 days).

Data Collection

The demographic characteristics, epidemiologic histories, comorbidity, laboratory parameters, and clinical outcomes were extracted from the electronic medical records. Comorbidity included at least one of the following: hypertension, diabetes, chronic cardiovascular disease, chronic liver disease, chronic lung disease, chronic kidney disease, acquired immunodeficiency syndrome, malignancy, and other immunocompromised factors including corticosteroid use and autoimmune diseases. For patients in this study, the vital signs were monitored daily; laboratory parameters were examined every 3–7 days, as were chest CT scans. The laboratory parameters including white blood cell (WBC) count, lymphocyte count, CD4-positive T-lymphocytes count (CD4 + T cell count), C-reactive protein (CRP), lactate dehydrogenase (LDH), D-dimer, and erythrocyte sedimentation rate (ESR) were collected both at baseline and at all points during hospitalization.

Statistical Analysis

We used the Kolmogorov–Smirnov test to verify the normality of continuous variables. Normal distribution variables were presented using mean and standard deviations, non-normal distribution continuous variables were presented using median and interquartile ranges (IQRs), and qualitative data were presented using frequency distribution. For the comparison of data between groups, we used the Student’s t test for normal distribution variables, the non-parametric Mann–Whitney-test for non-normal distribution continuous variables, and the chi-squared test for qualitative variables. The significance of each variable was assessed by univariate and multivariate Cox analysis for investigating the independent risk factors of long-term SARS-CoV-2 RNA shedding. All the variables at a statistically significant level after multivariate Cox analysis were candidates for formulation of a scoring model, and their relevant points were determined by the hazard ratio (HR) values to reflect their weights of impact on the SARS-CoV-2 RNA shedding duration. The predictive performance of the scoring model was assessed using the receiver operating characteristic (ROC) curve and the area under the ROC curve (AUROC). The cut-off value was obtained by maximizing the Youden index (sensitivity + specificity-1). Diagnostic accuracy was evaluated by sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). The statistical analyses were performed using the SPSS software, v.15.0 (SPSS; Chicago, IL, USA), MedCalc software, v.16.1 (MedCalc Software, Ostend, Belgium), and R software, v.3.6.1 ® Foundation for Statistical Computing, Vienna, Austria). All significance tests were two-tailed, and p < 0.05 was considered as statistically significant.

Results

Clinical Characteristics of Patients in the Training Set

Clinical characteristics of patients in the training set are shown in Table 1. Overall, 796 patients with COVID-19 were enrolled. The median age was 36 years (IQR 25–51), 456 patients (57.3%) were male, 203 patients (25.5%) were older than 50 years, 205 patients (25.8%) had obesity, 138 patients (17.3%) had a smoking history, 134 patients (16.8%) had at least one comorbidity, and the median SARS-CoV-2 RNA shedding duration was 13 days (IQR 9–18). The 796 patients were divided into the long-term group (n = 116, 14.6%) and the short-term group (n = 680, 85.4%) based on their SARS-CoV-2 RNA shedding duration. Age, comorbidity, fever, WBC, lymphocyte count, CD4 + T cell count, CRP, and corticosteroid use were significantly different between the two groups (Table 1).

Table 1 Clinical characteristics of patients in the training set

Independent Risk Factors Associated With Long-Term SARS-CoV-2 RNA Shedding

Cox regression analysis of prolonged viral RNA shedding duration is shown in Table 2. Multivariate Cox analysis identified that age > 50 years (HR 2.67, 95% CI 1.54–3.69, p = 0.001), comorbidity (HR 1.83, 95% CI 1.03–3.25, p = 0.039), CD4 + T cell count ≤ 410 cells/ul (HR 1.77, 95% CI 1.05–2.89, p = 0.042), CRP > 10 mg/L (HR 3.39, 95% CI 2.06–5.61, p < 0.001), and the corticosteroid use (HR 3.91, 95% CI 2.02–7.56, p < 0.001) were independent risk factors associated with long-term SARS-CoV-2 RNA shedding.

Table 2 Cox regression analysis of prolonged viral RNA shedding duration

Construction of a Scoring Model

In order to facilitate clinical use and further assessment, a scoring model was established according to the results of multivariate Cox analysis, called the CCCCA score, which scores from 5 (age ≤ 50 years, absence of comorbidity, CD4 + T cell > 410 cells/ul, CRP ≤ 10 mg/L, no corticosteroid use) to 14 points (age > 50 years, presence of comorbidity, CD4 + T cell ≤ 410 cells/ul, CRP > 10 mg/L, corticosteroid use). Using Shanghai Public Health Clinical Center laboratory reference range standards, we chose the lower limit of normal (≤ 410 cells/ul) as cut-off for CD4 + T cell count, and the upper limit of normal (> 10 mg/L) as cut-off for CRP. The calculator of the CCCCA score is shown in Table 3. ROC analysis was used to assess the performance of the CCCCA score in the training set (Fig. 2a). For the prediction of long-term SARS-CoV-2 RNA shedding, the AUROC of the CCCCA score was 0.87 (95% CI 0.85–0.89). Using a cut-off value of 8 points, the sensitivity was 75.9%, the specificity was 87.7%, the PPV was 51.2%, and the NPV was 95.5% (Table 4).

Fig. 2
figure 2

ROC curves of the CCCCA score in the training set (a) and the validation set (b). For the prediction of prolonged viral RNA shedding duration, the AUROC of the CCCCA score was 0.87 (95% CI 0.85 to 0.89) in the training set (a) and 0.83 (95% CI 0.79 to 0.87) in the validation set (b), respectively

Table 3 Calculator of the CCCCA score

Clinical Characteristics of Patients in the Validation Set

Clinical characteristics of patients in the validation set are shown in Table 5. Of 415 patients in the validation set, 81 (19.5%) were over 50 years, 64 (15.4%) had comorbidity, 53 (12.8%) had CD4 + T cell ≤ 410 cells/ul, 17 (4.1%) had CRP > 10 mg/L, 6 (1.4%) received corticosteroid therapy, and 29 (7.0%) met criteria for long-term SARS-CoV-2 RNA shedding. The median CCCCA score was higher in long-term group compared with short-term group (9 vs. 5, p < 0.001).

Table 4 Accuracy of the CCCCA score for the prediction of prolonged viral RNA shedding
Table 5 Clinical characteristics of patients in the validation set

Predictive Power of the CCCCA Score in the Validation Set

In the validation set, the predictive power of the CCCCA score was also good, with an AUROC of 0.83 (95% CI 0.79–0.87) (Table 4; Fig. 2b). Using a cut-off value of 8 points, we found sensitivity, specificity, PPV, and NPV of 51.7%, 92.2%, 33.3%, and 96.2%, respectively. Long-term SARS-CoV-2 RNA shedding duration increased from 14/370 (3.8%) in patients with CCCCA < 8 points to 15/45 (33.3%) in patients with CCCCA ≥ 8 points.

Discussion

Defining viral shedding of SARS-CoV-2 is of utmost importance for determining an appropriate period of isolation. Negative SARS-CoV-2 RNA tests may be useful in providing certainty about non-infectiousness before ending isolation in COVID-19 patient groups with known risk factors for prolonged viral shedding. In this study, we derived and validated a score model, called the CCCCA score, based on five variables (comorbidity, CRP, CD4 + T cell, corticosteroid use, age), aimed at predicting prolonged SARS-CoV-2 RNA shedding. For the prediction of long-term SARS-CoV-2 RNA shedding, the CCCCA score was excellent, with an AUROC of 0.87 (95% CI 0.85–0.89) in the training set and an AUROC of 0.83 (95% CI 0.79–0.87) in the validation set.

In this study, age > 50 years, comorbidity, CD4 + T cell ≤ 410 cells/ul, CRP > 10 mg/L, and corticosteroid use were identified as independent high-risk factors associated with long-term SARS-CoV-2 RNA shedding. Previous studies have found several factors to be associated with prolonged shedding. These include older age, male sex, concomitant hypertension or diabetes, delayed admission to hospital after illness onset, severe illness at admission, invasive mechanical ventilation, administration of corticosteroid, and so on [12, 14,15,16,17]. In the study by Zhou et al., multivariable regression showed prolonged viral shedding independently associated with older age (odds ratio 1.02, p = 0.003) [12]. Another study reported time from onset to admission (HR = 0.829, p < 0.001), and administration of corticosteroid (HR = 0.496, p = 0.002), arbidol (HR = 2.605, p = 0.008) and oseltamivir (HR = 0.416, p < 0.001) were independently associated with duration of viral shedding [14]. In a retrospective study, risk factors associated with SARS-CoV-2 RNA shedding were evaluated in a cohort of 113 symptomatic patients from two hospitals outside Wuhan, and the results showed that prolonged SARS-CoV-2 RNA shedding was associated with male sex, old age, concomitant hypertension, delayed admission to hospital after illness onset, severe illness at admission, invasive mechanical ventilation, and corticosteroid treatment [15]. In a study by Chen et al., older age (HR 0.99, 95% CI 0.98–1.00; p = 0.04), time lag from illness onset to hospital admission (HR 0.91, 95% CI 0.88–0.94; p < 0.001), and corticosteroid treatment (HR 0.60, 95% CI 0.39–0.94; p = 0.024 were significantly and independently associated with prolonged viral RNA shedding [16]. In order to identify risk factors for prolonged viral shedding, Buetti et al. used univariate and multivariate Cox proportional hazards models, and found that type two diabetes was associated with a prolonged viral RNA shedding (HR: 0.31, 95% CI 0.11–0.89, p = 0.029) [17].

Although previous studies have investigated the risk factors associated with the duration of SARS-CoV-2 RNA shedding, they did not establish a prediction model of long-term SARS-CoV-2 RNA shedding. There are currently considerable gaps regarding a scoring model for the prediction of prolonged SARS-CoV-2 RNA shedding. In this study, we derived a risk factors scoring system (the CCCCA score), which could identify a group of patients with la ow risk of long-term SARS-CoV-2 RNA shedding. Over 95% of patients with CCCCA score of 5–8 points will not experience long-term SARS-CoV-2 RNA shedding. Long-term SARS-CoV-2 RNA shedding increased from 3.8% in patients with CCCCA score < 8 points to 33.3% in patients with a CALL score ≥ 8 points.

In a previous retrospective study, the median duration of SARS-CoV-2 RNA detection was 17 (13–22) days as measured from illness onset [15]. Liu et al. reported the median duration of viral shedding was 25.0 days (IQR 20.0–30.0) in 140 medical workers with COVID-19 in Wuhan, China [18]. In a retrospective, multicenter cohort study, Zhou et al. included all adult inpatients with COVID-19 from Jinyintan Hospital and Wuhan Pulmonary Hospital, and found that the median duration of viral shedding was 20.0 days (IQR 17.0–24.0) in survivors, but SARS-CoV-2 was detectable until death in non-survivors [12]. In this study, the median duration of SARS-CoV-2 RNA shedding was 13 (9–18) days. In contrast, the median duration of viral shedding in this study was shorter than that reported by previous studies. There are several possible explanations for the differences. First, the statistical difference may have been caused by the size of the sample. We provided a large sample size (1211 patients) compared with previous studies (113 patients, 140 patients, and 384 patients, respectively). Second, there are some differences in the study populations regarding age, comorbidity, administration of corticosteroid, and the degree of severity, which are the confirmed risk factors related to SARS-CoV-2 RNA shedding.

One previous study reported that fever (temperature > 38.5 °C) (OR 5.1, 95% CI 1.5–18.1) was associated with increased odds of prolonged duration of viral shedding [19]. Another study also reported that the highest temperature at admission (OR 5.200; 95% CI 1.190–22.726; p = 0.028) was one of the risk factors for prolonged duration of viral shedding [20]. In this study, multivariate Cox analysis showed that fever (HR 1.28, 95% CI 0.83–1.98, p = 0.271) was not an independent risk factor associated with long-term SARS-CoV-2 RNA shedding. There are several possible explanations for the differences. First, the two previously mentioned studies included severe patients with COVID-19 (14–30.3%) [19, 20], while this study only included non-severe patients with COVID-19. Second, the prevalence of patients with fever is different (42.2%, 51.6%, and 78%, respectively). Third, the previous two studies had small sample sizes (66 and 147 patients, respectively). The statistical bias might be caused by the small size of the sample. In this study, we provided a much larger sample, including 796 patients in the training set and 415 patients in the validation set.

In this study, multivariate Cox analysis identified that CD4 + T cell ≤ 410 cells/ul (HR 1.77, p = 0.042), rather than lymphocyte ≤ 1.0 × 109/L (HR 1.07, p = 0.825), was one of the independent risk factors associated with long-term SARS-CoV-2 RNA shedding. Although lymphocyte count is a more practical variable to use, CD4 + T cell count is a more accurate marker of host's immunity, which is related with the SARS-CoV-2 RNA shedding.

Our study has several limitations. First, SARS-CoV-2 RNA shedding in nasopharyngeal or throat swab specimens does not necessarily represent true viral shedding, since recent research has indicated the prolonged presence of SARS-CoV-2 viral RNA in fecal samples [21]. In addition, the definition of starting point varies between studies (symptom onset and/or first positive RT-PCR). Second, we detected SARS-CoV-2 by RT-PCR but not by virus isolation. According to a recent report, RNA can be detected long after the disappearance of infectious virus. The immune system can neutralize viruses but not eliminate RNA, degrading slowly [22]. The presence of viral RNA does not always correlate with viability and transmissibility of virus, unless the virus is isolated and cultured from the specimens. However, the practical approach of discharging patients based on shedding cessation usually utilizes the widely available PCR tests rather than viral culture. Third, this study is a retrospective study, and might have the Type II error. Therefore, the results in this study should be interpreted with caution. Prospective, multicenter, large-scale studies are needed to further assess the performance of the CCCCA score for the prediction of long-term SARS-CoV-2 RNA shedding. Four, in this study, pediatric patients were not seen. In Shanghai, China, under the regulations of the Shanghai Health Committee, all pediatric patients with COVID-19 were hospitalized at another designated hospital, the Children's Hospital of Fudan University.

Conclusions

In conclusion, the median time for SARS-CoV-2 RNA shedding was 13 days (IQR 9–18) in non-severe hospitalized adult patients with COVID-19. The CCCCA score seems to be a good prognosticator for long-term SARS-CoV-2 RNA shedding. Prospective, multicenter, large-scale studies are needed to further assess the performance of the CCCCA score for the prediction of long-term SARS-CoV-2 RNA shedding.