Background

Despite the pandemic’s immense human cost, enormous economic toll, and extensive research response, the precise clinical characteristics of COVID-19 remain unclear [1]. At the start of the outbreak, COVID-19 was broadly characterised as a severe respiratory illness presenting with fever, cough and an atypical pneumonia [2,3,4,5]. Altered sense of taste and smell have since been found to be strongly associated with the disease [6, 7]. However, a review of 77 observational studies found substantial proportions of patients presenting with less typical symptoms [8].

Different sets of clinical criteria for suspected COVID-19 have been produced by the World Health Organization (WHO) [9], Centers for Disease Control and Prevention in the United States [10], Public Health England [11], and the European Centre for Disease Prevention and Control [12] (Table 1). Defining presenting symptoms of COVID-19 is further complicated by clinical experience suggesting that patients frequently present with atypical symptoms other than cough, fever and shortness of breath. This variation in the clinical characterisation of COVID-19 is problematic, as case definitions are used to guide clinical diagnosis, disease surveillance, and public health interventions.

Table 1 Clinical case descriptions in use

The International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) cohort study is an international collaboration to gather reliable observational data about patients admitted to hospital with COVID-19 [13]. Here, we present an analysis of how symptoms of patients admitted to hospital with confirmed COVID-19 vary by age and sex. Secondarily, we investigated how sensitivity of clinical case definitions varied among these populations, and explored heterogeneity among countries.

Methods

Study design and setting

This analysis used international observational data of clinical features of patients admitted to hospital with COVID-19 between 30 January and 3 August 2020. The ISARIC/WHO Clinical Characterisation Protocol for Severe Emerging Infections is a standardized protocol for investigation of severe acute infections by pathogens of public health interest [14]. ISARIC case report forms [15] allow standardized data collection from the start of an outbreak and rapid dissemination of clinical information [3, 16,17,18,19,20]. Different tiers of data collection exist to allow sites to collect data to the highest possible standards while recognising varying levels of resource for data collection during epidemics. Details of symptoms at admission were included on the case report forms for all tiers. Data were collected via electronic ‘Core’ and ‘Rapid’ forms, and through aligned forms by ISARIC-4C Coronavirus Clinical Characterisation Consortium in the United Kingdom [21] and the COVID-19 Critical Care Consortium [22]. Investigators from 41 countries are using Research Electronic Data Capture (REDCap, version 8.11.11, Vanderbilt University, Nashville, Tenn.) to contribute their data to a central database hosted by the University of Oxford. Additional data were submitted by investigators from Malaysia, Russia [23] and Spain, who had used alternative data collection forms and databases. This observational study required no change to clinical management and permitted enrolment in other research. The study was approved by the World Health Organization Ethics Review Committee (RPC571 and RPC572). Local ethics approval was obtained for each participating country and site according to local requirements. All investigators retained full rights to their data.

Study population

Patients of any age admitted to hospital with suspected or confirmed COVID-19 were eligible for recruitment. Some versions of the case report form included specific criteria such as fever and cough, but investigators were able to include patients with a clinical suspicion of COVID-19 even if these criteria were not met. This analysis was limited to patients who were admitted to hospital with symptomatic laboratory-confirmed COVID-19: laboratory confirmation was classified according to sites’ local diagnostic methods. We excluded asymptomatic patients who had been admitted to hospital for isolation and patients admitted for other conditions who subsequently developed COVID-19 symptoms.

Variables and measurement

Variables used in this analysis were age, sex at birth, symptoms, date of symptom onset, SARS-CoV-2 confirmation, and country of recruitment. To allow proportions to be calculated with a reliable denominator, only symptoms specified on the case report forms [15, 21, 22] were included in this analysis. The list of symptoms collected is presented with the results.

Statistical methods

Data were converted to Study Data Tabulation Model (version 1.7, Clinical Data Interchange Standards Consortium, Austin, Tex.) to allow inclusion of data submitted not using the ISARIC case report forms. We excluded patients with all symptoms recorded as missing or unknown, and those with missing age, sex or country.

Continuous variables were expressed as median with interquartile range (IQR), and categorical variables as counts with percentages. We tested for differences between female and male patients using Wilcoxon rank-sum tests for continuous variables and chi-square tests for categorical variables. We grouped patients into ten-year age bands (with a single group for age ≥ 90 years). We plotted the most frequently reported symptoms by age group, presenting missing data as a third category. We collated symptoms according to four sets of clinical criteria. These were based on published criteria [9,10,11,12] but modified to omit symptoms with large numbers of missing data due to being added to the case report form part way through the data collection period (altered sense of taste and smell) or not being collected in all datasets (anorexia):

  1. 1.

    Fever plus cough; or any three of fever, cough, fatigue, headache, myalgia, sore throat, rhinorrhoea, shortness of breath, nausea and vomiting, diarrhoea, and confusion;

  2. 2.

    Cough or shortness of breath; or any two of fever, myalgia, headache, and sore throat;

  3. 3.

    Cough or fever;

  4. 4.

    At least one of cough, fever, and shortness of breath.

Patients with missing details of cough, fever or shortness of breath were omitted from the composite groups; patients missing details of symptoms included in the lists of criteria 1 and 2 were included, and were classified according to the non-missing symptoms. We plotted proportions of patients meeting each set of criteria by ten-year age group, with 95% confidence intervals (CI) calculated using the Clopper–Pearson method.

We used logistic regression to identify associations of age and sex with the twelve most prevalent symptoms. Age group and sex were included as fixed effects, with country as a random intercept. To display heterogeneity between countries on the same scale as the fixed effects, we calculated median odds ratios (MOR) [24]. This quantifies variation between countries by comparing odds of an outcome between randomly chosen persons in different clusters who share the same covariates [24]. MORs are defined as a comparison of the group with greater propensity to the group with lower propensity, so lie in the range 1 to infinity [24]. We plotted the MOR to show the magnitude of the effect of heterogeneity and allow comparison with the fixed effects in our data.

79% of patients were recruited in a single country (United Kingdom). As a sensitivity analysis, we repeated the analysis excluding patients from this country to see if results substantially changed. Finally, we plotted age-stratified symptom frequencies for each country with at least 250 patients.

No minimum sample size was calculated. All significance tests were two-tailed. Analyses were performed using R (version 3.6.2, R Foundation for Statistical Computing, Vienna, Austria) with packages including binom, Epi, ggplot2, lme4, sjstats, tableone, and tidyverse.

Results

Data were available for 99 623 patients. We excluded 24 336 who did not have documented SARS-CoV-2 confirmation, 3290 with missing data, and 5794 who developed COVID-19 after admission to hospital. 6094 patients were admitted to hospital with asymptomatic COVID-19, with the greatest proportion in the age band 10–20 years (46% of patients admitted in that age group). The greatest proportion of asymptomatic patients was in Malaysia (49%), where hospitalization was compulsory throughout the data collection period for people with COVID-19. We included 60 109 patients in the analysis (Fig. 1), recruited from 394 sites in 43 countries (Supplementary Table 1), in this analysis. The median age of included patients was 70 years (IQR 54–82; Table 2). 929 (1.5%) were 18 years old or younger. Age distribution of patients varied among countries, between a median of 10 years in Poland, and 73 years in the United Kingdom (Supplementary Figure 1). 34 641 (58%) patients were male.

Fig. 1
figure 1

Flow of participants in this analysis. ISARIC Core and ISARIC Rapid represent data collected internationally via two sets of case report forms; ISARIC 4C and COVID-19 CCC data were collected on separate databases using aligned case report forms; Non-REDCap data were submitted from additional sites and were collected with different case report forms. 4C, Coronavirus Clinical Characterisation Consortium; CCC, Critical Care Consortium; ISARIC, International Severe Acute Respiratory and emerging Infection Consortium; REDCap, Research Electronic Data Capture; SARS-CoV-2, severe acute respiratory syndrome coronavirus-2.

Table 2 Patient demographics

The most frequently reported symptoms were fever, cough and shortness of breath (Table 3). These symptoms were each more prevalent in male patients, whereas less typical symptoms such as confusion, nausea and vomiting, diarrhoea, chest pain, headache and abdominal pain were more prevalent in female patients. The greatest sex-related difference was for nausea and vomiting, reported by 23% of female patients but only 16% of male patients. For most symptoms, the greatest prevalence was reported in adults aged between 30 and 60 years, decreasing toward extremities of age (Fig. 2). Frequency of confusion increased with age. Large numbers of patients had missing data for anorexia, severe dehydration, altered sense of taste and smell, and inability to walk, as these were not included on all case report forms. Altered sense of taste and smell, which we had excluded from the composite criteria, were experienced by only 7.4 and 6.2% respectively of patients with non-missing data.

Table 3 Symptoms at presentation to hospital with COVID-19
Fig. 2
figure 2

Age-specific prevalence of symptoms at hospital admission. Dark blue bars show symptom present, maroon bars show symptom absent, pale grey bars show missing data

Data on cough, fever or shortness of breath were missing for 3446 patients. The composite clinical criteria were calculated for the remaining 56 663 patients. Each set of criteria was met by a greater proportion of patients aged 30–60 years than those toward either extreme of age (Fig. 3). The criteria based on WHO’s clinical criteria [9] (fever plus cough; or any three of fever, cough, fatigue, headache, myalgia, sore throat, rhinorrhoea, shortness of breath, nausea and vomiting, diarrhoea, and confusion) were met by 40 911 (72%) patients, but only 51% of those aged 18 years and under, and 67% of those aged 70 years or over. The most sensitive criteria were at least one of cough, fever and shortness of breath, met by 52 041 (92%) participants. These criteria were met by 85% aged 18 years and under, and 90% of those aged 70 years or over. Each set of criteria were met by a greater proportion of male than female patients (Table 3).

Fig. 3
figure 3

Proportions meeting clinical criteria at hospital admission stratified by 10-year age band. Black boxes show the proportion of individuals, with error bars showing 95% confidence intervals calculated using the Clopper–Pearson method. The size of each box is inversely proportional to the variance, so larger boxes indicate greater certainty. Grey boxes with 95% confidence intervals show the proportions in the sensitivity analysis excluding patients recruited in the United Kingdom. In panel A, the three symptoms are from the list of fever, cough, fatigue, headache, myalgia, sore throat, rhinorrhoea, shortness of breath, nausea and vomiting, diarrhoea, and confusion; in panel B the two symptoms are from the list fever, myalgia, headache, and sore throat. Patients with missing data for cough, fever or shortness of breath are excluded from all four plots

For the 4622 patients whose symptoms did not meet any assessed case definitions, the most frequent symptom was confusion (47%; Table 4). This increased with age to 66% of those aged 90 years or older. Nausea and vomiting, and abdominal pain were the most common symptoms for people less than 60 years old who had not met any of the case definitions.

Table 4 Symptoms reported for patients meeting none of the clinical case definitions

In the logistic regression models (Fig. 4 and 5), similar associations between age and symptoms were seen after adjustment for sex as in the unadjusted bar charts. Confusion increased with age. Nausea and vomiting, headache, abdominal pain, and sore throat were each more frequent in younger age groups, decreasing with age. Male patients had greater odds of fever, cough and shortness of breath, and lower odds of gastrointestinal symptoms of nausea and vomiting, diarrhoea, abdominal pain, chest pain, headache and sore throat. However, in each case except diarrhoea, the magnitude of the difference according to sex was much less than the effect of age. The MOR for heterogeneity between countries was greater than the relationship with sex in all symptoms (Fig. 4). It was of similar magnitude to the relationship with age for most symptoms. For each symptom, heterogeneity between countries was of a similar magnitude to the effect of age, and a greater magnitude than sex.

Fig. 4
figure 4

Odds of symptoms among patients admitted to hospital with COVID-19, stratified by age and sex. Each plot is the result of a logistic regression with a symptom as an outcome. Fixed effects of age in ten-year bands (baseline group 50–60 years) and sex are shown in black boxes with 95% confidence intervals. The size of each square is inversely proportional to the variance of the log odds ratio, so larger boxes indicate greater certainty. Clustering by country is included as a random intercept and heterogeneity is depicted by circles showing the median odds ratio

Fig. 5
figure 5

Age- and sex-specific odds of meeting clinical definitions among patients admitted to hospital with COVID-19, stratified by age and sex. Each plot is the result of a logistic regression with a composite group of symptoms as an outcome. Fixed effects of age in ten-year bands (baseline group 50–60 years) and sex are shown in black boxes with 95% confidence intervals. The size of each square is inversely proportional to the variance of the log odds ratio, so larger boxes indicate greater certainty. Clustering by country is included as a random intercept and heterogeneity is depicted by circles showing the median odds ratio. In panel A, the three symptoms are from the list of fever, cough, fatigue, headache, myalgia, sore throat, rhinorrhoea, shortness of breath, nausea and vomiting, diarrhoea, and confusion; in panel B the two symptoms are from the list fever, myalgia, headache, and sore throat. Patients with missing data for cough, fever or shortness of breath are excluded from all four plots

47 280 (79%) patients were included from the United Kingdom. Excluding these patients, the patterns of symptoms were similar to the main analysis (Supplementary Figure 2). The peak prevalence of fever, cough and shortness of breath was in 70- to 80-year olds, and fatigue increased with age. Below the age of 50 years, the clinical case definitions tended to be less sensitive in the analysis excluding the United Kingdom than in the analysis including it; above the age of 70 years each tended to be more sensitive (grey lines in Fig. 3). Within countries, the baseline prevalence of each symptom varied but patterns within countries were broadly similar to the overall results (Supplementary Figures 3–14).

Discussion

The ISARIC prospective multinational cohort study is the largest cohort of patients admitted to hospital with COVID-19 to date. In this report, we confirmed a relationship between patients’ symptoms and their age and sex. The ‘typical’ COVID-19 symptoms occur most frequently in adults aged 30–60 years. Commonly used case definitions can miss up to half of children and a third of adults over 70 years who are admitted to hospital with COVID-19.

Our results support the findings of smaller cohort studies that atypical symptoms are more common in older adults [26], and correlate with similar findings of atypical presentations for pneumonia, bacteraemia and coronary artery disease [27, 28]. A lower prevalence of symptoms in children and young people has previously been suggested [29, 30], but this is the first large international cohort to collect data prospectively from both adults and children.

Separate analyses of the ISARIC-4C data have identified fever, cough and shortness of breath as frequently co-occurring clusters of symptoms [31, 32]. In children, the next most frequent cluster consisted of systemic, enteric and mucocutaneous symptoms [31]. For adults, other clusters consisted of non-specific viral symptoms, gastrointestinal symptoms, upper respiratory symptoms, neurological symptoms, and symptoms of bronchospasm [32]. Those data were included in this global dataset so the results of these analyses are not independent of our results.

We found that differences in symptoms by sex were statistically significant but generally of smaller magnitude. Typical symptoms of fever, cough and shortness of breath were more common in men than in women; all other symptoms were equal or more common in female patients. A cohort of non-hospitalized patients with COVID-19 in Poland found greater differences in symptoms of lack of appetite (55% of women, 36% of men) and taste disorder (53% women, 40% men) [33]. We are unable to determine from our data whether these differences reflect differences in health-seeking behaviour between men and women, or physiological differences in their response to the infection. Elaboration of this difference should be a goal of future research.

Our results suggest considerable heterogeneity among countries. We have not attempted to elicit reasons for heterogeneity. Potential reasons include cultural idiosyncrasies in reporting symptoms, and hospitals’ criteria for admission and testing. It might also reflect differences in local patient recruitment. Researchers in some countries may be unwilling to recruit confused patients due to requirements for consent, whereas in others, the requirement for consent has been waived or could be obtained from a proxy. We explored the effect of using the country of recruitment as a random effect in regression models and by repeating the analysis excluding the largest country. Each analysis suggested an underlying pattern of lower frequencies of typical symptoms in children and older adults. Therefore, although the prevalence of each symptom reported in this study may not apply to all settings, we have evidence to support the possibility of age-dependent differences internationally.

The size of this cohort is a strength. To our knowledge, it is the largest cohort of hospitalized COVID-19 patients in the world. However, the study has several limitations. First, almost 80% of patients were recruited in a single country. Moreover, less than 1% of patients were recruited from low- or lower–middle-income countries. Second, the cohort overwhelmingly includes older adults, with only 1.8% of the cohort aged 18 years or younger. Third, our analysis includes only patients who were hospitalized with COVID-19 and who had a laboratory-confirmed diagnosis. This patient population is more likely to be severely unwell and more likely to exhibit symptoms typically associated with COVID-19 than people who were managed in the community or whose disease has not been recognized. Accordingly, the reporting of ‘typical’ COVID-19 symptoms in this cohort is likely to be an overestimate of the population prevalence. Symptoms are subjective and cannot be externally verified. Some differences for children may reflect that symptoms could only be recorded if a caregiver recognised the symptom or the child had the appropriate vocabulary to describe it. Similarly, some symptoms may be under-reported in elderly patients if there are difficulties in communication, for example due to delirium. As such, the generalizability of estimates of our symptom prevalence is limited. Similarly, there is a shortage of studies conducted outside of high-income countries: a recent scoping review of clinical characteristics of COVID-19 identified no large cohorts in non-high-income countries except China [8].

The absence of a control group of patients without COVID-19 in this dataset prevented estimation of specificity or positive and negative predictive values. We are therefore prevented from advocating changes to clinical case definitions, as such decisions inevitably require a balance of false-positive and false-negative rates. However, given the prevalence of atypical symptoms in our cohort, we can confidently suggest that reliance on clinical case definitions may result in missing cases of COVID-19, especially among children and older adults. Non-healthcare professionals making decisions regarding isolation may be especially vulnerable to missing cases of COVID-19 by adhering to a clinical case definition too strictly.

The reported prevalence of COVID-19 may also rely on a strict interpretation of case definitions. In settings where comprehensive contact tracing is planned, or there is easy access to microbiological testing, a highly sensitive case definition is desirable. However, where decisions are based on clinical diagnoses, it is important to recognize other pathogens that can cause similar constellations of symptoms. The addition of symptoms such as confusion or gastrointestinal symptoms to the COVID-19 case definition could increase sensitivity, but at the cost of reduced specificity. Changes in the senses of taste and smell have recently been added to case definitions. Our data suggest that these criteria would detect only a small proportion of patients admitted to hospital with COVID-19 who were omitted by other definitions.

These results highlight the need to consider COVID-19 even if individuals do not display typical symptoms of the disease. This is especially the case in children and older adults. Given that our results are likely to overestimate the sensitivity of the clinical criteria currently used to identify patients for testing, our results suggest a lower limit to the proportion of people in the community with COVID-19 who would not be identified. Addition of confusion as a symptom would increase the sensitivity of case definitions for older adults; and inclusion of nausea and vomiting or abdominal pain would increase sensitivity for children and young adults. The high proportion of asymptomatic patients identified in patients aged 10–20 years suggests that universal screening in these ages could be beneficial when there is widespread community circulation of COVID-19. Ongoing data collection outside high-income countries is needed to establish whether alternative case definitions are needed in different settings. Work is also ongoing to determine whether some constellations of symptoms are associated with better or poorer outcomes than others.