Background

Multiple studies have attempted to determine the incidence and risk factors associated with SARS-CoV-2 sequelae in children and young people (CYP) [1]. However, these studies were often impeded by methodological heterogeneity, biases, and inconsistent definitions of COVID-19 consequences [2, 3]. The difficulty of appropriate reference or control group selection has subjected available studies to criticism [4].

One possible approach to obtaining a reference group is to recruit uninfected CYP whose negative status has been previously verified by PCR testing. In the CLoCk study, PCR-positive CYP were matched to test-negative CYP using the national SARS-CoV-2 testing dataset [5]. Similarly, a nationwide cohort study of CYP in Denmark included an exposed group with SARS-CoV-2 infection, verified by RT-PCR, and a reference group of randomly selected individuals who have never been test-positive for SARS-CoV-2 [6]. In a mobile app-based study, Molteni and co-authors recruited a reference group of CYP whose self-reported test results were negative [7]. In other studies, the exposed cohorts were formed from randomly selected schools based on RT-PCR test results, and reference cohorts were recruited from CYP who visited their doctor routinely, with both negative RT-PCR and antibody-based SARS-CoV-2 analysis [8, 9].

Previous efforts suffer from blending symptomatic and asymptomatic CYP in the infected groups, which does not address the possible confounding effect of severity or confounding by factors predisposing to more severe symptoms. Additionally, misdiagnosis may have occurred, as acknowledged by some authors [5]. One approach is to limit the scope of the infected group to those CYP who are symptomatic. However, the results of such studies are challenging to interpret, as previously ill CYP are compared to a generally healthy reference, and the incremental effect cannot be evaluated in the context of other previously known illnesses [10]. Previous expert statements [11, 12] as well as recent systematic review suggested that future studies would benefit from control group and adjustment of the study results for health and environmental factors, including SARS-CoV-2 variant [1].

A potential solution to this problem could be using a reference with comparably severe disease, yet without SARS-CoV-2 infection. To the best of our knowledge, a single study followed this consideration, recruiting reference cohort from both hospitalised and non-hospitalised CYP with other non-SARS-CoV-2 community-acquired infections, clinically and laboratory-confirmed, whereas cases were defined as CYP with previous acute phase of COVID-19 [13]. Given the relatively rare occurrence of highly severe SARS-CoV-2 infection [14], filling the gaps in understanding long consequences of SARS-CoV-2 on health and well-being of CYP in comparison to other known infection is needed. Additionally, there is a paucity of paediatric studies evaluating the long-term consequences of COVID-19 related to the most prevalent circulating variants of SARS-CoV-2 in the population during a specific time period [15].

Another limitation pertains to the broad outcome definition employed by most of the observational studies examining the consequences of COVID-19. Although the World Health Organization (WHO) has provided a definition for the post-COVID-19 Condition [16], a disappointingly small number of studies have chosen to incorporate this definition into their research methodology.

This prospective exposure-stratified cohort study aimed to investigate the incidence of post-COVID-19 condition among hospitalised CYP with COVID-19 infection compared with post-infection sequalae in a reference cohort of previously hospitalised CYP with non-COVID-19 infectious diseases. The study used standardised follow-up data collection protocols developed by the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) Global Paediatric COVID-19 follow-up working group. The study included assessment of post-COVID-19 condition corresponding to different waves of the pandemic in Moscow, Russia, as a proxy for infection with different SARS-CoV-2 variants.

Methods

The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) checklist for cohort studies (https://www.strobe-statement.org/) was utilised to report the present study.

Study design, setting, and participants

The present study is an exposure-stratified prospective cohort study of CYP under the age of 18 carried out in Bashlyaeva Children’s City Clinical Hospital and G.N. Speransky Children’s City Clinical Hospital No. 9. These hospitals were the primary hospitals treating COVID-19 in CYP residing in Moscow at different times of pandemic. Additional file 1: Table S1 provides a detailed description of the criteria for hospital admission according to the local clinical guidelines.

Exposed cohorts for this study included paediatric patients hospitalised to Bashlyaeva Children’s City Clinical Hospital with a laboratory-confirmed diagnosis of COVID-19 infection, spanning from April 2, 2020, to December 11, 2020 (Wuhan variant cohort), and to G.N. Speransky Children’s City Clinical Hospital No. 9 between January 12, 2022, and February 19, 2022 (Omicron variant cohort). The dates of patient hospital admission were matched with the data on variant predominance in Moscow [17] (Additional file 1: Figure S1).

CYP who were hospitalised to G.N. Speransky Children’s City Clinical Hospital No. 9 with confirmed respiratory and gastrointestinal infections in conjunction with negative lateral flow rapid diagnostic test and PCR-test results for SARS-CoV-2, between January 12, 2022, and February 19, 2022, served as the unexposed reference cohort for this study. They represent typical patients who are routinely admitted to the hospital with respiratory and gastrointestinal infections during seasonal outbreaks. Recruiting a reference cohort during the Wuhan wave proved to be a formidable challenge. The dearth of admissions of CYP with respiratory infections other than COVID-19, coupled with the high rate of false negatives in COVID-19 testing during the initial months of the pandemic and the scarcity of resources further compounded the difficulty.

Data management

Baseline data at admission consisted of information pertaining to patient demographics, symptoms, and comorbidities, documented at the time of admission. In addition, the dataset also encompassed results of clinical investigations, supportive care requirements during hospitalisation, and crucial clinical outcomes upon discharge.

Follow-up data have been collected via telephone interviews with parents conducted by research team members who had received standardised training in interview administration, REDCap data entry, and data security [18,19,20]. The interviews took place between January 31 and February 27, June 2 and August 1, 2021, and August 1 and September 15, 2022. The long COVID paediatric protocol and survey, which was developed by the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) and translated into Russian, was utilised to collect data from both exposed and the reference cohorts. The data included questions regarding symptoms and their onset and persistence at the time of the follow-up as well as the physical, psychosocial, and behavioural well-being of CYP (all proxy-reported by parents).

Data collection, storage, and management were performed using REDCap electronic data capture tools (Vanderbilt University, Nashville, TN, USA) hosted at Sechenov University and Microsoft Excel (Microsoft Corp, Redmond, WA, USA).

Definitions

The present study adopts the WHO case definition to categorise post-COVID-19 condition as the presence of any symptom that emerges no later than 3 months post-hospital discharge and persists for at least 2 months. In the absence of reliable objective medical record data pertaining to the date of first symptom appearance, symptom duration was calculated from the time of hospital discharge.

Post-COVID-19 condition/post-infection sequalae symptoms were categorised into nine manifestations, which encompassed cardiovascular, dermatological, fatigue, gastrointestinal, musculoskeletal, neurocognitive, respiratory, sensory, and sleep-related symptoms. This classification was formulated by drawing upon relevant prior literature and through ISARIC’s working group deliberations [20].

Emotional and behavioural changes, which were initially recorded using a five-point scale ranging from ‘much less’ to ‘much more’, have been converted into binary variables. We considered worsening of symptoms for excessive fatigue when parents were reporting ‘more’ and ‘much more’ with regard to this outcome during telephone interview.

Recovery was assessed using a Likert scale ranging from ‘1’ (no recovery) to ‘10’ (complete recovery). Responses were transformed into a binary variable, with a ‘1–6’ considered ‘incomplete recovery’ and ‘7–10’ complete recovery from the infection.

Patients in all the cohorts were defined as severe if required non-invasive ventilation, invasive ventilation, or intensive care unit (ICU) care during acute phase of COVID-19.

Statistical analysis

Baseline characteristics were analysed through descriptive statistics. We summarised continuous variables as medians (interquartile range, IQR), while categorical variables were presented as frequencies (percentages).

We utilised forest plots to depict the incidence and event rates associated with post-COVID-19 conditions and their varying manifestations. The incidence was calculated as the ratio of the total number of new cases to the total time at risk. In line with the World Health Organization’s definition of post-COVID-19 condition (PCC), an individual’s time at risk is capped at three months from the onset of symptoms (or from hospital admission if symptom onset is unknown). For manifestations of symptoms, we selected the shortest time at risk across all symptoms for each individual within a group. We calculated the rate of events as the ratio of the number of cases to the number of people at risk.

We conducted a comparison between the ‘exposed cohorts’ (Wuhan variant cohort and Omicron variant cohort) and ‘reference cohort’ using 1:1 matching by age (± 1 year) and sex. In a sensitivity analysis, two additional matching parameters severity and length of hospitalisation (± 2 days) were included (Additional file 1: Table S2, Figures S2 and S3). We regarded age and sex as confounders, while severity (defined as the need for non-invasive or invasive ventilation, or intensive care unit admission) and length of hospitalisation may function as proxies for true severity.

We employed bootstrap methodology (30,000 iterations) to obtain 95% confidence intervals (CIs) for the estimates of post-COVID-19 condition event rates and incidence. This involved resampling from a variety of potentially matched cohorts. We have presented the demographic characteristics and comorbidities of the matched cohorts, alongside the features of the initial sample.

We considered differences in event rates and incidence of manifestations between groups as significant if the median p-value for rate ratios, acquired from the bootstrap procedure via the exact Fisher’s test, was less than 5%.

Statistical analysis was conducted using R version 4.0.2, employing the dplyr, foreign, ggraph, and ggforce libraries.

To minimise recall bias, we limited the period from the onset date of the first symptoms (or the date of hospital admission if symptom onset was missing or inconsistent) to the follow-up date to a maximum of 8 months.

Results

Study participant characteristics

Out of the 2595 eligible CYP with laboratory-confirmed COVID-19 who were discharged between March 18, 2020, and February 20, 2022, 2520 (97.1%) had contact information available. Of these, 1707 CYP (65.7% of those discharged, 67.7% of those with contact information) participated in follow-up interviews. Upon matching, 1183 out of 1707 CYP (69%) were included in the final analysis (Fig. 1).

Fig. 1
figure 1

Study flow chart

Table 1 presents the demographic and clinical characteristics of the study participants. The median time elapsed after hospital discharge was 6.7 months, ranging from 6.2 months for Omicron variant and control cohorts to 7.5 months in the Wuhan variant cohort. Before the matching process, the median age was 10.3 years (IQR 2.8–14.7) in the Wuhan variant cohort, 2.6 years (IQR 0.8–7.0) in the Omicron stain cohort, and 4.2 years (IQR 1.9–7.8) in the reference cohort. The share of severe patients ranged from 2.4% in the Wuhan variant cohorts to 6.2% in the Omicron cohort and 3.2% in the reference cohort.

Table 1 Demographic characteristics of study participants

The most common comorbidities in the pooled initial cohorts were food allergy (12.0%), intestinal (9.3%) and neurological (8.7%) problems, and atopic dermatitis/eczema (7.9%). However, in the matched sets, the prevalence of these comorbidities varied significantly. Higher prevalence of neurological problems (5.8%), food allergy (4.7%), and lower prevalence of malnutrition (4.6%) were observed in Wuhan variant cohort when compared with the reference cohort. Higher prevalence of atopic dermatitis/eczema (9.7%) and neurological problems (+ 5.4%) was found in Omicron variant cohort when compared with the reference cohort.

Post-COVID-19 condition/post-infection sequalae incidence

Figure 2 presents the incidence rates of PCC/post-infection sequalae and manifestations of symptoms in matched cohorts, while Fig. 3 depicts the well-being assessment results and incomplete recovery in matched cohorts. Detailed information is available in Table S3.

Fig. 2
figure 2

Incidence of post-COVID-19 condition manifestations in Wuhan variant cohort, Omicron variant cohort, and post-infection sequalae manifestations in reference cohort

Fig. 3
figure 3

Event rate for incomplete recovery and emotional and behavioural changes in Wuhan variant cohort, Omicron variant cohort, and post-infection sequalae manifestations in reference cohort. VAS, visual analogue scale

For the Wuhan variant cohort, the incidence of PCC was 89.6 cases per 1000 person-months (95% CI 64.4–120.4), compared to 11.0 cases of post-infection sequalae per 1000 person-months (95% CI 4.8–19.7) in the reference cohort (p < 0.001). Conversely, the difference between the matched Omicron variant cohort (14.1 cases per 1000 person-months, 95% CI 6.8–23.7) and the reference cohort (10.1 cases per 1000 person-months, 95% CI 3.4–18.2) was not statistically significant (p = 0.45).

When comparing the Wuhan variant cohort (87.0 cases per 1000 person-months, 95% CI 61.0–118.9) to the Omicron variant cohort (15.7 cases per 1000 person-months, 95% CI 6.5–26.7), a significant difference in the PCC event rate was noted (p < 0.001).

Post-COVID-19 condition/post-infection sequalae manifestations

The Wuhan variant cohort displayed a significantly higher incidence of dermatological, fatigue, gastrointestinal, sensory, and sleep manifestations of PCC than similar post-infection sequalae manifestations in the reference cohort. The largest difference in manifestation rates was observed for fatigue (31.4 cases per 1000 person-months, 95% CI 18.7–48.0 vs. 2.4 cases per 1000 person-months, 95% CI 0.0–6.0) and sleep problems (24.5 cases per 1000 person-months, 95% CI 13.6–38.8 vs. 3.6 cases per 1000 person-months, 95% CI 0.0–8.4).

For the Omicron variant cohort, no statistically significant differences in any symptom manifestations were found when compared with the reference cohort.

Comparing the Wuhan and Omicron variant cohorts, significant differences in incidence were observed for dermatological, fatigue, gastrointestinal, sensory and sleep manifestations. The largest differences were observed in incidence of dermatological manifestations in Wuhan vs Omicron variant cohorts (21.9 cases per 1000 person-months, 95% CI 10.7–35.8 vs. 0.0 cases per 1,000 person-months, 95% CI 0.0–3.8) and fatigue (28.9 cases per 1,000 person-months, 95% CI 16.1–45.3 vs. 2.5 cases per 1,000 person-months, 95% CI 0.0–6.5).

The rate of events for incomplete recovery was significantly higher in the Wuhan variant cohort (13.9%, 95% CI 9.6–18.7%) than in the reference cohort (3.6%, 95% CI 1.4–6.3%). For the Omicron variant cohort, this rate was not significantly different from the reference cohort, but it was significantly lower than the Wuhan variant cohort (7.0% (95% CI 3.8–10.7%) vs. 13.3% (95% CI 8.8–18.2%)) (Additional file 1: Table S3).

Post-COVID-19 condition/post-infection sequalae emotional and behavioural status

Emotional and behavioural changes also differed significantly between the Wuhan variant and reference cohorts, with higher rates observed in the Wuhan variant cohort for behavioural changes (10.4%, 95% CI 6.7–14.9 vs. 4.2%, 95% CI 1.6–7.1%), emotional problems (12.9%, 95% CI 8.7–17.5% vs. 5.8%, 95% CI 3.1–9.2%), fatigue (16.0%, 95% CI 11.4–21.1% vs. 7.1%, 95% CI 4.1–10.6%), communicating with friends personally (5.4%, 95% CI 2.2–9.4% vs 1.4%, 95% CI 0.0–3.3%), worsening relationships (5.7%, 95% CI 2.7–9.2% vs. 0.8%, 95% CI 0.0–2.5%), decline in school attendance (11.5%, 95% CI 6.9–16.8% vs. 4.5%, 95% CI 1.6–8.1%), and sleep problems (12.2%, 95% CI 8.1–16.8% vs. 5.4%, 95% CI 2.9–8.5%).

In contrast, the Omicron variant cohort only showed a significant difference from the reference cohort with regard to lower school attendance (9%, 95% CI 5.1–13.6% vs. 3.8%, 95% CI 1.4–7.2%).

When the Wuhan and Omicron variant cohorts were compared, significant differences in rates of events for emotional behavioural changes were observed for fatigue (14.9%, 95% CI 10.4–19.8% vs. 7.4%, 95% CI 4.1–11.2%), decreased physical activity (9.3%, 95% CI 5.5–13.8% vs. 2.7%, 95% CI 0.8–5.2%), and deterioration of relationships (4.9%, 95% CI 2.4–8.4% vs. 0.9%, 95% CI 0.0–2.4%).

Incidence (per 1000 person-months (95% confidence interval)) of post COVID-19 condition and different manifestations in exposure and reference groups were matched by age and sex. Statistically insignificant results are presented in white; statistically significant difference of p < 0.05 between the groups is highlighted in light green; statistically significant difference of p < 0.01 between the groups is highlighted in light yellow; statistically significant difference of p < 0.001 between the groups is highlighted in light grey.

Prevalence of incomplete recovery is presented as event rate (95% confidence interval). Statistically insignificant results are presented in white; statistically significant difference of p < 0.05 between the groups is highlighted in light green; statistically significant difference of p < 0.01 between the groups is highlighted in light yellow; statistically significant difference of p < 0.001 between the groups is highlighted in light grey.

Discussion

The importance of investigating the impact of COVID-19 on CYP has been previously emphasised, particularly in light of new variants of the virus circulating in the population [12]. However, current research on the long-term effects of COVID-19 in CYP is limited and is associated with methodological limitations, making it difficult to distinguish between the effects of the virus and the impact of social restrictions [21]. There is also a lack of studies that compare the effects of COVID-19 in CYP with a control group of individuals following other viral infections. Experts previously highlighted the importance of a control group in studies following SARS-CoV-2 infection in CYP, using standardised case definitions [4]. This study aimed to address these gaps by estimating the incidence of PCC and the impact on well-being among CYP and young people, comparing COVID-19 from different variants with a control cohort of CYP hospitalised with non-COVID-19 infections. We found that CYP hospitalised during Wuhan variant wave had a significantly higher incidence of PCC and incomplete recovery than incidence of post-infection sequalae in the controls. Emotional and behavioural changes were also observed in Wuhan cases, including behaviour changes, emotional problems, excessive fatigue, worsening of relationships, decline in school attendance, and problems with sleep. In contrast, CYP admitted to the hospital with COVID-19 during the Omicron variant wave did not differ significantly from CYP hospitalised with respiratory or gastrointestinal infections in terms of long-lasting outcomes, symptom manifestation, and recovery. However, they were shown to have distinctively more problems with school attendance.

The incidence of PCC was significantly higher in the cohort infected by the Wuhan variant compared to the post-infection sequalae incidence in reference cohort, with a disparity of 77.5 cases per 1000 person-months. Conversely, the incidence rate in the Omicron variant cohort did not show a statistically significant difference from the reference cohort. The striking difference between the Wuhan and Omicron cohorts indicates the potential difference in virulence and resultant clinical outcomes associated with the distinct viral variants. Results are somehow reassuring as Wuhan variant is very uncommon in population now, and findings allow to hypothesise that Omicron variant as well as SARS-CoV-2 circulating in the population, originating from Omicron [22], is unlikely to differ in terms of consequences compared with common seasonal viruses significantly. Such discrepancy in long-term outcomes between different variants might be attributable to the variations in the viral structure, particularly in the spike protein [23], which is known to affect the virus’s ability to bind to and enter host cells. Future studies focusing on the specific biological mechanisms behind these differences could provide more definitive answers.

When assessing symptom manifestations of PCC, the Wuhan cohort presented a markedly higher incidence of dermatological, fatigue, gastrointestinal, sensory, and sleep disturbances than the post-infection sequalae manifestations in reference cohort. Among these, fatigue and sleep problems demonstrated the largest difference in manifestation rates. Such findings corroborate the existing literature, where fatigue and sleep issues have been consistently reported as prominent manifestations of PCC, particularly in the context of the Wuhan variant. In contrast, the Omicron cohort showed no significant difference in symptom manifestations compared to the reference cohort, further highlighting the potential differential pathogenicity of the viral variants. The largest differences between the Wuhan and Omicron cohorts were found for dermatological manifestations and fatigue, again underscoring the burden of these symptoms in CYP infected with the Wuhan variant. In contrast, the Omicron cohort showed no significant difference in symptom manifestations compared to the reference cohort, further highlighting the potential differential pathogenicity of the viral variants. The largest differences between the Wuhan and Omicron cohorts were found for dermatological manifestations and fatigue, again underscoring the burden of these symptoms in CYP infected with the Wuhan variant.

Our findings also revealed that CYP infected with the Wuhan variant had significantly higher rates of incomplete recovery and emotional-behavioural changes, such as behavioural changes, emotional problems, fatigue, worsening relationships, decline in school attendance, and sleep problems, as compared to the reference cohort. These results suggest the long-lasting and pervasive impact of COVID-19 on health and well-being of CYP, especially those infected by the Wuhan variant. It is crucial for healthcare providers to recognise these long-term sequelae and to develop appropriate strategies for ongoing support and management.

The Omicron cohort showed a significant difference from the reference cohort only in terms of lower school attendance. This difference could be attributed to factors such as more conservative return-to-school policies during the Omicron wave. Additionally, parental concerns about their child’s health after infection could have contributed to lower school attendance. However, it does imply a potential impact on the children’s educational attainment and warrants further exploration.

One of the major strengths of this study is the inclusion of a control group of CYP who were admitted to the hospital with non-COVID-19 infections. This allows us to estimate the relative burden of long COVID-19 and to compare the outcomes of CYP with COVID-19 infection to those of CYP with other infections. Both cases and controls in this study were exposed to infection-related hospitalisations, which reduces the potential for bias due to differences in health-seeking behaviour. Another strength of the study is the distinction made between the early Wuhan variant and the later Omicron variant of the virus. By comparing the outcomes of CYP infected with these two variants, we are able to explore the differences in the long-term effects of the two variants of the virus on children’s health. The study utilised the International Severe Acute Respiratory and Emerging Infection Consortium (ISARIC) long-term follow-up study case report forms (CRFs) for CYP. This standardised data collection method ensures that the data collected are consistent and comparable across different study sites. In addition, the study used the WHO post-COVID-19 condition definition, which provides a standardised definition of the condition that allows for more accurate and consistent identification of cases.

However, the study also has several limitations. First, all the cohorts represent hospitalised population which do not allow to extrapolate findings on all CYP. The follow-up period has been restricted to 8 months after hospital discharge. This may not be sufficient to capture the full extent of long-term effects of COVID-19 in CYP. Second, the study is limited in its ability to control for extrinsic factors and residual confounding, such as vaccinations and other viral infections, that may have affected the study outcomes. Third, the criteria for hospital admission may have changed over time, and cases across waves can have different acute severity. Fourth, the chronological association of the waves of study with COVID-19 variants was established without laboratory confirmation, and relied upon available open data sources, which may have introduced bias into the study. No patients were recruited during Delta variant wave. Fifth, the study relied on proxy-reported responses from the parents for the follow-up interviews, which may be affected by recall bias and non-blindness of respondents. Additionally, parental bias towards the health of their CYP may have influenced the accuracy of the data collected. Finally, there is a risk of potential selection bias, with those with symptoms more likely to agree to the survey and thus potentially overestimating the prevalence of post-COVID-19 condition. At the onset of the COVID-19 pandemic, when the Wuhan variant was prevalent in Russia, the only facility in Moscow equipped for the hospitalisation of infected children was the Bashlyaeva Children’s City Clinical Hospital. As the pandemic evolved and other variants, including Omicron, became dominant, the G.N. Speransky Children’s City Clinical Hospital No. 9 COVID-19 wards were set-up in response to increasingly large number of patients expected as a part of this wave. Consequently, it was not possible and feasible to gather data from just one hospital. However, given that both hospitals used identical data collection methodologies, the batch effect in the data is unlikely.

Conclusions

Overall, the findings of this study underscore the importance of continued monitoring and support for CYP who have been hospitalised with COVID-19 infection as well as for those who may be experiencing long-term effects of the virus. Future studies should aim to elucidate the underlying biological mechanisms behind the observed differences in clinical outcomes associated with different viral variants. The results offer some reassurance, given that the Wuhan variant is now quite rare in the population. These findings permit us to conjecture that the Omicron variant, as well as the SARS-CoV-2 variants currently circulating, is unlikely to diverge significantly from common seasonal viruses in terms of their consequences. Policymakers, healthcare providers, and families must work together to ensure that CYP are provided with the resources and support they need to recover fully and to continue to thrive despite the challenges posed by the COVID-19 pandemic.