Background

The COVID-19 pandemic affects many public health (PH) fields. Besides disease rates, persistent symptoms (Long-COVID) and death, impacts on mental health aspects are essential with regard to short-term and long-term well-being [1, 2]. To keep incidence rates as low as possible, governments used various combinations of social isolation strategies [3,4,5,6]. However, compared to adults, children and adolescents (CA) represent a particularly vulnerable group and tend to be affected differently by the pandemic and social distancing policies such as school closures. On the one hand, the short-term health effects of COVID-19 infections on CA without comorbidities seemed to generally be mild, e.g. clinically mild disease or asymptomatically infection [7,8,9]. On the other hand, however, a growing number of studies point to a high mental health burden among the youth during the pandemic; particularly regarding anxiety [7, 10,11,12] as the most prevalent mental health disorder among young people in Europe and the leading cause of years lived with disability among mental health conditions [13, 14].

In studies of earlier pandemics and disease-related quarantine, associations between loneliness and isolation with mental health problems such as anxiety are already well described in CA [15, 16]. Many of the exposed children began using mental health services [17, 18]. Hence, CA seemed to be particularly vulnerable to isolation or loneliness which could lead to an increase in mental health impacts through COVID-19 containment measures [11, 19]. According to UNICEF pre-pandemic estimates, the prevalence of mental disorders for boys and girls aged 10–19 in Europe was 16.3% in 2019 [16]. Further, the World Health Organization (WHO) described anxiety disorders as the most prevalent in this age group with profound consequences on physical and mental health in later life [20].

Within the ongoing COVID-19 pandemic, the number of primary studies regarding the effects of the pandemic on anxiety among CA is rapidly increasing. However, the existing studies provide partially contradictory findings [21, 22], have different results regarding the magnitude of anxiety and used heterogeneous diagnostic instruments [10, 12]. Up to now existing systematic reviews primarily focus on the general population [23] or the global prevalence of anxiety among CA [8, 10]. Since the COVID-19 pandemic confronted populations in Europe with several waves and national state governments reacted with heterogenous contact restrictions like lockdowns, school/kindergarten closures, quarantine orders, decreased peer interactions etc., a summary within a European context could allow a differential view on a potential increase of anxiety symptoms during the course of the pandemic. A recent published meta-analysis of our research group regarding the changes of depression symptoms in CA in Europe during the COVID-19 pandemic highlights an overall increase as well as a dose–response relationship of restriction measures and general depression symptoms [24]. At present, no systematic review specifically addresses the changes during COVID-19 pandemic for anxiety among CA on the European continent. Therefore, the aim of this systematic review (SR) is to critically synthesise the evidence regarding the impact of the COVID-19 pandemic on anxiety symptoms among CA in Europe compared to a pre-pandemic baseline. To this end, the proposed systematic review will address the following research objectives:

  1. 1.

    Estimation the change of general anxiety symptoms and clinically relevant anxiety rates among CA in Europe before and during the COVID-19 pandemic in total and pre-defined subgroups (in particular regarding age and gender);

  2. 2.

    Evaluation the impact of COVID-19-related measures stringency on general anxiety symptoms and clinically relevant anxiety rates, using the validated Oxford COVID-19 Stringency index [6];

  3. 3.

    Identification of vulnerable groups among CA;

  4. 4.

    Outline the clinical relevance of the available results.

In this protocol of the planned SR, the used methods will be described.

Methods

The SR was registered on the International Prospective Register of SR (PROSPERO; CRD42022303714) [25]. This protocol is prepared in accordance with the Preferred Reporting Items for Systematic review and Meta-Analysis Protocols (PRISMA-P) statement [26] (Appendix 1); the PROSPERO record will be updated regularly. Any deviations from the protocol will be noted in the final SR. The final SR will be conducted according to updated PRISMA statement [27] and will follow the guidelines of the actual Cochrane Handbook for SR [28] and the Joanna Briggs Institute (JBI) Manual for Evidence Synthesis [29], as far as possible.

Eligibility criteria and information

Based on the examination of an environmental exposure, namely the COVID-19 pandemic, the research question was formulated within a Population-Exposure-Comparison-Outcome (PECO) scheme [30], see Table 1.

Table 1 Research question according to the Population-Exposure-Comparison-Outcome (PECO) scheme

The eligibility criteria, divided into inclusion and exclusion criteria, were conducted in accordance to the PECO scheme and are presented in Table 2 with further categories.

Table 2 Inclusion and exclusion criteria according to the Population-Exposure-Comparison-Outcome (PECO) scheme

Search strategy

The search strategy includes the following databases: MEDLINE (PubMed), Embase, PsycINFO, Cochrane Central Register of Controlled Trials (CENTRAL), Web of Science and WHO COVID-19 database (also including pre-prints). Also, study registries (e.g. PROSPERO), relevant grey literature (e.g. government reports), related articles, congress submissions, websites of key organisations, reference lists of included articles and previous published reviews will be screened.

Translating the research question into a search string was performed in accordance with the guideline for Peer Review of Electronic Search Strategies (PRESS) [34]. Development of the database specified search strings occurred using validated or recommended search filters where possible (e.g. for identifying pediatric studies in PubMed [35], search strings for COVID-19 records in PubMed [33, 36], search filters offered by the InterTASC Information Specialists’ Subgroup Search Filter Resource [37]; in parts adapted). Both free-text and subject headings (e.g. Medical Subject Headings [MeSH], Emtree) will be used in combination with the adequate use of the Boolean operators ‘AND’ and ‘OR’. The search strategy was peer reviewed by an expert in conducting SR in health sciences according to the evidence-based checklist PRESS Evidence-Based Checklist [34] before the searches will be run to ensure a high-quality search strategy (search submission and peer review assessment are attached in Appendix 2 and 3). The draft search strategy for PubMed is presented in Appendix 4.

Study records

Study selection, in accordance to the inclusion and exclusion criteria in Table 2, will be conducted in three steps: (1) duplicates removal; (2) screening at title and abstract level; and (3) screening the full text. Duplicate removal will be conducted with assistance of the recommended EPPI-Reviewer Web software [38]. Two independent reviewers (HLW, ID) will screen the studies in step (2) and (3); any discrepancies will be discussed and, if necessary, resolved by a third author (MB). Several publications with an equal or similar study population and equal measurement points during the pandemic will be considered once; studies with, e.g. smaller sample sizes will be excluded. Studies of the same study population with various pandemic measurement points will be considered individually. The reasons for study exclusion in step 3 will be reported in the Appendix of the final study. All screening procedures will be presented using the PRISMA flow diagram [27]. Data management will be organised by the software Citavi 6.

Data collection

Data extraction will be conducted by two review authors (HLW, ID) using specially developed tabular data collection forms (‘Characteristics of included studies’ and ‘Summary of effect estimates’ tables are planned) [28]. These forms will be pilot tested with about one third of the included studies by both authors transferring the data independently from the studies and discussing possible discrepancies. Remaining data extraction will be completed by one reviewer (HLW) and verified by the other (ID). Any discrepancies between the two reviewers will be discussed extensively and, if necessary, resolved by a third author (MB). Study authors will be contacted in case of uncertainties regarding the published data. Further, several authors will be contacted to provide additional unpublished study data to expand the data basis and, if possible, to be able to perform (more detailed) subgroup analyses (e.g. gender- or age-stratified data).

For each included study the information of five categories (study information, population and setting, COVID-19 determinants, pre-pandemic baseline, outcomes) will be extracted for an overview of the study characteristics in a ‘Characteristics of included studies’ table, as shown in Table 3.

Table 3 Planned data for extraction

The primary outcomes are

  • General anxiety symptoms

  • Clinically relevant anxiety rates

General symptoms are defined as the general measurement of anxiety symptoms (mostly continuous measurements). Clinically relevant rates are defined as measurements with a clinical cut-off or in a medical setting (ICD-reports). Changes will be calculated as differences between scores with standard deviation (general symptoms) or ratios (clinically relevant anxiety symptoms; see “Data synthesis” section for further information). We assume that definitions and diagnostic instruments will vary across studies, so a wide range of definitions, measurement instruments and symptom reporter will be accepted. No restrictions will be set on the number of measurements during the COVID-19 pandemic. If data of pre-pandemic measurements will be available at multiple time points, only data of the latest possible time point will be used for effect estimate calculation. If present, both unadjusted and adjusted effects estimates will be extracted, whereby adjusted values will be preferred in the case of pooling (see “Data synthesis” section). The effect estimate will be provided with a 95% confidence interval (CI). No second outcomes will be considered.

We will put a special focus on the impact of pandemic-related restrictions on anxiety at CA (research objective 2) by using the validated Oxford COVID-19 Stringency Index [6]. The index is calculated as a score from nine categories: school closures, workplace closures, cancellation of public events, restrictions on public gatherings, closures of public transport, stay-at-home requirements, public information campaigns, restrictions on internal movements and international travel controls; it ranges from 0 (no restrictions) to 100 (most stringent restrictions). We will calculate for each study measurement period a mean score. Further, we will define three cut-off points in accordance with the COVIDSurg Collaborative [39]: light restrictions (index < 20), moderate lockdowns (index 20–60) and full lockdowns (index > 60). In addition, we plan to consider specifically the School Closure Index (also included in the Oxford COVID-19 Stringency Index) which records closings of schools. The range of the School Closure Index comprises 0 to 3: 0 for no measures, 1 for recommended closings or changes in school operations, 2 for partially school closures and 3 for closing of all school levels [6, 40]. Therefore, we will define the following cut-offs points: no or few alterations compared to a pre-COVID-19 situation (index < 2) and partial or full school closure (index ≥ 2) [24].

Risk of bias assessment

Based on preliminary searches and previously published systematic reviews [8, 10, 24], we expected mainly observational studies. Therefore, two review authors (HLW, LMP) will independently assess the risk of bias applying the current launched instrument ‘Risk Of Bias In Non-randomised Studies of Exposure’ (ROBINS-E) [41]. The ROBINS-E development followed the standards of the ‘Risk Of Bias In Non-Randomised Studies of Interventions’ (ROBINS-I) tool, in which RoB assessments are made within a set of ‘signalling questions’ within seven bias domains, including (1) risk of bias due to confounding, (2) risk of bias arising from measurement of the exposure, (3) risk of bias in selection of participants into the study, 4) risk of bias due to post-exposure interventions, 5) risk of bias due to missing data, 6) risk of bias arising from measurement of the outcome, and 7) risk of bias in selection of the reported result [42]. Judgments for each RoB item could be ‘low’, ‘some concerns’, ‘high RoB’, or ‘very high RoB’. Also, an overall judgment regarding the total RoB quality will occur. For further analysis we aim to differentiate between studies classified as having low/some concerns (= low) RoB and high RoB/very high RoB (= high) RoB (see “Data synthesis” section for further information). RoB assessments will be visualised as ‘traffic light’ plots of the domain-level judgements for each individual result and ‘weighted bar’ plots of the distribution of risk-of-bias judgements within each bias domain, using the tool robvis [43].

Data synthesis

The ‘Summary effect estimates’ tables will be presented and described for each study and outcome estimate, grouped by country and RoB (see ‘Data collection’ section). We will conduct the decision to combine (or not) the results of the individual studies (meta-analysis) in accordance with the assessment of clinical and methodological heterogeneity by considering gender, age, pandemic-related restrictions and RoB [44]. Where data will be pooled using meta-analysis, we will assess the degree of statistical heterogeneity by visual inspection of forest plots and applying chi2 test and I2 statistic. A low p value within chi2 test (or a large chi2 statistic relative to its degree of freedom) will be considered as an indication of heterogeneity but will be interpreted with caution when only few studies can be included or the studies have small sample sizes [45]. We will consider an I2 value of 50% or more to represent substantial levels of heterogeneity, but will interpret this value in light of the size and direction of effects and the strength of the evidence for heterogeneity. Where heterogeneity will be found in pooled effect estimates an explanation of the source of heterogeneity will be pursued by subgroup analyses, sensitivity analyses and/or meta-regression analyses [45]. To conduct a meta-regression analysis a minimum of 10 studies should be available per examined covariate [45].

Based on preliminary analysis, it is anticipated that general anxiety symptoms will be reported as continuous outcomes. If studies will use different outcome measures to assess general anxiety symptoms, the standardised mean difference (SMD) with a 95% CI will be used as a summary statistic (recommend by the Cochrane Handbook [46]). It can be further assumed that clinically relevant anxiety rates will be reported as dichotomous outcomes (odds ratio or risk ratio). The meta-analyses of both the continuous and dichotomous data will be performed based on the random-effect model (due to anticipated between-study heterogeneity) using the inverse-variance method with the ‘DerSimonian and Laird’ approach (to minimise the imprecision of the pooled effect estimate). If standard deviations are missing, we will calculate them from p values, CIs or standard errors, if available or contact the study authors [28]. If data for general anxiety symptoms will be reported as dichotomous data, it is planned to homogenise these data [47]. For the expression of dichotomous data as SMD the recommended formula by Chinn [45, 48] will be used. Results from adjusted analysis will get preference in the meta-analyses to provide more careful estimates. When both parent and self-rated data will be provided the self-rated data will be selected for meta-analysis [49]. Results of the meta-analysis will be illustrated using forest plots.

Study data will be categorised (e.g. in an excel-matrix) to decide which data sets are similar enough for a quantitative pooling, which subgroups are feasible and which data types (continuous or dichotomous data) are available (considering clinical and methodological heterogeneity). RoB will be given special attention by performing separate calculations, if possible, for low and high RoB studies as well as by calculating the overall summarising effect. The following subgroup analyses are planned, assuming that sufficient data from low RoB studies are available:

  • Outcome: general anxiety symptoms and clinically relevant anxiety rates;

  • Demographic: gender and age;

  • Contextual: full versus moderate lockdown (Oxford Stringency Index > 60 versus ≤ 60), school closures versus no school closures (School Closure Index ≥ 2 versus < 2), social status (high versus low social status) and education (high versus low educational level).

For research objective number no. 4 a medical interpretation of the change of clinically relevant anxiety rates will occur considering further relevant clinical aspects (ensured by JMF and MB).

The analyses will be conducted with Review Manager 5.4.1 [50] and/or R Studio 4.2.1 [51].

If a statistical pooling (meta-analysis) appears to be inappropriate, e.g. if study designs differ considerably, a tabular, graphical or narrative synthesis will be provided [52].

Sensitivity analysis

To determine whether the pooled results are robust, sensitive analyses will be conducted. This includes the repetition of the meta-analysis with different comparison categories [28, 53], planned are comparisons between low and high RoB studies, different study types, converted/unconverted and adjusted/unadjusted effect estimates (removing those studies with converted/unadjusted effect estimates).

Publication bias

The systematic review will also address RoB due to missing results in a synthesis. Graphical and statistical methods will be used to provide information about the extent of missing results. Funnel plots will be generated and visually interpreted for signs of asymmetry, which could indicate that publication bias is present [54, 55]. When at least 10 studies of different sample size will be included in a meta-analysis, the Eggers’ test will be used to test for funnel plot asymmetry[56].

Certainty of evidence

The certainty of evidence for each outcome will be evaluated by using the ‘Grading of Recommendations Assessment, Development and Evaluation’ (GRADE) approach adapted to the use of non-randomised studies [57]. Five domains for downgrading the certainty of evidence are considered in GRADE: RoB, inconsistency, indirectness, imprecision, and publication bias. Also, an upgrading is possible through consideration of three further domains: large effects, dose response, and opposing plausible residual bias and confounding [58]. The use of the RoB instrument for non-randomised studies will allow to start at ‘high’ initial certainty of evidence within GRADE [53]. For all domains, we will follow a transparent approach of applying detailed criteria for downgrading or upgrading. GRADE finally specifies four levels of the certainty for a body of evidence for each outcome: high (further research is very unlikely to change our confidence in the estimate of effect), moderate (further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate), low (further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate), or very low (very uncertain about the estimate of effect). The certainty of evidence will be report for each outcome in a ‘Summary of findings’ table, supported by evidence profiles with more detailed explanations [58].

Discussion

This protocol aims to provide a description of the research design and used methods of the SR addressing the real impact of the COVID-19 pandemic on anxiety among CA in Europe in contrast to many clinical and epidemiological observations without pre-pandemic baseline. The results of the SR will provide relevant evidence in order to address the gap in the literature with a high-quality methodological approach.

As a strength of the systematic review it can be pointed out that only methods and instruments that have already been tested and approved will be used. Although the study design will be not limited, it can be assumed that in particular observational studies will be included in the final systematic review; this might restrict the certainty of evidence. At present, there is no applied guideline for the preparation of a systematic review for observational studies or other study designs besides RCTs; however, some guidelines are in preparation [59,60,61]. The Cochrane Handbook [28] is often cited as the ‘gold standard’ for preparing SR. It contains important information on the preparation of clinically relevant search strategies (e.g. PRESS review), the synthesis of the results and the subsequent assessment of the RoB and the certainty of evidence (GRADE), but offers only few descriptions of how to prepare and conduct a SR with observational studies. In addition, the JBI provides comprehensive guides for conducting a variety of review types, including SR, scoping reviews, and umbrella reviews to address health-related questions [62].

Potential limitations of this SR could be the heterogeneity of studies, methodological approaches, and probable reduced number of studies due to urgency of a pre-pandemic baseline (see Table 2). Nonetheless, given a disparate and partly contradictory state of research, based on heterogeneous diagnostic instruments, age groups, pandemic situation, and country, the SR will provide a systematic assessment of the impact of the COVID-19 pandemic and its social distancing policies on anxiety among CA in Europe.