Introduction

Mental Health (MH) problems constitute the largest health concern for children and youth worldwide in the twenty-first century [1]. Representing MH states are two psychological responses: negative, which consist of unpleasant feelings or emotions and symptoms related to clinically diagnosed ill-being [2, 3]; and positive, which consist of affective states and psychological well-being beneficial to one’s life [2, 4,5,6,7,8]. MH is influenced by many factors [9], including physical activity (PA), referred to as any type of physical movement that increases energy expenditure [10]. It has been documented that regular PA participation is protective for MH by preventing and managing negative psychological responses [10, 11]. Engaging in an adequate level of PA regularly is important for promoting people’s physical and mental health, especially for children and youth [12]. However, the majority of young people are not meeting WHO’s PA guideline (i.e., an average of 60 min per day of moderate-to-vigorous PA intensity), which has resulted in a worldwide critical public health issue [13]. Moreover, a number of studies have demonstrated that physical inactivity deteriorates as age increases [14, 15].

PA and MH are theoretically and empirically associated, and the relation is bidirectional [16]. For example, Sampasa-Kanyinga [17] reported that a low level of PA is closely related to high levels of negative psychosocial responses (e.g., anxiety, depression, stress, negative affect, and distress) among children and youth. Not surprisingly, other studies found that a high level of PA is related to positive outcomes, such as well-being, self-esteem, self-concept, and resilience [16,17,18]. As early as 2011, a meta-analysis based on randomized controlled trials (RCTs) revealed that increased levels of PA are significantly associated with improved MH among children [19]. In 2019, another meta-analysis reported that the effects of PA on psychological ill-being (effect size = 0.130, p = 0.007) and psychological well-being (effect size = 0.189, p = 0.001) among children and youth are small but significant [16]. Thus, as improvements in MH reflect fewer negative psychological responses and more positive ones [20, 21], both positive and negative psychological responses should be considered when facilitating a comprehensive understanding of the relation between PA and MH.

The coronavirus disease (COVID-19) is an ongoing pandemic caused by a novel coronavirus, the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) [22]. Many countries implemented physical distancing measures, national lockdowns, and travel restrictions to control the spread of COVID-19 during the first two years of the outbreak [23]. As a protection for children and youth, restrictions to physically attend a majority of schools and universities were implemented worldwide, with an estimated 1.5 billion students transitioning to online learning [24]. These school restrictions and other social behavioral adaptations (e.g., social distancing, isolation) severely impacted the 24 h lifestyle of many children and youth [25], resulting in decreased opportunities for PA and increased sedentary behavior [26].

Moreover, the outbreak of COVID-19 has been accompanied by significant global MH challenges [27]. For example, an increasing number of studies have reported higher levels of anxiety, depression, and stress among children (ages 6–12 years) who experienced family isolation and school closures during COVID-19 [28,29,30]. Other studies have shown that people who were able to maintain more total time in moderate to vigorous PA were 12–32% less likely to experience depressive symptoms and 15–34% less likely to experience anxiety during COVID-19 [28, 31,32,33]. Although several existing reviews published in 2021 and 2022 demonstrated the relation between PA and MH among children during COVID-19 [28, 31, 33,34,35,36], they focused mainly on specific negative psychological responses (i.e., anxiety, depression and stress) with only a limited number of eligible studies being included. Additionally, the impacts of COVID-19 on PA and MH appear to be greater among children and youth with disabilities or chronic conditions (DCC) than their peers without DCC [28, 37]. However, within the COVID-19 context, a comprehensive understanding of the relationship between PA and MH among children and youth, including those with DCC, is still lacking. The relationship between PA and MH may be stronger in children and youth during COVID-19 than before COVID-19; and the pattern and extent of this relationship may vary by age range and disability status.

According to the theory of positive psychology, an adversity can enhance the ability of certain populations to cope positively and creatively with stress and distress [38]. During the COVID-19 pandemic, some people with an optimistic mindset counteracted negative impacts by making the most of limited resources and by being physically active and exercising at home [38, 39]. It is important to encourage children and youth to overcome the negative impacts of the COVID-19 pandemic by using methods, such as PA participation, that foster positive emotions and optimism [40]. Therefore, the purpose of this systematic review and meta-analysis was to evaluate the association of PA with MH in terms of both negative and positive psychological responses among children and youth within the context of COVID-19. Specifically, we asked two questions: (1) Did close relationships between PA and MH outcomes occur among children and youth during COVID-19?; and (2) Did the pattern and magnitude of such relationships vary by age range and disability status? We hypothesized that: (1) there would be close relationships between PA and MH outcomes in terms of both negative and positive psychological responses among children and youth during COVID-19; and (2) the pattern and magnitude of the relationships would vary in different age ranges and health conditions (i.e., with and without DCC). Information gained in this systematic review and meta-analysis will not only facilitate a better understanding of the relation between PA and MH, but also inform new research on how to promote PA and MH among children and youth during any future COVID-19-type pandemic.

Method

The conduct and reporting of this review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines [41]. Registration for this protocol was completed on the Prospero database (reference number: CRD42022303342).

Search strategy

Four electronic databases (i.e., Embase, PsycINFO, PubMed, and Web of Science) were systematically searched to identify relevant studies published between January 2020 and December 2021. This end date was chosen since the impact of COVID-19 on daily life, including isolation policies, gradually diminished in many countries starting in the second half of 2021 [42,43,44,45]. The search items were grouped into four components: (1) physical activity (physical activity OR exercise OR sport), (2) mental health (mental health OR mental problem OR mental illness OR mental disorder OR well-being OR depression OR anxiety OR stress OR happiness), (3) age group of interest (child OR adolescent OR youth), and (4) COVID-19 (COVID-19 OR SARS-CoV-2). An example of the search strategy can be found in Additional file 1: Table S1.

Eligibility criteria

Inclusion criteria were: (1) participants were children and youth aged 2 to 24 years with and without DCC; (2) studies reported a potential relationship between any type of PA and at least one MH outcome in the context of COVID-19, which enabled evidence for specific correlates to be determined; (3) studies used a cross-sectional, longitudinal, or experimental design; and (4) articles were published in a peer-reviewed journal in English. Qualitative studies, reviews, books, dissertations, conference proceedings, commentaries, and studies without full-text were excluded.

Study selection

After conducting the initial search and removing all duplicates, two reviewers (BL, JY) were trained to independently perform title/abstract and full-text screening for the inclusion of records with a yes, unsure, or no approach. Inter-rater reliability between the two reviewers at the two screening phases was calculated using the κ statistic. κ values between 0.60 and 0.74 were deemed as having good agreement, and values > 0.75 were deemed as having excellent agreement [46]. Any discrepancy between the two reviewers at any stage was jointly reviewed and discussed with a third reviewer (JJY) until a consensus was achieved.

Quality assessment

Two reviewers (BL, JY) independently rated the methodological quality of all the included studies. Inter-rater reliability was calculated using the intraclass correlation coefficient. To assess observational studies employing cross-sectional and longitudinal designs, we used the modified Newcastle–Ottawa Scale [47], which included seven items consisting of three quality components: selection (4 items, maximum 5 points), comparability (1 item, maximum 2 points), and outcome (2 items, maximum 3 points). Each criterion received zero to two points and summed to a final score (maximum 10 points). The methodological quality of a specific study was considered high if it was scored as 9 to 10, medium if scored as 5 to 8, and low if scored as 4 or less. To assess studies involving RCTs, we used the Effective Public Health Practice Project (EPHPP) Quality Assessment Tool [18], which included six quality components: selection bias, design, confounders, blinding, data collection, withdrawal and dropout. An overall rating was determined based on the ratings of the above constructs. RCT studies were categorized as high quality if no weak ratings were present, medium if there was only one weak rating, and low if there were two or more weak ratings.

Data extraction

All data from the included records were extracted by one reviewer (BL) and double checked by a second reviewer (JY). For each study, we coded the following bibliographic information: (1) first author’s name, (2) publication year, and (3) study location. We then coded the following variables from each study: (1) study design type (observational or experimental), (2) participant characteristics (age, sample size, and the number of girls and boys), (3) measures and outcomes of PA (e.g., custom questionnaire, the accelerometer) and MH, (4) main findings (relationship between PA and MH). Follow-up time was extracted for longitudinal studies, and experimental conditions and intervention components, were extracted for experimental studies.

Evidence synthesis

The relationship between PA and MH in different age ranges was determined by examining the percentage of studies that reported a statistically significant relationship [48]. Referring to previous studies [16, 50] and the definition of age ranges of youth by United Nations [49], participants were categorized as follows: age group 1 (2–5 years), age group 2 (6–12 years), age group 3 (13–18 years), and age group 4 (19–24 years). The relationship between PA and MH in participants with DCC was synthesized separately.

The coding rules were: MH outcomes with different terms but the same concepts were combined into a single identification factor. If a study examined the relationship between PA and one or more sub-dimensions of an MH outcome and most of the sub-dimensions had consistent associations with PA, this could be summarized as a general result of the association of PA with that MH outcome. Various statistical techniques (e.g., t-test, analysis of variance, linear regression, and logistic regression) were used in the included studies to estimate the associations between MH and PA outcomes. For each article, a statistically significant relationship between MH and PA was coded as “relevant” while a statistically insignificant one was coded as “irrelevant” [48]. A summary code of the relationship between each MH outcome and PA was obtained by dividing the number of findings supporting a specific MH outcome associated with PA by the total number of studies that examined the relationship between PA and that particular MH outcome. If 0–33% of the studies reported a statistically significant relationship between PA and MH, the result was categorized as “no association” (0). If 34–59% of the studies reported a statistically significant relationship between PA and MH, the result was categorized as “inconsistent” or “uncertain” (?). If 60–100% of the studies reported a positive or negative relationship between PA and MH, the result was coded as a “positive association” or “negative association” (+)/(–). Double summary codes were indicated as (00), (??), (+ +), or (– –) when three or more studies consistently supported no association, inconsistent, or positive or negative association, respectively. Evidence for the relationship between a specific MH outcome and PA was considered as sufficient only if such relationship obtained a double summary code.

Meta-analysis

We selected a minimum of ten studies investigating the same MH dimension or outcome for the meta-analysis. All values of the correlations between PA and MH were transformed into Fisher’s z scores and eventually used in the meta-analysis. The Fisher transformation of the correlation coefficient was chosen because the assumption of normality of the results obtained after the transformation would be more plausible [51]. The Fisher’s z transformation was a two-step process that first converted the relevant data describing the relationship between PA and MH (e.g., regression coefficient β, odds ratio, or other effect sizes) into a correlation coefficient r using Psychometrica (https://www.psychometrica.de/effect_size.html). This step was not necessary if the correlation coefficient r was reported in the study. Using the online platform Practical Meta-Analysis Effect Size Calculator, we transformed all correlation coefficient r values to Fisher’s z scores to obtain standardized data for conducting the subsequent meta-analysis (http://www.campbellcollaboration.org/escalc/html/EffectSizeCalculatorHome.php). When an article reported multiple quantitative values of the relationship between PA and MH (e.g., the correlation coefficient between PA and MH for boys and girls was reported separately), we calculated Fisher’s z scores for boys and girls separately based on the reported values and the corresponding number of participants. Fisher’s z scores of 0.12, 0.24, and 0.41 were interpreted as a small, medium, and large effect, respectively [36].

The Stata software version 16.0 (Stata Corp, College Station, TX, USA) was used to perform the meta-analysis. Owing to the anticipated heterogeneity across studies, we conducted a random-effects model. Heterogeneity was quantified with Q statistic and I2. Q test (p < 0.10) and an I2 > 75% indicated a significant high-level heterogeneity [52]. When a high-level heterogeneity appeared, subgroup analyses were performed. If the subgroup analyses could not resolve the high heterogeneity, appropriate univariate meta-regression analyses were performed to explore potential influencing factors, including age of participants, report of total and dimensions (duration, frequency, intensity) of PA, study design (interventional, observational), and disability population group [52, 53]. Additionally, funnel plots and Egger’s tests were performed to assess the risk of publication bias. Funnel plots provided a visual representation of the symmetric distribution of the studies. When the funnel plot was asymmetric, Egger’s tests were conducted to further assess the risk of publication bias. If the Egger’s test was significant (p < 0.05), the trim-and-fill method was used to adjust for the suspected publication bias and recalculate the pooled effect size [54, 55].

Results

Study selection

The search yielded a total of 3953 records. After removing duplications and screening for titles, abstracts, and full-texts, 58 records were included in this systematic review for categorization of variables, with 32 of them applicable for the meta-analysis. The inter-rater reliability between the two reviewers was good for the title and abstract screening (κ = 0.88) and full-text screening (κ = 0.82). The flowchart for the selection process is shown in Fig. 1.

Fig. 1
figure 1

Flow diagram of selection process. MH mental health, PA physical activity

Overview of studies

This systematic review included 58 articles conducted in 18 countries: China (n = 23), The United States (n = 7), Canada (n = 6), Italy (n = 3), Korea (n = 3), Brazil (n = 2), Germany (n = 2), The United Kingdom (n = 2), Bangladesh (n = 1), The Czech Republic (n = 1), Greece (n = 1), Hungary (n = 1), Iran (n = 1), Israel (n = 1), Lithuania (n = 1), Pakistan (n = 1), Poland (n = 1), and Saudi Arabia (n = 1). Of the included studies, 49 (84%) were published in 2021 (first quarter, n = 6; second quarter, n = 11; third quarter, n = 12; fourth quarter, n = 20), 51 were cross-sectional, four were experimental, and three were longitudinal. The number of participants ranged from 64 to 1,199,320, and their ages ranged from 2 to 24 years old. The most frequently studied age range was 13–18 years (n = 38), followed by 6–12 years (n = 22), 19–24 years (n = 16), and 2–5 years (n = 2). Additionally, 13 of the 58 included studies involved individuals with DCC (e.g., obesity, ADHD, mental illness).

Regarding the measurement of PA, 29 (50%) of the included studies used custom questionnaires, 27 (47%) used known questionnaires (e.g., International Physical Activity Questionnaire) with psychometric properties reported, and 4 (7%) used device based measures. The majority (n = 52, 90%) of included studies used validated questionnaires or scales to evaluate MH outcomes. The top five most frequently used measurements of MH included Patient Health Questionnaire-9 (n = 6), The Depression Anxiety Stress Scale (n = 6), Generalized Anxiety Disorder Scale-7 (n = 5), Center for Epidemiologic Studies Depression Scale (n = 5), and Strengths and Difficulties Questionnaire (n = 4). Additionally, eight included studies either directly selected sub-scale(s) from a validated measurement or developed a new scale based on an existing measurement with adaptations. For example, the COVID-19 Fear Scale was developed in one study as an adaptation of the SARS Fear Scale [59].

A variety of MH outcomes consisting of 36 negative responses and 16 positive responses were investigated in the included studies. Of the negative responses, depression (n = 23), anxiety (n = 21), and stress (n = 11) were investigated most frequently, followed by insomnia (n = 6), COVID-19 stress (n = 3), fatigue (n = 3), mental health problems (n = 3), negative affect (n = 3), anger (n = 2), boredom (n = 2), confusion (n = 2), distress (n = 2), emotional and behavioral problems (n = 2), loneliness (n = 2), tension (n = 2), tiredness (n = 2), aggressiveness (n = 1), being more stressed (n = 1), COVID-19 fear (n = 1), event-specific distress (n = 1), exercise dependence (n = 1), feeling more horrified (n = 1), feeling more apprehensive (n = 1), feeling more helpless (n = 1), feelings of loss of control (n = 1), having greater study pressure (n = 1), hyperactive-impulsive (n = 1), irritability (n = 1), inattention (n = 1), internalizing and functioning problems (n = 1), negative arousal (n = 1), pessimism (n = 1), perceived vulnerability (n = 1), psychosocial and behavioral problems (n = 1), post-traumatic stress disorder (PTSD) (n = 1), and sadness (n = 1). In contrast, general well-being (n = 13) was the most frequently studied positive response, followed by positive affect (n = 3), self-esteem (n = 3), vigor (n = 3), health-related quality of life (n = 2), life satisfaction (n = 2), resilience (n = 2), happiness (n = 1), mental health performance (n = 1), mental health importance (n = 1), optimism (n = 1), positive energy (n = 1), positive outlook (n = 1), prosocial behavior (n = 1), perceived health (n = 1), and relaxation (n = 1). The details of each included observational and experimental study are summarized in, Tables 1, 2, respectively.

Table 1 Summary table for the characteristics and methodological quality of observational studies included in the review
Table 2 Summary table for the characteristics and methodological quality of experimental studies included in the review

Quality assessment

Of the 58 included studies, 53 (91%) were rated as medium in the quality assessment, three were rated as high, and two were rated as low. For the methodological quality ratings (see Tables 1, 2), the inter-rater reliability between the two reviewers was good (intraclass correlation coefficient = 0.77).

Data syntheses

Changes in physical activity and mental health of children and youth during COVID-19

In the present review, 11 included studies reported a significant decrease in PA among children and youth during COVID-19. For example, one study revealed that only 3.6% of children (5–11 years) and 2.6% of youth (12–17 years) in Canada participated in moderate to vigorous PA for at least 60 min per day during COVID-19 [114]. Another study showed that Spanish children spent 91 min per day less on PA during COVID-19 than before [25]. Such PA reductions were reflected in various components, including duration, frequency, and intensity [57, 68, 78, 80, 83, 84, 115,116,117].

Regarding the impact of COVID-19 on MH, 25 included studies showed that the MH of children and youth deteriorated during the pandemic. For example, one study found that COVID-19 caused significantly elevated levels of anxiety in youth [118], while another showed an increase in depressive symptoms in the 6–12- and 13–18-year-old groups [91]. Additional studies reported a significant increase in psychological distress [119] and an indirect increase in stress levels [107] among youth. As mentioned earlier, COVID-19 not only impacted common negative responses such as depression, anxiety, and stress among children and youth, but also led to increased occurrences of less common mental problems. One study, for instance, showed that during COVID-19, children and youth became more attached, inattentive, and irritable, and preschoolers were more likely to manifest signs of clinginess and fear [120]. Positive responses were also affected, including significant decreases in levels of well-being and overall mental health among children and youth [68, 78, 80, 121]. It is worth noting that children and youth with DCC exhibited a higher incidence of severe psychosocial dysfunction and a lower level of PA compared to their peers without DCC [56, 57, 76, 88, 108].

The relationship between physical activity and mental health among children and youth during COVID-19

Overall findings

As shown in Table 3, PA was strongly and negatively correlated with depression (21 of 23 studies, 91.3%), anxiety (14 of 21 studies, 66.7%), stress (7 of 11 studies, 63.6%), insomnia (4 of 6 studies, 66.7%), fatigue (3 of 3 studies, 100%), and mental health problems (3 of 3 studies, 100%). Although PA was negatively related to COVID-19 stress, negative affect, anger, confusion, distress, emotional and behavioral problems, COVID-19 fear, exercise dependence, feelings of loss of control, irritability, negative arousal, pessimism, psychosocial and behavioral problems, and sadness, and positively related to hyperactive/impulsive and perceived vulnerability, the evidence was considered insufficient since less than three studies consistently supported a specific association. PA’s relationship with each of the remaining negative response outcomes was either inconsistent or no association. Regarding positive psychological responses, PA showed strong and positive associations with general well-being (11 of 13 studies, 84.6%) and vigor (3 of 3 studies, 100%). Although PA was positively related to positive affect, self-esteem, health-related quality of life, life satisfaction, resilience, happiness, mental health performance, mental health importance, optimism, positive energy, positive outlook, prosocial behavior, perceived health, and relaxation, the evidence was considered insufficient.

Table 3 Summary of the relationships between PA and MH outcomes
Findings across age ranges

The present systematic review showed PA was strongly and negatively correlated with depression and strongly and positively correlated with general well-being in all age groups. However, different correlations between PA and negative responses occurred in different age groups. In the 6–12-year-old group, PA was not associated with stress, but was strongly and negatively correlated with anxiety and mental health problems. In the 13–18-year-old group, PA was strongly and negatively correlated with anxiety, stress, insomnia, and fatigue. In the 19–24-year-old group, findings on PA’s associations with anxiety, stress, and insomnia were inconsistent. Furthermore, PA was negatively correlated with fatigue though evidence was insufficient in the 13–18- and 19–24-year-old groups. Among the positive responses, PA was strongly and positively correlated with vigor in the 13–18-year-old group, and positively correlated in the 6–12- and 19–24-year-old groups though the evidence was insufficient. As with negative responses, we found that the 13–18-year-old group demonstrated the greatest variety of positive responses, indicating that researchers were most concerned about the responses of this age group during the COVID-19 pandemic.

Findings among children and youth with disabilities or chronic conditions (DCC)

Only eight negative psychological responses and four positive ones were investigated regarding the association of PA and MH among children and youth with DCC. The synthesized results showed that PA was strongly and negatively correlated with depression and anxiety in this population group (see Table 3). For example, one included study showed that children with ADHD who engaged in less than one hour of exercise per day were more likely to exhibit increased depressive symptoms during COVID-19 [94]. However, the associations of PA with all positive psychological responses were considered insufficient.

Meta-analytic results

The relationship between physical activity and negative psychological responses

The meta-analysis results showed PA was weakly and negatively correlated with negative responses (Fisher’s z = − 0.170, 95% CI [− 0.22, − 0.12], p < 0.001, I2 = 92.42%) (see Fig. 2). Because Egger’s test for publication bias was significant (t = -2.50, p < 0.05), the trim-and-fill method was performed. After eliminating publication bias, there was a potential moderate and negative relationship between PA and negative responses (Fisher’s z = − 0.198, 95% CI [− 0.25, − 0.15], p < 0.001). Specifically, PA showed significant and negative associations with anxiety (Fisher’s z = − 0.180, 95% CI [− 0.27, − 0.09]; p < 0.001), depression (Fisher’s z = − 0.160, 95% CI [− 0.23, − 0.09], p < 0.001), and stress (Fisher’s z = − 0.170, 95% CI [− 0.24, − 0.10], p < 0.001) (see Fig. 2).

Fig. 2
figure 2

Forest plots of the relationship between physical activity and negative psychological responses during COVID-19

Furthermore, the meta-regression analyses results showed that age (p = 0.001), report of total (p = 0.001) and specific dimensions (duration and intensity) of PA (p = 0.001), and study quality (p = 0.001) were the primary origins of heterogeneity in the included studies for the relationship between PA and negative responses (see Table 4).

Table 4 Meta-regression analyses on influencing factors for the heterogeneity of included studies

The relationship between physical activity and positive psychological responses

As illustrated in Fig. 3, PA was weakly and positively correlated with positive responses (Fisher’s z = 0.170, 95% CI [0.08, 0.25], p < 0.001, I2 = 92.42%; see Fig. 3). No significant publication bias was detected (p = 0.465 ). Meta-regression results showed that age (p = 0.002), report of total PA (p = 0.028), and study quality (p = 0.008) were the primary origins of heterogeneity in the included studies for the relationship between PA and positive responses (see Table 4).

Fig. 3
figure 3

Forest plots of the relationship between physical activity and positive psychological responses during COVID-19

Discussion

This systematic review and meta-analysis examined the relationship between PA and MH among children and youth during the COVID-19 pandemic. The impacts of age group and other factors on this relationship are further discussed.

An overview of physical activity and mental health among children and youth during COVID-19

During COVID-19, children and youth worldwide demonstrated a decreased level in PA participation that was significantly larger than before COVID-19 (36–61% of US children and youth showed declining PA levels, 50% in Ireland, 39% in Poland, 39% in Finland, 38% in The Czech Republic, and 33% in China) [39, 68, 84, 92, 109, 122,123,124,125,126]. Additional studies showed that only 3.6% of children (5–11 years) and 2.6% of youth (12–17 years) in Canada participated in at least 60 min of moderate to vigorous PA per day, with 49–64% of children and youth spending less PA time [80, 83, 94, 114, 127]. Another study showed that 18.9% fewer youth in Hungary engaged in moderate-intensity PA [78]. People with disabilities were also at risk for low levels of PA due to a lack of opportunities, physical education, and interventions [128]. We found studies in some countries that reported the same or even increased levels of PA among children and youth during COVID-19 compared to pre COVID-19 [102, 129,130,131]. However, in view of the prevalence of physical inactivity among children and youth before COVID-19 [132], PA deterioration during the pandemic and its potential health consequences deserve more attention [13].

Although COVID-19 has a significant negative impact on MH among children and youth globally, the extent of this impact varies slightly by region. For example, evidence has shown that 74% of American parents reported that their child’s MH deteriorated during the COVID-19 restriction period, and that the MH-related emergency department visiting rate of American children and youth increased by 7% [68, 133]. Similarly, it has been reported that youth with poor MH in China increased by 12% [89, 109], while in Canada, 31–44% of children felt their overall MH had declined [80, 83]. This MH deterioration among children and youth may have resulted from alienation due to physical isolation, exposure to negative news, fear that they or their loved ones would be infected by the virus, and even fear of death [30, 124].

The relationship between physical activity and negative psychological responses

The present systematic review found that depression, anxiety, stress, insomnia, fatigue, and mental health problems were the most common negative response variables investigated during COVID-19, and all of them were significantly and negatively correlated with PA. Specifically, increases in depression and anxiety were closely related to the lack of PA, which is consistent with findings before COVID-19 [16, 17, 33]. Our meta-analysis results showed a weak and negative relationship between PA and negative psychological responses during COVID-19 (Fisher’s z = − 0.198), a finding in line with a meta-analysis published in 2019 that reported the beneficial effects of PA on negative psychological responses among children and youth [16]. Before COVID-19, the lack of PA among children and youth might have been compensated to some extent by daily commuting and school activities [33, 134]. However, COVID-19-related lockdowns and school closures resulted in at-home/indoor physical inactivity, fear, and loneliness among children and youth that ultimately amplified the impact of PA on MH [28], which in turn may have led to a further loss of interest in PA [33, 135, 136].

Additionally, our results of variable categorization showed that the correlations of PA with anxiety, stress, and insomnia among children and youth during COVID-19 was mixed in the 19–24-year-old group rather than other age groups. One reason for this may be due to that the 19–24-year-olds may have better emotional self-management ability (e.g., managing stress) with a relatively matured cognitive control system being developed in this older group than minors [137, 138]. Another possible reason is that the influencing factors on MH in the 19–24-year-old group could be more complex than that in other age ranges, thus the impact of PA on the above negative psychological responses might be diminished [139]. These findings indicate that COVID-19 not only deepened the link between PA and negative psychological responses, but also led to diverse negative response symptoms among children and youth.

COVID-19 engendered a plethora of uncommon negative psychological responses, including COVID-19 stress, fatigue, mental health problems, and negative affect [58, 61, 68, 76, 90, 105]. COVID-19 stress, as a new negative psychological response that emerged during the pandemic [65], may have been for most youth an adaptive response that prompted them to take precautions (e.g., frequent hand washing and social distancing) to protect themselves from the virus. However, some youth almost certainly experienced excessive COVID-19 stress [65], which, if not taken seriously, may have developed into post-traumatic stress when the outbreak subsided [140]. This problem, unlike depression and anxiety, can be easily overlooked and cause serious consequences to children and youth [61, 76, 89, 105, 141]. Thus, future research should pay more attention to the causes of negative psychological responses and explore further how to prevent and treat them.

Another finding from our review is that, during COVID-19, PA was significantly and negatively associated with depression, regardless of age group. We argue that school closures and physical alienation increased loneliness and decreased PA for all children and youth, which resulted in the further prevalence of depression and indirectly strengthened the connection between PA and MH [61, 112]. In addition, we found that the relationships between PA and anxiety, stress, and insomnia varied across age ranges but were strongest among 13–18-year-olds. One possible explanation is that this age group followed news about COVID-19 via social media more frequently than the other groups, which may have negatively impacted their MH [136]. Furthermore, students between 13 and 18 years of age are generally under considerable academic pressure, which may contribute further to anxiety, stress, and insomnia [61, 89, 95, 109, 142]. We also found that 13–18-year-olds presented the greatest variety of negative psychological responses during the COVID-19 pandemic. One possible reason for this was the disruption of routine due to the long-term absence of a structured school framework, which is a more important adaptation mechanism for this age group than for others [143]. The above results suggest that 13–18-year-olds were most at risk for MH problems during COVID-19. Thus, researchers need to continuously monitor members of this group since the negative psychological responses triggered by COVID-19 (e.g., COVID-19 stress and COVID-19 fear) may become long-term post-traumatic stress responses persistently affecting their MH in the post-pandemic period [144].

After experiencing COVID-19, children and youth should be informed about the prevalence of negative psychological responses during adversity, how to minimize their impact, and how to quickly achieve “closure” and move on from them [40]. Thus, instead of focusing primarily on negative psychological responses, children and youth should learn to view the loss and pain of a difficult situation with detached values.

The relationship between physical activity and positive psychological responses

Our results demonstrated a weak and positive correlation between PA and positive psychological responses during COVID-19 (Fisher’s z = 0.17). Results of variable categorization also showed that PA was positively correlated with positive psychological responses. Specifically, PA was significantly and positively correlated with general well-being and vigor. For youth, sufficient PA can relieve psychological stress, increase mental stability, and further enhance general well-being and vigor [56]. For children, general well-being is more likely to be influenced by factors other than PA, including family cohesion, social connections with peers, and parents’ emotional states [143]. Given that fewer studies have addressed positive, compared to negative, psychological responses during COVID-19, it seems that the important relationship between PA and positive psychological responses has been to some extent overlooked. The few studies that examined this issue have demonstrated that positive psychological responses are not only closely related to PA, but also help to alleviate negative emotions [101, 145]. Thus, future research should investigate further the relationships between PA and positive psychological responses among children and youth.

The relationship between physical activity and mental health among children and youth with disabilities or chronic conditions (DCC)

In this review, eight negative and four positive psychological response outcomes were investigated for children and youth with DCC. Of the 12 outcomes, a significant relationship between PA and MH was evident in only two negative psychological responses (i.e., anxiety and depression). The rest lacked sufficient evidence to determine a relationship with PA. Research has suggested that, due to their failure during COVID-19 to receive adequate health screenings and usual interventions, children and youth with DCC experienced more negative MH effects than their peers without DCC [56, 146]. Moreover, individuals with DCC appear to have more barriers that impede their accrual of sufficient PA, which might cause deterioration of their MH status [57, 76, 88, 147]. Although previous studies have indicated that the pandemic’s negative impacts on PA and MH were more acute for children and youth with DCC than those without [88, 108], the exact relationship between PA and MH among children and youth with DCC cannot be determined due to the limited number of studies focusing on this population group in our systematic review. Therefore, in the post COVID-19 era, researchers should focus more on PA and MH issues of children and youth with DCC.

Limitations and recommendations for future research

Because most of the studies included in this review focused on typically developing children and youth, a meta-analysis on the relationship between PA and MH among children and youth with DCC could not be performed. Therefore, more studies on individuals with DCC should be conducted when investigating the relationship between PA and MH among children and youth. Furthermore, this review shows that studies did not include details of PA could have an impact on the relationship between PA and MH. Future studies should report PA details, such as total amount, intensity, frequency, and duration, as much as possible to avoid a one-dimensional assessment of PA.

We only included studies published up until December 2021, it was thus impossible to include research that have been conducted in the first two years of the outbreak but not yet published by the end of 2021. It is worth noting that this review included few studies with high methodological quality, which may impact the robustness of our findings. Furthermore, as only a limited number of experimental studies were eligible for this review, our meta-analyses on the magnitude of the relationship between PA and MH outcomes were based primarily on findings reported in observational studies. Thus, a cause-effect relationship between PA and MH could not be determined. Future research investigating the relationship between PA and MH among children and youth should include more studies with an experimental design and high methodological quality.

Finally, our review found that age impacted the relationship between MH and PA during COVID-19. Specifically, the most common MH problems are not the same at different ages. Future research should focus on age-specific MH problems (e.g., anxiety, stress, insomnia, and fatigue) and not limit their studies to the most common negative psychological responses, such as depression. It is important to note that limited research was conducted on preschoolers during COVID-19, which should also be addressed in future studies. Additionally, most of the existing studies were conducted in China, The United States, and Canada. In view of the potential influence of social cultural environment on the relationship between PA and MH in children and youth, future research from more countries worldwide is warranted.

Conclusion

The present systematic review and meta-analysis confirms a close relationship between PA and MH among children and youth in terms of both negative and positive psychological responses during the COVID-19 pandemic. Negative psychological responses received disproportionately more attention than positive ones. Specifically, PA’s associations with anxiety, depression, stress, insomnia, fatigue, mental health problems, general well-being, and vigor appeared to be stronger than its associations with other MH outcomes among children and youth. In addition, the pattern and strength of relations between PA and MH outcomes varied across age ranges and health conditions, with preschoolers and those with DCC receiving less attention in the existing research. Given the importance of PA and MH for healthy development of children and youth, future research with high methodological quality should focus on (1) age-range specific relationships between PA and MH outcomes from a comprehensive perspective during the post COVID-19 era, and (2) children and youth with DCC.