Introduction

Despite worldwide vaccination efforts against the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), the ongoing coronavirus disease 2019 (COVID-19) pandemic continues to be a global public health problem [1]. Since the pandemic began, several acute and chronic neurologic and psychiatric manifestations of COVID-19 have been described [2,3,4]. As the long-term consequences of SARS-CoV-2 infection are better understood, it has become evident that a high proportion of survivors may develop multiple long-term neuropsychiatric sequelae that may co-exist and reduce the quality of life for these patients, including cognitive dysfunction, depression, anxiety, and post-traumatic stress disorder (PTSD) [5,6,7].

Cognitive dysfunction is the most frequently reported long-term mental health disorder affecting up to half of the patients with severe COVID-19 6 months after hospital discharge [8]; despite this high frequency and the growing evidence of persistent cognitive deficits after SARS-CoV-2 infection, the pathogenetic mechanisms involved in their development are yet to be elucidated. Multiple studies addressing the underlying factors associated with the development of this sequela have shown conflicting results that may be partially related to different clinical and statistical approaches trying to understand the role of COVID-19-associated inflammatory response biomarkers or illness-related variables such as hypoxemia, invasive mechanical ventilation (IMV) requirements, delirium, and psychiatric sequelae as independent phenomena and not part of the same cascade of events [9,10,11,12,13]. In this regard, the building of multivariate causal models may contribute to clarify the pathogenic drivers of cognitive impairments after COVID-19.

Therefore, the present study aimed to develop a causal model by path analysis exposing the links between age, comorbidities, COVID-19-associated thrombo-inflammatory response biomarkers, hypoxemia severity at hospital admission, IMV requirements, in-hospital delirium, and their effect on cognitive performance among severe COVID-19 pneumonia survivors 6 months after hospital discharge.

Materials and methods

Study design, setting, and patient selection

For this prospective study, we recruited adult patients (age ≥ 18 years) who were hospitalized for severe COVID-19 pneumonia according to the World Health Organization case definition [14], between March 15 and June 15, 2020, at the Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, a tertiary-care hospital in Mexico City converted into a COVID-19 referral center. Following conversion, COVID-19-specific diagnostic and care protocols were implemented. Due to the potential risk of viral aerosolization, all patients under IMV were admitted to the intensive care unit (ICU) or ICU-adapted areas, while non-IMV patients were treated in general medical wards; when ICU beds or adapted areas were not available, patients requiring IMV were transferred to other hospitals [15]. The study protocol was revised and approved by our Institutional Research and Ethics Committees (reference: NER-3497–20-21–1). Signed informed consent was obtained from all participants and the procedures about human research were according to the Helsinki declaration.

As part of an ongoing study on long-term neuropsychiatric complications of COVID-19, cognitive and psychiatric assessments were performed at a follow-up visit scheduled 6 months after hospital discharge by a neuropsychologist, a psychiatrist, and a neurologist. We included patients with confirmed SARS-CoV-2 infection by real-time reverse transcription-polymerase chain reaction in respiratory specimens from nasopharyngeal swabs and COVID-19-compatible findings on chest computed tomography scan. None of the included patients received corticosteroids because they were treated before the preliminary report of their benefits in severe COVID-19 [16]. For this analysis, we excluded patients with a history of pre-existing cognitive impairment or other diseases associated with cognitive decline, as well as those who developed an in-hospital neurologic complication potentially associated with long-term cognitive impairment.

Cognitive and psychiatric assessment

Cognitive and psychiatric evaluations were performed using Spanish versions of screening instruments validated for the Mexican population. Cognitive assessment was performed using the Montreal Cognitive Assessment (MoCA), a screening instrument evaluating the following domains: visuospatial/executive (5 points), naming (3 points), memory (5 points), attention (6 points), language (3 points), abstraction (2 points), and orientation (6 points), whit a total score of 30 points. After correcting for years of education by adding an extra point to the total score for those cases with ≤ 12 years of schooling, a score of ≤ 26 points was considered a positive screening [17]. In addition, the Short Form of the Informant Questionnaire on Cognitive Decline in the Elderly (S-IQCODE), a 16-item screening tool for pre-existing cognitive impairment, was used in patients aged ≥ 50 years; an adjusted score ≥ 3.4 was considered positive [18].

Anxiety and depression assessments were performed using the Hospital Anxiety and Depression Scale (HADS), a 14-item scale with two subscales (seven items each), one for depression (HADS-D) and another for anxiety (HADS-A), scored on a Likert scale ranging from 0 (lowest) to 3 (highest). A cutoff of ≥ 11 points for each disorder subscale was considered a positive screening [19, 20]. PTSD was assessed with the PTSD Checklist (PCL-5), a 20-item screening scale, scored on a Likert scale ranging from 0 (lowest) to 4 (highest); a cutoff of > 33 points subscale was considered positive [21].

Data collection and definitions

Retrospective data from hospitalization were extracted from electronic medical records using a standardized case report format and entered into a secure online database, including demographics (age and sex), number of comorbidities (diabetes, hypertension, cardiovascular disease, pulmonary disease, current smoking, and obesity), and self-reported COVID-19-associated symptoms; the following blood workup results at admission: white blood cell count, arterial blood gas (ABG) analysis, lactate dehydrogenase, thrombotic, and inflammatory response biomarkers (D-dimer, fibrinogen, ferritin, and C-reactive protein); all results were interpreted according to our local laboratory reference limits. Treatment with vasopressors, opioids, benzodiazepines, or propofol as ever used; IMV requirements, its duration in days, and hospital length of stay; the development of delirium during hospitalization as either present or absent, delirium diagnostic protocols during the pandemic for each hospital setting in our center have been published elsewhere [15].

For each patient, we calculated the Charlson Comorbidity Index [22]. The partial pressure of arterial oxygen/fraction of inspired oxygen (PaO2/FiO2) ratio on admission (before IMV) was calculated using the estimated FiO2 provided by each oxygen delivery device (nasal cannula or simple facemask) when the first blood samples for ABG analysis were obtained. Hypoxemia was defined as a PaO2/FiO2 ratio ≤ 300 mmHg, and its severity was classified as mild (PaO2/FiO2 between 300 and 201 mmHg), moderate (PaO2/FiO2 between 200 and 101 mmHg), and severe (PaO2/FiO2 ratio ≤ 100 mmHg) according to the Berlin definition cutoff values for acute respiratory distress syndrome (ARDS) [23]. On admission, the mortality risk was estimated with the Sequential Organ Failure Assessment (SOFA) score [24]. At least two researchers reviewed all data, and a third researcher adjudicated any difference in interpretation between the primary reviewers.

Statistical analysis

Categorical variables are reported as frequencies with proportions, and continuous variables as either median with interquartile range (IQR) or mean with standard deviation (SD). We compared the demographic characteristics, comorbidities, COVID-19-associated symptoms, and in-hospital events between patients with a positive screening for cognitive impairment and those with a negative assessment. After testing normality with the Kolmogorov–Smirnov test, continuous variables were analyzed with the Mann–Whitney U test or Student’s t test, and as for categorical variables with the χ2 or Fisher’s exact tests; the effect size for each test was evaluated by Cohen’s d, r scores, or Cramer’s V, as appropriate.

One-sample t tests were conducted to examine if mean measurements from laboratory findings were higher than their upper reference limit and if mean MoCA scores were lower than their clinical cutoff value. Bootstrapping with 10,000 samples was performed to avoid the assumption of normality. The effect size was evaluated by Cohen’s d. Bivariate correlation analyses were conducted with Pearson or Spearman’s rho correlation coefficient, as appropriate. MoCA total scores were compared among patients with and without hypoxemia using the Mann–Whitney U test. Comparisons between hypoxemia severity regarding total MoCA scores were conducted with the Kruskal–Wallis test, and the effect size was calculated with ε2. These analyses were performed using IBM SPSS Statistics, version 25 (IBM Corp., Armonk, NY, USA). Multiple comparison/correlation problem was addressed by False Discovery Rate corrections (q value = 0.05).

To explore the dimension underlying the laboratory findings, we conducted a principal component analysis (PCA) using MATLAB, version 2018b (MathWorks Inc., Natick, MA, USA). Before PCA, C-reactive protein, D-dimer, LDH, ferritin, fibrinogen, neutrophils, and lymphocytes measurements were converted to z-scores based on the SD of the cohort. Considering the pathophysiology of COVID-19, lymphocytes measures were inverted, ensuring that lower values would indicate worse disease severity [25]. Components with eigenvalues ≥ 1 were extracted and varimax-rotated, yielding orthogonal results. Scores from each principal component were identified. Each component was interpreted according to the current understanding of the COVID-19 pathophysiology [26, 27].

To simultaneously test the relationship between the components obtained from the PCA, age, number of comorbidities, PaO2/FiO2 ratio, IMV duration, in-hospital delirium, and cognitive performance, a causation structure between these variables was modeled by path analysis using IBM SPSS Amos 24 (IBM Corp., Armonk, NY, USA). Before model testing, Mardia’s test was conducted to assess multivariate normality according to its skewness and kurtosis [28]. Due to multivariate normality violations, bootstrapping was performed (10,000 samples) to correct the P values from the χ2 tests and estimate direct and indirect effects (95% confidence intervals) using the maximum likelihood method [29]. Each observed variable had a measurement error associated.

Based on the first hypothetical model results, non-significant paths with low standardized beta values were removed to promote parsimony and a good model fit evaluated with the following indexes: comparative fit index (CFI), root mean square error approximation (RMSEA), goodness of fit index (GFI), parsimony-adjusted normed fit index (PNFI), χ2 (degrees of freedom [df]), and χ2/df ratio. After identifying significant direct effects from the final model, we calculated indirect effects. For all analyses, P values were two-tailed and considered significant when < 0.05.

Results

Demographic and clinical data

In total, 113 patients completed the cognitive assessment; of them, we excluded 21 (18.6%) from the final analysis: 18 (85.7%) due to a history of the following diseases associated with cognitive decline: hypothyroidism in six; epilepsy in three (one of them an HIV-positive patient); major depressive disorder in three; and chemotherapy-treated oncologic disease, ischemic stroke, and pre-existing cognitive impairment two cases each; three (14.3%) due to the following in-hospital events: acute ischemic stroke in two and new-onset seizures requiring long-term antiepileptic drugs in one. None of the included patients tested positive for pre-existing cognitive impairment.

We studied 92 patients, 54 (58.7%) males and 38 (41.3%) females, with a median age of 50 (IQR 42–55) years; 50 (54.4%) tested positive for cognitive impairment; patient’s baseline characteristics are reported in Table 1. One-sample t tests comparing the laboratory findings with their respective upper reference limits showed that our patients had higher values in all tests except for neutrophil counts (Fig. 1A); lymphocyte values were lower than their inferior reference limit. The overall mean MoCA total scores were lower than its clinical cutoff value (≤ 26 points). There were no differences in age, comorbidities, self-reported symptoms, or in-hospital events between patients who tested positive for cognitive impairment and those who did not (Table 1). Patients with a positive MoCA exhibited significantly lower PaO2/FiO2 ratio values (U = 794.5, P = 0.045, r =  − 0.21). The differences between MoCA total and subset scores between groups are reported in Table 2.

Table 1 Patients’ baseline characteristics, self-reported COVID-19-associated symptoms, and events during hospitalization
Fig. 1
figure 1

Laboratory findings in COVID-19 and correlations between illness-related variables. A Comparative analyses between the measurements derived from the blood workups and their upper reference limit (lower reference limit for lymphocytes); and comparison between MoCA total score and its clinical cutoff point. The shadows in the violin plots show the probability density of the data smoothed by a kernel density estimator. The boxes inside the shadows represent the first and third quartile and the median in a dotted line. The superior and inferior notch in the boxes point out the lower and upper bounds of the 95% confidence interval for the median. The black points represent the mean, and the bars show the standard error of the mean. *P < 0.05 after False Discovery Rate corrections (q = 0.05). B Loadings of the components derived from the PCA. *Biomarkers loading highest (> 0.4) at each component. C Correlations among study variables included in the path analysis. *P < 0.05, **P < 0.01. MoCA, Montreal Cognitive Assessment; Pb, P bootstrapped (10,000 iterations); TIC, thrombo-inflammatory component; IMV, invasive mechanical ventilation; PaO2/FiO2, partial pressure of arterial oxygen/fraction of inspired oxygen ratio; TV, test value; RV, reference value

Table 2 Montreal Cognitive Assessment total and subset scores according to positivity

Due to protocol amendments, only 67 (72.7%) patients completed the psychiatric evaluations; three (4.5%) tested positive for depression, 15 (22.4%) for anxiety, and 12 (17.9%) for both, and 13 (19.4%) for PTSD, of whom only one patient had a negative HADS screening. There were no statistical differences between patients with cognitive impairment (n = 38, 56.7%) and those without (n = 29, 43.3%) in both positivity (Table S1) or scores (Table S2) for each of these psychiatric assessments. In this regard, neither HADS (anxiety: ρ =  − 0.02, P = 0.863; depression: ρ =  − 0.11, P = 0.386; total score: ρ =  − 0.08, P = 0.514) nor PCL5 (ρ =  − 0.18, P = 0.149) scores showed significant correlations with total MoCA scores.

Principal component analysis of COVID-19-associated biomarkers

PCA exploring the studied biomarkers produced a three orthogonal component solution accounting for 70.13% of the variance (Kaiser–Meyer–Olkin index = 0.74). Measurements loading highest on each component (> 0.4) are shown in Fig. 1B. The first component (accounting for 39.73% of the variance) was interpreted as “thrombo-inflammatory” which included the following biomarkers: C-reactive protein, fibrinogen, and neutrophils. The second component (accounting for 16.43% of the variance) loaded heavily to D-dimer and LDH; hence, this component was interpreted as “thrombosis-related tissue damage.” Finally, the third component (accounting for 13.97% of the variance) was interpreted as “acute inflammation,” as lymphocytes and ferritin loaded heavily.

Causal model for cognitive impairment

Correlations between the “thrombo-inflammatory” component, hypoxemia severity, days of IMV, in-hospital delirium, age, and cognitive impairment are shown in Fig. 1C. Neither scores from the “thrombosis-related tissue damage” component nor the “acute inflammation” component showed significant correlations with the variables included in the path model (Table S3). Regarding path analysis, the hypothesized model provided an adequate fit for the data, but a poor parsimony (Fig. 2A). The model fit statistics were as follows: CFI = 0.99, TLI = 0.98, RMSEA = 0.03, GFI = 0.97, and PNFI = 0.33. χ2 (df) value was 14.20(13) with P = 0.360 and Pbootstrapped = 0.609. χ2/df ratio was 1.09. To improve parsimony, non-significant paths with low standardized β values were removed. These adjustments resulted in a final model well fitted to our data (Fig. 2B) according to the following fit indexes: CFI = 0.98, TLI = 0.97, RMSEA = 0.05, GFI = 0.95, and PNFI = 0.64. χ2 (df) value was 17.99(15) with P = 0.263 and Pbootstrapped = 0.202. χ2/df ratio was 1.20.

Fig. 2
figure 2

Results of the path analysis on factors associated with cognitive performance. A Hypothetical path model. B Final path model presents significant causal relationships. The arrows represent an association between variables. Standardized coefficients (β), 95% confidence intervals, and associated P values are shown above each arrow. The boxes represent observable variables and their measurement errors (e). Boxes with thick solid lines are variables acquired at hospital admission, boxes with thin solid lines are in-hospital events, and the box with dotted lines represents cognitive performance measured by the Montreal Cognitive Assessment 6 months after hospital discharge. The proportion of variance explained (R2), 95% confidence intervals, and associated P values are shown above and below endogenous variables. The number of comorbidities included diabetes, hypertension, cardiovascular disease, pulmonary disease, smoking, and obesity. PaO2/FiO2, partial pressure of arterial oxygen/fraction of inspired oxygen ratio; IMV, invasive mechanical ventilation

Cognitive performance

A positive direct relationship was found between PaO2/FiO2 values at hospital admission and cognitive performance (β = 0.24, P = 0.010). Also, age negatively impacted cognitive performance (β =  − 0.18, P = 0.052). Neither days of IMV (β = 0.16, P = 0.403) nor in-hospital delirium (β = 0.03, P = 0.954) had direct effects on MoCA’s total score. On our hypothesized model, we did not find that MoCA’s total score had a direct impact from the thrombo-inflammatory component (β = 0.03, P = 0.720), the thrombosis-related tissue damage component (β = 0.03, P = 0.655), the acute inflammation component (β =  − 0.08, P = 0.726), or the number of comorbidities (β = 0.01, P = 0.850). Nevertheless, the final model showed a significant indirect effect from the thrombo-inflammatory component to cognitive performance through PaO2/FiO2 values (β =  − 0.10, P = 0.007) (Table 3).

Table 3 Indirect effects for the final path model

In-hospital delirium

The number of days under IMV was positively and significantly associated with in-hospital delirium (β = 0.84, P = 0.001), indicating that patients with longer days under IMV were more likely to develop delirium during their hospital stay. We found no direct relationship between PaO2/FiO2 values and delirium (β =  − 0.01, P = 0.740). Table 3 shows a significant indirect effect from the thrombo-inflammatory component to in-hospital delirium through PaO2/FiO2 values and IMV duration (β = 0.10, P = 0.005). IMV duration was also a significant mediator variable between PaO2/FiO2 values and in-hospital delirium (β =  − 0.24, P = 0.007).

Invasive mechanical ventilation duration and PaO2/FiO2 ratio

PaO2/FiO2 values were directly and negatively related with the number of days on IMV (β =  − 0.29, P = 0.007). Also, we found a significant indirect effect from the thrombo-inflammatory component to days of IMV through PaO2/FiO2 values (β = 0.12, P = 0.005; Table 3). The thrombo-inflammatory component was negatively associated with PaO2/FiO2 values (β =  − 0.43, P < 0.001). Levels of C-reactive protein, neutrophils, and fibrinogen, which loaded heavily on this component, showed significant negative correlations with PaO2/FiO2 values (Fig. 3A, B, and C). In contrast, neither the thrombosis-related tissue damage component (β =  − 0.15, P = 0.196) nor the acute inflammation component (β = 0.05, P = 0.490) was directly associated with PaO2/FiO2 values.

Fig. 3
figure 3

Association of hypoxemia and cognitive impairment. Significant relationships between PaO2/FiO2 ratio values and A C-reactive protein, B neutrophils, and C fibrinogen. D Semipartial correlation between PaO2/FiO2 values at hospital admission and MoCA’s total scores adjusted by age. The Y-axis is expressed as residuals. E Comparative analysis of MoCA performance between patients with (PaO2/FiO2 ≤ 300 mmHg) and without (PaO2/FiO2 > 300 mmHg) hypoxemia at hospital admission. The solid point inside the boxes represents the mean, whiskers the minimum and maximum values. F Proportion of positive (MoCA +) and negative (MoCA −) screening for cognitive impairment (clinical cutoff point ≤ 26) among patients with and without hypoxemia at hospital admission. PaO2/FiO2, partial pressure of arterial oxygen/fraction of inspired oxygen ratio; MoCA, Montreal Cognitive Assessment

COVID-19 biomarkers

Age was positively and significantly associated with the number of comorbidities (β = 0.28, P = 0.007); however, neither of these two variables showed significant relationships with PaO2/FiO2 values (age: β =  − 0.04, P = 0.808; comorbidities: β = 0.11, P = 0.320), the thrombo-inflammatory component (age: β =  − 0.01, P = 0.872; comorbidities: β =  − 0.01, P = 0.942), the thrombosis-related tissue damage component (age: β =  − 0.18, P = 0.101; comorbidities: β =  − 0.03, P = 0.799), or the acute inflammation component (age: β = 0.13, P = 0.211; comorbidities: β =  − 0.27, P = 0.050).

Causal model for MoCA subtests

The causal model exposed in Fig. 2B (including MoCA total score as endogenous/outcome variable) also explained the performance in the visuospatial/executive and verbal memory subsets (Supplementary Fig. 1); for both, the model had an acceptable fitting with the data. The model including scores from MoCA’s visuospatial/executive subtest as endogenous/outcome variable yielded the following fit indexes: CFI = 0.99; TLI = 0.99; RMSEA = 0.02; GFI = 0.96; PNFI = 0.65; χ2 (df) = 15.56 (15), P = 0.412, Pbootstrapped = 0.350; χ2/df = 1.04; while fit indexes for the model which included MoCA’s verbal memory scores were as follows: CFI = 0.99; TLI = 0.99; RMSEA = 0.03; GFI = 0.95; PNFI = 0.64; χ2 (df) = 16.38 (15), P = 0.357, Pbootstrapped = 0.305; χ2/df = 1.09.

As in the first model, PaO2/FiO2 values were directly and positively related with the performance on visuospatial/executive (β = 0.26, P = 0.017) and verbal memory (β = 0.20, P = 0.020) subsets, with significant indirect effects between thrombo-inflammatory component and performance in both subtests through PaO2/FiO2 values (visuospatial/executive: β =  − 0.11, P = 0.013; verbal memory: β =  − 0.09, P = 0.015).

Hypoxemia and cognitive performance

We regressed out age from MoCA total scores and explored their correlation with PaO2/FiO2 values based on the path analysis results. As predicted by the model, MoCA scores adjusted by age significantly correlated with PaO2/FiO2 values (Fig. 3D). We compared the cognitive performance between patients with (n = 71, 77.2%) and without (n = 21, 22.8%) hypoxemia. There were no differences in sex, age, years of education, and psychiatric features between groups (Table S4). As shown in Fig. 3E, patients with hypoxemia showed significantly lower MoCA scores than patients without hypoxemia. Furthermore, a larger number of patients with hypoxemia (60.6%) had scores below MoCA’s clinical cutoff point compared to those without hypoxemia (33.3%) (Fig. 3F).

MoCA total scores did not show significant differences (H = 0.78, asymptotic P = 0.68, Monte Carlo P = 0.69, ε2 = 0.01) between patients with mild (n = 31, median 25, IQR 22–29 points), moderate (n = 36, median 25.5, IQR 22.5–27.5 points), or severe (n = 4, median 27.5, IQR 22.5–28 points) hypoxemia. Neither demographic nor psychiatric features showed significant differences among these three groups (Table S5). When including the four severely hypoxemic patients in the moderate group and comparing their cognitive performance with the mild group, the same non-significant results were found (U = 588.50, P = 0.714, r =  − 0.04). Regarding the number of patients with a positive MoCA, there was no statistical difference (χ2(1) = 0.75, P = 0.38, Cramer’s V = 0.10) between patients with mild (54.8%) and moderate/severe (65%) hypoxemia.

Discussion

In this interim analysis of an ongoing study on the long-term neuropsychiatric consequences of SARS-CoV-2 infection, the frequency of new-onset cognitive impairment 6 months after hospital discharge for severe COVID-19 pneumonia was 54.4%, similar to that of 52.6% reported by Frontera et al., in a cohort of 382 patients evaluated 6 months after hospital discharge, assessed with a MoCA telephone version [8]. Even though their population was older (median age 68 years, IQR 55–77) than ours and had a higher burden of neurologic and non-neurologic comorbidities that can negatively affect cognition [30,31,32], in our population, neither age nor comorbidities were associated with worse MoCA scores.

Among critically ill non-COVID-19 patients, IMV is independently associated with poor performance on cognitive evaluations 6 months after hospital discharge [33, 34]; also, prolonged IMV exposing them to higher cumulative doses of sedatives or opioids increases the risk of developing in-hospital delirium [35, 36], a well-recognized risk factor for long-term cognitive impairment after hospitalization [37]. Among hospitalized patients with COVID-19, delirium incidence ranges from 11 to 80% depending on the hospital setting (general ward or ICU) [38, 39]; in our population, we previously reported a frequency of 16.3% regardless of the hospital setting [15], a proportion comparable to the 14.1% of patients included in this study who developed this complication. As reported by other authors in COVID-19 [9, 40], we did not find a direct or indirect effect between IMV duration, in-hospital delirium, and cognitive performance.

In our structural approach, the thrombo-inflammatory component (C-reactive protein, fibrinogen, and neutrophils) obtained from the PCA indirectly affected cognition, an effect mediated through PaO2/FiO2 ratio values. Despite this, there was no association between hypoxemia severity according to the Berlin definition cutoff values for ARDS and total MoCA scores on univariate analysis. In the study conducted by Frontera et al., low oxygen levels or the severity of lung injury was not associated with worse cognitive outcomes [8], as seen in non-COVID-19 critically ill patients [33, 41]; these contrasting results may be related to different methodological and statistical approaches. Interestingly, a case series of 29 patients found an association between residual pulmonary dysfunction assessed 3–4 months after hospital discharge and cognitive impairment [42], suggesting that long-lasting cerebral oxygen delivery restriction is involved in developing this sequela.

Although structural neuroimaging studies were not performed in our patients, it is known that profound hypoxemia is associated with brain atrophy [43], especially in highly susceptible structures such as the hippocampus [44]; among COVID-19 patients, microstructural changes in this structure have been detected as early as 3 months after hospital discharge [45]. Therefore, we hypothesize that the vulnerability of this cognition-related structure to hypoxemia may explain the findings of impaired verbal memory, along with executive dysfunction seen in COVID-19 survivors [10, 46,47,48]. In our model, lower PaO2/FiO2 ratio values were also associated with deficits in the MoCA visuospatial/executive and verbal memory subsets, as well as MoCA total scores.

Besides hypoxemia, the hyperinflammatory and hypercoagulable states promoted by SARS-CoV-2 may also play a crucial role in developing cognitive deficits after COVID-19 [47, 49]. In this regard, elevated D-dimer and C-reactive protein levels during hospitalization for severe COVID-19 correlated with worse cognitive performance in other studies [42, 50]. We did not find individual statistical differences between these biomarkers by univariate analysis; however, when clustered by PCA, the thrombo-inflammatory component had an indirect effect on cognitive impairment. In a case–control study, elevated inflammatory response biomarkers (C-reactive protein, procalcitonin, and interleukin 6) among patients with COVID-19 correlated with microstructural changes of white matter tracts, decreased cortical thickness, and widespread cerebral blood flow reductions, especially in the frontal and limbic systems 3 months after discharge [51].

The aforementioned results support our hypothesis in COVID-19 structural brain changes of cognition-related structures, promoted by a combination of multiple factors, such as hypoxemia (acute or chronic), cerebrovascular endothelium dysfunction related to systemic inflammation, or microthrombosis, that may disrupt the blood–brain barrier facilitating the passage to the brain of pro-inflammatory molecules and cytokines which can cause microglial activation, axonopathy, and production of neurotoxic mediators leading to chronic impairment of neurotransmission [52, 53]. In addition, disruption of the blood–brain barrier can also facilitate the entry of SARS-CoV-2 to the brain [52].

Following hospitalization for severe COVID-19, psychiatric sequela associated with poor cognitive performance, including depression, anxiety, and PTSD, develops in up to 30% of patients [7, 54]. Although only 72.7% of our patients were evaluated for the co-existence of psychiatric disorders, we found no correlation between total MoCA scores and depression, anxiety, or PTSD scores, neither statistical differences according to each screening tool positivity. However, the results of studies analyzing the association between these sequelae and cognitive performance have yielded conflicting results [9,10,11,12,13], which may be related to the heterogeneity of screening or diagnostic instruments and different methodological approaches. These findings suggest that the long-term effects of COVID-19 on cognition are not entirely driven by psychiatric variables, probably due to their duration.

This study has limitations. First, the single-center design limits the generalizability of our findings; also, due to our sample size, we were unable to perform invariance analyses exploring our model fitness across sexes. Second, the potential uncontrolled confounding for variables such as delirium duration, in-hospital treatments that might have modified the clinical course of these patients, psychiatric comorbidities, the single measurement of biomarkers, or measurements of additional inflammatory biomarkers, such as in interleukins which were not available in our center. Third, the PaO2/FiO2 ratio in non-IMV patients may not be a reliable method to define hypoxemia. Fourth, our model results may be biased due to the low proportion of subjects who required IMV or developed in-hospital delirium; however, our patients had a lower burden of comorbidities associated with severity when compared to other similar series [8, 13]. Fifth, we were unable to evaluate the potential neurotropism of SARS-CoV-2 to evaluate its role in developing this sequela. Finally, the lack of structural neuroimaging studies to analyze the relationship of brain changes with this COVID-19 sequela, as well as the administration of a single screening test to measure cognitive performance and not a more extensive battery of neuropsychological tests, limits the analysis of independent domains to characterize the specific patterns of cognitive impairment among severe COVID-19 survivors. Despite these limitations, our statistical approach emphasizes the potential multicollinearity between hypoxemia, COVID-19-associated thrombo-inflammatory response biomarkers, and cognitive performance; the synergistic effects of these variables on cognition should be carefully considered by researchers seeking to characterize the pathogenic drivers of this COVID-19 sequela, particularly those studies conducting univariate approaches. Still, reports of poor cognitive performance among non-hospitalized patients with mild to moderate disease without oxygen requirement suggest that other factors besides hypoxemia are involved in developing this long-COVID manifestation [10, 55, 56].

In conclusion, this study highlights the central role of hypoxemia and the thrombo-inflammatory response as drivers for developing cognitive impairment among patients with severe COVID-19 pneumonia. The remarkable high frequency of cognitive impairment 6 months after hospital discharge in this relatively young population with a low burden of comorbidities should emphasize the need for continuous neuropsychological monitoring for these patients, especially evaluating verbal memory and executive functions regardless of hypoxemia severity, in-hospital delirium development, IMV requirements, or results of follow-up psychiatric evaluations, to timely detect patients who may benefit from intense neurocognitive rehabilitation. Future studies, including a complete neuropsychological battery with structural neuroimaging investigations, are necessary to replicate and improve our structural causal model.