Introduction

Stroke is the second most prevalent cause of death and acquired adult disability worldwide (Saini et al. 2021). Despite a decline in mortality rates, the overall burden of stroke is on the rise.

Approximately 80% of all strokes are ischemic, resulting from the blockage of cerebral arteries (Virani et al. 2020). Ischemic stroke treatment has traditionally focused on reperfusion techniques such as intravenous thrombolysis with tissue plasminogen activator (tPA) and endovascular therapy, which are effective but limited by a narrow therapeutic window and risk of hemorrhage (Powers et al. 2018). Other treatment options include stem cell therapy, which promotes neurogenesis and reduces inflammation; natural products, peptides, and other pharmacological agents which decrease excitotoxicity, oxidative stress, apoptosis, and neuroinflammation; and nanomedicine, which targets drug delivery to the brain to improve outcomes (Lyden 2021; Jia et al. 2024). Although these emerging treatment options showed promising results in animal studies, translational challenges, including low study quality and publication bias in preclinical studies, have hindered their clinical application (Macleod et al. 2008; Sena et al. 2010). Consequently, it is advisable to conduct a well-designed systematic review and meta-analysis of preclinical stroke studies before embarking on clinical trials to provide nonbiased summaries of the available evidence (Bath et al. 2009; Dirnagl and Macleod 2009).

Hydrogen sulfide (H2S) has emerged as a candidate agent, and its effectiveness has been evaluated in various experimental animal models of ischemic stroke, yielding conflicting results (Gopalakrishnan et al. 2019; Jia et al. 2019; Ding et al. 2023). H2S functions as an endogenously produced gasotransmitter and serves as a vital signaling molecule at low concentrations, playing a significant role in vasodilation and blood pressure regulation (Kimura 2021). Growing evidence suggests that exogenous H2S donors protect organs, including the brain, from ischemia-reperfusion injury by mitigating inflammation, oxidative stress, mitochondrial damage, and apoptosis (Gopalakrishnan et al. 2019; Jia et al. 2019; Ding et al. 2023; Emre Aydıngöz et al. 2023). H2S therapy offers significant advantages over tPA by reducing hemorrhagic complications for ischemic stroke treatment (Liu et al. 2016; Jia et al. 2019). Combining H2S with tPA can improve safety and efficacy, suggesting a promising approach for enhancing stroke management (Liu et al. 2016).

Despite the extensive exploration of the beneficial effects of H2S against ischemic stroke in individual animal studies, no clinical trials have verified its efficacy in human settings. Furthermore, although many preclinical studies and narrative reviews have shown that H2S may be effective for treating stroke (Dou et al. 2016; Chan and Wong 2017; Gopalakrishnan et al. 2019; Narne et al. 2019; Ding et al. 2023), a systematic review of animal studies and quantification of the available data by meta-analysis have not yet been performed. Given the persistent translational challenges in preclinical stroke studies, a thorough systematic review and meta-analysis of existing evidence on the effectiveness of H2S against ischemic stroke is crucial.

This systematic review and meta-analysis aimed to determine the sufficiency of overall preclinical evidence to guide the initiation of clinical stroke trials with H2S and provide tailored recommendations for their design.

The research question is “How effective is H2S compared with no treatment on structural and functional outcomes in in vivo animal models of ischemic stroke?”

Methods

Study protocol overview

This systematic review and meta-analysis was conducted following a registered PROSPERO protocol (CRD42023380938, date: 10/01/2023). No amendments were made to the study protocol afterward. The guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) (Moher et al. 2009) and recommendations for the systematic review and meta-analysis of animal studies (Vesterinen et al. 2014; Soliman et al. 2020) were followed. The PRISMA checklist is presented in Supplement 1.

Literature search

The inclusion criterion was an in vivo animal model of regional ischemic stroke treated with any H2S donor. The exclusion criteria included in vitro, ex vivo, and in silico studies; observational studies; experimental studies lacking a control group; reviews; conference presentations; studies on human subjects; investigations involving interventions increasing endogenous H2S levels; and studies lacking crucial information for meta-analysis. There were no language or date restrictions.

The PubMed, Web of Science, Scopus, EMBASE, and MEDLINE databases were searched using the specific criteria detailed in Supplement 2. Animal filters devised by van der Mierden et al. (2022) and de Vries et al. (2014) were applied to the PubMed and EMBASE databases. The Polyglot Search Translator (Clark et al. 2020) was utilized to translate the search strategies across databases. The initial search was performed on February 23, 2023, with an update on August 1, 2023. Duplicates were removed, and the title/abstract was prescreened using the SyRF screening application (http://app.syrf.org.uk/) (Bahor et al. 2021). Prescreening and full-text screening were performed by two independent reviewers, with discrepancies resolved through discussion with a third reviewer.

Data extraction and effect measures

The key study characteristics extracted were experimental group, number of animals per group, species, sex, age and weight of the animals, comorbidity status, anesthetic agent, ischemic model, cotreatment, H2S donor, dosage, administration time, and route. For the effect measures, data on infarct volume (primary outcome), neurological deficit score, and brain edema were extracted. Mechanistic outcome measures involving biomarkers are reported in five or more studies. In cases where outcomes were reported at different time points for the same group of animals, the last time point measurement was extracted.

For all outcome measures, published continuous data (mean) and variance (standard deviation/standard error of the mean) were extracted without unit conversion. In cases where the variance was given as the standard error of the mean, the standard deviation was estimated on the basis of the sample size. The data presented graphically were extracted using WebPlotDigitizer (version 4.5, August 2021, Pacifica, California, USA). For one study, the standard deviation for infarct volume was obtained from the author of the original paper (Genc et al. 2023). If it was not possible to obtain the necessary effect measures, the paper was not included in the meta-analysis. Two reviewers independently performed the data extraction using the SyRF application. In cases of < 10% difference between extracted data, an average of the two values was taken; discrepancies > 10% were resolved through discussion with a third reviewer. The extracted raw data are presented in Supplement 3.

Meta-analysis

A meta-analysis was conducted on the full texts of a predefined number of studies (at least 10) with similar study designs, comparing the effects of any H2S donor with no treatment on structural and functional outcomes in in vivo animal models of ischemic stroke. All analyses were performed using the “metafor” package (Viechtbauer 2010) in R 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria; https://www.r-project.org/) and guided by the online version “Doing Meta-Analysis with R: A Hands-On Guide” (Harrer et al. 2021). The R codes are presented in Supplement 4.

Pooling of data

The data were pooled for the outcome measures that were reported in six or more studies. Pooling of effect sizes was accomplished through the calculation of the normalized mean difference (NMD) for infarct area, neurological deficit score, and brain edema, considering that the data were on a ratio scale. For other outcome measures, Hedges’ g (bias-corrected standardized mean difference (SMD)) served as the effect size. Random-effect models with a restricted maximum likelihood (RELM) estimate of between-study variance were employed to obtain pooled effect sizes for each outcome. The 95% confidence interval (CI) surrounding the pooled effect was computed using Knapp-Hartung adjustments (Knapp and Hartung 2003). Forest plots were generated to summarize the meta-analysis results. The sensitivity of the analysis was assessed by repeating it with a fixed-effect model and performing the leave-one-out test.

Assessment of heterogeneity

The between-study variance was estimated by calculating Tau2, Q (χ2), and its significance level. The percentage of residual variation attributable to between-study heterogeneity was expressed as I2. Meta-regression (subgroup) analysis was conducted to investigate the impact of various study characteristics on the treatment effect, including the type, dose, and application time of H2S donors; duration and mode of ischemia (temporary vs. permanent); age and species of animals; outcome assessment time; and risk of bias measures. The number of subgroups was minimized, defined according to the cutoff points identified by a data-driven approach (i.e., median), as suggested by Wang et al. (2021). Meta-regression analysis was performed on the pooled infarct volume data, the outcome measure reported by most of the studies.

Assessment of publication bias

Publication bias was assessed using small-study effect methods involving visual inspection of the funnel plot and quantification of asymmetry with Egger’s regression test. The trim-and-fill method was applied to address funnel plot asymmetry and obtain a corrected effect estimate. To avoid false-positive results, NMD was used as the effect size measure in the funnel plot, as suggested by Zwetsloot et al. (2017).

Assessment of quality and risk of bias

The quality of evidence for each study was evaluated using the CAMARADES study quality checklist (Sena et al. 2007), modified by McCann et al. (2016), to incorporate relevant items from the updated STAIR criteria (Fisher et al. 2009). The risk of bias was categorized according to the SYRCLE risk of bias tool (Hooijmans et al. 2014), and the risk of bias plot was generated using the web-based application robvis (https://mcguinlu.shinyapps.io/robvis/) (McGuinness and Higgins 2021). Two reviewers independently assessed study quality and risk of bias, with discrepancies resolved through discussion with a third reviewer.

The significance level was set at p < 0.05 throughout the analysis.

Results

Characteristics of the selected studies

Thirty-four publications meeting the study selection criteria were identified through a systematic review of the literature (Fig. 1). Seven of these studies were conducted in mice, and the remaining 27 were conducted in rats; all of these studies used a middle cerebral artery occlusion model to induce transient stroke. Ischemia was induced for a wide range of durations, from 1 to 24 h. With the exception of three publications where the sex of the animal was not specified, all studies used male animals. Only three studies were carried out on aged animals, and none of the studies used animals with comorbidities. Notably, most studies (26/34) used NaHS as the H2S donor, mostly by intraperitoneal administration, with varying doses (1–40 mg/kg) and times of administration (before, during, or after ischemia). The basic characteristics of the included studies and published measures are summarized in Supplement 5.

Fig. 1
figure 1

PRISMA flow diagram of the systematic review process

Pooled effect measures

Infarct area

The infarct area of 408 H2S-treated animals versus 232 control animals from 54 comparisons was reported in 31 studies—mm3 in 8 studies and the percentage of brain area in the remaining 23 studies. Pooling the NMDs using a random-effect model revealed that H2S decreased the infarct area by 34.5% (95% CI 28.2–40.8%, p < 0.0001) (Fig. 2). The fixed-effect model also revealed a large and significant effect of H2S (36.9% decrease in the infarct area 95% CI 35.2–38.6%, p < 0.01), indicating the sensitivity of the random-effect model to the variability between studies. In the leave-one-out analyses, the pooled estimate ranged between 33.7 and 35.9% when each of the studies was removed from the model and the analysis was repeated, indicating that the results of the meta-analysis were relatively stable and not strongly influenced by any single study (Supplement 6).

Fig. 2
figure 2

Forest plot of individual and overall effect sizes and heterogeneity statistics for infarct area. NMD, normalized mean difference; CI, confidence interval

Heterogeneity analysis indicated substantial variability among the studies, with a χ2 (heterogeneity statistics) of 520.9 (df = 53, p < 0.0001) and an I2 of 89.8%, suggesting that approximately 90% of the total variability in effect sizes is due to true differences between studies rather than sampling error (Fig. 2). In the leave-one-out analysis, the iteration of the model by excluding each study caused the heterogeneity to vary between 88.3 and 90.0%, confirming that the sources of variability among studies were not primarily driven by one particular study (Supplement 6).

Neurological deficit score

The neurological deficits of the animals were assessed in 20 studies by various scales, including the Longa, Garcia, mNSS, Bederson, and Philips scoring systems. The most preferred scale among these was the Longa scoring system, which was employed in 161 H2S-treated animals versus 83 control animals in 15 comparisons from 9 studies. High scores indicate severe neurological deficits on the five-point (0 to 4) Longa scale (Longa et al., 1989). Overall, H2S caused a 37.9% reduction in the Longa score (95% CI 29.0–46.8%, p < 0.0001), with moderate variability between studies (I2 = 63.8%, χ2 = 38.6, df = 14, p = 0.0004) (Fig. 3).

Fig. 3
figure 3

Forest plot of individual and overall effect sizes and heterogeneity statistics for neurological deficit score and brain water content. NMD, normalized mean difference; CI, confidence interval

Brain water content

The brain water content of 99 H2S-treated animals versus 62 control animals was reported in 9 studies as [(ischemic hemisphere wet weight − dry weight)/wet weight] × 100%. Compared to the control group, H2S caused a small but significant reduction in brain water content (3.2%, 95% CI 1.4–4.9%, p = 0.0014), with very high heterogeneity (I2 = 94.6%, χ2 = 277.1, df = 15, p < 0.0001) (Fig. 3).

Biochemical markers for apoptosis and inflammation

The levels of TUNEL and caspase-3, which are markers of apoptosis, and TNF-alpha and IL-1beta, which are proinflammatory cytokines, were reported in at least six studies. When the data were pooled using the random-effect model, H2S significantly decreased only TUNEL (Hedges’ g −2.0, 95% CI −3.0; −0.9, p = 0.0010) but had no significant effect on the remaining variables (Hedges’ g −2.1 [−4.4; 0.3], p = 0.0815 for caspase-3; −0.9 [−1.9; 0.1], p = 0.0670 for IL-1beta; −1.1 [−2.4; 0.2], p = 0.0797 for TNF-alpha) (Fig. 4). This finding may suggest that the inhibitory effect of H2S on caspase-independent apoptotic pathways or the early/intermediate stage of apoptosis, rather than its anti-inflammatory activity, plays a role in its beneficial effect on the experimental stroke model.

Fig. 4
figure 4

Forest plots of individual and overall effect sizes and heterogeneity statistics for biomarkers of apoptosis (TUNEL and caspase-3) and proinflammatory cytokines (TNF-alpha and IL-1beta). SMD, standardized mean difference; CI, confidence interval

Subgroup analysis

The results of subgroup analyses are reported in Table 1, which shows the estimated effect in each subgroup, the p value of the test for subgroup differences, residual heterogeneity, and the amount of heterogeneity accounting for that study variable. As shown in Table 1, animal species; blinded outcome assessment; controlling temperature; the STAIR score of the study; duration of ischemia; measurement unit for infarct area, type, dose, and application time of H2S; and time of outcome measure assessment did not have a significant effect on the observed heterogeneity between studies. However, although the difference was slightly greater than the conventional threshold (p = 0.0551), it is remarkable that H2S donors other than NaHS induced a much greater reduction in infarct volume than did NaHS donors (40.0% vs. 27.9%). The pooled effect of each H2S donor is shown in a forest plot in Supplement 7. An unexpected finding in the subgroup analysis was that randomization significantly increased the effect size (p = 0.01). Unexpectedly, H2S was found to cause a significantly greater reduction in the infarct area in randomized experiments, accounting for 13.3% of the heterogeneity.

Table 1 The results of subgroup analyses

Publication bias

The visual inspection of the funnel plot and Egger’s regression test (t = −0.89, df = 52, p = 0.3784) revealed no evidence of funnel plot asymmetry (Fig. 5). Parallel to this finding, the trim-and-fill method revealed no missing studies and did not adjust the model.

Fig. 5
figure 5

A Funnel plot representing the relationship between effect size and standard error (precision) for the studies included in the meta-analysis. The y-axis displays the standard error of the effect size, inversely proportional to study precision, while the x-axis represents the effect size estimate, which is the normalized mean difference (NMD) as a percent reduction in infarct volume. Each point on the plot corresponds to an individual study included in the meta-analysis. The diagonal line in the center represents the overall effect estimate. B A scatter plot of effect size against precision with the Egger regression line overlaid (t = −0.89, df = 52, p = 0.3784)

Study quality and risk of bias

According to the CAMARADES study quality checklist modified to include relevant items from the updated STAIR criteria, studies had low quality and high risk of bias (median score 5/15, interquartile range 4–9). Randomization of group allocation, blinded induction of ischemia, and blinded assessment of outcome—factors whose absence created the highest risk of bias—were reported in only 65%, 3%, and 50% of the studies, respectively (Table 2). Categorization of risk of bias by using the SYRCLE risk of bias tool revealed that the majority of the included studies had a “high” or “unclear” risk of bias, and none of the studies overall had a “low” risk (Fig. 6).

Table 2 The risk of bias score of each publication according to the CAMARADES study quality checklist was modified to include relevant items from the updated Stroke Therapy Academic Industry Roundtable (STAIR) criteria (Sena et al. 2007; Fisher et al. 2009; McCann et al. 2016)
Fig. 6
figure 6

Weighted bar plots of the distribution of risk of bias judgments within each bias domain of the SYRCLE risk of bias tool for all of the included studies (n = 34)

Discussion

To our knowledge, this is the first meta-analysis of preclinical studies evaluating the effect of H2S in an experimental stroke model. The analysis revealed that compared with no treatment, H2S significantly improved structural and functional outcomes in in vivo animal models of ischemic stroke. However, when considered as a whole, the level of evidence from preclinical studies is not sufficient to proceed to clinical trials due to the low external validity, high risk of bias, and variable design of existing animal studies.

Neuroprotective effects of H2S

Among the studies included in the meta-analysis, infarct volume was reported in 31 (91.2%), whereas neurological deficit scores were reported in only 20 (58.8%). Although these percentages are similar to those found in preclinical stroke studies in the literature, nearly all clinical stroke trials have reported functional outcome as the primary endpoint, and fewer than 15% of clinical trials have reported infarct volume (Schmidt-Pogoda et al. 2020). This factor may reduce the clinical translation of animal studies evaluating the effect of H2S on stroke. A meta-analysis of these studies indicated that H2S is very effective against stroke, decreasing the infarct area by 34.5% (95% CI 28.2–40.8%, p < 0.0001) and improving neurological deficit scores by 37.9% (95% CI 29.0–46.8%, p < 0.0001). This finding is in line with the findings of other potential agents against stroke, which decrease infarct volume and improve functional outcome by 24% on average in experimental studies (Sena et al. 2007; McCann et al. 2016; Schmidt-Pogoda et al. 2020). Due to the high risk of bias and low quality of evidence, which are common problems in preclinical studies, the efficacy of stroke treatments has declined from experimental to clinical trials (Macleod et al. 2004; Sena et al. 2007; McCann et al. 2016; Schmidt-Pogoda et al. 2020). Given the high risk of bias of the studies included in our meta-analyses, H2S is also likely to be less effective in clinical studies than in preclinical studies. Furthermore, functional outcome, the primary endpoint for clinical trials, has been evaluated in preclinical H2S stroke studies by using different methods, a situation that further jeopardizes the translation of animal studies. Twenty studies evaluating neurological deficits utilized various functional scales, including the Longa, Garcia, mNSS, Bederson, and Philips scoring systems, each covering different aspects of neurological function. Since Longa was the most commonly used scale, we performed pooled analyses including only studies using the Longa scale to increase the validity of the meta-analysis. Although this approach reduces between-study heterogeneity, it limits the power of the analysis due to the inclusion of only nine studies.

Another piece of evidence for the neuroprotective effect of H2S was a reduction in the brain water content, which was reported in nine studies using the wet-dry weight method. Brain water content is a measure commonly used in experimental animal stroke studies to assess cerebral edema, which occurs as a result of ischemia-induced inflammation and breakdown of the blood-brain barrier (Kozler et al. 2022). In the context of stroke research, cerebral edema is a significant concern because it can contribute to secondary damage and worsen outcomes following a stroke (Chen et al. 2021). H2S caused a small but significant reduction in brain water content (3.2%, 95% CI 1.4–4.9%, p = 0.0014) when the findings of the nine studies were pooled. As noted by Keep et al. (2012), when the brain water content is calculated in % using the wet-dry weight method, a “small” change in the % water content reflects a large change in the swelling of the tissue. Therefore, keeping in mind the high between-study heterogeneity (I2 = 94.6%, p < 0.0001), H2S significantly reduced brain edema after stroke.

Mechanisms of the neuroprotective effects of H2S

Research in recent decades has suggested that exogenous H2S and H2S donors exert neuroprotective effects on animal stroke models by inhibiting apoptosis, inflammation, oxidative stress, mitochondrial dysfunction, blood-brain barrier leakage, and disrupted cerebrovascular homeostasis, all of which contribute to ischemia and reperfusion injury in the brain (Dou et al. 2016; Chan and Wong 2017; Gopalakrishnan et al. 2019; Narne et al. 2019; Ding et al. 2023) (Fig. 7). The studies included in the meta-analysis evaluated these pathways by measuring various biomarkers. Among these, the ones for which the effect sizes could be pooled were the apoptosis markers TUNEL and caspase-3 and the inflammation markers IL-1beta and TNF-alpha. TUNEL was the only biomarker for which H2S significantly reduced the pooled results; caspase-3, IL-1beta, and TNF-alpha were not significantly affected. The fact that H2S does not affect proinflammatory cytokines but reduces TUNEL staining suggests that its antiapoptotic effect is more important for its neuroprotective effect than its anti-inflammatory effect. The observation that H2S decreases the number of TUNEL-positive cells without altering caspase-3 expression has multiple probable interpretations (Didenko et al. 2002). The antiapoptotic effect of H2S may involve caspase-independent apoptotic pathways or an early/intermediate stage of apoptosis. Since TUNEL staining could also be associated with other forms of cell death, such as necrosis or autophagy, the effect of H2S on TUNEL may also be explained by its inhibitory effect on autophagy induced by stroke (Chen et al. 2014).

Fig. 7
figure 7

Proposed mechanisms mediating the neuroprotective effect of H2S in cerebral ischemic stroke

Source of heterogeneity

To identify the sources of high heterogeneity between studies, we performed subgroup analyses to assess the effect of parameters related to the stroke model (e.g., duration of ischemia), treatment (e.g., type, dose, and time of H2S administration), and time of outcome assessment on the overall treatment effect. None of these parameters had a significant impact on heterogeneity. However, although not at a statistically significant level, H2S donors other than NaHS tended to be more effective in the stroke model. Because many different H2S donors were used in the studies, we were unable to obtain a sufficient number of studies for each donor to draw conclusions on the best donor effective against stroke. However, we observed that dual nitric oxide and H2S donors, such as 8e and 8d (Yin et al. 2016), and slow H2S-releasing agents, such as anethole dithiolethione (ADT) (Powell et al. 2018), are more effective than NaHS in preclinical stroke models. In further animal studies on stroke, we suggest using these or similar H2S donors.

Study quality and risk of bias

Compared with the meta-analyses of preclinical stroke studies in the literature, the quality of the studies included in our model was low (García-Bonilla et al. 2012; Schmidt et al. 2014; McCann et al. 2016; Boboc et al. 2023; Li et al. 2023). In previous studies, the median quality scores ranged between 4 and 7 out of 10 and between 6 and 11 out of 15, whereas the median score for the 34 studies in our analysis was 5 out of 15. Considering that most of the studies were published in 2015 and later, it is surprising that despite all efforts, preclinical stroke studies are still far from the expected quality. Low experimental and reporting quality creates a high risk of bias, making the reliability of the findings questionable. The most critical components for bias in stroke experiments (Sena et al. 2007), which are randomization of group allocation and blinded assessment of outcome, were reported in only 65% and 50% of the studies, respectively. Additionally, none of the studies reported any concern about avoiding the intrinsic neuroprotective effects of anesthesia. Five studies did not specify the anesthetics used. Chloral hydrate (n = 14), isoflurane (n = 12), halothane (n = 1), ketamine (n = 1), and sodium pentobarbital (n = 1) were also used. Although the CAMARADES checklist emphasizes that ketamine is the only anesthetic to avoid its neuroprotective effect (Sena et al. 2007), as there is evidence in the literature for neuroprotective effects of other anesthetics as well (Carbone and Austin 2016), we considered all studies to have a high risk of bias for this item. The lack of sample size calculations is another factor that increases the risk of bias. None of the studies reported any calculation or explanation for the sample size, which can lead to several drawbacks and limitations in the study design and interpretation of results, such as insufficient statistical power, inaccurate effect size estimates, increased type 1 and 2 errors, limited generalizability of the study findings, and increased ethical considerations (Percie du Sert et al. 2020). The use of male animals only, no use of animals with comorbidities, and no cotreatment were other factors that reduced the clinical and translational validity of the studies included in the meta-analysis.

In the subgroup analysis in which we evaluated the effect of the study quality score and risk of bias parameters on heterogeneity, surprisingly, studies reporting randomization and blinding and those with high-quality scores had larger effect sizes, accounting for 13.3%, 1.8%, and 3.6% of the heterogeneity, respectively. Although the p value for only randomization is significant according to the conventional significance threshold, others still indicate a difference at the trend level. The observation that randomization and blinding increase the effect size in an experimental animal study may seem counterintuitive and raise questions about the validity of the included studies. Considering that randomization and blinding correct the impact of sampling variability, selection bias, systemic errors, and confounding variables, we may also suggest that the effect of H2S on stroke is even greater than the estimates (McGough and Faraone 2009). However, given the poor study quality and small sample size, it is more plausible to suggest that randomization might result in seemingly large effects due to the limited number of subjects and insufficient study quality.

Publication bias

Another unexpected finding of the present meta-analysis was the symmetric funnel plot and nonsignificant Egger’s regression test, while the studies included in the model had a high risk of bias. This finding may suggest that publication bias may not be a major concern in our meta-analysis. However, it is crucial to note that funnel plot asymmetry is not solely indicative of publication bias; other factors can also contribute to asymmetry (Page et al. 2021). In our case, funnel plot symmetry may reflect genuine heterogeneity in the effect sizes across studies. A high risk of bias in individual studies may have contributed to this heterogeneity. In the absence of publication bias, heterogeneity in effect sizes may arise from variations in study design, animal models, interventions, or other biological factors. It is also possible that if a high risk of bias within individual studies is evenly distributed across studies and not selectively reported, it may not lead to funnel plot asymmetry.

Limitations

The main limitation of this meta-analysis is the high between-study heterogeneity, which decreases the power and reliability of the pooled effect sizes, making it challenging to detect a true effect of H2S in stroke patients. Subgroup analysis and risk of bias assessment revealed that the possible sources of heterogeneity were the various H2S donors used in the studies and the low quality of evidence. Although the pooled neuroprotective effect of H2S against stroke seems large, the true effect size needs to be confirmed in further well-designed experimental studies.

Conclusions and recommendations for further studies

H2S significantly reduces the stroke-induced infarct area, brain water content, and neurological deficit, primarily through early antiapoptotic effects. These promising findings are limited by the low quality of evidence and the high risk of bias represented by high heterogeneity in the effect sizes across studies. Despite all the efforts to standardize preclinical stroke studies and to increase translational success (Sena et al. 2007; Hooijmans et al. 2014; Schmidt-Pogoda et al. 2020), it is noteworthy that the internal and external validity of the animal studies evaluating the effect of H2S in stroke is low. When the studies included in our analysis are considered in the light of the CAMARADES, STAIR, and IMPROVE guidelines (Sena et al. 2007; Fisher et al. 2009; Percie du Sert et al. 2017), our specific recommendations for the future preclinical stroke studies on H2S are as follows:

  • Use a valid ischemic stroke model

  • Increase translation of findings by including both sexes, different age groups, and animals with comorbidities

  • Perform an accurate sample size calculation before the experiments

  • Report animal loss with reasons

  • Randomly allocate animals to the experimental groups

  • Blindly assess the outcome

  • Evaluate functional outcome, a more clinically relevant endpoint than stroke volume

  • Evaluate long-term effect of H2S

  • Compare H2S with current or emerging treatment options for ischemic stroke

  • Report transparently

In conclusion, although H2S seems promising against stroke, the available preclinical evidence is far from providing a basis for further clinical studies. Preclinical studies that provide high-quality evidence on the effect of H2S in stroke patients are still needed.