Background

Iron deficiency anemia is a significant concern for women of reproductive age (WRA) because it can have adverse maternal and fetal outcomes [1]. Because of the severe consequences of anemia for both WRA and their offspring, the World Health Organization (WHO) has prioritized iron deficiency anemia on the global health agenda [2]. The existing literature reveals that anemic pregnant women are more likely to have miscarriages, intrauterine growth retardation, preterm births, stillbirths, babies born with low birth weight (LBW), and neonatal mortality [3,4,5,6]. Previous studies, for example, show that a 10 gm increase in hemoglobin levels results in a 30% decrease in maternal deaths, indicating a linear relationship between anemia and maternal mortality [7]. Similarly, anemia during pregnancy contributes to 25% of LBW, 44% of preterm births, and roughly a quarter of stillbirths and neonatal deaths in resource-limited settings [8]. A meta-analysis found that anemia during pregnancy increased the risk of premature birth [9]. Furthermore, according to a WHO multi-country survey, severe anemia doubles the risk of maternal death [10].

While anemia is prevalent among WRA in both high and low and middle-income countries (LMICs), the latter experience a higher burden of anemia [11, 12]. Anemia affects approximately 66.7% of WRA in LMICs, possibly due to a poor diet caused by food insecurity, parasitic infections, and decreased iron absorption [13, 14]. Anemia affects roughly half a billion women worldwide, with South-East Asia accounting for 42% of those affected, followed by Africa and the Eastern Mediterranean [15]. In South Asian and African countries, approximately 50% of the burden of anemia among WRA is due to iron deficiency [16, 17]. For example, the burden of anemia ranges between 50 to 90% across diverse states of India [18,19,20]. Similarly, anemia affects 34.5% to 56.8%, 63.1%, and more than a third of the WRA in Ethiopia, Uganda, and Bangladesh, respectively [21,22,23,24].

While the etiology of anemia is multifactorial, inadequate iron store as a result of poor nutrition has been identified as an important risk factor for anemia among WRA [25, 26]. Women of reproductive age are more likely to suffer from iron deficiency due to menstruation, and this risk increases during pregnancy due to increased metabolic demands of pregnancy and fetal growth [25, 26]. To address the high prevalence of anemia, the WHO proposes to distribute iron supplements to all WRA in regions with an anemia burden of more than 20% [27].

Even though several interventions have been shown to reduce the global burden of anemia by 12% between 1992 and 2011, such interventions did not benefit WRA living in LMICs [28]. In addition, numerous epidemiological studies, including randomized controlled trials (RCTs), are being conducted to evaluate the role of iron in reducing the burden of anemia among WRA in LMICs. The findings of these RCTs have been synthesized in a few systematic reviews and meta-analyses; however, these meta-analyses from the last ten years are not without gaps and limitations. For instance, a Cochrane review by Low Yuan et al. (2016) assessed the effect of iron therapy in reducing anemia among menstruating women [29]. This review only included non-pregnant women and excluded pregnant women with a higher burden of anemia during pregnancy and requiring special care to avoid the negative feto-maternal outcomes associated with iron deficiency anemia [29]. In contrast, another systematic review that was undertaken in 2015 only focused on pregnant women and excluded non-pregnant women [30]. Haider et al. (2013) conducted a meta-analysis to evaluate the effect of iron therapy, but the study results were largely based on observational cohort and quasi-experimental studies, which are not free of residual and unmeasured confounding [31]. Finally, rather than studying iron deficiency markers as an outcome, Pasricha et al. (2014) conducted a meta-analysis on iron therapy's effect on women's physical exercise [32].

As previously stated, the limitations of existing reviews limit the capacity of health professionals and decision-makers in LMICs to address the problem of iron deficiency in both pregnant and non-pregnant women. Given the existing gaps in the literature, a systematic review of randomized controlled trials (RCTs) was required to assess the effect of only iron therapy in reducing anemia, primarily among pregnant and non-pregnant women living in LMICs. Hence, we conducted a systematic review and meta-analysis of RCTs to assess the existing evidence on the efficacy of iron therapy in lowering the burden of anemia and improving iron deficiency markers among WRA in LMICs. This review will help health professionals, researchers, and decision-makers in LMICs make evidence-based decisions when prescribing iron therapy to WRA. As a result, the high prevalence of iron deficiency anemia among WRAs living in resource-limited settings will be addressed.

Material and methods

Searching strategy

The updated Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were employed to undertake this review and meta-analysis for the qualitative and quantitative synthesis of literature [33]. We focused on reviewing the evidence for the efficacy of iron as a therapy or intervention to reduce the burden of anemia among WRA in LMICs. Using keywords and specific search terms such as Medical Subject Headings (PubMed) and Emtree, a systematic search of three major electronic databases, including Medline through PubMed, Embase, and Science Direct, was undertaken.

Eligibility criteria

Before conducting a systematic review of the literature and meta-analysis of the studies, we defined eligibility criteria to include the relevant studies undertaken predominantly in LMICs. World Bank’s 2018 guidelines of country classification were used to define a country as an ‘LMIC.’ The eligibility of a study was contingent on being primarily an RCT assessing the efficacy of iron in decreasing anemia among women aged 15–49 years in LMICs and being published in the English language in a peer-reviewed scientific journal from January 2000 to December 2020. A form to screen the studies based on eligibility criteria is given in Table 1.

Table 1 Screening form to assess the eligibility of the potential research articles 

Eligibility criteria were categorized into four major headings using PICO (population, intervention, comparison, and outcome) algorithm (Table 2) [34]. The population for the current review was women of reproductive age (pregnant and non-pregnant women). The intervention was considered an iron supplement alone or in combination with folic acid or other micronutrients, taken orally or intravenously in a dosage of 60 to 100 mg daily for at least one month. To have the potential benefits of 60 to 100 mg of iron in reducing anemia and improving hemoglobin levels, WRA should have consumed iron for at least one month [35]. Since the evidence suggests that iron should be taken by WRA for at least 3 to 4 weeks to show its benefits, we decided to include RCTs where women were given iron for at least one month [35, 36]. Women in the control or comparison group received no iron supplement, a placebo, or any other mineral such as zinc, vitamin A, or B12 without iron or a different dose of iron than the defined intervention (details below). The primary outcome of the review is the improvement in the mean hemoglobin levels of women at the end of the intervention (details below). Lastly, the study design was either individual or cluster randomized controlled trials conducted in hospitals or communities. To include the most appropriate research articles that meet the eligibility criteria, we used filters on the year of publication (2000–2020), gender (females), age of study participants (15–49 years), language (English), and study designs (RCTs).

Table 2 Eligibility criteria according to the PICOS framework

Intervention and primary outcome

Iron therapy in the form of tablets/ syrup/ injectables administered daily, weekly, or monthly in the case of injectables was proposed as an intervention. In addition, we considered interventions that evaluated the effect of iron and folic acid supplementation in addition to the usual iron-rich diet. The main outcome is a change in the mean hemoglobin level. We included RCTs that used hemoglobin levels (grams per deciliter (g/dL)) to label a WRA as anemic using the WHO proposed cut-offs of hemoglobin level of < 12.0 g/dL or Hct < 36% and < 11.0 g/dL or Hct < 33.0% for non-pregnant and pregnant women respectively (52). The secondary outcomes were the prevalence of anemia, mean serum ferritin, serum transferrin, and serum iron at the end of the intervention.

Sources of information and search strategy

A systematic literature search was performed electronically to assess the efficacy of iron as a therapy to reduce anemia among WRA in LMICs. Two authors searched the databases independently for studies using a combination of search terms developed in response to the proposed research question. As a first step, four principal concepts, including (anemia, WRA, iron therapy, and LMICs) were identified. This was followed by using the synonyms of these major concepts, such as low hemoglobin/hematocrit level (anemia), married women/married pregnant women/ married non-pregnant women, iron/iron supplements/iron therapy/iron regimen (Intervention), and developing countries/poor-resource countries/ resource-constrained countries/less developed countries (LMICs). Besides, we used different spellings of these concepts, such as “anemia vs. anemia” and “hemoglobin vs. hemoglobin.” to obtain appropriate studies. Then, to find more research studies with the same root word, we combined the major concepts using combinations (AND, OR) and truncation (*). Further, indexed keywords in the Medical Subject Headings (MeSH) were used to ensure uniform search terms. Initially, the search strategy (shown in Table 3) was developed using the PICO framework. The search strategy was further refined, and finally, the below search strategy was used to search three databases.

“Iron therapy OR iron intervention OR oral iron [MeSH Terms]) AND (anemia* OR iron deficiency anemia*) AND (women OR married women* OR pregnant or non-pregnant women*) AND (poor countries OR developing countries OR low-middle-income countries)”.

Table 3 Search strategy according to PICO criteria

Study selection

We used endnote software to handle research articles exported from the databases [37]. During the study selection process, title and abstract screening were conducted by two review authors independently. All research articles were screened by study title using Endnote software, followed by screening the study abstracts of the shortlisted articles. This was followed by full-text screening conducted by two review authors independently. The full texts of the shortlisted articles were then retrieved and screened against the inclusion and exclusion criteria. A third reviewer, an expert in the field, resolved disagreements between the two. Before ruling any study ineligible, both reviewers independently reviewed the full texts of the articles, and each reviewer provided strong justification. The third reviewer made the final decision to consider an article relevant. The flow diagram generated to illustrate the study selection process is shown in Fig. 1.

Fig. 1
figure 1

PRISMA 2020 flow diagram summarizing the identification and selection of relevant Randomized Controlled Trials

Data collection process

Two independent reviewers completed a customized data extraction sheet for each eligible research article. To ensure that all important results and conclusions were considered in the review, the data extraction tables of the two independent reviewers were tallied. We enlisted the help of a third evaluator to resolve any conflicts or discordant information between the data extraction processes of two independent reviewers. Furthermore, existing research studies on the identified topic were reviewed to identify key items for the data extraction form. The data extraction form includes the following items: study name and author with publication year, study location, sample source, the sample size of both intervention and control groups, characteristics of study participants: baseline hemoglobin (g/dl), type of participants, age category, length of follow-up, intervention type and mode of administration, blinding procedure, randomization method, and key findings for the primary and secondary outcomes.

Assessment of risk of bias

Overall quality was assessed using a revised Cochrane risk-of-bias tool for RCTs (RoB 2.0), which assesses “selection, performance, attrition, detection, and reporting bias by evaluating reported sequence generation, allocation concealment, blinding of participants and personnel, incomplete outcome data, selective outcome reporting, and other possible sources” [38]. Two independent review authors assessed the risk of bias. The conflict or disagreement between two independent review authors regarding the risk of bias assessment was resolved by a thorough discussion between two reviewers, and a third reviewer was invited if the two independent reviewers did not resolve the conflict. A high risk of bias was recognized if randomization or allocation concealment was either absent or judged at a higher risk, participants were not blinded, or there was high or imbalanced attrition across the groups. All RCTs that did not fulfill this criterion were categorized as studies with a low risk of bias. We synthesized the findings of the risk of bias, and the degree of bias was rated as low, high, or unclear. A final graph was generated to visualize the extent of bias in all eligible studies. Further, we assessed publication bias by constructing and assessing the asymmetry of funnel plots.

GRADE assessment for overall certainty of the evidence

Using the GRADE (Grading of Recommendations, Assessment, Development, and Evaluation), we assessed the certainty of evidence [39]. The GRADE provides a reproducible and transparent framework to grade the certainty of evidence. Two independent review authors assessed GRADE. The GRADE assessment helped to rate the certainty of the evidence of iron therapy in improving anemia. The two authors independently assessed the certainty of the evidence for different outcomes such as serum hemoglobin, ferritin, iron, transferrin, and anemia. Since we included all RCTs in the review, the evidence was initially set as highly uncertain due to a lack of residual confounding. However, to increase our confidence for high certainty of the evidence, we used five additional criteria (risk of bias in individual RCTs), inconsistency of findings between RCTs, indirectness of evidence, imprecision of the estimate, and publication bias to make a conclusion about the overall certainty of evidence [39]. We ultimately developed a summary of findings (SoF) table using GRADE.

A narrative synthesis of eligible studies

Before conducting a quantitative analysis of the same studies, we conducted a narrative synthesis of the eligible studies. To begin, we conducted a descriptive analysis on all of the final shortlisted full-text RCTs to collect information on the authors, publication year, study location, sample source, the sample size of both intervention and control groups, characteristics of study participants: baseline hemoglobin (g/dl), type of participants, age category, length of follow-up, intervention type and mode of administration, blinding procedure, and method of randomization. To begin, two reviewers thoroughly read the full text of the articles to extract data and summarize the key findings. The same reviewers then recorded analytical interpretations of the findings to identify important emerging themes. Finally, they highlighted the role of iron in decreasing anemia among WRA in LMICs by summarizing the relevant findings for the primary and secondary outcomes.

Statistical analysis for quantitative results

Open Meta[analyst] software was employed to analyze the data retrieved from all eligible studies [40]. We used the standardized mean difference (SMD) and its respective 95% confidence interval (CI) to identify differences in hemoglobin mean across intervention and comparison groups. We used fixed effects (Mantel–Haenszel method) and random effects (DerSimonian and Laird method) meta-analysis models to compute summary estimates for the overall effect of iron. The primary analysis used a random effects model to generate a pooled effect estimate for the primary and secondary outcomes. Because of true differences in the prevalences of causal partners, it is reasonable to expect that effects will vary across populations. Since the eligible RCTs were from different international populations, using a random effects model was considered the most conservative analysis. Although we used a commonly used fixed effect model at first, the fixed effect model may understate the uncertainty of our findings [41]. Furthermore, the fixed effects model generated narrow confidence intervals that failed to account for actual variance between studies [41]. Also, the fixed model assumes that studies are drawn from populations with the same effect size, which may not be true for real-world data [42]. On the other hand, the random effect model assumes that studies are drawn from populations with varying effect sizes, which seems reasonable [42]. As a result, we kept the results from random-effects models rather than the fixed-effects model. Forest plots were created to visually inspect the degree of heterogeneity and demonstrate individual and pooled effects across eligible RCTs. Q-statistics and I2 were employed to quantify heterogeneity. Significant heterogeneity was found to exist when I2 was more than 50%, and a p-value of < 0.05 was used as a criterion for statistically significant results. Moreover, funnel plots were examined for asymmetry to identify publication bias. Because not all studies measured all iron deficiency markers, such as transferrin receptor or serum iron levels, we did not include these outcomes in the quantitative synthesis; instead, the findings were summarized using a narrative.

Results

Findings for the study selection process

Five hundred fifty-one RCTs were extracted from various databases, with 39 duplicates removed and 512 unique RCTs identified for further screening. Based on the eligibility criteria, 386 irrelevant titles and abstracts were removed after screening. After reviewing 126 identified abstracts, 87 studies were removed, including review articles, animal studies, secondary analysis, and in vitro studies. We read the full texts of 39 records and excluded 20 RCTs based on the inclusion and exclusion criteria. Finally, 19 studies were included in our review and meta-analysis (Fig. 1).

Study characteristics of the eligible studies

A sample size of 19 studies ranged between 50 to 988 participants. However, the total sample size in the intervention (n = 2230) and control (n = 2281) groups was comparable. Most studies were conducted in India (n = 10), followed by Pakistan (n = 2) and Bangladesh (n = 2). One study was conducted in Nigeria (n = 1), Vietnam (n = 1), Tanzania (n = 1), Peru (n = 1), and Indonesia (n = 1). The majority of the studies were conducted in hospitals, including antenatal clinics and tertiary care OBGYN clinics (n = 14), followed by community-based sampling (n = 4) and participants from factory settings (n = 1). Baseline average hemoglobin (g/dl) for the intervention group ranged between 8.38 ± 1.41 to 12.5 ± 1.14, and for the control group, it ranged between 8.27 ± 1.20 to 11.8 ± 1.14. Most participants were aged 16–40 years, except for three studies, which had participants aged 15–49 years. Two studies included non-pregnant women, and nine included pregnant women ranging in gestation from 13–26 weeks (2nd trimester). Four studies included women with gestation periods ranging from 12–36 weeks, and two included pregnant women who were less than 24 weeks gestation at the time of enrollment. The intervention's effect was observed over a minimum of four weeks of gestation and a maximum of four weeks of the post-partum period in pregnant women (n = 17) and 13–26 weeks in non-pregnant women (n = 2) (Table 4).

Table 4 Characteristics of the included studies and their main findings for the primary and secondary outcomes (n=19)

Interventions included

Seven of the 19 RCTs were based on oral iron tablets to be administered daily [42,43,44, 48, 51, 52, 57, 60]. Two studies included oral iron tablets as a weekly dose [45, 49]. Two studies compared the efficacy of parenteral administration of iron supplementation [50, 59]. In one trial, ferric carboxymaltose was given intravenously (IV) and compared with IV iron sucrose complex [59]. Another trial provided an intervention based on three intramuscular doses of 250 mg elemental iron at intervals of 1 month along with the oral doses of 5 mg folic acid twice weekly and compared with a daily oral dose of 100 mg iron and 500 µg folic acid [50]. Of the total, three studies included micronutrient supplements comprised of zinc, minerals (copper, selenium), retinol, iodine, Vitamins: D, E, C, B1, B3, B2, B6, and B12 in different doses, along with iron and folic acid as an intervention and two out of three studies compared with iron (60 mg) and folic acid (400 μg) supplementation tablets and one study provided non-fortified beverage as a placebo [47, 53, 55]. Four studies out of 19 included an optimized preparation based on food and snacks in the form of bars, sauce, fortified ready-to-eat snacks in dried powdered form, and one study included meat, fresh fruits, and vegetable along with iron tablets (60 mg), and folic acid (250 mg) as shown in Table 4 [46, 54, 56, 58].

Pooled effect for outcomes measurement

Pooled effect for the primary outcome, serum hemoglobin

Since the complete data were only available for the hemoglobin, we performed quantitative analysis for the primary outcome (hemoglobin) and one secondary outcome (Serum ferritin). A total of 19 studies (n = 4421 participants) were included in the meta-analysis to estimate the effect size. Meta-analysis indicated overall pooled effect estimate for the role of iron therapy in lowering the burden of anemia among the WRA group was -0.71 (95% CI: -1.27 to -0.14) (p = 0.008). The heterogeneity (I2) across included studies was found to be statistically significant, as indicated by the parameters of heterogeneity (Q = 746.93, I2 = 97.59%, p = 0.000) (Fig. 2).

Fig. 2
figure 2

Forest plot summarizing the overall pooled effect of iron supplementation on the primary outcome, hemoglobin (n = 19)

Figure 2 depicts the overall findings showing that iron therapy positively affects hemoglobin levels. More specifically, iron therapy in any form increased mean hemoglobin levels by 0.40 gms. The intervention arm's mean and standard deviation for hemoglobin was 11.14 ± 1.11 gm/dl, while the comparison group's mean and standard deviation for hemoglobin was 10.74 ± 1.019 gm/dl. However, there were variations in the results of individual studies. For instance, nine studies found that mean hemoglobin levels improved in the intervention group significantly [43, 46, 47, 52, 56,57,58,59,60]. Eight RCTs found that the mean hemoglobin level was not significantly different between the two groups post-intervention [44, 45, 48,49,50,51, 53, 55]. One study showed that the mean hemoglobin level decreased significantly in both groups [54]. Another study documented that the hemoglobin level was decreased in the intervention group after comparing once-daily (intervention group) vs. twice-daily (control group) oral supplementation [42].

Findings for the secondary outcomes

The included studies assessed secondary outcomes, including changes in mean serum ferritin level, serum transferrin receptor, serum iron, and improvement in iron deficiency anemia.

Pooled effect for the secondary outcome, serum ferritin levels

For the secondary outcomes, complete data were only available for serum ferritin; therefore, we performed a quantitative analysis for serum ferritin. A total of 15 studies (n = 3648 participants) were included in the meta-analysis to estimate the effect size. Meta-analysis indicated overall pooled effect estimate for the role of iron therapy in improving the ferritin levels among WRA was -0.76 (95% CI: -1.56 to 0.04) (p = 0.04). The heterogeneity (I2) across included studies was found to be statistically significant, as indicated by the parameters of heterogeneity (Q = 659.95, I2 = 97.88%, p = 0.000) (Fig. 3).

Fig. 3
figure 3

Forest plot summarizing the overall pooled effect of iron supplementation on the secondary outcome, Serum Ferritin (n = 15)

Overall, the results revealed a favorable effect of iron therapy in improving serum ferritin levels, as shown in Fig. 3. More precisely, in simple terms, iron therapy in any form improved the mean ferritin levels by 5.90ug/L. The mean and standard deviation for ferritin levels of the intervention arm was 37.61 ± 20.0, whereas the mean and standard deviation for ferritin levels of the comparison group was 31.71 ± 20.13. However, there were variations in the results of individual studies.

Change in mean serum ferritin levels: qualitative synthesis

Fourteen studies measured mean serum ferritin levels after the intervention as a secondary outcome. Six of the fourteen studies found statistically significant improvements in mean serum ferritin levels in the intervention group versus the control group [43, 46, 47, 50, 53, 57]. One RCT found significant improvement in the mean serum ferritin levels across both groups [52]. In contrast, one study documented a significant increase in the mean serum ferritin levels in the control group [51]. One RCT found a significant reduction in the mean serum ferritin levels in both groups [54], whereas one RCT noticed a substantial reduction in the mean serum levels in the intervention group than the control group [49]. 4 RCTs identified no significant difference in the mean serum ferritin levels between the two groups [42, 44, 59, 60].

Reduction in the anemia prevalence

Six randomized controlled trials investigated the reduction in the prevalence of anemia as an outcome of interest. Two out of six RCTs reported a significant decrease in the prevalence of anemia in the intervention group compared to the control group [46, 58]. While two other RCTs showed an improvement in anemia in the intervention than the control group, the findings were statistically non-significant [55, 60]. Two RCTs did not find a significant difference in the prevalence of anemia across both groups [44, 45]. And one RCT showed negative findings, meaning a significant decline in the prevalence of anemia was observed in the control group than in the intervention group, as shown in Table 4 [51].

Change in mean serum iron

Four out of 19 RCTs assessed change in serum iron levels post-intervention, and two trials reported no significant difference in the mean serum iron between both groups [51, 59]. One RCT reported significant improvement in the mean serum iron levels in both groups [50]. In contrast, another RCT identified a statistically significant reduction in the mean serum iron in both groups [54].

Change in serum transferrin receptor

Two of the 19 RCTs examined the change in serum transferrin receptor after an intervention, with one finding that serum transferrin receptor was significantly lower in the intervention group than in the control group [46]. However, another RCT reported increased serum transferrin receptors in both groups [54].

Overall Quality assessment for RCTs

Table 5 depicts the overall quality assessment of the eligible RCTs. Overall, 6 of 19 studies were deemed to have a low risk of bias based on criteria such as the low risk of randomization, allocation concealment, or blinding [42,43,44, 53, 54, 57]. Of the total, seven studies were found to have a high risk of bias since the randomization method was unclear [45, 46, 51, 52, 54, 55, 58]. A high risk of bias was found in 2 studies where no allocation concealment was done [58, 59], and in 8 studies, concealment was unclear [45,46,47,48,49,50,51, 55]. The absence of blinding was found in 4 studies [56, 58,59,60]. An unclear description of blinding study participants or outcome assessors was found in 5 studies [45, 48,49,50,51, 54, 55] and, subsequently, labeled as having a high overall risk of bias.

Table 5 Risk of bias assessment of the studies included in the meta-analysis (n = 19)

Publication bias

A graphical representation, a funnel plot, was employed to assess publication bias, indicating potential bias because the shape of the graph is not symmetrical, as shown in Fig. 4a and b for studies that assessed primary (hemoglobin) and secondary outcomes (ferritin), respectively. Since the funnel plot is subjective, we used a confirmatory objective statistical test (i.e., Egger’s test) to evaluate the publication bias. The objective test results for the primary outcome revealed no publication bias as the t-test was not statistically significant (Egger t-test = -1.7, p = 0.107), as shown in Fig. 4a. However, the same objective test was statistically significant for the serum ferritin (Egger t-test = -2.41, p = 0.032), suggesting a publication bias (Fig. 4b).

Fig. 4
figure 4

A Funnel plot to evaluate the publication bias among included studies in the meta-analysis for the primary outcome, hemoglobin (n = 19). B Funnel plot to evaluate the publication bias among included studies in the meta-analysis for the secondary outcome, Serum Ferritin (n = 15)

Summary of findings using GRADE assessment

Since the meta-analysis was conducted on RCTs, the overall certainty of the evidence was considered reasonable. However, using the additional five criteria of GRADE, the certainty of the evidence was assessed for five outcomes, and SoF is provided in Table 6. The findings reveal overall certainty of the evidence for outcomes such as serum hemoglobin, serum ferritin, and anemia prevalence was moderate. This implies that the two independent authors believe that the truth or true effect size is probably close to the estimated effect size, and future research for similar outcomes may change the estimate for the effect size. However, the certainty of the evidence for outcomes such as serum iron and transferrin receptor was low, suggesting that further research for these two outcomes will likely change the estimate of the effect.

Table 6 Summary of findings (SoF) table illustrating the summary on certainty of overall evidence for five outcomes using GRADE

Discussion

We conducted this systematic review and meta-analysis to assess the role of iron therapy in reducing anemia in pregnant and non-pregnant women. The review's findings showed that iron supplementation helps to improve markers of iron deficiency anemia, such as serum ferritin and hemoglobin levels. The current systematic review and meta-analysis findings are consistent with previous reviews. Low Yuan et al., for example, conducted a review in 2016 to assess the effectiveness of iron therapy in reducing anemia in menstruating women [29]. The authors demonstrated that daily iron supplementation reduces the burden of anemia and iron deficiency, raises hemoglobin levels, increases iron stores, improves exercise ability, and decreases fatigue. [29]. Likewise, another review conducted five years ago also revealed similar findings, where authors found positive effects of iron on hematological and pregnancy outcomes [30]. However, this review relied on studies with very low-quality evidence [30]. A review by Haider et al., carried out in 2013, showed consistent findings. Haider et al. emphasized the role of iron in improving birth weight rather than intermediate outcomes such as improving anemia or markers of iron deficiency [31]. Regardless of these differences, the analogous findings across these reviews indicate the role of iron in improving hemoglobin and other markers of iron-deficiency anemia among WRA.

The current review and meta-analysis findings are biologically plausible, and several mechanisms regulating iron absorption have been explored in different studies [61,62,63]. More precisely, the findings suggest that daily iron supplementation appears to be an effective intervention to reduce the burden of anemia among WRA. This review's findings complement those of other studies examining the role of iron supplements in reducing anemia in pregnant and non-pregnant women. Based on the results of this review and reasonably comparable findings from other reviews, iron, in any form, for pregnant or non-pregnant women may be beneficial in reducing the burden of maternal anemia. The question arises of how such intervention improves hemoglobin levels. Iron absorption in intestinal cells, followed by iron transfer to bone marrow, muscles, and other tissues, could be one possible answer to the proposed question. Iron is taken up by receptors in these tissues and used for various biological functions or stored [64]. Both animal and human studies reveal an inverse relationship between iron status or stores and the ability to absorb iron from intestinal cells [65]. This implies that an iron-deficient woman has the potential to absorb iron two times more than an iron-non-deficient woman because iron deficiency induces changes in the transport of iron across the intestine [65].

Furthermore, the evidence suggests that an iron-deficient individual should aim to increase hemoglobin concentrations by 1 g/dl every week and be aware of the dietary sources that inhibit iron absorption [29]. Iron absorption from supplements follows the same principles as iron absorption from dietary sources, and consuming vitamin C, meat, and acidic foods increases iron absorption [66]. Tannins, calcium, and phytates reduce the absorption and should not be consumed alongside iron [66]. In addition to the dietary resources, one needs to be aware of the geographical regions before providing iron. For example, health professionals may need to treat malaria in areas with endemic malaria while providing iron therapy to women [67]. Similarly, iron may also be affected by worm infestation. Therefore, healthcare professionals should consider deworming women before prescribing iron therapy to WRA [68]. This suggests that iron therapy may be beneficial when healthcare professionals become cognizant of the facts mentioned above while prescribing iron therapy to a woman.

Evidence from epidemiological studies shows that women from LMICs enter pregnancy with limited iron stores and lower hemoglobin levels than those from high-income countries [69]. Therefore, the demand for iron absorption is higher in women from resource-poor settings with a greater hematological response [70]. In addition, improving iron stores during pregnancy may also decrease the risk of mortality resulting from hemorrhage and may lead to improved hemoglobin and iron levels after pregnancy [9].

Strengths and limitations

This review's main strength is that it provides insights into the effect of iron on a wide range of outcomes, including hemoglobin, serum ferritin, iron, transferrin receptor, and anemia prevalence. In addition, unlike other reviews, we included all studies from LMICs that looked at the role of iron therapy in any form among pregnant and non-pregnant women. In addition, by including only RCTs, the problem of the unknown and unmeasured confounding could be addressed, thereby improving the confidence in the validity of the findings. Additionally, no significant publication bias was found in our meta-analysis, indicating that most trials with positive, negative, or null findings were published in the literature.

However, some inherent limitations of the individual eligible studies need to be considered while interpreting the findings of this review. For example, high-quality studies included in this review and meta-analysis were relatively less (n = 5) because of methodological issues in the randomization methods, no or unclear allocation concealment, and lack of blinding. Moreover, the heterogeneity was found to be very high, which could be explained by factors such as variation in sample size, differences in the follow-up time, differences in the populations (pregnant and non-pregnant women), and substantial variations in the interventions designed (difference in dosages and composition, and duration of interventions). There was a significant variation in the given interventions, for example, daily iron supplementation vs. weekly, once-daily vs. twice daily, oral iron vs. parenteral iron, micronutrient powder and/or iron vs. only iron, fortified snacks in addition to iron vs. placebo, resulting in heterogeneous exposure. In addition, we included studies from 2000 to 2020, which is a relatively long period with a greater degree of variation in the methods of different RCTs. The purpose of including the more extended period was to capture multiple RCTs on the role of iron in improving anemia among WRA. However, considering a longer period may lead to a more significant heterogeneity due to a wide variation in the methods. While it may be challenging to avoid heterogeneity entirely, RCTs can be designed efficiently to assess the role of uniform dose and form of iron and to follow women for the same time in different settings. This will aid in determining the effect of only iron supplements versus a placebo in the control group to isolate the effect of a fixed dose of iron in reducing anemia. Furthermore, although we identified potential secondary outcomes such as iron and transferrin receptors, most RCTs did not measure the secondary outcomes we chose. Hence, the effect size for outcomes sucn as iron and transferrin receptors could not be estimated. Finally, we only included RCTs published in English, limiting our ability to have inferences from studies published in other languages.

Conclusions

This review aimed to determine the effect of iron on hemoglobin levels and anemia in women of reproductive age. Overall, the review found that iron therapy, in any form, increases hemoglobin levels in pregnant and non-pregnant women and reduces iron deficiency anemia, as evidenced by increases in hemoglobin, serum ferritin, and decreased soluble transferrin receptors. The review also revealed that WRA in resource-constrained settings could be given iron in any form. The findings of this systematic review and meta-analysis may help physicians, researchers, and policymakers make informed decisions about providing iron therapy to pregnant and non-pregnant women and prepare them with enough iron stores for adequate fetal growth.

Clinical and research implications

The current review and meta-analysis findings can be used to treat anemia among WRA in LMICs using simple iron therapy in oral or injectable forms, depending on the severity of anemia and a woman's needs. Simple, cost-effective, and culturally appropriate iron and folic acid therapies can be given to women before or during pregnancy to reduce the anemia burden. This improves hemoglobin and iron stores, as evidenced by an increase in ferritin levels, a marker of iron stores.

Although current evidence suggests that iron is important for WRA in LMICs, more research is needed to fill important gaps. For example, none of the eligible RCTs examined the underlying mechanisms by which iron can improve outcomes. There is a need to evaluate the benefits and side effects of iron and adherence to iron intake because there is a dearth of evidence on these outcomes. Further, almost all of the RCTs focused on improving iron status during pregnancy rather than during a critical preconception window when a woman can lay a good foundation for an upcoming baby by eating iron-rich foods or taking iron supplements. Thus, more research is needed in public health settings to assess the role of iron before and during pregnancy on iron markers as well as distal birth outcomes such as birth length and birth weight.

In LMICs, iron supplementation is usually combined with folic acid and/or other micronutrients. As a result, it is unclear how much improvement in anemia can be attributed to iron alone. Thus, more well-designed RCTs are needed to fully understand the efficacy and safety of iron alone in reducing anemia among WRA in LMICs. Finally, secondary outcomes such as serum ferritin, serum transferrin receptor, and transferrin saturation indicate a long-standing and sustained parameter of storage iron, and their depletion can result in iron deficiency. Hence, RCTs should measure these outcomes rigorously after interventions are given for a reasonable time to identify long-term iron stores [71]. Furthermore, serum ferritin, serum transferrin receptor, and transferrin saturation have been shown to be more accurate indicators of iron deficiency anemia. As a result, estimating these parameters will reveal long-term benefits [72].