Background

Sickle cell disease (SCD) is an inherited autosomal recessive haemoglobin (Hb) disorder caused by the replacement of normal Hb (HbA) by mutant Hb (sickle Hb, HbS). In addition to homozygous SCD (SCD-S/S), compound heterozygous forms exist that lead to a variable clinical picture. The 3 most common genotypes of SCD are SCD-S/S, SCD-S/C, and SCD-S/beta0 thalassaemia [1].

SCD-S/S is the most common type of sickle cell disease; copies of the haemoglobin gene are inherited from both parents. SCD-S/C is the second most common type. The HbC gene is inherited from one parent and the HbS gene from the other. People with SCD-S/C have symptoms similar to those with SCD-S/S. SCD-S/beta0 thalassaemia is a combination of the sickle cell mutation and beta-thalassaemia; one parent passes on a sickle cell allele, the other a beta-thalassaemia allele. No HbA is produced.

Other rarer combinations are also known [2]. The production of abnormal sickle-shaped red blood cells causes haemolysis and vaso-occlusion [1]. Symptoms typically start in the first years of life. Common symptoms include pain, infections, and anaemic symptoms. Young children are particularly at high risk of life-threatening complications such as sepsis and acute splenic sequestration. The global prevalence of SCD is estimated at 2.28 per 1000 persons [3]. It varies greatly from region to region and correlates with the prevalence of malaria, as carriers of the HbS mutation are less susceptible to malaria. SCD predominantly occurs in sub-Saharan Africa, as well as in parts of the eastern Mediterranean, the Middle East and India, and has been spread globally by migration [2, 4]. The prevalence estimates per 1000 persons are 10.68 in Africa and 0.07 in Europe [3]. It is estimated that 230,000 children with SCD are born in sub-Saharan Africa per year (0.74% of births), compared with only 1300 children in Europe (< 0.1% of births) [2]. It can be assumed that in European countries, SCD occurs only in descendants from the regions mentioned above.

A US guideline on newborn screening [5], the British National Health Service [6], and the World Health Organization [7] all recommend screening for SCD to ensure that affected newborns receive the necessary care quickly and to reduce mortality and morbidity.

Management strategies aim to avoid the symptoms and complications of triggering factors and vaso-occlusive crises [8,9,10,11,12]. The German guideline of the Consortium of the German Society for Paediatric Oncology and Haematology [13] recommends preventive behavioural measures for SCD management (including awareness of signs of acute complications, see also [8]), prevention of infections (including penicillin prophylaxis and vaccinations, see also [9,10,11,12]), and lifelong, structured monitoring and treatment [13].

SCD can be diagnosed by a blood sample. Biochemical methods such as isoelectric focusing, capillary electrophoresis or high-performance liquid chromatography (HPLC) have previously been used for blood analysis; more recent methods include mass spectroscopy and molecular genetic analysis [2, 14].

The aim of newborn screening using a blood sample is the early detection and management of specific diseases. In the current German newborn screening programme [15], venous or heel blood is collected in the 36th to 72nd hour of life, dripped onto filter paper cards, and examined for several target diseases, which currently do not include SCD. In contrast, newborn screening for SCD has been established in the USA [5], England [16], France [17], Spain [18], Belgium [19], and the Netherlands [20].

This systematic review was conducted to support the decision on whether Germany should launch a national SCD screening programme. However, this work can also serve as a guidance for other health care systems that have not yet established such a programme. The aim of the current article was to systematically review the evidence on the benefit of newborn screening for SCD followed by an earlier start of treatment versus either no newborn screening for SCD or versus newborn screening for SCD (= diagnosis of SCD) not followed by any further measures. The focus of the assessment was on patient-relevant outcomes (mortality, morbidity [e.g. pain, infections, hospitalisations], adverse events, and health-related quality of life).

Methods

Protocol and methodological approach

The review formed part of a health technology assessment (HTA) conducted by the German Institute for Quality and Efficiency in Health Care (Institut für Qualitaet und Wirtschaftlichkeit im Gesundheitswesen, IQWiG). The full (German-language) report and protocol (Commission No. S18-01) are available on the Institute’s website (www.iqwig.de). IQWiG’s responsibilities and methodological approach are described in its methods paper [21]. This review was written according to the PRISMA statement [22] (see Additional file 1). Only previously published studies were used, so an ethical review and patient consent were not required.

Eligibility criteria

The target population comprised newborns preferably investigated in controlled intervention studies of the screening and management strategy. The intervention was screening for SCD, with subsequent initiation of treatment. The timing of the intervention and diagnostic measures were to be comparable to those of the German newborn screening programme, which ranges from the 36th to the 72nd hour of life and uses blood samples taken from a vein or the heel dripped onto filter paper cards. The control intervention was no screening for SCD with subsequent initiation of treatment (either no screening for SCD or screening for SCD [= diagnosis of SCD] not followed by any further measures). The patient-relevant outcomes investigated included overall mortality of children with SCD, morbidity (e.g. pain, infections, and hospitalisations), any adverse events reported, and the health-related quality of life of the child (measured by any validated scale).

Randomised controlled trials (RCTs) were the primary type of evidence to be included in this review. If RCT data were insufficient, non-randomised comparative intervention studies and controlled cohort studies, including retrospective or historical comparisons (comparison of the later intervention group with an earlier, i.e. non-concurrent, control group), were to be considered.

If such studies with direct evidence were not available or were of insufficient quantity or quality, a linked evidence approach was to be used. Merlin et al. defined the full linked evidence approach as “the synthesis of systematically acquired evidence on the accuracy of a medical test, its impact on clinical decision making and the effectiveness of consequent treatment options” and presented several variations and abridged approaches [23]. Here, linked evidence from studies on diagnostic accuracy was to be used, together with controlled intervention studies investigating the benefit and harm of an earlier start of treatment for newborns with SCD. Due to the fact that direct evidence was identified and considered sufficient to draw conclusions (at least on the weakest level according to IQWiG’s grading system, see Table 1), the linked evidence approach is not further described here.

Table 1 Certainty of conclusions regularly inferred for different evidence situations if studies with the same qualitative certainty of results are available according to IQWiG methods

There was no restriction to the duration of the studies, the length of follow-up, and settings.

Search strategy and study selection

We searched for relevant primary studies and secondary publications (systematic reviews and HTA reports) in MEDLINE via PubMed and via Ovid (1946 to July 2020), EMBASE (1974 to July 2020), and the Cochrane Central Register of Controlled Trials (July 2020) as well as trial registries (ClinicalTrials.gov and EU Clinical Trials Register). The Cochrane Database of Systematic Reviews (Cochrane Reviews), MEDLINE, EMBASE, and the Health Technology Assessment Database (Technology Assessments) were searched for relevant systematic reviews. Reference lists of relevant systematic reviews were screened for additional relevant primary studies. There were no restrictions on language or date of publication. Conference abstracts were not eligible for inclusion. The search strategies, which were developed by one information specialist and checked by another, are presented in Additional file 2. After conducting the search, duplicates were removed using the reference management software Endnote X9 (by Clarivate, Philadelphia, USA). The remaining references were then screened with the in-house web trial selection database (webTSDB).

Two reviewers independently screened titles and abstracts of the citations retrieved to identify potentially eligible primary and secondary publications. The full texts of these articles were obtained and independently evaluated by the same two reviewers applying the full set of inclusion and exclusion criteria. Disagreements were resolved by consensus.

Data extraction

The individual steps of the data extraction and risk-of-bias assessment procedures were always conducted by one person and checked by another; disagreements were resolved by consensus. Details of the studies were extracted using standardized tables developed and routinely used by IQWiG.

We extracted information on study characteristics from each study considered (including study design, sample size, place, and period in which the study was conducted, characteristics of the intervention group, characteristics of the control group, main inclusion and exclusion criteria, and patient-relevant outcomes).

Assessment of risk of bias

We assessed the risk of bias for individual studies, as well as for each outcome, and rated these risks as “high” or “low”.

For controlled intervention studies, the risk of bias was assessed by determining the adequacy of the following quality criteria, which closely follow the criteria of the Cochrane risk-of-bias tool [24]: treatment groups were studied in parallel, comparability of groups, blinding of participants and investigators, selective outcome reporting, absence of other factors potentially causing bias, and overall risk of bias (study level). If the risk of bias on the study level was rated as “high”, the risk of bias on the outcome level was also generally rated as “high”. Study results were only considered in the analyses if at least 70% of the study participants had been evaluated.

Data analysis

We reported the treatment effects as odds ratios (ORs) for binary outcomes, including 95% confidence intervals (CIs) and p values (Wald test).

Sensitivity analyses were planned in cases where the effect observed might not have been robustly estimated or unreliable due to methodological factors (e.g. algorithms for replacing missing values, choice of cut-offs for defining outcome variables or length of follow-up when outcomes were measured at several points in time). We planned subgroup analyses to examine the impact of variables such as sex, age, or type of diagnostic test on the effect of the intervention.

All calculations were performed with the statistical software R6.3.6 [25].

Assessment of certainty of evidence

Using the IQWiG methods [21], we graded the results of the analysis into different categories with regard to the respective qualitative certainty of the conclusions (see Table 1). The data provide either “proof” (highest certainty of conclusions), an “indication” (medium certainty of conclusions), or a “hint” (weakest certainty of conclusions) in respect of the benefit or harm of an intervention—or none of these 3 categories apply.

Results

Literature search

Only one controlled intervention study of the screening and management strategy was identified: King et al. [26]. The flowchart of study selection is presented in Fig. 1. Because of the direct evidence available in this study, we dispensed with a search for linked evidence. A list of excluded full texts with reasons for exclusion is presented in Additional file 3.

Fig. 1
figure 1

Flowchart of study selection for direct evidence

Study characteristics

King et al. [26] was a retrospective study evaluating the screening programme for SCD in Jamaica. The intervention group included 395 SCD-S/S-positive infants who had been identified by screening (followed by preventive measures) between 1995 and 2006. Management of SCD in the intervention group included prevention of infections (e.g. penicillin prophylaxis) and education of parents (e.g. how to perform splenic palpation). The control group were participants in a historical cohort observation study initiated to provide information on the course of SCD, including 105 SCD-S/S-positive infants identified by screening between 1973 and 1975 (Jamaican Sickle Cell Cohort Study [27, 28]). These infants remained untreated, as no established treatment for SCD was available at that time. Further characteristics of the study are shown in Table 2.

Table 2 Study characteristics

Risk of bias

King et al.’s study [26] was rated as having a high risk of bias (Table 3). This was due to the historical control group and the lack of control for confounding factors. As the data were collected retrospectively, a lack of blinding was assumed. No baseline data on the newborns in the intervention and control group were available, but since they were born in the same region, comparability was assumed. The high risk of bias at the study level had a direct impact on the risk of bias at the outcome level. The risk of bias for the outcome “mortality” was thus also rated as high.

Table 3 Risk of bias in the study included (King et al. [26])

Effects of newborn screening for SCD (Table 4)

Within the first year of life, 0.01% of newborns with SCD-S/S died in the intervention group, versus 0.10% in the control group. There was a clear difference between groups at 1 year with an OR of 0.09 (95% CI [0.03; 0.30]) and a p value of p < 0.001. This means that for the intervention group (cohort 1995–2006), the odds of dying were reduced to as little as 9% of the odds for the control group (cohort 1973–1975).

Table 4 Results on mortality

The analyses of children up to the age of 5 showed consistent effects. The data showed a mortality rate of 0.01% at 2 and 3 years in the intervention group; the corresponding rate was 0.14% (at 2 years) and 0.17% (at 3 years) in the control group; the ORs were 0.06 (95% CI [0.02; 0.20]) at 2 years and 0.05 (95% CI [0.02; 0.15]) at 3 years, with p < 0.001 in both cases. The results at 5 years also showed a very large effect, with a mortality rate of 0.02% versus 0.19%; the OR was 0.09 (95% CI [0.04; 0.22]) with p < 0.001. The results for mortality at 10 years were not statistically significant (0.09% versus 0.23%; OR 0.33, 95% CI [0.07; 1.64], p = 0.176). As the estimated effect of the intervention was very large and consistent over several years of observation, no further sensitivity analyses were needed to ensure its robustness.

No suitable data were available for subgroup analyses.

As the mortality in children decreased by a factor of about 10 in the first 5 years of life, there was a very large difference between the intervention and the control group, which is unlikely to be explained by bias alone.

Discussion

The results of this systematic review are based on a single, retrospective, historically controlled study evaluating a screening programme for SCD in Jamaica (King et al. [26]). It shows a decrease in mortality by a factor of 10 through screening for SCD followed by preventive measures. In our opinion, this large reduction in mortality may be explained only in part by bias associated with the comparison of non-concurrent groups in different decades or potential confounding factors, such as medical and technological progress. We therefore concluded that, in comparison with no screening, the data provide a hint of a benefit in favour of newborn screening for SCD. The sample size appears sufficient to draw this conclusion, as the precision of estimates is acceptable.

Consideration of linked evidence in addition to direct evidence would not have added much information to this review question. At best, i.e. if supporting evidence existed, the certainty of conclusions would still have remained relatively weak, as linked evidence studies cannot increase the reliability of direct evidence studies. If conflicting results from linked evidence existed, the direct evidence would still in most cases be regarded as more convincing, as linking evidence increases uncertainty [23].

The test procedure used in King et al.’s study [26] no longer complies with the latest laboratory standards. Established diagnostic test procedures for SCD are currently available: The results of test-positive newborns screened using tandem mass spectrometry or HPLC, for example, showed that they were actually affected by SCD, since no false-positive results were reported in selected studies (e.g. [29, 30]). Since the factors influencing test accuracy may vary, essential factors for the widespread use of such screening programmes are the training of laboratory personnel, laboratory standards, and the minimum amount of blood per sample to be analysed. The Jamaican screening programme includes prevention of infections and education of parents. These measures (e.g. penicillin prophylaxis) are also recommended as essential components of the early management of newborns with SCD in clinical guidelines in Western industrial countries [6, 13, 14, 31, 32], although in some countries, e.g. Germany, penicillin is not approved as a preventive measure for asymptomatic children with SCD. In these countries, as well as in Jamaica, the treatment of acute complications—currently and at the time of the study—includes immediate antibiotic treatment for fever and/or an enlarged spleen and blood transfusions for low Hb levels. It can therefore be assumed that the findings of King et al. [26] can be applied both to German and other healthcare systems.

This review excluded screening studies comparing a cohort of children screened and treated after birth with a cohort of children treated as soon as symptoms appeared [8, 33, 34]. This is because it can be assumed that in the latter cohort, children may be missed who died of SCD without being diagnosed; robust conclusions on SCD-related mortality cannot therefore be drawn. However, the data reported in these studies are in line with the decrease in mortality reported in King et al. [26], although the differences between the intervention and control groups are less pronounced. This is plausible, as the potential undetected SCD-related early deaths in the control groups may lead to an underestimation of the survival benefit in children with SCD detected by screening versus children with SCD identified after showing clinical symptoms.

As early as 1986, an RCT [12] showed that penicillin prophylaxis in asymptomatic children with SCD can reduce mortality by reducing the rate of infections. This also supports our conclusion that newborn screening for SCD lowers mortality. Against this background, the conduct of future RCTs to evaluate the benefits of newborn screening for SCD seems unethical and unlikely.

In April 2017, a consensus meeting of European experts (EuroBloodNet) was held, which discussed the organisation and methods of screening for SCD and made recommendations for implementation [35]. For example, the experts discussed which laboratory methods are appropriate (HPLC, capillary electrophoresis, isoelectric focussing, and tandem mass spectrometry) or what the recommended procedure should be after a positive screening result (e.g. confirming the positive screening result with another method using a second sample or re-testing with the same method) [35]. They also noted a great variety of screening and management strategies across European countries and that data on affected newborns were lacking for some regions. On the one hand, future research should therefore focus on regional SCD prevalence rates. On the other, screening and managing strategies should be evaluated to gain information on which strategies are the most suitable for the local healthcare systems and demographics.

Recommendations in the literature

Our literature search identified 4 systematic reviews on newborn screening for SCD: one Cochrane review published in 2000 [36], 2 from the Galician Agency for Health Technology Assessment (original report from 2004 and update from 2013) [37, 38], and one from the Canadian Institute of Health Economics published in 2006 [32]. No review identified high-level evidence (i.e. RCTs) evaluating newborn screening for SCD but also included the study King et al. [26]. All note that no robust evidence to support newborn screening for SCD is available, but nevertheless mostly recommend this intervention.

Limitations

Our results are based on a single, retrospective, historically controlled study in Jamaica with a high risk of bias (King et al. [26]). We therefore concluded that, in comparison with no screening, the data provide only a hint (i.e. the weakest certainty of conclusions according to IQWiG’s grading system) of a benefit in favour of newborn screening for SCD.

The control group in King et al.’s study [26] was also identified by newborn screening for SCD. Since no further specific preventive measures followed after diagnosis, the natural course of disease was investigated and this cohort was considered to be a suitable control group for the present review. Furthermore, only the test-positive newborns were followed up, which was plausible from an economical and practical perspective, even though the follow-up of all tested newborns would have resulted in a more precise data basis.

Conclusion

The decrease in mortality by a factor of 10 is unlikely to be explained alone by the high risk of bias in the single retrospective study considered in our systematic review (largely due to the inclusion of historical controls). Therefore, in terms of mortality, the data suggest a benefit of newborn screening for SCD (followed by preventive measures) versus no newborn screening for SCD (weak certainty of conclusions).