Introduction

Dental caries affects 2.4 billion adults and 621 million children worldwide (Kassebaum et al. 2015). Lancet’s Global Burden of Disease (GBD) assessment generally demonstrated oral diseases being one major challenge for global public health (Peres et al. 2019). In primary teeth, untreated dental caries is estimated to cause costs of > $532 Mio, with richer countries showing a significantly lower prevalence (Vernazza et al. 2021). The main daily work of dentists is still the restoration of carious teeth and the replacement of pre-existing restorations (Dos Santos Pinto et al. 2014; Franzon et al. 2015; Hübel and Mejare 2003; Kavvadia et al. 2004). Facing the global needs, there is a general urgent need for the treatment of primary teeth caries (Peres et al. 2019; Vernazza et al. 2021).

Traditionally, dental amalgam has had its place in paediatric dentistry worldwide (Dutta et al. 2001; Hse and Wei 1997; Sengul and Gurbuz 2015). In 2017, the parties to the Minamata Convention on mercury agreed to phase down the use of dental amalgam, especially in the dental treatment of vulnerable groups such as children, pregnant, or breastfeeding women to reduce their exposure to mercury (Minamata Convention on Mercury 2013). Another important aspect is that for both cement lining and amalgam, there may not be enough space in primary molars due to large pulps. Following these reasons, a clear paradigm shift in favour of minimally invasive preparation and restoration is clearly seen (Attin et al. 2001; Hse and Wei 1997). Consequently, adhesive restorations placed in primary dentition revealed the best clinical performance; however, their use still is time-consuming and considerably technique-sensitive (Laske et al. 2016; Opdam et al. 2014; van de Sande et al. 2013). Hence, in most of the studies considerable amounts of failures have been reported, most of them related to secondary caries (Laske et al. 2016; Opdam et al. 2014). Effective restoration of primary teeth is dependent on several factors, i.e., the compliance of the child, operator skills in both behavioural and technical management, and finally the individual properties of the chosen restorative biomaterial—in decreasing order of clinical importance (Chisini et al. 2018). Furthermore, it is evident that lower age correlates with lower restoration survival of primary tooth restorations (Chisini et al. 2018).

In relation to these patient behaviour management problems, operator skill related factors, and material properties, the ideal restorative material for primary teeth has not been found so far: Amalgam (A) is characterised by low technique-sensitivity but quite aggressive preparation requirements without actually sufficient space for this, since primary teeth are smaller and pulp is closer to the outer surface (Daou et al. 2009; Hilgert et al. 2014). Resin-based composites (CR) are minimally invasive and quite easily bonded when recent universal adhesives are employed. However, they are still considerably technique-sensitive, related to both operator’s abilities and contamination issues (Casagrande et al. 2013; Cavalheiro et al. 2020). Glass-ionomer cements (GIC) are true bulk-fill materials and, therefore, favourable per se, but GIC require undercuts such as amalgam as well and they are prone to fracture due to inferior flexural strength and fatigue characteristics (Espelid et al. 1999; Kilpatrick et al. 1995; Krämer and Frankenberger 2001). Resin-modified glass-ionomer cements (RMGIC) exhibit significantly higher flexural strengths compared to conventional GICs but they also require undercuts; their inferior wear resistance seems to be uncritical for primary teeth (Hübel and Mejare 2003; Kotsanos and Arizos 2011). Last but not least, preformed metal crowns (PMC) offer an overall good clinical performance, on the other hand they are mainly indicated for extensive carious defects in primary teeth and for cuspal coverage/better coronal seal post endodontic treatment, or the Hall-technique. Their aesthetics really are a major concern when dealing with parents in daily dental practice (Donly et al. 2020; Hutcheson et al. 2012).

The aim of the present study was to systematically review the clinical effectiveness of different restorative materials including new biomaterials for the restoration of carious primary teeth. Special focus was placed to evaluate the different failure modes related to material characteristics.

Methods

Protocol and registry

The present systematic review was conducted according to the updated PRISMA guidelines (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) (Page et al. 2021). The review protocol was registered before review commencement in PROSPERO international prospective register of systematic reviews hosted by the National Institute for Health Research (NIHR), University of York, UK, Centre for Reviews and Dissemination (CRD42020221944).

Review’s focused question

What is the clinical effectiveness of restorative materials including new biomaterials used for the restoration of carious primary teeth with a follow-up of at least 12 months?

PICO(S) construct

Population: Children (regardless of sex and age) with primary, previously unrestored carious lesions in primary teeth, which were treated with a restorative approach.

Intervention: (i) Randomised controlled clinical trials (RCTs) comparing different techniques and degrees of caries removal (selective vs. complete caries removal) in combination with the same/different restorative material(s) (adhesive/compomer, adhesive/composite, glass-ionomer cement (GIC), resin-modified glass-ionomer cement (RMGIC), metal-reinforced glass-ionomer cement (MRGIC), bio-active materials (BM), amalgam, preformed metal/zirconia/composite crowns) placed as restorations in primary dentition. (ii) RCTs comparing the same approach for caries removal combined with different restorative materials (adhesive/compomer, adhesive/composite, glass-ionomer cement (GIC), resin-modified glass-ionomer cement (RMGIC), metal-reinforced glass-ionomer cement (MRGIC), bio-active materials (BM), amalgam, preformed metal/zirconia/composite crowns) placed as restorations in primary dentition.

Comparison(s): Conventional restorative approach using another technique or degree of caries removal and/or another restorative material to restore carious lesions in primary teeth.

Outcome: The primary outcomes for the clinical effectiveness of restorations in primary teeth included (i) treatment failure evaluated according to the modified USPHS criteria (Krämer et al. 1999; Roulet 1994; Ryge and Snyder 1973), and (ii) restoration quality assessed according to the following criteria: surface roughness, colour match, marginal integrity, integrity (tooth/filling), proximal contact, change of sensitivity, hypersensitivity and radiographic assessment. To assess the failure of crowns, the following outcome criteria needed to be described: Modified USPHS criteria (restoration failure, proximal contact, marginal integrity, occlusion, secondary caries (Alaki et al. 2020)) or outcome criteria such as success/major failure/minor failure (Santamaria et al. 2017).

Secondary outcomes were: (i) time until restoration failure/re-treatment, (ii) discomfort during restorative treatment or within 24 h after treatment, (iii) patient’s and/or carer’s perceptions of the restorative treatment, (iv) impact of the following factors on the clinical effectiveness of the restorative treatment: Preoperative radiograph, caries lesion depth, affected tooth surface(s), extent of carious removal, type of tooth, isolation technique, type of adhesive, type restorative material.

Study design: Randomised controlled clinical trials (RCTs).

Inclusion criteria

The following criteria of RCTs needed to be fulfilled for inclusion:

  • Children with carious lesions in primary teeth extending into dentine and requiring intervention, preferably with a description of the lesion depth.

  • Vital, asymptomatic teeth with no history of pain, pulp exposure, infection, swelling and no evidence of furcal/periapical inflammation.

  • Restorative treatment with different restorative materials or techniques and degrees of caries removal (selective or complete removal of carious tissue).

  • Clinical follow-up of at least 12 months and a minimum of 40 restorations per group (Chisini et al. 2018).

  • Evaluation of any of the following outcomes: Treatment failure (i. e. according to the modified USPHS criteria) or assessment of restoration quality (surface roughness, colour match, marginal integrity, integrity (tooth), integrity (filling), proximal contact, change of sensitivity, hypersensitivity, and radiographic assessment).

Exclusion criteria

The following exclusion criteria were applied:

  • Any publication not fulfilling the criteria mentioned above or not related to the aim and outcomes of the current systematic review.

  • Studies that were not RCTs.

  • Studies conducted in permanent teeth.

  • Studies with teeth treated by vital and non-vital pulp therapy.

  • Studies with a drop-out rate > 30% (Tedesco et al. 2017).

Search strategy

The literature search was performed by one experienced researcher (DK) screening the following 4 electronic databases on December 28, 2020: MEDLINE (PubMed), EMBASE (via Ovid), Cochrane Library, and LILACS. The search strategy was adjusted to the requirements of each electronic database. Hand search was carried out to find additional studies that had not been identified during electronic database search; reference lists of included studies and of systematic reviews on restorative treatment in primary dentition were screened. No restrictions were applied to language or publication year. The search strategies of all electronic databases are presented in Appendix 1.

Study selection

Two reviewers (CB, SA) carried out study selection independently and in duplicate in the following stages:

  1. 1.

    Initial screening of titles and abstracts of potential papers according to the inclusion criteria by two reviewers (CB, SA), resulting in a database for which full-text articles were retrieved. Rayyan QCRI application was used for the initial filtering of the studies (Ouzzani et al. 2016).

  2. 2.

    Screening by two reviewers (CB, SA) of the full-text papers identified as possibly relevant to the question of the review. Manuscripts of studies published several times were excluded if they presented identical results or data that were not within the scope of this systematic review. Only the latest published manuscript presenting relevant outcomes was included. In case of studies reporting relevant outcomes in several manuscripts, all of them were assessed for eligibility.

  3. 3.

    Handsearching by two reviewers (NNL, SA) in the reference lists of included papers and systematic reviews on restorative treatment in primary dentition for any missed potential papers.

Disagreements between reviewers in any stage were resolved by consensus-based discussion. Reviewers were blinded neither to author names and study sites, nor to the results of the included RCTs. A record of the study selection procedure was kept.

Data collection

Full-text articles were obtained from or ordered through the University of Giessen library for all the initially included studies. Data from the finally selected RCTs were extracted from the full-text papers and entered into an Excel table for further analysis by two independent reviewers (CB, SA). The reviewers performed a calibration training using the first 10 studies of data extraction and risk of bias assessment. For each selected trial the following data were recorded:

  • Authors, title, year of publication

  • Study design (split-mouth or parallel group), country of origin

  • Setting of the clinical study, follow-up intervals

  • Sample size at baseline and at each follow-up > 12 months, age of participants, sex, inclusion and exclusion criteria

  • Detailed description of interventions, techniques and materials used (caries risk, type of tooth, cavity class, type of anaesthesia, type of isolation, technique of caries removal, restorative technique)

  • Primary and secondary outcomes (clinical and radiographic)

  • Results/Findings (success scores, failure scores, reasons for failure)

  • Study conclusions

  • Notes (ethical approval, funding, trial registration, sample size calculation, informed consent, number and calibration of operator(s) and outcome assessor(s), randomisation technique, drop-out, limitations)

Quality assessment of the included studies

The quality assessment of the selected RCTs was performed using the Cochrane risk-of-bias tool for randomized trials version 2 (Sterne et al. 2019). It was conducted independently by two reviewers (CB, SA) without blinding the name of authors, institutions and journals. The aim of the evaluation was to assess the effect of assignment to the intervention (the intention-to-treat effect). Disagreements between reviewers were resolved by consensus-based discussion. A third reviewer (DK) was consulted in cases, where consensus was not reached.

Data analysis

Meta-analyses were planned to be computed if there were studies of similar comparisons reporting the same outcomes. Due to the considerable clinical and methodological heterogeneity among included studies, meta-analyses were not performed.

Calculation of failure rates

Failure rates and annual failure rates (AFR) reported by trial authors were extracted. If Kaplan–Meier statistics had been used by the study authors to estimate survival rates of restorations, failure rates were calculated based on the data presented for survival analysis. In all other cases, parameters of reported evaluation criteria (Frencken 2009; Hickel et al. 2007; Krämer et al. 1999; Roulet 1994; Ryge and Snyder 1973) were transferred into dichotomous data (acceptable/unacceptable clinical performance of restorations) to calculate failure rates (Table 1 according to Dias et al. 2018).

Table 1 Dichotomy of the evaluation criteria applied in the included RCTs

The AFR was computed by division of the failure rate by years of follow-up and according to a formula presented by Opdam et al. (2014):

\(\left( {{1}{-}{\text{y}}} \right)^{\text{z}}\, = \,({1}{-}{\text{x}})\).

x = total failure rate at ‘z’ years.

y = mean AFR.

In addition, reasons for failure of restorations (secondary caries, marginal adaptation, fracture/anatomical form, and loss of retention) were extracted.

Results

Selection of the studies

Based on the selection criteria, 1,676 articles were identified through database screening and 9 additional papers were identified through other sources. Among these records, 629 duplicates were removed. Another 845 records were excluded, because the title and/or the abstract did not fulfil the inclusion criteria. 211 full-text articles were assessed for eligibility. The reasons for exclusion of 182 full-text articles are presented in Fig. 1. Thus, 29 articles remained for qualitative synthesis and no record was included in the quantitative analysis (Fig. 1).

Fig. 1
figure 1

PRISMA flow diagram presenting the study selection process, the numbers of studies identified, eligible, and included in the systematic review. PFS pit and fissure sealing, RCT randomised clinial trial

Characteristics of included studies

The characteristics of the 29 randomised controlled clinical trials included in this systematic review are listed in Tables 3 and 4. Among the 29 RCTs, 27 studies included primary molars, 1 study evaluated the clinical performance of crowns in primary incisors (Alaki et al. 2020), and one study included anterior and posterior primary teeth (Moura et al. 2020). Studies included between 25 and 568 children aged 1.5–11 years. All in all, 5,892 teeth distributed among 2,927 children received a restorative treatment. The follow-up period of the restorations ranged from 12 to 60 months. Seventeen studies were conducted in split-mouth design and 12 studies used a parallel group design. Seventeen of the studies were carried out in universities or dental schools, seven in a school setting, three in private dental practices, and in 2 cases the clinical setting was not reported. Nineteen papers evaluated the restoration quality by modified USPHS or Ryge criteria (Krämer and Frankenberger 2001; Krämer et al. 1999; Roulet 1994; Ryge and Snyder 1973), two used the FDI criteria (Hickel et al. 2010; Hickel et al. 2007), and 7 the criteria for ART restorations (Frencken 2009). Thirteen studies included Class-I restorations; 21 studies focused on Class-II restorations. In two trials, authors reported about crowns as restoration on vital primary teeth. In 14 studies cotton rolls were used for isolation, in 9 trials rubber dam was applied, 1 trial had a combination of both isolation methods, and in 7 papers the isolation technique was not reported. Information about the procedure of caries removal was very limited, since 19 studies gave some information on caries removal, being partial, complete or ART. Studies with ART were separately examined. Limited data was the reason why we did not further evaluate this parameter. All in all, the study designs, the treatment protocols, and the outcomes were described heterogeneously. Tables 3 and 4 list the results split by the different restorative materials or techniques used in the included RCTs.

Table 2 Quality assessment for the potential risk of bias in included studies
Table 3 RCTs with at least 1-year follow-up evaluating restorations on primary teeth: Study design, country, clinical setting, funding, restorative material under investigation, and inclusion criteria of included studies

Quality assessment of the included studies

Twenty-five included RCTs were of an overall high risk of bias. For the remaining 4 studies, the overall risk of bias was unclear. The main reasons for that were: the randomisation process (69% of the trials), missing outcome data (59%) or measurement of the outcome (52%) (Fig. 2 and Table 2). Most of the studies that were marked as high risk due to the randomisation process, had no information on allocation concealment.

Fig. 2
figure 2

Quality assessment for the potential risk of bias

Bias arising from the randomisation process

Twenty-one studies were rated as having a high risk of selection bias based on inadequate information provided about the randomisation process and/or the allocation sequence concealment. The remaining 8 studies were of low risk of bias (Alaki et al. 2020; Cavalheiro et al. 2020; Dermata et al. 2018; Hesse et al. 2016; Lenzi et al. 2017; Moura et al. 2020; Olegario et al. 2019, 2020). A computer-generated randomisation was applied in several studies (Alaki et al. 2020; Cavalheiro et al. 2020; Dermata et al. 2018; Lenzi et al. 2017; Olegario et al. 2019, 2020). Opaque, sealed envelopes (Alaki et al. 2020; Cavalheiro et al. 2020; Moura et al. 2020; Olegario et al. 2020) or a central allocation (Dermata et al. 2018; Lenzi et al. 2017; Olegario et al. 2019) were used to conceal the randomisation sequence.

Bias due to deviations from the intended interventions

In many studies, blinding of participants and trial personnel appealed to be not feasible due to the different materials used for the restoration of primary teeth. Two studies (Faustino-Silva and Figueiredo 2019; Olegario et al. 2020) were graded as low risk of bias. For the other 27 studies, some concerns persisted regarding performance bias, and therefore, these studies were rated with an unclear risk.

Bias due to missing outcome data

Seventeen studies exhibited a high risk of bias for missing outcome data (58.6%) because of drop-out rates exceeding 10%. In 2 studies, there were some concerns about bias due to missing outcome data (Olegario et al. 2019, 2020). Given the drop-out rates of < 10% in the remaining 10 studies, they were judged as having a low risk of attrition bias (Alaki et al. 2020; Alves dos Santos et al. 2010; Deepa and Shobha 2010; Ehlers et al. 2019; El-Housseiny et al. 2019; Hesse et al. 2016; Hse and Wei 1997; Kilpatrick et al. 1995; Kupietzky et al. 2019; Lenzi et al. 2017).

Bias in measurement of the outcome

Given the fact that several included studies compared the clinical performance of different restorative materials or that operator and outcome assessor were the same person, 55.2% (n = 16) of studies were rated as being of high risk of detection bias. In 44.8% (n = 13) of published studies, the outcome assessors were adequately blinded to the intervention.

Bias in selection of the reported results

None of the studies were at high risk of bias in selection of the reported results. In 48.3% (n = 14) of studies the risk of bias was rated as “unclear” because of incomplete reporting of results for pre-specified outcome criteria (Alaki et al. 2020; Deepa and Shobha 2010; Dermata et al. 2018; El-Housseiny et al. 2019; Ertugrul et al. 2010; Gok Baba et al. 2021; Hesse et al. 2016; Hse and Wei 1997; Konde et al. 2012; Lenzi et al. 2017; Moura et al. 2020; Olegario et al. 2019, 2020; Welbury et al. 1991). The rest of the included studies reported the results completely (51.7%, n = 15).

Reported outcomes for the restorative materials

Amalgam

Tables 3 and 4 show the reported outcomes for the restorative materials including four trials that investigated the use of amalgam in primary teeth (Barr-Agholme et al. 1991; Dutta et al. 2001; Kavvadia et al. 2004; Welbury et al. 1991). The studies compared amalgam with composite resin (Barr-Agholme et al. 1991), compomer (Kavvadia et al. 2004), GIC (Welbury et al. 1991) and RMGIC (Dutta et al. 2001). The four studies used the USPHS criteria or Ryge’s criteria to evaluate clinical performance. The follow-up period ranged from 1 to 5 years. There was a high variation of annual failure rates for amalgam ranging from 1 to 28%. The study by Kavvadia et al. (2004) was the only one using rubber dam and reported the lowest AFR of 1% (Kavvadia et al. 2004). Three studies reported on secondary caries ranging from 9 to 12.5%, two studies on marginal adaptation ranging from 17 to 32%, and one study on loss of retention of 2–3.4%.

Given the overall high risk of bias among the included studies, the results for amalgam restorations are based on a low quality of evidence.

Glass-ionomer cements

The clinical performance of conventional (GIC), metal-reinforced (MRGIC), resin-modified (RMGIC), or high-viscosity glass-ionomer cements (HVGIC) was assessed in primary dentition in 9 included randomised controlled clinical trials (Akman and Tosun 2020; Alves dos Santos et al. 2010; Dermata et al. 2018; Dutta et al. 2001; El-Housseiny et al. 2019; Gok Baba et al. 2021; Kilpatrick et al. 1995; Kupietzky et al. 2019; Welbury et al. 1991). The comparators chosen were amalgam (Dutta et al. 2001; Welbury et al. 1991), compomer (Alves dos Santos et al. 2010; Gok Baba et al. 2021), composite resin (Akman and Tosun 2020; Alves dos Santos et al. 2010; Dermata et al. 2018; El-Housseiny et al. 2019; Kupietzky et al. 2019), bulk-fill composite resin (Akman and Tosun 2020), another type of glass-ionomer cement (Gok Baba et al. 2021; Kilpatrick et al. 1995), and glass carbomer (El-Housseiny et al. 2019). One trial solely compared glass-ionomer cements (Kilpatrick et al. 1995). If reported, isolation was achieved by use of cotton rolls in 3 studies (Akman and Tosun 2020; Gok Baba et al. 2021; Welbury et al. 1991) or by rubber dam in 4 studies (Alves dos Santos et al. 2010; Dermata et al. 2018; El-Housseiny et al. 2019; Kupietzky et al. 2019). The follow-up of included studies ranged from 1 to 5 years. Most commonly modified USPHS criteria (8 trials) were used to evaluate the clinical performance. Kupietzky et al. (2019) described their own clinical and radiographic evaluation criteria and assessed the clinical performance by means of photographs and radiographs (Kupietzky et al. 2019). The annual failure rates were 7.6–16.6% for GIC, 29.9% for MRGIC, 1.9–16.9% for RMGIC, and 2.9–25.6% for HVGIC. Reported reasons for failure of glass-ionomer cements were loss of retention (2% (per protocol analysis) up to 25.6%), fractures (0–15.2%), marginal adaptation (0–13.5%), and secondary caries (0–7%). In comparison to amalgam, GIC exhibited a higher (Welbury et al. 1991) and RMGIC a lower failure rate (Dutta et al. 2001). RMGIC achieved similar or slightly higher failure rates compared to compomer and composite resin restorations (Alves dos Santos et al. 2010; Dermata et al. 2018; El-Housseiny et al. 2019). The results for HVGIC showed a high variation. Whereas Akman & Tosun (2020) reported an acceptable clinical performance of HVGIC for Class-II restorations after 1 year (Akman and Tosun 2020), Class-II restorations with compomer and composite resin outperformed HVGIC in two other trials after 1–3 years (Gok Baba et al. 2021; Kupietzky et al. 2019). MRGIC presented the highest failure rate in Class-II restorations with loss of anatomical form and marginal integrity over time (Kilpatrick et al. 1995).

All studies are at high risk of bias, which results in a low quality of evidence for these findings on the restorative treatment of carious primary teeth with various glass-ionomer cements.

Glass carbomer

Only one included clinical study (El-Housseiny et al. 2019) reported on glass carbomer restorations after drilling in primary teeth. Failure rates of glass carbomer were extremely high (up to 46.2% AFR) compared to composite resin (2.1% AFR) or RMGIC (up to 7.5% AFR). Main reasons for failure were marginal adaptation and anatomical form (fracture).

The findings from the single study evaluating the use of glass carbomer in Class-II cavities of primary molars are based on evidence of low quality.

Compomer

Seven of the included randomised controlled clinical trials investigated the clinical effectiveness of compomer restorations in primary teeth (Alves dos Santos et al. 2010; Ehlers et al. 2019; Ertugrul et al. 2010; Gok Baba et al. 2021; Hse and Wei 1997; Kavvadia et al. 2004; Santamaria et al. 2017). The following restorative materials were selected as comparators in these trials: amalgam (Kavvadia et al. 2004), RMGIC (Alves dos Santos et al. 2010), HVGIC (Gok Baba et al. 2021), another compomer (Ertugrul et al. 2010), composite resin (Alves dos Santos et al. 2010; Hse and Wei 1997), bulk-fill composite resin (Ehlers et al. 2019), and alternative caries management approaches (Santamaria et al. 2017). In the majority of trials, Class-II restorations were examined. Rubber dam was used to isolate the teeth during restorative treatment in 3 trials (Alves dos Santos et al. 2010; Hse and Wei 1997; Kavvadia et al. 2004) and cotton rolls with saliva ejector were chosen in another 2 trials (Ertugrul et al. 2010; Gok Baba et al. 2021). Again, the clinical performance of restorations was most commonly assessed by modified USPHS criteria (5 trials). In one trial, FDI criteria were chosen (Ehlers et al. 2019). In another trial, the categories “success”, “minor failure” and “major failure” were described for outcome assessment (Santamaria et al. 2017). The annual failure rate of compomer restorations varied from 0 to 14.7%. In primary teeth, compomer restorations exhibited a similar clinical performance compared to amalgam (Kavvadia et al. 2004), RMGIC and composite resin (Alves dos Santos et al. 2010), composite resin (Hse and Wei 1997), and a flowable bulk-fill composite resin applied without cover layer (Ehlers et al. 2019). In comparison to two HVGICs, the compomer achieved more favourable retention rates (AFR 3.6% vs. 13.4–25.6%) (Gok Baba et al. 2021). One trial including a tooth coloured and a coloured compomer found comparable survival rates after 1 year for both materials in Class-II restorations (Ertugrul et al. 2010). Overall, restorations failed due to secondary caries (0–15%) and loss of retention (1.7–3.6%). One trial reported a combination of fracture and loss of retention in 9% of cases (Santamaria et al. 2017).

The results for compomers are based on studies with high risk of bias leading to a low quality of evidence.

Composite resin

Clinical performance of composite resins in primary teeth was reported in 11 of the included studies, compared to compomer in 3 studies (Alves dos Santos et al. 2010; Ehlers et al. 2019; Hse and Wei 1997), to GIC or RMGIC in 5 studies (Akman and Tosun 2020; Alves dos Santos et al. 2010; Dermata et al. 2018; El-Housseiny et al. 2019; Kupietzky et al. 2019), to amalgam in 1 study (Barr-Agholme et al. 1991), and to glass carbomer in another study (El-Housseiny et al. 2019). Three studies included only composite resins in their experimental groups, testing parameters of the application technique. Therefore, the variables were: (i) dentine etching time 15 s vs. 7 s, (ii) etch-and-rinse vs. self-etch adhesion, (iii) bevel vs. no bevel during cavity preparation (Cavalheiro et al. 2020; Lenzi et al. 2017; Oliveira et al. 2008). Most of the studies used hybrid composite resin and 3 studies reported results for bulk-fill composite resins (Akman and Tosun 2020; Cavalheiro et al. 2020; Ehlers et al. 2019). The most commonly used evaluation criteria were the modified USPHS criteria and duration of the studies was 1–4 years.

Hybrid composite resin restorations in primary teeth exhibited 0–19.5% AFR, while bulk-fill composite resins showed 0–16.9% AFR. Two separate studies reported 0% failure rate for bulk-fill composite resins with different evaluation criteria (FDI and modified USPHS) (Cavalheiro et al. 2020; Ehlers et al. 2019). Compared to amalgam restorations, amalgam demonstrated higher failure rates and more failures due to inferior marginal adaptation (Barr-Agholme et al. 1991). Similar or lower failure rates were recorded for composite resins compared to GIC/RMGIC, except from one study showing lower failure rates for RMGIC (El-Housseiny et al. 2019). Same or higher failure rates were shown for composite resins compared to compomers.

Etching of dentine for 15 s in comparison to 7 s resulted in comparable survival rates of bulk-fill composite resin restorations. Generally speaking, the clinical outcome was more favourable when phosphoric acid etching of dentine was performed for 7 s in contrast to 15 s (Cavalheiro et al. 2020). Etch-and-rinse adhesive technique demonstrated higher failure rates (27.8%) compared to self-etch (10.3%) after 1.5 years (Lenzi et al. 2017). When no bevel was placed, failure rate was twice as high of a value compared to bevel placement and twice as many lost teeth due to secondary caries (Oliveira et al. 2008).

The results for composite resin restorations in primary teeth are mainly based on studies with a high risk of bias (10/11) and 1 study with an unclear risk of bias (1/11). Therefore, the evidence is of low to unclear quality.

New biomaterials

As far as new restorative materials with bio-active molecules are concerned (Imazato et al. 2014), the search retrieved no results for RCTs on the clinical effectiveness of these restorative materials for the treatment of carious primary teeth.

Crowns

Two studies on crowns were included in the systematic review. Both studies were conducted prospectively in dental school settings (Alaki et al. 2020; Santamaria et al. 2017). They were carried out as randomised trials (parallel group design) comparing different restorative materials or techniques. In one study, zirconia crowns (ZC) and composite strip crowns (CSC) filled with a hybrid composite with fine particles (Z100) were evaluated (Alaki et al. 2020). The other study compared preformed metal crowns (PMC) placed using the Hall-technique with compomer restorations and non-restorative caries treatment (Santamaria et al. 2017). The follow-up times varied from 1 to 2.5 years with 40 to 60 restorations being followed. For evaluation, one study used the USPHS criteria (Alaki et al. 2020) and in the other one own criteria (successful, minor and major failures) were applied (Santamaria et al. 2017). The annual failure rate ranged between 3.3% (ZC) and 15% (CSC). Reasons for failure were secondary caries (2.5–6.7%) and loss or fracture of the restorations (2.5–38.3%).

Again, there is a low quality of evidence as the findings are based on studies with a high risk of bias.

Class-I vs. Class-II restorations

Three of the four trials solely evaluated the clinical performance of amalgam in Class-II cavities (Barr-Agholme et al. 1991; Dutta et al. 2001; Kavvadia et al. 2004) however, no comparison with Class-I restorations was made in the same study under the same standardised parameters. In the ART studies that compared Class-I restorations with Class-II, there was a higher annual failure rate for Class-II restorations (5–28.9%) compared to 1.2–14.7% for Class-I. Two out of nine studies with glass-ionomer cements included both Class-I and Class-II restorations (Alves dos Santos et al. 2010; Welbury et al. 1991) and the remaining examined only Class-II restorations. Higher survival rates were shown in Class-I restorations than in Class-II in 4 years (Alves dos Santos et al. 2010; Welbury et al. 1991); however, Welbury et al. 1991 did not provide separate data on the two types of restorations (Welbury et al. 1991). Two trials assessed Class-II compomer restorations combined with additional cavity classes and composite resin restorations compared with other cavity classes, namely, Class-I or Class-III and Class-V cavities (Alves dos Santos et al. 2010; Hse and Wei 1997). Again, Alves dos Santos et al. 2010 reported higher survival rates in Class-I compomer and composite resin restorations than in Class-II in 4 years (Alves dos Santos et al. 2010; Welbury et al. 1991). Hse and Wei 1997 did not give separate data on Class-II, -III and -V restorations but concluded that compomers can be suitable for restoring Class-III and Class-V cavities as well and not only Class-II (Alves dos Santos et al. 2010; Hse and Wei 1997). In addition, six studies reported on Class-II composite resin restorations, one on Class-I and 2 studies gave no information on cavity size. As far as crowns are concerned, 1 study included anterior primary teeth with at least two surfaces affected by carious lesions without mentioning the lesion depth (Alaki et al. 2020). The other RCT included primary molars with Class-II carious lesions extending into the dentine (Santamaria et al. 2017). Further comparison of the results from different study settings was not possible.

Restorative materials used for atraumatic restorative treatment (ART)

Ten trials investigated the use of Atraumatic Restorative Treatment (ART) in primary teeth (Deepa and Shobha 2010; Ersin et al. 2008, 2006; Faustino-Silva and Figueiredo 2019; Hesse et al. 2016; Konde et al. 2012; Moura et al. 2020; Olegario et al. 2019, 2020; Yassen 2009). Five studies evaluated the clinical performance of various types of GIC (conventional GIC, RMGIC, HVGIC, and glass carbomer) for ART (Deepa and Shobha 2010; Faustino-Silva and Figueiredo 2019; Konde et al. 2012; Moura et al. 2020; Olegario et al. 2020) and three studies evaluated different application techniques of GIC for ART (Ersin et al. 2008; Hesse et al. 2016; Yassen 2009). In one study, a compomer was compared with a GIC and a glass carbomer (Olegario et al. 2019), and in another a GIC with a composite resin (Ersin et al. 2006). Eight studies used cotton rolls as a form of isolation, and in the remaining 2 studies the isolation technique was not reported (Hesse et al. 2016; Konde et al. 2012). Various criteria were used to evaluate clinical performance and included USPHS criteria (Ersin et al. 2008, 2006; Faustino-Silva and Figueiredo 2019; Konde et al. 2012), special ART criteria or Frencken’s (Deepa and Shobha 2010; Faustino-Silva and Figueiredo 2019; Hesse et al. 2016; Moura et al. 2020; Yassen 2009). Two studies used Roeleveld criteria (Olegario et al. 2019, 2020). The follow-up period ranged from 1 to 4 years. There was a high variation of the AFR ranging from 1.2 to 37.1%. Only one study reported secondary caries as a reason for failure (Konde et al. 2012), due to the nature of ART, where caries is inherently left below the restoration at placement. The use of polyacrylic acid cavity conditioner prior to placement of GIC did not improve the success rate of the restoration (Yassen 2009). Using a bilayer technique for placement of HVGIC in proximal lesions positively influenced the survival rate when compared to conventional one step placement (Hesse et al. 2016). In the same study, the use of a nano-filled coating increased the restoration longevity. The use of a cavity disinfectant prior to GIC placement did not affect the success rate for both Class-I and Class-II ART restorations (Ersin et al. 2008). The use of composite resin in one study did not show an improved survival rate, when compared to HVGIC (Ersin et al. 2006). Fracture of restoration or loss of retention were the most common causes of failure (2.6–54% for fracture and 5.2–44.7% for loss of retention).

All these findings are premised on studies with a high or unclear risk of bias resulting in a low of unclear quality of evidence for restorative materials chosen for ART in primary dentition.

Quantitative synthesis of the included studies

The lack of standardised protocols impeded a valid interpretation of the actual results through pooled estimates. Substantial differences in the implemented interventions, follow-up duration and investigated outcomes indicated significant heterogeneity. Moreover, and probably more notably, the risk of bias assessment of the included studies indicated high risk in all, except in 4 studies, that were evaluated at unclear risk (rated with 'Some concerns') (Table 2). Again, significant clinical and methodological heterogeneity among these 4 studies was identified. Therefore, a meta-analysis was not feasible.

Table 4 RCTs with at least 1-year follow-up evaluating restorations on primary teeth: the systematic review results for the restorative treatment

Discussion

This systematic review aimed to investigate the quality of the evidence of published RCTs on the clinical effectiveness of contemporary restorative materials in vital primary teeth.

Restorative materials used covered a wide spectrum, being amalgam, different glass-ionomer cements, glass carbomer, compomer, composite resins and preformed crowns. Failure rates of the different materials varied greatly between the studies, as Chisini et al. 2018 also concluded (Chisini et al. 2018). Glass-ionomer cements demonstrated 7.6–29.9%, among them high-viscosity and the metal-reinforced glass-ionomer cements exhibiting the highest failure rates—the later agreeing with existing literature (Chisini et al. 2018). Amalgam restorations reached up to similar high failure rates (1–28%) with glass-ionomer cements. Glass carbomer restorations showed up to 46.2% failure rate; however, only one study with this material was included in the review. Composite resin restorations showed a failure rate of 0–19.5%, being slightly higher than that reported in a previous review (1.7–12.9%) (Chisini et al. 2018). However, included studies differed from the present review, as studies without control group were included in the first one and a language and year of publication restriction was imposed by those authors. The lowest failure rates in our study was shown by GIC, RMGIC and compomers and not by composite resin restorations as in Chisini et al. 2018 (Chisini et al. 2018). Regarding paediatric crowns, similar low failure rates (3.1%) are reported for PMC in the previous literature (Chisini et al. 2018), which does not include zirconia or composite strip crowns, with failure rates in our study of 3.3% and 15% accordingly. However, due to the fact that composite resin accumulates more plaque than zirconia (Alaki et al. 2020) or metal (Adamczyk and Spiechowicz 1990), the higher failure rate of composite strip crowns are justified. It has to be taken into consideration that composite strip crowns can also be classified as multi-surface composite resin restorations. Strictly speaking, strip crowns support the forming of the crown and the restorative material they are filled with decides about the long-term outcome. A systematic review by Alrashdi et al. (2021) showed that zirconia crowns may be an acceptable alternative to PMC showing similar retention rate, fracture resistance, and gingival health parameters (Alrashdi et al. 2021). The results of this systematic review do not allow for a direct comparison of crown failure rates due to the fact that PMCs were placed on primary molars and zirconia crowns on primary incisors. Especially for aesthetic crowns, the extent of the carious lesion may influence the long-term success, as adhesively bonded restorations require a maximum of bonded surface area. Most of the published studies with crowns report on crown placement after pulp therapy and their success may be influenced more by the management of the pulp treatment, rather than the crown itself (Alrashdi et al. 2021). In our review, 2 included studies examined PMC, zirconia or composite strip crowns without prior pulp therapy (Alaki et al. 2020; Santamaria et al. 2017). Based on the small number of included studies and the high risk of bias no certain conclusions can be drawn.

Some studies investigated the type of the restorative material used in various cavity designs, among them mostly on Class-I or Class-II cavities. However, information on the comparison for the same restorative materials in different cavity classes was limited. In general, literature agrees with the results of the present review that Class-I restorations tend to have lower failure rates than Class-II (Chisini et al. 2018).

The restorative materials used for ART were evaluated separately from their use for the conventional restorative approach. As cavity preparation and caries removal are performed with hand instruments for ART, the cavity design and extent of caries removal differ from the conventional restorative approach, where rotary instruments are used. Both may affect the outcome of the restorative treatment. When ART was examined—with most studies placing glass-ionomer cement restorations after caries excavation—the failure rates climbed up to 37.1%, which complies with the Cochrane Review about ART restorations, suggesting that ART shows higher failure rates than conventional high-viscosity glass-ionomer cement restorations (Dorri et al. 2017). Given the higher probability of failure when glass-ionomer cements are applied by ART, this treatment approach may be limited to situations, where rotary cavity preparation is not available or where Class-I cavities need to be restored with a HVGIC in small children with limited compliance.

Apart from the 10 ART studies (Table 3 and 4), only 10 out of 19 studies gave information on the involved caries excavation technique (complete (Alves dos Santos et al. 2010; Cavalheiro et al. 2020; Ehlers et al. 2019; Hse and Wei 1997; Kavvadia et al. 2004; Kilpatrick et al. 1995; Lenzi et al. 2017; Santamaria et al.2017; Welbury et al. 1991) vs. selective (Alves dos Santos et al. 2010; Kavvadia et al. 2004; Kupietzky et al. 2019; Santamaria et al. 2017)). First and foremost, comparison of heterogenic studies would not result to a valid conclusion. Second, since some of the USPHS criteria can be related to residual dentine caries, e. g. hypersensitivity, lack of this information could potentially influence the result of the study and attribute it to the material rather than to the excavation technique. Therefore, it should be suggested that future clinical studies should include more extensive information regarding the implemented procedures.

The level of experience and the number of operators as well as the setting of the clinical study were considerably inconsistent among the 29 included studies. The number of operators was 1–12, usually 1–2 and in some cases, it was not reported at all. Experience levels started from final year undergraduate students and went up to postgraduate students and specialized dentists, although this may have a tremendous influence on the clinical outcome. The exact years of experience of dentists are not mentioned in any study. Study setting for most ART studies was university/dental school type, although this technique is not originally designed for this. Most of the included studies for all other materials were performed in dental colleges and only a few in private practices. All those factors cause diversities within the included studies which could potentially influence their results and thus their comparison.

There was no uniform adhesion technique chosen for composite resin placement. Most of the studies with composite resin restorations (8/12) chose the etch-and-rinse procedure (Akman and Tosun 2020; Alves dos Santos et al. 2010; Barr-Agholme et al. 1991; Cavalheiro et al. 2020; Dermata et al. 2018; El-Housseiny et al. 2019; Hse and Wei 1997; Kupietzky et al. 2019; Oliveira et al. 2008). This makes comparisons between the studies challenging, even if the same restorative material is used. Only one study used the same adhesive in both etch-and-rinse and self-etching mode and was able to directly compare their effect (Lenzi et al. 2017).

With respect to the heterogeneity and the risk of bias of the included studies, only 4 studies out of 29 were rated as having “some concerns” in the quality assessment for the potential risk of bias (Table 2). A quantitative synthesis could not be performed because of the heterogeneity in study designs, comparisons chosen, outcomes described, and the overall high risk of bias of included RCTs. Accordingly, and despite the number of 29 included studies, clear recommendations for clinical practice based on pooling effects from the available evidence were impossible to be drawn.

Compared to Schwendicke et al. 2016, where a review and network meta-analysis was performed, the rationale of this systematic review was different (Schwendicke et al. 2016). The authors of the aforementioned study limited the publication years (2005–2015) and performed a meta-analysis by including “high risk of bias” papers. As a matter of fact, it is questionable if a meta-analysis is meaningful, when high risk of bias is identified and unclear conclusions are drawn.

The quality of evidence for the restorative materials evaluated at a minimum of 12 months varied and depended on factors, such as sample size calculation, randomisation, allocation concealment, blinding technique and dropout rate (Table 2 and in Fig. 2). Randomisation was a conflicting issue, since few authors presented detailed information and allocation concealment was not described thoroughly, if not at all. Based on the nature of the dental procedures and the different handling of different restorative materials, it is to be expected that double-blinding is difficult—if not impossible—to be achieved. Nevertheless, some studies attempted to minimise performance bias by excluding the operator(s) from assessing the effect of interventions. Moreover, time of physiological tooth exfoliation is not known, and is especially important for older children or studies with a wide age range. Therefore, despite the fact that factor age of the child may be involved in the success rates and longevity of the restorations placed, its effect remains uncertain.

Strengths

One of the major strengths of the current review is its stringent inclusion criteria—RCTs with more than 40 restorations per group, with a minimum of 12-month follow-up duration and less than 30% drop-out rate. In contrast to the latest published study on the same subject (Chisini et al. 2018), studies with no control group were excluded as well as when the control group was not standardised (Innes et al. 2011). Only studies with a minimum of 40 restorations per group were included, to comply with recent literature (Chisini et al. 2018) and in cases where at least one group had less than 40 restorations, the study was excluded (Andersson-Wenckert and Sunnegardh-Gronberg 2006). Drop-out ratio was set to a maximum of 30% either at patient level or at tooth level, to make included studies comparable (Tedesco et al. 2017). Drop-out ratios were either provided by the authors of each paper, or calculated at patient level and at tooth level according to the information given. Exfoliated teeth—either mentioned in the text or calculated by the authors—were not included in the drop-out. However, if another publication of the same cohort with a shorter follow-up met the aforementioned criteria, it was considered eligible for inclusion. The exclusive inclusion of randomised controlled clinical trials (RCTs) aimed to diminish the risk of selection bias, as restorative materials were randomly allocated among treatment groups (Schwendicke et al. 2016). Standardising the calculation of the failure rates reported in each trial and calculating the annual failure rate for all restorative procedures of the included papers (Opdam et al. 2014) allowed for direct comparisons between the included studies, offering another great strength. As opposed to other reviews, there was no restriction in the year of publication of included literature, as well as no language restriction.

Limitations

The large number of restorative materials evaluated across the studies, the difference in the study design (split-mouth vs. parallel), as well as their different levels of risk of bias, did not allow for studies to be included in the meta-analysis and that could be regarded as the major limitation of the systematics review. Regarding glass carbomer restorations, only one study was included in the review. Studies with longer observation periods than 12-month set were scarce and high drop-out rates along with a high number of exfoliations occurring in primary dentition did not allow reporting for longer follow-up periods. Despite the fact that the 12-month follow-up was chosen to be in accordance with previous reviews, the number of studies was small and thus the review does not report on failure rates of restorative material in vital primary teeth for longer time periods.

Another limitation is that 70% of the included ART studies were performed in a school setting and 30% at university, although this technique is originally designed to be conducted under field conditions without access to electricity (Frencken 2009). Apart from material characteristics, the choice of the restorative technique and the setting may influence the outcome, which was confirmed by the variation of AFRs reported in the included ART studies. For ART, the caries removal with hand instruments along with the restoration under field conditions (Frencken and Holmgren 1999) may be influential factors for the longevity of the restoration.

The procedure of caries excavation was not evaluated as a parameter, since information in the included papers was limited. Nevertheless, the extent of caries removal is of relevance for the success of the restorative treatment, as it was shown that composite resin restorations in primary molars presented a higher failure rate when selective caries removal was performed compared to complete caries removal, although the former technique was associated with a better outcome for the pulp (Liberman et al. 2020).

No universal study design (split mouth vs. parallel arm) was met, having an effect on quality of evidence. In split-mouth designs each patient acts as his own control which reduces inter-subject variability and/or random error, thus increasing study accuracy and power. Moreover, parameters which can cause confusion in the level of bias—such as age, oral hygiene level, and general health—can be eliminated in split-mouth design. The so-called “carry-across effect” is a major limitation of this study design describing distortion of the treatment effect due to mutual influence of both treatment sides (Pozos-Guillen et al. 2017). In cases of restorative treatment, however, this is unlikely to happen, since each material is placed on a certain tooth. However, a split-mouth design is contraindicated in case of asymmetrical distribution of carious lesions (e. g. varying severity) within the oral cavity (Pozos-Guillen et al. 2017).

Since caries risk was reported in very few studies, and there was no distinction between the sides or quadrants of the mouth, selection of the specific study design should be reconsidered. In addition, eligible patients for split-mouth set ups are more difficult to recruit, since symmetry in appearance of the disease appears more rarely in patients’ mouths. However, non-eligibility of a large number of patients imposes restrictions in the ability to generalize the conclusions of the studies, since they may not apply to the public. For example, choosing patients with cavities in all quadrants might create bias towards patients with a higher total number of cavities in their oral cavity (Chaffee et al. 2017), which is associated with the socioeconomic status (Schwendicke et al. 2015). On the other hand, one major aspect of parallel group studies is randomization and allocation concealment, which in the present review was the most common factor for leveling risk of bias to high risk. This study design also increases the number of subjects to be recruited. However, their statistical analysis is simpler than the one of the split-mouth studies. Other limitations could be factors that may cause diversities, such as the number of operators (1–12), experience level (final year undergraduate students up to specialized dentists) and study setting (university vs. private practice). It is often discussed that response for authors is very limited in systematic reviews, as reported by EAPD review group 1. Due to this experience, we decided not to contact the authors for further information. This also has to be accounted as a limitation of the current review.

Conclusions

Considering any limitations of the present review, the following conclusions can be made:

  • The included RCTs were mainly evaluated as at high-risk of bias, with the domains of randomisation and allocation concealment being the most problematic.

  • The results ranged within the same material group, as different evaluation criteria were used among the studies.

  • In view of phasing down the use of dental amalgam, this restorative material may no longer be recommended in primary dentition given the toxicological concerns and available tooth-coloured restorative materials with lower AFRs.

  • The type of glass-ionomer cement plays an important role on clinical outcome: Due to the inferior performance, MRGIC and HVGIC cannot be recommended. RMGIC may be a treatment option for Class-I and Class-II restorations in primary molars.

  • Compomers, hybrid composite resins and bulk-fill composite resins demonstrated similar failure rates, with compomers exhibiting the lowest AFR among them.

  • Crown material (zirconia or preformed metal) may have an impact on failure rates, although the existing data are extremely limited.

  • Restorative materials used for ART reported comparable AFRs to conventional amalgam restorations.

  • All in all, future thoroughly implemented RCTs evaluating restorations in primary teeth are needed to draft clear recommendations for the best restorative treatment approach.