Background

Periodontitis is characterized by gingivitis and periodontal tissue destruction resulting in alveolar bone and tooth loss [1]. Periodontitis is the sixth most common disease with a standardized prevalence of 11.2%. It is the primary cause of tooth loss and negatively affects oral health, nutrition, self-confidence, and overall health. It associates with various systemic chronic diseases such as angiocardiopathy and diabetes [2, 3]. The global burden of periodontal disease remains high, which increased by 57.3% from 1990 to 2010 [4].

Dental plaque biofilm is the initial factor of periodontitis, which aggregates to trigger immune responses. This tends to destroy surrounded soft tissues and alveolar bone [3]. The goal of periodontal treatment is to control infection, and remove dental plaque, dental calculus, and endotoxins [5]. It is confirmed that subgingival scaling is an effective non-surgical periodontal therapy. In the early stage, subgingival scaling was practiced with manual scaling. In recent years, ultrasonic subgingival scaling has been applied in periodontal clinic [1, 6,7,8,9]. Each technique has advantages and disadvantages. Currently there is no universal protocol or clinical guideline for selecting one technique over another.

Many in-vivo studies did not group by initial probing pocket depth (PPD) when comparing ultrasonic and manual subgingival scaling, but different PPD may greatly influence instrument selection. For example, Beuchat found that when initial PPD < 6 mm, for gingival recession (GR), there was no statistical difference between ultrasonic and manual subgingival scaling (P < 0.05). When initial PPD > 7 mm, attachment level (AL) improved more while GR reduced more in using ultrasonic instruments than using manual instruments [10]. It is controversial in terms of when to use manual subgingival instruments and when to use ultrasonic ones.

In this study, we aimed to compare the work effectiveness between ultrasonic and manual subgingival scaling in different initial PPDs. It can provide new evidence for clinical instrument selection and future study.

Methods

A protocol has been registered at the International Prospective Register of Systematic Reviews (Number CRD42019125067). The content of this article is consistent with the protocol.

Research question

The focused question was developed in accordance with recognized Patient, Intervention, Comparison, and Outcome (PICO) format: “What’s the difference of the work effectiveness between ultrasonic and manual subgingival scaling in periodontal treatment at different initial PPDs?”

Selection criteria

Study type

In-vivo randomized controlled trials (RCT) that compared manual- and ultrasonic- subgingival scaling in periodontal treatment were included. Studies had to be in English. We classify the initial PPDs into: (1) shallow pocket: PPD ≤4 mm; (2) medium pocket: 4 mm < PPD < 6 mm; or, (3) deep pocket: PPD ≥6 mm. A group of studies in each article could not be simultaneously included in different PPD categories. Teeth with single-roots or multi-roots were all included in the study.

Participants

Adults (age ≥ 18) diagnosed with periodontitis unaccompanied by any other oral or systemic disease and not taking antibiotics.

Intervention

Meta-analysis sought to eliminate bias caused by different initial PPDs to compare ultrasonic subgingival scaling with manual ones. No distinctions were made between ultrasonic subgingival instrument makers or models. A Gracey scraper was chosen as the manual instrument.

Outcomes

A study may have primary and secondary indicators. Each indicator was processed differently. Not all outcomes were consistent with the criteria. The outcomes included appear in tables (Table 1 and 2).

  1. (1)

    Primary outcomes:

Table 1 Included studies characteristics (Primary outcome measures)
Table 2 Included studies characteristics (Secondary outcome measures)

PPD and clinical attachment loss (CAL) were the primary outcomes to compare different subgroup outcomes.

  1. (2)

    Secondary outcomes:

Bleeding on probing (BOP), gingival recession (GR), and post-scaling residual dental calculus were used as measures. These indicators are of interest, but data about them could not be extracted for meta-analysis due to the limited number of studies, their different measurements, and definitions. They were analyzed by comparing the results of each study.

Studies meeting the following conditions were excluded: (1) follow-up in less than 3 months; (2) treatment during follow-up. Cluster trials were not included.

Search strategy

PubMed, Cochrane Central Register of Controlled Trials, EMBASE, Medline were searched until January, 2019 for relevant studies. The search was performed using a combination of controlled vocabulary and key words (Appendix 1). Only English articles were searched. No time restrictions were imposed.

For potentially eligible studies, we searched ClinicalTrials.gov for prospective trial registers for controlled trials, with publication time up to January, 2019. In addition, the reference lists from the selected articles were checked for further studies qualifying for the review. Only articles written in English were selected. In this process, an eligible study contrasted ultrasonic and manual subgingival scaling, whether or not other methods were compared at the same time. And the initial PPD of each subject in an eligible study could be classified into shallow, medium or deep pocket at the same time.

Data collection and analyses

Study selection and quality assessment

Three review authors (XZ, ZH, and XSZ) independently searched and included eligible studies, using the same search strategy which had been completed and improved. The quality of each study was reviewed and evaluated for 3 times by three authors and relevant data were extracted. When there was a disagreement whether to include or not, a discussion with other authors (JC and WL) was held and an agreement reached on inclusion or exclusion.

A study’s methodological quality was assessed using the original publication. Trial quality was evaluated using Cochrane review bias assessment risk criteria [11]. The GRADE approach was used to assess quality of evidence.

This included random sequence generation (selection bias), allocation concealment (selection bias), blinding of participants and personnel blinding (performance bias), outcome assessment blinding (detection bias), incomplete outcome data (attrition bias), selective reporting (reporting bias) and other biases. Possible ratings were ranked by risk: low (L); high (H); uncertain (U).

Statistical analyses

When appropriate, data extracted was combined for meta-analysis using Review Manager 5.3. Effect size was estimated and reported as the mean difference (MD) for continuous variables with a 95% confidence interval (CI). Weight was calculated in individual studies based on the inverse of variance. This study used a random-effects model for analyses due to expected heterogeneity of the studies selected. Study statistical homogeneity was assessed using a Cochran test and by examining the observed variances in effect sizes and residual variance. I2 was calculated to quantify heterogeneity. I2 > 50% was considered significant [11] . No statistical corrections were used to adjust for multiple analyses. We set the ultrasonic subgingival scaling as the experimental group and the manual subgingival scaling as the control group.

According to Cochrane reviews [11], in meta-analysis, studies with baseline changes as outcomes could be combined with those with final measurements as outcomes. In randomized trials, differences in mean values obtained from baseline changes were usually analyzed on the basis of final measurements and obtained the same effects. In this meta-analysis, baseline changes and final measurements from different studies were combined.

The reason of disagreement from previous studies about ultrasonic and manual subgingival scaling might be that they did not compare them in the same initial pocket depth. In other words, they did not control the variate of initial PPD. So, in this study, we established three subgroups based upon initial pocket depth: shallow (≤4 mm); medium (4-6 mm); and deep (≥6 mm), to eliminate the effect of the initial PPD to the result.

If a study had two initial PPD groups that could be included in one subgroup, data was combined to conform to the depth classification, using the formulas in Appendix 2 [11].

Sensitive analysis

Pre-planned sensitivity analysis had been done, analyzing the data in the same follow-up time and initial pocket depth: shallow (≤4 mm), medium (4-6 mm), or deep (≥6 mm). And we deleted a study each time and performed a new meta-analysis to see if the heterogeneity had changed significantly. If a study was deleted and the heterogeneity was significantly reduced, it was considered to be the main source of heterogeneity, needing further read and evaluation.

Results

Study selection

There were 1434 studies searched. References in selected papers were searched and no additional studies were located. After reading full texts, ten studies were finally selected. Process selection appears in Fig. 1.

Fig. 1
figure 1

Selection process PRISMA flow chart

There were 495 duplicates in the 1434 studies. 875 studies were ruled out after reading titles and abstracts. Another 43 studies were ruled out after reading the full text. There were 11 unavailable or awaiting classifications. Finally, 10 studies were included. Studies eventually included were from 2004 to 2015.

Study characteristics

All research included were RCTs. Follow-up periods were 3 and 6 months. Characteristics of articles selected for primary outcomes appear in Table 1. Characteristics of articles selected for secondary outcomes appear in Table 2. Reasons for study exclusion appear in Table 3 [5, 6, 10, 24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63].

Table 3 Reasons for excluding studies

Limited information of 11 studies could be obtained from the publication. Inclusion, or exclusion could not be decided because of unobtainable full texts and unknown specific conditions. These appear in Appendix 3.

Quality and risk of bias assessment

Bias analysis results for the studies appear in Appendix 4 and 5. Most studies did not have high bias risks. Funnel plots could not be done due to limited number of studies (< 10).

Meta-analysis results of primary outcomes

The follow-up period lengths of the studies varied. Most were 3 and 6 months. This allowed for grouping into 3- and 6-months and reduced heterogeneity.

Outcome 1:PPD (Fig.2)

  1. (1)

    3 months: (Fig. 2a)

  1. a)

    Initial PPD ≤4 mm

    Only one met the criteria, which reported no statistical differences between ultrasonic and manual subgingival scaling [12].

  2. b)

    Initial PPD > 4 mm

    When initial PPD was medium, differences between ultrasonic and manual subgingival scaling were statistically significant. PPD reduction after manual subgingival scaling was greater than ultrasonic instruments (MD 0.14, 95% CI [0.02, 0.26], P = 0.02). Heterogeneity was great (Tau2 = 0.01; Chi2 = 28.94, df = 3 (P < 0.00001); I2 = 90%) [12,13,14,15].

    When initial PPD was deep, heterogeneity was acceptable (Tau2 = 0.01; Chi2 = 4.24, df = 3 (P = 0.24); I2 = 29%). PPD reduction after the two treatments were not statistically significant (MD 0.13, 95%CI [− 0.02, 0.28], P = 0.09) [12,13,14, 16].

  1. (2)

    6 months: (Fig. 2b)

  1. a)

    Initial PPD ≤4 mm

Fig. 2
figure 2

a. Forest plot comparing PPD at 3-months with manual subgingival scaling versus ultrasonic subgingival scaling of initial PPD > 4 mm in terms of the following: 1.1.1 initial PPD 4-6 mm; 1.1.2 initial PPD ≥ 6 mm. b. Forest plot comparing PPD at 6-months with manual subgingival scaling versus ultrasonic subgingival scaling of initial PPD > 4 mm in terms of the following: 1.2.1 initial PPD 4-6 mm; 1.2.2 initial PPD ≥ 6 mm

Two studies [12, 17] met the criteria as they used baseline changes as outcomes and one baseline changes value of 0. A meta-analysis could not be done using the two studies. They both reported no statistical differences between ultrasonic and manual subgingival scaling.

  1. b)

    Initial PPD > 4 mm

When initial PPD was medium, differences between ultrasonic and manual subgingival scaling were not statistically significant (MD 0.19, 95%CI [0.11, 0.27], P = 0.22). Heterogeneity was large (Tau2 = 0.02; Chi2 = 25.89, df = 3 (P < 0.0001); I2 = 88%) [12, 15, 17, 18].

When initial PPD was deep, heterogeneity was also great (Tau2 = 0.08; Chi2 = 8.74, df = 3 (P = 0.03); I2 = 66%). PPD reduction after manual subgingival scaling was greater than ultrasonic instruments with a statistically significant difference (MD 0.50, 95%CI [0.10, 0.89], P = 0.01) [12, 16,17,18].

Outcome 2:CAL (Fig. 3)

  1. (1)

    3 months: (Fig. 3a)

  1. a)

    Initial PPD ≤4 mm

Fig. 3
figure 3

a. Forest plot comparing CAL at 3-months with manual subgingival scaling versus ultrasonic subgingival scaling of initial PPD > 4 mm in terms of the following: 2.1.1 initial PPD4-6 mm; 2.1.2 initial PPD ≥ 6 mm. b. Forest plot comparing CAL at 6-months with manual subgingival scaling l versus ultrasonic subgingival scaling of initial PPD were shallow in terms of the following: 2.2.1 initial PPD ≤ 4 mm;2.2.2 initial PPD4-6 mm; 2.2.3 initial PPD ≥ 6 mm

Only one met the criteria, reporting no statistically significant difference between the two methods [12].

  1. b)

    Initial PPD > 4 mm

When initial PPD was medium, differences between ultrasonic and manual subgingival scaling were not statistically significant (MD -0.08, 95%CI [− 0.18, 0.03], P = 0.14). Heterogeneity was great (Tau2 = 0.01; Chi2 = 10.92, df = 3 (P = 0.01); I2 = 73%) [12,13,14,15].

When initial PPD was deep, difference between ultrasonic and manual subgingival scaling was not statistically significant (MD -0.06, 95%CI [− 0.58, 0.46], P = 0.81). Heterogeneity was also high (Tau2 = 0.16; Chi2 = 23.28, df = 3 (P < 0.0001); I2 = 87%) [12,13,14, 16].

  1. (2)

    6 months: (Fig. 3b)

  1. a)

    Initial PPD ≤4 mm

The heterogeneity of the two studies was too large (Tau2 = 0.10; Chi2 = 10.55, df = 1 (P = 0.001); I2 = 91%) for a meta-analysis to be performed. They both indicated no statistically significant differences between ultrasonic and manual instruments [12, 17].

  1. b)

    Initial PPD > 4 mm

No statistically significant differences were found between the ultrasonic and manual subgingival scaling when initial PPD was medium (MD -0.06, 95%CI [− 0.17, 0.06], P = 0.33). Heterogeneity was slightly great (Tau2 = 0.01; Chi2 = 6.29, df = 3 (P = 0.10); I2 = 52%) [12, 15, 17, 18].

At deep pocket depth, differences between the two were not statistically significant (MD 0.28, 95%CI [− 0.20, 0.77], P = 0.26). Heterogeneity was large (Tau2 = 0.15; Chi2 = 10.37, df = 3 (P = 0.02); I2 = 71%) [12, 16,17,18].

Secondary outcome measures

  1. (1)

    GR: Sculean et al. [18] indicated no statistical differences studying single or multiply-root teeth between ultrasonic and manual subgingival scaling at 6-months when initial PPD was deep. Kargas et al. [15] noted that, at medium depths, there were no statistical differences between ultrasonic and manual subgingival scaling at either 3 or 6-months.

  2. (2)

    BOP: Christgau et al. [17] found that at 6-months, manual subgingival scaling showed greater BOP reduction at initial deep pocket depth compared to ultrasound.

  3. (3)

    Residual dental calculus: Schwarz et al. [19] indicated that for single-root teeth at deep initial depth, ultrasonic subgingival device was superior to manual instruments in removing subgingival dental calculus. Yukna et al. [20] found no statistical differences in residual dental calculus rates between ultrasonic and manual subgingival scaling with initial PPD at 5-6 mm, 7-8 mm or > 9 mm. Gellin et al. [21] found no statistical differences in dental calculus clearance rates between the two methods when initial PPD was 0-3 mm, 4-5 mm, or, 6-12 mm. When ultrasonic subgingival scaling was combined with manual instruments, the effectiveness was superior to either ultrasonic or manual instruments individually [21].

Discussion

Sensitivity analysis

Outcome: PPD

At 3-months, at medium depth, heterogeneity was great (I2 = 90%, Fig. 2a). After sensitivity analysis, four studies were found highly heterogeneous to each other and were unsuitable for meta-analysis. After a bias analysis, the heterogeneity source was thought to be: (1) small number of studies; (2) the fact that tissue healing took time and early probing disrupted attachment gains. At 3-months, PPD and CAL reductions were unstable, causing large heterogeneity.

At medium depth, 6-months, Kargas et al. [15] was found to have significant heterogeneity. After excluded, heterogeneity decreased to 0% (Tau2 = 0.00; Chi2 = 0.02, df = 2 (P = 0.99); I2 = 0%). The results showed statistically significant differences between manual and ultrasound groups. PPD reduction after manual subgingival scaling was greater than ultrasonic subgingival scaling (MD 0.19,95%CI [0.11, 0.27], P < 0.00001). Compared with the other three studies, only non-smokers were included in this study, which might be the reason for heterogeneity (Fig. 4a).

Fig. 4
figure 4

a. Sensitivity analysis of PPD at 6-months in terms of the following: 1.2.1 initial PPD 4-6 mm; 1.2.2 initial PPD ≥6 mm. b. Sensitivity analysis of CAL at 3-months in terms of the following: 2.1.1 initial PPD 4-6 mm; 2.1.2 initial PPD ≥6 mm. c. Sensitivity analysis of CAL at 6-months in terms of the following: 2.2.1 initial PPD ≤ 4 mm; 2.2.2 initial PPD 4-6 mm; 2.2.3 initial PPD ≥6 mm

When initial PPD was deep at 6 months, D’Ercole [16] was a major origin of heterogeneity. After exclusion, heterogeneity decreased to (Tau2 = 0.03; Chi2 = 3.86, df = 2 (P = 0.15); I2 = 48%) (Fig. 4a).

Outcome:CAL

When initial PPD was medium, at 3-months follow-up, the heterogeneity of CAL was high (Tau2 = 0.01; Chi2 = 10.92, df = 3 (P = 0.01); I2 = 73%). According to a sensitivity analysis, four studies were highly heterogeneous with each other, making them unsuitable for meta-analysis (Fig. 4b). The heterogeneity source was also thought to be: (1) too few studies; (2) tissue healing took time and early intervening probing may damage attachment gain. When the follow-up period was only 3 months, CAL were unstable which caused great heterogeneity.

Heterogeneity was also large in the following three groups: 1) deep pocket, at 3-months follow-up; 2) medium pocket, at 6-months follow-up; 3) deep pocket, at 6-months follow-up. After excluding Ioannou 2009 [12], heterogeneity decreased from: 87, 52, 71 to 24%, 0, 0%. This study was the only one in which 50% of the patients were smokers, while other papers were unclear about the ratio of smokers or had a small number of smokers, which might be the reason for heterogeneity. After exclusion, at deep pocket depth of a 6-months follow-up, after manual subgingival scaling, CAL reduction was more than ultrasonic subgingival scaling and were statistically different. (MD 0.58, 95%CI [0.27, 0.89], P = 0.0002) (Fig. 4b, Fig. 4c).

In the same study, heterogeneity of PPD and CAL was much greater at 3-months than at 6-months follow-up. The influence was quite apparent at medium PPD, 3-months. At deep initial depths, either 3-months or 6-months, heterogeneity was acceptable. The reason could be that the tissue took time to heal. In deep pocket, tissue contacted and attached to the bone better, resulting in shorter healing time and more stable condition within 3 months. At medium pocket depths, tissue did not contact bone as readily as at deeper pockets, so healing time was longer. Probing too early in healing process may damage tissue and influence attachment gain, and led to unstable results.

Quality of evidence

After evaluating the quality of data with the GRADE system, the following results were obtained: the data of PPD at 3-months and 6-months follow-up of shallow pocket, and CAL at 3-months follow-up of shallow pocket was of very low quality; the data of PPD at 3-months follow-up of medium pocket, CAL at 3-months follow-up of medium and deep pocket and at 6-months follow-up of shallow pocket was of low quality; the data of PPD at 3-months follow-up of medium pocket, at 6-months follow-up of medium and deep pocket and CAL at 6-months follow-up of medium and deep pocket was of moderate quality. Details were given in the Appendix 6. The data of shallow pocket mostly was of very low quality because of too small sample size and publication bias, so that we could not draw a reliable conclusion and more studies are required. The data of 3-months follow-up was of low or very low quality, which might due to the fact that tissue healing took time and early probing disrupted attachment gains. We thought 3-months follow-up was too short and we cannot draw reliable conclusions according to the data of it. The data of PPD and CAL at 6-months follow-up for the medium and deep pocket groups was of moderate quality. Based on the above, we thought the conclusion should be drawn according to the data of 6-months follow-up of medium and deep pocket groups.

According to the above analysis, different indicators showed statistical significance between ultrasonic and manual subgingival scaling, which indicated the different effectiveness in clinic after the treatment of ultrasonic and manual instruments.

In addition, in shallow pocket, CAL increased after both ultrasonic and manual subgingival scaling, which might be resulted from junctional epithelium attachment damage [17]. We have also found it clinically. Therefore, manual subgingival scaling is not recommended when PPD is less than 4 mm. In clinical practice, when PPD is less than 4 mm and there is symptom such as bleeding on probing or subgingival dental calculus, ultrasonic subgingival working tip can be used for deep cleaning.

In terms of GR, at medium or deep PPD, there was no differences between manual and ultrasonic subgingival scaling, whatever the roots were single or multiple [15, 18].

BOP results of one study [17] showed, at 6-months follow-up, more BOP reduction at deep depths after manual scaling than ultrasound.

Residual calculus provided different results. Two studies [20, 21] indicated that, regardless of depth, there were no statistical differences in calculus clearance rates between ultrasound and manual treatment. Schwarz [19] indicated when PPD was deep, for a single-root tooth, ultrasonic dental calculus removal was more effective than manual subgingival scaling.

Above all, ultrasonic subgingival scaling is an efficient non-surgical treatment [7], yet manual subgingival scaling is also essential and cannot be replaced by ultrasonic method.

According to the newly-released 2018 periodontitis classifications, there are something about it:

  1. 1.

    Different economic and health care developments between developed and developing countries make different influences on periodontitis [22, 23]. Primary CAL in developing countries was three times of developed countries [23]. Only one study [13] involved a developing country (Turkey). Whether the conclusions reached in this paper apply to developing countries is unknown.

  2. 2.

    Smoking is confirmed as affecting progress of periodontitis and was considered in the new classification [22]. Most of the included studies chose patients according to the 1999 classification and did not consider the impact of smoking, which may lead to heterogeneity.

Summary

Significance to clinical practice

  1. (1)

    Combining our above analysis and quality of evidence, we believed that only 6-months follow-up results could be used to reach following conclusions:

    1. a.

      When initial PPD was shallow, no conclusions were drawn due to the limited number of studies.

    2. b.

      When initial PPD was medium, PPD reductions proved that manual subgingival scaling was superior. CAL and GR results showed no statistical differences. More studies are needed before any conclusion can be drawn.

    3. c.

      When initial PPD was deep, manual subgingival scaling was superior in terms of PPD, CAL and BOP results, while GR results showed no statistical differences. This conclusion also needs more study because of the limited number of studies.

  2. (2)

    In terms of residual dental calculus, there was no conclusion could be drawn.

Significance to research

  1. (1)

    Inclusion and exclusion criteria could refer to the new classification to reduce bias;

  2. (2)

    Studies should consider other indicators such as BOP, PI, and GI, bacterial changes, when comparing in different PPDs;

  3. (3)

    More studies are needed in developing countries;

  4. (4)

    Single and multiple root teeth should be measured separately;

  5. (5)

    Further studies should enlarge sample sizes to improve credibility;

  6. (6)

    Inclusion or exclusion criteria for smokers should be standardized.

  7. (7)

    Studies with a follow-up period of 6 months or longer are suggested to determine reliable results.

Conclusions

Ultrasonic subgingival scaling is an efficient non-surgical treatment of periodontitis. However, when initial PPD was 4-6 mm, PPD reduction proved manual subgingival scaling was superior, but CAL results showed no statistical differences between the two means. When initial PPD was ≥6 mm, PPD and CAL reductions suggested that manual subgingival scaling was superior. Manual subgingival scaling is significant and cannot be completely replaced by ultrasonic subgingival scaling. We suggest, when initial PPD is medium or deep, using ultrasonic and manual subgingival instruments together.