Introduction

Controlling the vertical facial dimension during orthodontic treatment is especially important among patients with skeletal hyperdivergence and is often challenging. It is commonly believed that individualized treatment goals and protocols according to the patient’s vertical configuration contribute to improved vertical control. In patients with an increased vertical dimension, this goal would be to minimize molar extrusion, since that would lead to clockwise mandibular rotation [1, 2]. Apart from treatment mechanics, orthodontic treatment including systematic extractions of four premolars has long been thought to result in favorable outcomes in hyperdivergent cases [3].

The bite-closing tendency associated with premolar extractions has been attributed to protraction of the posterior teeth into the extraction sites that might allow the mandible to rotate counterclockwise, thereby reducing the so-called “wedge effect”. Furthermore, reciprocal space closure with retraction and possible retroclination of the anterior teeth could additionally deepen the bite, the so-called “drawbridge effect” [4]. These changes associated with extraction therapy are considered beneficial and could lead to a decrease in lower anterior facial height and a decrease in lip strain [1, 5], thus, improving esthetics in hyperdivergent cases. However, the impact of extraction therapy on the inclination of the mandible and the corresponding reduction in anterior facial height is not unanimously supported in the literature. A systematic review concluded that there is no specific effect of systematic premolar extractions on the skeletal vertical measurements but stated that uncertainty exists in these findings due to the high risk of bias of available studies [6].

Previous studies assessing the effect of extractions on the skeletal vertical dimension included only class II patients thereby limiting generalizability [7,8,9,10], did not control for crowding or bimaxillary protrusion [11,12,13] thereby not allowing for estimation of the “wedge effect”, or did not ensure comparability between extraction and non-extraction groups [7, 14, 15]. Thus, the present study aimed to cephalometrically investigate the effect of four premolar extractions on the vertical dimension in hyperdivergent patients with fixed appliances, irrespective of anteroposterior relationships.

Materials and methods

This retrospective cohort study was based on the archives of patients treated at the Clinic of Orthodontics and Pediatric Dentistry, University of Zurich, Switzerland. The protocol of the study was developed in advance, but it was not registered, and received ethical approval by the Cantonal Ethics Committee Zurich (BASEC-No.: 2020-02717). All patients or their guardians signed an informed consent prior to their treatment initiation.

Patients’ eligibility was consecutively proved on the basis of the following criteria without considering other factors, such as treatment outcome, duration, patient’s cooperation: (i) available diagnostic records of high quality, (ii) hyperdivergent patients with a SN-ML angle of 34° or more, (iii) no more than 6 mm crowding in the lower dental arch, (iv) patients treated with fixed edgewise appliances with extraction of four premolars for orthodontic purposes or without premolar extractions, (v) no missing teeth or tooth anomalies, (vi) no syndromes or clefts, physical or mental impairment, and (vii) no previous orthodontic treatment.

Sample size calculation was based on a previous study assessing the treatment of patients with skeletal hyperdivergence [10], from which the assumed response for the SN-ML in the non-extraction group was taken (mean 39.3°; standard deviation [SD] 4.5°). Assuming a minimal clinical difference of 10% with the same SD, a total of 62 patients (31 patients per group) would be needed to identify a minimal clinically relevant difference of 10% (with common SD) using an independent-samples t‑test at alpha (α) = 5% and beta (β) = 10%. It was decided to set power to 90%, as previous studies had failed to identify any differences. Therefore, all consecutively eligible patients were included in this study until a minimum of 31 patients per group were included.

All patients were treated by postgraduate students in orthodontics under the direct supervision of experienced clinical instructors. Treatment planning and procedures followed the protocol of the clinic. All present patients received standard edgewise fixed appliances with a 0.018-inch slot. Archwire sequence and treatment mechanics were specified according to each patient’s needs. Either sliding mechanics or closing loops were used for space closure with a 0.016 × 0.022-inch stainless steel wire.

Patient characteristics (sex, age, teeth extracted, overjet, overbite, crowding, orthodontic appliances) were extracted from the patients’ files. Lateral cephalograms were taken in natural head position at two timepoints: prior to treatment initiation (T1) and after debonding (T2). They were traced to assess several skeletal and dental parameters (Supplementary Table 1). The primary outcome of this study was the divergence of the skeletal mandibular base (SN-ML angle). To assess consistency of the measurement method, a set of 50 lateral cephalograms was remeasured by the first author within a 2-week interval and another set of 50 lateral cephalograms was remeasured by the last author.

Although all material for this study was coded, no blinding was possible at outcome measurement, since extraction treatment can easily be identified on lateral cephalograms. However, statistical analysis was undertaken blindly using a coded dataset.

Statistical analyses

Initially, descriptive statistics were calculated after checking normality, including means and SDs for normally distributed continuous outcomes (medians and interquartile ranges [IQR] for skewed data) or absolute/relative frequencies for categorical outcomes. Differences in baseline characteristics were assessed with independent-samples t‑tests, Mann–Whitney tests, or Χ2 tests. Crude differences in outcomes were assessed with linear regression using T2 values as dependent variables, the extraction group as independent variable and T1 values as covariates. Adjusted regressions were run to control for potential confounders, which were selected with a 10% change-in-estimate approach. All analyses were run in StataSE 13.0 (StataCorp, College Station, TX, USA) with a two-sided α of 5% and an open dataset available through Zenodo [16].

Results

In all, 76 patients, of which 51% (n = 39) were females, with a mean age of 11.9 ± 1.7 years, were included in the present study (Table 1). From these patients, 31 were treated with four premolar extractions and 45 were treated non-extraction. No significant differences (P > 0.05) in patients’ age, sex, overjet, or overbite were observed at baseline. The average overjet was 5.1 ± 2.5 mm and the average overbite was 2.4 ± 1.9 mm. The only difference was the available arch space, which in the extraction group was significantly less than in the non-extraction group for both the upper (−2.1 versus −0.7 mm) and the lower (−2.0 versus +0.7 mm) arches.

Table 1 Tab. 1 Demographics of included patientsDemografische Details der aufgenommenen Patienten

Cephalometrically, the two groups were very similar for most skeletal and dental parameters (Table 2). The included sample showed an increased vertical dimension with a mean SN-ML of 40.2° (SD 3.5°), SN-NL of 6.8° (SD 3.7°), and SN-OP of 20.4° (SD 3.3°) and a class II tendency with an ANB of 4.6° (SD 2.3°). The only significant differences between groups at baseline were seen for NL-ML (34.5° and 31.9° in the extraction and non-extraction groups, respectively) and SpaMe:NMe (59.1% and 58.0% in the extraction and non-extraction groups, respectively). The inclinations of the upper and lower incisors were similar between the groups and normal at baseline with U1-NL and L1-ML of 109.8° (SD 7.2°) and 91.9° (SD 7.3°), respectively.

Table 2 Tab. 2 Baseline cephalometric assessment of included patientsKephalometrische Ausgangsuntersuchung der aufgenommenen Patienten

As far as treatment appliances are concerned, the only statistically significant difference between the groups was the use of a lingual arch in 13% of the patients in the non-extraction group and 44% of the patients in the extraction group (Supplementary Table 2).

Orthodontic treatment with extraction of four premolars had little effect on the vertical dimension (Table 3). Changes in mandibular divergence (SN-ML) were similar in the non-extraction (−0.6°) and extraction (−0.5°) groups, to a difference in means of 0.1° (95% confidence interval [CI] = −0.9 to 1.1°; P = 0.88). Similarly, no significant differences were found for SN-NL (0.8° and 0.9° in the non-extraction and extraction groups, respectively) or for NL-ML (−1.3° and −1.4° in the non-extraction and extraction groups, respectively). The only difference was observed for the SN-OP with significantly less anterior rotation in the non-extraction group (−0.2°) compared to the extraction group (−1.5°), resulting in a difference in means of −1.5° (95% CI −2.9 to −0.1°; P = 0.04). Furthermore, no significant differences were found for SAr:ArGo (P = 0.88), SGo:NMe (P = 0.47), or SpaMe:NMe (P = 0.29).

Table 3 Tab. 3 Cephalometric changes during treatment in the non-extraction and extraction groupsKephalometrische Veränderungen während der Behandlung in der Nichtextraktions- und in der Extraktionsgruppe

As far as sagittal skeletal parameters are concerned, the extraction group presented a greater reduction (−1.5°) than the non-extraction group (0°; P = 0.003) in the SNA angle, as well as a greater reduction (−1.6°) than the non-extraction group (−0.4°; P = 0.004) in the ANB angle.

Incisor inclination was only slightly affected by the decision to extract four premolars or not. The upper incisors were slightly more retroclined in the extraction group (−3.2°) than in the non-extraction group (−1.9°), which was not statistically significant. On the other hand, significant differences were observed for the inclination of the lower incisors, which were proclined in the non-extraction group (+3.5°) and slightly retroclined in the extraction group (−2.2°).

No significant effects from confounders (Supplementary Table 3) were noted for the primary outcome (SN-ML) or the two selected secondary outcomes (SGo-NMe, SpaMe:NMe; Table 4). The effect of premolar extractions on both SN-ML and SGo:NMe was nonsignificant in the crude analysis and remained so after adjusting for confounders. The effect of extractions on SpaMe:NMe was nonsignificant on average in the crude analysis (−0.44°), but hints of confounding according to the specific premolars extracted were identified. Different effects on SpaMe:NMe were observed for the extraction of four first premolars (difference (Δ) −1.39% to the non-extraction group), upper first/lower second premolars (Δ −0.86% to the non-extraction group), or four second premolars (Δ 0.47% to the non-extraction group), with statistically significant differences among them (P = 0.04).

Table 4 Tab. 4 Effect of four premolar extractions on the vertical parameters at T2 (with the vertical parameter at T1 as covariate) using either crude models or models adjusted for selected baseline covariates (in Supplementary Table 3)Auswirkung der Extraktion von 4 Prämolaren auf die vertikalen Parameter bei T2 (mit dem vertikalen Parameter bei T1 als Kovariable) unter Verwendung von Rohmodellen bzw. für ausgewählte Ausgangskovariablen angepasste Modellen (vgl. ergänzende Tab. 3)

For all cephalometric measurements, concordance was almost perfect within examiner (range 0.9–0.996) and between examiners (range 0.925–0.997). Systematic bias was also extremely low both within and between examiners (up to 0.35° for all cephalometric angles and 0.65% for the cephalometric ratios; Supplementary Table 4).

Discussion

This study assessed the effect of systematic premolar extractions on the vertical skeletal dimension in 76 children with skeletal hyperdivergence treated with fixed appliances. Extraction of four premolars had no consistent effect on the skeletal vertical dimension of the face represented by the cephalometric measurements SN-ML, SpaMe:NMe, and SGo:NMe. The only significant influences pertained to dental parameters like SN-OP and L1-ML, where that the extraction group presented an increased anterior rotation of the occlusal plane and an increased retroclination of the lower incisors. Furthermore, the SNA and, subsequently, ANB angles were decreased in the extraction group (Table 3), but this could still be due to treatment-induced alterations to point A [17]. Apart from the decision which teeth to be extracted on the variable SpaMe:NMe, none of the covariates (dental or skeletal parameters as well as orthodontic appliances) identified (Supplementary Table 3) and tested (Table 4) had a considerable confounding effect on any of the skeletal vertical cephalometric parameters.

According to the results of the present study, it seems that the effect of four premolar extraction therapy on the vertical skeletal dimension is not apparent in preadolescent and adolescent children with skeletal hyperdivergence. On the contrary, in a similar hyperdivergent class II sample, Porto et al. [18] reported a statistically significant decrease in the SN-GoGn angle and facial axis (SN-Gn angle) of less than 1° in the extraction group compared with an increase of less than 1.5° in the non-extraction group. A similar beneficial effect on the anterior facial height was also noted for the extraction group compared to the non-extraction group in both class I and II patients [5]. Furthermore, in a slightly older borderline sample selected with discriminant analysis based on parameters relating to the extraction decision, significantly different effects (P = 0.04) on the vertical dimension were seen according to premolar extractions: a small reduction in the SN-GoGn angle was found in the extraction group, but a small increase in the non-extraction group [19].

Even if statistically significant, the effect of four premolar extraction treatment on the vertical cephalometric variables seems to be of questionable clinical importance due to its small magnitude, which falls within the range of systematic error or small baseline differences. Also, data from many studies indicate that treatment with four premolar extractions does not actually affect the vertical skeletal pattern [20,21,22,23,24,25,26]. It seems that the majority of evidence in support of extraction treatment to control or reduce the vertical dimension originates from older publications, which did not directly investigate the effect of extractions and/or were case reports or opinion papers [4, 27, 28].

With respect to the decision of which teeth to be extracted, only the extraction of four second premolars or molars led to a counterclockwise mandibular rotation, while extraction of four first premolars had no significant treatment effect on the mandible in a prospective investigation of 15 years olds with skeletal open bite [29]. On the other hand, no difference in the vertical facial dimension could be identified between first and second premolar extraction treatment in patients with skeletal hyperdivergence in other studies [30, 31]. On the contrary, in the present study, compared to non-extraction therapy, a slight decrease in the anterior facial height ratio was observed only for the extraction of four first premolars, but not for the extraction of four second premolars. It should be noted, however, that in the present study, 53% of patients in the extraction group was treated with two extractions of the first and two extractions of the second premolars. Since the sample was collected irrespective of anteroposterior discrepancies, different anchorage needs in the sagittal dimension most probably might have dictated which teeth to be extracted.

The strengths of the present study include the a priori study design with hypotheses defined in advance to ensure appropriateness of the study population, selected outcomes, and covariates [32]. Furthermore, the sample size was calculated in advance to ensure enough power and prevent inflated results. The variety of orthodontists and clinical instructors can enhance generalizability of the present results. Finally, blinding at the statistical and interpretation level could reduce potential bias and the openly available dataset increases transparency. On the other hand, this study certainly presents some limitations primarily relating to the retrospective study design. First, retrospective nonrandomized studies have been shown to present inflated intervention effects in orthodontic research [33]. Second, apart from the general treatment protocol of the clinic, each patient’s treatment was carried out in accordance with each patient’s needs. Although treatment appliances were controlled for their effect in this study, a prospective study design could have better ensured standardization of treatment procedures. In that context, the only statistically significant difference between the groups regarding treatment appliances was the lingual arch, which was used more in the extraction than non-extraction group and might have prevented molars from tipping into the extraction sites [34].

Conclusions

In children with skeletal hyperdivergence, minimal to moderate crowding, and normal incisor inclinations, it seems that orthodontic treatment with extraction of four premolars is not necessarily associated with improved facial height compared with non-extraction treatment.