Introduction

Technological advances in the orthodontic practice led clear aligner treatment (CAT) meet the demand for a minimally visible and customized treatment modality; however, it is still one of the main topics of scientific research of how to maximize treatment efficiency [1, 2]. Orthodontic treatment without the use of bands, brackets, and wires was first described in 1945 by Kesling [3] who used a flexible tooth positioning appliance to move the teeth. Following his philosophy, the Invisalign® system (Align Technology Inc, Santa Clara, CA, USA) took it further by using computer-aided design and computer-aided manufacturing (CAD-CAM) technology to produce a series of transparent and removable appliances [4, 5]. This new treatment approach offered the advantage of improved aesthetics, increased patient comfort, and better oral hygiene when compared to the conventional fixed mechanics [1, 6, 7]. In this system, digital scans are converted into virtual models via stereolithographic (STL) technology and processed with the ClinCheck™ software (Align Technology Inc, Santa Clara, CA, USA) to simulate virtual tooth movements and to plan where, when and how much interproximal reduction (IPR) is needed [4, 8, 9]. During the process, IPR is planned according to the clinical case requirements such as the amount of crowding, Bolton excess, molar and canine relationships, and overjet to achieve well-aligned teeth with optimal interproximal contacts [1, 4].

The success of CAT relies on various factors. These are patient-related factors such as compliance to the appliances, bone turn-over rate, and crown and root morphology of the teeth, operator-related factors such as accurate execution of the planned IPR, and an appropriate and realistic treatment plan, and mechanical factors such as the shape and position of the attachments, and material and thickness of the clear aligners [10,11,12,13,14]. Furthermore, the malocclusion, tooth movements required to address it, and the type of tooth to be moved are also influential on the final result.

IPR is used in the orthodontic practice to reduce the mesiodistal width of a tooth in order to resolve crowding and eliminate Bolton discrepancy, to treat black triangles by reshaping and approximating neighboring tooth, and, in the case of CAT, to provide teeth the space to carry out planned tooth movements [15,16,17,18,19,20]. Clinically, the most preferred IPR techniques include thin diamond burs or diamond-coated discs used with a handpiece, and hand- or motor-operated abrasive metal strips [15, 17]. Diamond-coated metal strips adapt easily to the proximal contours of the teeth and bend without deformation owing to its flexible nature while providing optimum tactile control and protection for the lips and cheeks [21]. Although metal strip systems are claimed to offer more precise IPR, they require longer chair-time when compared to the motor-operated systems. On the other hand, motor-operated oscillating segmental discs are designed to be one-sixth the size (60°) of a standard disc and remove the enamel by making oscillating movements with a pivot angle of 30°. Like metal strips, oscillating segmental discs are also shown to protect soft-tissues which eliminates the need for lip and cheek protectors [22].

Precise execution of IPR is considered to be crucial for proper fitting of the aligners and complete realization of digitally planned tooth movements in CAT. However, there is a lack of reliable evidence on the accuracy of different IPR methods. Therefore, the aims of this study were to compare the consistency between planned IPR and executed IPR during CAT using 3 different IPR methods; hand-operated diamond strips, motor-driven 3/4 oscillating segmental discs, and motor-driven abrasive strips, and to evaluate patient perception of discomfort and anxiety with these methods.

The null hypothesis was that there is no statistically significant difference between the planned and executed amounts of IPR, regardless of the method.

Materials and methods

This prospective clinical study was approved by …Başkent University Institutional Review Board and Ethics Committee (Project no: D-KA 21/13) and supported by …Başkent University Research Fund. Inclusion criteria were (1) patients receiving Lite, Moderate or Comprehensive Invisalign® treatment packages (Align Technologies Inc, San Jose, CA, USA) with IPR prescription, (2) patients presenting mild to moderate crowding, and (3) full permanent dentition without impacted, missing or supernumerary teeth. Exclusion criteria were patients who (1) were candidates for extraction treatment, (2) received any dental procedure during the treatment which altered the mesiodistal width of the teeth other than IPR, (3) presented active periodontal disease, and (4) had undergone orthodontic treatment previously.

Sample size calculation performed with 80% power to detect an outcome of 0.5 mm of difference between the planned and the executed IPR per arch, with a significance level of 0.05 and 10% potential drop-out suggested that 42 patients should be included in the study [1]. Patients were randomly assigned to one of the 3 IPR groups in the order of starting treatment. Patients in group 1 (n = 14, 150 teeth) received IPR with hand-operated abrasive strips (ContacEZ, Ortho Classic®, Vancouver, WA, USA), patients in group 2 (n = 14, 134 teeth) received IPR with motor-driven 3/4 oscillating segmental discs (KOMET, Sterisafe® A6, Rock Hill, SC, USA), and patients in group 3 (n = 14, 133 teeth) received IPR with motor-driven abrasive strips (SWISS, Orthofile, Swiss Dentacare®, Switzerland) (Fig. 1).

Fig. 1
figure 1

A Hand-operated abrasive strips; B motor-driven 3/4 oscillating segmental discs; C motor-driven abrasive strips; D interproximal gauge

IPRs were planned to be performed when contacts were easily accessible using the ClinCheck™ software, and all IPRs were performed by the same experienced orthodontist (…A.A.Ö.) in a single center. The amount of crowding at the beginning of treatment and linear displacement between the contact points of the anterior teeth (Little’s Irregularity Index (LII)) at the session of the first IPR were calculated (MeshLab ver. 2022.02, ISTI-CNR, Rome, Italy) [23]. A metal interproximal gauge (KOMET, Rock Hill, SC, USA) was used to quantify the amount of stripping by entering the IPR site parallel to the long axes of the neighboring teeth without exerting pressure and pushing the teeth away from each other.

Scans were taken at the beginning of treatment (T0) and at the end of first set of aligners (T1) using an intraoral scanner (iTero Element 5D, Align Technologies Inc, San Jose, CA, USA). Prior to the scans, patients were asked to brush their teeth if dental plaque was evident and teeth were dried thoroughly to remove any kind of remnants which may lead to false readings.

Mesiodistal width of the teeth was noted using the arch length-tooth size discrepancy table (Bolton function) of the ClinCheck™ software at T0 and T1. It was assumed that prescribed IPR was carried out equally on either neighboring tooth. The difference between T1 and T0 gave the exact amount of executed IPR. Accuracy of IPR was assessed by calculating the absolute mean difference between planned and executed IPR, where absolute values prevented losing data. The reliability of the arch length-tooth size discrepancy table of the ClinCheck™ software was validated by using T0 and T1 widths of the teeth that were not subjected to IPR (n = 315). As these teeth were kept intact throughout the treatment, both measurements were expected to be the same and ICC value equal to 1.

Patients were asked to complete an anonymous questionnaire, on the session of the first IPR, to evaluate discomfort and anxiety levels induced by the IPR methods. The questionnaire statements were taken from validated questionnaire studies written in the native language of the patients [24, 25]. The finalized questionnaire consisted of 17 questions, 2 for demographic data, and 15 for the evaluation of discomfort and anxiety (Supplementary material). Of these 15 questions, 3 were yes-or-no questions, and 12 were rating scale questions on a 100-mm visual analog scale (VAS). Cutoff points for rating scale questions were determined as follows; none (0–4), mild (5–44), moderate (45–74), and severe (75–100) [26]. Questionnaire statements were validated on a representative sample of 20 before commencement of the study.

Statistical analysis

Statistical analyses were performed using SPSS software package (version 22; IBM, Armonk, NY, USA). Shapiro–Wilk test showed that the data was non-normally distributed. Mann–Whitney U, Kruskal–Wallis H, Wilcoxon signed rank, and Chi-squared tests were used for comparisons of the data derived from the clinical step of the study, and Kruskal–Wallis H and Spearman’s signed rank tests were used for the results of the questionnaire.

Results

A total of 42 patients and 417 teeth were included in the data analysis. The ICC value calculated for the reliability of the arch length-tooth size discrepancy table of the ClinCheck™ software was found to be 0.996 with good repeatability. The mean difference between T0 and T1 readings was − 0.09 mm (median, − 0.07 mm). The Cronbach’s alpha was 0.83 for the questionnaire statements.

Demographic characteristics, distribution of the malocclusion type, the amount of crowding at the beginning of treatment, and LII values at the session of the first IPR were found similar between the groups (Table 1).

Table 1 Demographic characteristics of the groups

Table 2 shows the comparison between planned and executed IPR within the groups. The overall average value of executed IPR was significantly less than the planned amount in group 1 (p = 0.003), yet similar in groups 2 (p = 0.511) and 3 (p = 0.659). On quadrant basis, executed IPR in the upper left quadrant in group 1 was significantly less than the planned amount (p = 0.036); however, it was significantly more in the upper right quadrant in group 2 (p = 0.021).

Table 2 Comparison of planned IPR (mm) and executed IPR (mm) within groups

Table 3 presents the comparison of accuracy of the IPR methods by means of the average difference between planned and executed IPR. According to this, the accuracy of the IPR method in group 1 was significantly low for teeth 11, 21, 32, 33, and 43 (p = 0.021). Table 4 presents the accuracy of IPR between the arches (upper/lower), sides (left/right), and groups of teeth (incisors/canines/premolars), where no significant difference was evident.

Table 3 Comparison of the accuracy of execution of planned IPR (mm) within and between the groups
Table 4 Comparison of the accuracy of execution of planned IPR (mm) according to arch, side, and group of teeth

The results of the questionnaire showed that patients’ discomfort and anxiety levels were similar between the groups regardless of the IPR method; however, scores in group 1 were slightly higher than the other 2 groups in general. Patients who were priorly informed about the procedure were less likely to make research (Table 5). Moreover, correlation tests revealed that a negative correlation existed between age, and the severities of pain (r =  − 0.548, p = 0.042) and tooth sensitivity (r =  − 0.540, p = 0.046).

Table 5 Summary of the questionnaire results

Discussion

IPR is an essential component of CAT which creates room for teeth to perform digitally planned movements and has to be executed precisely in order to achieve the desired outcome. Therefore, the primary aim of this study was to investigate the consistency between planned and executed IPR for 3 IPR methods including hand-operated abrasive diamond strip system, motor-driven oscillating segmental discs, and motor-driven abrasive strips, in patients receiving CAT with the Invisalign® system. The secondary aim was to assess discomfort and anxiety levels induced by these methods. This study stands out among the others in the literature with regard to its in vivo nature and prospective design which made standardization of the interventions possible. Furthermore, all IPRs were executed by an experienced orthodontist to minimize the risk of uncontrolled IPR, especially with the motor-driven methods that would lead to over-stripping and distortion of the contact surface morphology. This is also the first study to assess patients’ perceptions and experiences with different IPR methods during CAT.

The null hypothesis is rejected. The overall executed IPR was less than the planned amount with the hand-operated abrasive strip system (ContacEZ), yet similar with the motor-driven systems (KOMET and SWISS), even though the amount of contact displacement was similar between the groups prior to the first IPR. These findings are in line with De Felice et al.’s [1] who showed that manual strip system fell short to fully execute the planned IPR. Although this system is claimed to provide a more reliable and precise stripping process due to the flexibility of the strips, they have to be used incrementally from thinner to thicker strips, and the clinician needs to enter the same region iteratively with a significant pressure [8, 16, 17]. This may lead to displacement of the tooth in its socket and lead to a false reading on the interproximal gauge. Furthermore, it is clinically more tiring and time consuming, especially when a marked amount of IPR is planned, which may give the clinician a false impression that the targeted amount is reached. On the contrary, motor-driven IPR methods are shown to provide more effective stripping than hand-operated methods by means of speed and the need for less amount of pressure entering the contact points [27,28,29].

Executed IPR was significantly deficient on the mandibular canines with the hand-operated abrasive strip system, which is in line with the findings of Kalemaj and Levrini [16], because canines are in tight contact with the adjacent teeth and are frequently pushed out of the dental arch in the case of crowding, which makes it challenging to precisely implement the planned IPR [30]. Furthermore, mandibular teeth are usually in a more crowded state than the maxillary teeth, which is supported by the findings of the present study showing that IPR was dominantly carried out on the mandibular arch (61.6%) when compared to the maxillary arch (38.3%) that was also documented by Hariharan et al. [31].

Executed IPR exceeded the planned amount on the upper right quadrant with motor-driven oscillating segmental discs and could not reach the planned amount on the upper left quadrant with the hand-operated abrasive strip system; however, the overall amount of executed IPR was similar between upper and lower arches, as well as left and right quadrants when the sample was evaluated regardless of the IPR system.

Johner et al. [14] evaluated the accuracy of 3 different IPR methods (hand-operated abrasive strips, motor-driven oscillating discs, and motor-driven abrasive strips) on extracted premolar teeth with an in vitro study design. They showed that the amount of IPR was generally less than expected in all 3 groups. Although their results are in line with the abrasive strip group in the present study, their motor-driven IPR groups present conflicting results with ours. These differences seem to arise from the study designs, one being conducted in an actual biological setting with teeth surrounded by its periodontal ligament, differing in crown morphology and in a crowded state, and the other in an artificial setting with only premolar teeth mounted in silicone to mimic the periodontal ligament.

Based on the findings of this study, motor-driven methods have proven to be more effective when compared to the hand-operated ones by means of precision, speed, and patient comfort. However, if the clinician favors a hand-operated method, it may be advised to perform slightly more IPR especially on mandibular canines and maxillary central incisors. Also, a brief preliminary alignment phase grants easy access to the contact surfaces which is believed to increase the accuracy of IPR with less damage to the tooth shape and contact surface morphology. Overall, a precisely executed IPR in clear aligner treatment is the way to complete realization of the planned tooth movements and ultimately to a successful treatment.

According to the results of the questionnaire, patients were mildly anxious about IPR before the procedure, and their anxiety levels decreased after. A negative correlation existed between age and anxiety and discomfort levels. On the other hand, patients reported similar levels of pain and discomfort with both hand-operated and motor-driven IPR methods, which was unexpected because motor-driven IPR is thought to be more perturbative for the patients. Considering that motor-driven methods provide more accurate IPR, and that patients’ anxiety and discomfort levels are similar with the hand-operated method, motor-driven methods can be preferred over hand-operated methods.

Today, websites and social media platforms are frequently referred by the patients for healthcare research. However, the lack of regulations and the fact that any individual can upload a content pose the risk of misinformation [32]. Results of the questionnaire showed that patients who were priorly informed about the IPR procedure tended to make less research. This means that patients can be kept away from the information pollution on the internet and the treatment can be led by the orthodontist in a more reliable environment.

Limitations

Although the arch length-tooth size discrepancy table of the ClinCheck™ software is claimed to be prone to measurement errors, its repeatability was found to be very good which shows that the quantitative uncertainty was very low. There is some margin of error in all measurement methods (manual or digital), but the system can be accepted as reliable as long as this does not significantly alter the expected effect [16, 33, 34]. Blindness, on the other hand, is another favorable feature of the software.

The demonstrated differences between the IPR methods are statistically significant; however, they may not be indicative of clinical significance.

Conclusions

  • The overall results of this in vivo study showed a discrepancy between planned and executed IPR with the hand-operated abrasive strip system, tending to produce less enamel reduction than planned.

  • The hand-operated abrasive strips fell short to fully realize the planned amount of IPR, especially on the maxillary central incisors and mandibular canines.

  • The consistency of IPR with motor-driven oscillating segmental discs and motor-driven abrasive strips was high.

  • Although the difference between the hand-operated and motor-driven methods was statistically significant, it may not be clinically relevant.

  • According to the results of the questionnaire, both motor-driven and hand-operated methods caused similar levels of discomfort.