Introduction

Mid-facial growth is an important outcome measure when evaluating cleft lip and palate treatment. In patients with a unilateral cleft lip and palate (UCLP), normal maxillary growth is often impeded, resulting in a relative retrusion of the mid-face [13]. As untreated cleft patients often show a normal mid-facial growth potency, iatrogenic changes induced by the surgical treatment are likely the greatest cause for maxillary hypoplasia [36]. Delaying hard palate closure is therefore believed to minimize mid-facial growth interference as a larger portion of maxillary growth is already established. Nevertheless, delayed hard palate closure may in turn lead to less favourable speech results. To circumvent this dilemma, a two-stage approach was firstly introduced in the 1950s [7]. It was initially believed that early soft palate repair allows for adequate speech development while secondary delayed palate repair reduces the degree of growth restriction. In addition, the soft palate closure is thought to approximate the palatal shelves, avoiding the need for extensive palatal dissection at palate closure [8]. However, later studies investigating the potential mid-facial growth benefit of delayed hard palate closure showed contradictory results [4, 9]. The correlation between timing of closure and subsequent craniofacial growth therefore seems to be less evident. Other factors than timing of surgery are of influence and should be taken into account [4, 9].

Because the effects of surgery become increasingly apparent as patients mature [3], long-term assessment is essential to make a comprehensive evaluation of a treatment protocol and to identify the different factors affecting facial growth. So far, few long-term studies are available.

The objective of the present study is to evaluate and compare long-term mid-facial growth after two-stage palatoplasty in an adult group of UCLP patients with hard palate closure at the age of 3 years. Evaluation will be carried out by cephalometric analysis and results will be compared to the previously described treatment protocols, including those of the Eurocleft study.

Methods

Patients

Patients were selected from our cleft database at the Wilhelmina Children’s Hospital. We analysed all medical files of cleft patients that had been invited for a long-term assessment of their treatment (n = 148). This long-term multidisciplinary assessment was implemented since 2008 for all cleft patients of 17 years and older, who had cleft treatment at our hospital. Each medical file was scanned for surgical data and medical history, including the type of cleft. Only patients with a complete cleft lip and palate, including a complete cleft of the alveolus were considered for analysis. Furthermore, cleft repair had to comprise two-stage palatoplasty performed by one of the two surgeons of the Wilhelmina Children’s Hospital at that time. Patients with additional anomalies (n = 4), Simonart’s bands (n = 1), non-Caucasian ethnicity (n = 6), partial treatment elsewhere (n = 34), incomplete information regarding the timing of surgery or treatment according to a different protocol (n = 25) were excluded from the present study.

Out of the 148 patients that were invited for a last follow-up, 78 met the inclusion criteria for this study. From this group, 52 patients eventually attended follow-up (67 %). Out of the 26 patients that did not attend, 9 indicated that they were not interested in follow-up or unable to attend. The remaining patients were lost to follow-up either due to non-response or incorrect contact details (n = 17, 22 %). Factors that may have contributed to this loss are the lack of electronic files and the lack of standard follow-up after completion of the orthodontic treatment (before 2000). Lastly, the hospital changed location in 1999, which also might have led to the loss of up-to-date contact details.

The follow-up consultations took place between 2008 and 2014. Patient characteristics are described in Table 1. In order to ensure that the attended group (n = 52) was representable for the whole UCLP group (n = 78), we compared the treatment variables of the attended with the non-attended group in Table 1. We did not observe any significant differences between the two groups in terms of surgical timing or incidence of secondary surgeries.

Table 1 Patients’ characteristics baseline table

Surgical and orthodontic protocol

Surgeries were performed by two plastic surgeons from the Wilhelmina Children’s Hospital specialized in cleft lip and palate surgery. The Utrecht treatment protocol used is summarized in Table 2. The mean age of patients at each surgical intervention are shown in Table 1.

Table 2 Overview of the treatment protocols

Patients presenting with persisting velopharyngeal insufficiency despite additional speech therapy directed at improvement of velum mobility, were offered a subsequent pharyngoplasty. Speech enhancing surgery was performed according to the modified Honig technique [10].

Orthodontic treatment was carried out by our orthodontist (E.E.), also experienced in cleft care. Pre-surgical plates were only applied in case of significant feeding problems or tongue thrust during infancy and therefore not used as standard. Orthodontic treatment was started at least 6 months before alveolar bone grafting and resumed 6 months after the surgical intervention. Removable appliances were used to widen the maxilla establishing a normal transversal occlusion prior to alveolar bone grafting. A fixed orthodontic appliance was used to correct the vertical plane of the central incisors. The transverse expansion was maintained for at least 6 months after alveolar bone grafting. After these 6 months, the fixed appliances were reapplied to establish the best possible occlusion. Facial masks were not used in our protocol.

Cephalometric analysis

Standardized lateral cephalograms were obtained on the day of long-term follow-up. If the patient had a history of orthognathic surgery, the pre-operative cephalogram was analysed in this study. The Orthophos XG 3® (Sirona group, Salzburg, Germany) was used for imaging. Each cephalogram was made in natural head position with teeth occluded. Images were stored as a DICOM file and subsequently exported to Viewbox 4.0, a software program for cephalometric analysis (dHAL Software®, Athens, Greece, 2014). Each image was rescaled before analysis. The first stage of analysis involved determining 12 landmark points on each cephalogram. These landmark points were used to determine 5 reference lines, from which 10 angles and 1 ratio variable (Fig. 1) could be calculated. The calculated values included the angles used by the Eurocleft study, Nollet et al. and Friede et al. [2, 11, 12]. In order to calculate the inter- and intra-observer agreement, all cephalograms were scored twice by a maxillofacial trainee (G.B.) and a medical student (I.K.) under the same conditions, with an interval of at least a week. The cephalometric variables were compared to normal values described by Thilander et al. [13].

Fig. 1
figure 1

Skeletal reference points and reference lines. Reference points: A subspinal, deepest anterior point in the concavity of the anterior maxilla. Ai apex inferius, apex of the root of the most prominent lower central incisor. Ans anterior nasal spine, most anterior point of the anterioposterior profile of the upper jaw. As apex superius, apex of the root of the most prominent upper central incisor. B supramental point, deepest anterior point in the concavity of the anterior mandible. Gn gnathion, the most inferior point on the mandibular symphysis. N nasion, most anterior point of the frontonasal suture. Pg pogonion, most anterior point on the mandibular symphysis. Pns posterior nasal spine, most posterior point of the bony palate defined by the junction of the hard palate, the soft palate and the extension of the pterygomaxillary fissure. S sella, the centre of the sella turcica. Skeletal reference lines: ILI axis of lower incisors, line through ai and as. ILS axis of upper incisors, line through as and is; ML mandibular line, the tangent to the lower border of the mandible through the lowest point of the mandibular symphysis (Gn). NL nasal line or palatal plane, the line through Ans and Pns. NSL nasion-sella-line, line through the central sella (point S) and nasion (point N)

Statistical analysis

Statistical analysis was carried out using SPSS (IBM SPSS Statistics 20.0, IBM Inc., NY, USA) and RStudio (©2009–2015 RStudio, Inc., Boston, USA). A p value below 0.05 was considered as significant. The chi-square test or independent sample t test was used to compare patient groups where relevant. The Mann-Withney U test was used to compare median values.

Cephalometric analysis was repeated multiple times in order to calculate the inter- and intra-rater variability. For statistical analysis, we used the mean value of the angles obtained during the different scoring sessions. The pre-operative cephalogram was missing in 5 of the 11 patients that had orthognathic surgery (9.6 % of total). To prevent a possible bias, we predicted and imputed the values for these missing cephalograms using a multiple regression analysis. Known values of baseline characteristics, surgical history, and the timing of each surgical procedure were used as predictors in the model. The procedure was repeated five times, resulting in 5 separate datasets. From these datasets, a pooled mean estimate was derived for each of the missing cephalometric values.

The observed mean cephalometric values were subsequently compared to normal values [13] and the mean values of previous studies including those of the Eurocleft study [2, 11, 12]. The one-way ANOVA test was used to compare all means [2, 11, 12]; contrast tests were used to calculate the mean differences between each study and to determine the statistical significance of these differences. Given the multiple comparisons, a modified Bonferroni correction was applied to calculate an adjusted p value after each one-way ANOVA test in order to maintain higher statistical power [14].

In the study of Thilander et al., normal values were originally described for men and women separately [13]. In order to maintain statistical power, a pooled mean value for both sexes was calculated and used for comparison according to the method described above. The pooled standard deviations were calculated using the following formula: pooled sd = √ ((S1 × (n1 − 1)) + (S2 × (n2 − 1))/((n1 − 1) + (n2 − 1)) (S1 = variance for women (variance = sd^2), n1 = total number of women, S2 = variance for men (variance = sd^2), n2 = total number of men ).

The interclass correlation coefficient (ICC) was used to calculate the intra-observer and inter-observer reliability of the cephalometric analysis. ICC values above 0.75 are considered as excellent, values between 0.40–0.74 are fair to good and values below 0.40 are considered as poor [15]. In addition, reliability was measured by calculating the difference, 95 % limits of agreement, and error of the method as described by Bland and Altman [16]. Error of the method is therefore calculated using the following equation: √(sd^2/n).

Ethical approval

The Medical Ethics Committee of the University of Utrecht approved the protocol (14/416) and methods used for this study which was performed in accordance with the 1964 Declaration of Helsinki and its later amendments.

Results

Measurement reliability

The intra- and inter-observer analyses demonstrated good intra-class correlation coefficients for all obtained cephalometric values, varying from 0.795 to 0.977 and from 0.725 to 0.983, respectively (Table 3). The inter-observer differences and limits of agreement were greatest for the dentoalveolar values and small for the SNA, SNB and ANB values (Table 3).

Table 3 Agreement of the cephalometric values

Comparison to normal values

The mean cephalometric values were compared to the pooled normal values in Table 4 and shown in Fig. 2. The maxillary angle (SNA) and the maxillomandibular angle (ANB) were significantly smaller and negative compared to the normal population, indicating maxillary hypoplasia. Both the vertical maxillary inclination and mandibular inclination were significantly increased (NSL-NL, NSL-ML). We also observed an increase of the ILS-NA angle and more obtuse inter-incisor angle (ILS-ILI).

Table 4 Comparison of the presently obtained values to normal values
Fig. 2
figure 2

Composite tracing based on the mean cephalometric values of the studied cohort. Reference points are explained in the legend of Fig. 1

Comparison to previously reported long-term values after cleft treatment

Our mean cephalometric values are compared to previous studies in Table 5 and presented in Fig. 2. An example of a cephalogram can be seen in Fig. 3.

Table 5 Comparison of cephalometric values with previous studies
Fig. 3
figure 3

A lateral cephalogram made in natural head position with teeth occluded, scaled and exported to Viewbox

Eurocleft Centre D showed a significantly less favourable mean SNA angle. [11] The NSL-NL angle was also increased. Mandibular angles were comparable to the present study (NSL-ML, SNPg). Vertical facial proportions were more harmonious in Centre D, Centre A and the Centre of Nijmegen. (N-Ans/N-Gns ×100 %).

The maxillary (SNA) and maxillomandibular (ANB) relationships of Centre A and of Nijmegen, both applying two-stage closure, were comparable to the current study. Further, the group treated according to the Gothenburg protocol showed better maxillary (SNA) and mandibular relationships (SNPg, NSL-ML) compared to this study.

All Eurocleft Centres showed significantly reduced angles between the upper incisor and palatal plane (ILS-NL) resulting in a more obtuse interincisor angle (ILI-ILS).

Discussion

This study reports the long-term craniofacial morphology in UCLP adults after delayed hard palate closure performed at the age of 3 years. Compared to the general population, the present cohort observed mid-facial growth differences recognized as typical characteristics of individuals with a repaired unilateral complete cleft lip and palate (UCLP). [11, 12]. The maxillary and mandibular angles SNA and SNB were reduced and we observed a more retruded position of the maxilla in relation to the mandibula (ANB) (Table 4). Further, a more obtuse gonial angle and steeper mandibular plane (NSL-ML, NL-ML) was observed in the current cohort. General consensus is that these growth differences are both a direct and an indirect result of cleft palate surgery [35]. Scar tissue along the dental arch may also cause a significant deviation of the dentoalveolar process resulting in dental malocclusions and often retroclined incisors [17]. The observed mean ILS-NA angle and inter-incisor angle (ILI-ILS) deviated from the normal values, likely due to an increased upper incisor inclination as a result of orthodontic treatment, and a more retroclined position of point A, which is often seen in cleft individuals. We therefore found a significant growth impairment despite delaying the hard palate closure until the age of 3 and performing the osteoplasty relatively late.

When comparing our cephalometric values to previous long-term results, the observed maxillary (SNA) and maxillomandibular (ANB) relationships were similar to studies applying both one- (Eurocleft Centres B, E, F) and two-stage closure (Eurocleft Centre A, Nijmegen) [2 11, 12] (Table 5). Although there was a trend of increased maxillary retrusion in relation to the mandibula (ANB) in centres applying early palate closure (Centres D and F), this difference did not reach statistical significance. This lack of difference might however be related to the small patient groups, resulting in low statistical power [11]. It is questionable whether the timing of surgery contributed to this trend, as other one-stage centres of the Eurocleft study show relatively better outcomes (Centres B, E). The outcome of Center D may have been affected by the inconsistency of their protocol and participation of low volume surgeons. Similarly, the moderate results of Centre F likely resulted from the implementation of primary bone grafting during lip closure (Table 2) [1820]. A review of long-term studies after two-stage closure found that disappointing growth outcomes after two-stage closure can often be explained by surgical variations or other factors within the treatment protocol [21]. Although some studies found better growth results after delayed closure [22], a direct correlation between timing of closure and growth outcome seems to be lacking and factors other than timing should be taken into account [9, 23].

The amount and location of scar tissue may have a greater influence on mid-facial growth than the specific age of hard palate closure during the first decade of life. Several studies have attributed the decreased maxillary growth after a two-stage protocol to increased scar tissue formation [24, 25]. Scar tissue around the sutures, such as the vomero-premaxillary suture, can restrict the forward and downward expansion of the maxilla [17, 26, 27]. A significant proportion of the final length of the maxilla is gained during the maxillary growth spurt. According to cephalometric analysis of the general population, this increase in growth velocity takes place at the age of 6–10 years in girls and 8–14 years in boys [13]. Delayed hard palate closure at the age of 3 years might therefore be too early as post-operative scar tissue can still interfere significantly with growth. This may also explain why Schweckendieck obtained such good results after hard palate closure at the mean age of 13 (range 8–22 years), after the previously described growth spurt [28]. The growth benefit of delayed hard palate closure may therefore only be achieved when closing at a significantly later age, when the greatest proportion of the final maxillary length is already achieved.

The extent of scar tissue formation is influenced by multiple factors such as surgical skill and experience, a patient’s inherent propensity for scar formation [3] and the amount of secondary surgeries. This study found a high incidence of revision cheiloplasty (46 %), pharyngoplasties (42 %) and fistulas (27 %) needing surgical closure. Previous studies identified a significantly higher rate of secondary procedures (including pharyngoplasties) in patients with poor growth outcomes or patients needing orthognathic surgery [24, 29, 30]. Similarly, the worst scoring Eurocleft Centre D had the highest number of surgeries per patient (6.0 surgeries compared to 4.4 and 4.8 in Centres E and A, respectively) [31]. Extensive fibrosis following primary surgery might contribute to the need for secondary surgeries (a pharyngoplasty due to a rigid velum) as well as to the development of maxillary hypoplasia. The observed correlation between the number of surgeries and maxillary growth outcome may therefore not be causal. Attempt to minimize scar formation during primary surgery should be considered an important goal. For this reason, expertise, skill and caseload of the surgeon might also have a great influence on long-term results after cleft treatment. Unfortunately, the actual contribution of each surgical and non-surgical treatment factor to impaired mid-facial growth is difficult to quantify because of their interplay and collective action.

Orthodontic treatment to optimize dental occlusion is an important non-surgical treatment factor. Of significance in the present results, the mean inter-incisor angle differed substantially from all Eurocleft Centres except for Centre A [11]. (Table 5) The inter-incisor angle in our group was however still significantly smaller compared to normal values. (Table 4) This smaller inter-incisor angle is likely caused by a more pronounced upper incisor proclination. Liao et al., reported previously that early closure of the palate resulted in more retroclined incisors [32]. However, these incisors tend to procline over time due to a dento-alveolar compensation mechanism in response to the unfavourable arch relationships [32]. Further, as the von Langenbeck procedure limits the amount of scar tissue adjacent to the anterior alveolus, this technique is believed to facilitate the compensatory anterior incisor’s adjustment [19, 32]. Orthodontic treatment may in turn enhance this inclination, resulting in overcompensation and steep incisor angles. The underlying cause for this compensation, maxillary hypoplasia, however remains present. Our findings may therefore be explained by the rigid orthodontic treatment that was followed in Utrecht, while maintaining a conservative approach towards orthognathic surgery. According to Good et al., the decision for performing a maxillary advancement is based on subjective criteria and possibly influenced by cultural differences in patient expectations and surgical preferences [29]. The decision to perform orthognathic surgery will therefore vary amongst centres. The studied patients were possibly reluctant to undergo orthognathic surgery, which may have been influenced by the opinion of our orthodontists at that time. Of note, the 95 % limits of agreement were more dispersed for the dentoalveolar values making the error of the method larger for these measurements.

In summary, heterogeneity of treatment protocols and multifactorial influences on treatment outcomes pose a challenge for clinical cleft lip and palate research. Focus has previously been on the timing of cleft closure, however as these results suggest surgical, non-surgical and patient factors all play a role.

Limitations

In this retrospective study, quality of data partly depends on the accuracy of medical records. Looking back as far as 30 years, some data could not be retrieved and the cases within this study were not consecutive. This increases the risk of selection bias. Further, as cephalometric normal values for the Dutch population are lacking, Swedish normal values were used as they are geographically the most closely related. However, mid-facial growth patterns may vary amongst Caucasian populations [33] and we cannot ensure that mid-facial growth in the Swedish population exactly relates to the growth patterns of the Dutch.

An uncertainty was implemented in the data by using the technique of multiple imputation to insert missing values (n = 5). The currently described mean values were therefore based on pooled estimates. However, as many predictors were used in the imputation model and the procedure was repeated five times before calculating the pooled estimates, this uncertainty could be considered negligible. Removing five cases that contained missing values would have further reduced the statistical power and might also have introduced a bias. In addition, the patient cohorts described in the previous studies and by Thilander et al., were sometimes small (varying from 20 to 50) [2, 1113]. and therefore already had a decreased statistical power.

Despite a possible difference in cephalometric values between male and female groups, we did not analyse these groups separately as it would again affect the power of this study. The mean values for males and females are however separately reported in Table 4.

Although the patient characteristics of those that did or did not attend follow-up was not significantly different, (Table 1), patients with a more intensive treatment or less favourable outcome were possibly more inclined to attend follow-up. This might have led to an underestimation of results.

Conclusion

This study describes the long-term craniofacial morphology in adults treated for an UCLP with hard palate closure at a mean age of 3 years. The mean maxillary angle SNA and mandibular angle SNPg were comparable to previous studies both applying early and delayed hard palate closure. The observed upper incisor proclination is likely caused by orthodontic overcorrection in response to the unfavourable jaw relationships. No clear growth benefit of this protocol could be demonstrated.

The cause for impeded mid-facial growth after cleft surgery is multifactorial. In future treatment protocols, emphasis should not solely be on the timing of surgery but also on the minimization of palatal scar tissue in order to reduce growth disturbances later in life. The high rate of secondary surgeries and resulting scar tissue formation is possibly one of the contributing factors to the moderate growth results in our group.