Introduction

Tooth wear and its management present new challenges in dentistry [1]. Tooth wear is increasingly common, especially among children and adolescents [2, 3]. The loss of occlusal tooth substance in tooth wear may result in a reduced vertical dimension of occlusion (VDO) and altered shape of teeth. Patients may need complex restorative treatment such as a full rehabilitation at an increased VDO [4,5,6,7]. Changes in tooth shape, both due to wear and its restorative treatment, may have an effect on speech.

Speech is the ability to communicate through articulated sounds. The production of speech is divided into five levels: (1) Respiration, the air flow that is generated by the lungs; (2) Phonation, the production of sounds by the vocal cords; (3) Resonance, vibrations created by the pharynx, the oral and the nasal cavity; (4) Articulation, the formation of sounds by movements of the articulators and (5) Prosody, suprasegmental characteristics like rhythm, intonation and pitch [8]. Tooth abnormalities or missing teeth are most likely to have an effect on articulation. Along with the tongue, the teeth are, for example, involved in the production of /f/ and /v/ and help to produce the frication in sounds like /s/ and /z/ as the breath stream passes over the lower edges of the incisor teeth [9]. Speech sounds which are thought to be most frequently misarticulated after an increase in VDO include /s/ and /sh/, sounds which are produced when exhaled air enters the mouth and is guided by the tongue [10]. Although previous studies have investigated the effect of tooth wear on oral functions [11,12,13], there is to our knowledge little to no research about the effect on speech of a full rehabilitation of the worn dentition with an increase in VDO.

To evaluate speech sounds, the parameter centre of gravity (COG) may be used. This objective parameter quantifies a spectral detail that corresponds to the frequency of the speech sound [14]. A subjective evaluation of speech characteristics as experienced by patients with tooth wear is possible with the use of a questionnaire, in this case the Dutch Speech Handicap Index (SHI) [15].

The aim of this first evaluation study was to explore changes in speech characteristics and self-perceived quality of speech in tooth wear patients, after a full rehabilitation with an increase in VDO. A second aim was to evaluate the possible adaptation in speech characteristics and self-perceived quality of speech at follow-up.

Materials and methods

Patients

Patients with tooth wear referred by general dental practitioners to the Department of Dentistry of the Radboud University Medical Center (Nijmegen, The Netherlands) were eligible for inclusion. The inclusion took place in the period January 2015 till January 2018. Ethical approval was granted before study commencement (NL2014-1252). All patients who were asked to participate agreed and signed an informed consent before entering the study.

Inclusion/exclusion criteria

The following inclusion criteria were applied: (1) age ≥ 18 years, (2) generalized tooth wear with need for a full rehabilitation, (3) full dental arches, with a maximum of one missing tooth in the posterior area, (4) absence of general health problems (ASA-score ≤ 3) and (5) Dutch native speaker.

The following exclusion criteria were used: (1) mouth opening < 3.5 cm, (2) temporomandibular disorders, (3) periodontitis (pockets > 4 mm) and (4) acute dental caries or endodontic problems.

Tooth Wear Evaluation System

The Tooth Wear Evaluation System (TWES) is a modular clinical guideline, which aims at optimising the diagnostic process (both qualifying and quantifying) and improving the selection of proper management strategies regarding tooth wear for the permanent dentition [16]. TWES was evaluated on intraoral 3D scans of the patients’ dentitions. For occlusal and incisal grading an eight-point ordinal scale was used. A three-point ordinal scale for non-occlusal/non-incisal grading was used.

VDO increase and lengthening of upper incisors

To determine the applied VDO and the difference in length of the upper incisors after treatment, intraoral 3D scans, made before and after treatment, were used. Meshlab (http://meshlab.sourceforge.net/) was used to evaluate VDO increase (the difference in height measured at the first molars) and incisal lengthening of the upper central incisors due to the full rehabilitation.

Full rehabilitation of tooth wear

Based on initial examination, an individual treatment plan was made for each patient [4]. The required VDO increase was estimated taking into account the amount of tooth wear at the first molars and the need or possibility to lengthen the anterior teeth. The optimal length of the anterior teeth was determined with an intraoral mock-up in the upper jaw from cuspid to cuspid with direct composite restorations, using the lip-generated smile design [17]. The increase usually lies between 2 and 4 mm in the first molar region. Based on the digital scan and intra-oral mock-up, a digital wax-up was made by a dental technician, and indirect composite restorations (LAVA Ultimate, 3 M™) were milled from a pre-polymerized block [4]. Restorations were placed in multiple sessions, and a final appointment was made to make direct composite veneers (Filtek Supreme XTE, 3 M™) on the upper and lower anterior teeth.

Self-evaluation of speech

To assess the patients’ perspective regarding his/her speech, the Dutch Speech Handicap Index (SHI) [15] was used on three occasions: before treatment (T0), immediately after treatment (T1) and 1 month after treatment (T2). SHI is a self-evaluation questionnaire originally designed and validated to measure the physical, functional and emotional impact on speech due to dysarthria. Since there is no questionnaire available to evaluate speech in relation to tooth wear, SHI was considered the best available option. This questionnaire consisted of two parts. Part 1 contains 15 items, split into three subscales of 5 items each: physical, functional and emotional functions. Each item is scored on a five-point scale (never (0), almost never (1), sometimes (2), almost always (3) and always (4)). The maximum score per subscale was 20; the total overall score was 60. Finally, all answers were summed to obtain an overall score. The second part consists of 3 general questions on which the patient could choose a numeric rating score (NRS) between 0 to 10, meaning 0 no impact, 10 meaning maximal impact. These questions are a summary of the previous questions, but the scoring scale is different. The scores of part 2 are not used in the total score but give an extra impression of the experience of the patient. In the analysis in this study, only part 1 was used for clear interpretation; part 2 was excluded from analysis.

Speech recordings

Recordings of the patients’ speech on the same three occasions: before treatment (T0), immediately after treatment (T1) and 1 month after treatment (T2). The recorded material consisted of the Dutch text ‘De Koning’ [18]. All speech samples were taken with the same Tascam DR-05 recording machine, under standard conditions in a sound isolated room at the audiological center of the Radboud University Medical Center. The recording at T1 took place immediately after the final treatment session, in which no anaesthesia (possibly interfering with speech) was required. The final session took place approximately 1 month after treatment (T2). After every speech recording, the patients filled out the SHI.

Acoustic analysis

The speech sounds evaluated were the /s/, /f/, /v/, /d/, /t/, /m/. The sounds located at the beginning of words were analysed because this position makes it easier and more reliable to isolate them for acoustic analysis. In the text ‘De Koning’, the /s/, /v/, /d/, /t/, /m/ occur 5 times at the beginning of a word, the /f/ 4 times. Using PRAAT software (version 6.0.43) [14], digital samples of the sounds were made. The signals were sampled at 44.1 kHz. The cursor was placed manually at the start and end of each sound using a Hamming window. A PRAAT script was developed to derive the spectral characteristic centre of gravity (COG) expressed in Hz, which meant that the highest value in Hz was taken into account for the analysis. The value of the COG was measured for every appointed sound in the three different recordings. The use of a filter or a pre-emphasis was not deemed necessary because all recordings took place in a sound isolated room.

Statistical analysis

Paired Student’s t test was used to calculate intra-observer reproducibility of the acoustic analysis. Reliability coefficient was calculated as Pearson’s correlation coefficient through analysing 10% of the recordings twice. The duplicate measurement error was calculated as the standard deviation of the differences between round 1 and 2, divided by √2. To analyse a possible non-random difference between repeated measurements, the mean difference (95% CI, p < 0.05) was calculated.

SHI scores were analysed with paired t tests to analyse the overall effect of the treatment and 1 month of adaptation (i.e. comparison between outcomes at T0-T2). To evaluate each question separately, a Wilcoxon signed rank test was used. For all six sounds, the change of COG in the period directly after treatment was analysed using a multiple regression model. As we have multiple observations (4 or 5) within one sound for each patient, a multilevel model was used with a random intercept to allow for the presence of data clustered in patients. To this model, the increase in length of the 2 middle incisors of the upper jaw was added as independent variable. For ease of interpretation, this value was centered around its mean (2.6 mm). Also, for all six sounds, the effect of adaptation on COG (T2-T1) was analysed using a multiple regression model, taken into account the amount of change in COG for the period T1-T0 as independent variable. All statistical analyses were performed with the Statistical package for Social Sciences (SPSS v.24) or R (v.3.4.3).

Results

In total, 24 patients were included in the study. Four patients were not able to go the audiological center due to mobility problems (e.g. wheelchair user), and for three patients recordings could not be used due to poor quality of at least one of the recordings. Therefore, the recordings of 17 patients were included: 14 men and 3 women with a mean age of 41.2 ± 10.4 years. TWES scores are shown in Table 1. For the occlusal/incisal wear, the highest score was 3a in 5 patients, 3b for 11 patients and 4 for 1 patient. The mean VDO increase was 2.7 ± 0.73 mm. The mean lengthening of the central incisors of the upper jaw was 2.6 ± 1.2 mm. The time interval between treatment and 1-month evaluation (T1 to T2) was on average 42 ± 17 days.

Table 1 Distribution of overall occlusal/incisal and nonocclusal/nonincisal tooth wear scores, measured on 3D scans. Not all patients were fully dentate; 2.7% of the teeth were missing

The results of the intra-observer reproducibility analysis for the acoustic analysis are shown in Table 2. The p values of the reliability scores are all > 0.98, indicating no statistically significant differences. Only the sound /m/ showed a significant p value of 0.04.

Table 2 Results of the intra-observer analysis with the program PRAAT

In Table 3, results of the paired t test analysis of the SHI scores were given for T0-T2. The overall score between T0 and T2 did not change significantly, p = 0.054, but had a trend for positive improvement, as the overall score was lower at T2. One subscale, functional, showed a significant change, p = 0.001. The other subscales, physical and emotional, showed no significant difference, but for all three subscales individually, the trend was for a positive improvement in speech after treatment. To evaluate the change per question, a Wilcoxon signed rank test was used. Three questions stood out for this specific study group; P2: ‘I experience difficulties in articulation when I speak’; F3: ‘I find it difficult to communicate with people I don’t know’ and F4: ‘Because of my speech problem people ask me often to repeat myself’. These questions showed a significant positive difference, resp. 0.021, 0.008 and 0.015. Table 4 summarizes the change in COG over time. For all sounds, the intercept was negative (signifying a mean drop in COG), but this was only statistically significant for /s/ and /f/. The effect of the extent of incisor lengthening varied. For the sounds /t/ and /f/ that effect was statistically significant, with a larger change in incisor length being associated with an increase in COG. For the adaptation effect (T1T2), the intercept in the analysis does not have a natural interpretation, but the effect of the initial change (between T0 and T1) on the change between T1 and T2 was clearly negative for all sounds, with always p < 0.001. That implies that for all sounds the change between T1 and T2 can be seen as a rebound to the initial change of the COG. Figure 1 visualizes this rebound effect for the sound /v/. The size of the rebound varied between 29 and 68% of the initial change in COG (Table 4).

Table 3 Mean overall and subscale SHI-scores for T0 (before treatment) and T2 (1 month after treatment), and results of paired t tests to evaluate the changes over time. T0 was compared with T2 to evaluate the impact of the treatment after adaptation
Table 4 Results of the statistical analysis of the COG-changes for the analysed sounds. For the difference between T0 and T1, the intercept of the regression models shows the change in COG (at the centered mean increase of the length of the incisors: 2.6 mm). The effect of the actual extent of change in incisor length (factor ∆length incisors) varied between the sounds. For the difference between T1 and T2, the intercept of the regression models is the change in COG in the situation where the change in COG between T0 and T1 was 0. This intercept does not have a natural interpretation. The relevant information is contained in the factor ∆COG T1-T0, which represents the effect of the initial (or treatment) change (between T0 and T1) on the change between T1 and T2 (or adaptation). The ICC explained the clustering of repeated sounds (within one moment (T0/T1 or T2) within the patient. ICC = 0, no clustering, ICC = 1, distribution between patients, not within the patients themselves
Fig. 1
figure 1

Graph of the COG of the sound /v/ at the three different measurement moments for each patient (data points linked by lines per patient). The rebound effect may be observed for most patients, both when the initial change was positive and when it was negative

Discussion

To our knowledge, this is a first evaluation study to investigate the effect of a full occlusal rehabilitation of tooth wear with increase in VDO on speech characteristics. No power analysis was performed on forehand but the results showed, for the majority, a very low p value.

In collaboration with a speech-language pathologist, it was decided to evaluate the sounds most likely to be affected by the increase in vertical dimension or the change in shape of the upper incisors. The intraoral areas that are involved into articulation of the different sounds in the Dutch language has previously been described by Rietveld et al. [19]. Another decision made based on the expertise of the speech-language pathologist, due to the lack of evidence in the literature, was to evaluate the sounds at the beginning of words. This decision was made to make it easier and more reliable to isolate the sounds using the program PRAAT. Intra-observer reliability rates were very high: > 0.98. Only for the sound /m/ the re-measurement showed a significant inconsistency with a p value of 0.04. The /m/ is harder to isolate at the end of the sound because the /m/ easily transits into the next sound. The effect of the next sound of the word was analysed, but it showed no significant influence on the COG, so the sounds could be averaged.

The reason for analysing the parameter COG of each sound and not to look at other characteristics within the speech recordings was that the COG is an objective parameter, that can be acoustically analysed with the program PRAAT. The COG is a measurement that represents the average height of spectral frequencies. It is the most valuable characteristic of a sound [14]. Speech characteristics are variable within the same person [20], and it is possible to investigate speech with qualitative evaluation (an expert listening and assessing) but that remains subjective. Since there is little to no knowledge about the speech characteristics after a full rehabilitation of tooth wear, the objective analysis using COG was chosen as a starting point.

The patients’ subjective opinion about their own speech, quality was evaluated using the SHI, showing very low means. This questionnaire may not have focused enough on tooth wear to evaluate the subtle changes in patients with a full rehabilitation due to tooth wear, resulting in no significant differences after the adaptation period of 1 month compared to the baseline (T0) registration, p = 0.054. Only the subscale functional showed a significant improvement of 0.39 scale point.

Looking deeper into the different questions, three questions stood out for this specific study group, namely question P3, F3 and F4. Before treatment, many patients pointed out these problems to their dentist, even before filling in the SHI. So overall, the main speech problems in daily life for these patients were represented in the questionnaire.

To assess the effect of the dental changes during treatment, the applied VDO in mm and the lengthening of the upper central incisors in mm were introduced in a regression model of COG changes. The applied VDO had no significant impact on any of the sounds and was therefore left out of the analysis. This study showed that the mean COG for the sounds /s/, /t/, /d/, /f/, /v/ and /m/ decreased due to the treatment, when taking into account the mean change in the length of the central incisors. For the sounds /t/ and /f/, the degree of lengthening of the incisors significantly influenced the treatment effect.

The effect on these sounds may be explained by their formation: the /f/ is produced through a light contact between the inside of the lower lip and the upper front teeth, while the /t/ is produced with the tip of the tongue touching the roof of the mouth near or against the front teeth [21]. This may have led to a substantial effect on COG. The absence of a significant effect for the sounds /v/ and /d/ may also be found in the articulatory information, as the /v/ and /d/ are voiced sounds, unlike the /t/ and /f/ which are voiceless. Looking at the results of the SHI questionnaire, this pronounced effect did not lead to a negative impact on speech perception by the patient.

We expected that the increase of length of incisor teeth would have an effect on the /s/ sound; however, this was not observed. The impact of the whole treatment did show a significant impact, p < 0.001, for the sound /s/. The verbal complaints of patients of a temporarily feeling of lisping, which was over within a few weeks, cannot be explained with the increase in length of the incisors. It is possible that for the /s/, the thickness on palatal side of the incisors may have had a greater effect, causing the temporarily feeling of lisping. The /s/ is created involving two constrictions: the narrow channel between the tongue and the palate or alveolar ridge, and the second constriction is formed by the teeth, against which the airflow is directed.

The COG after 1 month of adaptation showed a rebound effect, dependent on the initial treatment change in COG T0-T1. Interestingly, the rebound occurred irrespective of the direction, positive or negative, of the treatment effect.

The clinical relevance of this finding is not yet explained, but it may be a positive sign that the impact of a full rehabilitation does not result in lasting and possibly detrimental changes in the COG of the investigated sounds.

It is difficult to compare the COG values in this study to a ‘golden standard’ for speech. There is a lot of variations possible in the pronunciation, and dialects may have an influence [21]. On the website www.taalportaal.org, the Dutch language is investigated covering syntax, morphology and phonology. For the different investigated sounds: /s/, /f/, /v/, /d/, /t/ and /m/, there are indications in which region the COG in Hz may be located [21]. These values strongly depend on the way they are obtained for example quality of the recordings, or amount of background noise that had to be cancelled out with a filter. Therefore, absolute COG values should probably not be compared across studies. Comparisons with other studies in the field of dentistry are also difficult because of methodological differences and the very limited literature on speech in relation to dentistry. But in the case of our study, the exact interpretation of the COG value (i.e. low or high COG value) is not important because the parameter COG was only used to detect speech changes within the same individual during repeated measurements in the course of time.

Further investigation may entail more recording moments, different parameters such as listening experiments or questionnaires specific designed for patients with tooth wear to come to a conclusion about the clinical implication of a full rehabilitation on speech characteristics.

Conclusion

Acoustic speech characteristics of tooth wear patients tend to change after full composite rehabilitation with a rebound effect after 1 month of adaptation, dependent on the size of the initial change right after treatment completion. Tooth wear patients perceive a subjective improvement in speech function after treatment.