Introduction

A greater number of people today retain their natural teeth until later in life. For example, in Switzerland, only 6.5% of the population above the age of 65 years are reported to be edentulous [1]. Overall, the incidence of edentulism is declining due to the success of both preventive and conservative dentistry [2] as well as restorative dental techniques [3]. Current assessments from Germany, however, indicate that despite these preventive efforts, the prevalence of oral infections is still increasing with age. Later in life, periodontal disease is commonly found to be the main reason for tooth loss [4].

In elderly patients who are in need of help for their everyday activities (activities of daily living (ADL)) both the oral and denture hygiene is often poor [5, 6]. Thus, more frequently, periodontal inflammation may be leading to pain, tooth loss, decreased masticatory function and reduced quality of life [7, 8]. Furthermore, it has been shown that inappropriate oral health may further compromise general health especially in elderly patients. In these cases, the accumulated biofilm built on teeth and dentures can be released into saliva, the oropharynx and further be aspirated into the lungs possibly causing aspiration pneumonia [9]. A number of studies suggested that aspiration pneumonia is an important risk factor for morbidity and mortality in elderly patients [10, 11]. On the other hand, Yoneyama and co-workers conducted a study in nursing homes demonstrating that rigorous oral hygiene regimes may result in a significant risk reduction for the development of aspiration pneumonia [12].

The potential pathogens such as Staphylococcus aureus can be detected in 20% and enterobacteria in 8% of the elderly population. Furthermore, yeasts are highly prevalent in people living in long-term care facilities [13]. They form biofilms on teeth, mucosa and denture surfaces, which could most commonly be removed mechanically using a toothbrush and toothpaste. However, many elderly people are not able to perform a homecare oral hygiene regimen as indicated. They often lack the capacity for adequate oral hygiene due to decreasing manual dexterity, impaired vision or frailty [14].

Several indications have been described in which antimicrobial mouthwashes may be of additional value to mechanical plaque control. These clinical situations may comprise poor mechanical control (e.g. arthritis of the hands), xerostomia, use of splints, fixed partial prostheses, exposed root surfaces, overdenture abutments or oral surgery [15]. In such cases, antimicrobial mouthwashes may add a significant benefit, especially chlorhexidine in concentrations of 0.1 to 0.2% [16]. Other mouthwashes containing fluoride were shown to exhibit anti-inflammatory effects, however, demonstrating common side effects such as mouth-burning sensations and staining of both teeth and tongue [17].

Essential oils and other plant extracts have evoked interest as a source of natural medicinal products. Both oils and plant extracts have been evaluated for their use as an alternative medicine in the treatment of numerous infectious diseases [18]. Specifically, essential oils were proven to exert antibacterial, antifungal, antiviral and antioxidant properties. However, their mechanisms of action have not been fully elucidated [19]. Salvia officinalis (sage) has been used in medicine and in dentistry for several hundred years. A recent study demonstrated an anticandidal effect of Salvia officinalis essential oil against Candida albicans [20]. Delamare and co-workers [21] demonstrated an antibacterial activity of the essential oils of Salvia officinalis and Salvia triloba. The essential oils of both species exhibited bacteriostatic properties against several microorganisms, among them Staphylococcus aureus [21].

Further information on the effect of sage-containing mouthwashes on periodontal health may lead to improved management of periodontal diseases in elderly individuals enrolled in supportive periodontal therapy (SPT). Therefore, the main aim of the current study was to evaluate the potential of a sage-containing mouthwash to reduce inflammatory signs of both intra-oral mucosa and gingiva in comparison with a water/alcohol-based placebo.

Null-hypothesis H0: In elderly patients under supportive periodontal therapy, there is no difference between a sage-containing mouthwash and a water/alcohol-based, taste-adjusted placebo mouthwash in alleviating inflammatory signs of intra-oral mucosa and gingiva.

Material and methods

Subjects, study design and setting

This was a randomised, placebo-controlled, double-blind, parallel design clinical study. The consolidation standards of reporting trials (CONSORT) guidelines for clinical trials were followed [22]. The study was reviewed, registered and approved by the appropriate Regional Ethical Board in Bern, Switzerland (Basec-No.: 2016-01383). The study was registered on the ClinicalTrials.gov Protocol Registration and Results System (PRS) under the ID: NCT0283080. The recruitment period lasted from 2016 to 2017. Consecutive patients from a private geriatric practice (Laupen, Switzerland), two homes for the elders (Burgerspittel am Viererfeld, Bern and Betagtenzentrum Laupen) and the School of Dental Medicine (Bern, Switzerland) were informed about the study and asked whether they wanted to participate. Inclusion criteria were the ability to give informed consent and the willingness to participate in the study. Subjects needed to be 65 years and older while presenting a full-mouth percentage of bleeding on probing (BOP) > 50% [23]. Exclusion criteria were allergy to one of the components of the irrigation solution, complete edentulism, alcohol dependency, prescription of antibiotics or immunosuppressant (e.g. glucocorticoids) and anti-inflammatory mouthwash. There was no minimum time period that must have elapsed from the last recall for the patients to be included in the study. One calibrated dentist, who was blinded to the type of mouthwash investigated in this study, conducted all clinical examinations. Measurements were recorded on case report forms (CRF) specifically designed for this study. For all subjects, the baseline examination (BL) included an independent oral examination together with the collection of the general medical history.

A priori sample size calculation could not be performed due to lack of previous reference data. Therefore, a sample size of n = 40 was chosen and a recruitment of n = 48 was deemed necessary to compensate for dropouts. This study was designed as a pilot randomised clinical trial (RCT).

Randomisation was performed according to an online-generated randomisation list (www.randomizer.org) with blocs of n = 8. Attribution to either the test group or the placebo group was noted and concealed in non-transparent envelopes. Following initial screening, signing of informed consent and study inclusion as well as the baseline examination by the calibrated dentist, the envelope was opened and the subject was thus attributed to the test group or the placebo group, respectively.

Intervention

For the duration of the study, the test group rinsed with a sage-containing mouthwash (Dr. Hauschka Med Mundspülung Salbei, WALA Heilmittel GmbH, Bad Boll/Eckwälden, Germany) containing extraction of Althaea officinalis leaf extract, Salvia officinalis leaf extract, Krameria Triandra root extract, Calendula officinalis flower extract, Potentilla erecta root extract, Aesculus hippocastanum bark extract, Melia azadirachta leaf extract, Commiphora Myrrha resin extract, water and alcohol solvent (9 vol%). The placebo group rinsed with a placebo mouthwash (WALA Heilmittel GmbH) that was similar in taste, colour and alcohol content (9 vol%). Stability tests were performed for all mouthwashes used, and a safety assessment was conducted according to the EU Cosmetic Law 1223/2009. Both test and placebo mouthwashes were packed in identical white bottles which were labelled with consecutive numbers only. Hence, both operator and subjects were blinded to the assignment of the study group.

In addition to their individual oral hygiene measures, the subjects were asked to rinse with the allocated mouthwash once daily (in the evening after brushing or before bedtime) for the duration of 6 weeks. Patients were instructed to rinse vigorously for 30 s. They have not been given instructions to improve oral hygiene. For the standardisation of the time used for rinsing, an hourglass was provided. After rinsing with the mouthwash, participants were advised to avoid further rinsing, eating or drinking for at least 30 min.

Primary outcome measure

For the purpose of this study, the Sulcus Bleeding Index (SBI) was used as the primary outcome measure [24]. The criteria for scoring were Score 0 (healthy gingiva and absence of bleeding on probing (BOP)), Score 1 (healthy gingiva and presence of BOP), Score 2 (change in colour of the gingiva, no oedema, presence of BOP), Score 3 (change in colour of the gingiva, slight oedema, presence of BOP), Score 4 (change in colour of the gingiva, obvious oedema, presence of BOP) and Score 5 (change in colour of the gingiva, marked oedema, spontaneous bleeding). Four gingival sites were scored at each tooth: mid-buccal, mid-lingual and the mesio-buccal and disto-buccal papillary gingiva. The average scores recorded determined the SBI.

Secondary outcome measures

Plaque Index

Plaque deposits were recorded using the Plaque Index (PLI) according to Löe and Silness: Score 0 (absence of plaque), Score 1 (plaque disclosed after running the periodontal probe along the gingival margin), Score 2 (visible plaque) and Score 3 (abundance of plaque) [25].

Tooth Staining Index

The vestibular and oral surfaces were evaluated for discolouration, ignoring wisdom teeth, crowned teeth and dentures using the Tooth Staining Index (TSI) [17]. The staining was graded: Grade 0 (no discolouration), Grade 1 (up to 1/4 of the tooth surface was discoloured), Grade 2 (up to 1/2 of the tooth surface was discoloured), Grade 3 (up to 3/4 of the tooth surface was discoloured) and Grade 4 (more than 3/4 of the tooth surface was discoloured). The TSI was calculated as the sum of all discolouration areas over the sum of all surfaces.

Xerostomia

Subjective symptoms of dry mouth were assessed using the German 14-item version of the Xerostomia Inventory (XI) [26]. Subjects were asked to indicate the frequency of 14 occurrences that are related to dry mouth and throat using a 5-point ordinal rating scale. The XI summary score is the sum of the scores of the 14 single items with potential values ranging from 0 to 56 and with higher values indicating more symptoms of dry mouth. Therefore, the XI was considered a problem index. Reliability of the XI was assessed as the instrument’s internal consistency.

Stomatitis Index

Oral soft tissues were further assessed using the Stomatitis Index (SI) according to Newton [27] as follows: Grade 0 (none), Grade 1 (punctate hyperaemic lesions, local erythema), Grade 2 (diffuse erythema, generalised simple inflammation) and Grade 3 (hyperplastic granular surface, inflammatory papillary hyperplasia) [27].

Statistical analysis

A descriptive analysis was conducted, and absolute and relative frequencies, mean values and standard deviations were reported. The data were evaluated using the statistical software package R, Release 3.3.3 [28].

With the perceived xerostomia, a Cronbach’s alpha [29] was computed along with the average inter-item correlation using all 14 XI items. Alpha values were interpreted according to guidelines with values of 0.70 or above suggesting satisfactory reliability [30]. For broader higher-order constructs such as dry mouth symptoms, an average inter-item correlation of at least 0.15 was considered satisfactory [31]. Cronbach’s alpha (0.90) and average inter-item correlation (0.39) revealed that the instrument’s internal consistency was excellent.

The baseline results of both groups were tested using Students t test for independent samples and Fisher’s exact test. Differences between baseline and re-examination were tested using both Student’s paired t test and Fisher’s exact test. Both tests were corrected for repeated measurements according to the method described by Holm. For the prediction of the difference of numeric values between the baseline and the follow-up examination, multiple linear regression models were used. The explanatory variable ‘tooth brushing frequency’ and their response options ‘once, occasionally or never’ were combined in one group as opposed to the frequency ‘twice or more’. As level of significance, a p value of < 0.05 was chosen.

Results

Subjects

Out of 48 initially included subjects, 45 completed the study (test n = 23, control n = 22, Fig. 1). Reasons for dropout were no compliance to the study protocol (n = 2) or no show at the final examination (n = 1). The mean age of the subjects was 77.5 years (± 7.3), and the majority were non-smokers (83.3%). Out of the n = 48 subjects, 52.1% (n = 25) were females. The mean number of drugs taken by the study participants was 4.6 (± 5.16, min = 0, max = 19), and none of them was diabetic. In the control group, 14 participants brushed their teeth at least twice a day and 10 once per day. In the test group, 19 participants brushed their teeth at least twice a day, another 4 once per day and 1 occasionally. The difference was not significant (p = 0.270). Baseline and demographic characteristics were not statistically significantly different between the two study groups (Table 1).

Fig. 1
figure 1

The CONSORT 2010 flow diagram of the present study

Table 1 Baseline demographic data and unadjusted qualitative analysis of possible confounders in n = 48 subjects

Outcome measures

Gingival inflammation as assessed by the SBI in the test group improved from 1.3 (± 0.9) to 0.8 (± 0.7) (p = 0.0029) and 1.4 (± 0.9) to 1.1 (± 0.7) (p = 0.0105) in the placebo group, respectively. Furthermore, the calculated change of SBI for each site of measurement was not statistically significant between test and placebo groups (chi-square test, p = 0.8823). Similarly, the PLI in the test group improved from 1.2 (± 0.5) to 1.0 (± 0.3) (p = 0.0080) and 1.3 (± 0.4) to 1.1 (± 0.6) (p = 0.0087) in the placebo group, respectively (Fig. 2). No statistically significant differences were found, however, between test and placebo groups at both baseline and the 6-week follow-up (p > 0.05) (Fig. 2; Supplementary Table A).

Fig. 2
figure 2

Mean Sulcus Bleeding Index (SBI) and Plaque Index (PLI) from n = 23 test subjects and n = 22 control subjects at baseline and 6 weeks. *p < 0.05, statistically significant difference

There was no statistically significant difference in the pooled analysis concerning the degree of prosthetic stomatitis at baseline (p = 0.6824) and follow-up (p = 0.2362), tooth staining in both upper and lower jaw (baseline and follow-up: p > 0.05) and xerostomia at baseline and follow-up and no difference between groups (p > 0.05), respectively (Table 2).

Table 2 Descriptive summary of the degree of stomatitis, tooth staining and xerostomia inventory per group and clinical visit

Bi- and multivariate analyses

The linear regression analysis of the intervention effect revealed that either increase or decrease of SBI and PLI could not be predicted from the groups. There was a tendency, however, that the improvement in PLI could be predicted from age since younger subjects tended to show a more improved PLI (Table 3).

Table 3 Regression analysis for the Sulcus Bleeding Index (SBI) and further indicators of gingival health

Discussion

Summary of the results

During the course of the study, periodontal indices related to gingival health and oral hygiene improved in both study groups reaching statistical significance. However, no between-group differences were found among test and placebo nor could the effect of the test group be predicted from the statistical model for age, gender, frequency of tooth brushing, and the allocation of the study group. Therefore, the null-hypothesis H0 cannot be rejected. However, it has to be noted that the study revealed an effect size for the improvement of the main outcome parameter gingival inflammation as assessed with the SBI of 0.42. An effect size of 0.5 was classified as ‘medium’ and could be ‘visible to the naked eye of a careful observer’ according to Sullivan and Feinn [32]. Hence, it can be assumed that the current study was slightly underpowered.

Results in the context of existing knowledge

The effects of plant extracts and essential oils have attracted considerable interest, but most of these studies had been performed in vitro [33]. They described bacteriostatic and antifungal effects and alteration to the morphology of the bacterial organisms. The present study revealed that both the placebo solution and the sage-containing mouthwash had a similar effect on gingival inflammatory parameters. It has to be noted, however, that the placebo solution possibly contained effective agents in the same concentration such as alcohol. Therefore, a possible beneficial effect of the alcohol and peppermint oil content of the placebo on the composition of the intraoral flora could be taken into consideration. However, it has to be noted that Chalhoub and co-workers in their RCT did not find a beneficial effect of using an alcohol-free essential oil-containing mouthwash containing peppermint oil in institutionalised patients with dementia [34].

In an in vitro study, Sissons and co-workers demonstrated that prolonged application of ethanol of up to 40% is necessary to inhibit growth of plaque biofilms [35]. With an alcohol concentration as solvent of 9 vol%, both test and control solutions of the present study were not expected to have a statistically significant inhibitory effect on the microbial flora. In a 3-day plaque regrowth model Marchetti and co-workers found no difference between an alcohol-free mouthwash containing essential oils compared with an alcohol-based essential oil mouthwash [36]. The lack of a placebo group rinsing with a non-alcohol-containing placebo may be considered a limitation of the present study.

In a systematic review, van Leeuwen and co-authors analysed the effect of an alcohol vehicle solution compared with an essential oil mouthwash and a water-based control [37]. The evidence from their review revealed that the reduction in plaque and gingivitis between the alcohol vehicle solution and the water control was not significantly different. Therefore, it may be concluded that the antiseptic effect of the hydro-alcohol solution appears to be negligible [37].

One further possible explanation of the improvement in both SBI and PLI may be attributed to the Hawthorne effect. More specifically, the improvement of the PLI over 6 weeks highly reaching statistical significance supports this assumption. The Hawthorne effect, i.e. ‘the observer effect’ was described as the reaction of individuals of being observed and consequently, change their behaviour because of the circumstance of being observed. It was suggested that the knowledge of being subject to research, and subsequent higher attention to the individual would lead to an increase in productivity or observed action [38]. The awareness of being studied in the present RCT may have had a positive impact on the oral hygiene behaviour of the subjects in our control group. Such consequences have been repeatedly described, but very little is known about the mechanism under which they operate [39].

Another RCT by Bellomo and co-workers [40] assessed the improvement of oral hygiene measures for elderly people in a long-term care facility. Their experimental group benefitted from ergo-therapy for the improvement of the autonomously performed oral hygiene. Their control group received regular manicure. Interestingly, in both groups, oral and denture plaque scores improved [40]. Their observation may also suggest a similar psychological effect observed in the present study.

The influence of alcohol containing mouthwashes on the oral mucosa is repeatedly discussed in the medical and dental literature. The effect of alcohol on keratinocytes as the most common cell type has been recently investigated in an in vitro study by Calderòn-Montāno and co-workers [41]. After exposition of keratinocytes to ethanol concentrations commonly used in mouthwashes, a marked cytotoxic effect was observed for concentrations of 20% and above. Cytotoxicity appears to be influenced by ethanol concentration and exposure time. Interestingly enough, no cytotoxicity was noted when cells were exposed to ethanol 10% for 5 min. Therefore, we could not expect that the test and placebo solution with a concentration of 9 vol% alcohol to cause any adverse effects on the oral mucosa.

Boffetta and co-workers pointed out that most of the available mouthwashes contain 20% alcohol and thus suggested a possible relationship with oral cancer [42]. Due to many confounding factors, however, conclusive evidence is lacking. Based on the risk estimation of three to four cases per one million, it will hardly be possible to epidemiologically detect and determine the level of risk from alcohol containing mouthwashes on oral cancer [43].

For the evaluation of xerostomia, the German adaption of the XI score according to Thomson and co-workers has been evaluated in the current study [26]. These authors have shown a change of 6 or more points in the XI score being clinically meaningful. We did not find a worsening effect on xerostomia (point difference, 0.5–1.8), despite that placebo and test solution were containing alcohol. This is in agreement with the data of Nair and co-workers who evaluated the 7-day rinsing effect of a 20% alcohol-containing mouthwash versus mouthwash without alcohol [44]. In contrast with our data, their patients were rinsing two times a day for 30 s. We conclude that even long-term use of alcohol containing concentration < 10% mouthwashes are not worsening xerostomia.

Strengths and weaknesses of the study

This study was designed and conducted as a randomised, placebo-controlled, double-blind, parallel design clinical study. Hence, it fulfils the highest standards of clinical trial designs. If the study had been designed as a case series instead—without placebo control—the results would have appeared much more positive for the tested mouthwash, i.e. significant improvement of the inflammatory signs.

Taking the medium effect size of 0.42 for the SBI into consideration, it has to be extrapolated that the study was slightly underpowered. Although hypothetical, it is likely that a statistically significant effect could have been demonstrated if the sample size had been chosen to be higher. However, RCTs are very expensive and laborious studies and recruitment must be planned a priori.

The compliance of the participants to the study protocol was impossible to supervise on a daily basis; however, at the end of the study, the filling level of the three bottles was checked.

A further strength of this study is that it was mainly practice-based and aimed at including old and very old dentate patients. Practice-based studies reflect much more dentists and their patients in normal life. Often, University-based studies lack the authenticity as diagnosis and treatment do not depend on financial and time constraints.

Clinical implications

Mouthwashes used on a regular basis appear to positively influence gingival health of elderly people. Participation in a study may account for an additional increase in the patient’s motivation and awareness of oral hygiene measures supporting the importance of repeated patient motivation.

Suggestions for further research

In future trials evaluating sage-containing mouthwashes, it may be beneficial to add either a negative control group irrigating with a placebo solution not containing alcohol or a positive control group rinsing with chlorhexidine digluconate. Moreover, an irrigation time of 1 min twice per day may show more favourable results. Additionally, the follow-up time may be extended to two and 3 months in order to evaluate whether the expected improvement of the selected outcome measures may be demonstrated both mid- and long term. The setting of this type of trial including subjects in long-term care would be of additional interest. Several attempts have been made to improve oral hygiene in care settings; however, study results remained inconclusive.

Conclusion

In this study, the irrigation with a sage-containing mouthwash did not result in a superior beneficial effect on inflammatory parameters and plaque indices compared with the placebo. The potential impact of this plant extract on oral health may need further investigation.