Introduction

Loss of the sense of smell is a well-recognized symptom of coronavirus disease-2019 (COVID-19) with an estimated prevalence of 40 to 75% [1]. Loss of smell resolves completely in the vast majority of cases within two to four weeks of disease resolution [2,3,4], but it persists in 15–20% of cases [5]. Hence, persistent olfactory dysfunction (OD) is the most frequent long-term morbidity in COVID-19.

Loss of smell can severely impair quality of life due to its negative effects on food enjoyment, nutritional balance, social communication, cognitive skills, and mental functioning. Furthermore, it exposes patients and their families to hazardous situations [6, 7]. Considering the prevalence of COVID-19 in the general community, it is likely that a large number of people will seek treatment for this disabling symptom. To date, various therapies have been investigated for the treatment of post-viral olfactory dysfunction (PVOD), and olfactory training (OT) has emerged as a promising treatment option [8,9,10,11]. OT involves daily smelling of a set of odorants over a period of several months to improve olfactory function based on the neuronal plasticity of the human olfactory system [12]. Hummel et al. [12] first described a structured method for OT using suprathreshold concentrations of four odors (rose, eucalyptus, lemon, and cloves), which is now called ‘classical olfactory training’ (COT). They reported the benefit of COT over 12 weeks in the treatment of olfactory disorders of various etiologies. Since its introduction, OT has been an attractive therapeutic approach because of its ease of administration, low cost, and negligible adverse effects [13].

The exact pathophysiological mechanism underlying the OD associated with COVID-19 remains incompletely understood, but it is thought to be different from that of other respiratory viruses [14, 15]. This raises the question of whether the efficacy of OT on COVID-19-related OD differs from that of other respiratory viruses. In this context, our aim was to conduct a prospective, controlled study to investigate the effect of COT on post-COVID-19 persistent OD.

Materials and methods

Patients

Consecutive patients who complained of loss of smell following COVID-19 were assessed in the Rhinology Clinic of the Department of Otorhinolaryngology of Kocaeli University, Kocaeli, Turkey. Patients were either self-referred to our department or referred from the Post-COVID-19 Follow-up Clinic held in the Infectious Disease Department. All patients were subjected to the Sniffin’ Sticks test in order to identify those with dysosmia. Patients also underwent a thorough medical history, and a complete otorhinolaryngologic examination, which included a nasal endoscopy for assessing the patency of the olfactory cleft and identifying other sinonasal pathologies. The inclusion criteria were as follows: (1) adults 18 years of age or older; (2) confirmed history of COVID-19 (positive real-time polymerase chain reaction test for severe acute respiratory syndrome coronavirus 2); (3) anosmia/hyposmia diagnosed using the Sniffin’ Sticks test; and (4) loss of smell for more than four weeks. The exclusion criteria were as follows: pregnancy; active cigarette smoking; previous OD; diagnosis of sinonasal disease; any respiratory allergy; any neurodegenerative diseases; and prior nasal/paranasal sinus surgery.

All patients were informed about the natural course of PVOD and the COT procedure was explained. Fifty-one patients who met the study criteria were then asked to choose between following the COT scheme or waiting for spontaneous recovery. According to their preference, participants were stratified into two groups: (1) patients who opted to perform classical olfactory training (COT group); and (2) patients who decided to wait for spontaneous recovery (control group). Demographic information about the participants was collected, including age, gender, and duration of loss of smell. The presence or absence of parosmia and/or phantosmia was also noted. All participants were warned of the dangers of loss of smell, and recommendations were made regarding the use of smoke and gas detectors and monitoring expiration dates on foods.

The study was performed in accordance with the Declaration of Helsinki on Biomedical Studies Involving Human Subjects. The study design was approved by the ethics committee of the Medical Faculty of Kocaeli University, Kocaeli, Turkey (KAEK 2021/01.06.). All participants provided signed informed consent.

Olfactory performance assessment

The Sniffin’ Sticks test (Burghart GmbH, Wedel, Germany) was used for the psychophysical assessment of olfactory function [16]. This test has been validated in a Turkish population [17]. The Sniffin’ Sticks test contains three separate felt-tip pen-like odor dispenser sets for odor threshold, discrimination, and identification. The odor threshold (T) is assessed by a single-staircase, three-alternative forced-choice procedure. Subjects had to detect the odorized stick among three samples with the other two blank control sticks. Odor discrimination (D) is determined by distinguishing the target odor in a triplet from two other identical odors. Odor identification (I) is carried out using 16 common odors, which have to be identified as a multiple-choice task from four possibilities presented as terms. The sum of the three tests (T, D, and I) scores gives the threshold–discrimination–identification (TDI) score with a maximum of 48 points. According to the most recent update of TDI normative values by Oleszkiewicz et al. [18], normosmia, hyposmia and anosmia were respectively defined by TDI ≥ 30.75, 30.5 ≥ TDI ≥ 16.25 and TDI ≤ 16. The Sniffin’ Sticks test was performed twice, at the baseline and at the follow-up visit 12 weeks later.

Olfactory training

In the present study, OT was performed in accordance with the protocol of Hummel et al. [12]. Patients in the training group were provided with four odorants (lemon, rose, cloves, and eucalyptus) in amber-colored glass jars and a written document describing the training procedure. Patients were instructed to expose themselves to each odor twice a day for 12 weeks. Patients were advised to sniff each jar for 20 s, with a 20-s break between each scent. Each session lasted about three to five minutes. Participants were asked to keep diaries throughout the training period to enable the monitoring of their adherence to the training scheme. Patients who did not complete at least 90% of their training sessions were excluded from the study. Patients in the OT group were also all contacted via telephone by the experimenter during the first and second months of the treatment period to give guidance when patients encountered difficulties and also to encourage adherence to the protocol. Patients in the control group did not receive any treatment.

Outcome measures

The primary outcome was the comparison of the clinical improvement in olfactory sensitivity at 12 weeks between the COT group and the control group. As previously described, a TDI score increase of 5.5 points or more was considered a minimal clinically important difference for subjective improvement in olfactory function [19]. Other outcome measures were the comparison of mean TDI and subunit (T, D, and I) scores between the groups. Additionally, the variables that might have been associated with clinical improvement in olfactory function, including age, gender, duration of OD, presence of parosmia, and baseline TDI score, were evaluated. Furthermore, the adherence rate of the patients to the training scheme was assessed. Any side effects related to COT were noted.

Statistical analysis

All statistical analyses were performed using IBM SPSS for Windows, version 20.0 (SPSS, Chicago, IL, USA). The Shapiro–Wilk test was used to assess the assumption of normality. Descriptive statistics of qualitative data are presented in the form of numbers and percentages, whereas for quantitative data mean ± standard deviation (SD) is used when the data fitted the normal distribution, and median and interquartile range (IQR) if they did not. Categorical variables are summarized as counts (percentages). Comparisons of continuous variables between groups were performed using an independent samples t test or Mann–Whitney U test, as appropriate. Comparisons of dependent samples were carried out by the paired t test or the Wilcoxon signed rank test. A chi-square test was used to determine relationships between categorical variables. Associations between continuous variables were assessed by Pearson or Spearman's correlation analyses. The significance level was p < 0.05.

Results

Patients’ characteristics

Of 51 patients, 31 (61%) chose the COT option and 20 (39%) chose not to undergo COT and were thus included in the control group. However, eight were excluded from the study due to missed follow-up (n = 4, two in the control group and two in the COT group) or insufficient commitment to the training scheme (n = 4). Thus, 43 of 51 cases were included in the analysis of therapeutic effect, of whom 24 (56%) were female. The median age was 38 years (IQR 26–46; range 18–67). The median duration of OD was 5 months (IQR 2–8; range 1–11). Parosmia was present in 54% of the subjects, while phantosmia was present in 7%.

The COT group consisted of 25 patients (15 female and 10 male, mean age 38 ± 14 years, range: 18–67 years) and the control group of 18 patients (9 female and 9 male, mean age 37 ± 10 years, range 22–58 years). The mean duration of OD in the COT group was 5.8 ± 3.4 months (range 1–11 months) and 4.7 ± 3.3 months (range 1–11 months) in the control group. In the COT group, 64% of patients had parosmia and 4% had phantosmia; in the control group, these values were 39% and 11%, respectively. At baseline assessment, there were no statistically significant differences between the two groups in terms of age, sex distribution, duration of OD, and presence of parosmia (all p > 0.05) (Table 1).

Table 1 Baseline characteristics of the training and control groups

Baseline olfactory evaluation

The initial mean TDI score in the COT group was 22.99 ± 5.24 (T = 5.39 ± 2.54, D = 9.12 ± 1.48, and I = 8.48 ± 2.43) and in the control group was 22.61 ± 6.36 (T = 4.67 ± 1.87, D = 9.56 ± 2.81, and I = 8.56 ± 2.53). The TDI scores were dominated by the deterioration of threshold ability. At baseline, there was no significant difference between the groups in respect of measured olfactory function (p = 0.312 for T, p = 0.513 for D, p = 0.922 for I, and p = 0.832 for TDI). Concerning the degree of OD, there were 4 anosmic (16%) and 21 hyposmic (84%) patients in the COT group, and 3 anosmic (17%) and 15 hyposmic (83%) patients in the control group.

Follow-up olfactory evaluation

After 12 weeks, the mean TDI score in the COT group was 28.67 ± 5.94 (T = 6.91 ± 2.60, D = 11.36 ± 2.14, and I = 10.40 ± 2.52) and in the control group, this was 24.07 ± 6.64 (T = 5.07 ± 1.81, D = 9.72 ± 2.93, and I = 9.28 ± 2.70) (Table 2, Fig. 1). The mean TDI score changed by 5.68 ± 2.78 in the COT group and 1.46 ± 3.19 in the control group (Fig. 2). A statistically significant change in the mean TDI score was observed in the COT group (p < 0.001) but not in the control group (p = 0.077). Patients in the COT group exhibited significantly higher mean scores in all subsets (T, D, and I) of the Sniffin' Sticks test (all p < 0.001), whereas only the I score showed a statistically significant change in the control group (p = 0.261 for T, p = 0.740 for D, and p = 0.022 for I). When evaluated individually, TDI scores improved in all 25 patients (100%) in the COT group, whereas this was observed in 14 of 18 patients in the control group (78%). Deterioration in TDI score was observed in 4 of the 18 (22%) subjects in the control group.

Table 2 Baseline and twelfth week olfactory test results
Fig. 1
figure 1

Mean threshold (T), discrimination (D), identification (I), and composite TDI scores at the beginning and twelfth weeks for the control group (no training) and the COT (classical olfactory training) group

Fig. 2
figure 2

Boxplot indicates the difference in the mean threshold–discrimination–identification (TDI) score between the end of the twelfth week and the start of the study for the control (no training) and COT (classical olfactory training) groups

Finally, the change in measured olfactory function according to clinical significance was evaluated (≥ 5.5 point increase in TDI score). Ten of 25 (40%) subjects in the COT group exhibited a clinical improvement, whereas it was observed in only 1 of 18 (6%) subjects in the control group. The comparison of clinical improvement rates between the groups was significant (p = 0.014). Age (p = 0.567), gender (p = 0.678), duration of OD (p = 0.818), presence of parosmia (p = 0.803), and initial TDI score (p = 0.800) had no effect on clinical improvement after COT.

When investigating the patients in terms of the degree of olfaction, in the training group, 3 of the 4 anosmic patients became hyposmic and 1 remained anosmic; 10 of 21 hyposmic patients became normosmic and 11 continued to be hyposmic. In the control group, 2 of 3 anosmic patients became hyposmic and 1 remained anosmic; 3 of 15 hyposmic patients became normosmic and 12 continued to be hyposmic.

In terms of adherence, the average commitment rate to the training sessions was 87%. Four out of 29 patients did not reach our specified compliance rate; subsequently, they were excluded from the study. None of the patients reported any side effects related to olfactory training.

Discussion

The current study showed that COT for 12 weeks was effective in improving persistent OD associated with COVID-19. Specifically, the COT group had a higher rate of olfactory score above the minimal clinically relevant difference for subjective improvement of smell than those in the control group who did not receive any therapy (40% versus 6%). All patients who performed COT exhibited an increase in their TDI scores. The TDI scores of almost three-quarters of patients in the control group also improved, albeit they showed less increase in their olfactory test scores compared to the training patients. Significant improvements in the means of odor discrimination, odor identification, and odor threshold were accompanied by an improvement in the mean TDI score in the COT group.

Since the initial description of OT by Hummel et al. [12], several meta-analyses [10, 11, 20] have demonstrated the beneficial effects of this method and its modifications in the treatment of PVOD. In a recent meta-analysis, Kattar and colleagues [20] pooled only the results from the COT regimens with different follow-up intervals and revealed that COT is associated with a higher rate of clinical improvement in PVOD compared to controls. They also pointed out that longer OT duration is associated with greater improvements in olfactory function. However, all five included studies that included 12 weeks of COT had clinically significant improvements in olfactory function at rates of 21% [21], 23% [22], 27% [7], 28% [12], and 53.3% [23]. In the present study, 12 weeks of COT was also effective in post-COVID-19 persistent OD, with a clinical improvement rate of 40%.

To date, there are only a few studies investigating the efficacy of olfactory training in the treatment of post-COVID-19 persistent OD. A recent study by Vandersteen and colleagues [24] reported a significant clinical recovery after OT in patients with COVID-19-related persistent OD. Their cohort was comprised of forty-three patients with a mean OD duration of 5.8 ± 3.2 months. All participants underwent OT based on the protocol of Hummel et al. [12], with an average period of 14 weeks. They observed a significant increase of 6.2 points in the average TDI score (from 24.7 ± 8.9 to 30.9 ± 9.8) following OT. The patients and symptom characteristics in this study were similar, and the results were consistent with our study. However, as the study of Vandersteen et al. did not include a control group, we could not compare the spontaneous recovery rates. In another study, Altundag et al. [25] reported that modified olfactory training (MOT) is effective in the treatment of post-COVID-19 parosmia. Among the findings of this study, statistically significant increases in the mean TDI scores of the MOT group in the third, sixth, and ninth months were observed. In keeping with our findings, at the end of the first trimester of the MOT, the mean TDI score significantly increased by 4.4 points. Another prospective study by Le Bon et al. [26] reported that the combination of a short course of oral corticosteroids (OCS) plus OT was safe and may be beneficial in the treatment of patients with long-lasting dysosmia due to COVID-19. Patients in the OCS + OT group improved their olfactory scores by 7.7 points on average after 10 weeks, compared to a 2.1 point increase in the OT-only group. In contrast to our findings, they did not find a significant improvement in OD by OT alone. This result may be due to a lower adherence rate of 31% to the training scheme and possibly a slightly shorter training period compared to that used in our study.

Regarding the effectiveness of OT across three different olfactory abilities—smell identification, discrimination, and threshold—several meta-analyses [10, 11, 20] reported a great variance in the improvement of these subscores. In patients with post-COVID-19 persistent OD who performed OT, Vandersteen and colleagues [24] observed a significant T and I improvement, followed by a non-significant D improvement. Our study, however, produced a significant increase in the mean values of all subsets of the Sniffin’ Sticks test after the COT. This result suggests that the therapeutic effect of COT may occur in both the peripheral and central olfactory systems.

In terms of characteristics that predict the success of OT in PVOD, some studies commented that a shorter duration of OD prior to OT initiation was associated with greater improvement in olfactory function [21, 27]. Liu et al. revealed that higher residual olfactory function and older age were inversely associated with relevant improvement in olfactory function after OT, while gender had no impact [28]. Furthermore, in another study by Liu et al. [29], it was found that patients who had parosmia accompanying PVOD had better outcomes after OT in terms of odor identification and discrimination compared with those who did not have parosmia. The current study did not yield any significant effect of age, gender, duration of OD, presence of parosmia, or initial olfactory score on the clinical improvement of olfaction after 12 weeks of COT. Thus, clear conclusions regarding whether these factors have any impact on the success of OT in COVID-19-related OD could not be made. A larger sample size would be required to provide a definitive assessment of such variables.

The adherence rate to the 12-week training plan in the present study was high at 87%. Fornazieri et al. [30] reported an 88% adherence rate after three months of OT, and a moderate decline after that. Some studies [30, 31] suggested that adequate adherence to the training scheme was significantly correlated with the improvement in olfactory test scores derived from OT. Our high adherence rate can be attributed to factors such as the short training period, a high rate of smell improvement during the training period, the keeping of a diary, and regular interactive feedback via direct contact with the experimenter. None of the patients in our study reported any adverse effects during the training period.

The strengths of our study were: (1) the participants had a homogenous etiology for their OD; (2) the inclusion of a control group to exclude the confounding effect of spontaneous recovery; (3) similar baseline patient characteristics in both the COT and control groups; (4) the inclusion of only those patients with an adherence rate of more than 90% to training sessions; and (5) using a well-validated tool to assess olfactory function.

This study has limitations, which must be considered when interpreting the data. First, our study was not a placebo-controlled study. Subsequently, due to the lack of intervention in the control group, the second olfactory test results might have been affected by the loss of motivation. However, odorless training jars could be detected by subjects or relatives. Second, the COT and control groups were formed according to the patient’s preference rather than through random allocation. However, a random assignment may not be ethical. Third, this was a single-center study with a limited number of patients. Therefore, the statistical power was low. A final limitation was that the adherence rate to the training scheme was evaluated entirely by gathering data from patients' records in their diaries. This might have resulted in unreliable data to assess the adherence rate of the training scheme. Despite these limitations, this is the first study to investigate the effect of COT in patients with persistent OD after COVID-19, involving a control group to eliminate the effect of spontaneous recovery. Larger cohort studies are needed to confirm our findings and develop alternate methods to enhance the efficacy of OT.

Conclusion

In patients with persistent olfactory dysfunction after COVID-19, classical olfactory training produced significantly better improvement in olfactory sensitivity than that of the natural course of the disease in this cohort. A twelve-week period of classical olfactory training is a useful, practical, and well-tolerated procedure with a high level of compliance.