Introduction

COVID-19 (Coronavirus Disease 2019) is a recently emerged high-risk infectious disease with a long incubation period that has progressed very rapidly and has become a pandemic affecting millions of people around the world [1]. Since the pandemic outbreak, a growing number of studies have been conducted to investigate the effect of the disease on physical, psychological, and social health [2,3,4]. More specifically, the disease seems to have the potential of affecting various systems in the human body including the respiratory, gastrointestinal, neurological, cardiovascular, renal, and even reproductive systems [5,6,7,8]. Aside from the available data, ongoing studies are still trying to explore the underlying mechanism of these effects.

The production of the human voice is a complex phenomenon that requires an integrated working model, especially with the respiratory and neurological systems. A problem in these systems may disrupt the integrity and also the functionality of voice production, which further results in a disordered voice [9]. It is well established in the studies that the lung is the primary affected organ in COVID-19 disease and there are disease-specific pathological changes in an infected lung [10, 11]. One of the most important tasks of the respiratory system is to provide airflow for voice production. In severe cases, COVID-19 can significantly affect the respiratory system and cause pneumonia and acute respiratory distress syndrome. It has been reported that people who are infected experience difficulties not only in the inspiratory phase of respiration but also in the expiratory phase [12]. Since the energy source required for phonation is provided by the air exhaled from the lungs, both respiratory phases should have adequate functionality. On the other hand, cough, sore throat, rhinorrhea, sneezing, and vomiting are among the general symptoms of the disease which are closely related to the anatomical structures involved in voice production [13]. These symptoms could cause mechanical alterations on the vocal folds and lead to deterioration in voice quality. Another issue that may have a potential effect on the phonatory mechanism is post-viral vagal neuropathy (PVVN). According to Saniasiaya et al. [14], since it contributes to laryngeal sensory and motor disruptions, PVVN should be considered as an important factor that effecting the voice of patients with COVID-19.

The prevalence of voice disorders (dysphonia) in COVID-19 patients was reported to be between 22.3% and 43.7% [15,16,17]. To date, several studies have been performed to investigate the vocal characteristics of patients with COVID-19. In a study by Saki and colleagues, significant differences were found in auditory-perceptual parameters (both for CAPE-V and GRBAS) between the patients and healthy volunteers [18]. In another study, it was stated that the Voice Handicap Index-10 (VHI-10) scores were higher in patients and all scores of CAPE-V were significantly worse in patients than in controls [19]. In the target population, Tohidast et al. [20] investigated the voice quality and vocal tract discomfort symptoms and reported higher scores in all parameters of the GRBAS scale and a greater frequency of vocal tract discomfort symptoms than healthy subjects. In their study, Asiaee et al. [21] used acoustic voice analyses to compare the vocal quality of the patients with healthy participants. The results of the study revealed statistically significant differences between the two groups for CPP, HNR, H1H2, F0SD, jitter, and shimmer parameters. In all the above-mentioned studies, only subjective voice assessment methods were used except for Asiaee’s study. However, it is well known that both objective and subjective methods should be included in the voice evaluation protocols for better management [22,23,24,25]. In a study by Eadie et al. [22], it was mentioned that combining acoustic and auditory-perceptual measurements improved the classification accuracy between dysphonic and normophonic voices by up to 100%. In a more recent study, a multiparameter voice assessment protocol including objective and subjective methods was recommended to the clinicians in the diagnosis and treatment of voice disorders [25].

To the best of our knowledge, there is no previous data evaluating the vocal characteristics of the patients who recovered from COVID-19 in a comprehensive manner that consists of both the objective and the subjective parameters. We, therefore, conducted the present study to compare the vocal outcomes of recovering patients with a control group using acoustic voice analyses, durational measurements, self-reported questionnaires, and auditory-perceptual evaluations.

Materials and methods

Study design

The present study was designed as a case–control study at Prof. Dr. Necmettin Akyıldız Hearing, Speech, and Voice Center. Ethical approval was approved by the institutional review board of Gazi University Clinical Research Ethics Committee (IRB No. 328) and all data of the participants were collected after written informed consents were obtained.

Participants and recruitment

The individuals who recovered from COVID-19 disease were the target population for the current study. The subjects with voice disorders due to any functional, structural, or neurogenic causes, those with any malignancy in the nasal cavity, oropharynx, and larynx, those with a history of surgery in the head and neck region (except for septoplasty), those receiving regular inhaled medication due to any reason, and those with hearing loss were excluded from the study.

A total of 40 recovering patients (20 male, 20 female) with a mean age of 39.9 ± 8.8 (range, 21–53) and 40 gender and age-matched healthy individuals (mean age, 37.3 ± 8.8; range, 21–54) were included to the study. Body mass index (BMI) for all participants was collected and did not differ between the groups. Data of time after infection, day of turning negative polymerase chain reaction (PCR) test, and the VAS (visual analog scale) scores of symptoms experienced during the disease (coughing, difficulty in breathing, tiredness, difficulty in speaking, and pain) were gathered from the study population. Table 1 provides the demographic information and clinical variables of all participants.

Table 1 Demographic and clinical characteristics of the participants

Recording of voice samples

All voice samples were recorded in a sound-treated room with an ambient noise level of 30 ± 5 dBC using CSL software (Model 3700, Version 3.4.1, 2000–2001 Kay PENTAX, Montvale, NJ). A unidirectional Rode NT1 Cardioid Condenser Microphone (Rode microphones, Torrance, CA) with a frequency response of 20 Hz to 20 kHz (± 2 dB) was placed at the level of participants’ mouths with a distance of 15 cm. Recordings were performed with a sampling frequency of 44,100 Hz and 16-bit resolution. Before the voice recordings, an experienced SLP (Speech and Language Pathologist) explained the CAPE-V procedure in detail to each participant. According to the procedure, all individuals were asked to phonate the sustained vowels (/a/ and /i/), read the sentences in the Turkish version of CAPE-V, and perform a running speech sample (at least 20 s of natural conversational speech) using a standard interview question “Tell me about your voice” [26, 27]. All voice samples were then saved in.wav format on a desktop computer for further analysis.

Acoustic analyses

Voice perturbation measurements [mean fundamental frequency (F0), jitter local, shimmer local, and harmonic-to-noise ratio (HNR)] and cepstral peak prominence (CPP) values were obtained from sustained phonation and connected speech, respectively. F0, measured in Hertz, is defined as the number of vocal cycles produced per second. Jitter local is the average absolute difference between consecutive periods, divided by the average period and shimmer local is the average absolute difference between the amplitudes of consecutive periods, divided by the amplitude [28]. HNR specifies the ratio of harmonic energy to noise energy in an acoustic voice signal. CPP is a frequency-based acoustic measurement obtained from the cepstrum of a sound wave. Since it measures the degree of harmony within a voice sample, the greater value of CPP corresponds to a more periodic voice signal and a greater degree of harmonic energy [29]. Praat software (Version 6.0.17, Paul Boersma and David Weenink, Department of Phonetic Sciences, University of Amsterdam, Amsterdam, Netherlands) was used for all acoustic voice analyses. For perturbation measurements, the mid-vowel segment of the most stable part of the sustained /a/ phonation was used. Because it consists of all voiced phonemes, the CPP values were obtained from the third sentence of the Turkish version of CAPE-V (“Arda onca yılın ardından aradı”, phonetic transcription; “Ʌrdʌ onʤʌ jɯɫɯn ʌrdɯndʌn ʌrʌdɯ”) for each participant according to a previously published data which shows the steps and parameter setting in Praat software for the extraction of CPP [30].

Durational measurements

Maximum phonation time (MPT) and s/z ratio values were obtained during sustained phonations. For MPT, participants were instructed to inhale deeply and produce the sound /a/ as long as they could at their usual speaking volume for three consecutive times. The maximum value of the trials was accepted as MPT and noted in seconds. The same method of MPT was used for s/z ratio, the maximum phonation time of sustained /s/ divided to the maximum phonation time of sustained /z/. Before all durational measurements, the examiner performed the task for each individual for proper execution.

The questionnaires

Turkish version of both Voice Handicap Index-10 (VHI-10) [31] and Voice-Related Quality of Life (V-RQOL) [32] questionnaires were used to collect data about the patient-reported outcomes. VHI-10 is a questionnaire that consists of 10 items with a score of spanning from 0 to 40, each item has five possible answers: 0—never; 1—almost never; 2—sometimes; 3—almost always; and 4—always. V-RQOL also has 10 items that are rated on a 5-point scale (1–5) measuring the social-emotional and physical function of the patients with voice disorders. A higher score from the questionnaire indicates greater severity of dysphonia.

Auditory-perceptual evaluation

Auditory-perceptual voice judgments were performed based on the running speech part of voice recordings (“Tell me about your voice”). An SLP (primary rater), who has at least 5 years of experience in the evaluation and measurement of voice disorders, blindly assessed all voice samples in accordance with the CAPE-V protocol. To capture intra- and inter-rater reliability, twenty percent of data were selected randomly and repeat judgments were done by the primary rater and a second rater. All parameters namely, overall severity, roughness, breathiness, strain, pitch, and loudness were evaluated by the raters in the sound-treated room using a headphone.

Data analysis

All statistical analyses were performed with the Statistical Package for the Social Sciences (SPSS Inc., Chicago, IL) software version 25. Demographic data and clinical characteristics of the participants were analyzed descriptively. Mann–Whitney U test was used to compare age, BMI, acoustic voice parameters (mean F0, jitter local, shimmer local, HNR, and CPP), durational measurements (MPT and s/z ratio), patient-reported outcomes (VHI-10 and V-RQOL), and auditory-perceptual evaluation results between the groups. After the comparisons between the study and the control group, gender-based comparisons were performed to obtain gender-specific values of the related voice parameters. The Spearman correlation analysis was performed to determine the correlations between the clinical characteristic of the participants and the measured outcomes. Two-way random effects, absolute agreement, and intraclass correlation coefficients (ICC) were used to calculate intra- and inter-rater reliabilities for all CAPE-V parameters (overall severity, roughness, breathiness, strain, pitch, and loudness). The ICC results were interpreted as follows: if ≥ 0.90, excellent; if between 0.75 and 0.90, good; if between 0.50 and 0.75, moderate; and if < 0.50, poor [33]. A p value of < 0.05 was considered statistically significant.

Results

Comparisons of acoustic voice parameters between the study group (n = 40) and the control group (n = 40) did not significantly differ for mean F0 (p = 0.686), jitter local (p = 0.202), shimmer local (p = 0.268), HNR (p = 0.482), and CPP (p = 0.874). Among the durational measurements statistically significant difference was found for MPT (p = 0.020), but s/z ratio did not differ between the groups (p = 0.201). VHI-10 scores were significantly higher in the study group (mean ± sd; 2.48 ± 3.81, range; 0–16) than the control group (mean ± sd; 0.95 ± 2.40, range; 0–10). Significantly higher scores for V-RQOL were also observed for the study group (mean ± sd; 12.68 ± 3.56, range; 10–27) than the control group (mean ± sd; 10.95 ± 2.45, range; 10–24). The p values for VHI-10 and V-RQOL scores were obtained 0.002 and < 0.001, respectively. Table 2 provides more detailed data of measured outcomes.

Table 2 Comparisons of acoustic voice parameters, durational measurements, and scores of the questionnaires between the groups

Gender-based comparisons were conducted for further analyses of acoustic voice parameters, durational measurements, and scores of the questionnaires. The findings showed that there was no statistically significant difference between the males in the study group and the males in the control group for any parameter. Whereas, comparisons between the female participants showed significantly lower scores for MPT (p = 0.038), significantly higher scores were obtained for both VHI-10 (p < 0.001) and V-RQOL (p = 0.001) questionnaires in the study group than in the control group. More information about the parameters is given in Table 3.

Table 3 Gender-based comparisons of acoustic voice parameters, durational measurements, and scores of the questionnaires between the groups

Intra-rater reliability was found ‘excellent’ for overall severity and roughness, ‘good’ for breathiness, strain, pitch, and loudness parameters of CAPE-V. Inter-rater reliability was found ‘excellent’ for overall severity and loudness, ‘good’ for roughness, breathiness, and strain, and ‘moderate’ for pitch parameters of CAPE-V. ICC values and interpretation are defined in Table 4.

Table 4 Intra-rater and inter-rater agreement for the scores of CAPE-V

Comparisons of CAPE-V parameters revealed statistically higher scores in the study group for overall severity (p = 0.023), but no significant differences were found for the other parameters between the groups. When gender-based comparisons were conducted for the parameters, the results only showed a statistically significant difference for overall severity parameter in the male participants with a higher score in the study group (mean ± sd; 4.60 ± 11.3, range; 0–50 versus mean ± sd; 0.25 ± 1.11, range; 0–5). Comparisons between the female participants showed no significant differences for any parameters. Comparisons of CAPE-V parameters between the groups are detailed in Tables 5 and 6.

Table 5 Comparisons of CAPE-V parameters between the groups
Table 6 Gender-based comparisons of CAPE-V parameters

The Spearman analyses revealed significant correlations of both questionnaires with the symptom scores during COVID-19 infection. The score of VHI-10 was significantly correlated with coughing (r = 0.442, p = 0.004), difficulty in breathing (r = 0.402, p = 0.010), tiredness (r = 0.529, p < 0.001), and pain (r = 0.314, p = 0.049). Furthermore, the score of V-RQOL was significantly correlated with age (r = 0.368, p = 0.020), difficulty in breathing (r = 0.412, p = 0.008), tiredness (r = 0.466, p < 0.002), and difficulty in speaking (r = 0.329, p < 0.038). The results of correlation analyses for the scores of questionnaires are presented in Table 7. However, no statistically significant correlation of age, BMI, time after infection, time of negative PCR, and symptom scores was found with the other voice-related evaluations (acoustic variables, durational measurements, and CAPE-V parameters).

Table 7 Correlations between the characteristics of recovering patients and the scores of questionnaires

Discussion

This study investigated the voice-related parameters in the patients who recovered from COVID-19 disease to compare outcomes with a group of non-infected healthy control cohort. Although several studies have been conducted to reveal the effect of the disease on the human voice, to the best of our knowledge, this is the first study considering this issue in a wide range of assessments including acoustic voice analyses, durational measurements, self-reported outcomes, auditory-perceptual judgments, and clinical symptoms of the subjects.

Although the previous studies have investigated the relationship between COVID-19 and voice-related parameters [18,19,20,21], acoustic analyses were discussed only in one study [21]. Asiaee et al. compared the acoustic voice results of 64 patients with 70 healthy participants and stated significant differences in CPP, HNR, H1H2, F0SD, jitter, and shimmer values. However, in our cohort, mean F0, jitter local, shimmer local, HNR, and CPP parameters did not show any difference between the study group and the control group. Furthermore, correlation analysis did not reveal any relationship between the symptom scores during COVID-19 infection and acoustic variables. The logical explanation of the difference between the studies is that all patients included in the previous study were actively infected patients at the time of enrollment. However, in the present study, our study group consisted of individuals who had a previous history of COVID-19 and recovered from the disease. As such, the mean time passed after infection was 8.2 ± 4.3 months (range, 1–17 months) in the study group. Therefore, it could be said that the acoustic voice characteristics of the patients might deteriorate only during the exacerbation of the disease.

Among the durational parameters, only MPT differed between the groups with a lower score in the study group, but it was not related to any of the symptom scores during COVID-19 infection and the time passed after infection. Previous studies have also described a reduction of MPT in COVID-19 patients [18, 21]. Our results combined with prior research indicate that the disease can affect the respiratory system and decrease the duration of phonation in this patient population, which is more evident in females.

In a previous study, the VHI-10 questionnaire was used for patient-reported vocal outcomes and it was stated that the total score was higher in patients with COVID-19, as we found [19]. However, the average total VHI-10 score was found to be 7.31 ± 6.67 which was relatively high from our result (2.48 ± 3.81). We also used the V-RQOL questionnaire and the results showed us that COVID-19 caused a reduction in voice-related quality of life in recovering patients. The possible reason for this difference between the studies may be due to the time of evaluation. While the previous study was conducted with the active COVID-19 patients, our study was conducted with the subjects who recovered from the disease. It is known that the symptoms encountered during the respiratory tract infections such as cough, sore throat, nasal congestion, and rhinorrhea alter the basic characteristics of voice [34]. In our cohort of recovering patients, significant correlations were found especially between the symptom scores during COVID-19 infection and the scores of both VHI-10 and V-RQOL questionnaires. We therefore clearly conclude that exacerbation of the symptoms causes an increase in self-reported vocal complaints and patients with COVID-19 are more likely to have deterioration in voice quality after the infection period.

In the current study, we used the CAPE-V procedure for auditory-perceptual evaluation. All parameters (overall severity, roughness, breathiness, strain, pitch, and loudness) were used for comparisons between the study and the control group. Except for the overall severity (p = 0.023), there were no significant differences for the other parameters. Our auditory-perceptual evaluation results are comparable to those from previous studies [18,19,20]. For example, Saki et al. [18] aimed to investigate the auditory-perceptual characteristics of patients with different severities of COVID-19. In the study, they only used the overall severity parameter of CAPE-V and found significant differences between the subgroups of patients and their healthy counterparts. The overall severity scores of patients with mild, moderate, and severe COVID-19 were reported as 24.55 ± 5.49, 46.31 ± 5.99, and 64.11 ± 6.23, respectively. In another study by Tahir et al. [19], all CAPE-V parameters were reported significantly higher in patients than in healthy controls except for the strain and the pitch parameters. In the study, the mean overall severity score was found 20.29 ± 15.74 which indicates a mild to moderate deviance in voice quality [26]. Compared to the prior results, the findings of the current study revealed a relatively lower score for auditory-perceptual evaluation (2.63 ± 8.43). Because we found a significant difference between the recovering patients and healthy controls in a long-term period after infection, it is possible to conclude that even after the infection the overall severity of dysphonia persists in patients who recovered from COVID-19 disease.

Conclusion

As far as we know, this is the first study that evaluates all voice-related parameters namely, acoustic voice analyses, durational measurements, patient-reported outcomes, and auditory-perceptual evaluations together in a cohort of patients who recovered from COVID-19 disease. The findings showed us that the recovering patients had significantly lower MPT, increased VHI-10 scores, decreased voice-related quality of life based on the V-RQOL questionnaire, and a mild amount of voice deviancy in auditory-perceptual evaluation. However, a similar trend was not observed in acoustic voice analyses between the study and the control group. Correlation analyses showed that symptom scores of the disease have a close relationship with self-reported voice complaints. Since the current study was conducted with the patients who had a prior history of COVID-19, these results suggest that the target population may experience various vocal impairments after the infection period. However, because of the small sample size and a relatively short time of follow-up, the results should be interpreted with caution. Thus, further studies investigating the effect of COVID-19 on human voice with a larger cohort and longer follow-up period are needed to specify the present findings.