Acoustical recognition of laryngeal pathology: a comparison of two strategies based on sets of features
- 46 Downloads
The efficiency of sets of acoustical features discriminating pathological voices from control voices is reported. Two strategies were compared. The first (called the ‘distance strategy’) was built upon a statistical distance of voice features to reference values obtained for a set of healthy (reference) voices. The second strategy (called the ‘range strategy’) is based on the position inside or outside normal ranges established from a reference population; results based on this strategy were presented in a previous paper. Reference values were calculated from a database of 200 healthy voices distributed into 10-year age groups ranging from 20 to 70. Comparisons were made using a second database of 220 voices, including 65 control, 51 functional dysphonia, 50 with nodules on the vocal folds and 54 recurrent nerve palsy. The phonetic material was compared of 17 French vowels: 11 vowels in a sentence, three isolated vowels and three segments (beginning, middle and end) of the sustained vowel/a/. Four acoustical features were considered for each vowel: the voice fundamental (f0) and the first three formant frequencies. Acoustical features were calculated on an ILS (Interactive Laboratory System) analysis system (workstation). The separation of each pathological group from the control group, using sets of acoustical features, was statistically assessed. From the strategy point of view, results indicated that (i) the fundamental frequency f0 was the best measure to separate normal from pathological voices with the distance strategy; (ii) when the formants were taken, the range strategy performed better in separating the voices. For classification of pathologies, the best separation coefficients were obtained with nodules and the worst with recurrent nerve palsy. Overall, it was seen that the separation between control and pathological voices was most efficient when measured using the distance strategy for f0. The range strategy was useful with formant frequencies.
KeywordsVoice Laryngeal pathology Fundamental frequency Formants Vowels
Unable to display preview. Download preview PDF.
- Choi, S. E., Kim, H. N., andKim, G. R. (1980): ‘The medicosonagraphic study of Korean hoarseness due to laryngeal pathology’,J. Med. Sci.,13, p. 82Google Scholar
- Holst, M., Hertegard, S., andPersson, A. (1990): ‘Vocal dysfunction following cricothyroidotomy: A prospective study’,Laryngoscope,100, pp. 749–755Google Scholar
- Kwang-Moon, K., Yuki, K., andMinoru, H. (1982): ‘Sound spectrographic analysis of the voice of patients with recurrent nerve laryngeal paralysis’,Folia Phoniatrica,34, pp. 124–133Google Scholar
- Perrin, E., Berger-Vachon, C., Le Dissez, C., andMorgon, A. (1994): ‘The voice of cochlear implanted children’,Adv. O.R.L.,50, pp. 167–173Google Scholar
- Perrin, E., Collet, L., andBerger-Vachon, C. (1996): ‘Influence of the vocalic context on the acoustical recognition of voice impairment in laryngeal pathology’,Innov. Tech. Biol. Med.,17, pp. 505–515Google Scholar
- Saporta, G. (1988): ‘Probabilités, analyse des données et statistique’, Technips EdsGoogle Scholar