Exploring the feasibility of smart phone microphone for measurement of acoustic voice parameters and voice pathology screening
- 411 Downloads
The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73–1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7 % and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5 %. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60 % and RFC revealed the EER of 7.9 %, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.
KeywordsAcoustic analysis Voice screening Smart phone
This study was supported by grant VP1-3.1- ŠMM-10-V-02-030 from the Ministry of Education and Science of Republic of Lithuania.
Compilance with ethical standards
Conflict of interest
No conflicts of interest to declare.
- 4.Cohen SM, Kim J, Roy N, Courey M (2014) Delayed otolaryngology referral for voice disorders increases health care costs. Am J Med 128:11–18Google Scholar
- 5.Dejonckere PH, Bradley P, Clemente P, Cornut G, Crevier-Buchman L, Friedrich G et al (2001) A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur Arch Otorhinolaryngol 258:77–82CrossRefPubMedGoogle Scholar
- 19.Elliott AC, Woodward WA (2007) Statistical analysis quick reference guidebook: with SPSS examples. Sage Publications, New YorkGoogle Scholar
- 22.Brümmer N, de Villiers E (2013) The BOSARIS toolkit: Theory, algorithms and code for surviving the new dcf. ArXiv Preprint ArXiv 1304.2865Google Scholar