Abstract
This study examined the correlation of, and agreement between, cepstral peak prominence (CPP) measures obtained from three acoustic analysis programs: Analysis of Dysphonia in Speech and Voice (ADSV), SpeechTool, and VoiceSauce. Voice data recorded from sustained /a/ vowel and connected speech of two cohorts of vocally healthy female participants were analysed using program default settings to measure smoothed CPP (CPPS) in ADSV, CPPS and CPP in SpeechTool, and CPP in VoiceSauce. Intraclass correlation coefficients, linear regression, and Bland–Altman plots were used for testing the correlation and agreement between these programs. There was good correlation between ADSV and SpeechTool with respect to vowel CPPS in both cohorts. Connected speech CPPS from these two programs showed moderate correlation in cohort 1 and good correlation in cohort 2. CPP values obtained from VoiceSauce were highly correlated with those from SpeechTool in both tasks. Bland–Altman plots showed that there were differences between programs in CPPS and CPP values. While CPPS and CPP values from these programs were correlated, they did not show absolute agreement. This implied possible different thresholds of detecting dysphonic severity across different acoustic analysis programs.
Similar content being viewed by others
References
Helou, L.B., Solomon, N.P., Henry, L.R., Coppit, G.L., Howard, R.S., Stojadinovic, A.: The role of listener experience on Consensus Auditory-perceptual Evaluation of Voice (CAPE-V) ratings of postthyroidectomy voice. Am. J. Speech Lang. Pathol. 19(3), 248–258 (2010). https://doi.org/10.1044/1058-0360(2010/09-0012)
Oates, J.: Auditory-perceptual evaluation of disordered voice quality. Folia Phoniatr. Logop. 61(1), 49–56 (2009)
Maryn, Y., Roy, N., De Bodt, M., Van Cauwenberge, P., Corthals, P.: Acoustic measurement of overall voice quality: a meta-analysis. J. Acoust. Soc. Am. 126(5), 2619–2634 (2009). https://doi.org/10.1121/1.3224706
Dejonckere, P.H., Bradley, P., Clemente, P., Cornut, G., Crevier-Buchman, L., Friedrich, G., Van De Heyning, P., Remacle, M., Woisard, V.: A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Eur. Arch. Otorhinolaryngol. 258(2), 77–82 (2001). https://doi.org/10.1007/s004050000299
Awan, S.N., Roy, N., Jetté, M.E., Meltzner, G.S., Hillman, R.E.: Quantifying dysphonia severity using a spectral/cepstral-based acoustic index: comparisons with auditory-perceptual judgements from the CAPE-V. Clin. Linguist. Phon. 24(9), 742–758 (2010)
Frohlich, M., Michaelis, D., Strube, H.W., Kruse, E.: Acoustic voice analysis by means of the hoarseness diagram. J. Speech Lang. Hearing Res. 43(3), 706–720 (2000)
Halberstam, B.: Acoustic and perceptual parameters relating to connected speech are more reliable measures of hoarseness than parameters relating to sustained vowels. J. Oto Rhino Laryngol. Head Neck Surg. 66(2), 70–73 (2004)
Gorham-Rowan, M.M., Laures-Gore, J.: Acoustic-perceptual correlates of voice quality in elderly men and women. J. Commun. Disord. 39(3), 171–184 (2006)
Hillenbrand, J., Cleveland, R.A., Erickson, R.L.: Acoustic correlates of breathy vocal quality. J. Speech Hearing Res. 37(4), 769–778 (1994)
Mathew, M.M., Bhat, J.S.: Soft phonation index—a sensitive parameter? Indian J. Otolaryngol. Head Neck Surg. 61(2), 127–130 (2009). https://doi.org/10.1007/s12070-009-0050-4
Akif Kiliç, M., Ögüt, F., Dursun, G., Okur, E., Yildirim, I., Midilli, R.: The effects of vowels on voice perturbation measures. J. Voice 18(3), 318–324 (2004)
Bielamowicz, S., Kreiman, J., Gerratt, B.R., Dauer, M.S., Berke, G.S.: Comparison of voice analysis systems for perturbation measurement. J. Speech Hearing Res. 39(1), 126–134 (1996)
Godino-Llorente, J.I., Osma-Ruiz, V., Saenz-Lechon, N., Cobeta-Marco, I., Gonzalez-Herranz, R., Ramirez-Calvo, C.: Acoustic analysis of voice using WPCVox: a comparative study with multi dimensional voice program. Eur. Arch. Otorhinolaryngol. 265(4), 465–476 (2008)
Heman-Ackah, Y.D., Michael, D.D., Goding Jr., G.S.: The relationship between cepstral peak prominence and selected parameters of dysphonia. J. Voice 16(1), 20–27 (2002)
Qi, Y., Hillman, R.E.: Temporal and spectral estimations of harmonics-to-noise ratio in human voice signals. J. Acoust. Soc. Am. 102, 537 (1997)
Hillenbrand, J., Houde, R.A.: Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J. Speech Hearing Res. 39(2), 311–321 (1996)
Awan, S.N., Roy, N.: Outcomes measurement in voice disorders: application of an acoustic index of dysphonia severity. J. Speech Lang. Hearing Res. 52(2), 482–499 (2009)
Lowell, S.Y., Colton, R.H., Kelley, R.T., Hahn, Y.C.: Spectral- and cepstral-based measures during continuous speech: capacity to distinguish dysphonia and consistency within a speaker. J. Voice 25(5), e223–e232 (2011). https://doi.org/10.1016/j.jvoice.2010.06.007
Peterson, E.A., Roy, N., Awan, S.N., Merrill, R.M., Banks, R., Tanner, K.: Toward validation of the cepstral spectral index of dysphonia (CSID) as an objective treatment outcomes measure. J. Voice 27(4), 401–410 (2013). https://doi.org/10.1016/j.jvoice.2013.04.002
Boersma, P., Weenink, D.: Praat: doing phonetics by computer [Computer program].http://www.praat.org/ (2011). Accessed 07 Sept 2011
Kay Elemetrics: Multi-dimensional voice program (MDVP) [Computer program]. Kay Elemetrics, Pine Brook (1993)
Elisei, N.G.: Acoustic analysis of normal and pathological voices using two different systems: anagraf and praat (Análisis acústico de la voz normal y patológica utilizando dos sistemas diferentes: anagraf y praat). Interdisciplinaria 29(2), 339–357 (2012)
Hillenbrand, J.M.: SpeechTool [Computer Program]. http://homepages.wmich.edu/~hillenbr (2002). Accessed 24 Apr 2012
PentaxMedical: Analysis of dysphonia in speech and voice—ADSV. https://www.pentaxmedical.com/pentax/en/99/1/Analysis-of-Dysphonia-in-Speech-and-Voice-ADSV. Accessed Mar 2018
Shue, Y.L.: VOICESAUCE: a program for voice analysis. J. Acoust. Soc. Am. 126(4), 2221 (2009)
Shue, Y.-L., Keating, P., Vicenik, C., Yu, K.: VoiceSauce: a program for voice analysis. In: 17th International Congress of Phonetic Sciences, pp. 1846–1849 (2011)
Maryn, Y., Weenink, D.: Objective dysphonia measures in the program Praat: smoothed cepstral peak prominence and acoustic voice quality index. J. Voice 29(1), 35–43 (2015). https://doi.org/10.1016/j.jvoice.2014.06.015
Watts, C.R., Awan, S.N., Maryn, Y.: A comparison of cepstral peak prominence measures from two acoustic analysis programs. J Voice 31(3), e1–e10 (2017). https://doi.org/10.1016/j.jvoice.2016.09.012
Sauder, C., Bretl, M., Eadie, T.: Predicting voice disorder status from smoothed measures of cepstral peak prominence using praat and analysis of dysphonia in speech and voice (ADSV). J. Voice 31(5), 557–566 (2017). https://doi.org/10.1016/j.jvoice.2017.01.006
Shue, Y.L., Keating, P., Vicenik, C.: VOICESAUCE. In: p. Program available online at http://www.seas.ucla.edu/spapl/voicesauce/. UCLA (2009). Accessed 21 April 2015
Awan, S.N., Giovinco, A., Owens, J.: Effects of vocal intensity and vowel type on cepstral analysis of voice. J Voice 26(5), e615–e620 (2012). https://doi.org/10.1016/j.jvoice.2011.12.001
Shue, Y.-L.: The voice source in speech production: data, analysis and models. University of California, Los Angeles (2010)
Bland, J.M., Altman, D.G.: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 327(8476), 307–310 (1986)
Koo, T.K., Li, M.Y.: A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med. 15(2), 155–163 (2016). https://doi.org/10.1016/j.jcm.2016.02.012
Adobe Systems Inc. https://www.adobe.com/au/products/audition.html?sdid=V6NZKW5P&mv=search&ef_id=WjoC_gAAAHySFHNG:20180516063911:s. Accessed Mar 2018
Fairbanks, G.: Voice and Articulation Drillbook, 2nd edn. Harper & Row, New York (1960)
Shrout, P.E., Fleiss, J.L.: Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86(2), 420–428 (1979)
Gravetter, F.J., Wallnau, L.B.: Statistics for the Behavioral Sciences, 4th edn. West Publishing Company, St. Paul (1996)
Kim, G.H., Lee, Y.W., Park, H.J., Bae, I.H., Kwon, S.B.: A study of cepstral peak prominence characteristics in ADSV, SpeechTool and Praat. J Speech Hearing Disord (in Korean) 26(3), 99–111 (2017)
Bland, J.M., Altman, D.G.: A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement. Comput. Biol. Med. 20(5), 337–340 (1990)
Heman-Ackah, Y.D., Sataloff, R.T., Laureyns, G., Lurie, D., Michael, D.D., Heuer, R., Rubin, A., Eller, R., Chandran, S., Abaza, M., Lyons, K., Divi, V., Lott, J., Johnson, J., Hillenbrand, J.: Quantifying the cepstral peak prominence, a measure of dysphonia. J. Voice 28(6), 783–788 (2014). https://doi.org/10.1016/j.jvoice.2014.05.005
Brinca, L.F., Batista, A.P., Tavares, A.I., Goncalves, I.C., Moreno, M.L.: Use of cepstral analyses for differentiating normal from dysphonic voices: a comparative study of connected speech versus sustained vowel in European Portuguese female speakers. J. Voice 28(3), 282–286 (2014). https://doi.org/10.1016/j.jvoice.2013.10.001
Baken, R.J., Orlikoff, R.F.: Clinical Measurement of Speech and Voice, 2nd edn. Singular Publishing Group, San Diego (2000)
Ladefoged, P.: A Course in Phonetics, 3rd edn. Harcourt Brace Jovanovich Inc, Fort Worth (1993)
Acknowledgements
The authors would like to thank the Australian Acoustical Society for funding this research. Thanks also to Professor Shaheen Awan for provision of detailed information and support regarding using of the ADSV and to Tara Cliffe for her assistance with editing and analysis of the data. We would also like to acknowledge the support of the Dr Liang Voice Program at The University of Sydney.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Madill, C., Nguyen, D.D., Eastwood, C. et al. Comparison of Cepstral Peak Prominence Measures Using the ADSV, SpeechTool, and VoiceSauce Acoustic Analysis Programs in Vocally Healthy Female Speakers. Acoust Aust 46, 215–226 (2018). https://doi.org/10.1007/s40857-018-0139-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40857-018-0139-6