Abstract
This study examined the effect of native language background on listeners’ perception of native and non-native vowels spoken by native (Hong Kong Cantonese) and non-native (Mandarin and Australian English) speakers. They completed discrimination and an identification task with and without visual cues in clear and noisy conditions. Results indicated that visual cues did not facilitate perception, and performance was better in clear than in noisy conditions. More importantly, the Cantonese talker’s vowels were the easiest to discriminate, and the Mandarin talker’s vowels were as intelligible as the native talkers’ speech. These results supported the interlanguage speech native intelligibility benefit patterns proposed by Hayes-Harb et al. (J Phonetics 36:664–679, 2008). The Mandarin and English listeners’ identification patterns were similar to those of the Cantonese listeners, suggesting that they might have assimilated Cantonese vowels to their closest native vowels. In addition, listeners’ perceptual patterns were consistent with the principles of Best’s Perceptual Assimilation Model (Best in Speech perception and linguistic experience: issues in cross-language research. York Press, Timonium, 1995).


Similar content being viewed by others
References
Bauer, R. S., & Benedict, P. K. (1997). Modern Cantonese phonology. Berlin and New York: Mouton de Gruyter.
Benoit, C., Mohamadi, T., & Kandel, S. (1994). Effects of phonetic context on audiovisual intelligibility of French. Journal of Speech and Hearing Research, 37(5), 1195–1203.
Bent, T., & Bradlow, A. R. (2003). The interlanguage speech intelligibility benefit. The Journal of Acoustical Society of America, 114(3), 1600–1610.
Best, C. T. (1995). A direct realist view of cross-langauge speech perception. In W. Strange (Ed.), Speech perception and linguistic experience: Issues in cross-language research (pp. 171–232). Timonium, MD: York Press.
Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International, 5(9/10), 341–345.
Campbell, R., Dodd, B., & Burnham, D. (Eds.). (1998). Hearing by eye II: Advances in the psychology of speechreading and auditory-visual speech. Hove, UK: Psychology Press.
Chen, T. H., & Massaro, D. W. (2004). Mandarin speech perception by ear and eye follows a universal principle. Perception & Psychophysics, 66(5), 820–836.
Chen, Y., & Hazan, V. (2007). Language effects on the degree of visual influence in audiovisual speech perception. Germany: Saarbrueken.
Davis, C., & Kim, J. (2004). Audio-visual interactions with intact clearly audible speech. Quaterly Journal of Experimental Psychology, 57A, 1103–1121.
Dodd, B., & Campbell, R. (Eds.). (1987). Hearing by eye: The psychology of lip-reading. Hillsdale, NJ: Lawrence Erlbaum Associates.
Erber, N. P. (1969). Interaction of audition and vision in the recognition of oral speech stimuli. Journal of Speech and Hearing Research, 12(2), 423–425.
Flege, J. E. (1995). Second language speech learning: Theory, findings, and problems. In W. Strange (Ed.), Speech perception and linguistic experience: Issues in cross-language research (pp. 233–277). Timonium, MD: York Press.
Flege, J., Bohn, O.-S., & Jang, S. (1997). The effect of experience on nonnative subjects’ production and perception of English vowels. Journal of Phonetics, 25, 437–470.
Gilbert, G., & Micheyl, C. (2005). Influence of competing multi-talker babble on frequency-importance functions for speech measured using a correlational approach. ACTA Acustica United with Acustica, 91, 145–154.
Harrington, J., Cox, F., & Evans, Z. (1997). An acoustic phonetic study of broad, general, and cultivated Australian English vowels. Australian Journal of Linguistics, 17(2), 155–184.
Hayes-Harb, R., Smith, B. L., Bent, T., & Bradlow, A. R. (2008). The interlanguage speech intelligibility benefit for native speakers of Mandarin: Production and perception of English word-final voicing contrasts. Journal of Phonetics, 36, 664–679.
Hazan, V., Kim, J., & Chen, Y. (2010). Audiovisual perception in adverse conditions: Language, speaker and listener effects. Speech Communication, 52(11–12), 996–1009.
Ingram, J. C. L., & Park, S.-G. (1997). Cross-language vowel perception and production by Japanese and Korean learners of English. Journal of Phonetics, 25, 343–370.
Iverson, P., Hazan, V., & Bannister, K. (2005). Phonetic training with acoustic cue manipulations: A comparison of methods for teaching English /r/-/l/ to Japanese adults. The Journal of Acoustical Society of America, 118(5), 3267–3278.
Ladefoged, P., & Maddieson, I. (1996). The Sounds of the World’s Languages. Oxford: Blackwell.
Levy, E. S., & Strange, W. (2008). Perception of French vowels by American English adults with and without French language experience. Journal of Phonetics, 36(1), 141–157.
MacLeod, A., & Summerfield, A. Q. (1987). Quantifying the contribution of vision to speech perception in noise. British Journal of Audiology, 21, 131–141.
Markham, D., & Hazan, V. (2002). Speaker intelligibility of adults and children. In Proceedings of the international conference for spoken language processing (pp. 1685–1688). Denver.
McCandliss, B. D., Fiez, J. A., Protopapas, A., Conway, M., & McClelland, J. L. (2002). Success and failure in teaching the [r]-[l] contrast to Japanese adults: Tests of a Hebbian model of plasticity and stabilization in spoken language perception. Cognitive, Affective, & Behavioral Neuroscience, 2(2), 89–108.
McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264, 746–748.
Milborrow, S., & Nicolls, F. (2008). Locating facial features with an extended active shape model. In P. T. David Forsyth & Andrew Zisserman (Eds.), Proceedings of European conference on computer vision (pp. 504–513). Heidelberg, Berlin: Springer.
Miyawaki, K., Strange, W., Verbrugge, R., Liberman, A. M., Jenkins, J. J., & Fujimura, O. (1975). An effect of linguistic experience: The discrimination of [r] and [l] by native speakers of Japanese and English. Perception and Psychophysics, 18, 331–340.
Navarra, J., & Soto-Faraco, S. (2007). Hearing lips in a second language: Visual articulatory information enables the perception of second language sounds. Psychological Research, 71(1), 4.
Ortega-Llebaria, M., Faulkner, A., & Hazan, V. (2001). Auditory-visual L2 speech perception: Effects of visual cues and acoustic-phonetic context for Spanish learners of English. Speech, Hearing and Language: Work in progress, 13, 39–51.
Parikh, G., & Loizou, P. C. (2005). The influence of noise on vowel and consonant cues. The Journal of Acoustical Society of America, 118(6), 3874–3888.
Polka, L. (1991). Cross-language speech perception in adults: Phonemic, phonetic, and acoustic contributions. The Journal of Acoustical Society of America, 89(6), 2961–2977.
Polka, L. (1992). Characterizing the influence of native language experience on adult speech perception. Perception & Psychophysics, 52(1), 37–52.
Polka, L. (1995). Linguistic influences in adult perception of non-native vowel contrasts. The Journal of the Acoustical Society of America, 97(2), 1286–1296.
Polka, L., Escudero, P., & Matchett, S. (2002). Assimilation and discrimination of Canadian French vowels by English-speaking adults. The Journal of Acoustical Society of America, 112(5), 2250.
Ross, L. A., Saint-Amour, D., Leavitt, V. M., Javitt, D. C., & Foxe, J. J. (2007). Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments. Cerebral Cortex, 17(5), 1147–1153.
So, C. K., & Best, C. T. (2010). Cross-language perception of non-native tonal contrasts: Effects of native phonological and phonetic influences. Language and Speech, 53(2), 273–293.
Strange, W. (Ed.). (1995). Speech perception and linguistic experience: Issues in cross-language research. Timonium, MD: York Press.
Sumby, W. H., & Pollack, I. (1954). Visual contribution to speech intelligibility in noise. The Journal of the Acoustical Society of America, 26(2), 212–215.
Wang, Y., Behne, D. M., & Jiang, H. (2008). Linguistic experience and audio-visual perception of non-native fricatives. The Journal of the Acoustical Society of America, 124(3), 1716–1726.
Werker, J. F., & Tees, R. C. (1984). Phonemic and phonetic factors in adult cross-language speech perception. The Journal of Acoustical Society of America, 75, 1866–1878.
Wikipedia. (2012, 10 February 2012). Chinese vowel diagram Retrieved 17 August 2012.
Acknowledgments
This study was supported by the Australian Research Council Discovery Project Grant (DP0880654) to the first author. We thank Haisam Haidar, Meerab Jacob, Shaheen Pordily, and Lee-Anne Michael for their assistance with data collection. We thank Amanda Reid for proofreading.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
So, C.K., Attina, V. Cross-Language Perception of Cantonese Vowels Spoken by Native and Non-native Speakers. J Psycholinguist Res 43, 611–630 (2014). https://doi.org/10.1007/s10936-013-9270-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10936-013-9270-6

