Abstract
When the distinctive formant transition of a synthetic syllable is presented to one ear while the remainder (the “base”) is presented to the opposite ear, listeners report hearing the original syllable in the ear receiving the base—a phenomenon called “spectral/temporal fusion” by Cutting (1976). We have found that the mere onset (i.e., the first pitch pulse, 10 msec in duration) of an isolated, contralateral third-formant (F3) transition can be sufficient to cue the /da/-/ga/ distinction in this way. We also varied the relative onset times of isolated F3 and base and compared three types of F3 segments (50-msec time-varying, 50-msec constant, 10-msec onset) under both dichotic and diotic presentation. Time-varying F3 segments were superior to constant ones, especially when they lagged behind the base. Diotic performance exceeded dichotic performance, but only when F3 preceded the base, suggesting that upward spread of masking occurred in diotic presentation when F3 coincided with energy in the lower formants. Perhaps most interestingly, subjects’ tolerance of temporal asynchrony (roughly ±50 msec) was about the same in dichotic and diotic conditions, suggesting that the temporal integration mechanism that combines phonetic information from the isolated F3 segment and the base operates similarly in both conditions.
Article PDF
Similar content being viewed by others
References
Bentin, S., &Mann, V. A. (1983). Selective effects of masking on speech and nonspeech in the duplex perception paradigm.Haskins Laboratories Status Report on Speech Research,SR-76, 65–85.
Broadbent, D. E. (1955). A note on binaural fusion.Quarterly Journal of Experimental Psychology,7, 46–47.
Broadbent, D. E., &Ladefoged, P. (1957). On the fusion of sounds reaching different sense organs.Journal of the Acoustical Society of America,29, 708–710.
Cutting, J. E. (1976). Auditory and linguistic processes in speech perception: Inferences from six fusions in dichotic listening.Psychological Review,83, 114–140.
Danaher, E. M., &Pickett, J. M. (1975). Some masking effects produced by low-frequency vowel formants in persons with sensorineural hearing loss.Journal of Speech and Hearing Research,18, 261–271.
Darwin, C. J., Howell, P., &Brady, S. A. (1978). Laterality and localization: A right ear advantage for speech heard on the left. In J. Requin (Ed.),Attention and performance VII. Hillsdale, NJ: Erlbaum.
Liberman, A. M. (1979). Duplex perception and integration of cues: Evidence that speech is different from nonspeech and similar to language. In E. Fischer-Jørgensen, J. Rischel, & N. Thorsen (Eds.),Proceedings of the IXth International Congress of Phonetic Sciences (Vol. 2). Copenhagen: University of Copenhagen.
Liberman, A. M. (1982). On finding that speech is special.American Psychologist,37, 148–167.
Liberman, A. M., Isenberg, D., &Rakerd, B. (1981). Duplex perception of cues for stop consonants: Evidence for a phonetic mode.Perception & Psychophysics,30, 133–143.
Mann, V. A., &Liberman, A. M. (1983). Some differences between phonetic and auditory modes of perception.Cognition,14, 211–235.
Nabelek, I. V., Nabelek, A. K., &Hirsh, I. J. (1970) Pitch of tone bursts of changing frequency.Journal of the Acoustical Society of America,48, 536–553.
Nearey, T. M., &Levitt, A. G. (1974). Evidence for spectral fusion in dichotic release from upward spread of masking.Haskins Laboratories Status Report on Speech Research,SR-39/40, 81–89.
Nusbaum, H. C., Schwab, E. C., &Sawusch, J. R. (1983). The rote of “chirp“ identification in duplex perception.Perception & Psychophysics,33, 323–332.
Nye, P. W., Nearey, T. M., &Rand, T. C. (1974). Dichotic release from masking: Further results from studies with synthetic speech stimuli.Haskins Laboratories Status Report on Speech Research,SR37/38, 123–137.
Pastore, R. E., Schmuckler, M. A., Rosenblum, L., &Szczesiul, R. (1983) Duplex perception with musical stimuli.Perception & Psychophysics,33, 469–474.
Pastore, R. E., Szczesiul, R., Rosenblum, L. D., &Schmuckler M. A. (1982). When is a [p] a [t], and when is it not.Journal of the Acoustical Society of America,72 (Supplement No 1), S16. (Abstract)
Rand, T. C. (1974). Dichotic release from masking for speech.Journal of the Acoustical Society of America,55, 678–680.
Repp, B. H., Milburn, C., &Ashkenas, J. (1983). Duplex perception: Confirmation of fusion.Perception & Psychophysics,33, 333–337.
Schwab, E. C. (1981).Auditory and phonetic processing for tone analogs of speech. Unpublished doctoral dissertation, SUNY at Buffalo.
Author information
Authors and Affiliations
Additional information
This research was supported by NICHD Grant HD01994 and BRS Grant RR05596 to Haskins Laboratories. Shlomo Bentin was supported by a stipend from the Jesselson foundation.
Rights and permissions
About this article
Cite this article
Repp, B.H., Bentin, S. Parameters of spectral/temporal fusion in speech perception. Perception & Psychophysics 36, 523–530 (1984). https://doi.org/10.3758/BF03207512
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03207512