Abstract
When humans talk without conventionalized arrangements, they engage in conversation—that is, a continuous and largely nonsimultaneous exchange in which speakers take turns. Turn-taking is ubiquitous in conversation and is the normal case against which alternatives, such as interruptions, are treated as violations that warrant repair. Furthermore, turn-taking involves highly coordinated timing, including a cyclic rise and fall in the probability of initiating speech during brief silences, and involves the notable rarity, especially in two-party conversations, of two speakers’ breaking a silence at once. These phenomena, reported by conversation analysts, have been neglected by cognitive psychologists, and to date there has been no adequate cognitive explanation. Here, we propose that, during conversation, endogenous oscillators in the brains of the speaker and the listeners become mutually entrained, on the basis of the speaker’s rate of syllable production. This entrained cyclic pattern governs the potential for initiating speech at any given instant for the speaker and also for the listeners (as potential next speakers). Furthermore, the readiness functions of the listeners are counterphased with that of the speaker, minimizing the likelihood of simultaneous starts by a listener and the previous speaker. This mutual entrainment continues for a brief period when the speech stream ceases, accounting for the cyclic property of silences. This model not only captures the timing phenomena observed in the literature on conversation analysis, but also converges with findings from the literatures on phoneme timing, syllable organization, and interpersonal coordination.
Article PDF
Similar content being viewed by others
References
Ackermann, H., Gräber, S., Hertrich, I., &Daum, I. (1997). Categorical speech perception in cerebellar disorders.Brain & Language,60, 323–331.
Ackermann, H., Mathiak, K., &Ivry, R. B. (2004). Temporal organization of “internal speech” as a basis for cerebellar modulation of cognitive functions.Behavioral & Cognitive Neuroscience Reviews,3, 14–22.
Allen, J. S., Miller, J. L., &DeSteno, D. (2003). Individual talker differences in voice-onset-time.Journal of the Acoustical Society of America,113, 544–552.
Bavelas, J. B., Chovil, N., Coates, L., &Roe, L. (1995). Gestures specialized for dialogue.Personality & Social Psychology Bulletin,21, 394–405.
Beattie, G. W. (1979). Contextual constraints on the floor- apportionment function of speaker-gaze in dyadic conversations.British Journal of Social & Clinical Psychology,18, 391–392.
Beattie, G. W., Cutler, A., &Pearson, M. (1982). Why is Mrs. Thatcher interrupted so often?Nature,300, 744–747.
Beebe, B., Alson, D., Jaffe, J., Feldstein, S., &Crown, C. (1988). Vocal congruence in mother-infant play.Journal of Psycholinguistic Research,17, 245–259.
Bloom, K., Russell, A., &Wassenberg, K. (1987). Turn taking affects the quality of infant vocalizations.Journal of Child Language,14, 211–227.
Brown, G. D. A., Preece, T., &Hulme, C. (2000). Oscillator-based memory for serial order.Psychological Review,107, 127–183.
Burgess, N., &Hitch, G. (1992). Towards a network model of the articulatory loop.Journal of Memory & Language,31, 429–460.
Burgess, N., &Hitch, G. (1999). Memory for serial order: A network model of the phonological loop and its timing.Psychological Review,106, 551–581.
Burle, B., &Bonnet, M. (1999). What’s an internal clock for? From temporal information processing to temporal processing of information.Behavioral Processes,45, 59–72.
Buzsáki, G., &Draguhn, A. (2004). Neuronal oscillations in cortical networks.Science,304, 1926–1929.
Caspers, J. (1998). Who’s next? The melodic marking of question vs. continuation in Dutch.Language & Speech,41, 375–398.
Caspers, J. (2003). Local speech melody as a limiting factor in the turntaking system in Dutch.Journal of Phonetics,31, 251–276.
Clark, H. H., &Fox Tree, J. E. (2002). Using uh and um in spontaneous speaking.Cognition,84, 73–111.
Coates, J. (1994). No gap, lots of overlap: Turn-taking patterns in the talk of women friends. In D. Graddol, J. Maybin, & B. Stierer (Eds.),Researching language and literacy in social context (pp. 177–192). Philadelphia: Multilingual Matters.
Coates, J. (1997). The construction of a collaborative floor in women’s friendly talk. In T. Givón (Ed.),Conversation: Cognitive, communicative and social perspectives (pp. 55–89). Philadelphia: Benjamins.
Coates, J., &Sutton-Spence, R. (2001). Turn-taking patterns in deaf conversation.Journal of Sociolinguistics,5, 507–529.
Craig, H. K., &Gallagher, T. M. (1982). Gaze and proximity as turn regulators within three-party and two-party child conversations.Journal of Speech & Hearing Research,25, 65–75.
Crown, C. L., Feldstein, S., Jasnow, M. D., Beebe, B., &Jaffe, J. (2002). The cross-modal coordination of interpersonal timing: Sixweek-olds infants’ gaze with adults’ vocal behavior.Journal of Psycholinguistic Research,31, 1–23.
Cutler, A. (1991). Linguistic rhythm and speech segmentation. In J. Sundberg, L. Nord, & R. Carlson (Eds.),Music, language, speech and brain (pp. 157–166). London: Macmillan.
Dittmann, A. T., &Llewellyn, L. G. (1968). Relationship between vocalizations and head nods as listener responses.Journal of Personality & Social Psychology,9, 79–84.
Duncan, S., &Niederehe, G. (1974). On signaling that it’s your turn to speak.Journal of Experimental Social Psychology,10, 234–247.
D’Urso, V., &Zammuner, V. (1990). The perception of pause in question-answer pairs.Bulletin of the Psychonomic Society,28, 41–43.
Elias, G., Hayes, A., &Broerse, J. (1986). Maternal control of covocalization and inter-speaker silences in mother-infant vocal engagements.Journal of Child Psychology & Psychiatry,27, 409–415.
Ford, C. E., &Thompson, S. A. (1996). Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the management of turns. In E. Ochs, E. A. Schegloff, & S. A. Thompson (Eds.),Interaction and grammar (pp. 134–184). New York: Cambridge University Press.
Fox Tree, J. E. (2000). Coordinating spontaneous talk. In L. Wheeldon (Ed.),Aspects of language production (pp. 375–406). Philadelphia: Taylor & Francis.
Fox Tree, J. E. (2002). Interpreting pauses and ums at turn exchanges.Discourse Processes,34, 37–55.
French, P., &Local, J. (1983). Turn-competitive incomings.Journal of Pragmatics,7, 17–38.
Hafez, O. M. (1991). Turn-taking in Egyptian Arabic: Spontaneous speech vs. drama dialogue.Journal of Pragmatics,15, 59–81.
Harrigan, J. A. (1985). Listeners’ body movements and speaking turns.Communication Research,12, 233–250.
Harris, H. D. (2002). Holographic reduced representations for oscillator recall: A model of phonological production. In W. D. Gray & C. D. Schunn (Eds.),The proceedings of the 24th Annual Meeting of the Cognitive Science Society (pp. 423–428). Hillsdale, NJ: Erlbaum.
Hartley, T. (2002). Syllabic phase: A bottom-up representation of the structure of speech. In J. A. Bullinaria & W. Lowe (Eds.),Connectionist models of cognition and perception: Proceedings of the Seventh Neural Computation and Psychology Workshop (pp. 277–288). Singapore: World Scientific.
Hartley, T., &Houghton, G. (1996). A linguistically constrained model of auditory verbal short-term memory.Journal of Memory & Language,35, 1–31.
Heritage, J. (1984).Garfinkel and ethnomethodology. Cambridge: Polity.
Holmlund, C. (1995). Development of turntakings as a sensorimotor process in the first 3 months: A sequential analysis. In K. E. Nelson & Z. Reger (Eds.),Children’s language (Vol. 8, pp. 41–64). Hillsdale, NJ: Erlbaum.
Ivry, R. B., &Gopal, H. S. (1992). Speech production and perception in patients with cerebellar lesions. In D. E. Meyer & S. Kornblum (Eds.),Synergies in experimental psychology, artificial intelligence and cognitive neuroscience (pp. 771–802). Hillsdale, NJ: Erlbaum.
Izdebski, K., &Shipp, T. (1978). Minimal reaction times for phonatory initiation.Journal of Speech & Hearing Research,21, 638–651.
Jasnow, M., &Feldstein, S. (1986). Adult-like temporal characteristics of mother-infant vocal interactions.Child Development,57, 754–761.
Jefferson, G. (1973). A case of precision timing in ordinary conversation: Overlapped tag-positioned address terms in closing sequences.Semiotica,9, 47–96.
Jefferson, G. (1984). Notes on some orderliness of overlap onset. In V. D’Urso & P. Leonardi (Eds.),Discourse analysis and natural rhetorics (pp. 11–38). Padova: CLEUP.
Jefferson, G. (1989). Preliminary notes on a possible metric which provides for a “standard maximum” silence of approximately one second in conversation. In D. Roger & P. Bull (Eds.),Conversation: An interdisciplinary perspective (pp. 166–196). Clevedon: Multilingual Matters.
Jungers, M. K., Palmer, C., &Speer, S. R. (2002). Time after time: The coordinating influence of tempo in music and speech.Cognitive Processing,1-2, 21–35.
Keller, H., Schölmerich, A., &Eibl-Eibesfeldt, I. (1988). Communication patterns in adult-infant interactions in Western and non-Western cultures.Journal of Cross-Cultural Psychology,19, 427–445.
Kent, R. D., Kent, J. F., Rosenbek, J. C., Vorperian, H. K., &Weismer, G. (1997). A speaking task analysis of the dysarthria in cerebellar disease.Folia Phoniatrica et Logopædica,49, 63–82.
Kjaerbeck, S. (1998). The organization of discourse units in Mexican and Danish business negotiations.Journal of Pragmatics,30, 347–362.
Koiso, H., Horiuchi, Y., Tutiya, S., Ichikawa, A., &Den, Y. (1998). An analysis of turn-taking and backchannels based on prosodic and syntactic features in Japanese map task dialogs.Language & Speech,41, 295–321.
Körner, E., Gewaltig, M.-O., Körner, U., Richter, A., &Rodemann, T. (1999). A model of computation in neocortical architecture.Neural Networks,12, 989–1005.
Kuriki, S., Mori, T., &Hirata, Y. (1999). Motor planning center for speech articulation in the normal human brain.NeuroReport,10, 765–769.
La France, M. (1974). Nonverbal cues to conversational turn taking between black speakers.Personality & Social Psychology Bulletin,1, 240–242.
Large, E. W., &Jones, M. R. (1999). The dynamics of attending: How people track time-varying events.Psychological Review,106, 119–159.
Large, E. W., &Palmer, C. (2002). Perceiving temporal regularity in music.Cognitive Science,26, 1–37.
Lerner, G. H. (1991). On the syntax of sentences-in-progress.Language in Society,20, 441–458.
Lerner, G. H. (1996). On the “semi-permeable” character of grammatical units in conversation: Conditional entry into the turn space of another speaker. In E. Ochs, E. A. Schegloff, & S. A. Thompson (Eds.),Interaction and grammar (pp. 238–276). Cambridge: Cambridge University Press.
Lerner, G. H. (2002). Turn-sharing: The choral co-production of talkin-interaction. In C. E. Ford, B. A. Fox, & S. A. Thompson (Eds.),The language of turn and sequence (pp. 225–256). New York: Oxford University Press.
Lerner, G. H., &Takagi, T. (1999). On the place of linguistic resources in the organization of talk-in-interaction: A co-investigation of English and Japanese grammatical practices.Journal of Pragmatics,31, 49–75.
Levinson, S. C. (1983).Pragmatics. New York: Cambridge University Press.
MacNeilage, P. F. (1998). The frame/content theory of evolution of speech production.Behavioral & Brain Sciences,21, 499–546.
MacNeilage, P. F., Davis, B. L., Kinney, A., &Matyear, C. L. (2000). The motor core of speech: A comparison of serial organization patterns in infants and languages.Child Development,71, 153–163.
Masataka, N. (1993). Effects of contingent and noncontingent maternal stimulation on the vocal behaviour of three- to four-month-old Japanese infants.Journal of Child Language,20, 303–312.
Masataka, N. (1995). The relation between index-finger extension and the acoustic quality of cooing in three-month-old infants.Journal of Child Language,22, 247–257.
McFarland, D. H. (2001). Respiratory markers of conversational interaction.Journal of Speech, Language, & Hearing Research,44, 128–143.
Mesch, J. (2000). Tactile Swedish Sign Language: Turn taking in signed conversations of people who are deaf and blind. In M. Metzger (Ed.),Bilingualism and identity in deaf communities (pp. 187–203). Washington, DC: Gallaudet University Press.
Mesch, J. (2001).Tactile sign language: Turn taking and questions in signed conversations of deaf-blind people. Hamburg, Germany: Signum.
Miura, I. (1993). Switching pauses in adult-adult and child-child turn takings: An initial study.Journal of Psycholinguistic Research,22, 383–395.
Murata, K. (1994). Intrusive or co-operative? A cross-cultural study of interruption.Journal of Pragmatics,21, 385–400.
Penttonen, M., &Buzsáki, G. (2003). Natural logarithmic relationship between brain oscillators.Thalamus & Related Systems,2, 145–152.
Povel, D.-J. (1981). Internal representations of simple temporal patterns.Journal of Experimental Psychology: Human Perception & Performance,7, 3–18.
Povel, D.-J., &Essens, P. J. (1985). Perception of temporal patterns.Music Perception,2, 411–440.
Robbins, O., Devoe, S., &Wiener, M. (1978). Social patterns of turntaking: Nonverbal regulators.Journal of Communication,28, 38–46.
Rutter, D. R., &Durkin, K. (1987). Turn-taking in mother-infant interaction: An examination of vocalizations and gaze.Developmental Psychology,23, 54–61.
Sacks, H., Schegloff, E. A., &Jefferson, G. (1974). A simplest systematics for the organization of turn-taking for conversation.Language,50, 696–735.
Schaffer, D. (1983). The role of intonation as a cue to turn taking in conversation.Journal of Phonetics,11, 243–257.
Schegloff, E. A. (1982). Discourse as an interactional achievement: Some uses of “uh huh” and other things that come between sentences. In D. Tannen (Ed.),Analyzing discourse: Text and talk (pp. 71–93). Washington, DC: Georgetown University Press.
Schegloff, E. A. (2000). Overlapping talk and the organization of turntaking for conversation.Language in Society,29, 1–63.
Sellen, A. J. (1995). Remote conversations: The effect of mediating talk with technology.Human-Computer Interaction,10, 401–444.
Shea, S. A., Walter, J., Pelley, C., Murphy, K., &Guz, A. (1987). The effect of visual and auditory stimuli upon resting ventilation in man.Respiration Physiology,68, 345–357.
Shockley, K., Santana, M.-V., &Fowler, C. A. (2003). Mutual interpersonal postural constraints are involved in cooperative conversation.Journal of Experimental Psychology: Human Perception & Performance,29, 326–332.
Sidnell, J. (2001). Conversational turn-taking in a Caribbean English Creole.Journal of Pragmatics,33, 1263–1290.
Stephens, J., &Beattie, G. (1986a). On judging the ends of speaker turns in conversation.Journal of Language & Social Psychology,5, 119–134.
Stephens, J., &Beattie, G. (1986b). Turn-taking on the telephone: Textual features which distinguish turn-final and turn-medial utterances.Journal of Language & Social Psychology,5, 211–222.
Stevenson, M. B., Ver Hoeve, J. N., Roach, M. A., &Leavitt, L. A. (1986). The beginnings of conversation: Early patterns of mother-infant vocal responsiveness.Infant Behavior & Development,9, 423–440.
Streeck, J. (1996). A little Ilokano grammar as it appears in interaction.Journal of Pragmatics,26, 189–213.
Street, R. L. (1984). Speech convergence and speech evaluation in factfinding interviews.Human Communication Research,11, 139–169.
Strogatz, S. (2003).Sync: The emerging science of spontaneous order. New York: Hyperion.
Tanaka, H. (2000a). The particle ne as a turn-management device in Japanese conversation.Journal of Pragmatics,32, 1135–1176.
Tanaka, H. (2000b). Turn projection in Japanese talk-in-interaction.Research on Language & Social Interaction,33, 1–38.
Thomas, A. P., &Bull, P. (1981). The role of pre-speech posture change in dyadic interaction.British Journal of Social Psychology,20, 105–111.
Trimboli, C., &Walker, M. B. (1984). Switching pauses in cooperative and competitive conversations.Journal of Experimental Social Psychology,20, 297–311.
Vousden, J. I., Brown, G. D. A., &Harley, T. A. (2000). Serial control of phonology in speech production: A hierarchical model.Cognitive Psychology,41, 101–175.
Walker, M. B. (1982). Smooth transitions in conversational turn- taking: Implications for theory.Journal of Psychology,110, 31–37.
Walker, M. B., &Trimboli, C. (1984). The role of nonverbal signals in co-ordinating speaking turns.Journal of Language & Social Psychology,3, 257–272.
Ward, L. M. (2003). Synchronous neural oscillations and cognitive processes.Trends in Cognitive Sciences,7, 553–559.
Warner, N., &Arai, T. (2001). The role of the mora in the timing of spontaneous Japanese speech.Journal of the Acoustical Society of America,109, 1144–1156.
Wayland, S. C., Miller, J. L., &Volaitis, L. E. (1994). The influence of sentential speaking rate on the internal structure of phonetic categories.Journal of the Acoustical Society of America,5, 2694–2701.
Wells, B., &MacFarlane, S. (1998). Prosody as an interactional resource: Turn-projection and overlap.Language & Speech,41, 265–294.
Wilson, T. P., Wiemann, J. M., &Zimmerman, D. H. (1984). Models of turn-taking in conversational interaction.Journal of Language & Social Psychology,3, 159–180.
Wilson, T. P., &Zimmerman, D. H. (1986). The structure of silence between turns in two-party conversation.Discourse Processes,9, 375–390.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was partially supported by the Max Planck Institute for Human Cognition and Neuroscience, Munich.
Rights and permissions
About this article
Cite this article
Wilson, M., Wilson, T.P. An oscillator model of the timing of turn-taking. Psychonomic Bulletin & Review 12, 957–968 (2005). https://doi.org/10.3758/BF03206432
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03206432