Chunking as a function of sequence length

Tosatto, Laure; Fagot, Joël; Nemeth, Dezso; Rey, Arnaud

doi:10.1007/s10071-024-01835-z

Chunking as a function of sequence length

Original Paper
Open access
Published: 02 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Animal Cognition Aims and scope Submit manuscript

Chunking as a function of sequence length

Download PDF

Laure Tosatto ORCID: orcid.org/0000-0001-9008-9341^1,2,7,
Joël Fagot^1,2,3,8,
Dezso Nemeth^4,5,6 &
…
Arnaud Rey^1,2^nAff8

725 Accesses
3 Altmetric
Explore all metrics

Abstract

Chunking mechanisms are central to several cognitive processes. During the acquisition of visuo-motor sequences, it is commonly reported that these sequences are segmented into chunks leading to more fluid, rapid, and accurate performances. The question of a chunk’s storage capacity has been often investigated but little is known about the dynamics of chunk size evolution relative to sequence length. In two experiments, we studied the dynamics and the evolution of a sequence’s chunking pattern as a function of sequence length in a non-human primate species (Guinea baboons, Papio papio). Using an operant conditioning device, baboons had to point on a touch screen to a moving target. In Experiment 1, they had to produce repeatedly the same sequence of 4 movements during 2000 trials. In Experiment 2, the sequence was composed of 5 movements and was repeated 4000 times. For both lengths, baboons initially produced small chunks that became fewer and longer with practice. Moreover, the dynamics and the evolution of the chunking pattern varied as a function of sequence length. Finally, with extended practice (i.e., more than 2000 trials), we observed that the mean chunk size reached a plateau indicating that there are fundamental limits to chunking processes that also depend on sequence length. These data therefore provide new empirical evidence for understanding the general properties of chunking mechanisms in sequence learning.

Cognitive load theory and educational technology

Article 01 August 2019

No one knows what attention is

Article Open access 05 September 2019

NeuroKit2: A Python toolbox for neurophysiological signal processing

Article 02 February 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

A key mechanism allowing our cognitive system to compress information and increase short term memory capacity is the formation of chunks (Mathy and Feldman 2012; Miller 1956). Chunking is defined as the process of associating and grouping several items together into a single processing unit (Gobet et al. 2001, 2016). Several studies have questioned the maximum number of chunks that can be stored in short-term memory. While Miller (1956) initially proposed that humans have a short-term storage capacity of 7 plus or minus 2 chunks, Cowan (2001) suggested that this capacity might be more limited to a set of approximately four chunks. Other studies were concerned by the number of items that can be stored into a single chunk and have shown that chunks seem to have their own limits regarding storage, seemingly 3 or 4 items per chunk (Allen and Coyne 1988; Chase and Simon 1973; Johnson 1970). Yet, this absolute number of chunk size varies depending on experimental paradigms and factors such as expertise. For example, Gobet and Clarkson (2004) found that chess Masters were able to chunk many pieces of information (up to 15 items). Here, we aim to investigate the size of chunks and their evolution during sequence learning and the effect of sequence length on the chunking pattern.

In the field of perceptual-motor learning, chunking has been considered as the main motor sequence integration mechanism (Diedrichsen and Kornysheva 2015; Wymbs et al. 2012). Motor sequence learning is commonly described as the process by which a sequence of movements is acquired and executed with increased speed and accuracy (Willingham 1998). This process is largely related to the question of chunking as individuals spontaneously parse sequences of movements into chunks corresponding to subparts of the sequence. This process of parsing into chunks becomes clear when studying the pattern of successive response times (RTs) in typical sequential button-press tasks: long temporal gaps between two successive responses are usually observed and are assumed to mark chunk boundaries (Abrahamse et al. 2013; Bottary et al. 2016). The resulting chunking pattern therefore reflects the sequence’s organization in memory (Sakai et al. 2003) and inform us about the length chunks can have.

If many studies report that chunks typically contain 3 or 4 items, sometimes 5 (e.g., Nissen and Bullemer 1987; Sakai et al. 2003; Verwey 1996; Verwey et al. 2002), other studies found much larger chunk sizes of 7 or 8 (e.g., Kennerley et al. 2004). One factor that can explain this heterogeneity of results is practice. Indeed, some studies include a very limited number of repetitions of the same sequence (e.g., only 36 repetitions in Rosenbaum et al. 1983) whereas others are interested in extended practice and include hundreds of trials (e.g., 588 in Verwey 2003). Throughout extended practice, chunks were found to evolve and grow larger as if more compression of information was possible with increasing familiarity with the sequence (e.g., Acuna et al. 2014; Bera et al. 2021; Ramkumar et al. 2016; Wright et al. 2010). These conclusions are not limited to humans and identical results have been obtained in other animals, particularly non-human-primates (e.g., Ramkumar et al. 2016; Terrace 2002; Scarf et al. 2018). Animals too appear to spontaneously chunk sequences and the chunking pattern can evolve through extended practice.

Another factor that may influence chunk size is the length of the sequence. Indeed, temporal gaps between items of the sequence seem to emerge only after sequences of 3 or 4 items (Bo et al. 2009; Verwey and Eikelboom 2003). This suggests that a single chunk can be formed for very short sequences and that as the sequence gets longer, more chunks can emerge. For instance, Verwey (2003) found no segmentation in 2 and 4-item sequences whereas chunking occurred when participants performed 6-item sequences. This experiment however does not specifically study the evolution of chunk sizes in relationship to sequence length.

In a recent study, Tosatto et al. (2022) studied the evolution of chunks during the repeated execution of a single visuo-motor sequence in non-human primates (i.e., Guinea baboons papio papio). Using a serial response time (SRT) task, baboons had to repeatedly produce the same sequence composed of 9 different locations for a thousand trials. Consistent with previous studies, results showed that baboons initially parsed the sequence into small chunks that progressively became fewer and longer throughout the task. Indeed, the average chunk size was initially equal to 2.2 items per chunk and it increased up to 3.38 items per chunk at the end of the experiment, after extended practice. On some occasions, longer chunks of 8 or 9 items were also observed.

This experiment also showed that the evolution of the chunks was governed by two reorganization mechanisms: concatenation (i.e., the process by which two successive chunks are performed more fluidly and the temporal gap between them decreases leading to a single and longer chunk) and recombination (i.e., the emergence of a new segmentation pattern across chunks, such as two chunks of 3 items become a chunk of 4 items followed by a chunk of 2 items). Tosatto et al.’s (2022) study therefore informs us about the relative flexibility of chunks throughout learning, but this study remains limited because using only 9-items sequences does not provide information about the relationship between the initial chunking pattern, its evolution and the length of the sequence.

The aim of the present research is to study the dynamics of chunking for shorter extensively repeated sequences, in comparison to the results obtained for 9-item sequences. We designed two experiments to study the evolution of chunk size for different sequence lengths. In the first experiment, baboons were trained on a single repeated sequence of 4 items for 2000 trials. This specific sequence length was chosen as it is generally accepted in the literature that chunks can store up to 3 or 4 items. Therefore, we expected either no segmentation in producing the sequence, or an initial segmentation of the sequence followed by a progressive increase in chunk size up to 4 items. The second experiment was similar to the first, but baboons were trained on a 5-item sequence for 4000 trials. This larger sequence length was used as a proxy to infer the evolution of chunk size for a sequence length between 9 and 4 items, using the data already collected for these two latter lengths. We also increased the total number of trials to determine if greater extended practice would still lead to a linear increase of the average chunk size.

Experiment 1

Method

Participants

Twenty-five Guinea baboons (Papio papio) from the CNRS primate facility in Rousset (France) were tested in this study. For practical reasons, we stopped the experiment after 17 monkeys completed all scheduled trials (fourteen female and three male, age range 2.8–24.8 years). Water was provided ad libitum during the test, and the monkeys received their normal food ration of fruits every day at 5 PM.

Materials

Apparatus

This experiment was conducted with a computer-learning device based on the voluntary participation of baboons (for details, see Fagot and Bonté 2010). Baboons implanted with a RFID microchip had free access to 10 automatic operant conditioning learning devices equipped with touch screens. Each time a monkey entered a test chamber, it was identified by its microchip and the system resumed the trial list where the subject left it at its previous visit. The experiment was controlled by E-prime (Version 2.0, Psychology Software Tools, Pittsburgh, PA, USA).

Task and stimuli

The screen was divided into nine uniformly spaced predetermined locations represented by white crosses on a black background, virtually labeled as Position 1 to 9 (see Fig. 1A). A trial began with the presentation of a yellow fixation cross at the bottom of the screen. Once pressed, the fixation cross disappeared and the nine white crosses were displayed, one of them being replaced by the target, a red circle. When the target was touched, it was immediately replaced by the cross. The red circle then replaced the next position in the sequence until it was touched, and a new position was displayed. Reward (grains of dry wheat) was provided at the end of a sequence of four touches (see Fig. 1B).

If baboons touched an inappropriate location (incorrect trial) or failed to touch the screen within 5000 ms after the red circle’s appearance (aborted trial), a green screen was displayed for 3000 ms as a marker of failure. Aborted trials were not retained and therefore presented again, while incorrect trials were not. The time elapsed between the appearance of the red circle and the baboon’s touch on this circle was recorded as the response time (RT) for that location in the sequence. To learn the task, baboons initially received random trials that were rewarded after three touches. Then, the number of touches in a trial was increased to four.

Design of the sequences

To control the motor difficulty of the transitions to be produced in the sequence, a random phase of sequence production was first conducted, where thirteen baboons performed random sequences of six positions for 1000 trials. For each of these 13 baboons, we computed all the mean transition times from one location to another, leading to a 9 × 9 matrix of RTs (with no values on the diagonal of the matrix). We then correlated each matrix of each baboon to the matrix of all the other baboons and, on average, the correlation between these matrices was 0.42 (SD = 0.19), indicating that there was a good consistency between the baboons’ performances. This result allowed us to compute an average baseline measure for all possible transitions for the entire group of baboons (see Appendix 1).

Based on these baseline measures, we designed four sequences of four serial positions for which each transition T was numerically faster (or equally fast) to produce on average than the next one (i.e., T1 ≤ T2 ≤ T3). Ideally, all the transitions should be matched to equate each transition for motor difficulty and to study the segmentation/chunking of the sequence. This was not possible for sequences of 9 positions (i.e., in Tosatto et al. 2022) and that is why we constructed the repeated sequences by systematically choosing increasing or equal transition times from the first transition to the last. Therefore, to make this study (with sequences of 9 positions) comparable with the present study with shorter sequences, we adopted the same logic when constructing the sequences. However, we also made sure that there was no significant difference between all the transition times of each sequence so that at the beginning of learning there was no significant difference on any transition (see Supplementary Materials). Appendix 2 provides the details of all the sequences we used, i.e., the sequence itself, the average response times for each transition and the number of monkeys that were presented with each sequence.

Procedure

To neutralize the potential effect of one specific sequence, baboons were exclusively presented with either Sequence 1 (n = 4), 2 (n = 5), 3 (n = 3) or 4 (n = 5) and had to produce their sequence repeatedly for 2000 successive trials. RTs for each position of the sequence were recorded for all the trials.

Results

On average, baboons required 2.82 days (SD = 1.19) to complete the 2000 trials, with a mean of 708.33 trials per day and a mean accuracy level of 99.44% (SD = 1.65). Incorrect trials were removed from the dataset (0.56%). RTs greater than 1000 ms were excluded and an additional recursive trimming procedure excluded RTs greater or smaller than 2.5 standard deviations from the subject’s mean for each of the four possible positions (15%). Note that by not removing any outlier, this does not change the main trends of our results (see Supplementary Materials). RTs for each of the four positions and for the 2000 trials were divided into 20 Blocks of 100 trials.

General sequence learning was estimated by computing on each trial the average of RTs over the four positions in the sequence. For each participant, we averaged these mean RTs for each Block of 100 trials and Fig. 2 represents the evolution of mean RTs for the entire group of monkeys. These values were entered in a repeated measures one-way ANOVA with Block (1–20) as the within factor. The effect of Block was highly significant, F(19, 304) = 21.175, p < 0.001, η² = 0.57. A linear regression also indicates that mean RTs decreased throughout the blocks of trials, F(1, 38) = 255, p < 0.001, Adjusted R² = 0.93, (Block 1, M = 430.43, SD = 38.25; Block 20, M = 340.34, SD = 34.71), suggesting that monkeys learned the sequence.

We adopted the same method as previously used for sequences composed of 9 positions (i.e., Tosatto et al. 2022) to study the chunking pattern of the sequence by monkeys. We considered successive positions A and B to be part of the same chunk as long as the transition time from one position to the next did not correspond to a significant increase in RT, otherwise an AB transition was supposed to mark a chunk boundary (Kennerley et al. 2004). Statistical significance was assessed through paired-sample t-tests for each pair of successive positions (significance threshold is set at 0.01 to correct for multiple comparisons^{Footnote 1}). Each time the RT of a pair's second position was significantly higher than the first position, it marked a chunk boundary. This analysis was applied on the mean RTs obtained at each position, for each Block of 100 trials and for each monkey (see Fig. 3 for an illustration of this procedure for one monkey).

With this method, we were able to quantify the number of chunks and their average size produced on each block by each monkey. Two linear regressions were conducted to test the effect of Block on the mean number of chunks and the mean chunk size respectively. These analyses revealed that the number of chunks significantly decreased across blocks (Block 1, M = 2.18, SD = 0.45; Block 20, M = 1.47, SD = 0.32; F(1, 18) = 98.6, p < 0.001, Adjusted R² = 0.84) and that chunk size significantly increased across blocks (Block 1, M = 2.12, SD = 0.59; Block 20, M = 3.06, SD = 0.64; F(1, 18) = 83.4, p < 0.001, Adjusted R² = 0.81). Note that we get exactly the same linear trends if we change the block size by taking 40 blocks of 50 trials or 10 blocks of 200 trials (see Supplementary Materials).

Additionally, for the average chunk size, we combined the present data on sequences of 4 items with the data collected in Tosatto et al. (2022) on sequences of 9 items to conduct a multiple regression analysis testing the effect of Block (1–20), Length (4 or 9) and the interaction of these two factors (see Fig. 4 for a representation of these data). This analysis revealed an effect of Block, Length and a significant interaction between these predictors (F(3, 26) = 34.8, p < 0.001, Adjusted R² = 0.78). The individual predictors showed no main effect of Block (t = − 0.01, p = 0.98) and no main effect of Length (t = 1.04, p = 0.31), but a significant interaction indicating that the increase of chunk size across blocks differs between 4-items and 9-items sequences, (t = 3.12, p = 0.004).

Finally, we studied the two reorganization mechanisms reported in Tosatto et al. (2022). We found that 52.94% of the reorganizations were concatenations (which were observed in all monkeys) and 47.06% of the reorganizations were recombinations (which were observed in 15 monkeys). Table 1 provides the total number of concatenations and recombinations obtained for each block and for all monkeys in Experiment 1. A repeated measure ANOVA with Block and Mechanisms (concatenation vs. recombination) did not reveal any significant difference between these two reorganization mechanisms (all ps > 0.05).

Table 1 Total number of concatenations and recombinations per block for Experiment 1

Full size table

Discussion

Two main findings were obtained in the present study. First, we confirmed the results of a previous experiment led on the evolution of chunks while learning a 9-item visuo-motor sequence. As it was the case with a longer sequence, non-human primates learn 4-item sequences by segmenting the sequence into small chunks and, with extended practice, these chunks become longer and fewer. Second, this decrease in the number of chunks and this increase in chunks’ size is due to two types of reorganizations: the recombination of several preexisting chunks and the concatenation of two distinct chunks into one.

It is interesting to note that the final mean chunk size after producing the sequence 1000 times is only 2.51 (CI [1.88, 3.14]; Min = 1; Max = 4), and that after 2000 trials, it is still different from 4 (Mean = 3.06; CI [2.42, 3.7]; Min = 1; Max = 4), indicating that baboons continue, on average, to segment this short sequence in several chunks. This is even more interesting considering that the mean chunk size for 9-item sequences was 3.38 (CI [2.85, 3.91]; Min = 1; Max = 8) after 1000 trials, indicating that chunk size varies with the length of the sequence. Chunking processes therefore seem to operate in interaction with the size of the sequence.

To further explore this interaction, we used the linear regressions presented in Fig. 4 to extrapolate the slope of the linear regression that should be obtained for a sequence length between 4 and 9. Indeed, it is possible to use regression models as predictive models as illustrated in Eq. (1):

$$y = \beta_{0} + \beta_{1} x_{1} + \beta_{2} x_{2} + \beta_{1.2} x_{1.2}$$

(1)

Here, the linear regression predicts the mean chunk size y as a function of the intercept β₀, the slope coefficient β₁ at block x₁, the slope coefficient β₂ for a sequence of length x₂ and the interaction effect β_1.2 between block and length. Using this formula, we can replace x₂ by a constant C = 5 to model the predicted evolution of the mean chunk size across blocks for a sequence of 5 items. The resulting predictive line is represented on Fig. 4. According to this model, a mean chunk size greater than 4 (4.89) should be observed after 4000 trials for sequences of 5-items. This indicates that large chunks could only be formed after greater extended practice. Experiment 2 was designed to assess the predictive power of that model and test the hypothesis that the relationship between block, sequence length and chunk size is linear.