Introduction

A number of interesting phenomena can emerge from the collective behaviour of a group of agents interacting with each other that cannot be explained in terms of the dynamics of an individual agent1,2. One phenomenon of extreme importance is the emergence of leadership. Leadership plays a crucial role in determining the success or the failure of a variety of activities both in animals (e.g. protecting the animal group against predatory attacks, locating food resources) and human groups (e.g. steering opinion dynamics, playing music or sport). Many studies exist in the literature addressing coordination and leadership emergence in the case of groups of animals3,4,5,6,7, where the problem is tackled by studying how agents informed about a migration route or food position influence collective behaviour. In this context, results show that, as group size becomes larger, only a small proportion of informed individuals is needed to guide the group behaviour.

In the case of human groups, several studies investigated the problem of emergence of leadership from a psychological or anthropological point of view8,9,10,11. For example, the choice of leading a group may be taken often by individuals who put a high value on decision rights or who desire to control others’ outcomes12,13. On the other hand, the leader role can be seen as a burden and so avoided by individuals acting mostly as followers14. Dominance versus egalitarianism allows us to make different predictions about the personalities of individuals emerging as leaders in formerly leaderless groups. Indeed, dominance-view leadership is expected to correlate with selfish and egoistic traits whereas egalitarianism-view leadership is expected to correlate with pro-social personality traits9,10.

In contrast, limited results are available on leadership emergence in situations where interactions are nonverbal. Examples include the role of a conductor in an orchestra15,16, group behaviour in dance17,18, consensus decision making with informed and non informed individuals19,20,21. However, there has been little quantitative analysis or mathematical modelling of the interaction patterns within a group that yield the phenomenon of leadership22,23.

Here, we study the emergence of leadership in a group of people walking together24,25,26,27. The goal is to propose a rigorous mathematical methodology capable to assess whether leadership emerges and whether it facilitates group coordination and cohesiveness. Understanding leadership emergence in groups of humans walking together can be relevant in many applications. For example, during evacuation from a building, the presence of a leader with global knowledge of the structure can safely guide people towards the exit avoiding fatal consequences28. Emergency situations are often characterised by high complexity, uncertainty, chaos, and ambiguity, thus requiring a coordinated and synchronised collective action. In such situations, leader identification is a crucial task to steer the group towards a desired collective behaviour. By finding, influencing or disconnecting the leaders from the rest of the crowd, efficient containment strategies can be accomplished29. Other applications may be the analysis of pedestrian flows in a shopping centre identifying crowd patterns and cluster formation, or preventing violent behaviour conducted by a small part of fanatic supporters at a football match from spreading among the crowd.

Past studies have observed that, in these situations, some people have a higher probability of becoming leaders according to their attitude or qualities, e.g. ability to handle emergency situations, charisma, confidence, friendliness, effective speaking30. Another interesting attitude that may play an important role in this context is the willingness of few individuals to shoulder responsibility and take common decisions, given the responsibility aversion of the majority14. It has been observed that such a leader is characterised by specific behavioural or physiological traits such as a common tendency to act first31. On the other hand, the spontaneous emergence of leadership may depend on contextual factors, such as an individual’s spatial position in the group6.

To investigate leadership emergence in human groups, we consider the simplest scenario in which four participants are asked to walk together across the room. Our aim is to unveil whether, when no designated roles are assigned, a leader emerges in the group. Specifically, we adopt a time-dependent cross correlation analysis to unfold the leader-follower relationships established within the group by instructing all participants to change their heading direction or speed twice at their will during every trial. We then analyse each trial to identify the individual whose change of locomotion (direction or speed) is readily followed by other group members. That individual is then labelled as the leader on that trial. We further demonstrate that relative position in the group, or a participant’s characteristic locomotion behaviour can influence leadership emergence.

Our findings serve as proof of existence of different types of leadership emerging from walking crowds. Precisely, we identify an “emergent leaderhip” scenario depending on local visual couplings between group members that try to match their speed and heading direction accordingly, confirming the observations made in25,26,32. In addition, our data analysis identifies an “individual leadership” type showing that in some situations a participant’s characteristic locomotion behaviour can be so influential as to dominate the group independent of his/her position.

Results

Two groups of four participants were tested during the experiment, designated Group 1 and Group 2. Before each trial, group members were positioned on the vertices of a square configuration, labelled as FL (front left), FR (front right), BL (back left) and BR (back right) (refer to “Methods”, Fig. 4). To identify an individual participant, we use the notation “Px.y” where x is the group and y is a sequential number identifying a specific participant, for example P1.3 indicates participant 3 in group 1. During each trial, group members were instructed to change direction (turn left/turn right) or speed (speed up/slow down) twice while staying together as they walked across the room. Three different initial inter-personal distances (IPD) between participants were tested: 1, 2 and 4 m. Participants’ positions were shuffled between each trial.

The data were analysed to assess the effect on leadership emergence of (1) spatial position in the group and (2) individual locomotor behaviour. We compared the time series of heading (or speed) for each participant by adapting the time-dependent cross correlation metric proposed in33 to characterise leadership emergence in groups of trawling bats. Briefly, the method quantifies the temporal dependence of turns (or speed changes) between group members, allowing us to compute an individual leadership index for each participant on each trial (for further details and formal definition of the index see “Methods”).

Effect of pedestrian position in the group

Due to the square configuration used during the experiments, we expected one of the pedestrians in the front row to act as the leader because they can influence participants in the back row (i.e. the back row is visually coupled to the front row), but not vice versa.

Figure 1 shows the influence of participant position on the individual leadership index, averaged over all trials, in the two conditions (heading direction and speed). Results indicate that participants in the front left and front right positions have the highest value of the leadership index.

Figure 1
figure 1

Averaged percent leadership evaluated grouping the trials by position. Four bar charts are reported, 2 groups \(\times\) 2 condition (speed and heading direction). Each bar represents the mean value of the leadership grouped by position over the number of trials, whereas black error bars are the standard error of the mean. FL front left, FR front right, BL back left, BR back right.

This is confirmed by a formal statistical analysis. A three-way Mixed ANOVA (2 Conditions \(\times\) 4 Positions \(\times\) 3 IPD) was performed on percent leadership, in which Condition and IPD were treated as “between-subject” factors, Position was treated as a “within-subject” factor, and Group was considered a random effect. The analysis revealed a significant main effect of Position on the measured leadership (\(F(3,117)=19.865, p < 0.05, \eta ^2 = 0.337\)).

The Condition main effect and the Condition \(\times\) Position interaction were not in themselves significant, \(F(1,39)=1.334, p = 0.255, \eta ^2 = 0.033\) and \(F(3,117)=0.196, p = 0.899, \eta ^2 = 0.005\), respectively, indicating a similar influence of position on leadership for heading and speed. Furthermore, neither the IPD main effect, \(F(2,39)=0.133, p = 0.876, \eta ^2 = 0.007\), nor the IPD \(\times\) Position interaction, \(F(6,117)=0.906, p = 0.493, \eta ^2 = 0.044\), were significant, indicating a similar influence at all distances. Finally, the three-way Condition \(\times\) IPD \(\times\) Position interaction was not significant, \(F(6,117)=1.301, p = 0.262, \eta ^2 = 0.063\). Thus, the factor Position alone dominates percent leadership.

Since the factor Position was statistically significant, we performed Bonferroni post hoc test for a pairwise comparison in order to understand in detail what positions influence most leadership emergence. We found that the differences between the front positions and the back positions are statistically significant (\(p < 0.05\)) while the differences between the two positions in the front and between the two in the back are not (\(p=1\)). These results demonstrate a clear contextual effect of position on emergent leadership.

Effect of individual locomotor behaviour

The same data are replotted by participant in Fig. 2, where the mean value and standard error of the mean (SEM) of the individual leadership index are shown for each pedestrian. Notice that some participants have consistently higher indices than the others. A one-way ANOVA (4 Participants) on mean percent leadership was performed separately for each group revealing a main effect only in direction-change trials for Group 2, \(F(3,24) = 4.663, p < 0.05, \eta ^2 = 0.368\).

Figure 2
figure 2

Averaged percent leadership grouping the trials by individual pedestrian. Four bar charts are reported, 2 groups \(\times\) 2 condition (speed and heading direction). Each bar represents the mean value of the leadership grouped by a specific pedestrian over the number of trials independently from his/her position, whereas black error bars are the standard error of the mean.

In these trials, pedestrian P2.4 became the leader independently of his/her position in the group as shown in Fig. 3 (details about leader-follower patterns inferred from the data are reported in Supplementary Info, Supplementary Figs. 1,2, 3 and 4). Pairwise comparisons of participant means using a Bonferroni post-hoc test confirmed that the behaviour of pedestrian P2.4 has a significant influence on the behaviour of the whole group (\(p < 0.05\)). This influence is absent in the speed-change condition.

We attribute the leadership of pedestrian P2.4 to a characteristic behaviour, the tendency to initiate a change by acting first, which is evident in the time series (see Supplementary Info, Supplementary Figs. 1,2, 3 and 4). The question arises, however, as to how this individual could influence other group members from the back row when changing direction, but not when changing speed. This discrepancy may be explained by the observation that on direction-change trials, P2.4 often moved into the peripheral field of view of other group members, whereas on speed-change trials, P2.4 remained directly behind the front row, out of view.

Figure 3
figure 3

Percent leadership of four sample trials having P2.4 in each position. Percent leadership of four sample trials are reported having the pedestrian P2.4 (in purple) in each of position. The x-axis represents the trial, the y-axis the position (Front Left, Front Right, Back Left, Back Right), whereas the z-axis reports the value of leadership in percentage. Each pedestrian is identified by different colour.

Discussion

In this work, we carried out a data-driven experimental investigation of leadership emergence in a group of walking pedestrians. Two groups of four volunteers each were asked to walk across a room together, and to change their heading or speed. We then analysed the acquired data on their movement and speed by defining a novel “individual leadership index” to identify the leading participant and reconstruct leader-follower interactions in the group. This metric is based on time-dependent correlation analysis and was adapted from the indexes33 that were used previously to study the leader-follower behaviour of flying bats. To the best of our knowledge, it is applied here for the first time to a human group in order to predict leader-follower patterns.

Our analysis shows that both contextual factors, such as the position of an individual in the group, as well as personal factors, such as a characteristic locomotor behaviour, contribute significantly to the emergence of a leadership role. In particular, our findings confirm that participants in the front row, to whom those in the back row are visually coupled, are usually elected as leaders via the mechanism of collective motion, in which pedestrians match the heading direction of neighbours in their field of view24. However, we also find that a specific individual assumed a leadership role more than 60% of the time, initiating a change in direction irrespective of their position in the group. By deciding to turn first, this participant exploited the mechanism of collective motion to seize the leadership, a self-appointed leader.

These findings demonstrate that leadership in walking groups is a complex phenomenon that does not only depend on geometric conditions, such as the leader location in the group (emergent leadership), but also on other features related to the characteristics of individual participants (individual leadership); for example, the tendency to initiate movements or changes of direction that when perceived by neighbours propagate to the rest of the group. This observation might be relevant to understand how coordination in walking groups emerges in situations, such as emergency evacuations or some sport scenarios, where verbal communication among group members is limited. Of course, these observations need to be further investigating with additional experiments that are beyond the scope of this paper which is focused on presenting the methodology.

We wish to emphasise that the present methodology reveals information about specific leader-follower relationships within a group and how they change over time, as illustrated in Figs. 5 and 6. Moreover, the method can be generalised to larger pedestrian groups and other scenarios. An understanding of leadership emergence and the factors on which it depends can be extremely useful in emergency situations when it is critical to influence group dynamics, such as guiding a crowd to a safe exit.

Methods

Experiment

Participants

Two groups of 4 participants were tested in separate sessions as part of a larger study. The research protocol was approved by Brown University’s Institutional Review Board, in accordance with the Declaration of Helsinki. Informed consent was obtained from all participants, who were compensated for their time. All participants had normal or corrected vision, and none reported a motor impairment.

Task and procedure

Before each trial, participants were positioned at the vertices of a square configuration marked on the floor (“front left” (FL), “front right” (FR), “back left” (BL), “back right” (BR)), at one of three initial inter-personal distances (IPD of 1, 2, or 4 m) (see Fig. 4). They were instructed to change direction or speed twice at self-selected times as they walked together across the room (about 20m), and to “try to stay together with your neighbours”. There were nine possible instructions: change direction in left/right sequence (LL, RR, LR, or RL), change speed in low/fast sequence (SS, FF, SF, or FS), or no-change control (CC). The experimental design was thus 2 Conditions (heading direction, speed) \(\times\) 4 Sequences \(\times\) 3 IPD (1, 2, 4 m), with one trial in each of the 24 conditions plus three control trials. Given that no effects of IPD were found, data were collapsed across sequence and IPD for analysis, yielding 12 trials per analysed condition.

Figure 4
figure 4

Group configuration. Participants are placed at the vertices of a square (FL, FR, BL, BR) with an initial inter-personal distance (IPD) between them of 1, 2 or 4 m.

At the beginning of each trial, a sequence instruction was given, then the group started walking upon a verbal “ready, begin” command. Before the next trial, participants shuffled their positions in the starting configuration. Each group received one practice trial, followed by a block of 12 heading change trials, and a block of 12 speed change plus 3 control trials, for a total of 27 test trials. Trial order was randomised within blocks, and block order was counterbalanced across the two groups. A test session lasted about 40 min. Further details about each single trial are reported in Supplementary Info, Supplementary Tables 1 and 2.

Data acquisition and analysis

The experiments were carried out at Brown University in a large hall with a 14 m \(\times\) 20 m tracking area marked on the floor. Head position was recorded at 60 Hz with a 16-camera infrared-reflective motion capture system (Qualisys Oqus). Each participant wore a bicycle helmet having 5 markers placed on 30–40 cm stalks in a unique configuration, so that each helmet could be identified.

The time series of head positions of all four pedestrians were recorded in three dimensions, but only two dimensions (xy-plane) were analysed further. Time series were filtered using a forward and backward 4th-order low-pass Butterworth filter to reduce tracker error and gait oscillations, before differentiating to obtain time series of speed and heading. A 1.0 Hz cutoff was used to reduce anterior-posterior oscillations on each step before computing speed, and a 0.6 Hz cutoff for reducing medial-lateral oscillations on each stride before computing heading. Files were truncated to eliminate endpoint error. Leadership was then evaluated by assessing the difference in heading direction at each time between each pair of pedestrians by adapting the time-dependent delayed directional correlation \(C^d\) (see Leadership metrics)33. Summing the percentage of time in which a specific pedestrian is identified as leader, we were able to reconstruct a network representing the influence that a specific pedestrian has on any other. Similarly, leadership in speed was evaluated by adapting the previous algorithm using time-dependent delayed speed correlation \(C^s\) instead.

Leadership metrics

Time-dependent delayed directional correlation (TDDC)

Using time-dependent delayed directional correlation (TDDC), we characterised leadership in the walking group in terms of the heading direction of each individual, spotting whether some of the participants adjusted their own direction to that of the others and exhibited a follower role. Accordingly, a leader is defined as the participant whose heading direction influences the others the most, in the sense that the other participants match their headings with that of the leader after a time delay corresponding to the natural human visual–locomotor delay.

Denoting by \(v_k \in R^2\) the velocity vector of pedestrian k in the plane, we compute:

$$\begin{aligned} h_{ij}(t,\tau ) = \dfrac{v_i(t)}{||v_i(t)||} \dfrac{v_j(t+\tau )}{||v_j(t+\tau )||} \quad \in [-1, 1], \end{aligned}$$
(1)

which represents the scalar product between the heading direction of pedestrian i at time t and that of pedestrian j at delayed time \(t+\tau\).

Formally, TDDC is defined as follows:

$$\begin{aligned} C^d_{ij}\left( t,\tau \right) = \frac{1}{2\omega + 1}\sum _{k=-\omega }^{\omega }h_{ij} \left( t+k\Delta t, \tau \right) , \end{aligned}$$
(2)

where \(\Delta t\) is the sampling time and \(\omega > 0\) is a suitably selected constant. In order to compensate for high-frequency noise and measurement errors, \(h_{ij}(t,\tau )\) is averaged over a symmetric time-window \(2 \omega \Delta t\). As the measured human visual–locomotor delay to the leader’s change in heading is approximately equal to 1000 ms and the sampling time for the collected data is equal to \(\Delta t = \dfrac{1}{60\,\, {\text {Hz}}}\), we have that \(2 \omega \Delta t > 1s\) and so \(\omega\) was set to 40.

To identify the leader, \(C^d_{ij}(t, \tau )\) was evaluated for each pair of pedestrians (ij) at every time instant t along the trial and for different values of delay \(\tau\). If the value of \(\tau _{ij}(t)\) maximising the heading direction correlation is positive then we say that pedestrian i leads pedestrian j at time instant t, or otherwise if that value is negative.

A left-right trial carried out by Group 1 with an initial IPD of 4m is taken as a representative example to illustrate the analysis based on time-dependent directional correlation metric. Figure 5d shows the heat maps characterising values of \(C^d_{i,j}(t, \tau )\) for the whole duration of the trial with different time delay \(\tau\) between each pair (i, j) of pedestrians.

Figure 5
figure 5

Left-right representative trial. (a) Pedestrians’ trajectories in the xy-plane with different colours. Velocity vectors are represented by an arrow each 3 s, the colour goes from black to grey as time increases. (bc) Position along x-axis and velocity along y-axis. Pedestrians are identified by their position (FL, FR, BL, BR). (d) TDDC evaluated for each pair of pedestrians. The x-axis represents time instant t, whereas the y-axis represents different values of the time-delay \(\tau\). Different colours are different values of TDDC, with blue representing \(C^d_{ij}(t,\tau )\le 0.5\) and red representing \(C^d_{ij}(t,\tau )=1\). Dashed white lines represent level curves where \(C^d_{ij}(t,\tau )=0.95\), whereas the dashed black line represents \(\tau (t)=0\). The solid green line on top of the heat map represents the value of \(\tau (t)\) maximising the heading direction correlation between two pedestrians. (e) Percent leadership. Each grey bar refers to a different member in the group. (f) Network reconstruction. Each node represents a pedestrian, whereas the arrow directed from node i to node j indicates the fact that the heading direction of pedestrian i influences that of pedestrian j. The arrow’s width is the strength of such interaction.

Time-dependent delayed speed correlation (TDSC)

To assess leadership roles in speed-change trials, we define:

$$\begin{aligned} \Delta s_{ij}(t, \tau ) = \left| \; ||v_i(t)|| - ||v_j(t+\tau )|| \; \right| \end{aligned}$$
(3)

which represent the difference between the speed of pedestrian i at time t and that of pedestrian j at delayed time \(t+\tau\). The time-dependent speed correlation (TDSC) is defined as:

$$\begin{aligned} C^s_{ij}\left( t,\tau \right) = \frac{1}{2\omega + 1}\sum _{k=-\omega }^{\omega }\Delta s_{ij} \left( t+k\Delta t, \tau \right) , \end{aligned}$$
(4)

where the parameter \(\omega\) is the length of the symmetric window used to average \(s_{ij}(t,\tau )\) in order to compensate for noise and measurements errors as done before. As the measured human visual–locomotor delay to the leader’s change in speed is approximately equal to 500 ms26, we have set \(\omega\) to 20. In this case pedestrian i is said to lead pedestrian j at time t if the value of \(\tau _{ij}(t)\) minimising the speed correlation is positive, or otherwise if it is negative.

A fast–slow trial carried out by Group 1 with an initial IPD of 2m is taken as a representative example to illustrate the analysis of the data based on time-dependent speed correlation metric. Figure 6d shows the heat maps characterising values of \(C^s_{i,j}(t, \tau )\) for the whole duration of the trial with different time delay \(\tau\) between each pair (i, j) of pedestrians.

Figure 6
figure 6

Fast–slow representative trial. (a) Pedestrians’ trajectories in the xy-plane with different colours. Velocity vectors are represented by an arrow each 3 s, the colour goes from black to grey as time increases. (bc) Position along x-axis and velocity along y-axis. Pedestrians are identified by their position (FL, FR, BL, BR). (d) TDSC evaluated for each pair of pedestrians. The x-axis represents time instant t, whereas the y-axis represents different values of the time-delay \(\tau\). Different colours are different values of TDSC, with blue representing \(C^s_{ij}(t,\tau )=0\) and red representing \(C^s_{ij}(t,\tau )\ge 0.5\). Dashed white lines represent level curves where \(C^s_{ij}(t,\tau )=0.05\), whereas the dashed black line represents \(\tau (t)=0\). The solid green line on top of the heat map represents the value of \(\tau (t)\) minimising the speed correlation between two pedestrians. (e) Percent leadership. Each grey bar refers to a different member in the group. (f) Network reconstruction. Each node represents a pedestrian, whereas the arrow directed from node i to node j indicates the fact that the speed of pedestrian i influences that of pedestrian j. The arrow’s width is the strength of such interaction.

Individual leadership index

Knowledge of \(\tau _{ij}(t)\), maximising/minimising the heading direction/speed correlation, for each trial and pair of pedestrians allows us to quantitatively estimate how much a participant acts as a leader in the group. Specifically we define the individual leadership index for each participant as the overall average percentage of time for which the computed \(\tau _{ij}(t)\) is positive (Figs. 5e, 6e).

Network reconstruction

Knowledge of \(\tau _{ij}(t)\) also allows us to infer leader-follower interactions between each pair of group members. Each pedestrian is represented as a node in the network and the influence that each pedestrian has on the others is represented as an edge (an edge directed from node i to node j means that pedestrian j is influenced by pedestrian i). The strength of this influence is captured by the edge weight \(w_{i,j} \in [0,1]\) representing the normalised percentage of time in which pedestrian i leads pedestrian j. The network was reconstructed repeatedly within 5 consecutive time windows in order to evaluate also the evolution of the interactions over time (Figs. 5f, 6f).

The topologies inferred were then refined by means of DPI techniques (data processing inequality) and by a further thresholding to remove false positive links between nodes, as suggested in34. Specifically: i) links whose weights \(w_{i,j}\) are null were discarded from the reconstructed network, and ii) non-zero weights \(w_{i,j}\) were checked among all triplets of connected nodes so that, on the basis of their intensity, one of the triplet can be possibly regarded as a false connection and set to zero (so that the corresponding link was removed). For example, let us consider three nodes i, j and k. If the weight \(w_{i,k}\) between the nodes i and k was lower than \(w_{i,j}\) between the nodes i and j, and was also lower than \(w_{j,k}\) between the nodes j and k, then the weight \(w_{i,k}\) was set to zero and the link between the two nodes i and k was removed.