A Bayesian actor-oriented multilevel relational event model with hypothesis testing procedures

Vieira, Fabio; Leenders, Roger; McFarland, Daniel; Mulder, Joris

doi:10.1007/s41237-023-00203-4

A Bayesian actor-oriented multilevel relational event model with hypothesis testing procedures

Original Paper
Open access
Published: 17 July 2023

Volume 51, pages 37–74, (2024)
Cite this article

Download PDF

You have full access to this open access article

Behaviormetrika Aims and scope Submit manuscript

A Bayesian actor-oriented multilevel relational event model with hypothesis testing procedures

Download PDF

Fabio Vieira ORCID: orcid.org/0000-0003-4253-1953¹,
Roger Leenders^2,3,
Daniel McFarland⁴ &
…
Joris Mulder¹

1116 Accesses
3 Citations
Explore all metrics

This article has been updated

Abstract

Relational event network data are becoming increasingly available. Consequently, statistical models for such data have also surfaced. These models mainly focus on the analysis of single networks; while in many applications, multiple independent event sequences are observed, which are likely to display similar social interaction dynamics. Furthermore, statistical methods for testing hypotheses about social interaction behavior are underdeveloped. Therefore, the contribution of the current paper is twofold. First, we present a multilevel extension of the dynamic actor-oriented model, which allows researchers to model sender and receiver processes separately. The multilevel formulation enables principled probabilistic borrowing of information across networks to accurately estimate drivers of social dynamics. Second, a flexible methodology is proposed to test hypotheses about common and heterogeneous social interaction drivers across relational event sequences. Social interaction data between children and teachers in classrooms are used to showcase the methodology.

Discovering trends of social interaction behavior over time: An introduction to relational event modeling

Article Open access 10 May 2022

Nodal Heterogeneity can Induce Ghost Triadic Effects in Relational Event Models

Article 06 March 2024

Social Networks and Causal Inference

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Current technological advancements, combined with constant development of new communication applications, has originated huge amounts of data on social interactions (Eagle and Pentland 2003). This has enabled researchers to build and test increasingly rich and complex models of human interaction using time-stamped data. Social network analysis is among the fields that have benefited the most from having access to rich temporal interaction data. In this paper, we specifically focus on the Relational Event network model (Butts and Marcum 2017; Stadtfeld and Block 2017; Mulder and Leenders 2019)

A relational event network is comprised of multiple interactions among a finite set of actors. Observation units are called “relational events", which are defined as discrete instances of interactions among social entities along a timescale (Butts and Marcum 2017). In a directed network, events contain a clear indication of sender and receiver. Thus, every observation unit displays information on which actor was the sender, which actor was the receiver, and at which point in time the interaction occurred. The stream of events in a given network is often referred to as a relational event sequence.

One of the traditional approaches to social network analysis is based on Markovian random graphs theory and involves the aggregation of events into a graph (Van Der Hofstad 2009). Transitions on the graph structure are then modeled via sufficient statistics, which are functions of the observed network ((Frank and Strauss , 1986; Hanneke et al. , 2010; Lusher et al. , 2013)). A different approach was taken by Butts (2008), who employed survival analysis concepts and introduced the relational event model which profits from the temporal structure of relational events. In this framework, the main goal is to model the rates of communication between senders and receivers via a log-linear function, without requiring the data to be aggregated. This is done by employing sufficient statistics that capture important social patterns and actor-specific covariates.

Since then, the relational event model has gained increasing popularity and received multiple extensions. For instance, Vu et al. (2011) introduced a model with time-varying parameters, based on the additive Aalen model (Aalen 1989), and developed an algorithm for online inference in social networks. Perry and Wolfe (2013) used a partial likelihood approach to modeling who the receiver will be a given sender. Vu et al. (2015) implemented case–control sampling to decrease the number of computations in estimating relational event models. Later, the control–case sampling approach was further explored by Lerner and Lomi (2020) to estimate relational event models in large networks. Stadtfeld et al. (2017a) and Stadtfeld and Block (2017) introduced a two-step model, where the first step consists of modeling the activity rate of the sender and the second one features the choice of the receiver conditional on the sender. Mulder and Leenders (2019) proposed a way to analyze the temporal evolution of effects in the social network, by estimating effects in different subsets of the data determined by overlapping intervals, which they called moving windows, this approach was further developed by Meijerink et al. (2022).

These contributions have mainly focused on the statistical analysis of a single relational event sequence (with a notable exception of DuBois et al. (2013)); while in practice, we also often observe multiple independent relational event sequences which show similar but not identical social interaction behavior. For example, Blonder and Dornhaus (2011) analyses information flow in multiple relational event networks of ant colonies, DuBois et al. (2013) studies several relational event sequences of high-school students interactions and Kiti et al. (2016) examines social contact in numerous networks of Kenyan households. The gold standard for analyzing such clustered (or hierarchical) data is using a multilevel or mixed effects approach. In a relational event modeling approach, this implies that a probability distribution is specified for the network effects (or drivers) of social interaction behavior (such as inertia, reciprocity, transitivity, or group effects) across clusters. A multilevel approach will result in ‘pooled’ estimates of the cluster-specific network effects where information about social interaction behavior in other clusters is borrowed to improve the estimation (Gelman and Hill 2006). We achieve this by modeling the relational event sequences in a Bayesian hierarchical framework via Markov chain Monte Carlo (MCMC) methods.

Therefore, our first contribution is a multilevel extension for the dynamic actor-oriented model (Stadtfeld et al. 2017a) for independent relational event sequences. In this framework, a relational event between a sender and a receiver in a network is modeled by separately modeling (i) when an actor decides to initiate an event as a sender given the past history, and (ii) modeling the receiver that is chosen given the sender and the past event history between the actors. Thereby, this approach differs from the dyadic approach of Butts (2008), and the hierarchical extension by DuBois et al. (2013), where the time, sender, and receiver are jointly modeled using the same set of parameters (Stadtfeld et al. 2017b). The actor-oriented approach is therefore more flexible by separately modeling the behavior of actors to become a sender and the behavior of senders to choose a receiver in contrast to the dyadic approach. A second important difference is that we explicitly model the full unstructured covariance matrix of the network effects across sequences. Thereby, the dependency between interaction behavior across sequences, such as inertia and reciprocity (which is generally nonzero), is included in the analysis resulting in improved estimation using the concept of pooling. A final important difference between a dyadic relational event model and an actor-oriented approach is the computational burden, which is considerably larger in the case of a dyadic approach. This can be explained by the size of the risk set (i.e., the possible events that can be observed at every step), which is equal to $N(N-1)$ in the case of a network consisting of N actors where $N(N-1)$ dyads are at risk for each event, while the risk set of the sender model in an actor-oriented model consists of N actors, and the risk set of the receiver model consists of $N-1$ actors (when assuming that the sender cannot be equal to the receiver). In the case of moderate to large networks, this difference can have a huge impact on the computational costs when using a dyadic approach.

Our second contribution is an extensive set of statistical tests using the Bayes factor (Jeffreys 1961; Kass and Raftery 1995) for evaluating hypotheses about social interaction behavior in the case of independent relational event sequences. The first test can be used to assess whether specific interaction behavior, such as inertia (which quantifies the degree of habitual behavior between actors to keep sending messages of one actor towards another actor), is equal or differs across sequences (i.e., whether it should be modeled as a fixed effect or as a random effect). In the case of different interaction behavior across sequences, a second test is proposed to assess the degree of heterogeneity (or variability) of network effects on social interaction behavior, e.g., to assess whether inertia behavior varies more across clusters than reciprocity, or whether the impact of a group effect (e.g., whether the sender is the teacher in a class room or not) on social interaction dynamics is more or less heterogeneous than another group effect (e.g., whether the sender is male or female). Third, a test is proposed for evaluating hypotheses with equality and order constraints on the average relative importance across clusters (i.e., on the fixed effects) of drivers on social interaction dynamics. The methodology is flexible to also allow for testing the absolute values of network effects, which is useful when the exact sign is unknown before observing the data. These tests build on and further extends recent developments on Bayes factor hypothesis testing (Mulder et al. 2021).

Our methods have been implemented in R (R Core Team 2017), interfacing with Stan through the rstan package (Stan Development Team 2018). We provide the full code to facilitate the fitting of other relational event models (https://github.com/Fabio-Vieira/bayesian_dynamic_network).

The remainder of this paper is divided as follows: in Sect. 2, we describe the relational events framework and present the hierarchical actor-oriented relational event model. This Section also introduces the Bayesian specification of the model by presenting prior distributions and briefly discussing a reparametrization that allows more efficient sampling; we detail the hypothesis testing methods in Sect. 3 and conduct the empirical application in Sect. 4. The paper ends with a discussion in Sect. 5, where we go through some limitations of the model and possible routes for future research.

2 The relational event framework

The relational event model (REM) is used to model the rate of interactions in dynamic social networks (Butts 2008). In this framework, events happen among actors at particular points in time, being represented by tuples in the form $e_t = (s, r, t)$, where s is the sender, r is the receiver, and t is the time point of the interaction. Thus $e_t$ is called a relational event. Therefore, we have K clusters (in this paper we will use cluster, groups, relational event sequence and network interchangeably), each of which with $N_1, \dots , N_K$ social actors allowed to be senders or receivers at any given time. At time t, in cluster k, sender $s \in \{1, 2, \dots , N_{k} \}$ interacts with receiver $r \in \{1, 2, \dots , N_{k}\ |\ r \ne s \}$, forming a dyad $(s, r) \in {\mathcal {R}}_{k}(t)$, where ${\mathcal {R}}_{k}(t) = \big \{(s, r): s, r \in \{1, 2, \dots , N_{k}\},\ s \ne r \big \}$ is called the risk set, which comprises all possible dyads (s, r) at a particular point in time.

The main idea consists of assuming that, for each cluster $k,\ \text {for}\ k = 1, \dots , K$, we are able to observe an ordered sequence of $M_{k}$ dyadic events among $N_{k}$ individuals on the time window $[0, \tau _{k}) \in \mathrm{I\!R}^+$. Thus, a relational event sequence for cluster k is formally defined as

$$\begin{aligned} {{\textbf {E}}}_{k} = \{e_{t_{m}} = (s_{m}, r_{m}, t_{m}):\ (s_{m}, r_{m}) \in {\mathcal {R}}_{k}(t_{m}),\ 0< t_1< \dots< t_{M_k} < \tau _{k}\}, \end{aligned}$$

where $t_m$ is the time at which the mth event occurred, in cluster k. Then, following Butts (2008), who borrows concepts from survival analysis, the relational event history ${{\textbf {E}}}_{k}$ is modeled as a stochastic process, where the rate of events from sender s to receiver r, with $(s, r) \in {\mathcal {R}}_{k}(t)$, is given by the intensity $\lambda _{s r} \big (t | {{\textbf {E}}}_{k} \big )$. This intensity function has the form of a Cox proportional hazards model (Cox 1972). Moreover, the intensity is assumed constant between subsequent events and the waiting times conditionally exponentially distributed. This amounts to the well-known piece-wise constant exponential model for survival data (Friedman 1982). Thus, the survival function is given by $S_{s r} \big (t_{m} - t_{m-1} | {{\textbf {E}}}_{k} \big ) = \exp \big \{ - (t_{m} - t_{m-1}) \lambda _{s r} \big (t_{m} | {{\textbf {E}}}_{k} \big ) \big \}$.

2.1 The actor-oriented multilevel relational event model

In this paper, we focus on an alternative relational event model that was proposed by Stadtfeld et al. (2017a) and Stadtfeld and Block (2017). Their approach models the receiver given the sender, similar to Perry and Wolfe (2013). Conceptually, it builds on the same tradition as the stochastic actor-oriented model (Snijders 1996), where the evolution of the network is assumed to be a product of actors’ individual behaviors as they constantly seek to maximize their own utilities. The basic framework consists of a two-step approach based on log-linear predictors. First, the waiting time until an actor becomes active is modeled. After that, a multinomial choice model (McFadden 1973) is employed to determine the choice of the receiver by the active sender actor. These two steps are assumed to be conditionally independent given the available information about the past up to that event.

The Bayesian hierarchical approach we will develop for multiple relational event sequences (denoted as "clusters") will build on this model. At each point in time, in cluster k, sender $s \in \{1, 2, \dots , N_{k} \}$ starts an interaction with intensity $\lambda _{s} \big (t | {{\textbf {E}}}_{k} \big )$. This intensity is directly proportional to the probability of a given actor s to be the next sender. This probability is given by $P\big (s_m = s | {{\textbf {E}}}_{k}\big ) = \lambda _s\big (t | {{\textbf {E}}}_{k}\big )/\sum _{h \in k} \lambda _h\big (t | {{\textbf {E}}}_{k}\big ),\ \forall \ h \in \{1, 2, \dots , N_{k} \}\ \text {and}\ m \in \{1, 2, \dots , M_{k}\}$, where $M_{k}$ is the number of events in cluster k. Next, this sender chooses the receiver $r \in \{1, 2, \dots , N_{k} | r \ne s \}$, forming the dyad $(s, r) \in {\mathcal {R}}_{k}(t)$, with intensity $\lambda _{r | s} \big (t | s, {{\textbf {E}}}_{k} \big )$. The receiver intensity represents the rate at which actor s chooses actor r to form a dyad, which is proportional to the probability of observing dyad $(s, r) \in {\mathcal {R}}_{k}(t)$ as the next one in the sequence. This probability is given by $P\Big (r_m = r | s_m = s, {{\textbf {E}}}_{k}\Big ) = \lambda _{r|s}\Big (t | s, {{\textbf {E}}}_{k}\Big )/\sum _{h \in k} \lambda _{h|s}\Big (t | s, {{\textbf {E}}}_{k}\Big ),\ \forall \ h \in \{1, 2, \dots , N_{k}\ |\ h \ne s \}\ \text {and}\ m \in \{1, 2, \dots , M_{k}\}$. Then, these intensities for cluster k, for the sender and receiver steps of this model, will be given by the following log-linear functions:

$$\begin{aligned} \begin{aligned} \lambda _{s}\big (t | {{\textbf {E}}}_{k}\big ) =&\exp \{ \varvec{\phi }' \varvec{z}_{s}(t) + \varvec{\gamma }'_k \varvec{x}_{s}(t)\},\\ \lambda _{r|s} \big (t | s, {{\textbf {E}}}_{k} \big ) =&\exp \{\varvec{\psi }' \varvec{z}_{s r}(t) + \varvec{\beta }_{k}' \varvec{x}_{s r}(t)\}, \end{aligned} \end{aligned}$$

(1)

where $\varvec{\phi }$ and $\varvec{\psi }$ are a vectors of fixed-effect parameters, $\varvec{\gamma }_k$ and $\varvec{\beta }_k$ are a vectors of random-effect parameters for cluster k, with $k = 1, \dots , K$. The vectors of statistics $\varvec{z}_{s}(t)$ and $\varvec{x}_{s}(t)$ are associated with actor $s \in \{1, 2, \dots , N_{k} \}$, whereas $\varvec{z}_{sr}(t)$ and $\varvec{x}_{sr}(t)$ are vectors of statistics associated with dyad $(s, r) \in {\mathcal {R}}_{k}(t)$.

Assuming this log-linear form will allow us to conduct inference at the actor level, unveiling effects that make actors more (or less) prone to start interactions or more (or less) likely to be chosen as the next receiver. Also, this idea helps us to limit the size of the set of possible dyads that need to be analyzed at each point in time: if actor s is the one starting an interaction at time t, then all dyads, where s is not the sender, become impossible to happen. Thus, for the actor-oriented model, the risk set, ${\mathcal {R}}_{k}(t)$, will have size $N_{k}-1$ (given the sender), whereas for a dyadic model, such as in DuBois et al. (2013), the risk set has size $(N_{k}-1)N_{k}$, which can easily become massive for medium to large networks.

The likelihood for the actor-oriented model is given by

$$\begin{aligned} \begin{aligned} p({{\textbf {E}}} | \varvec{\theta }, \varvec{Z}, \varvec{X}) = \prod _{k = 1}^{K}&\prod _{m = 1}^{M_{k}} \Bigg [ \Bigg ( \lambda _{s_m}\Big (t_m | {{\textbf {E}}}_{k}\Big ) \prod _{s \in k} S_s\Big (t_m - t_{m-1} | {{\textbf {E}}}_{k}\Big ) \Bigg ) \times \\&\times \frac{\lambda _{r_m | s_m}\Big (t_m| s_m, {{\textbf {E}}}_{k}\Big )}{\sum _{r \in k} \lambda _{r | s_m}\Big (t_m| s_m, {{\textbf {E}}}_{k}\Big )} \Bigg ] \times \prod _{s \in k} S_{s}\Big (\tau _{k} - t_{M_{k}} | {{\textbf {E}}}_{k}\Big ), \end{aligned} \end{aligned}$$

(2)

where $\varvec{\theta }$ is the vector containing all parameters and $\varvec{Z}$ and $\varvec{X}$ are matrices with fixed and random-effects covariates, respectively. The time of the last observed event in cluster k is denoted by $t_{M_{k}}$ and $\tau _{k}$ the end of the observation period. In most empirical applications, it is assumed that $t_{M_{k}} = \tau _{k}$. This way the last part of the likelihood is equal to 1, since due to the piece-wise constant exponential assumption we have $S_{s_{m}} \big (t_{m} - t_{m-1} | {{\textbf {E}}}_{k} \big ) = \exp \big \{ - (t_{m} - t_{m-1}) \lambda _{s} \big (t_{m} | {{\textbf {E}}}_{k} \big ) \big \}$. In this setting, at time $t_m$, $P\big (s = s_m | {{\textbf {E}}}_{k}\big ) = \frac{\lambda _s}{\sum _{h \in k} \lambda _h},\ \forall \ s \in k$ is the probability of sender s being active. The probability of actor r being the receiver given that s is the sender is given by $P\big (r_m = r | s_m = s, {{\textbf {E}}}_{k}\big ) = \frac{\lambda _{r|s}}{\sum _{h \in k} \lambda _{h|s}},\ \forall \ r \in k$.

The likelihood in Eq. (2) is a product of the two pieces. The first one, representing the sender model, is a piece-wise constant exponential likelihood

$$\begin{aligned} p_{\text {sender}}({{\textbf {E}}} | \varvec{\theta }, \varvec{Z}, \varvec{X}) = \prod _{k = 1}^{K} \prod _{m = 1}^{M_{k}} \left [ {\Bigg ( \lambda _{s_m}\Big (t_m | {{\textbf {E}}}_{k}\Big ) \prod _{s \in k} S_s\Big (t_m - t_{m-1} | {{\textbf {E}}}_{k}\Big ) \Bigg )} \right], \end{aligned}$$

and the second one, representing the receiver model, is a multinomial likelihood

$$\begin{aligned} p_{\text {receiver}}({{\textbf {E}}} | \varvec{\theta }, \varvec{Z}, \varvec{X}) = \prod _{k = 1}^{K} \prod _{m = 1}^{M_{k}} \left [ {\frac{\lambda _{r_m | s_m}\Big (t_m| s_m, {{\textbf {E}}}_{k}\Big )}{\sum _{r \in k} \lambda _{r | s_m}\Big (t_m| s_m, {{\textbf {E}}}_{k}\Big )}} \right]. \end{aligned}$$

In the literature, it has been shown that both of these models are special cases of the Poisson model. Holford (1980) and Laird and Olivier (1981) are examples of cases where the Poisson representation of the piece-wise exponential model is discussed. Baker (1994) extensively discusses the multinomial-Poisson transformation and how their likelihoods yield identical estimates. For proofs, see Appendix A. The Poisson regression model has been extensively studied, and it is well understood in the statistical literature (Frome 1983; Consul and Famoye 1992; Hayat and Higgins 2014).

The second level of the hierarchical actor-oriented relational event model specifies the multivariate distributions of network effects across the K sequences. We follow the standard approach in multilevel modeling by assuming multivariate normal distributions for $\varvec{\beta }_k$ under the sender model and for $\varvec{\gamma }_k$ under the receiver model, i.e.,

$$\begin{aligned} \varvec{\beta }_k\sim & {} {\mathcal {N}} (\varvec{\mu }_{\beta }, \varvec{\Sigma }_{\beta }),\\ \varvec{\gamma }_k\sim & {} {\mathcal {N}}(\varvec{\zeta }_{\gamma }, \varvec{\Sigma }_{\gamma }), \end{aligned}$$

for $k = 1, \dots , K$. The mean vectors quantify the overall, global effect of the network effects and the unstructured covariance matrices quantify the variability of the effects across clusters and the dependency structure between the effects across clusters. Thus, when estimating these distributions when fitting the multilevel model using independent relational event sequences, the estimated means and (co)variances of the second level are used to improve the estimates of the specific parameters in the separate sequences (especially in the case of short sequences).

2.2 Prior specification

In a Bayesian approach, prior distributions have to be specified which reflect our uncertainty about the model parameters before observing the data. Throughout this paper, we shall work with vague, noninformative priors (which are completely dominated by the data).

The prior choice of the random-effects covariance matrices is most important. We decompose the random-effects covariance matrix under the receiver model according to

$$\begin{aligned} \varvec{\Sigma }_{\beta } = \varvec{\sigma }_{\beta } \times \varvec{\Omega }_{\beta } \times \varvec{\sigma }_{\beta }, \end{aligned}$$

(3)

where $\varvec{\sigma }_{\beta } {:}{=}diag(\sigma _{\beta ,1}, \dots , \sigma _{\beta ,P})$ is a diagonal matrix of standard deviations and $\varvec{\Omega }_{\beta }$ is a correlation matrix (Gelman and Hill 2006). Following Carpenter et al. (2017), $\varvec{\Omega }$ will have a Lewandowski–Kurowicka–Joe (LKJ) prior and $\varvec{\tau }$ a half-Cauchy prior as follows

$\sigma _{\beta ,p} \sim \text {half-Cauchy}(0, \tau _{\beta }),\ p = 1,\dots , P,$
$\varvec{\Omega }_{\beta } \sim \text {LKJCorr}(\eta _{\beta }).$

It has been shown that a half-Cauchy prior for the random-effects standard deviation results in desirable estimates in the case of multilevel data with few clusters (Gelman and Hill 2006). Furthermore, by setting very large prior scale parameters, we can obtain approximately flat priors for the standard deviations. Further note that a half-Cauchy prior for the standard deviation corresponds to a F distribution on the variance (Mulder and Pericchi 2018), which is a common distribution for modeling variance components.

The LKJ prior is defined as $\text {LKJCorr}(\varvec{\Omega } | \eta ) \propto \text {det}(\varvec{\Omega })^{\eta - 1}$, with $\eta \in \mathrm{I\!R}^{+}$. This distribution allows us to sample uniformly from the space of positive definite correlation matrices and has a behavior similar to the beta distribution (Wang et al. 2018; Lewandowski et al. 2009). For example, when $\eta = 1$, it has a uniform behavior, when $\eta < 1$ it favors stronger correlation; whereas when $\eta > 1$, it favors weaker correlation. For the random-effects covariance matrix under the sender $\varvec{\Sigma }_{\gamma }$, the same prior is specified.

Finally, multivariate normal priors are specified for the fixed effects and random effects, i.e.,

$$\begin{aligned} \varvec{\phi }\sim & {} N({{\textbf {0}}},\varvec{\Psi }_{\phi }),\\ \varvec{\psi }\sim & {} N({{\textbf {0}}},\varvec{\Psi }_{\psi }),\\ \varvec{\mu }_{\beta }\sim & {} N({{\textbf {0}}},\varvec{\Psi }_{\mu }),\\ \varvec{\zeta }_{\gamma }\sim & {} N({{\textbf {0}}},\varvec{\Psi }_{\zeta }). \end{aligned}$$

For the prior covariance matrices, diagonal matrices are specified with very large variances. This builds upon existing knowledge of weakly informative prior specification for mixed effects generalized linear models, which have become the standard approach under this class of mixed effects models (Gelman et al. 2013; Gelman and Hill 2006).

2.3 Reparameterization to improve Bayesian computation

Due to the hierarchical structure of the data, the random-effects parameters $\varvec{\beta }_k$ and $\varvec{\gamma }_{k}$ are highly correlated with the population parameters $\varvec{\mu }$, $\varvec{\zeta }$ and $\varvec{\Sigma }$. This introduces severe computational inefficiencies in the sampling process. When the data are sparse, which is a characteristic of most social network data, the geometry of the posterior distribution makes it very difficult to sample from the highest posterior density areas. Betancourt and Girolami (2015) called these issues pathologies of the hierarchical model. Therefore, to ease the burden on the sampler, we take advantage of the multivariate normal structure of the random effects and apply a non-centered linear transformation to those parameters.

Lemma 1

Let $\varvec{\beta } \sim {\mathcal {N}}(\varvec{\mu }, \varvec{\Sigma })$, where $\varvec{\beta } \in \mathrm{I\!R}^p$. Then, with $\varvec{\mu } \in \mathrm{I\!R}^p$ and $\varvec{A}$ being a $p \times p$ matrix, such that $\varvec{A} \varvec{A}' = \varvec{\Sigma }$, one can write $\varvec{\beta } = \varvec{A} \varvec{Z} + \varvec{\mu }$, where $\varvec{Z} \in \mathrm{I\!R}^p$ and $\varvec{Z} \sim {\mathcal {N}}(\varvec{0}, \varvec{I})$, where $\varvec{I}$ is a $p \times p$ identity matrix.

This transformation can be applied to both $\varvec{\beta }$ and $\varvec{\gamma }$. A natural candidate for matrix $\varvec{A}$ is the Cholesky factorization of the covariance matrix $\varvec{\Sigma }$. For details see appendix B.

This reparameterization is more efficient for two reasons. First, it reduces the dependency between the random-effects parameters and the population parameters by sampling from independent standard normal distributions. This simplifies the geometry of the posterior and avoids inverting $\varvec{\Sigma }$ at every evaluation of the multivariate normal density (Carpenter et al. 2017). Therefore, we can safely and efficiently transform the random-effects parameters without causing any change in the prior specification of population parameters.

The model has been implemented in Stan, a probabilistic programming language that employs Hamiltonial Monte Carlo (HMC) algorithms to sample from posterior distributions. The advantage of HMC methods is that they avoid the random walk behavior and the sensitivity to posterior correlations that plague many Bayesian applications (Hoffman and Gelman 2014), including hierarchical models. This particularities allow HMC to generally converge to high dimensional distributions much faster than Metropolis–Hastings or Gibbs sampler methods (Hoffman and Gelman 2014; Betancourt and Girolami 2015).

3 Bayesian hypothesis testing under the actor-oriented multilevel relational event model

In multilevel analysis, researchers are usually interested in testing which theories receive the most support from the observed data, so that inferences about the population can be conducted. This kind of analysis is carried out through a process called hypothesis testing. From a Bayesian perspective, those tests are usually performed by the computation of Bayes factors (BF) (Jeffreys 1961). The BF is given by the ratio of marginal likelihoods under the parameter space of competing hypotheses, which quantifies the probability of observing the data under one hypothesis relative to another hypotheses, and thereby providing a quantification of the relative evidence in the data between the hypotheses. Let ${{\textbf {E}}}$ be the observed data, $\varvec{\theta }$ a vector of parameters in the space $\varvec{\Theta }$, and $\text {H}_{0} \in \varvec{\Theta }_{0}$ be a hypothesis that will be tested against $\text {H}_{1} \in \varvec{\Theta }_{1}$, then $\text {BF}_{01}$ is expressed as

$$\begin{aligned} \text {BF}_{01} = \frac{m({{\textbf {E}}} | \text {H}_{0})}{m({{\textbf {E}}} | \text {H}_{1})}, \end{aligned}$$

(4)

where $m({{\textbf {E}}} | \text {H}_{i}) = \int _{\varvec{\theta }_{i} \in \varvec{\Theta }_{i}} p(\varvec{E} | \varvec{\theta }_{i}) p(\varvec{\theta }_{i}) d\varvec{\theta }_{i}$, for $i = 0, 1$, with $p(\varvec{E} | \varvec{\theta }_{i} )$ being the likelihood and $p(\varvec{\theta }_{i})$ the prior. Also, $\varvec{\Theta }_{0} \cap \varvec{\Theta }_{1} = \varnothing$, with both $\varvec{\Theta }_{0}$ and $\varvec{\Theta }_{1}$ being subsets of $\varvec{\Theta }$. Kass and Raftery (1995) provide a rule-of-thumb for Bayes Factors interpretation. In their setting, the evidence provided by the $\text {BF}_{01}$ in favor $\text {H}_{0}$ can be seen as “insufficient" if $1< \text {BF}_{01} < 3$, “positive" if $\text {BF}_{01} > 3$, “strong" if $\text {BF}_{01} > 20$, and “very strong" if $\text {BF}_{01} > 150$. These are rough guidelines to aid the interpretation of Bayes factors and should not be used as strict cut-off values.

Furthermore, when prior probabilities of the hypotheses have been formulated before observing the data, the prior odds can be updated with the Bayes factor to obtain the posterior odds of the hypotheses, i.e.,

$$\begin{aligned} \frac{P(H_0|{{\textbf {E}}})}{P(H_1|{{\textbf {E}}})}= BF_{01} \times \frac{P(H_0)}{P(H_1)}. \end{aligned}$$

For example, when both hypotheses are equally likely a priori, i.e., $P(H_0)=P(H_1)=\frac{1}{2}$, the posterior probabilities of the hypotheses are given by $P(H_0|{{\textbf {E}}})=\frac{BF_{01}}{BF_{01}+1}$ and $P(H_1|{{\textbf {E}}})=\frac{1}{BF_{01}+1}$. These posterior probabilities quantify the probabilities that each of these hypotheses are true after observing the data (when assuming that one of these hypotheses is true). Because Bayes factors are consistent under mild conditions, the evidence for the true hypothesis goes to infinity and the posterior probability of the true hypothesis goes to 1 as the sample goes to infinity.

Below, we introduce Bayes factors for three types of hypothesis testing problems for the multilevel relational event modeling literature. Test I is useful for testing the homogeneity of network effects across independent sequences. Test II is useful for testing the degree of heterogeneity of network effects across sequences using order constraints on random-effects variances. Test III is useful to test equality and order constraints on fixed effects. Prior specification and numerical computation, which are important aspects when computing Bayes factors and posterior probabilities of the hypotheses, are discussed for each test separately.

3.1 Test I: testing for homogeneous social interaction behavior across sequences

When building multilevel relational event models, a central question is whether a driver of specific social interaction behavior (as quantified via the network effects) is constant over the sequences or whether it varies across the sequences. Testing this is important from a statistical point of view (i.e., to keep the model as parsimonious as possible, and thus to avoid an enormous overparameterization using different effects across all K sequences) but also from a substantive point of view (i.e., to understand which drivers of social interaction behavior in relational event data are constant across sequences and which (highly) differ). Thereby, testing this contributes to a better understanding of the heterogeneity of social interaction behavior.

The hypothesis test for the pth random network effect in the receiver model can be formulated as $H_0:\sigma _{\beta ,p}=0$ versus $H_0:\sigma _{\beta ,p}>0$, or equivalently as, $H_0:\beta _{p,1}=\ldots =\beta _{p,K}$ versus $H_1:\hbox { not}\ H_0$. The second formulation of the hypothesis is simpler as we avoid the need of testing whether the variance is equal to the boundary of 0, but instead we only need to test whether the random effects are equal across the K clusters. Using the fact that the distribution of the random effects effectively serves as a prior for the random effects on the first level, we can simply compute a Savage–Dickey density ratio using the estimates of the random-effects distribution from the data, i.e.,

$$\begin{aligned} \text {BF}_{01} = \frac{\pi (\xi _{1}, \dots , \xi _{K-1} = 0| {{\textbf {E}}})}{\pi (\xi _{1}, \dots , \xi _{K-1} = 0)}, \end{aligned}$$

where $\xi _{p,k}=\beta _{p,k}-\beta _{p,k-1}$, for $k=2,\ldots ,K$, are the differences between the random effects.

To compute the posterior and prior densities at the null value, we use the fact that our actor-oriented multilevel relational event model can be written as two independent mixed effects Poisson regression models having specific forms (Appendix A). Thus, the two independent components of the model belong to the family of generalized linear mixed effects models, which is well understood in the statistical literature (e.g., Gelman and Hill 2006; Bürkner 2017). Moreover, the model results in unimodal posteriors when using weakly informative priors, as done in this paper. Additionally, we use the general property that posterior distributions converge to Gaussian distributions in well-behaved problems following large sample theory (Gelman et al. 2004, Ch. 4; Rafter 1995; Kim and Ibrahim 2000). We illustrate the accuracy of this Gaussian approximation later in this paper. The Gaussian approximation of the posterior is then used to compute the numerator (then having an analytic expression). Moreover, the prior distribution, which is needed to compute the denominator, follows a multivariate Gaussian distribution by definition because it is the second level of our multilevel model, $\varvec{\beta }_k\sim N(\varvec{\mu }_{\varvec{\beta }},\varvec{\Sigma }_{\varvec{\beta }})$, and thus the joint prior distribution of the contrasts ($\xi$) also follows a multivariate Gaussian distribution.

One interesting aspect of this Bayes factor is that the prior is fully determined from the data, similar to empirical Bayesian estimation of hierarchical models. This property is especially useful here as Bayes factors are known to be sensitive to the choice of the prior. Note that empirical Bayesian approaches to obtain Bayes factors have been proposed for testing regression coefficients using g priors (e.g., see Liang et al. 2008, and the references therein), but (to our knowledge) not for testing variance components as we do here. The advantage of this approach is that no external prior information is required and no other ad hoc choices for prior distributions are needed. The outcome of the test can be used to quantify the relative evidence in the data of whether a hierarchical structure for the network effects is applicable or not given the observed data.

3.2 Test II: testing the degree of heterogeneity of network effects across sequences

After establishing which network effects are heterogeneous across sequences (using the test from the previous subsection), it is useful to investigate the relative degree of heterogeneity of network effects across sequences, again with the goal to better understand the heterogeneity of social interaction behavior across independent relational event sequences. This comes down to testing whether a specific random effect varies more across sequences than another random effect, or, equivalently, whether one random-effect variance is larger than another. When generalizing this further, we would test specific orderings of random-effect variances (see also Böing-Messing et al. 2020, who consider testing order constraints on variances in a nonhierarchical setting).

Two different tests are proposed for this purpose. First, a confirmatory test is proposed of whether an anticipated ordering regarding the degree of heterogeneity of network effects across sequences is present:

$$\begin{gathered} {\text{H}}_{0} :\sigma _{1} < \sigma _{2} < \ldots < \sigma _{P} \hfill \\ {\text{H}}_{1} :{\text{not}}{\mkern 1mu} H_{0} . \hfill \\ \end{gathered}$$

Second, an exploratory test is proposed to determine which ordering out of all P! possible orderings receives most evidence from the data, i.e.,

$$\begin{aligned} \text {H}_{1}&:&\sigma _1< \sigma _2< \dots< \sigma _{P}\\ \text {H}_{2}&:&\sigma _2< \sigma _1< \dots< \sigma _{P}\\ \vdots{} & {} \\ \text {H}_{P!}&:&\sigma _P< \sigma _{P-1}< \dots < \sigma _{1}. \end{aligned}$$

Following the existing Bayesian literature on order-constrained hypothesis testing (e.g., Klugkist et al. 2005), we specify the prior under an order-constrained hypothesis as a truncated version of an unconstrained prior truncated in the order-constrained subspace. The Bayes factor of an order-constrained hypothesis against the unconstrained hypothesis$, H_u$, can then be expressed as the ratio of the posterior and prior probability that the constraints hold (Appendix C):

$$\begin{aligned} \text {BF}_{1u} = \frac{P(\sigma _1< \sigma _2< \dots< \sigma _{{P}}| {{\textbf {E}}},H_u)}{P(\sigma _1< \sigma _2< \dots < \sigma _{{P}}|H_u)}. \end{aligned}$$

(5)

The Bayes factors between constrained hypotheses of interest can then be obtained using the transitive property of the Bayes factors, e.g., $B_{12}=B_{1u}/B_{2u}$.

The Bayes factor will not be sensitive to the prior as long as very vague identical priors are specified for the variances. In this case, the prior probability in the denominator will be equal to $(P!)^{-1}$, and the posterior probability will be fully determined by the information in the data (similar as in Bayesian estimation). Bartlett’s paradox is, thus, not an issue in Bayesian order or one-sided hypothesis testing (Jeffreys 1961; Klugkist and Hoijtink 2007; Liang et al. 2008; Mulder 2014a).

To compute the posterior probability that the order constraints hold under the unconstrained model, we can simply fit the unconstrained model (using independent vague priors) and compute the proportion of draws satisfying the order constraints. The current test extends the use of Bayes factors for testing order hypotheses on variance components (Böing-Messing et al. 2017; Mulder and Fox 2019; Böing-Messing et al. 2020, e.g.,) to multilevel relational event models.

3.3 Test III: testing common and average network effects over all sequences

In most applications, hypothesis tests are formulated on the common network parameters across clusters (i.e., the fixed effects) and on the average of the random-effects across clusters (i.e., the global random-effects means). Since both parameters have the same role, hypothesis tests of these parameters are both discussed here using the same methodology.

First, a common test is of whether a network statistic has no effect, a negative effect, or a positive effect on the interaction behavior, which could be formulated as

$$\begin{aligned} H_0:\psi _1=0 \text { versus } H_1:\psi _1<0 \text { versus } H_2:\psi _1>0 \end{aligned}$$

for the first fixed effect of the receiver model, for instance. Second, hypotheses are often formulated on combinations of parameters using equality and/or order constraints on multiple parameters of interest based on existing theories or scientific expectations (Hoijtink et al. 2019), e.g.,

$$\begin{aligned} H_1:\psi _1=\psi _2<\psi _3 \text { versus } H_2: \text { `not} H_1', \end{aligned}$$

where hypothesis $H_1$ assumes that the first effect ($\psi _1$) is equal to the second effect ($\psi _2$) and larger than the third effect ($\psi _3$), and hypothesis $H_2$ assumes the complement is true. Third, when testing the effects of categorical (dummy) variables on the event rate, scientific expectations may be formulated which categorical variable has the largest impact on the event rate. A challenging testing problem is when the interest is in assessing which categorical variable has the largest impact when it is not known which category of each categorical variable results in the largest event rate. For example, it may be of interest whether the dichotomous gender variable (0 or 1) has a larger, equal, or smaller impact on the event rate than a dichotomous race variable (0 or 1), but it is not of particular interest which category results in the largest rate. This could be translated to constrained hypotheses on the absolute values of the effects of these two categorical variables according to

$$\begin{aligned} H_0:|\psi _1|=|\psi _2| \text { versus } H_1:|\psi _1|<|\psi _2| \text { versus } H_2:|\psi _1|>|\psi _2|. \end{aligned}$$

Finally, it is possible to test parameters between the sender model and the receiver model. This allows researchers to assess whether variable has a larger/equal/smaller to predict the next sender than to predict the next receiver.

The procedure for Test III builds on default Bayes factor methodology where the information in the data is split between a minimal subset, which is used for default prior specification (in combination with a noninformative prior) and a maximal subset which is used for hypothesis testing (O’Hagan 1995; Pérez and Berger 2002, among others). When using these methods, the resulting Bayes factors do not depend on the undefined normalizing constants of the improper priors (O’Hagan 1995), which practically means that arbitrarily vague priors can be used, as we do in this paper. These Bayes factors can be computed in an automatic fashion and manual prior specification can be avoided.^{Footnote 1} Here, we use fractional Bayes factors where a (minimal) fraction of the data, denoted by b, is used to construct a (default) fractional prior, given by

$$\begin{aligned} \pi (\varvec{\phi },\varvec{\psi },\varvec{\mu },\varvec{\zeta },\varvec{\Sigma }_{\beta },\varvec{\Sigma }_{\gamma } | {{\textbf {E}}}^b) \propto p({{\textbf {E}}} | \varvec{\phi },\varvec{\psi },\varvec{\mu },\varvec{\zeta },\varvec{\Sigma }_{\beta },\varvec{\Sigma }_{\gamma })^b \pi (\varvec{\phi },\varvec{\psi },\varvec{\mu },\varvec{\zeta },\varvec{\Sigma }_{\beta },\varvec{\Sigma }_{\gamma }), \end{aligned}$$

(6)

and the remaining fraction is used for hypothesis testing. It has been shown that fractional Bayes factors follow many important properties which are not shared by all default Bayes factors, such as large sample consistency, satisfying the likelihood principle, invariance to transformations of the data (O’Hagan 1997), and information consistency (Mulder 2014b). To properly incorporate the model complexity of order-constrained hypotheses, we apply a prior adjustment of the fractional prior (Mulder 2014b), and Gaussian approximations are applied to the posterior and fractional prior to simply the computation (Kim and Ibrahim 2000; Gu et al. 2017). The accuracy of the Gaussian approximations will be discussed in the next section. The approximated fractional Bayes factor of a constrained hypothesis (e.g., with any set of equality and/or constraints on certain parameters) against an unconstrained model is then computed as the integral over the unconstrained posterior over the constrained subspace divided by the integral over the unconstrained fractional prior over the constrained subspace as a Savage–Dickey density ratio (Mulder 2014b). Existing functions in R using the ‘mvtnorm’ package (Genz et al. 2021) can be used to compute the Bayes factors (Mulder et al. 2021).

The choice of the fraction ‘b’ is based on the recommendation by Mulder and Fox (2019), which implies selecting a minimal sample that is based on the ratio between the total number of parameters (excluding group specific effects) and the total number of observations. Since sender and receiver models are two different processes assumed to be conditionally independent given the past, we have to define two difference fractions, one for each model (Hoijtink et al. 2019, see also). In the sender model, we have P random effects and Q fixed effects, hence the fraction is $b_{\text {Snd}} = \frac{(P(P+1)/2) + (P+Q)}{\sum _{k = 1}^{K} N_{k}}$ and in the receiver model we have V random effects and U fixed effects, resulting in the fraction $b_{\text {Rec}} = \frac{(V(V+1)/2) + (V+U)}{\sum _{k = 1}^{K} N_{k}}$.

4 Estimating and testing classroom dynamics using the multilevel relational event model

The data that are considered here were collected by McFarland (2001) in a study to investigate student rebellion in the classroom. The data feature observations of interactions among high-school students in two different schools in the United States. For this illustration, we consider 15 independent classrooms from Magnet High School during the 1996–1997 school year. The student body of this high school can be considered academically homogeneous. The data were collected through classroom observations in which the conversations within the classroom were coded. In each of these fifteen classes, a teacher is present in addition to the students. The number of events (the number of times one person said something to another person) ranges from 86 to 628, and the number of persons (the students plus the teacher) ranges between 19 and 30 across the event sequences. The conversations happened in an orderly lecture-like fashion, so only one person was speaking at each time. The aim of this application is twofold. At first, we present a sequential analysis that is used as proof of concept for the proposed model and statistical testing procedures. Second, we aim to provide insights into classroom dynamics by showing the evolution of network effects over time and extensively applying our tests to showcase multiple substantively interesting hypothesis tests. Below, we first discuss the actor-oriented multilevel relational event model that we use to analyze the data.

4.1 Model specification

We fit the hierarchical actor-oriented relational event model (described in Sect. 2) to the data. Initially, all effects are considered random, so the need for a hierarchical structure can be evaluated using the Bayes factor presented in Sect. 3. This first step will allow us to determine whether some effects can be treated as fixed and the model can hence be simplified.

We set the prior parameters to $\sigma ^2_{\phi } = \sigma ^2_{\mu _{\beta }} = \sigma ^2_{\psi } = \sigma ^2_{\zeta _{\gamma }} = \sigma _{\tau } = 10$, making the priors for the fixed and random effects relatively vague, and $\eta = 2$, slightly favoring smaller correlations. For our illustration of the proposed methods, we will include the following covariates into the model:

Teacher: A dummy variable that indicates whether the actor is the teacher (one if the actor is the teacher and zero otherwise).

Gender: A dummy variable that indicates the gender of the actor (one if the actor is male and zero otherwise).

Race: A dummy variable that indicates the race of the actor. McFarland (2001) notes that 50 percent of the Magnet High population is Caucasian. Hence, the variable is one if the actor is Caucasian and zero otherwise.

Inertia: Inertia captures the persistence of the communication, where past interaction is likely to be repeated (Leenders et al. 2016). It is computed as the accumulated volume of past communication from a specific sender to a specific receiver. Because this statistic is dyadic, it will be included only in the receiver model.

Participation shifts: These statistics are used to reflect expectations of adherence of communication norms in small groups (Butts 2008). Gibson (2005) describes the framework for several types of participation shifts. In this application two distinct types of participation shifts will be included. The first belongs to the group of “turn-receiving" and is represented by the event pattern ABBA: an interaction from person A to person B is immediately followed by an interaction from B to A. The second is ABAB, which can be considered as a special case of "turn-continuing": an interaction from person A to B is immediately followed by another interaction event from A to B.

The ABBA and ABAB participation shifts are concerned with dyads, therefore they will be included in the receiver model only. However, we do include adapted versions in the sender model. Here, the statistics become ABB (after A has spoken to B, the next event is B starting the conversation) and ABA (after A has spoken to B, A speaks again).

Activity: This set of covariates captures the effect of actor activity as a sender or as a receiver (Vu et al. 2017). The first is the outgoingness of a person, defined as the number of events sent by one actor up until a specific point in time. This captures the tendency of the person to start conversations (or just to talk). The second is the popularity of a person, given by the number of events received by one actor up until a specific point in time. This captures the popularity of an individual as a receiver of the conversation.

Relational event models are vulnerable to process explosion, which happens when $\lambda (t) \rightarrow \infty$ often due to a feedback loop that may be caused by using statistics that are computed as cumulative sums (Aalen et al. 2008). This is particularly a problem for inertia and activity statistics. One way to alleviate this problem is via z-score standardization at every time point, which is defined as $z(t) = (x(t) - {\bar{x}}(t))/S_{x}(t)$, where x(t) is the value of the statistic, ${\bar{x}}(t)$ is the sample mean, and $S_{x}(t)$ is the sample standard deviation at time t. We use standardized statistics in this application.

4.2 Exploring social interaction dynamics as class time progresses

By analyzing the relational event sequences as the time during class progresses, we can explore how statistical certainty increases over time using the proposed multilevel relational event model, and how interaction dynamics between children and the teacher evolves as class time progresses. To study this, we analyze the sequences with increasing batches of 20% of the total number of events in each sequence (i.e., 20%, 40%, 60%, 80%, and 100%).

4.2.1 Testing for homogeneous social interaction behavior across school classes

The first step is to fit the model with all effects considered random. We ran 2000 MCMC iterations and discarded the first 1000 samples as burn-in. We used ${\hat{R}}$ as a convergence diagnostic measure. For all models, the metric values were fairly close to 1 all, falling all below the 1.05 value recommended by Carpenter et al. (2017). As an example, Fig. 1 shows several trace plots for the model run with 100% of the events. The chains show convergence. The number of posterior draws is not high because the Hamiltonian dynamics in Stan’s algorithm allow for faster convergence than standard MCMC methods (Hoffman and Gelman 2014). To check this, we investigated the bulk effective sample size and the tail effective sample size, which fell between 800 and 1100 for all parameters ( Carpenter et al. (2017) recommends these values to be both above 100 for the sample to be considered reliable). This confirms the computational efficiency of the sampler. For other sample size recommendations, see Hecht and Zitzmann (2021). Since a more parsimonious model is preferred, we consider an effect to vary across classrooms via a random effect when the posterior probability for the hypothesis that an effect is fixed, $\text {H}_{0}$, is less than 0.25, corresponding to $\text {BF}_{01} < 1/3$ (which can be interpreted as positive evidence that the effect varies across clusters; Kass & Raftery, 1995).

Figure 2 shows the evolution of the posterior probabilities of $\text {H}_{0}$ as we increase the number of events. In the beginning, when the number of events is small (varying from 17 to 125 events across school classes), the uncertainty in the posterior is relatively large leading to considerable overlap in the posterior distributions of the random effects. Therefore, the posterior probabilities for a fixed effect tend to start out large and eventually decrease to zero when the full sequences are considered for most effects, providing strong evidence for their heterogeneous nature. This is the case for most effects that are clearly heterogeneous if the sample sizes are large enough. On the other hand, for some covariates, the posterior probability gets larger as the sample sizes near 100% of the events. This is the case for ABA, outgoingness and popularity in the sender model and for race in the receiver model, resulting in posterior probabilities for a fixed effect of 0.768, 0.681 and 0.579, 0.777, respectively. Thus, the parameters corresponding to those effects will be fixed in our model. Therefore, we continue with a mixed-effect model, where the sender model has ABA, outgoingness, and popularity as fixed effects, and intercept, teacher, gender, race, and ABB as random effects, and a receiver model where only race is a fixed effect and all other effects are considered random.

To understand the nonmonotonic development of the posterior probabilities where the lines first go down and eventually increase, it is useful to check the corresponding estimates of the random-effects. As an example, the top left panel of Fig. 3 shows random-effect estimates of the race covariate in the receiver model, where each line represents a five different classrooms (to keep the plot clear; the other classrooms showed similar patterns). As the lines show we see that the random effects become more heterogeneous until 80% of the events are included in the analysis but when all 100% of the events are included the estimates become more homogeneous which explains the evidence for a fixed effect for the race effect in the receiver model in Fig. 2 when all 100% of the events are considered.

In addition, in the bottom right panel of Fig. 3, which presents the random-effect estimates for outgoingness in the sender model, a similar pattern is observed. The random effects becomes heterogeneous between 40% and 60% of the events, but they rapidly converge displaying a pattern similar to race in the sender model. This behavior is reflected in the posterior probabilities, with the line going down for 40% and 60% of the events and then getting back up again. On the other hand, the estimates on the left-hand side panels of Fig. 3, which display popularity in the receiver model and teacher in the sender model, are consistently heterogeneous as class time progresses. This is corroborated by the posterior probabilities with both effects presenting small evidence for the null.

4.2.2 Evaluating the shrinkage behavior of random effects

Given that there is asymmetry in the distribution of the number of events across classrooms (varying from 86 to 628 events over classrooms), there will be a variation regarding the statistical uncertainty of the estimated random effects across classrooms. Due to the multilevel structure of the model, estimated effects with a relatively large uncertainty which also deviate much from the other estimates will then be (slightly) pulled towards the global mean of the effect over all class rooms. This statistical behavior is also known as shrinkage, or borrowing information across groups, which is an intrinsic property of multilevel models. The current sequential analysis where the sample sizes grows allow us to investigate this behavior, since with fewer events we should have less information and, therefore, less accurate classroom-specific estimates.

We use the R package remstimate (Arena et al. 2022) to obtain the estimates of the classroom-specific effects which ignores the multilevel structure of the data. Figure 4 shows the shrinkage for several effects in the sender and receiver models. The lines were plotted using a gray scale where lighter shades mean smaller sample sizes. In each panel, we have the estimates based on the estimation of the separate classrooms at the top, and the multilevel estimates at the bottom.

First, we see that there is considerably more variation of the estimated effects in the separate analyses (top of each panel), which ignore the multilevel structure, in comparison to the multilevel estimates (bottom of each panel). This confirms that the proposed multilevel model shows the anticipated shrinkage behavior. Second, we see that when using subsets of only 20% complete event sequences in the classrooms, we obtain more extreme estimates than when using the proposed multilevel model. Finally, we see that estimates for classrooms with shorter event sequences (represented with light gray lines) are on average more extreme than estimates for classrooms for longer event sequences. This illustrates that the proposed multilevel relational event models is particularly useful when the event sequences are relatively short to avoid extreme estimates.

4.2.3 Evolution of classroom behavior

Another interesting aspect to look at in the sequential analysis is the potential change of the estimated of the effects as class time progresses. This will give us an insight into the social interaction dynamics in classrooms. Figure 5 shows the evolution of some mean effects in the sender and receiver models. Here, we restrict ourselves to some trends of average network behavior across all classrooms. Note that the intervals are relatively wide because the limited lengths of the event sequences in a class period.

First, we see that the teacher effect in the sender model is positive and large in the sender model. This was expected given the dominant role of the teacher during lectures. The plot in Fig. 5 (upper left panel) also shows a slight decrease as class time progresses. This suggests that the dominant role of the teacher slightly decreases as the lectures approach the end. This indicates that teachers may loose some of the control towards the end or that more students become engaged or activated towards the end. This is also confirmed by the ABA effect in the sender model (Fig. 5, upper right panel)) which is initially positive and decrease to just below zero towards the end of the class period. Thus, it becomes less likely that the sender of the previous messages (typically the teacher because these are lectures) becomes the sender of the next event.

Another interesting trend can be observed for the global inertia effect across classrooms in the receiver model (Fig. 5, lower left panel) In the beginning of the class period, inertia starts out negative which indicates that it is less likely that actors who were the receiver of relational events in the past (typically a question or remark by the teacher towards a student because lectures are considered) become a receiver of the next. This indicates that teachers aim to address different students in the beginning of the lecture with the goal to get more students engaged or activated during lectures. The effect, however, gradually increases as the class time progressed. As the class period nears the end, the inertia effect becomes positive which implies that it has become more likely that a student is addressed by the teacher who was also addressed by the teacher in the past. This suggests that teachers tend to focus more on the same students (possibly the more motivated students) rather than trying to involve other less active students in the discussion. This is also confirmed by the positive and increasing ABAB effect in the receiver model during class time (Fig. 5, lower right panel).

4.3 Mixed-effects analysis on the full sequences

In this subsection, we carry out the other proposed tests under the mixed-effects model obtained from the homogeneity tests that were carried out in the previous section. We only restrict ourselves to the results for the full event sequences to keep the discussion of the results as concise as possible. We ran 4 chains, each with 5000 MCMC iterations with a burn-in of 2500 draws. Figure 6 shows the trace plots of a few parameters, all chains show convergence. Here, we also used the ${\hat{R}}$ as a convergence measure, they were essentially 1 in all chains for all parameters. In addition, the bulk and tail effective sample sizes were all close to around 2000 samples.

Moreover, Figs. 7 and 8 display the density estimated from each of the 4 chains and a Gaussian approximation on top. As can be seen, the Gaussian approximation seems to be acceptable in the bulk as well as in the tails of the distribution. Therefore, empirically, the approach of conducting the tests using these approximations as proxies for the posterior distributions seems reasonable. From now on, given that the results for the four chains are virtually identical, we proceed to analyze the results of a single chain.

We can see that the proposed testing procedure to determine which effects are random and which are fixed works correctly because a similar fit to the data is obtained in the more parsimonious model (where some effects are fixed) compared to the larger (i.e., all random) model. We can see this by computing the point-wise deviance residuals for both models under the full multilevel model where all effects are random across classrooms and the mixed effects model where certain effects are assumed fixed across classrooms (according to the proposed Bayes factor test). If a similar fit is obtained for both models, the point-wise deviance residuals for both models are similar meaning that none of the two models dominates the other in terms of fit. Figure 9 clearly shows that that is the case: most points lie around the gray diagonal where the values are equal. This indicates that the fit provided by both models is similar. Details of the point-wise deviance computation are provided in appendix E. Finally, Table 1 shows a comparison of the posterior estimates for both models, where the bold ones are effects treated as fixed in the mixed-effects model. In sum, the mixed effects model results in a comparable fit but with much less parameters to estimate, resulting in a more parsimonious model and a simpler explanation of social interaction behavior across classrooms.

Table 1 Parameter estimates of the mean effects

Full size table

4.3.1 Testing the degree of heterogeneity of network effects across school classes

By testing order constraints on the random-effects variance parameters, we can assess which characteristics cause most variation in the social interaction behavior across school classes. In particular, we focus on the random effects of the teacher, gender, and race variables in the sender model (which are all dummy variables). The same test can also be applied to the variance parameters in the receiver model in a similar manner. The objective is to determine the amount of evidence regarding the level of heterogeneity among these effects. Let $\sigma ^2_{{teacher}}$, $\sigma ^2_{{gender}}$, and $\sigma ^2_{{race}}$ be the between classrooms variances of the teacher, gender, and race effects, respectively, where these parameters are extracted from the diagonal of $\varvec{\Sigma }_{\gamma }$. Given the special (dominant) role of the teacher in a classroom, and the fact that different teachers have different teaching styles, it is expected that the variance of the teacher effect is largest. We tested all six possible order hypotheses for the random-effects variances:

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: \sigma ^2_{teacher}> \sigma ^2_{gender}> \sigma ^2_{race}\\ \text {H}_{2}&: \sigma ^2_{teacher}> \sigma ^2_{race}> \sigma ^2_{gender}\\ \text {H}_{3}&: \sigma ^2_{race}> \sigma ^2_{teacher}> \sigma ^2_{gender}\\ \text {H}_{4}&: \sigma ^2_{race}> \sigma ^2_{gender}> \sigma ^2_{teacher}\\ \text {H}_{5}&: \sigma ^2_{gender}> \sigma ^2_{teacher}> \sigma ^2_{race}\\ \text {H}_{6}&: \sigma ^2_{gender}> \sigma ^2_{race} > \sigma ^2_{teacher}. \end{aligned} \end{aligned}$$

The goal is to determine which hypothesis receives most evidence from the data. Below, the Bayes factors for each one of these hypotheses against one another is represented in the form of an evidence matrix:

$\begin{gathered} \begin{array}{*{20}c} {\;\;\;\;\;\;\;\;\;\;{\text{H}}_{1} } & {\;\;{\text{H}}_{2} } & {\;\;\;{\text{H}}_{3} } & {\;\;\;\;\;\;{\text{H}}_{4} } & {\;\;\;\;\;\;\;{\text{H}}_{5} } & {\;\;\;\;\;\;\;\;\;{\text{H}}_{6} } \\ \end{array} \hfill \\ \begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ {{\text{H}}_{4} } \\ {{\text{H}}_{5} } \\ {{\text{H}}_{6} } \\ \end{array} \left( {\begin{array}{*{20}c} {1.00} & {1.47} & {5788.40} & {615.79} & {5788.40} & {275.64} \\ {0.73} & {1.00} & {4200.00} & {446.81} & {4200.00} & {200.00} \\ {0.00} & {0.00} & {1.00\;\;\;\;\;} & {0.00\;\;\;} & {0.00\;\;\;\;\;} & {0.00\;\;\;} \\ {0.00} & {0.00} & {8.40\;\;\;\;\;} & {1.00\;\;\;} & {8.40\;\;\;\;\;} & {0.40\;\;\;} \\ {0.00} & {0.00} & {0.00\;\;\;\;\;} & {0.00\;\;\;} & {1.00\;\;\;\;\;} & {0.00\;\;\;} \\ {0.00} & {0.00} & {20.00\;\;\;} & {2.13\;\;\;} & {20.00\;\;\;} & {1.0\;\;\;\;\;} \\ \end{array} } \right). \hfill \\ \end{gathered}$

Each cell of the matrix represent the comparison of the hypothesis in the rows against the hypothesis in the columns. So, for example, the evidence for $\text {H}_1$ against $\text {H}_2$ is equal to 1.5. The evidence for $\text {H}_2$ against $\text {H}_1$ is then $1/1.50 = 0.67$.

The evidence matrix clearly shows that the evidence for hypotheses $\text {H}_1$ and $\text {H}_2$ is much larger than the evidence for any of the other hypotheses, which indicates that the teacher effect shows indeed most heterogeneity across the fifteen classrooms. When inspecting the estimates of the variances, this is also confirmed: ${\bar{\sigma }}^2_{teacher} = 1.65$, ${\bar{\sigma }}^2_{gender} = 0.25$, and ${\bar{\sigma }}^2_{race} = 0.22$. Note that the added value of the Bayes factor complementary to eyeballing the estimates is that the effect sizes and their uncertainty are combined in a principled manner to determine which effects are most heterogeneous across classrooms. Finally note that there is no clear evidence that either the gender effect or the race effect is more heterogeneous across classrooms. Thus, we can conclude that teachers have the biggest impact on the variability of social interaction behavior across classrooms.

4.3.2 Testing the impact of nodal characteristics on classroom dynamics

In this subsection, we test the effects of nodal characteristics how they affect classroom dynamics. In the sender model, due to the special role of the teacher to control interaction behavior during lectures, the teacher variable is expected to have the strongest impact on the rate at which an actor starts an interaction when compared to the other two personal-trait variables gender and race. No expectations are formulated about which of these latter two variables has the largest effect in the model, or about which category of these dichotomous variables results in a positive effect. This results in the following hypotheses where constraints are formulated on the absolute values of the respective effects of these variables

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: |\zeta _{{teacher}}|> |\zeta _{{gender}}|> |\zeta _{{race}}|\\ \text {H}_{2}&: |\zeta _{{teacher}}|> |\zeta _{{gender}}| = |\zeta _{{race}}|\\ \text {H}_{3}&: |\zeta _{{teacher}}|> |\zeta _{{race}}| > |\zeta _{{gender}}|\\ \hbox {H}_{4}&: \text {neither} H_1 \hbox {,} H_2, nor H_3. \end{aligned} \end{aligned}$$

The complement hypothesis $H_4$ is included as a “safety net” in case our expectations about the dominance of the teacher variable would not be supported by the data. Once again, we present the resulting fractional Bayes factors in an evidence matrix form

$\begin{gathered} \begin{array}{*{20}c} {\;\;\;\;\;\;\;\;\;\;{\text{H}}_{1} } & {\;\;\;{\text{H}}_{2} } & {\;\;\;\;{\text{H}}_{3} } & {} \\ \end{array} \;\;\;\;{\text{H}}_{4} \hfill \\ \begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ {{\text{H}}_{4} } \\ \end{array} \left( {\begin{array}{*{20}c} {1.00} & {0.21} & {4.74} & {2.39{\text{e}}8} \\ {4.56} & {1.00} & {21.63} & {1.09{\text{e}}9} \\ {0.21} & {0.05} & {1.00} & {5.05{\text{e7}}} \\ {0.00} & {0.00} & {0.00} & {1.00\;\;\;} \\ \end{array} } \right) \hfill \\ \end{gathered}.$

Again, we translate these Bayes factors to posterior probabilities when assuming that each hypothesis is equally likely a priori, which results in posterior probabilities of 0.235, 0.723, 0.042, and 0.000, for $H_1$, $H_2$, $H_3$, and $H_4$, respectively. Based on these results we can safely rule out the complement hypothesis $H_4$. Moreover, hypothesis $H_2$ (which assumes that the teacher variable has the largest effect, and the gender effect and the race effect are equal) receives most evidence; but only approximately 3 times more evidence than the hypothesis $H_1$ (which assumes that gender plays a larger role than race after the teacher effect). In order to draw more decisive conclusions of whether $H_1$ or $H_2$ is true more data would be required. These results are confirmed when looking at the posterior estimates, i.e., 2.22, 0.289, 0.143, and their posterior standard deviations, i.e., 0.344, 0.131, 0.103, of the absolute values of the teacher, gender, and race effect. Note again that the Bayes factors and posterior probabilities are a principled probabilistic methodology to summarize these findings without requiring subjective eyeballing of the estimates.

Under the receiver model, we consider a more exploratory approach by considering all possible order hypotheses on the absolute effects of these nodal characteristics. Note that even though the teacher still has a dominant role, this may not necessarily be the case in the receiver model. The following hypotheses are considered

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: |\mu _{{teacher}}|> |\mu _{{gender}}|> |\mu _{{race}}|\\ \text {H}_{2}&: |\mu _{{teacher}}|> |\mu _{{race}}|> |\mu _{{gender}}|\\ \text {H}_{3}&: |\mu _{{race}}|> |\mu _{{teacher}}|> |\mu _{{gender}}|\\ \text {H}_{4}&: |\mu _{{race}}|> |\mu _{{gender}}|> |\mu _{{teacher}}|\\ \text {H}_{5}&: |\mu _{{gender}}|> |\mu _{{race}}|> |\mu _{{teacher}}|\\ \text {H}_{6}&: |\mu _{{gender}}|> |\mu _{{teacher}}| > |\mu _{{race}}|, \end{aligned} \end{aligned}$$

where $|\mu |$ is the absolute value of $\mu$, which represents the global mean of the random effects in the receiver model. We also display the results in the form of an evidence matrix which yields:

$\begin{gathered} \;\;\;\;\;\;\;\;\begin{array}{*{20}c} {{\,\,\,\text{H}}_{1} } & {\;\;\;{\text{H}}_{2} \;} & {\;\;\;\;\;{\text{H}}_{3} \;} & {\;\;\;\;\;\;{\text{H}}_{4} \;} & {\;\;\;{\text{H}}_{5} \;} & {\;\;\;{\text{H}}_{6} } \\ \end{array} \hfill \\ \begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ {{\text{H}}_{4} } \\ {{\text{H}}_{5} } \\ {{\text{H}}_{6} } \\ \end{array} \left( {\begin{array}{*{20}c} {1.00} & {45.52\;} & {354.37\;} & {254.63} & {3.82} & {0.44} \\ {0.02} & {1.00\;\;\;} & {7.78\;\;\;\;} & {5.39\;\;\;} & {0.08} & {0.01} \\ {0.00} & {0.12\;\;\;} & {1.00\;\;\;\;} & {0.69\;\;\;} & {0.01} & {0.00} \\ {0.00} & {0.18\;\;\;} & {1.44\;\;\;\;} & {1.00\;\;\;} & {0.02} & {0.00} \\ {0.26} & {11.91\;\;} & {92.69\;\;} & {64.25\;} & {1.00} & {0.12} \\ {2.28} & {103.89} & {888.80} & {560.61} & {8.73} & {1.00} \\ \end{array} } \right) \hfill \\ \end{gathered}$

and translating these to posterior probabilities result in the following posterior probabilities for the respective order hypotheses: 0.291, 0.006, 0.001, 0.001, 0.068, 0.633. These results indicate that almost all probabilities mass goes to $H_1$ and $H_6$, which suggests that the race variable has the smallest impact. Furthermore, the gender variable is expected to play the largest role in the receiver model among these nodal characteristics. These results are confirmed when looking at the posterior estimates, which equal 0.243, 0.310, 0.070, and posterior standard deviations 0.172, 0.100, and 0.045, for the absolute teacher, gender, and race effect, respective.

4.3.3 Testing the impact of network statistics on classroom dynamics

Next, we focus on testing the effects of endogenous (network) characteristics of actors. Specifically, we evaluate whether the fixed effects of popularity or outgoingness makes actors more likely to be the next sender. Let $\phi _{\text {popularity}}$ be the mean effect of popularity and $\phi _{\text {outgoingness}}$ be the mean outgoingness effect, both in the sender model. We test the following hypotheses against one another:

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: \phi _{\text {popularity}} = \phi _{\text {outgoingness}}\\ \text {H}_{2}&: \phi _{\text {popularity}} < \phi _{\text {outgoingness}}\\ \text {H}_{3}&: \phi _{\text {popularity}} > \phi _{\text {outgoingness}}. \end{aligned} \end{aligned}$$

Having more than two hypotheses, we show the Bayes factors in an evidence matrix form, which yields

$\begin{gathered} \begin{array}{*{20}c} {\;\;\;\;\;\;\;\;\;\;\;\;{\text{H}}_{1} } & {\;\;\;\;\;\;\;\;{\text{H}}_{2} } & {\;\;\;\;\;\;\;{\text{H}}_{3} } \\ \end{array} \hfill \\ \begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ \end{array} \left( {\begin{array}{*{20}c} {*20l1.00} & {48.55\;\;} & {0.43} \\ {0.02\;\;\;\;\;\;} & {1.00\;\;\;\;} & {0.01} \\ {2.29\;\;\;\;\;\;} & {110.99\;} & {1.00} \\ \end{array} } \right) \hfill \\ \end{gathered} .$

Assuming equal prior probabilities of the three hypotheses, the posterior probabilities are equal to $P(\text {H}_1|{{\textbf {E}}})=0.384$, $P(\text {H}_2|{{\textbf {E}}})=0.006$, and $P(\text {H}_3|{{\textbf {E}}})=0.610$. The results indicate that $\text {H}_{3}$ is most likely to be true after observing the data, being approximately twice more likely than $\text {H}_{1}$, and approximately 100 times more likely than $\text {H}_2$. For this reason, we can with almost complete certainty state that the popularity effect is not smaller than the outgoingness effect. This is confirmed by the posterior estimates of the popularity and the outgoingness effect which are equal to 0.17 and 0.07, respectively; see also the posteriors in Fig. 10. More data are required to obtain more decisive evidence of whether $\text {H}_{1}$ or $\text {H}_{3}$ is true.

In addition to testing the fixed effects as above, we can also test the global means of random effects. Here, we test the participation shift ABBA is more likely than the participation shift ABAB (both are random effects) after controlling for the other effects in the receiver model. Let $\mu _{\text {abba}}$ be the global mean effect of ABBA and $\mu _{\text {abab}}$ be the global mean effect of ABAB. This comparison captures the tension between an individual’s tendency to continue to speak to the same individual or the societal norm of reciprocity where the previous receiver because the sender of the next event and the previous sender because the receiver. In polite conversation norms, we would expect to see more turn-switches between two individuals (A speaks to B and then B responds to A) than turn-continuing (where A directs a comment to B and then continues to speak without giving B the opportunity to respond first). The estimates in Table 1 suggests that this is indeed the case since the mean effect of ABBA is almost four times as large as the mean effect of ABAB, 4.27 and 1.18, respectively. Here, we formally test this by considering the following three hypotheses:

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: \mu _{\text {abba}} = \mu _{\text {abab}}\\ \text {H}_{2}&: \mu _{\text {abba}} < \mu _{\text {abab}}\\ \text {H}_{3}&: \mu _{\text {abba}} > \mu _{\text {abab}}. \end{aligned} \end{aligned}$$

This yields the following evidence matrix:

$\begin{gathered} \begin{array}{*{20}c} {\;\;\;\;\;\;\;\;\;\;\;\;{\text{H}}_{1} } & {\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;{\text{H}}_{2} } & {\;\;\;\;\;\;\;\;\;\;\;\;{\text{H}}_{3} } \\ \end{array} \hfill \\ \begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ \end{array} \left( {\begin{array}{*{20}c} {*20l1.00\;\;\;\;\;\;\;\;\;} & {143.26\;\;\;\;\;\;\;} & {0.00} \\ {0.00\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;} & {1.00\;\;\;\;\;\;\;\;\;\;\;} & {0.00} \\ {4.19 \times 10^{{12}} \;\;\;\;\;\;} & {6.00 \times 10^{{14}} \;} & {1.00} \\ \end{array} } \right) \hfill \\ \end{gathered} .$

The evidence favoring $\text {H}_{3}$ is overwhelmingly larger than the evidence for any other hypothesis, strongly supporting the hypothesis that the effect of immediate reciprocity is larger than that of immediate inertia across these fifteen classrooms at Magnet High school. This is also confirmed by the posterior probabilities $P(\text {H}_1|{{\textbf {E}}})=0.000$, $P(\text {H}_2|{{\textbf {E}}})=0.000$, and $P(\text {H}_3|{{\textbf {E}}})=1.000$, when assuming equal prior probabilities.

Finally, we showcase a hypothesis test of network effects between the receiver model and the sender model. Specifically, we test whether the teacher variable has a larger effect on the rate of being a sender or on the rate of being chosen as receiver. Given the dominant role of the teacher as a sender, we expect the teacher effect is larger in the sender model. We formulate the hypotheses as follows:

$$\begin{aligned} \begin{aligned} \text {H}_{1}&: \zeta _{{teacher}} = \mu _{{teacher}}\\ \text {H}_{2}&: \zeta _{{teacher}} < \mu _{{teacher}}\\ \text {H}_{3}&: \zeta _{{teacher}} > \mu _{{teacher}}, \end{aligned} \end{aligned}$$

where hypothesis $H_3$ corresponds to our scientific expectation.

The resulting evidence matrix for the Bayes factors is:

$\begin{array}{*{20}c} {{\text{H}}_{1} } \\ {{\text{H}}_{2} } \\ {{\text{H}}_{3} } \\ \end{array} \mathop {\left( {\begin{array}{*{20}l} \vline & {20l1.00} \hfill & {92.15} \hfill & {0.00} \hfill \\ \vline & {0.01} \hfill & {1.00} \hfill & {0.00} \hfill \\ \vline & {5.84 \times 10^{4} } \hfill & {5.38 \times 10^{6} } \hfill & {0.00} \hfill \\ \end{array} } \right)}\limits^{{\begin{array}{*{20}l} {{\text{H}}_{1} \qquad {\text{H}}_{2} \qquad {\text{H}}_{3} } \hfill \\ \end{array} }} .$

Indeed, the evidence matrix shows convincing evidence that the teacher effect is larger in the sender model than in the receiver model. The posterior probabilities of the three hypotheses are equal to $P(\text {H}_1|{{\textbf {E}}})=0.000$, $P(\text {H}_2|{{\textbf {E}}})=0.000$, and $P(\text {H}_3|{{\textbf {E}}})=1.000$, when assuming equal prior probabilities, which results in the same conclusion. The results are also confirmed by the estimates which are equal to ${\bar{\zeta }}_{{teacher}} = 2.22$ and ${\bar{\mu }}_{{teacher}} = 0.20$.

5 Discussion

In this paper, we presented a Bayesian actor-oriented multilevel relational event model for studying social interaction behavior from independent relational event sequences. This model allows for inferences at the actor level, thus opening the possibility of unveiling effects that make an actor more prone to send or receive an interaction in the population under study. Our results show that the model is able to capture the effects that have the largest impacts in actors preferences, even when those effects are different in the sender activity and receiver choice rates. The models can be estimated using the Stan programming language, the Stan code for the actor-oriented relational event model is available on Github (https://github.com/Fabio-Vieira/bayesian_dynamic_network).

Furthermore, a flexible set of hypothesis testing procedures was proposed for this class of models, facilitating inferences on population parameters of the estimated effects. These tests can be used for testing whether effects should be treated as fixed or random across sequences, for testing the relative heterogeneity of different network effects across sequences, and for testing equality and order constraints on the fixed effects, even on their absolute values. These tests are useful to get a better understanding about the heterogeneity of social interaction behavior across independent relational event sequences. These novel testing procedures can also be applied to other mixed effects models to better understand the heterogeneity in other types of multilevel or clustered data.

The computational issues originating from the inefficiencies that result from the hierarchical structure of the model are the main limitations in the estimation process. We have taken advantage of the multivariate normal structure of the hierarchical prior to induce a non-centered transformation in the random-effects parameters, which is more efficient in practice for the reasons aforementioned. An issue that we have not directly addressed in our paper is the sparsity of relational event data which may cause the geometry of the posterior distribution to be highly complex (Betancourt and Girolami (2015)). The alien form of the likelihood of the relational event model also presents a challenge to the estimation of hierarchical relational event models. An especially attractive next step would to construct alternative representations of the model so that posterior distributions in closed form could be derived, improving efficiency in the estimation process.

Another promising direction to improve computational feasibility when fitting multilevel relational event models to the large clustered relational event sequences is using meta-analytic approximations (Borenstein et al. 2010). A Gibbs sampler could be derived using the point estimates from the independent relational event sequences as data observations. The question would then be how large the relational event sequences should be in order for these meta-analytic approximation to be accurate enough to make reliable statistical inferences. Our flexible Bayes factor tests could then be built on top of that. Finally, another important direction for future research would be to model time-varying coefficients across sequences in order to discover complex temporal changes of the network drivers of social interactions over time.

Data availability

Requests for data access should be directed to Daniel McFarland.

Change history

22 December 2023
Funding note was published incorrectly and corrected in this version.

Notes

Alternatively, JZS priors (Bayarri and García-Donato 2007; Rouder et al. 2009; Wetzels et al. 2012) could also be considered for this test but, to our knowledge, this class of priors has not yet been proposed for generalized linear mixed effects models. Moreover, note that the prior scale of the key parameter that is tested still needs to be manually specified, unlike the proposed fractional Bayes factor using a minimally informative fractional prior.

References

Aalen O (1989) A linear regression model for the analysis of life times. Stat Med 8(8):907–925
Article Google Scholar
Aalen O, Borgan O, Gjessing H (2008) Survival and event history analysis: a process point of view. Springer, New York
Book Google Scholar
Arena G, Lakdawala R, Meijerink-Bosman M, Karimova D, Shafiee Kamalabad M, Generoso Vieira F (2022) remstimate: optimization tools for tie-oriented and actor-oriented relational event models [Computer software manual]. Retrieved from https://github.com/TilburgNetworkGroup/remstimate (R package version 1.0)
Baker SG (1994) The multinomial-Poisson transformation. J Royal Stat Soc: Ser D (The Statistician) 43(4):495–504
Google Scholar
Bayarri MJ, García-Donato G (2007) Extending conventional priors for testing general hypotheses in linear models. Biometrika 94(1):135–152
Article MathSciNet Google Scholar
Betancourt M, Girolami M (2015) Hamiltonian Monte Carlo for hierarchical models. Curr Trends Bayesian Methodol Appl 79(30):2–4
Google Scholar
Blonder B, Dornhaus A (2011) Time-ordered networks reveal limitations to information flow in ant colonies. PloS one 6(5):e20298
Article Google Scholar
Böing-Messing F, Mulder J (2020) Bayes factors for testing order constraints on variances of dependent outcomes. Am Stat 75(2):152–61
Article MathSciNet Google Scholar
Böing-Messing F, van Assen MA, Hofman AD, Hoijtink H, Mulder J (2017) Bayesian evaluation of constrained hypotheses on variances of multiple independent groups. Psychol Methods 22(2):262
Article Google Scholar
Borenstein M, Hedges LV, Higgins JP, Rothstein HR (2010) A basic introduction to fixed-effect and random-effects models for meta-analysis. Res Synth Methods 1(2):97–111
Article Google Scholar
Bürkner P-C (2017) brms: an r package for Bayesian multilevel models using Stan. J Stat Softw 80:1–28
Article Google Scholar
Butts CT (2008) A relational event framework for social action. Sociol Methodol 38(1):155–200
Article Google Scholar
Butts CT, Marcum CS (2017) A relational event approach to modeling behavioral dynamics. Group processes. Springer, Cham, pp 51–92
Chapter Google Scholar
Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, Riddell A (2017) Stan: a probabilistic programming language. J Stat Softw. https://doi.org/10.18637/jss.v076.i01
Article Google Scholar
Collett D (2015) Modelling survival data in medical research. CRC Press
Book Google Scholar
Consul P, Famoye F (1992) Generalized Poisson regression model. Commun Stat-Theory Methods 21(1):89–109
Article Google Scholar
Cox DR (1972) Regression models and life-tables. J Royal Stat Soc: Ser B (Methodological) 34(2):187–202
MathSciNet Google Scholar
Dickey JM (1971) The weighted likelihood ratio, linear hypotheses on normal location parameters. Ann Math Stat 42(1):204–223
Article MathSciNet Google Scholar
DuBois C, Butts CT, McFarland D, Smyth P (2013) Hierarchical models for relational event sequences. J Math Psychol 57(6):297–309
Article MathSciNet Google Scholar
Eagle N, Pentland AS (2003) Social network computing. In: International conference on ubiquitous computing, pp 289-296
Frank O, Strauss D (1986) Markov graphs. J Am Stat Assoc 81(395):832–842
Article MathSciNet Google Scholar
Friedman M (1982) Piecewise exponential models for survival data with covariates. Ann Stat 10(1):101–113
Article MathSciNet Google Scholar
Frome EL (1983) The analysis of rates using Poisson regression models. Biometrics 39(3):665–674
Article Google Scholar
Gelman A, Hill J (2006) Data analysis using regression and multilevel/hierarchical models. Cambridge University Press
Book Google Scholar
Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB (2013) Bayesian data analysis. CRC Press
Book Google Scholar
Gibson DR (2005) Taking turns and talking ties: networks and conversational interaction. Am J Sociol 110(6):1561–1597
Article Google Scholar
Hanneke S, Fu W, Xing EP (2010) Discrete temporal models of social networks. Electron J Stat 4:585–605
Article MathSciNet Google Scholar
Hayat MJ, Higgins M (2014) Understanding Poisson regression. J Nurs Educ 53(4):207–215
Article Google Scholar
Hecht M, Zitzmann S (2021) Sample size recommendations for continuous-time models: compensating shorter time series with larger numbers of persons and vice versa. Struct Equ Model: Multidiscip J 28(2):229–236
Article MathSciNet Google Scholar
Hoffman MD, Gelman A (2014) The no-u-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res 15(1):1593–1623
MathSciNet Google Scholar
Hoijtink H, Gu X, Mulder J (2019) Bayesian evaluation of informative hypotheses for multiple populations. Br J Math Stat Psychol 72(2):219–243
Article Google Scholar
Holford TR (1980) The analysis of rates and of survivorship using log-linear models. Biometrics 36(2):299–305
Article MathSciNet Google Scholar
Jeffreys H (1961) Theory of probability, 3rd edn. Oxford University Press, Oxford
Google Scholar
Kass RE, Raftery AE (1995) Bayes factors. J Am Stat Assoc 90(430):773–795
Article MathSciNet Google Scholar
Kiti MC, Tizzoni M, Kinyanjui TM, Koech DC, Munywoki PK, Meriac M et al (2016) Quantifying social contacts in a household setting of rural Kenya using wearable proximity sensors. EPJ Data Sci 5(1):1–21
Article Google Scholar
Klugkist I, Hoijtink H (2007) The Bayes factor for inequality and about equality constrained models. Comput Stat Data Anal 51(12):6367–6379
Article MathSciNet Google Scholar
Klugkist I, Laudy O, Hoijtink H (2005) Inequality constrained analysis of variance: a Bayesian approach. Psychol Methods 10(4):477
Article Google Scholar
Laird N, Olivier D (1981) Covariance analysis of censored survival data using log-linear analysis techniques. J Am Stat Assoc 76(374):231–240
Article MathSciNet Google Scholar
Leenders RTA, Contractor NS, DeChurch LA (2016) Once upon a time: understanding team processes as relational event networks. Organ Psychol Rev 6(1):92–115
Google Scholar
Lerner J, Lomi A (2020) Reliability of relational event model estimates under sampling: how to fit a relational event model to 360 million dyadic events. Netw Sci 8(1):97–135
Article Google Scholar
Lewandowski D, Kurowicka D, Joe H (2009) Generating random correlation matrices based on vines and extended onion method. J Multivar Anal 100(9):1989–2001
Article MathSciNet Google Scholar
Liang F, Paulo R, Molina G, Clyde MA, Berger JO (2008) Mixtures of g priors for Bayesian variable selection. J Am Stat Assoc 103(481):410–423
Article MathSciNet Google Scholar
Lusher D, Koskinen J, Robins G (2013) Exponential random graph models for social networks: theory, methods, and applications. Cambridge University Press
Google Scholar
McFadden D (1973) Conditional logit analysis of qualitative choice behavior. In: Zarembka P (ed) Frontiers in econometrics. Academic Press, pp 105–142
Google Scholar
McFarland DA (2001) Student resistance: how the formal and informal organization of classrooms facilitate everyday forms of student defiance. Am J Sociol 107(3):612–678
Article Google Scholar
Mulder J (2014) Bayes factors for testing inequality constrained hypotheses: issues with prior specification. Br J Math Stat Psychol 67(1):153–171
Article MathSciNet Google Scholar
Mulder J (2014) Prior adjusted default Bayes factors for testing (in) equality constrained hypotheses. Comput Stat Data Anal 71:448–463
Article MathSciNet Google Scholar
Mulder J, Fox J-P (2019) Bayes factor testing of multiple intraclass correlations. Bayesian Anal 14(2):521–552
Article MathSciNet Google Scholar
Mulder J, Leenders RTA (2019) Modeling the evolution of interaction behavior in social networks: a dynamic relational event approach for real-time analysis. Chaos, Solitons Fractals 119:73–85
Article MathSciNet Google Scholar
Mulder J, Pericchi LR (2018) The matrix-f prior for estimating and testing covariance matrices. Bayesian Anal 13(4):1193–1214
Article MathSciNet Google Scholar
Mulder J, Williams DR, Gu X, Tomarken A, Böing-Messing F, Olsson-Collentine A et al (2021) Bfpack: flexible Bayes factor testing of scientific theories in r. J Stat Softw 100:1–63
Article Google Scholar
O’Hagan A (1995) Fractional Bayes factors for model comparison. J Royal Stat Soc: Ser B (Methodological) 57(1):99–118
MathSciNet Google Scholar
O’Hagan A (1997) Properties of intrinsic and fractional Bayes factors. Test 6(1):101–118
Article MathSciNet Google Scholar
Pérez JM, Berger JO (2002) Expected-posterior prior distributions for model selection. Biometrika 89(3):491–512
Article MathSciNet Google Scholar
Perry PO, Wolfe PJ (2013) Point process modelling for directed interaction networks. J Royal Stat Soc: Ser B (Stat Methodol) 75(5):821–849
Article MathSciNet Google Scholar
R Core Team (2017) R: A language and environment for statistical computing [Computer software manual]. Vienna, Austria. Retrieved from https://www.R-project.org/
Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G (2009) Bayesian t tests for accepting and rejecting the null hypothesis. Psychon Bull Rev 16:225–237
Article Google Scholar
Snijders TA (1996) Stochastic actor-oriented models for network change. J Math Sociol 21(1–2):149–172
Article Google Scholar
Stadtfeld C, Block P (2017) Interactions, actors, and time: dynamic network actor models for relational events. Sociol Sci 4:318–352
Article Google Scholar
Stadtfeld C, Hollway J, Block P (2017) Dynamic network actor models: investigating coordination ties through time. Sociol Methodol 47(1):1–40
Article Google Scholar
Stadtfeld C, Hollway J, Block P (2017) Rejoinder: dynams and the grounds for actor-oriented network event models. Sociol Methodol 47(1):56–67
Article Google Scholar
Stan Development Team (2018) RStan: the R interface to Stan. Retrieved from http://mc-stan.org/ http://mc-stan.org/ (R package version 2.17.3)
Van Der Hofstad R (2009) Random graphs and complex networks. Available on http://www.win.tue.nl/rhofstad/NotesRGCN.pdfhttp://www.win.tue.nl/rhofstad/NotesRGCN.pdf
Vu D, Hunter D, Smyth P, Asuncion AU (2011) Continuous-time regression models for longitudinal networks. In: Advances in neural information processing systems, pp 2492–2500
Vu D, Pattison P, Robins G (2015) Relational event models for social learning in MOOCs. Soc Netw 43:121–135
Article Google Scholar
Vu D, Lomi A, Mascia D, Pallotti F (2017) Relational event models for longitudinal network data with an application to interhospital patient transfers. Stat Med 36(14):2265–2287
Article MathSciNet Google Scholar
Wang Z, Wu Y, Chu H (2018) On equivalence of the lkj distribution and the restricted Wishart distribution. arXiv preprint arXiv:1809.04746
Wetzels R, Grasman RP, Wagenmakers E-J (2012) A default Bayesian hypothesis test for ANOVA designs. Am Stat 66(2):104–111
Article MathSciNet Google Scholar

Download references

Funding

This research was supported by the Netherlands Organization for Scientific Research (NWO) Grant to JM and FV (452-17-006), as well as the ERC Starting Grant “TIMEISNOW” to JM and RL (758791).

Author information

Authors and Affiliations

Department of Methodology and Statistics, Tilburg University, PO Box 90153, 5000 LE, Tilburg, The Netherlands
Fabio Vieira & Joris Mulder
Department of Organization Studies, Tilburg University, Tilburg, The Netherlands
Roger Leenders
Jheronimus Academy of Data Science, ’s-Hertogenbosch, The Netherlands
Roger Leenders
Department of Education, Stanford University, Stanford, CA, USA
Daniel McFarland

Authors

Fabio Vieira
View author publications
You can also search for this author in PubMed Google Scholar
Roger Leenders
View author publications
You can also search for this author in PubMed Google Scholar
Daniel McFarland
View author publications
You can also search for this author in PubMed Google Scholar
Joris Mulder
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabio Vieira.

Ethics declarations

Competing Interests

The corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A Actor-oriented relational event model as a Poisson regression model

The likelihood of the actor-oriented relational event model is a product of a piece-wise constant exponential likelihood and a multinomial likelihood. Where the former corresponds to the sender model and the latter to the receiver model. Both of these models are special cases of the Poisson model.

Sender model as a Poisson regression

Proof

Assuming M events are observed, and the risk set contains N actors,

$$\begin{aligned} P ({{\textbf {E}}}\ |\ \varvec{\gamma } )&= \prod _{m = 1}^{M} P \Big ((s_m, t_m) | \varvec{\gamma }, (e_{1}, \dots , e_{m-1}) \Big ) = \\&= \prod\limits_{{m = 1}}^{M} {\lambda _{{s_{m} }} } \; \times \exp \left\{ { - (t_{m} - t_{{m - 1}} )\sum\limits_{{s \in {\mathcal{R}}}} {\lambda _{s} } } \right\} \\&= \prod _{m = 1}^{M} \exp \left\{{\varvec{\gamma } \varvec{x}'_{s_m} }\right\} \times \exp \left\{ { - \exp \left\{ {\upsilon _{m} } \right\}\sum\limits_{{s \in {\mathcal{R}}}} {\exp } \left\{ {\gamma x^{\prime}_{{s_{m} }} } \right\}} \right\} & {} \big (\upsilon _{m} = \log (t_{m} - t_{m-1})\big )\\&\propto \prod _{m = 1}^{M} \exp \{\upsilon _{m} + \varvec{\gamma } \varvec{x}'_{s_m} \} \times \prod _{s \in {\mathcal {R}}} \exp \{ - \exp \{\upsilon _{m} + \varvec{\gamma } \varvec{x}'_{s_m} \} \} \\&= \prod _{m = 1}^{M} \prod _{s \in {\mathcal {R}}} {\exp \{\upsilon _{m} + \varvec{\gamma } \varvec{x}'_{s_m} \}}^{y_{s}} \times \exp \{ - \exp \{\upsilon _{m} + \varvec{\gamma } \varvec{x}'_{s_m} \} \} \\&= \prod _{m = 1}^{M} \prod _{s \in {\mathcal {R}}} P \Big ( y_{s_m} | \lambda _{s_m}= \exp \{\upsilon _{m} + \varvec{\gamma } \varvec{x}'_{s_m } \} \Big ), \end{aligned}$$

where $y_{s_{m}} = 1$ if actor s is observed and zero otherwise. As a result, the factorial term in the Poisson likelihood will always be equal to one, since $1! = 1$. $\square$

Receiver model as a Poisson regression:

Proof

Once again, assuming M events are observed, and the risk set contains N actors, the likelihood function of the receiver model has the following form

$$\begin{aligned} P({{\textbf {E}}} | \varvec{\beta } )&= \prod _{m=1}^{M} \frac{\lambda _{r_{m} | s_{m}}}{\sum _{r \in {\mathcal {R}}} \lambda _{r | s_{m}}} = \prod _{m=1}^{M} \frac{\exp \{ \varvec{\beta } \varvec{x}'_{s_{m} r_{m}} \}}{\sum _{r \in {\mathcal {R}}} \exp \{ \varvec{\beta } \varvec{x}'_{s_m r} \}}. \end{aligned}$$

Taking the logarithm of $p({{\textbf {E}}} | \varvec{\beta })$ yields

Then, when considering an alternative formulation using a Poisson model, an equivalent likelihood can be written as

$$\begin{aligned} \tilde{\varvec{Y}}_{r_{m} | s_{m}} \sim \text {Poisson} \Big ( \exp \{ \varvec{\beta } \varvec{x}'_{s_{m} r_{m}} + \alpha _{m} \} \Big ), \end{aligned}$$

where $\alpha _{m}$ is an event-specific intercept and ${\tilde{Y}}_{r | s } = 1$ if the dyad (s, r) was observed and zero otherwise. The likelihood of the Poisson model can then be written

$$\begin{aligned} P(\tilde{\varvec{Y}}_{r | s} | \varvec{\beta }, \varvec{\alpha })&= \prod _{m = 1}^{M} \prod _{r \in {\mathcal {R}}} {\Big ( \exp \{ \varvec{\beta } \varvec{x}'_{s_{m} r_{m}} + \alpha _{m}\} \Big )}^{{\tilde{y}}_{r_m | s_m}} \times \\&\times \exp \{ - \exp \{ \varvec{\beta } \varvec{x}'_{s_{m} r} + \alpha _{m}\} \} \times {{\tilde{y}}_{r_m | s_m}}^{-1}, \end{aligned}$$

where ${\tilde{y}}_{r_m | s_m} = 1$. To write the likelihood as a function of $\varvec{\beta }$, we need to derive the maximum likelihood estimate of $\alpha _m$. Take the logarithm and deriving with respect to $\alpha _m$ yields the following MLE expression

$$\begin{aligned} {\hat{\alpha }}_{m} = - \log \Big ( \sum _{r \in {\mathcal {R}}} \exp \{ \varvec{\beta } \varvec{x}'_{s_m r} \} \Big ). \end{aligned}$$

Plug in the MLE into the likelihood

$$\begin{aligned} , \end{aligned}$$

which is equal to $(\star )$. $\square$

B Linear transformation of multivariate normal distribution

Here, we show that the transformation of the random-effects parameters is valid. Let $\varvec{X}$ be a p-dimensional random variable, with $\varvec{X} \sim {\mathcal {N}}(\varvec{\mu }, \varvec{\Sigma })$, where $\varvec{\mu } \in \mathrm{I\!R}^p$ is the mean vector and $\varvec{\Sigma }$ is a $p \times p$ covariance matrix. Thus, the moment generating function of $\varvec{X}$ is given by

$$\begin{aligned} M_{\varvec{X}}(\varvec{t}) = \exp \Big \{ \varvec{t}' \varvec{\mu } + \frac{1}{2} \varvec{t}' \varvec{\Sigma } \varvec{t} \Big \}. \end{aligned}$$

Now, assuming we can write $\varvec{X} = \varvec{\mu } + \varvec{A} \varvec{Z}$, where $\varvec{A}$ is a $p \times p$ matrix, with $\varvec{\Sigma } = \varvec{A}' \varvec{A}$, and $\varvec{Z} = (Z_1, Z_2, \dots , Z_p)$ is an independent normal random vector, with $Z_{i} \sim {\mathcal {N}}(0, 1),\ \text {for}\ i = 1, \dots , p$. Therefore, if we can show that $M_{\varvec{X}}(\varvec{t}) = M_{(\varvec{\mu } + \varvec{A} \varvec{Z})}(\varvec{t})$, then the transformation holds. So, we derive the moment generating function of the transformation as

Proof

$$\begin{aligned} M_{(\varvec{\mu } + \varvec{A} \varvec{Z})}(\varvec{t})&= \text {E}(e^{\varvec{t}' \varvec{X}}) = \text {E}(e^{\varvec{t}' (\varvec{\mu } + \varvec{A}\varvec{Z})})\\&= e^{\varvec{t}' \varvec{\mu }}\ \text {E}(e^{\varvec{l}' \varvec{Z}}),\ \text {where}\ \varvec{l}' = \varvec{t}' \varvec{A} \\&= e^{\varvec{t}' \varvec{\mu }}\ \text {E}\Big (e^{\sum _{i = 1}^{p} l_{i} Z_{i}}\Big )\\&= e^{\varvec{t}' \varvec{\mu }}\ \prod _{i = 1}^p \text {E}(e^{l_{i} Z_{i}})\\&= e^{\varvec{t}' \varvec{\mu }}\ \prod _{i = 1}^p e^{l^2_{i}/2} \\&= \exp \Big \{ \varvec{t}' \varvec{\mu }\ + \sum _{i = 1}^{p} \frac{l^2_{i}}{2} \Big \}\\ \end{aligned}$$

$\square$

Thus, $M_{\varvec{X}}(\varvec{t}) = M_{(\varvec{\mu } + \varvec{A} \varvec{Z})}(\varvec{t})$. Hence, we can transform the random-effects parameters in an MCMC algorithm by following the steps,

1.
Obtain posterior samples of mean vector $\varvec{\mu } \in \mathrm{I\!R}^p$;
2.
Obtain posterior samples of the $p \times p$ covariance matrix $\varvec{\Sigma }$;
3.
Compute the Cholesky factor $\varvec{A}$ of $\varvec{\Sigma }$;
4.
Sample a p-dimensional vector of independent and identically distributed standard normal variables $\varvec{Z}$;
5.
Compute the transformation $\varvec{\beta } = \varvec{\mu } + \varvec{A} \varvec{Z}$.

C Bayes factor of constrained against unconstrained hypotheses

Let us suppose that we wish to test an informative hypothesis $\text {H}_{i}: \varvec{\theta } \in \varvec{\Theta }_{i}$, where $\varvec{\Theta }_{i}$ can be seen as a truncation of the space of $\varvec{\theta } \in \varvec{\Theta }$, with $\varvec{\Theta }_{i} \subset \varvec{\Theta }$. Thus, if we define the prior in the space $\varvec{\Theta }_{i}$ as a truncation of the prior in $\varvec{\Theta }$, such as $\pi _{i}(\varvec{\theta }) = c^{-1} \pi (\varvec{\theta })\ \mathbbm {I} \{\varvec{\theta } \in \varvec{\Theta }_{i}\}$, where c is a normalizing constant of the form $c = \int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \pi (\varvec{\theta }) d\varvec{\theta }$ and $\pi (\varvec{\theta })$ is the prior under the unconstrained space. Then, the Bayes factor of $\text {H}_{i}$ against $\text {H}_{u}$ will be given by

$$\begin{aligned} \begin{aligned} \text {BF}_{iu}&= \frac{m({{\textbf {E}}} | \text {H}_{i})}{m({{\textbf {E}}} | \text {H}_{u})} = \frac{\int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \pi _{i}(\varvec{\theta }) \pi ({{\textbf {E}}} | \varvec{\theta }) d\varvec{\theta }}{\int \pi (\varvec{\theta }) \pi ({{\textbf {E}}} | \varvec{\theta }) d\varvec{\theta }} = c^{-1} \int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \frac{\pi (\varvec{\theta }) \pi ({{\textbf {E}}} | \varvec{\theta }) d\varvec{\theta }}{\int \pi (\varvec{\theta }) \pi ({{\textbf {E}}} | \varvec{\theta }) d\varvec{\theta }} =\\&= c^{-1} \int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \pi (\varvec{\theta } | {{\textbf {E}}}) d\varvec{\theta } = \frac{\int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \pi (\varvec{\theta } | {{\textbf {E}}}) d\varvec{\theta }}{\int _{\varvec{\theta } \in \varvec{\Theta }_{i}} \pi (\varvec{\theta }) d\varvec{\theta }} = \frac{P(\varvec{\theta } \in \varvec{\Theta }_{i} | {{\textbf {E}}})}{P(\varvec{\theta } \in \varvec{\Theta }_{i})}. \end{aligned} \end{aligned}$$

Therefore, the Bayes factor of a constrained against an unconstrained hypothesis can be reduced to a ratio of posterior and prior probabilities in the space of the constrained hypothesis. In case of an exact hypothesis, such as $\text {H}: \varvec{\theta } = \varvec{0}$, the ratio of probabilities becomes the Savage–Dickey density ratio (Dickey 1971).

D Bayes factor for testing random-effect structures

In this appendix, we show details on the derivation of the parameters in the Bayes factor for testing random-effect structures. Assuming we have K social networks and each of them has its own $\beta _{k},\ \text {for}\ i = 1, \dots , K$, regression parameter. Where $\beta _{k} \sim {\mathcal {N}}(\mu , \sigma ^2)$. Thus, let ${\bar{\mu }}$, ${\bar{\sigma }}^2$, ${\bar{\beta }}_k$ be posterior estimates for $\mu$, $\sigma ^2$ and $\beta _k,\ \forall \ k$. Also, let ${\bar{\tau }}^2_k$ be the point estimate for the variance of the posterior distribution of $\beta _k$. Then, we can write

$$\begin{aligned} \begin{aligned} \text {posterior:}\ {}&\beta _{k}| \text {E}_{k} \sim {\mathcal {N}}({\bar{\beta }}_{k}, {\bar{\tau }}^2_{k})\\ \text {prior:}\ {}&\beta _{k} \sim {\mathcal {N}}({\bar{\mu }}, {\bar{\sigma }}^2). \end{aligned} \end{aligned}$$

(7)

Assuming we want to test $\text {H}_0: \beta _1 = \dots = \beta _K$ against $\text {H}_1: \text {"at least one is different"}$. Thus we can do so by writing $\xi _{j} = \beta _{j+1} - \beta _{j}$, for $j = 1, \dots , K-1$. Therefore, we can approximate the joint distribution of $\varvec{\xi } = (\xi _1, \dots , \xi _{K-1})$ by a multivariate normal

$$\begin{aligned} \begin{aligned} \text {posterior:}\ {}&\varvec{\xi }| \varvec{\text {E}} \sim {\mathcal {N}}(\bar{\varvec{\mu }}_{_\xi },\ \bar{\varvec{\Sigma }}_{_\xi })\\ \text {prior:}\ {}&\varvec{\xi } \sim {\mathcal {N}}(\varvec{0},\ \bar{\varvec{\Lambda }}_{_\xi }). \end{aligned} \end{aligned}$$

Thus, we can derive the parameters of the distributions of every component $j = 1, \dots , K-1$ of $\varvec{\xi }$ as follows

$$\begin{aligned} \text {E}(\xi _{j} | {{\textbf {E}}})=\, & {} \text {E}(\beta _{j+1} - \beta _{j})\\=\, & {} \text {E}(\beta _{j+1}) - \text {E}(\beta _{j}) = {\bar{\beta }}_{j+1} - {\bar{\beta }}_{j},\\ \text {Var}(\xi _{j}| {{\textbf {E}}})=\, & {} \text {Var}(\beta _{j+1} - \beta _{j})\\=\, & {} \text {Var}(\beta _{j+1}) + \text {Var}(\beta _{j}) =\, {\bar{\tau }}^2_{j+1} + {\bar{\tau }}^2_{j},\\ \text {Cov}(\xi _{j}, \xi _{j+h} | {{\textbf {E}}})=\, & {} \text {E}(\xi _{j} \xi _{j+h}) - \text {E}(\xi _{j}) \text {E}(\xi _{j+h})\\=\, & {} \text {E}\Big ((\beta _{j+1} - \beta _{j}) (\beta _{j+h+1} - \beta _{j+h})\Big ) - \text {E}\Big ((\beta _{j+1} - \beta _{j})\Big ) \text {E}\Big ((\beta _{j+h+1} - \beta _{j+h})\Big )\\=\, & {} \text {E}(\beta _{j+1} \beta _{j+h+1}) - \text {E}(\beta _{j+1} \beta _{j+h}) - \text {E}(\beta _{j} \beta _{j+h+1}) + \text {E}(\beta _{j} \beta _{j+h}) - \\{} & {} - \text {E}(\beta _{j+1}) \text {E}(\beta _{j+h+1}) + \text {E}(\beta _{j+1}) \text {E}(\beta _{j+h}) + \text {E}(\beta _{j}) \text {E}(\beta _{j+h+1}) - \text {E}(\beta _{j}) \text {E}(\beta _{j+h})\\= & {} {\left\{ \begin{array}{ll} - \text {Var}(\beta _{j+1}) =\, - {\bar{\tau }}^2_{j+1},&{} \text {if } h =\, 1\\ \ \ 0, &{}\text {if } h > 1. \end{array}\right. } \end{aligned}$$

The parameters in the prior are derived in the same way with $\text {E}(\xi _{j}) = 0$, $\text {Var}(\xi _{j}) = 2 {\bar{\sigma }}^2$ and $\text {Cov}(\xi _{j}, \xi _{j+h}) = - {\bar{\sigma }}^2$, if $h = 1$, and zero otherwise.

E Point-wise deviance residuals

After estimation, it is common practice to check how the proposed model fits the data under study. In DuBois et al. (2013), they used the posterior draws to evaluate the log likelihood at every data point to determine the dynamic adequacy of the model to every data point. Thus, for the actor-oriented relational event model, for $k = 1, \dots , K$ and $m = 1, \dots , M_{k}$, this quantity would be computed as

$$\begin{aligned} \begin{aligned} \text {Res}_m = -2&\Bigg \{ \log \Big (\lambda _{s_m}\Big (t_m| {{\textbf {E}}}_{k}\Big )\Big ) + \log \Big (\lambda _{r_m|s_m}\Big (t_m | s_m, {{\textbf {E}}}_{k}\Big )\Big ) - \\&- \log \Bigg (\sum _{s,r} \lambda _{r|s_m}\Big (t_m| s_m, {{\textbf {E}}}_{k}\Big )\Bigg ) - (t_m - t_{m - 1}) \Big (\sum _{i} \lambda _{s}(t_m| {{\textbf {E}}}_{k}\Big )\Big ) \Bigg \}. \end{aligned} \end{aligned}$$

(8)

This measure is usually called residual deviance (Collett 2015), and it is useful to compare models and see which one fits better each data point by having smaller values of $\text {Res}_m$.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Vieira, F., Leenders, R., McFarland, D. et al. A Bayesian actor-oriented multilevel relational event model with hypothesis testing procedures. Behaviormetrika 51, 37–74 (2024). https://doi.org/10.1007/s41237-023-00203-4

Download citation

Received: 02 February 2023
Accepted: 21 June 2023
Published: 17 July 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s41237-023-00203-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Bayesian actor-oriented multilevel relational event model with hypothesis testing procedures

Abstract

Similar content being viewed by others

Discovering trends of social interaction behavior over time: An introduction to relational event modeling

Nodal Heterogeneity can Induce Ghost Triadic Effects in Relational Event Models

Social Networks and Causal Inference

1 Introduction

2 The relational event framework

2.1 The actor-oriented multilevel relational event model

2.2 Prior specification

2.3 Reparameterization to improve Bayesian computation

Lemma 1

3 Bayesian hypothesis testing under the actor-oriented multilevel relational event model

3.1 Test I: testing for homogeneous social interaction behavior across sequences

3.2 Test II: testing the degree of heterogeneity of network effects across sequences

3.3 Test III: testing common and average network effects over all sequences

4 Estimating and testing classroom dynamics using the multilevel relational event model

4.1 Model specification

4.2 Exploring social interaction dynamics as class time progresses

4.2.1 Testing for homogeneous social interaction behavior across school classes

4.2.2 Evaluating the shrinkage behavior of random effects

4.2.3 Evolution of classroom behavior

4.3 Mixed-effects analysis on the full sequences

4.3.1 Testing the degree of heterogeneity of network effects across school classes

4.3.2 Testing the impact of nodal characteristics on classroom dynamics

4.3.3 Testing the impact of network statistics on classroom dynamics

5 Discussion

Data availability

Change history

22 December 2023

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher's Note

Appendices

A Actor-oriented relational event model as a Poisson regression model

Proof

Proof

B Linear transformation of multivariate normal distribution

Proof

C Bayes factor of constrained against unconstrained hypotheses

D Bayes factor for testing random-effect structures

E Point-wise deviance residuals

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation