Power analysis of longitudinal studies with piecewise linear growth and attrition

Moerbeek, Mirjam

doi:10.3758/s13428-022-01791-x

Power analysis of longitudinal studies with piecewise linear growth and attrition

Open access
Published: 07 February 2022

Volume 54, pages 2939–2948, (2022)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

Power analysis of longitudinal studies with piecewise linear growth and attrition

Download PDF

Mirjam Moerbeek ORCID: orcid.org/0000-0001-5537-1237¹

3013 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

In longitudinal research, the development of some outcome variable(s) over time (or age) is studied. Such relations are not necessarily smooth, and piecewise growth models may be used to account for differential growth rates before and after a turning point in time. Such models have been well developed, but the literature on power analysis for these models is scarce. This study investigates the power needed to detect differential growth for linear–linear piecewise growth models in further detail while taking into account the possibility of attrition. Attrition is modeled using the Weibull survival function, which allows for increasing, decreasing or constant attrition across time. Furthermore, this work takes into account the realistic situation where subjects do not necessarily have the same turning point. A multilevel mixed model is used to model the relation between time and outcome, and to derive the relation between sample size and power. The required sample size to achieve a desired power is smallest when the turning points are located halfway through the study and when all subjects have the same turning point. Attrition has a diminishing effect on power, especially when the probability of attrition is largest at the beginning of the study. An example on alcohol use during middle and high school shows how to perform a power analysis. The methodology has been implemented in a Shiny app to facilitate power calculations for future studies.

Assessing mediational processes using piecewise linear growth curve models with individual measurement occasions

Article 09 September 2022

Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data

A Review of Time Scale Fundamentals in the g-Formula and Insidious Selection Bias

Article 15 June 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In the social and behavioral sciences, subjects are often measured at multiple points across time or age in order to study changes in abilities, behavior, opinion, attitude and so forth. Data that arise from such longitudinal studies are often analyzed by means of a multilevel model (Goldstein, 2011; Hox et al., 2018; Raudenbush & Bryk, 2002) or a latent growth curve model (Duncan et al., 2006).

Most often, smooth growth trajectories, such as those modeled by linear, polynomial and nonlinear (e.g., exponential) relations between time (or age) and response, are fitted. However, some longitudinal studies may show non-smooth patterns of change. Growth may show a sharp change after the occurrence of some important life event, such as first criminal offense, entry into parenthood, retirement, or death of spouse. One example is a study on the change in alcohol use across middle and high school (Li et al., 2001). The authors found a higher linear growth rate in high school as compared to middle school. Discontinuities in growth may also be observed in experimental studies where the beginning or end of an intervention is the turning point. An example is a study on the change in bulimia severity, depression and self-concept of female patients during and after treatment with guided self-change treatment or cognitive behavioral therapy. Cognitive behavioral therapy showed greater improvement during therapy, while guided self-change treatment showed more continued improvement post-treatment. Data obtained from such studies are known as interrupted time-series data and can be analyzed by means of multilevel or latent growth curve models (Duncan et al., 2004; Flora, 2008; Grimm & Marcoulides, 2016; Harring et al., 2021; Muggeo et al., 2014) using one or more turning points to distinguish different phases across time and by specifying differential growth rates across these phases.

Longitudinal research often requires considerable effort, money and time from both the researchers and the participants. It is therefore important that a longitudinal study is designed carefully. Among other components, the number of subjects, number of measurements per subject and the study duration must be decided upon, and it has to be determined whether sufficient statistical power can be achieved. Over the past three decades, a few dozen papers on the design of longitudinal studies have appeared (e.g., De Jong et al., 2010; Fan, 2003; Galbraith & Marschner, 2002; Hedeker et al., 1999; Moerbeek, 2008, 2011; Raudenbush & Liu, 2001; Zhang & Wang, 2009) The focus of these papers is on smooth growth trajectories with linear or polynomial growth. This implies that these methods cannot be used for piecewise growth models, as a power analysis for a certain model cannot be done on the basis of another model. Moreover, design questions for a piecewise growth model are different from those of a polynomial growth model. For instance, power may be determined not only by the number of measurements per subject, but also by the number of measurements per phase. In addition, the model parameter for which a power analysis is to be done depends on what type of model is used. In longitudinal studies with smooth growth trajectories, it is the growth rate across time and/or the interaction of the growth rate with another variable, such as treatment condition. In piecewise growth models it is the change in growth from one phase to the next.

Sample size guidelines for piecewise growth models are scarce; to the author’s knowledge there exist only two relevant papers. Diallo and Morin (2015) conducted a simulation study with 6, 8 and 10 measurements and a turning point at time point 2, 3 or 4. They showed that power increases with increasing sample size, number of measurements, the difference between the two slopes and the correlation between the two slopes. Larger power was observed when the turning point was at the third or fourth time point than at the second time point. Power decreased when the variance of the second slope increased. Segalas et al. (2019) also conducted a simulation study; they used study duration of 21. Each subject was allowed to have their own turning point, and attrition was assumed to be absent or to occur at a constant rate. Larger power was observed for a larger slope difference, a larger sample size and when attrition was absent. Power was larger when the turning point was located at time point 15 than at time point 10, and when the variability in turning points was lower.

Although these two studies are very useful, they also have their limitations. Diallo and Morin restricted their work to scenarios in which each subject had the same turning point, which may not always be realistic. For instance, the age at which subjects graduate from college or enter parenthood varies across subjects. They also ignored the possibility of attrition, while in longitudinal studies attrition is the rule rather than the exception. Segalas and coauthors did take into account the variability in turning points and the possibility of attrition. However, they assumed constant attrition rates across time, while attrition rates may very well vary across time. Furthermore, both papers based their power calculations on simulations, which may be time-consuming and require specific software (in their case SAS and Mplus, which are not free of charge).

The aim of this contribution is to further investigate the power to detect a difference in growth rates in piecewise linear–linear growth models. The study investigates how power is influenced by the location of turning points; in other words, is the highest power achieved when most subjects have a turning point at the beginning, halfway or at the end of the study? Furthermore, it investigates whether higher power is achieved when all subjects have the same turning point or when they have different turning points. In addition, the loss in efficiency due to attrition is studied. Attrition is modeled using the Weibull survival function, which allows for increasing, decreasing and constant attrition rates during the course of the study. The methodology has been implemented in a Shiny app to facilitate power analysis for future studies.

The remainder of this paper is organized as follows. In the next section the multilevel mixed model for piecewise growth is presented. In the following section it is shown how power to detect differential growth is calculated. This section also introduces the Shiny app. The section thereafter shows how power is influenced by the location of and variability in turning points in studies without attrition. The following section quantifies the loss in efficiency due to attrition. The final section presents conclusions and a discussion, with directions for future research.

Multilevel mixed model

Repeated measurements across time are nested within subjects; hence the data have a multilevel structure and can be analyzed using the multilevel mixed model (Goldstein, 2011; Hox et al., 2018; Snijders & Bosker, 2012), which is also known as the hierarchical (linear) model (Raudenbush & Bryk, 2002). An alternative model is the latent growth curve model within the general framework of structural equation models (Duncan et al., 2006; Flora, 2008).

The duration of the study is denoted as D. The aim is to measure each subject i = 1, …, n at equidistant time points t = 0, 1, 2, …, D. As a measurement is also taken at baseline (t = 0), the aim is to measure each subject at m = D + 1 time points. However, subjects may prematurely drop out of the study, meaning that the number of measurements may vary across subjects. The number of measurements for subject i is denoted as m_i.

The study is split in two different time phases, with phase 1 beginning at time point t = 0 and phase 2 beginning at time point t = T_i. The latter time point is the turning point, which may be either constant or varying across subjects. When all subjects have the same turning point (i.e., T_i = T ∀ i), the turning point cannot be located at t = 0 or t = D because that would mean the study has only one phase.

In both phases a linear relation between time and response is assumed. The multilevel mixed model for subject i at time point t is then given by

$${y}_{ti}={\pi}_{0i}+{\pi}_{1i}{t}_{1 ti}+{\pi}_{2i}{t}_{2 ti}+{e}_{ti}.$$

(1)

The variables t_1ti and t_2ti are time indicators for the first and second phase of the study for subject i. They are coded as follows:

$${\displaystyle \begin{array}{c}{t}_{1 ti}=t\kern0.5em \mathrm{if}\ t\le {T}_i\kern0.5em \mathrm{and}\ {t}_{1 ti}={T}_i\kern0.75em \mathrm{if}\kern0.5em t>{T}_i,\\ {}{t}_{2 ti}=0\kern0.5em \mathrm{if}\kern0.5em t\le {T}_i\kern0.5em \mathrm{and}\kern0.5em {t}_{2 ti}=t-{T}_i\kern0.5em \mathrm{if}\kern0.5em t>{T}_i\end{array}}$$

(2)

Consider as an example subject i with m_i = 7 time points and a turning point T_i = 3. The design matrix for this subject is given by

$${\boldsymbol{X}}_i=\left(\begin{array}{ccc}1& 0& 0\\ {}1& 1& 0\\ {}1& 2& 0\\ {}1& 3& 0\\ {}1& 3& 1\\ {}1& 3& 2\\ {}1& 3& 3\end{array}\right).$$

(3)

The associated regression weights π_0i, π_1i and π_2i are the baseline score and growth rates in phase 1 and 2, respectively. Each of them is assumed to randomly vary across subjects:

$${\displaystyle \begin{array}{c}{\pi}_{0i}={\beta}_0+{u}_{0i},\\ {}\begin{array}{c}{\pi}_{1i}={\beta}_1+{u}_{1i},\\ {}{\pi}_{2i}={\beta}_2+{u}_{2i}.\end{array}\end{array}}$$

(4)

Here, the regression weights β₀, β₁ and β₂ are the average intercept and growth rates, and the random variables u_0i, u_1i and u_2i are the deviations of subject i from these averages. As each of the three regression coefficients π_0i, π_1i and π_2i has a random effect, the design matrix Z_i for the random part is equal to the design matrix X_i for the fixed part.

The random variables u_0i, u_1i and u_2i are assumed to follow a multivariate normal distribution with means equal to zero and covariance matrix

$$\mathit{\operatorname{cov}}\left({\boldsymbol{u}}_i\right)=\mathit{\operatorname{cov}}\left(\begin{array}{c}{u}_{0i}\\ {}{u}_{1i}\\ {}{u}_{2i}\end{array}\right)=\left(\begin{array}{ccc}{\sigma}_{u0}^2& {\sigma}_{01}& {\sigma}_{02}\\ {}{\sigma}_{01}& {\sigma}_{u1}^2& {\sigma}_{12}\\ {}{\sigma}_{02}& {\sigma}_{12}& {\sigma}_{u2}^2\end{array}\right).$$

(5)

These random variables are assumed to be independent from the residuals e_i0, e_i2, …, e_iD. These residuals are assumed to follow a multivariate normal distribution with means equal to zero and covariance matrix ${\sigma}_e^2{\boldsymbol{I}}_i$, where I_i is the (m_i + 1) × (m_i + 1) identity matrix and ${\sigma}_e^2$ is the variance of the residual term e_ti.

The model for subject i can be written in matrix notation:

$${\boldsymbol{y}}_i={\boldsymbol{X}}_i\boldsymbol{\beta} +{\boldsymbol{Z}}_i{\boldsymbol{u}}_i+{\boldsymbol{e}}_i$$

(6)

with y_i the vector of responses, X_i the design matrix for the fixed part, β = (β₀, β₁, β₂)′ the vector of regression weights, Z_i the design matrix for the random part, u_i = (u_0i, u_1i, u_2i)′ the vector of random variables and e_i = (e_i1, e_i2, …, e_i(m + 1))′ the vector of residuals.

Given the covariance matrices for the random effects, the covariance matrix (conditional on the fixed effects) of the responses of subject i is

$$\mathit{\operatorname{cov}}\left({\boldsymbol{y}}_i|\ {\boldsymbol{X}}_i\boldsymbol{\beta} \right)={\boldsymbol{V}}_i={\boldsymbol{Z}}_i\mathit{\operatorname{cov}}\left({\boldsymbol{u}}_i\right){{\boldsymbol{Z}}_i}^{\prime }+{\sigma}_e^2{\boldsymbol{I}}_i.$$

(7)

Once the variances and covariances in cov(u_i) and the variance ${\sigma}_e^2$ have been estimated, they can be plugged into the equation above to get ${\hat{\boldsymbol{V}}}_i$. The vector of regression coefficient is then estimated by

$$\hat{\boldsymbol{\beta}}={\left(\sum \nolimits_{i=1}^n{\boldsymbol{X}}_i^{\prime} {\left({\hat{\boldsymbol{V}}}_i\right)}^{-1}{\boldsymbol{X}}_i\right)}^{-1}\sum \nolimits_{i=1}^n{\boldsymbol{X}}_i^{\prime}{\left({\hat{\boldsymbol{V}}}_i\right)}^{-1}{\boldsymbol{y}}_i.$$

(8)

This is the maximum likelihood estimator of fixed effects of the linear mixed effects model in equation (6). The associated covariance matrix is estimated as

$$\hat{\mathit{\operatorname{cov}}}\left(\hat{\boldsymbol{\beta}}\right)=\left(\begin{array}{ccc}\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_0\right)& \hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_0,{\hat{\beta}}_1\right)& \hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_0,{\hat{\beta}}_2\right)\\ {}\hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_0,{\hat{\beta}}_1\right)& \hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_1\right)& \hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_1,{\hat{\beta}}_2\right)\\ {}\hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_0,{\hat{\beta}}_2\right)& \hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_1,{\hat{\beta}}_2\right)& \hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_2\right)\end{array}\right)={\left(\sum \nolimits_{i=1}^n{\boldsymbol{X}}_i^{\prime} {\left({\hat{\boldsymbol{V}}}_i\right)}^{-1}{\boldsymbol{X}}_i\right)}^{-1}.$$

(9)

The variances $\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_1\right)$ and $\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_2\right)$ and covariance $\hat{\mathit{\operatorname{cov}}}\left({\hat{\beta}}_1,{\hat{\beta}}_2\right)$ are used to study the relation between sample size and power to detect a difference in growth rates across the two phases.

Statistical power to detect differential growth

The main question is whether the growth rates in the two phases are equal to one another. The corresponding null hypothesis is H₀ : β₁ = β₂, which can also be formulated as H₀ : β₁ − β₂ = 0. This difference is estimated by plugging in the estimates of the regression coefficients, and the associated variance is estimated as $\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_1-{\hat{\beta}}_2\right)=\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_1\right)+\hat{\mathit{\operatorname{var}}}\left({\hat{\beta}}_2\right)-\hat{2\mathit{\operatorname{cov}}}\left({\hat{\beta}}_1,{\hat{\beta}}_2\right)$. The variances and covariance at the right side of this equation follow from the covariance matrix (9).

If there indeed exists a difference in growth rates across the two phases in the population, then one would like to detect it with sufficient statistical power. The relation between the difference in growth rates β₁ − β₂, the variance var(β₁ − β₂), statistical power 1 − β, and type I error rate α is given by

$$\frac{\beta_1-{\beta}_2}{\sqrt{\mathit{\operatorname{var}}\left({\beta}_1-{\beta}_2\right)}}={z}_{1-\alpha }+{z}_{1-\beta }.$$

(10)

This relation holds for a one-sided alternative H₁ : β₁ − β₂ > 0 or H₀ : β₁ − β₂ < 0; for two-sided alternative H₀ : β₁ − β₂ ≠ 0, z_1 − α is replaced by z_{1 − α/2}.

In the design phase of a study, the difference in means is often not known. This causes a vicious cycle: the study is to be conducted to gain insight into the difference in growth rates, but to design the study such that it has sufficient power, the population value of the difference in growth rates needs to be known in advance. To escape the vicious cycle, one can consult the literature for similar studies in the past to gain insight into plausible values for the difference in growth rates.

The variance var(β₁ − β₂) depends on the number of subjects, the number of measurements per subject and the location of the turning point in the case where all subjects have the same turning point. In the case of varying turning points, the variance depends on the distribution of turning points. Furthermore, this variance also depends on the rate of attrition across the study. In addition, it is a function of the variance and covariance components in Eq. (5) and the variance ${\sigma}_e^2$. The expression for the variance var(β₁ − β₂) cannot be captured by a simple mathematical expression. For that reason, matrix algebra should be used to calculate the value of the variance for each specific study at hand. The online 14 shows the results of a small simulation study. The power as calculated using matrix algebra is almost the same as that obtained from simulation.

Shiny app

To facilitate the use of the methodology presented herein, a Shiny app was developed to study the relation between number of subjects and power. First, the user has to specify the duration of the study and the distribution of turning points: the proportion of subjects that have a turning point at each of the time points t = 0, 1, …, D. Second, the values of all variance and covariance components have to be specified, along with the residual variance. Third, the population values of the growth rates in both phases have to be specified, along with the type I error rate and whether a one- or two-sided alternative is used. Finally, the parameters ω and γ of the Weibul attrition function have to be specified (see later in this contribution for an explanation of these parameters). Once all parameters have been specified, the app shows the relation between number of subjects and power for the growth rates in phases 1 and 2 and for the difference in growth rates. Power levels are shown for the user-selected degree of attrition and for zero attrition. By hovering over a graph, the power level for a selected number of subjects is displayed. The app can be found online at https://utrecht-university.shinyapps.io/Power_Piecewise_Growth/

Designing studies without attrition

The aim of this section is to study how the distribution of turning points affects the sample size to achieve a power 1 − β = 0.8 to detect a slope difference in turning points in a two-sided test with type I error rate α = 0.05 in a study without attrition.

Parameter values

Statistical power depends on the values of the model parameters. The population values of the regression coefficients, variance and covariance components are taken from Diallo and Morin (2015); a rationale for these values can be found in that paper.

The average response at t = 0 is β₀ = 1. The mean growth rate in the first phase is β₁ = 0.16, while the average growth rate in the second phase is β₂ = 0.11, 0.0 or 0.55. Given these values, the difference in growth rates is β₁ − β₂ = 0.05, β₁ − β₂ = 0.16 or β₁ − β₂ = 0.39, respectively. The mean growth curves in the first and second phase of the study are presented in Fig. 1.

The variance component for the random intercept is var(u_0j) = 0.2, the variance component for the growth rate in the first phase is var(u_1j) = 0.1, and the variance component for the growth rate in the second phase is var(u_2j) = 0.16. The correlation between the random intercept and phase 1 slope is cor(u_0j, u_1j) = 0.1, the correlation between the random intercept and phase 2 slope is cor(u_0j, u_2j) = 0, and the correlation between the two random slopes is cor(u_1j, u_2j) = 0. Finally, the residual variance is var(e_ij) = 0.2 and does not vary across the time points.

Distribution of turning points

Figure 2 gives the nine distributions of turning points T_i that will be used in this section and the next. The bars in this figure show the proportion of subjects that have a turning point at time points t = 0, 1, …, 12 in a study with duration D = 12. In the top row the turning points are located at the beginning of the study, with a mean μ_T = 3; in the middle row they are located halfway through the course of the study (μ_T = 6) and in the bottom row at the end of the study (μ_T = 9). In the left column all subjects have the same turning point, meaning the variance in turning points ${\sigma}_T^2$ is zero. In the middle column there is a small variance in turning points (${\sigma}_T^2=1.33$), while in the right column there is large variance (${\sigma}_T^2=2.22$)

Results

Table 1 shows the required sample size to achieve a power 1 − β = 0.8 to detect a difference between the two slopes as a function of the distribution of turning points and for three differences between the two slopes (two-sided test with α = 0.05). As is obvious, smaller sample sizes are needed for larger slope differences. Furthermore, a larger sample size is needed when there is a larger variability in turning points. Finally, the location of turning point(s) has an effect on sample size. For any difference between turning points and for any variability in turning points, the smallest sample size is needed if the mean turning point is located halfway through the study (μ_T = 6). In the case where all subjects have the same turning point (zero ${\sigma}_T^2$), the required sample size for μ_T = 3 is equal to that for μ_T = 9. In the case where subjects have different turning points, the required sample size for μ_T = 3 is smaller than that for μ_T = 9.

Table 1 Sample size to achieve a power level 1 − β = 0.8 for the test on differential slopes

Full size table

The latter finding is further illustrated in Fig. 3, which shows the required sample size to detect a slope difference β₁ − β₂ = 0.05 as a function of the mean turning point and variability in turning points. The curve for the case of zero variability is symmetric around μ_T = 6, while the other two curves are not. In the case of between-subject variation in turning points, a larger sample size is needed for a late turning point than for an early turning point (having the same time difference from μ_T = 6).

Designing studies with attrition

Attrition implies that subjects drop out during the course of a study. As a result, a larger sample size is needed to achieve a desired power level as compared to a study that is not hampered by attrition. This section investigates the increase in the required sample size due to attrition based on the Weibull survival function.

Weibull survival function

It is assumed that the underlying attrition process is continuous, meaning subjects may drop out at any time during the course of the study. Furthermore, it is assumed that attrition depends on the study time elapsed, but not on the number of measurements that are planned to be taken on each subject during the course of the study. The survival function gives the probability of staying in the study up to at least time point t: S(t) = P(τ > t), where τ is a continuous random variable measuring the elapsed study time. There exist many survival functions; in this paper the Weibull survival function is used. This is a flexible survival function in the sense that it allows for increasing, decreasing or constant attrition rates over time. The survival function is S(t) = exp(−λt^γ). For the sake of convenience, time is rescaled by dividing by the study duration D, so that t₁ = 0 is baseline and t_m = 1 is the last measurement. Furthermore, the parameter λ is replaced by − log(1 − ω), where ω ∈ [0, 1] is the proportion of subjects who drop out during the course of the study. The Weibull survival function is then formulated as $S(t)={\left(1-\omega \right)}^{t^{\gamma }}$. The parameter γ ∈ [0, ∞] determines the shape of the survival function. For γ < 1, the attrition rate decreases during the course of the study, meaning that attrition is concentrated at the beginning of the study. The opposite is the case for γ > 1, where the attrition rate increases during the course of the study, meaning that attrition is concentrated at the end of the study. A constant attrition rate is observed when γ = 1. Figure 4 shows survival functions for ω = 0.2, 0.5, 0.8 and for $\gamma =\frac{1}{2},1,2$.

To calculate the effect of attrition on the power to detect a difference in growth rates, the vector N = (n₁, n₂, …, n_m)′, with n_j the number of subjects having j time points, needs to be known beforehand. However, this vector is random, with associated probability vector p = (p₁, p₂, …, p_m)′, where p_j is the probability of having exactly j measurements. For each possible vector N, the variance in the estimator of the difference in growth rates across the two phases can be calculated. The expected variance is then the weighted variance across all possible vectors N, with the weights equal to the probability of each vector. This procedure becomes difficult to apply in studies where the number of time points, and hence the number of vectors N, is large. It is then useful to approximate the variance in the estimator of the difference in growth rates using a sampling procedure (Verbeke & Lesaffre, 1999). The vector N is sampled a large number of times using probability vector p, and for each draw the variance in the estimator of the difference in growth rates is calculated. The mean of these variances across all draws is then used to calculate the effect of attrition. A good approximation is made when the number of draws is large, which makes this procedure time-consuming. For that reason, a further approximation is made in this contribution. The vector N is replaced by its expectation E(K) = n × p. This procedure produces results similar to the sampling procedure (Galbraith & Marschner, 2002).

The variance in the treatment effect estimator is calculated based on Eq. (9). The following algorithm has been implemented in the Shiny app. First, given the distribution of turning points and sample size, calculate how many subjects there are for each turning point. Second, for each turning point, calculate the number of subjects with 1, 2, …, D measurement occasions. This number follows from the Weibull survival function with parameters γ and ω. Third, for each combination of turning point and number of measurements, construct the design matrix X_i and covariance matrix of the random effects V_i and calculate ${\boldsymbol{X}}_i^{\prime }{\left({\hat{\boldsymbol{V}}}_i\right)}^{-1}{\boldsymbol{X}}_i$. Fourth, multiply these terms by their associated sample sizes, sum up and take the inverse.

Results

Table 2 shows the percentage increase in the required sample size to detect a difference in growth rates of β₁ − β₂ = 0.05 with a power level 1 − β = 0.8 in a two-sided test at type I error rate α = 0.05 as compared to a study without attrition. In the worst case, sample size needs to be increased by 259%, and in the best case by only 4%. As is obvious, a larger percentage increase is observed when more subjects drop out (i.e., larger ω) and when the risk of dropout is highest at the beginning of the study (i.e., larger τ). Furthermore, the largest percentage increase in sample size is observed when the turning points are located at the end of the study (larger μ_T). This is obvious because, in that case, many subjects may have dropped out before their turning point. The variability in turning points, however, has a minor effect on the percentage increase in sample size.

Table 2 Percentage increase in sample size to achieve a power level 1 − β = 0.8 for the test on differential slopes as compared to a study without attrition

Full size table

Example: alcohol use in middle and high school

Li et al. (2001) demonstrated the use of a piecewise growth model to study how alcohol use develops during middle (grades 6–8) and high school (grades 9–12). They used a mixture model to distinguish pupils with high (N = 57) and low (N = 122) initial status and developed piecewise models for both these groups. All subjects had the same turning point.

Suppose a replication of this study is to be conducted and an a priori sample size calculation is requested by the funding agency. The main research question is whether the growth rates during middle and high school differ from one another. Here it is illustrated how to calculate the required sample size for the high initial status group. Parameter estimates from Li et al. (2001) are used as input for the sample size calculation: ${\hat{\beta}}_0=2.594$, ${\hat{\beta}}_1=0.022$, ${\hat{\beta}}_2=0.255$, ${\hat{\sigma}}_{u0}^2=0.116$, ${\hat{\sigma}}_{u1}^2=0.092$, ${\hat{\sigma}}_{u2}^2=0.054$, ${\hat{\sigma}}_{01}=-0.031$, ${\hat{\sigma}}_{02}=-0.022$ and ${\hat{\sigma}}_{12}=-0.051$. Note that Li et al. (2001) analyzed their data using the latent growth curve model, which allows the residual variance e_ti to vary across the measurement occasions. The methodology for power analysis in this manuscript is based on the multilevel model, which is restricted to equal residual variance across time. For that reason, the mean of the estimates was used: $mean\left({\hat{\sigma}}_e^2\right)=0.23$.

Figure 5 shows the relation between sample size and power in the case where attrition is absent and when attrition is present and modeled by the Weibull survival function with ω = 0.25 and γ = 1 (meaning 25% of the students drop out during the study, and they do so at a constant rate). In the case where attrition is absent, a sample of size N = 60 should be used to detect a difference in growth rates at power 1 − β = 0.8, and type I error rate α = 0.05 in a two-sided test. This sample size is only slightly larger than the actual sample size used by Li et al. (2001). If attrition is present, then the required sample size increases a little further to N = 66.

Conclusions and discussion

This study investigated the power to detect differential growth in linear–linear piecewise growth models. The relation between sample size and power was calculated for the multilevel mixed model. In a study without attrition, the required sample size is smallest when all subjects have the same turning point, which is located halfway through the study. Attrition increases the required sample size, especially when many subjects drop out and most of them do so at the beginning of the study.

A Shiny app was developed to facilitate the performance of a power analysis for future studies. To use the app, a priori estimates of the (co)variance components, residual variance and growth rates in both phases need to be specified. These can be obtained from the literature. In the example on alcohol use in middle and high school, parameter estimates from the literature were used. The required sample size turned out to be only slightly larger than that actually used by Li et al. (2001). However, that will not always be the case, and a power analysis is always preferred to basing the sample size for a future study on that of similar studies in the past. For that reason, it is important that estimates of (co)variance components, residual variance and growth rates in both phases are clearly reported in the literature, so that these can be used to calculate sample size for future studies.

Furthermore, the distribution of turning points needs to be specified a priori, along with the parameters ω and γ of the Weibull attrition function. In some studies the distribution of the turning points is under the control of the researcher. For instance, in psychotherapy trials, the therapy and follow-up phases may be of fixed duration, meaning that all participants have the same turning point and the location of the turning point is known beforehand. In trials in which a stepped-wedge design is used (Mdege et al., 2011), subjects move from the control to the intervention condition at preset points in time. In such trials there is variability in turning points but the number of turning points and the number of subjects that switch to the intervention at each turning point are under control of the experimenter and hence known beforehand. In observational studies, on the other hand, the distribution of turning points is often not known beforehand. In studies on developmental psychology, for instance, the turning point may be the transition from childhood to adolescence, and this turning point varies across subjects. However, the literature may provide good insight into the distribution of such a turning point. For studies in which no prior information about the distribution of turning points is available, the Shiny app may be used to explore the effects of various realistic distributions. The same applies, of course, to the parameters ω and γ of the Weibull attrition function.

This study extends previous work on power for piecewise growth models (Diallo & Morin, 2015; Segalas et al., 2019) by allowing for variability in turning points and non-constant attrition. Future extensions may focus on studies with more than one turning point (Cudeck & Harring, 2007; Harring et al., 2021; Marcoulides, 2018), studies with nonlinear growth in one or more phases (Flora, 2008; Harring et al., 2021; Zvoch, 2016), and studies with individually varying times of observation (Liu et al., 2015). It is also of interest to focus on models for discontinuous growth, meaning that there is not only a change in growth rate at the turning point, but also a change in level (Grimm & Marcoulides, 2016). Finally, it is worthwhile to focus on power analysis in the case of non-continuous outcome variables and to explore the effects of covariates on power.

In conclusion, this contribution presents power analysis for linear–linear piecewise growth models, taking into account the possibility of variability in turning points and non-constant attrition rates. I hope the results presented in this contribution, along with the Shiny app, will be helpful in calculating sample sizes for future research.

Data availability

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

References

Cudeck, R., & Harring, J. R. (2007). Analysis of nonlinear patterns of change with random coefficient models. Annual Review of Psychology, 58, 615–637. https://doi.org/10.1146/annurev.psych.58.110405.085520
Article Google Scholar
De Jong, K., Moerbeek, M., & Van Der Leeden, R. (2010). A priori power analysis in longitudinal three-level multilevel models: an example with therapist effects. Psychotherapy Research, 20(3), 273–284. https://doi.org/10.1080/10503300903376320
Article Google Scholar
Diallo, T. M. O., & Morin, A. J. S. (2015). Power of latent growth curve models to detect piecewise linear trajectories. Structural Equation Modeling: A Multidisciplinary Journal, 22(3), 449–460. https://doi.org/10.1080/10705511.2014.935678
Article Google Scholar
Duncan, T. E., Duncan, S. C., Strycker, L. A., & Li, F. (2004). A latent growth curve modeling approach to pooled interrupted time series analyses. Journal of Psychopathology and Behavioral Assessment, 26(4), 271–278. https://doi.org/10.1023/B:JOBA.0000045342.32739.2f
Article Google Scholar
Duncan, T. E., Duncan, S. C., & Strycker, L. A. (2006). An introduction to latent variable growth curve modeling (2nd ed.). Erlbaum.
Google Scholar
Fan, X. (2003). Power of latent growth modeling for detecting group differences in linear growth trajectory parameters. Structural Equation Modeling: A Multidisciplinary Journal, 10(3), 380–400. https://doi.org/10.1207/S15328007SEM1003_3
Article Google Scholar
Flora, D. B. (2008). Specifying piecewise latent trajectory models for longitudinal data. Structural Equation Modeling: A Multidisciplinary Journal, 15(3), 513–533. https://doi.org/10.1080/10705510802154349
Article Google Scholar
Galbraith, S., & Marschner, I. C. (2002). Guidelines for the design of clinical trials with longitudinal outcomes. Controlled Clinical Trials, 23(3), 257–273. https://doi.org/10.1016/s0197-2456(02)00205-2
Article Google Scholar
Goldstein, H. (2011). Multilevel statistical models (4th ed.). Wiley.
Google Scholar
Grimm, K., & Marcoulides, K. (2016). Individual change and the timing and onset of important life events: Methods, models, and assumptions. International Journal of Behavioral Development, 40(1), 87–96. https://doi.org/10.1177/0165025415580806
Article Google Scholar
Harring, J. R., Strazzeri, M. M., & Blozis, S. A. (2021). Piecewise latent growth models: beyond modeling linear-linear processes. Behavior Research Methods, 53(2), 593–608. https://doi.org/10.3758/s13428-020-01420-5
Article Google Scholar
Hedeker, D., Gibbons, R. D., & Waternaux, C. (1999). Sample size estimation for longitudinal designs with attrition: comparing time-related contrasts between two groups. Journal of Educational and Behavioral Statistics, 24(1), 70–93. https://doi.org/10.3102/10769986024001070
Article Google Scholar
Hox, J. J., Moerbeek, M., & Van de Schoot, R. (2018). Multilevel analysis. Techniques and applications. Routledge.
Google Scholar
Li, F., Duncan, T. E., & Hops, H. (2001). Examining developmental trajectories in adolescent alcohol use using piecewise growth mixture modeling analysis. Journal of Studies on Alcohol, 62(2), 199–210. https://doi.org/10.15288/jsa.2001.62.199
Article Google Scholar
Liu, Y., Liu, H., Li, H., & Zhao, Q. (2015). The effects of individually varying times of observations on growth parameter estimations in piecewise growth model. Journal of Applied Statistics, 42(9), 1843–1860. https://doi.org/10.1080/02664763.2015.1014884
Article Google Scholar
Marcoulides, K. M. (2018). Automated Latent Growth Curve Model Fitting: A Segmentation and Knot Selection Approach. Structural Equation Modeling: A Multidisciplinary Journal, 25(5), 687–699. https://doi.org/10.1080/10705511.2018.1424548
Article Google Scholar
Mdege, N. D., Man, M. S., Taylor nee Brown, C. A., & Torgerson, D. J. (2011). Systematic review of stepped wedge cluster randomized trials shows that design is particularly used to evaluate interventions during routine implementation. Journal of Clinical Epidemiology, 64(9), 936–948. https://doi.org/10.1016/j.jclinepi.2010.12.003
Article Google Scholar
Moerbeek, M. (2008). Powerful and cost-efficient designs for longitudinal intervention studies with two treatment groups. Journal of Educational and Behavioral Statistics, 33(1), 41–61. https://doi.org/10.3102/1076998607302630
Article Google Scholar
Moerbeek, M. (2011). The effects of the number of cohorts, degree of overlap among cohorts and frequency of observation on power in accelerated longitudinal designs. Methodology, 7(1), 11–24. https://doi.org/10.1027/1614-2241/a000019
Article Google Scholar
Muggeo, V. M. R., Atkins, D. C., Gallop, R. J., & Dimidjian, S. (2014). Segmented mixed models with random changepoints: A maximum likelihood approach with application to treatment for depression study. Statistical Modelling, 14(4), 293–313. https://doi.org/10.1177/1471082X13504721
Article Google Scholar
Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models. Applications and data analysis methods. Sage Publications.
Google Scholar
Raudenbush, S. W., & Liu, X. (2001). Effects of study duration, frequency of observation, and sample size on power in studies of group differences in polynomial change. Psychological Methods, 6(4), 387–401. https://doi.org/10.1037/1082-989X.6.4.387
Article Google Scholar
Segalas, C., Amieva, H., & Jacqmin-Gadda, H. (2019). A hypothesis testing procedure for random changepoint mixed models. Statistics in Medicine, 38(20), 3791–3803. https://doi.org/10.1002/sim.8195
Article Google Scholar
Snijders, T. A. B., & Bosker, R. J. (2012). Multilevel analysis: an introduction to basic and advanced multilevel modelling (2nd ed). Sage.
Google Scholar
Verbeke, G., & Lesaffre, E. (1999). The effect of dropout on the efficiency of longitudinal experiments. Journal of the Royal Statistical Society Series C, 48(3), 363–375. https://doi.org/10.1111/1467-9876.00158
Article Google Scholar
Zhang, Z., & Wang, L. (2009). Statistical power analysis for growth curve models using SAS. Behavior Research Methods, 41(4), 1083–1094. https://doi.org/10.3758/BRM.41.4.1083
Article Google Scholar
Zvoch, K. (2016). The use of piecewise growth models to estimate learning trajectories and RTI instructional effects in a comparative interrupted time-series design. The Elementary School Journal, 116(4), 699–720. https://doi.org/10.1086/686304
Article Google Scholar

Download references

Code availability

The Shiny app is available at https://utrecht- university.shinyapps.io/Power_Piecewise_Growth/. The source code is available at https://github.com/MirjamMoerbeek/Power_Piecewise_Growth

Author information

Authors and Affiliations

Department of Methodology and Statistics, Utrecht University, PO Box 80140, 3508 TC, Utrecht, the Netherlands
Mirjam Moerbeek

Authors

Mirjam Moerbeek
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All work has been done by Mirjam Moerbeek.

Corresponding author

Correspondence to Mirjam Moerbeek.

Ethics declarations

Competing interests

The author has no relevant financial or non-financial interests to disclose.

Ethics approval

Not applicable

Consent to participate

Not applicable

Consent for publication

Not applicable

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 18 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moerbeek, M. Power analysis of longitudinal studies with piecewise linear growth and attrition. Behav Res 54, 2939–2948 (2022). https://doi.org/10.3758/s13428-022-01791-x

Download citation

Accepted: 03 January 2022
Published: 07 February 2022
Issue Date: December 2022
DOI: https://doi.org/10.3758/s13428-022-01791-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Power analysis of longitudinal studies with piecewise linear growth and attrition

Abstract

Similar content being viewed by others

Assessing mediational processes using piecewise linear growth curve models with individual measurement occasions

Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data

A Review of Time Scale Fundamentals in the g-Formula and Insidious Selection Bias

Introduction

Multilevel mixed model