Abstract
When assessing the impact of a family or youth intervention, program evaluators often consider whether and how baseline characteristics moderate the effect of treatment. Parametric and non-parametric approaches are both possible, but each has limitations. In this article, we argue for and illustrate the use of semi-parametric methods. A hybrid of parametric and non-parametric approaches, semi-parametric modeling techniques combine the strengths of each approach. This article demonstrates how this approach can be used to evaluate whether an intervention effect was larger for those with most serious problems at baseline. Using data from the Fast Track project—a comprehensive intervention designed to prevent serious conduct problems among children at high risk—we investigated the moderating effect of two key baseline characteristics: parent-reported and teacher-reported problem behavior. Our analyses do not demonstrate a significant moderation of the intervention effect.
Similar content being viewed by others
Notes
The literature on semi-parametric and non-parametric methods is large, diffuse and growing. We found these three sources particularly helpful (Yatchew 2003; Wood 2006; Fox 2000). Our presentation of this methodology draws heavily on those sources. We also refer the reader to Li and Racine (2007) for a detailed treatment.
The mean square error of an estimate of an univariate function diminishes to zero with \( \sqrt N \). (The estimator is said to be “root-N consistent.”) Because of the curse of dimensionality, a function of two variables is \( \sqrt[3]{T} \) consistent—the estimator converges to the “truth” much more slowly. If one can separate the function into two pieces, each a function of one variable, the overall estimator is still \( \sqrt N \)-consistent (Yatchew 1998).
Estimation is somewhat more complicated than indicated by Eq. 3. In particular, differencing can involve subtracting a weighted average the k preceding observations. Doing so smoothes out additional variation in the relationship between the covariate and the outcome: doing so improves the efficiency of the resulting estimator. In our empirical example below, we employ fifth-order differencing using the optimal weights derived in Yatchew (1997). For additional details, see that article.
Researchers typically transform the polynomial into an alternative format or basis—that is, a set of functions that provide an equivalent fit to the piecewise cubic polynomial. Those functions have been refined over time in order to improve the computational properties of estimation and include models such as the B-Splines (Wood 2006). Other models set the second derivative at the knots to zero. These are labeled “natural splines”. Other models restrict the model to be linear in the tails (Durrleman and Simon 1989).
The intuition is the same as in linear regression—adding a square term or an interaction can never cause the R-square to decrease.
In an extreme case, if λ were set to zero and the knots were set at the actual data points, the fitted model would involve simple interpolation “connect the dots”. If λ were set to infinity, the resulting line would be the least squares line.
The denominator of the GCV includes the term, \( [n - \gamma tr(A)] \). When γ equals one, this is the residual degrees of freedom. The research literature suggests that there is some tendency for the GAM to overfit even when the GCV is used to determine λ. For that reason, researchers often set γ equal to 1.4. We have followed that convention in our empirical illustration below.
References
Aday, L.A., Andersen, R.: A framework for the study of access to medical care. Health Serv. Res. 9(3), 208 (1974)
Aday, L.A., Begley, C.E., Lairson, D.R., Slater, C.H., Richard, A.J., Montoya, I.D.: A framework for assessing the effectiveness, efficiency, and equity of behavioral healthcare. Am. J. Manag. Care. Jun 25;5 Spec no: SP25-44 (1999)
Aos, S., Lieb, R., Mayfield, J., Miller, M., Pennucci, A.: Benefits and Costs of Prevention and Early Intervention Programs for Youth. Washington State Public Policy Institute, Seattle (2004)
Arellano, M.: Panel Data Econometrics. Oxford University Press, Oxford (2003)
Baron, R.M., Kenny, D.A.: The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J. Pers. Soc. Psychol. 51(6), 1173–1182 (1986)
Chamberlain, P., Reid, J.B.: Parent observation and report of child symptoms. Behav. Assess. 9, 97–109 (1987)
Cohen, M.A.: The monetary value of saving a high-risk youth. J. Quant. Criminol. 14, 5–33 (1998)
Conduct Problems Prevention Research Group: A developmental and clinical model for the prevention of conduct disorders: the Fast Track Program. Dev. Psychopathol. 4, 509–527 (1992)
Conduct Problems Prevention Research Group: Initial impact of the fast track prevention trial for conduct problems: I. The high-risk sample. J. Consult. Clin. Psychol. 67(5), 631–647 (1999)
Conduct Problems Prevention Research Group: Fast track randomized controlled trial to prevent externalizing psychiatric disorders: findings from grades 3 to 9. J. Am. Acad. Child Adolesc. Psychiatry 46(10), 1250–1262 (2007)
Dawson-McClure, S., Sandler, I., Wolchik, S., Millsap, R.: Risk as a moderator of the effects of prevention programs for children from divorced families: a six-year longitudinal study. J. Abnorm. Child Psychol. 32(2), 175–190 (2004)
Deater-Deckard, K.: Physical discipline among African American and European American mothers: links to children. Dev. Psychol. 32(6), 1065–1072 (1996)
Dishion, T.J., Dodge, K.A.: Peer contagion in interventions for children and adolescents: moving towards an understanding of the ecology and dynamics of change. J. Abnorm. Child Psychol. 33(3), 395–400 (2005)
Durrleman, S., Simon, R.: Flexible regression models with cubic splines. Stat. Med. 8(5), 551–561 (1989)
Foster, E.M., Jones, D., Conduct Problems Prevention Research Group: Can a costly intervention be cost-effective?: an analysis of violence prevention. Arch. Gen. Psychiatry 63, 1284–1291 (2006)
Foster, E.M., Jensen, P.S., Schlander, M., et al.: Treatment for ADHD: is more complex treatment cost-effective for more complex cases? Health Serv. Res. 42(1 Pt 1), 165–182 (2007)
Fox, J.: Multiple and Generalized Nonparametric Regression. Sage Publications, Thousand Oaks (2000)
Greene, W.H.: Econometric Analysis, 2nd edn. Macmillan, New York (1993)
Greene, W.H.: Econometric Analysis, 6th edn. Prentice Hall, Upper Saddle River (2008)
Greenland, S., Pearl, J., Robins, J.M.: Causal diagrams for epidemiologic research. Epidemiology 10(1), 37–48 (1999)
Grossman, M.: The human capital model. In: Culyer, A.J., Newhouse, J.P. (eds.) Handbook of Health Economics, vol. 1, pp. 347–405. Elsevier, New York (2000)
Jacobson, L.: The family as a producer of health—an extended Grossman model. J. Health Econ. 19, 611–637 (2000)
Jacoby, W.G.: Loess: a nonparametric, graphical tool for depicting relationships between variables. Elect. Stud. 19(4), 577–613 (2000)
Kraemer, H.C., Wilson, G.T., Fairburn, C.G., Agras, W.S.: Mediators and moderators of treatment effects in randomized clinical trials. Arch. Gen. Psychiatry 59(10), 877–883 (2002)
Kusche, C., Greenberg, M.: The PATHS Curriculum. Developmental Research and Programs, Seattle (1993)
Li, Q., Racine, J.S.: Nonparametric Econometrics: Theory and Practice. Princeton University Press, Princeton (2007)
Morgan, S.L., Winship, C.: Counterfactuals and Causal Inference: Methods and Principles for Social Research. Cambridge University Press, Cambridge (2007)
Moyé, L.A.: Multiple Analyses in Clinical Trials: Fundamentals for Investigators. Springer, New York (2003)
Murray, D.M., Varnell, S.P., Blitstein, J.L.: Design and analysis of group-randomized trials: a review of recent methodological developments. Am. J. Public Health 94(3), 423–432 (2004)
Owens, E.J., Hinshaw, S.P., Kraemer, H.C., et al.: Which treatment for whom for ADHD? Moderators of treatment response in the MTA. J. Consult. Clin. Psychol. 71(3), 540–552 (2003)
Pearl, J.: Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge (2000)
R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2007)
Sandler, I.N., Ayers, T.S., Wolchik, S.A., et al.: The Family Bereavement Program: efficacy evaluation of a theory-based prevention program for parentally bereaved children and adolescents. J. Consult. Clin. Psychol. 71(3), 587–600 (2003)
Shaffer, D., Fisher, P., Dulcan, M.K., et al.: The NIMH diagnostic interview schedule for children version 2.3 (DISC-2.3): description, acceptability, prevalence rates, and performance in the MECA study. Methods for the epidemiology of child and adolescent mental disorders study. J. Am. Acad. Child Adolesc. Psychiatry 35(7), 865–877 (1996)
Spieker, S.J., Larson, N.C., Lewis, S.M., Keller, T.E., Gilchrist, L.: Developmental trajectories of disruptive behavior problems in preschool children of adolescent mothers. Child Dev. 70(2), 443–458 (1999)
Werthamer-Larsson, L., Kellam, S.: Teacher Observation of Classroom Adaptation-revised (TOCA-R). In Prevention Center Training Manual. Johns Hopkins University, Baltimore (1989)
Wood, S.: Generalized Additive Models: An Introduction With R. Chapman & Hall, London (2006)
Wood, S., Wood, M.S.: The mgcv Package (2007). http://cran.r-project.org/web/packages/mgcv/mgcv.pdf
Yatchew, A.: An Elementary Estimator of the Partial Linear Model. Econ. Lett. 57(2), 135–143 (1997)
Yatchew, A.: Nonparametric regression techniques in economics. J. Econ. Lit. 36(2), 669–721 (1998)
Yatchew, A.: Semiparametric Regression for the Applied Econometrician. Cambridge University Press, New York (2003)
Acknowledgments
This work was supported by National Institute of Mental Health (NIMH) grants R18 MH48043, R18 MH50951, R18 MH50952, and R18 MH50953. The Center for Substance Abuse Prevention and the National Institute on Drug Abuse also has provided support for Fast Track through a memorandum of agreement with the NIMH. This work was also supported in part by Department of Education grant S184U30002 and NIMH grants K05MH00797 and K05MH01027. The economic analysis of the Fast Track project is supported through R01MH62988. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute On Drug Abuse or the National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Appendix 1: R code for the model
Appendix 1: R code for the model
-
# outcome—indicates the outcome of interest
-
# parent—parent-reported behavior problems at baseline
-
# teacher—teacher-reported behavior problems at baseline
-
# trt—treatment status
-
(#1)
Estimate the GAM using the variables that have been differenced and residualized
-
#
Variables that have been differenced have the prefix DT or DP, indicating whether
-
#
the variable has been differenced using the parent (DP) or teacher (DT) variables.
-
#
All variables have a suffix of R, indicating that they have been residualized using the
-
#
other covariates (such as site).
$$ {\tt{Model1 <- gam}}\, {\tt{(DT}} {\underline{\tt{outcome}}} {\tt{R \sim s}} ({\tt{DT}}{\underline{\tt{parent}}}{\tt{R}}), \quad {\tt{bs = tp,}} \quad {\tt{gamma = 1.4)}} $$ -
#
-
(#2)
Estimate a standard linear model for comparison purposes
$$ {\tt{Model2 <- lm}} ({\tt{DT}} {\underline{{\tt{outcome}}}} {\tt{R\sim DT}} {\underline{\tt{parent}}} {\tt{R}}) $$ -
(#3)
Estimate the test statistic using an F statistic
$$ {\tt{anova}}{\tt{(Model2}},\quad {\tt{Model1}},\quad {\tt{test}} = {\tt{\hbox{``}}}{\tt{F}}{\tt{\hbox{''})}}$$-
#
We conducted similar steps using the teacher report as the outcome.
-
#
-
(#4)
Estimate a model with each of the predictors included flexibly
$$ {\tt{Model3<-gam}}{\tt{(}}{\underline{{\tt{outcome}}}}{\tt{R}}\sim {\tt{s}}{\tt{(}}{\underline{{\tt{parent)}}}}{\tt{R}}+{\tt{s}}{\tt{(}}{\underline{{\tt{teacher}}}}{\tt{R)}},\quad{\tt{gamma}}=1.4,\quad{\tt{bs}}={\tt{tp)}}$$-
#
note that differenced variables are no longer used; each of the two baseline variables
-
#
are now included as predictors
-
#
-
(#5)
Estimate a model that allows for flexible interaction between the parent and teacher
-
#
reports
$$ {\tt{Model4<-gam}}{\tt{(}}{\underline{{\tt{outcome}}}}{\tt{R}}\sim {\tt{s}}{\tt{(}}{\underline{{\tt{parent}}}}{\tt{R}},\quad{\underline{{\tt{teacher}}}}{\tt{R}}{\tt{)}},\quad{\tt{gamma}}=1.4,\quad {\tt{bs}}={\tt{tp)}}$$
-
#
-
(#6)
Estimate the test statistic for the additivity assumption
$$ {\tt{anova}}{\tt{(Model3}},\quad {\tt{Model4}},\quad {\tt{test}} = {\tt{\hbox{``}}}{\tt{F}}{\tt{\hbox{''})}}$$ -
(#7)
Estimate a model that estimates the smoothed model separately for the
-
#
treatment and comparison cases
$$ {\tt{Model5<-gam}}{\tt{(outcomeR}}\sim {\tt{s}}{\tt{(}}{\underline{{\tt{parent}}}}{\tt{R}},{\underline{{\tt{teacher}}}}{\tt{R)}}+{\tt{s}}{\tt{(}}{\underline{{\tt{parent}}}}{\tt{R}},{\underline{{\tt{teacher}}}}{\tt{R}},{\tt{by}}={\tt{trt}}),\quad{\tt{gamma}}=1.4,{\tt{bs}}={\tt{tp)}}$$
-
#
-
(#8)
Estimate the test statistic for the assumption that the same model fits both groups
$$ {\tt{anova}}{\tt{(Model4}},\quad {\tt{Model5}},\quad {\tt{test}} = {\tt{\hbox{``}}}{\tt{F}}{\tt{\hbox{''})}}$$
Rights and permissions
About this article
Cite this article
Michael Foster, E., Watkins, S. Semi-parametric regression as a tool for assessing moderation: an analysis of the Fast Track intervention. Health Serv Outcomes Res Method 10, 67–85 (2010). https://doi.org/10.1007/s10742-010-0064-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10742-010-0064-0