Abstract
In many areas of science, interest centers on the time it takes for an event to occur. For example, product engineers monitor how long a device lasts before it fails; developmental psychologists study the emergence of language; and educators track how long it takes students to master some task. Historically, issues of this nature were investigated by researchers studying mortality, so the name “survival analysis” is used as an umbrella term to cover any sort of “time-to-event” analysis, even when the event has nothing to do with life or death. For example, the time it takes an athlete to reach maximum heart rate on a treadmill (see final section of Chap. 11) can be modeled using survival analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
To be precise, the form of censoring being discussed is termed “right censoring.” Left censoring occurs when the starting point (not the end point) is missing. In this chapter, we will consider only right censoring as it is the more common of the two types.
- 2.
The data are fabricated, but the percentage of teens who have tried drugs in our fictitious sample matches estimates of recreational drug use (including alcohol) among American teenagers.
- 3.
The ℛ code that accompanies this section includes a second estimator known as the Fleming-Harrington estimator. We will discuss this procedure later in this chapter.
- 4.
If no survival rate falls below this standard, the median is not reported.
- 5.
Bear in mind that the accuracy of the test requires a large sample, which is not characteristic of our example.
- 6.
If the predictor is a two-group categorical variable, the score test is asymptotically equivalent to the log rank test discussed earlier.
- 7.
The significance of the test can be assessed by referring the absolute value of the obtained value to a normal distribution or by referring the squared test statistic to a χ2distribution with 1 df.
- 8.
We will concern ourselves only with a single predictor, but when multiple predictors are present, there will be separate Schoenfeld residuals for each predictor.
- 9.
Because the test is used with multiple predictors, it will not be illustrated with our single predictor data set. For those wishing a more complete treatment, the test is implemented in ℛ using the function: cox.zph(mod), with the survival times from a Kaplan-Meier estimate used as the measure of time, and the scaled residuals found by dividing each Schoenfeld residual by its respective variance. Additional information appears in Therneau and Grambsch (2002, pp. 130–134.)
- 10.
The comparable value using Breslow ties is b = − .3678, p = .04, and the interpretation is similar to the one using Efron ties.
- 11.
In some cases, the Weibull distribution includes a location parameter. When it is not specified, as is true in this text, it is assumed to be 0.
- 12.
The ℛ code that accompanies this section adds a weighting scheme to update σ more gradually.
- 13.
The survreg function in ℛ uses a different parameterization than the one described here. The conversion is as follows: survreg’s scale σ = 1/γ; survreg’s intercept = log(λ).
- 14.
If you look back to Eq. (12.13), you will see that the Fleming-Harrington (FH) estimator can be obtained from the Nelson-Aalen (NA) estimator: FH = exp − (NA)
References
Breslow, N. E. (1974). Covariance analysis of censored survival data. Biometrics, 30, 88–99.
Collett, D. (2014). Modeling survival data in medical research (3rd ed.). London: Chapman & Hall.
Cox, D. R. (1972). Regression models and life tables. Journal of the Royal Statistical Society Series B, 34, 187–220.
Cox, D. R., & Oakes, D. O. (1984). Analysis of survival data. London: Chapman & Hall.
Efron, B. (1977). The efficiency of Cox’s likelihood function for censored data. Journal of the American Statistical Association, 72, 557–565.
Fleming, T. R., & Harrington, D. P. (1991). Counting processes and survival analysis. New York: Wiley.
Grambsch, P., & Therneau, T. (1994). Proportional hazards tests and diagnostics based on weighted residuals. Biometrika, 81, 515–526.
Greenwood, M. (1926). The natural duration of cancer. Reports on Public Health and Medical Subjects, 33, 1–26.
Hertz-Picciotto, I., & Rockhill, B. (1997). Validity and efficiency of approximation methods for tied survival times in Cox regression. Biometrics, 53, 1151–1156.
Kalbfleisch, J. D., & Prentice, R. L. (2011). The statistical analysis of failure time data (2nd ed.). New York: Wiley.
Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis: Techniques for censored and truncated data (2nd ed.). Berlin, Germany: Springer.
Therneau, T., & Grambsch, P. (2002). Modeling survival data: Extending the Cox model. Berlin,Germany: Springer.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this chapter
Cite this chapter
Brown, J.D. (2018). Survival Analysis. In: Advanced Statistics for the Behavioral Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-93549-2_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-93549-2_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93547-8
Online ISBN: 978-3-319-93549-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)