Genome-wide linkage analysis of longitudinal phenotypes using σ2A random effects (SSARs) fitted by Gibbs sampling

Palmer, Lyle J; Scurrah, Katrina J; Tobin, Martin; Patel, Sanjay R; Celedon, Juan C; Burton, Paul R; Weiss, Scott T

doi:10.1186/1471-2156-4-S1-S12

Genome-wide linkage analysis of longitudinal phenotypes using σ²_A random effects (SSARs) fitted by Gibbs sampling

Proceedings
Open access
Published: 31 December 2003

Volume 4, article number S12, (2003)
Cite this article

Download PDF

You have full access to this open access article

BMC Genetics Aims and scope Submit manuscript

Genome-wide linkage analysis of longitudinal phenotypes using σ²_A random effects (SSARs) fitted by Gibbs sampling

Download PDF

Lyle J Palmer^1,2,
Katrina J Scurrah³,
Martin Tobin³,
Sanjay R Patel¹,
Juan C Celedon¹,
Paul R Burton³ &
…
Scott T Weiss¹

3971 Accesses
8 Citations
Explore all metrics

Abstract

The study of change in intermediate phenotypes over time is important in genetics. In this paper we explore a new approach to phenotype definition in the genetic analysis of longitudinal phenotypes. We utilized data from the longitudinal Framingham Heart Study Family Cohort to investigate the familial aggregation and evidence for linkage to change in systolic blood pressure (SBP) over time. We used Gibbs sampling to derive sigma-squared-A-random-effects (SSARs) for the longitudinal phenotype, and then used these as a new phenotype in subsequent genome-wide linkage analyses.

Additive genetic effects (σ²_A.time) were estimated to account for ~9.2% of the variance in the rate of change of SBP with age, while additive genetic effects (σ²_A) were estimated to account for ~43.9% of the variance in SBP at the mean age. The linkage results suggested that one or more major loci regulating change in SBP over time may localize to chromosomes 2, 3, 4, 6, 10, 11, 17, and 19. The results also suggested that one or more major loci regulating level of SBP may localize to chromosomes 3, 8, and 14.

Our results support a genetic component to both SBP and change in SBP with age, and are consistent with a complex, multifactorial susceptibility to the development of hypertension. The use of SSARs derived from quantitative traits as input to a conventional linkage analysis appears to be valuable in the linkage analysis of genetically complex traits. We have now demonstrated in this paper the use of SSARs in the context of longitudinal family data.

Background

The choice of phenotype is critical for the success of gene discovery in complex human diseases. In the case of most common, complex diseases, however, the choice is far from simple. Diseases such as cardiovascular disease are characterized by heterogeneity and are associated with a large number of intermediate phenotypes, which are themselves under complex genetic control and which may change markedly over time. The study of change in these intermediate phenotypes over time is important, as genes for disease severity or effect modification may be as or more important than genes encoding disease risk per se.

In this paper we explore a new approach to phenotype definition in the genetic analysis of longitudinal phenotypes. The generalized linear mixed model (GLMM) [1] provides the unifying framework and Gibbs sampling [2, 3] a means of fitting these models.

We utilized data from the longitudinal Framingham Heart Study Family Cohort to investigate the familial aggregation and evidence for linkage to change in systolic blood pressure (SBP) over time. Our primary aim was to assess evidence for genetic effects on the rate of increase of SBP with age, as well as genetic effects on SBP at any given time. Our secondary aim was to derive sigma-squared-A-random-effects (SSARs) for the longitudinal phenotype, and to use these as a new phenotype in subsequent genome-wide linkage analyses.

Methods

Data

We analyzed the longitudinal Framingham Heart Study Family Cohort data set, provided for the Genetic Analysis Workshop, which included 4692 individuals in all 330 pedigrees. The primary response variable of the GLMMs and linkage models reported in this paper was the SBP measured over time. Explanatory covariates for all models included: sex, age, height, weight, number of drinks per week, number of cigarettes per day. Sex (m = 1, f = 0) was analyzed as a binary covariate. Other variables were treated as continuous. All continuous covariates were centered at or close to their mean. In order to allow for a nonlinear increase in SBP with age, quadratic and cubic age terms were also included. Although the model is Bayesian, the effects of each covariate on the phenotype can also be assessed using a pseudo-likelihood approach.

The covariates drink (number of drinks per week) and CPD (number of cigarettes per day) were only measured at 11 and 18 time points, respectively, in the first cohort. However, the majority of individuals had only one or two distinct values for these covariates throughout the study, so each missing value of drink and CPD was replaced with the mean of the covariates for that individual in the two observations surrounding the missing observation. This enabled us to increase the amount of useful observations without excluding important covariates from the model. The phenotypic component of the model included 24,746 observations for 2860 individuals. These observations all had non-missing values for SBP, height, weight, age, drink, and CPD.

Variance components model

The models fitted in this analysis are extensions of our previously described variance components models for extended pedigrees with normal, binary, or censored survival phenotypes [4, 5]. The original models are GLMMs with components of variance due to additive polygenic effects (σ²_A), common family environment (σ²_C), and common sibling environment (σ²_Cs), and since the phenotype in this analysis is assumed normally distributed, an error variance (σ²_E). The complex familial correlation structure is modeled through the use of shared random effects for each variance component. The model for the j^th individual in the i^th pedigree can be represented as:

μ_ij = β₀ + β^TX_ij + A_ij + C_ij + Cs_ij

y_ij ~ N(μ_ij, σ²_E),

where y_ij is the observed phenotype, X_ij is a matrix of covariates, A_ij ~ N(0, V_A), C_ij ~ N(0, V_C), Cs_ij ~ N(0, V_Cs) and the random effects are sampled from appropriate multivariate normal distributions with variance-covariance matrices (V_A, V_C, and V_Cs) parameterized by σ²_A, σ²_C, and σ²_Cs, respectively, and structured as described in [5]. Individuals without phenotypic data are used in generating the random effects but not in the main model.

The extensions not only allow for repeated measures on individuals, but also use the extra information available to assess evidence for genetic effects on the rate of change of the outcome over time [6]. At the k^th observation for the j^th individual in the i^th pedigree, the model can be expressed as:

μ_ijk = β₀ + β^TX_ijk + A_ij + C_ij + Cs_ij + age_ijk × (β_age + A.time_ij + C.time_ij + Cs.time_ij + E.time_ij)

y_ijk ~ N(μ_ijk, σ²_E),

where y_ijk is the observed phenotype, X_ijk is a matrix of covariates, β_age is the mean gradient of age (for all individuals), A_ij ~ N(0, σ²_A), C_ij ~ N(0, σ²_C), Cs_ij ~ N(0, σ²_Cs), A.time_ij ~ N(0, σ²_A.time), C.time_ij ~ N(0, σ²_C.time), Cs.time_ij ~ N(0, σ²_Cs.time), E.time_ij ~ N(0, σ²_E.time).

In this model time was measured using the covariate age. The random effects for the gradient of age, A.time_ij, C.time_ij, Cs.time_ij, and E.time_ij, have a covariance structure equivalent to that described previously [5], with each variance component replaced by its counterpart for the gradient, so (for example) in this part of the model σ²_A is represented by σ²_A.time. However, the variance components for the intercepts (σ²_A, σ²_C, σ²_Cs, and σ²_E) are independent of the variance components for the corresponding gradients (σ²_A.time, σ²_C.time, σ²_Cs.time, and σ²_E.time), and the estimated random effects for each variance component for the intercept are different from the estimated random effects for each variance component for the gradient. All random effects are constant across observations on the same individual; they do not change over time. Just as the size of σ²_A can indicate whether additive genetic effects influence the outcome, the size of σ²_A.time can indicate whether additive genetic effects influence the rate of change of the outcome with time. The narrow-sense heritability (h²_N) was defined as the ratio of variance due to additive genetic effects (σ²_A or σ²_A.time) to the total phenotypic variance in SBP or rate of change in SBP [7].

The models were fitted using Gibbs sampling in WinBUGS v1.3 [8]. Because WinBUGS uses a Bayesian formulation, prior distributions must be specified for all parameters of interest. Vague normal N(0, 1000) priors were specified for fixed regression coefficients. Vague Pareto(1, 0.001) priors were specified on the precisions (inverses of the variances) of all random effects; these were equivalent to specifying uniform priors on the corresponding variances. Models were run for 45,000 iterations after a burn-in of 5000 iterations.

Blood pressure adjustment

The problem of how to model continuous SBP when some individuals are on blood pressure treatment is complex. However, since complex modelling of SBP was not the main aim of this particular analysis, we chose a simple method of adjustment based on known average treatment effects [9, 10]. This involved adding a constant (10 mm Hg) to each phenotype where the individual was on treatment, to reflect the 'true' SBP that might have been observed if the individual had not been on treatment.

Linkage analysis

As described previously, the random effects due to σ²_A (A_ij, the sigma-squared A residuals, or SSARs) can be used as a continuous phenotype in a linkage analysis [11, 12]. The SSARs for the gradient (A.time_ij, the sigma-squared A time residuals or SSATRs), may also be used in such a way. The estimated SSARs and SSATRs were used as continuous phenotypes in a multipoint variance components linkage analysis that was performed using SOLAR v1.7.4 [13]. All 395 markers were included. The most common approach undertaken in previous genetic analyses of longitudinal data has been to estimate a simple linear gradient in each subject and then use this as a phenotype. Therefore, in order to compare our SSAR methodology to such an approach, we also estimated a simple linear gradient in each subject and used this as an outcome into the SOLAR linkage analyses.

Results

Results from fitting the variance components model in WinBUGS are shown in Table 1. Both means ('best estimates') and 95% credible intervals are presented for each parameter. We deliberately excluded certain intermediate phenotypes (such as cholesterol and BMI) from the model even though they were clearly related to SBP, because we did not want to model away their contribution to variation in SBP.

Table 1 Results from main model

Full size table

The size of σ²_A.time (relative to the other variance components for the gradient) gave a h²_N estimate of ~9.2%, suggesting that genes do influence the rate of change of SBP with age, but that nongenetic influences due to shared familial or sibling environment are more important. In contrast, the variance components estimates for the intercept suggest that additive genetic effects are the largest component of variance for SBP at the mean age (h²_N ~ 43.9%).

The mean rate of change of SBP with each year was 0.62, suggesting that for the average individual, SBP increases by approximately 6.2 mm Hg every 10 years. A nonlinear effect was also apparent, since the credible intervals for both the quadratic and cubic terms excluded 0 (Figure 1).

Other covariates also appeared to influence SBP. Males tended to have lower SBP than females, and individuals from the second cohort tended to have lower SBP than those included in the first cohort. Both increasing weight and number of drinks consumed per day was significantly associated with increased SBP. Height and number of cigarettes per day appeared to show little association with SBP.

Results changed little when σ²_E.time was excluded, when σ²_C and σ²_C.time were excluded, when σ²_Cs and σ²_Cs.time were excluded, and when the SBP adjustment (for those on blood pressure treatment) was changed to +5 mm Hg or +20 mm Hg (instead of +10 mm Hg). A number of genome-wide significant [14] linkages were found for both the SSARs and the SSATRs (Tables 2 and 3). No evidence of genome-wide significant linkage was found for the simple linear slope. The results suggested that one or more major loci regulating change in SBP over time may localize to chromosomes 2, 3, 4, 6, 10, 11, 17, and 19 (Table 2). The results also suggested that one or more major loci regulating level of SBP may localize to chromosomes 3, 8, and 14 (Table 3).

Table 2 Summary of linkage results to change in SBP (SSATR; LOD > 3)

Full size table

Table 3 Summary of linkage results to SBP (SSAR; LOD > 3)

Full size table

Discussion

Our methods for the analysis of longitudinal data in pedigrees have several advantages. They make full use of the information contained in the repeated observations for each individual, without first requiring the information to be summarized in some way. It is simple to include extra covariates or single-nucleotide polymorphism genotypes (i.e., association analysis). They allow the inclusion of all pedigree members, without requiring decomposition of the pedigrees into smaller structures. These adjusted phenotypes can then be usefully used in linkage analyses. The results obtained suggest that our extended variance components models perform well in practice. They are relatively straightforward to fit, and produce useful and biologically plausible results.

Previous analyses of these data suggested a major locus regulating longitudinal SBP on chromosome 17 (multipoint LOD = 4.97 at 67 cM) [15]. This finding is broadly consistent (within 25 cM) of our finding on chromosome 17 for SSATR (multipoint LOD = 3.97 at 89 cM) (Table 2). Differences in results between this previous study [15] and our study may relate to differences in phenotypic definition (Levy et al. analyzed a residualized linear slope adjusted for BMI), differences in dealing with antihypertensive treatment, and differences in the sub-set of the Framingham Heart Study data analyzed.

Our results support a genetic component to both SBP and change in SBP with age, and are consistent with a complex, multifactorial susceptibility to the development of hypertension. The use of SSARs derived from quantitative traits as input to a conventional linkage analysis has previously been investigated using both real [11] and simulated data [12], and appears to be valuable in the linkage analysis of genetically complex traits. An analysis of the GAW13 simulated data using these methods [6] produced answers that corresponded well with the simulating model. We have now demonstrated in this paper the use of SSARs in the context of longitudinal family data.

References

Breslow N, Clayton D: Approximate inference in generalized linear mixed models. J Am Stat Assoc. 1993, 88: 9-25. 10.2307/2290687.
Google Scholar
Zeger S, Karim M: Generalized linear models with random effects; a Gibbs sampling approach. J Am Stat Assoc. 1991, 86: 79-86. 10.2307/2289717.
Article Google Scholar
Guo S, Thompson E: Monte Carlo estimation of variance components models. IMA J Math Appl Med Biol. 1991, 8: 171-189.
Article CAS PubMed Google Scholar
Burton P, Tiller K, Gurrin L, Cookson W, Musk A, Palmer L: Genetic variance components analysis for binary phenotypes using generalized linear mixed models (GLMMs) and Gibbs sampling. Genet Epidemiol. 1999, 17: 118-140. 10.1002/(SICI)1098-2272(1999)17:2<118::AID-GEPI3>3.3.CO;2-M.
Article CAS PubMed Google Scholar
Scurrah KJ, Palmer LJ, Burton PR: Variance components analysis for pedigree-based censored survival data using generalized linear mixed models (GLMMs) and Gibbs sampling in BUGS. Genet Epidemiol. 2000, 19: 127-148. 10.1002/1098-2272(200009)19:2<127::AID-GEPI2>3.0.CO;2-S.
Article CAS PubMed Google Scholar
Scurrah KJ, Tobin MD, Burton PR: Longitudinal variance components models for systolic blood pressure, fitted using Gibbs sampling. BMC Genetics. 2003, 4 (suppl 1): S25-10.1186/1471-2156-4-S1-S25.
Article PubMed Central PubMed Google Scholar
Khoury M, Beaty T, Cohen B: Fundamentals of Genetic Epidemiology. Oxford, Oxford University Press. 1993
Google Scholar
Spiegelhalter D, Best N, Thomas A: WinBUGS Version 1.3 User Manual. Cambridge, Cambridge University MRC Biostatistics Unit. 2000
Google Scholar
Palmer LJ: Loosening the cuff: important new advances in modeling antihypertensive treatment effects in genetic studies of hypertension. Hypertension. 2003, 41: 197-198. 10.1161/01.HYP.0000051503.09587.E4.
Article CAS PubMed Google Scholar
Cui JS, Hopper JL, Harrap SB: Antihypertensive treatments obscure familial contributions to blood pressure variation. Hypertension. 2003, 41: 207-210. 10.1161/01.HYP.0000044938.94050.E3.
Article CAS PubMed Google Scholar
Palmer LJ, Tiller KJ, Burton PR: Genome wide linkage analysis using genetic variance components of alcohol-dependency-associated censored and continuous traits. Genet Epidemiol. 1999, 17: 283-288.
Article Google Scholar
Palmer LJ, Jacobs KB, Scurrah KJ, Xu X, Horvath S, Weiss ST: Genome wide linkage analysis in a general population sample using s2A random effects (SSARs) fitted by Gibbs sampling. Genet Epidemiol. 2001, 21: 674-679.
Google Scholar
Almasy L, Blangero J: Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet. 1998, 62: 1198-1211. 10.1086/301844.
Article PubMed Central CAS PubMed Google Scholar
Lander E, Kruglyak L: Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results. Nat Genet. 1995, 11: 241-247. 10.1038/ng1195-241.
Article CAS PubMed Google Scholar
Levy D, DeStefano AL, Larson MG, O'Donnell CJ, Lifton RP, Gavras H, Cupples LA, Myers RH: Evidence for a gene influencing blood pressure on chromosome 17. Genome scan linkage results for longitudinal blood pressure phenotypes in subjects from the Framingham Heart Study. Hypertension. 2000, 36: 477-483.
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Channing Laboratory, Brigham & Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
Lyle J Palmer, Sanjay R Patel, Juan C Celedon & Scott T Weiss
Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, Ohio, USA
Lyle J Palmer
Department of Epidemiology and Public Health and Institute of Genetics, University of Leicester, United Kingdom
Katrina J Scurrah, Martin Tobin & Paul R Burton

Authors

Lyle J Palmer
View author publications
You can also search for this author in PubMed Google Scholar
Katrina J Scurrah
View author publications
You can also search for this author in PubMed Google Scholar
Martin Tobin
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay R Patel
View author publications
You can also search for this author in PubMed Google Scholar
Juan C Celedon
View author publications
You can also search for this author in PubMed Google Scholar
Paul R Burton
View author publications
You can also search for this author in PubMed Google Scholar
Scott T Weiss
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lyle J Palmer.

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Palmer, L.J., Scurrah, K.J., Tobin, M. et al. Genome-wide linkage analysis of longitudinal phenotypes using σ²_A random effects (SSARs) fitted by Gibbs sampling. BMC Genet 4 (Suppl 1), S12 (2003). https://doi.org/10.1186/1471-2156-4-S1-S12

Download citation

Published: 31 December 2003
DOI: https://doi.org/10.1186/1471-2156-4-S1-S12

Genome-wide linkage analysis of longitudinal phenotypes using σ²_A random effects (SSARs) fitted by Gibbs sampling

Abstract

Background

Methods

Data

Variance components model

Blood pressure adjustment

Linkage analysis

Results

Discussion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Genome-wide linkage analysis of longitudinal phenotypes using σ2A random effects (SSARs) fitted by Gibbs sampling

Abstract

Background

Methods

Data

Variance components model

Blood pressure adjustment

Linkage analysis

Results

Discussion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Genome-wide linkage analysis of longitudinal phenotypes using σ²_A random effects (SSARs) fitted by Gibbs sampling