Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data

Li, Donghe; Kang, Hahn; Lee, Sanghun; Won, Sungho

doi:10.1007/s13258-019-00902-x

Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data

Research Article
Open access
Published: 04 January 2020

Volume 42, pages 393–403, (2020)
Cite this article

Download PDF

You have full access to this open access article

Genes & Genomics Aims and scope Submit manuscript

Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data

Download PDF

Donghe Li¹,
Hahn Kang²,
Sanghun Lee³ &
…
Sungho Won^1,4,5

1645 Accesses
1 Altmetric
Explore all metrics

Abstract

Background

There are many research studies have estimated the heritability of phenotypic traits, but few have considered longitudinal changes in several phenotypic traits together.

Objective

To evaluate the progressive effect of single nucleotide polymorphisms (SNPs) on prominent health-related phenotypic traits by determining SNP-based heritability ($h_{snp}^{2}$) using longitudinal data.

Methods

Sixteen phenotypic traits associated with major health indices were observed biennially for 6843 individuals with 10-year follow-up in a Korean community-based cohort. Average SNP heritability and longitudinal changes in the total period were estimated using a two-stage model. Average and periodic differences for each subject were considered responses to estimate SNP heritability. Furthermore, a genome-wide association study (GWAS) was performed for significant SNPs.

Results

Each SNP heritability for the phenotypic mean of all sixteen traits through 6 periods (baseline and five follow-ups) were significant. Gradually, the forced vital capacity in one second (FEV1) reflected the only significant SNP heritability among longitudinal changes at a false discovery rate (FDR)-adjusted 0.05 significance level ($h_{snp}^{2} = 0.171$, FDR = 0.0012). On estimating chromosomal heritability, chromosome 2 displayed the highest heritability upon periodic changes in FEV1. SNPs including rs2272402 and rs7209788 displayed a genome-wide significant association with longitudinal changes in FEV1 (P = 1.22 × 10⁻⁸ for rs2272402 and P = 3.36 × 10⁻⁷ for rs7209788). De novo variants including rs4922117 (near LPL, P = 2.13 × 10⁻¹⁵) of log-transformed high-density lipoprotein (HDL) ratios and rs2335418 (near HMGCR, P = 3.2 $\times$ 10⁻⁹) of low-density lipoprotein were detected on GWAS.

Conclusion

Significant genetic effects on longitudinal changes in FEV1 among the middle-aged general population and chromosome 2 account for most of the genetic variance.

A toolkit for quantification of biological age from blood chemistry and organ function test data: BioAge

Article 02 November 2021

The Emerging Theoretical Framework of Life Course Health Development

Examining the relationship between genetic polymorphisms (BDKRB2, GNB3, HIF1A, MCT1, NOS3) and endurance athlete status

Article Open access 16 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Single nucleotide polymorphism (SNP)-based heritability ($h_{snp}^{2}$) indicates the relative proportion of genetic variance explained based on SNPs used for genome-wide association studies (GWASs). For $h_{snp}^{2}$ estimation, the genomic restricted maximum likelihood (GREML) for linear mixed models (LMMs) is often implemented in the genetic complex trait analysis (GCTA) tool (Yang et al. 2011). GREML first calculates the genetic relatedness matrices (GRM), which are used as variance–covariance matrices for random effects. The significance of estimates obtained through GREML depends on the study design; if it is applied to family-based samples, it displays pedigree-based heritability, but for unrelated subjects, it estimates $h_{snp}^{2}$ (Kim et al. 2015; Yang et al. 2017). Estimation of $h_{snp}^{2}$ involves considerable differences across not only methodologies but also procedures requiring careful interpretation of results (Evans et al. 2018; Ni et al. 2018). Moreover, the estimated heritability is potentially biased and misleading owing to measurement errors at various degrees. To overcome these challenges, the heritability determined from longitudinal data is more reliable than that determined from cross-sectional data. While most studies on $h_{snp}^{2}$ focused on the primary effect of SNPs, significant effects of SNPs on the average annual differences indicate the SNP-by-age interaction. Numerous examples illustrate the importance of age on longitudinal changes (Genetic epidemiologic studies on age-specified traits. NIA Aging and Genetic Epidemiology Working Group 2000; Nishimura et al. 2012; van de Pol and Verhulst 2006). For instance, one study showed that an annual decline in lung function is associated with age (Kim et al. 2016), and another study reported a genetic influence on changes in both lipoprotein risk factors and systolic blood pressure over a decade (Friedlander et al. 1997). Therefore, the $h_{snp}^{2}$ should be estimated on the basis of not only the mean of observed traits but also changes in the sufficient period. Hence, we applied a two-stage approach, which is a convenient method of analyzing longitudinal data by combining linear regression models to investigate the effect of SNPs on both average and longitudinal differences in phenotypic traits.

In this study, we investigated the magnitude of SNPs effect on average and longitudinal differences by using both genomic data and 16 phenotypic traits associated with major health indices using a phenotype-genotype dataset of unrelated individuals in a community-based cohort and evaluated their importance. Except for baseline, each phenotype was objectively measured every 2 years for 10-year follow-up, and six repeated measurements (maximum) were obtained for each individual. For each subject, both the average phenotypic traits and their longitudinal changes were estimated via subject-specific regression analysis, using intercepts and coefficients of ages, respectively. Each $h_{snp}^{2}$ value was estimated using GCTA. Our results show that lung function has the only significant $h_{snp}^{2}$ for longitudinal changes, while all average phenotypes of the 16 traits yielded a significant $h_{snp}^{2}$ value. Furthermore, the GWAS revealed certain novel genome-wide significant SNPs associated with the phenotypes analyzed herein.

Materials and Methods

Ethics approval and consent to participate

The respective Institutional Review Board (IRB) of Seoul National University reviewed and approved the informed consent, the study protocols and other documents (Permit No. E1605/002-003). All methods were performed in accordance with the relevant guidelines and regulations.

Korea Associated Resource (KARE) cohort data

Korea Associated Resource (KARE) data are based on a community-based epidemiological study and comprises subjects residing in Ansan (urban area) and Ansung (rural area) in the Gyeonggi Province of South Korea (Cho et al. 2009). A baseline survey was completed in 2001–2002, and 10,030 participants aged 40–69 years were recruited. After that, biennial repeated surveys were conducted, and the last survey were completed in 2013–2014 (Kim et al. 2017). Six different surveys were conducted in total. These measurement periods are indicated as periods 1–6 throughout this study, each with a different number of subjects (e.g., period 1: 8543 subjects [4052 male, 4491 female]; period 6: 5391 subjects [2502 male, 2889 female]). The number of overlapping subjects throughout the six periods was 4306 (2009 male, 2297 female). Among these, subjects whose traits were measured at least three times were considered, and 6843 participants (3273 male, 3570 female) were assessed in total.

Many participant phenotypes were recorded by trained interviewers through questionnaires and clinical measurement, and we only considered the 16 quantitative traits that they were measured objectively and associated with major health indices; these were classified into four groups: anthropometric, biochemistry, cardiopulmonary, and red blood cell traits (Table 1). As glycated hemoglobin (HbA1c), fasting blood glucose (GLU0), high-density lipoprotein (HDL), triglycerides (TG), and systolic blood pressure (SBP) displayed skewed distributions, they were log-transformed and denoted by log(HbA1c), log(GLU0), log(HDL), log(TG), and log(SBP), respectively. The missing rate of HbA1c was larger than 0.5 at period 2 and was excluded from further analyses. For each trait, subjects with more than three measurement observations were assessed.

Table 1 Sixteen phenotypic traits associated with major health indices

Full size table

Genotypes

Genotype data for the KARE cohort were obtained using the Affymetrix Genome-Wide Human SNP array 5.0 (Cho et al. 2009). Quality control (QC) analysis of SNPs and subjects was conducted using PLINK (Purcell et al. 2007) and ONETOOL (Song et al. 2018). SNPs with P-values from Hardy–Weinberg equilibrium (HWE) analysis < 10⁻⁵, minor allele frequencies (MAFs) < 0.05, and genotype call rates < 95% were excluded. Furthermore, subjects with missing genotype call rates > 5% or sex-based inconsistencies were excluded. The missing genotypes for typed SNPs were imputed on the basis of the 1000-genome sequence reference data. After QC analysis, 305,158 SNPs were analyzed for SNP heritability and GWAS (Fig. 1).

Data availability

The data used in this study underwent an application process according to the Korean Genome and Epidemiology study data access policy, and can be downloaded from following website: http://nih.go.kr/index.es?sid=a5#a.

Determination of phenotypic averages and longitudinal changes in each subject

The phenotypic averages and longitudinal changes for each subject were determined and used to estimate SNP heritability and for the GWAS. Significant differences in phenotypic variances were observed for each period, and such heteroscedasticity was considered for phenotypic averages and longitudinal changes for each subject as follows. First, the linear regression model for subjects of the same period was adjusted for traits. The effect of sex, age, and top 10 principal component (PC) scores estimated from the genetic relationship matrix were adjusted as covariates. Considering $w_{jk}$ as the residual variances of trait k (k = 1, …, 16) at the period j, the following linear regression model was adjusted with repeated measures for each subject i as follows:

$$y_{ijk} = \beta_{ik0} + \beta_{ik1} \left( {age_{ij} - \overline{{age_{i} }} } \right) + \varepsilon_{ijk} , \varepsilon_{ijk} \sim N\left( {0,\frac{1}{{w_{jk} }}\sigma_{ik}^{2} } \right)$$

(1)

Here, i indicates the ith subject, and $\overline{{age_{i} }}$ indicates the mean of ages at the observed time points. In the regression model (1), $y_{ijk}$ is the jth period observed value of subject i for trait k, $\beta_{ik0}$ indicates the expected phenotypic mean of subject i for trait k when he or she is $\overline{{age_{i} }}$ years old, and $\beta_{ik1}$ is the average longitudinal change in trait k. The estimated values of $\beta_{ik0}$ and $\beta_{ik1}$ were used to estimate the heritability and for GWAS analysis. For convenience, both are denoted by $B_{0}$ and $B_{1}$, respectively.

Estimation of heritability

$B_{0}$ and $B_{1}$ were separately used to estimate SNP heritability. Analyses were conducted using GCTA (Yang et al. 2011), and the restricted maximum likelihood estimator was used. Effects of age and sex were adjusted as covariates.

GWAS analysis

$B_{0}$ and $B_{1}$ were separately analyzed to identify disease susceptibility loci for 16 different traits. The $\overline{{age_{i} }}$, sex, and 10 PC scores estimated from the genetic relationship matrix were included as covariates to adjust the population stratification.

Results

Estimation of heritability

A schematic representation of the heritability analysis and genome-wide association study is shown in Fig. 1. For 16 different traits of 6843 subjects, the mean and standard deviation values of each trait at period 1 are shown in Table 2 (see Table 1 for detailed information). Some missing values resulted in differences in the total number of subjects depending on the phenotype, and the sample sizes of those traits and descriptive statistics including sex and age were summarized.

Table 2 Descriptive statistics of 16 traits

Full size table

For those 6843 subjects, a multidimensional scaling (MDS) plot was generated (Fig. 2). As shown in Fig. 2, subjects from the 1000 Genomes Project were also included, and our analyses were not affected by population stratification.

We calculated the descriptive statistics for $B_{0}$ and $B_{1}$ (Table 3, see “Materials and methods” for details). $B_{0}$ in Eq. (1) indicates the means of the predicted traits at $\overline{{age_{i} }}$ years of age. $B_{1}$ stands for the longitudinal changes in the traits of each subject. Table 3 shows that the means of $B_{0}$ are similar to those of period 1. Means of $B_{1}$ were generally closer to 0. Figure 3 shows the estimates of heritability with $B_{0}$ as the response in the GREML model, and the estimated heritability of height for the data peaked at 0.318 (P = 1.665 $\times$ 10⁻¹⁶, FDR = 2.664 $\times$ 10⁻¹⁵). The subsequent three highest heritability traits were total cholesterol (TCHL), log(HDL), and low-density lipoprotein (LDL), with values of 0.265 (P = 3.895 $\times$ 10⁻¹², FDR = 3.116 $\times$ 10⁻¹¹), 0.241 (P = 8.911 $\times$ 10⁻¹⁰, FDR = 4.753 $\times$ 10⁻⁹), and 0.222 (P = 5.178 $\times$ 10⁻⁹, FDR = 1.657 $\times$ 10⁻⁸), respectively. These three traits are cholesterol-related. The heritability of waist was 0.218 (P = 5.016 $\times$ 10⁻⁹, FDR = 1.657 $\times$ 10⁻⁸) and that of weight was 0.196 (P = 2.046 $\times$ 10⁻⁷, FDR = 5.456 $\times$ 10⁻⁷). The heritability was 0.195 for Hb (P = 4.926 $\times$ 10⁻⁷, FDR = 9.852 $\times$ 10⁻⁷) and 0.192 for log(TG) (P = 4.419 $\times$ 10⁻⁷, FDR = 9.852 $\times$ 10⁻⁷). The heritability of the other traits with an FDR larger than 1 $\times$ 10⁻⁶ were less than 0.19.

Table 3 Summary of $B_{0}$ and $B_{1}$ of 16 traits

Full size table

Figure 4 shows the heritability estimates for $B_{1}$, which are generally less than those for $B_{0}$, and we found that lung function traits FVC and FEV1, wasit, diastolic blood pressure (DBP), BMI, and log(SBP) are relatively high. The highest heritability estimate was observed for FEV1 (0.171) and its FDR-adjusted P value was 0.0189. The heritability estimates of other traits were less than 0.1. The second highest heritability was 0.0941 for FVC, and its FDR-adjusted P value was 0.166. The heritability of waist was also relatively higher than that of other traits. Waist heritability and the FDR-adjusted P values were 0.0082 and 0.0657, respectively. The higher heritability estimates for $B_{1}$ indicate that the decreasing/increasing rates were associated with genetic factors. Height displayed the highest heritability estimates for $B_{0}$, but its estimate for $B_{1}$ was low (0.0297). Height does not usually change after the age of 20, and its lower value here is probably attributable to it. For the other traits including log(HbA1c), LDL, log(HDL), TCHL, and Hb levels, SNP heritability estimates tended towards 0.

Furthermore, we determined chromosomal heritability estimates of FEV1, which displayed the highest heritability in the $B_{1}$ model. Consequently, chromosome 2 accounted for the highest proportion of phenotypic variance ($h_{snp}^{2}$ = 0.0397), albeit with a high standard error (Fig. 5).There was a significant positive correlation between chromosome length and heritability (r = 0.58, P = 0.0045) in FEV1 (Fig. 6).

Genome-wide association studies

$B_{0}$ and $B_{1}$ were considered responses for GWAS. Tables 4 and 5 show genome-wide significant SNPs at a significance level of 1 × 10⁻⁷. Subsequent findings are summarized in the Supplementary material (Tables S1 and S2). Table 4 shows that SNPs have relatively lower P values for log(TG) and log(HDL) than any other trait. The most significant variant for log(TG) was rs6589566 in ZPR1 with a P-value of 7.9 $\times$ 10⁻³⁸, the lowest P value among all 16 traits. Furthermore, ZPR1 is associated with TG (Coram et al. 2013). The most significant variant of log(HDL) was rs16940212 with a P value of 2.08 $\times$ 10⁻¹⁸ in ALDH1A2, which is associated with HDL (Spracklen et al. 2017). Certain other significant variants were significantly associated with proximal genes and with traits assessed herein. The variant rs180349 (P = 8.86 $\times$ 10⁻³⁵) of log(TG) was proximal to BUD13, which is associated with TG (Hoffmann et al. 2018). The variant rs17482753 (P = 3.199 $\times$ 10⁻¹⁸) is proximal to LPL, which was strongly associated with HDL (Hoffmann et al. 2018). Herein, we also detected some de novo variants including rs4922117 (P = 2.13 $\times$ 10⁻¹⁵) of log(HDL) and rs2335418 (P = 3.2 $\times$ 10⁻⁹) of LDL, which were previously unknown; however, both their proximal genes LPL and HMGCR are significantly associated with each trait (Hoffmann et al. 2018). The Manhattan Plot and QQ plot for the model with $B_{0}$ as the response are provided in the Supplementary material (Figures S1 and S2).

Table 4 Results of the genome-wide association study with $B_{0}$ as the response

Full size table

Table 5 Results of the genome-wide association study with $B_{1}$ as the response

Full size table

Table 5 shows the results of GWAS for $B_{1}$. Based on the results, rs2272402 (SLC6A1) was the most significant variant both in FEV1 (P = 1.22 $\times$ 10⁻⁸) and FVC (P = 1.40 $\times$ 10⁻⁹) (Regional plots are shown in Supplementary Figures S5 and S7), and the SLC6A1 enhancer was associated with lung function. Other variants including rs7209788 (NARF, P = 3.36 $\times$ 10⁻⁷) for FEV1 and rs2668162 (FAM19A1, P = 6.18 $\times$ 10⁻⁷) for FVC had P-values less than the 1 $\times$ 10⁻⁶ threshold (S2 Table). From the regional plot (Supplementary Figure S6), we found that rs4789777(HEXDC, P = 4.599 $\times$ 10⁻⁶) is highly correlated with rs7209788 of FEV1. The Manhattan Plot and QQ plot for the model with $B_{1}$ as the response are provided in the Supplementary material (Figures S3 and S4).

Discussion

In this study, SNP-based heritability estimates of 16 phenotypic traits were estimated longitudinal data from a 10-year follow-up of the KARE cohort. The GCTA tool was used with a two-stage approach to determine the heritability estimate of phenotypic mean and longitudinal changes in each trait. Moreover, chromosomal heritability estimates were determined and GWAS analyses were performed using the same approach. Overall, heritability estimates within the population-based cohort including KARE are potentially lower than those of pedigree or twin studies for all 16 traits, regardless of whether the response is $B_{0}$ that phenotypic mean of traits or $B_{1}$ which stands for the changes by time of traits. For example, the heritability of height herein was estimated to be approximately 0.318 with $B_{0}$ as the response, which is lower than the conventional heritability estimate of height of approximately 0.8 based on the assumption-free model (Visscher et al. 2006). In the case of TCHL and LDL, each heritability estimate was determined to be 0.265 and 0.22, respectively, which are also lower than the heritability estimates of 0.67 and 0.69 for TCHL and LDL, respectively, on familial and pedigree analysis (van Dongen et al. 2013). The underlying reason may be explained on the basis of the missing heritability, which describes the difference in values between heritability estimated via GWAS and via familial studies (Sandoval-Motta et al. 2017). However, systemic inflation of estimated heritability estimates of polygenic phenotypes in familial studies may be confounded owing to a shared environment or environment-dependent genetic effects (Robinson et al. 2017). Therefore, the population-based design similar to that of the present study potentially represents the average genetic effects regardless of various confounding environmental factors.

Based on the present $B_{0}$ and $B_{1}$ model, the heritabilities of $B_{1}$ are markedly lower than those of $B_{0}$, indicating that most of the genetic variance of traits are not temporally influenced. Here, $B_{0}$ was not determined from the baseline measurements of traits but rather the average values of repeated measurements to yield a more robust and reasonable result. If baseline measurement and longitudinal changes $(B_{1} )$ calculated from those were considered responses during the estimation of heritability, the estimate may have been potentially inaccurate owing to the correlation between baseline and $B_{1}$ values. Thus, by applying a regression model to estimate the average $B_{0}$ and longitudinal changes $B_{1}$, the effect of $B_{0}$ on $B_{1}$ in each subject could be removed.

On GWAS, the two-stage model elucidated significant variants associated with the traits and their changes in the longitudinal data. We confirmed several proven variants and identified some other significant unreported variants. In the case of the $B_{0}$ model, rs4922117 (P = 2.13 $\times$ 10⁻¹⁵) of log (HDL) and rs2335418 (P = 3.2 $\times$ 10⁻⁹) of LDL were both unreported; however, their proximal genes LPL and HMGCR, respectively, were significantly associated with each trait (Hoffmann et al. 2018). Furthermore, unreported genes, such as rs180349, including non-coding SNPs with a significant P value for TG are proximal to BUD13, which is strongly associated with TG (Hoffmann et al. 2018). Variants including rs17482753 also had significant P values and was proximal to LPL, which is strongly associated with the HDL trait (Hoffmann et al. 2018). In the $B_{1}$ model, rs2272402 (SLC6A1, P = 1.22 $\times$ 10⁻⁸) was significant in both FEV1 and FVC lung function. The SLC6A1 enhancer is associated with pulmonary function. Therefore, the present results are concurrent with previous findings regarding genes associated with each phenotype.

Among the 16 phenotypic traits in this study, only FEV1 displayed longitudinally significant heritability herein (Fig. 4), thus reliably reflecting the physiological state of the lungs and airways and acting as a predictor of morbidity and mortality in the general population; FEV1 is also widely used to define chronic obstructive pulmonary disease (COPD) (Young et al. 2007). Lung function develops in early life, peaks at a specific time point in early adulthood, and subsequently declines with age. Therefore, the decline of lung function in middle-aged and older individuals is suggested to be heritable in the general population (Gottlieb et al. 2001). However, longitudinal studies on FEV1 and FEV1/FVC have suggested several significant genetic regions that markedly differ from the numerous genetic variants associated with lung function, with FEV1 being estimated at a single time point (John et al. 2017; Tang et al. 2014). Hence, gene-environment interactions and significant genetic heterogeneity in lung function have been observed in diseases such as asthma or COPD (Hansel et al. 2013; Imboden et al. 2012). Accordingly, the present study included the middle-aged general population with similar environmental exposure without specific lung diseases, thus suggesting that intact FEV1 decreased due to aging. Therefore, the present results show that FEV1 has significant SNP heritability for longitudinal changes (FDR = 0.0012 for FEV1).

This study has several limitations. First, the analysis of new variants in the present GWAS was not replicated for other cohorts. Second, the two-stage approach is statistically inefficient even though it is computationally fast. However, the sample size was very large, which hopefully minimized this problem. Furthermore, we considered subjects with at least three or more measurements, which potentially minimize statistical power loss. Third, gene-environment interactions were not analyzed, although the estimation of random effects in the mixed model was elusive. Fourth, GCTA itself has limitations for reasons such as data overfitting and skewed singular values (Kumar et al. 2016). Though our study optimized parameters to attain accurate results using GCTA, our sample size might have resulted in certain variations in comparison with other large studies. Furthermore, the issue regarding missing heritability was inevitable to an extent because the Affymetrix genotypic array represents only common variants for SNPs, while rare genetic SNP variants were not included herein (Bandyopadhyay et al. 2017).

Despite the aforementioned limitations, our study elucidates heritability estimates via a two-stage approach using a mixed model in GCTA and a GWAS, which further determines longitudinal change effects independently with a linear model, followed by estimating heritability using regression coefficients. This approach provides a reasonable and easy method to estimate heritability in longitudinal data and potentially assess both heritability of the phenotypic mean and changes through several periods. Essentially, our results show that significant SNP heritability is objectively confirmed for longitudinal changes in lung function decline including FEV1 in comparison with other health-related indices. Therefore, genetic studies on longitudinal FEV1 decline among the middle-aged general population and chromosome 2, which attributes the most in genetic variance should be encouraged.

References

Bandyopadhyay B, Chanda V, Wang Y (2017) Finding the sources of missing heritability within rare variants through simulation. Bioinform Biol Insights 11:1177932217735096
Article Google Scholar
Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, Ban HJ, Yoon D, Lee MH, Kim DJ, Park M et al (2009) A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet 41:527–534
Article CAS Google Scholar
Coram MA, Duan Q, Hoffmann TJ, Thornton T, Knowles JW, Johnson NA, Ochs-Balcom HM, Donlon TA, Martin LW, Eaton CB et al (2013) Genome-wide characterization of shared and distinct genetic components that influence blood lipid levels in ethnically diverse human populations. Am J Hum Genet 92:904–916
Article CAS Google Scholar
Evans LM, Tahmasbi R, Vrieze SI, Abecasis GR, Das S, Gazal S, Bjelland DW, de Candia TR, Goddard ME, Neale BM et al (2018) Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits. Nat Genet 50:737
Article CAS Google Scholar
Friedlander Y, Austin MA, Newman B, Edwards K, Mayer-Davis EI, King MC (1997) Heritability of longitudinal changes in coronary-heart-disease risk factors in women twins. Am J Hum Genet 60:1502–1512
Article CAS Google Scholar
Genetic epidemiologic studies on age-specified traits (2000) NIA aging and genetic epidemiology working group. Am J Epidemiol 152:1003–1008
Article Google Scholar
Gottlieb DJ, Wilk JB, Harmon M, Evans JC, Joost O, Levy D, O’Connor GT, Myers RH (2001) Heritability of longitudinal change in lung function. The Framingham study. Am J Respir Crit Care Med 164:1655–1659
Article CAS Google Scholar
Hansel NN, Ruczinski I, Rafaels N, Sin DD, Daley D, Malinina A, Huang L, Sandford A, Murray T, Kim Y et al (2013) Genome-wide study identifies two loci associated with lung function decline in mild to moderate COPD. Hum Genet 132:79–90
Article Google Scholar
Hoffmann TJ, Theusch E, Haldar T, Ranatunga DK, Jorgenson E, Medina MW, Kvale MN, Kwok PY, Schaefer C, Krauss RM et al (2018) A large electronic-health-record-based genome-wide study of serum lipids. Nat Genet 50:401–413
Article CAS Google Scholar
Imboden M, Bouzigon E, Curjuric I, Ramasamy A, Kumar A, Hancock DB, Wilk JB, Vonk JM, Thun GA, Siroux V et al (2012) Genome-wide association study of lung function decline in adults with and without asthma. J Allergy Clin Immunol 129:1218–1228
Article Google Scholar
John C, Soler Artigas M, Hui J, Nielsen SF, Rafaels N, Pare PD, Hansel NN, Shrine N, Kilty I, Malarstig A et al (2017) Genetic variants affecting cross-sectional lung function in adults show little or no effect on longitudinal lung function decline. Thorax 72:400–408
Article Google Scholar
Kim Y, Lee Y, Lee S, Kim NH, Lim J, Kim YJ, Oh JH, Min H, Lee M, Seo HJ et al (2015) On the estimation of heritability with family-based and population-based samples. Biomed Res Int 2015:671349
PubMed PubMed Central Google Scholar
Kim SJ, Lee J, Park YS, Lee CH, Yoon HI, Lee SM, Yim JJ, Kim YW, Han SK, Yoo CG (2016) Age-related annual decline of lung function in patients with COPD. Int J Chron Obstr Pulm Dis 11:51–60
CAS Google Scholar
Kim Y, Han BG, KoGESg (2017) Cohort profile: The Korean Genome and Epidemiology Study (KoGES) Consortium. Int J Epidemiol 46:1350
Article CAS Google Scholar
Kumar SK, Feldman MW, Rehkopf DH, Tuljapurkar S (2016) Correction for Krishna Kumar et al., Limitations of GCTA as a solution to the missing heritability problem. Proc Natl Acad Sci USA 113:813
Article Google Scholar
Ni G, Moser G, Schizophrenia Working Group of the Psychiatric Genomics CSH, Wray NR, Lee SH (2018) Estimation of genetic correlation via linkage disequilibrium score regression and genomic restricted maximum likelihood. Am J Hum Genet 102:1185–1194
Article CAS Google Scholar
Nishimura M, Makita H, Nagai K, Konno S, Nasuhara Y, Hasegawa M, Shimizu K, Betsuyaku T, Ito YM, Fuke S et al (2012) Annual change in pulmonary function and clinical phenotype in chronic obstructive pulmonary disease. Am J Respir Crit Care Med 185:44–52
Article Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ et al (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575
Article CAS Google Scholar
Robinson MR, English G, Moser G, Lloyd-Jones LR, Triplett MA, Zhu Z, Nolte IM, van Vliet-Ostaptchouk JV, Snieder H, LifeLines Cohort S et al (2017) Genotype-covariate interaction effects and the heritability of adult body mass index. Nat Genet 49:1174–1181
Article CAS Google Scholar
Sandoval-Motta S, Aldana M, Martinez-Romero E, Frank A (2017) The human microbiome and the missing heritability problem. Front Genet 8:80
Article Google Scholar
Song YE, Lee S, Park K, Elston RC, Yang HJ, Won S (2018) ONETOOL for the analysis of family-based big data. Bioinformatics 34:2851–2853
Article CAS Google Scholar
Spracklen CN, Chen P, Kim YJ, Wang X, Cai H, Li S, Long J, Wu Y, Wang YX, Takeuchi F et al (2017) Association analyses of East Asian individuals and trans-ancestry analyses with European individuals reveal new loci associated with cholesterol and triglyceride levels. Hum Mol Genet 26:1770–1784
Article CAS Google Scholar
Tang W, Kowgier M, Loth DW, Soler Artigas M, Joubert BR, Hodge E, Gharib SA, Smith AV, Ruczinski I, Gudnason V et al (2014) Large-scale genome-wide association studies and meta-analyses of longitudinal change in adult lung function. PLoS One 9:e100776
Article Google Scholar
van de Pol M, Verhulst S (2006) Age-dependent traits: a new statistical model to separate within- and between-individual effects. Am Nat 167:766–773
Article Google Scholar
van Dongen J, Willemsen G, Chen WM, de Geus EJ, Boomsma DI (2013) Heritability of metabolic syndrome traits in a large population-based sample. J Lipid Res 54:2914–2923
Article Google Scholar
Visscher PM, Medland SE, Ferreira MA, Morley KI, Zhu G, Cornes BK, Montgomery GW, Martin NG (2006) Assumption-free estimation of heritability from genome-wide identity-by-descent sharing between full siblings. PLoS Genet 2:e41
Article Google Scholar
Yang J, Manolio TA, Pasquale LR, Boerwinkle E, Caporaso N, Cunningham JM, de Andrade M, Feenstra B, Feingold E, Hayes MG et al (2011) Genome partitioning of genetic variation for complex traits using common SNPs. Nat Genet 43:519–525
Article CAS Google Scholar
Yang J, Zeng J, Goddard ME, Wray NR, Visscher PM (2017) Concepts, estimation and interpretation of SNP-based heritability. Nat Genet 49:1304–1310
Article CAS Google Scholar
Young RP, Hopkins R, Eaton TE (2007) Forced expiratory volume in one second: not just a lung function test but a marker of premature death from all causes. Eur Respir J 30:616–622
Article CAS Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the support given by J.G, Sangchul Park, and all the subjects who have participated in this study. This study was supported by the Bio-Synergy Research Project (NRF-2017M3A9C4065964) of the Ministry of Science, ICT and Future Planning through the National Research Foundation, by Basic Science Research Program through the NRF of Korea funded by the Ministry of Education (NRF-2016R1D1A1A09919610) and by the Industrial Core Technology Development Program (20000134) funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea).

Author information

Authors and Affiliations

Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
Donghe Li & Sungho Won
Biology Department, Morrissey College of Arts and Sciences, Boston College, Boston, MA, USA
Hahn Kang
Department of Medical Consilience, Graduate School, Dankook University, Yongin, Republic of Korea
Sanghun Lee
Institute of Health and Environment, Seoul National University, Seoul, Republic of Korea
Sungho Won
Department of Public Health Science, Graduate School of Public Health, Seoul National University, 1 Kwanak-ro Kwanak-gu, Seoul, 151-742, Republic of Korea
Sungho Won

Authors

Donghe Li
View author publications
You can also search for this author in PubMed Google Scholar
Hahn Kang
View author publications
You can also search for this author in PubMed Google Scholar
Sanghun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sungho Won
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

DL contributed to analysis and drafting of the article. SW and DL contributed to the study conception and design. SW, HK and SL contributed to critical revision of the paper. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Sanghun Lee or Sungho Won.

Ethics declarations

Conflict of interest

Donghe Li, Hahn Kang, Sanghun Lee and Sungho Won declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 2019 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, D., Kang, H., Lee, S. et al. Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data. Genes Genom 42, 393–403 (2020). https://doi.org/10.1007/s13258-019-00902-x

Download citation

Received: 16 November 2019
Accepted: 02 December 2019
Published: 04 January 2020
Issue Date: April 2020
DOI: https://doi.org/10.1007/s13258-019-00902-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data

Abstract

Background

Objective

Methods

Results

Conclusion

Similar content being viewed by others

A toolkit for quantification of biological age from blood chemistry and organ function test data: BioAge

The Emerging Theoretical Framework of Life Course Health Development

Examining the relationship between genetic polymorphisms (BDKRB2, GNB3, HIF1A, MCT1, NOS3) and endurance athlete status

Introduction

Materials and Methods

Ethics approval and consent to participate

Korea Associated Resource (KARE) cohort data

Genotypes

Data availability

Determination of phenotypic averages and longitudinal changes in each subject

Estimation of heritability

GWAS analysis

Results

Estimation of heritability

Genome-wide association studies

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (DOCX 2019 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation