Introduction

Type 2 diabetes is characterised by an elevation in blood glucose in the fasting state and/or following a glucose challenge resulting from insulin resistance and insufficient compensatory insulin secretion by pancreatic beta islet cells. Insulin action, as the insulin sensitivity index (S I), can be estimated from the frequently sampled IVGTT with minimal model. Other indices include the acute insulin response to glucose (AIRg, reflecting insulin secretion) and the disposition index (DI=S I×AIRg, measuring overall glucose homeostasis and taking account of the hyperbolic relationship between S I and insulin secretion). Glucose effectiveness (S G) represents an insulin-independent effect whereby glucose mediates its own disposal from plasma. Impairments in these insulin action and glucose metabolism indices are recognised as prediabetic phenotypes involving pathogenic development and pathogenetic processes of type 2 diabetes. Exercise training improves peripheral S I and S G in healthy human subjects [1], and significant improvements in S I, AIRg, DI and S G in response to 20 weeks of endurance exercise training have been observed and reported in the HERITAGE Family Study [2]. Recent investigations in HERITAGE provide further evidence that physiological training responses vary appreciably from person to person, and these individual differences are influenced by genetic factors [3].

Genome-wide linkage scans localising quantitative trait loci for type 2 diabetes and associated traits are numerous. However, there are no scans for such unique traits of glucose and insulin metabolism phenotypes in response to endurance exercise training, except for the baseline values [4] and fasting values [5] in HERITAGE. In this study, S I, AIRg, DI and S G derived from an IVGTT using the Minimal Model Millennium software [6] were obtained before and after 20 weeks of endurance exercise training. Training responses were defined as differences between post-training and baseline values.

We searched for quantitative trait loci that may harbour candidate genes influencing changes in S I, AIRg, DI and S G in response to endurance exercise training in sedentary and non-diabetic whites and blacks.

Subjects, materials and methods

HERITAGE was designed to investigate the role of the genotype in cardiovascular, metabolic and hormonal responses to aerobic exercise training, and the contribution of exercise training to changes in type 2 diabetes and cardiovascular disease risk factors. In general, subjects in the 20-week training programme were required to be in good health. All subjects had been sedentary at baseline for at least 6 months prior to study entry [7]. Age was ≤65 years for parents and ≥17 years for offspring. Subjects with uncontrolled endocrine and metabolic disorders or diabetes were excluded from participation. Resting blood pressure was ≤159/99 mm Hg. None of the subjects took medications to control hypertension or for dyslipidaemia. BMI had to be less than 40 kg/m2. However, several subjects with their BMI slightly higher than 40 kg/m2 were approved for participation by supervising physicians, because they were considered to be in good health and able to exercise. HERITAGE design, protocol, population, inclusion and exclusion criteria are described elsewhere [7].

A total of 441 subjects from 98 white families and 187 subjects from 90 black families had complete data. Sample size descriptions are given in Table 1 for whites and Table 2 for blacks. The Institutional Review Boards at five participating centres of HERITAGE approved the study protocol. Written informed consent was obtained from each participant.

Table 1 Basic characteristics of whites
Table 2 Basic characteristics of blacks

The training programme has been described elsewhere [8]. Each subject was administered a comprehensive battery of tests prior to the 20-week training programme, including an IVGTT. Subjects then completed 60 sessions of endurance exercise on cycle ergometers that were computer-controlled to maintain the subjects’ heart rates at levels associated with fixed percentages of their baseline maximal oxygen uptake. The full test battery, including the IVGTT, was administered again after completion of the training programme. The IVGTT protocol [9] has been detailed previously [4]. It was performed in the morning after overnight fasting for 12 h. Blood samples to measure plasma glucose and insulin were taken 1, 3, 5, 10, 15, 20, 30, 45, 60, 75, 90, 120, 150 and 180 min after the end of the glucose injection in the opposite arm.

Plasma insulin was measured by radioimmunoassay after polyethylene glycol separation [10]. Polyclonal antibodies that cross-react more than 90% with proinsulin, and presumably its conversion intermediates, were used [11, 12]. The intra- and interassay coefficients of variation were 7.7 and 10.3%, respectively. Plasma glucose was enzymatically determined using a reagent kit distributed by Diagnostic Chemicals (Oxford, CT, USA). AIRg was computed as the incremental integrated area under the insulin curve for the first 10 min of the IVGTT. S I, AIRg, DI and S G were all derived from the Minimal Model Millennium software [6].

S I, AIRg and DI exercise training responses were skewed and were approximately normalised using a square-root transformation. S I, AIRg, DI and S G exercise training responses were adjusted for the effects of age, age2, age3, BMI and their respective baseline values (to assess genetic determinants on these exercise training response phenotypes independently of the effects of their baseline levels) within each of the sex by generation groups, separately by race, in both the mean and the variance using a stepwise multiple regression procedure. For each of the regressions, only terms that were significant at the 5% level were retained. Finally, each of the adjusted variables was standardised (mean of 0, SD of 1) within sex×generation groups, separately by race.

PCR conditions and genotyping methods have been described previously [13]. Incompatibilities of Mendelian inheritance were checked, and markers showing incompatibilities (between 5 and 10% depending on marker) were retyped completely from PCR reaction to the genotyping. Microsatellite markers were mainly selected from the Marshfield panel version 8a (http://www.marshfieldclinic.org/geneticsresearch). Map locations in location database (LDB) composite units were derived from the Location Database of Southampton, UK (http://cedar.genetics.soton.ac.uk/public_html). The LDB units were obtained by integrating different types of data (genetic linkage maps, radiation hybrid maps, physical maps, cytogenetic data, and mouse homology) into a single map. The data were obtained from the Internet, published sources, and the Wessex Human Genetics Institute. Here, LDB units were used to perform linkage scans, and the physical distance in Mb, obtained from the National Center for Biotechnology Information physical map, build 34.3, was used to present scan results.

Multipoint linkage analyses were performed using the variance components model as implemented in the computer program SEGPATH [14, 15]. Under this model, a phenotype is under the influence of the additive effects of a trait locus (g), a residual familial background modelled as a pseudopolygenic component (G R), and a residual non-familial component (r). The effects of the trait locus and the pseudopolygenic component on the genotype are quantified by the heritabilities h g 2 and h r 2 respectively. Allele-sharing probabilities at each marker location for each sib pair were estimated using the multipoint approach in the computer program MAPMAKER/SIBS [16], and were entered into the SEGPATH model. Other parameters in the model include spouse resemblance (u), additional sibling resemblance (b) and the phenotype means and variances. The linkage hypothesis is tested by restricting h g 2=0. A likelihood ratio test contrasting the null versus the alternative hypotheses is asymptotically distributed as a 50:50 mixture of a χ 2 with 1 degree of freedom and a point mass at zero [17]. The lod score is computed as χ 2/(2×loge10).

Results

Means and SD for S I, AIRg, DI and S G exercise training responses, together with the baseline values, separately by sex and generation groups, are presented in Table 1 for whites and Table 2 for blacks. Favourable and significant (p<0.05) exercise training responses have been observed in both races and have been reported elsewhere [2]. Significant (p<0.05) racial differences in means of the exercise training response phenotypes were not found within each of the sex×generation groups.

In general, age accounted for little variance in the exercise training responses (<10% in DI [white sons], AIRg [black daughters] and S G [black mothers]). In contrast, the baseline values were significant predictors of the exercise training responses in both races for most groups, accounting for 20–50% of the variance (except baseline AIRg in white mothers and black men). After transformation and adjustment of the data, both skewness (0.34–0.66) and kurtosis (−0.10 to 1.20) for all the exercise training response phenotypes across races became acceptable.

The mean (±SD) heterozygosities for 654 markers used in this study were 0.72±0.15 in whites and 0.76±0.16 in blacks, with an average intermarker spacing of 2.6 Mb. Residual genetic heritabilities for all the four training response phenotypes, as expected in dissecting other complex traits, were modest to moderate, the estimates ranging from 10% in whites to 20–30% in blacks. All linkage results with lod scores ≥1.0 are summarised in Table 3. They include cytogenetic regions of 1p, 1q, 2p, 3q, 6p, 7q, 10p, 10q, 12q, 13q, 15q, 18q, 19q, 20q and 22q. All autosomal linkage scan results are presented for whites in Fig. 1a and for blacks in Fig. 1b. In addition, linkage analysis results on chromosome 6 are depicted in Fig. 2a for whites and Fig. 2b for blacks; those on chromosome 7 are depicted in Fig. 3a for whites and those on chromosome 19 are depicted in Fig. 3b for blacks. The strongest linkage is on 19q (54–62 Mb) for S G exercise training response in blacks with promising lod scores ≥1.75 [18] at four markers. These markers include the human skeletal muscle glycogen synthase (GYS1) gene locus (54 Mb, lod score 1.8), D19S601 (57 Mb, lod score 2.1), D19S589 (58 Mb, lod score 2.0) and D19S254 (62 Mb, lod score 3.1).

Table 3 Summary of multipoint linkage scan results with lod scores ≥1.0 in whites and blacks
Fig. 1
figure 1

Multipoint linkage scan results on all autosomes for S I (green circles), AIRg (black dots), DI (blue stars) and S G (red triangles) exercise training responses among whites (a) and blacks (b). Lod score of horizontal reference line is 1.0 for interesting linkages

Fig. 2
figure 2

Multipoint linkage analysis results on chromosome 6 for S I (circles), AIRg (dots), DI (stars) and S G (triangles) exercise training responses in whites (a) and blacks (b)

Fig. 3
figure 3

Multipoint linkage analysis results on chromosome 7 for S I (circles), AIRg (dots), DI (stars) and S G (triangles) exercise training responses in whites (a) and on chromosome 19 for these phenotypes in blacks (b)

Discussion

The strongest linkage evidence was found on 19q13 in the GYS1 gene locus to D19S254 (54–62 Mb) for S G exercise training response in blacks (but not in whites), with a maximum lod score of 3.1. HERITAGE and several other studies previously reported linkages in this region for type 2 diabetes and associated risk factors. At D19S589 (58 Mb, lod score 2.0, for S G exercise training response), there are at least four findings in the literature that are in agreement. They include linkages for a principal component score (lod score 2.1, primarily percentage body fat, plasma HDL cholesterol and triglycerides) and for fasting insulin (lod score 1.6), both in HERITAGE whites [5, 19], for familial combined hyperlipidaemia in Dutch families (lod score 1.3) [20], and for 2-h glucose in Pima Indians (lod score 1.1) [21]. Moreover, linkages in this region have also been reported in the Framingham Offspring Study (D19S178, 49 Mb, lod score 1.8) for 2-h insulin, in Finnish families (D19S412-D19S867, 52–55 Mb, highest lod score 2.8) for an empirical insulin-resistance index and in multiplex familial type 2 diabetes families (D19S178; GSY1 gene locus, 49–54 Mb, highest lod score 3.2) for plasma triglycerides [2224]. Recently, two groups reported linkages at this region for type 2 diabetes, one in a subset of the Japanese population with age at diagnosis younger than 45 years (D19S571, 58 Mb, lod score 1.7), and the other one in a Dutch population (D19S246–D19S601, 56–57 Mb, highest lod score 1.3) [25, 26]. Follow-up studies on 19q13 to search for type 2 diabetes susceptibility genes or genes linked to glucose–insulin–lipid metabolism, particularly in response to endurance exercise training, are strongly justified.

S G represents the ability of glucose itself to mediate its own disposal from plasma, independently from the dynamic changes due to basal insulin. It involves both the autoregulatory effect of hepatic glucose output and peripheral glucose utilisation. Although S G is different from S I, it is also important in the assessment of glucose tolerance [27]. Decreased S G levels may predict the development of type 2 diabetes; controversially, others [28] found an opposite association of increased S G levels with type 2 diabetes using 20 normoglycaemic first-degree relatives of type 2 diabetes patients against 20 matched controls. The storage of glucose as glycogen in human skeletal muscles is frequently impaired in patients with type 2 diabetes and their non-diabetic relatives, partially because of impaired activation of the GYS1 gene by insulin [29]. The GYS1 gene, here linked with S G exercise training response, encodes glycogen synthase, a key enzyme in glucose storage. GYS1 associations with type 2 diabetes and insulin resistance have been suggested [30] but not confirmed [31]. Insulin stimulates GYS1 mRNA expression, and impaired stimulation of GYS1 gene expression by insulin in type 2 diabetes is thought to be secondary to chronic hyperglycaemia [32]. Exercise of large muscle masses is one of the most efficacious modes to prevent the development of insulin resistance by reducing skeletal muscle stores of glycogen and triglycerides [33]. Studies designed to clarify the effects of sequence variation in the GYS1 gene on S G exercise training response are practicable. Finally, and cautiously, given that S I is associated insulin activation of muscle GYS1 activity, the fact that our linkage finding at the GYS1 gene locus for S G but not for S I in blacks but not in whites suggests that validations and replications from other independent studies should be strongly encouraged.

Interestingly, all nominal linkages especially found in whites were for the DI exercise training response. DI, S I and AIRg are non-independent traits. DI is a measure reflecting the balance between insulin secretion and insulin action on the disposition of glucose. In other words, DI is an index of insulin secretion adjusted for insulin sensitivity. Thus, finding stronger results for DI than for AIRg alone should not be surprising. The best replicated region across races for DI exercise training response was 6p, with a lod score of 1.2 at D6S299 (24 Mb) in whites and of 1.0 at D6S2239 (26 Mb) in blacks. Previously, an interesting linkage in this region was found for baseline S I (D6S2439, 27 Mb, lod score 1.7) in HERITAGE [4]. We propose that a quantitative trait locus in this region may harbour gene(s) that influence DI exercise training response with pleiotropic effects on baseline S I. Two previous studies reported quantitative trait loci at D6S276 (24 Mb, lod score 2.1) for type 2 diabetes in Chinese Hans [34] and D6S273 (32 Mb, lod score 1.8) for fasting glucose in Pima Indians [21]. Candidate genes surrounding this locus include the TNFα (32 Mb), BAT2 (32 Mb), HLA-DQB1 (33 Mb), PPARD (35 Mb) and GLP1R (39 Mb) genes. TNF-α is an inflammatory cytokine with a wide range of anti-tumour and immune functions. Its role in the development of insulin resistance has been established [35]. The BAT2 gene is within the human major histocompatibility complex class III region. Its product is associated with the age at onset of type 1 diabetes, and possibly with the inflammatory process of pancreatic beta cell destruction during the development of type 1 diabetes. The HLA-DQB1 gene product specifies the autoimmune response against insulin-producing islet cells that leads to type 1 diabetes. Peroxisome proliferator-activated receptor (PPAR)-δ is ubiquitous, and belongs to one of the transcription factors of the superfamily of nuclear receptors. PPARs are major regulators of lipid and glucose metabolism and insulin sensitivity [36]. Glucagon-like peptide 1, a hormone derived from the preproglucagon molecule, and secreted by intestinal L cells, is the most potent stimulator of glucose-induced insulin secretion. It has potential for the treatment of type 2 diabetes because of its glucose-dependent insulinotropic and glucagonostatic properties [37]. Though the magnitude of the linkage signals on 6p is not striking, consistencies across whites and blacks and across several other cohorts at this locus warrant further studies.

Among the linkages on 1p, 3q, 7q, 10p and 12q in whites and 1p, 2p, 10, 12q, 13q, 15q, 18q, 20q and 22q in blacks, that on 7q at the PON1 gene locus D7S821 (95–96 Mb, highest lod score 1.4) for DI exercise training response in whites was supported by two previous findings. They include one at the PON1 gene locus (lod score 1.4) for fasting insulin exercise training response in HERITAGE whites [4], and one at D7S1799 (103 Mb, Z score 1.9) for type 2 diabetes in Pima Indian sib pairs [38].

In conclusion, our genomic scans for glucose and insulin metabolism phenotypes in response to endurance exercise training yielded three quantitative trait loci of interest. A promising locus for S G exercise training response was identified on 19q13 at the GYS1 locus; the GYS1 gene regulates glycogen storage in skeletal muscles. Two possible loci on 6p and 7q were captured for DI exercise training responses, accompanied by consistent findings for relevant traits across other studies.