Introduction

The central role of mitochondria in metabolism has led to much effort to uncover their potential role in the metabolic syndrome and type 2 diabetes mellitus. The primary role of mitochondria is to produce energy in the form of ATP by oxidative phosphorylation. The mitochondrial DNA (mtDNA) encodes 13 proteins, all of which are subunits of the energy-generating oxidative phosphorylation (OXPHOS) complexes. Several lines of evidence suggest that mtDNA is central to the aetiology of diabetes. Defects in OXPHOS are nearly always found in type 2 diabetes, particularly in the aged and in offspring of type 2 diabetes patients [1, 2]. Genes involved in OXPHOS are also expressed at lower levels in type 2 diabetes patients than healthy controls [3, 4]. The link between glucose recognition and insulin secretion by pancreatic beta cells depends on metabolic changes and signal generation within mitochondria [5]. It has been noted that fewer mitochondria can be seen microscopically in diabetes patients than in controls, and those that can be seen are smaller than normal [6]. Mitochondrial variants have been indirectly linked with diabetes by the observation that there is a bias towards maternal transmission of diabetes, which increases with mean age of onset [7].

More direct evidence comes from the observation that there are more than 40 diseases caused by mtDNA mutations that involve hyperglycaemia [8]. The A3243G mutation, when found in only a small fraction of mitochondria in the cell is known to cause maternally inherited diabetes and deafness [9]. This is the most common known molecular cause of diabetes worldwide, accounting for 1% of all cases of type 2 diabetes. This mutation is in a mitochondrial leucine tRNA. As the fraction of mutant mitochondria increases, so does the severity of the associated metabolic disease, with mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS) being the most severe. Mitochondria may also play a role in the metabolic syndrome, as evidenced by a mutation in the tRNA ILE gene (also known as MT-TI) at nucleotide position 4,291 causing hypertension, hypercholesterolaemia and hypomagnesaemia [10]. A recent study using rats with identical nuclear genomes but differing mitochondria showed that mitochondrial variants have a direct effect on a number of metabolic phenotypes, including triacylglycerol levels [11].

The large amount of evidence for mitochondrial variants contributing to diabetes risk has led to a number of association studies being undertaken. These have included both case–control studies of type 2 diabetes and studies on metabolic traits that are known risk factors for diabetes. More than 25 mitochondrial variants have been posited as being associated with type 2 diabetes [7, 12], but in general there have been conflicting findings regarding potential associations. A large study in the UK population using 500 cases and controls showed an association with the T16189C polymorphism, which is found at a frequency of 6–10% in Europe [13]. This finding was replicated in the Asian population by two separate studies [14, 15]. However, a meta-analysis of studies in European populations suggested it was unlikely that the region containing the mt16189 polymorphism played a role in increasing susceptibility to type 2 diabetes [16]. One of the largest studies to date, carried out in the Finnish population, confirmed the association with mt16189, and also reported a weak association with haplogroup J [17]. This is the first reported association with a major haplogroup. Some earlier studies reported no associations with major haplogroups, but suffered from small sample sizes, particularly for certain haplogroups [18]. The association with haplogroup J was supported by a study in a white Brazilian population, which also found an association with haplogroup T [19]. More recently, there has been a report that haplogroup N9a confers resistance to type 2 diabetes in Asians [20]. In the largest association study to date [21], an association between both diabetes disease status and related metabolic traits and common European mitochondrial variants was tested for in several populations of European origin. The only significant association found for increased risk of diabetes was mt12612 in the Polish sample. No significant associations were found for any of the metabolic traits in non-diabetic individuals. Previous mitochondrial association studies of type 2 diabetes and the metabolic syndrome are summarised in Table 1. It should be noted that this table is not an exhaustive list of mitochondrial association studies and that further small-scale studies have been carried out.

Table 1 Results from previous mitochondrial association studies of type 2 diabetes and the metabolic syndrome

A previous association study [21] was, to the best of our knowledge, the first to look at the potential role of mitochondria in quantitative metabolic traits related to type 2 diabetes. Our study examines whether common mitochondrial variants contribute to variation in a larger number of biochemical traits in healthy adolescent twins. The traits examined include plasma lipids, HDL-cholesterol and triacylglycerol, which are known risk factors for cardiovascular disease and the metabolic syndrome [22]. Twin studies have shown these traits to be highly heritable, with higher estimates in younger samples than older ones [23]. However, the onset of cardiovascular disease usually begins later in life, and so our study provides an opportunity to study normal variation in these traits in premorbid adolescents. In addition, the concentration of uric acid, ferritin and a number of liver enzymes (γ-glutamyltransferase [GGT] and alanine aminotransferase [ALT]) that are risk factors for obesity, cardiovascular disease, diabetes and the metabolic syndrome were examined. Uric acid has been shown to be associated with the metabolic syndrome [24], while serum GGT and ALT, which have been shown to be correlated [25], have been associated with cardiovascular disease [26, 27], hypertension and diabetes [27, 28]. The sample was also measured for weight and height, and hence BMI. We believe this to be the first mitochondrial association study of quantitative traits to be undertaken using family data and we present herein a method, which has been implemented in the freely available program Sib-Pair, for carrying out such a study.

Methods

Sample

Families consisted of adolescent twins and their siblings who had been recruited as part of two overlapping studies into the genetic and environmental influences on cognition [29] and melanoma risk factors [30]. Twins between the ages of 12 and 16 years and their families were recruited by contacting principals of schools in the Brisbane area and through media appeals. For the melanoma study, twins were recruited as close as possible to their 12th and 14th birthdays, and for the cognition study, twins were measured as close as possible to their 16th birthday. For all of the studies, participants gave informed consent to the questionnaire, interview and collection of blood. The studies were approved by the Queensland Institute of Medical Research Human Research Ethics Committee.

Table 2 gives the breakdown of the number of individuals measured at each time point and the overlap between the groups. The sample has been shown to be representative of the general Queensland population with respect to cognition [29] and skin mole counts [31]. Further information on the sample, including recruitment strategy, have been described previously [31, 32].

Table 2 Breakdown of zygosity by age group and overlap between groups

A total of 2,484 individuals were measured on at least one visit. Of those who had been measured at least once, 2,179 (87.72%) had been genotyped for a set of 69 mitochondrial-tagging single-nucleotide polymorphisms (SNPs). The sample was divided into three separate age groups that corresponded with the time of the visit. The first group contained all individuals <13 years of age on their first visit, the second group contained all those <15 years but >13 years at the time of their first or second visit, and the final group contained all remaining individuals. A further analysis was carried out using the average of an individual’s measurements if they had been measured on more than one occasion. To reduce bias due to their being greater numbers of individuals in particular age groups, and different means and SDs in each age group, the standardised residuals after adjusting for age and sex were averaged over the three age groups. Ancestry data were collected as part of the study, and a total of 24 families with non-European ancestry were removed from the analysis. A total of 908 families were included in the analysis

Genotyping

The details of the genotyping have been described elsewhere [33, 34], but briefly, Saxena et al. [21] identified 64 mtSNPs that tag all mitochondrial variants of >1% frequency in the European population with an r 2 of 0.8. For this study, 61 of the 64 tagSNPs were genotyped as well as an additional nine SNPs, which tag the three non-typed tagSNPs from Saxena et al. as well as further variation at r 2 > 0.8, that otherwise required the use of multi-SNP haplotypes using a method described in a previous study [33]. A common D-loop variant, mt16189, which has previously been implicated as being associated with type 2 diabetes was also genotyped. This gave a total of 71 SNPs, but two were found to be monomorphic in our sample, leaving 69 SNPs in the final analysis. SNPs were typed using iPLEX Gold chemistry (Sequenom, San Diego, CA, USA) and analysed using a Sequenom MassARRAY Compact Mass Spectrometer. After PCR reactions, a two-step 200 short cycles program was used for a post-PCR extension reaction. The iPLEX reaction products were spotted on a SpectroChip (Sequenom) and data were processed and analysed by MassARRAY TYPER 3.4 software (Sequenom).

All SNPs had a high call rate (>95%). There were 62 (0.03%) ‘heterozygous’ calls, indicating that there is little heteroplasmy in the sample. Some of those calls could be attributed to genotyping error. Heterozygous calls were treated as missing for the purposes of this study. Any individual with >5% missing genotypes (two individuals) was removed from the analysis.

The maternal inheritance pattern of mitochondria means that when a mutation occurs, it is transmitted in a particular background of variants and therefore is linked to all of them. These sets of linked variants are known as haplogroups. It has been previously shown by us [33] and others [21] that the tagging SNPs genotyped here can be used to tag all the major European haplogroups. Using a linear discriminant function analysis method described by us previously [33], each haplotype was assigned to a haplogroup. Because of missing genotypes, a small number of haplotypes (<1%) could not unambiguously be assigned to haplogroups.

Traits

The sample was analysed for a total of nine traits. Seven of these were measures of the concentration of a number of biochemical markers: non-fasting glucose blood level, and serum HDL-cholesterol, GGT, triacylglycerol, ALT, uric acid and ferritin. Height and BMI were also tested for association. Further information regarding the measurement of these variables have been described [25, 35]. Participants were not instructed to fast before measurements were taken. This might have affected the measurements of some traits, but most particularly blood glucose and HDL-cholesterol, which can be affected by the amount of carbohydrate and fat, respectively, consumed in a recent meal. As such, the variance of these traits will be affected by the variability in time to the last meal of the individuals. All traits that deviated from normality (ALT, GGT, ferritin and BMI) were log10 transformed to ensure they had a normal distribution. Individuals with a z score >3.5 were excluded from the analysis for that trait. The number of individuals excluded for a particular trait in a specific age group ranged from 0 (height and BMI in 12 year age group) to 11 (ALT in the 16 year age group).

Analysis

Simple regression of a trait on mitochondrial genotype is not suitable when using family data due to nuclear genetic sharing between family members. To account for the nuclear effects, a linear mixed model is used to model the inheritance of both mitochondrial and nuclear genetic effects. The full model is

$$ {\text{y}} = \mu + \beta {\text{x}} + {\text{Zu}} + {\text{e}} $$

where y is a vector of phenotypes and μ is the mean. The mitochondrial inheritance is modelled as a fixed effect (β), where x is a vector of mitochondrial genotype indicators and β is a scalar containing an estimate of the effect of the mitochondrial variant. The nuclear genetic inheritance is modelled with a random effect Zu, where Z is a matrix relating individuals to phenotypes and u is a vector of nuclear additive polygenetic effects and e is a vector of residuals. The random effects u and e are assumed to be distributed as multivariate normal, where \( u\sim N\left( {0,A\sigma_a^2 } \right) \), and \( e\sim N\left( {0,I\sigma_e^2 } \right) \), where A is the standard additive genetic relationship matrix, and I is an identity matrix. The heritability can be estimated as \( {\sigma ^{2}_{a} } \mathord{\left/ {\vphantom {{\sigma ^{2}_{a} } {{\left( {\sigma ^{2}_{a} + \sigma ^{2}_{e} } \right)}}}} \right. \kern-\nulldelimiterspace} {{\left( {\sigma ^{2}_{a} + \sigma ^{2}_{e} } \right)}} \). The model above fits an AE variance components model (u is the additive genetic variance and e is the environmental variance) which assumes that the phenotypic variance in the sample can be explained by additive genetic variance (A) and unique environmental variance (E) (u represents additive genetic variance and e is the environmental variance in the full model). This model assumes that there is no variance due to family members, in particular twin pairs, sharing common environmental factors. If such common environmental variance does exist, it will be confounded with additive genetic variance, leading to an upward bias of heritabilities. The model was fitted using the program Sib-Pair (www.qimr.edu.au/davidD/index.html#sib-pair, accessed 28 April 2009) via the ‘mit’ command, and the effect of the mitochondrial variant was estimated using maximum-likelihood estimation.

Two SNPs were found to be monomorphic in the 12 and 14 year age groups, meaning that there were only 67 SNPs tested for association. The effective number of independent tests being carried out was estimated by calculating the linkage disequilibrium between all SNPs and their correlation with haplogroups using the SNPSpD program [36] and using the previously described method [37]. The estimated number of independent tests was 65. The number of independent traits, 8.1, was estimated using the previously described method [38]. Accordingly, the experiment-wide significance level was set at p = 6.85 × 10−4 (5% significance level with a Bonferroni correction for the 73 tests, 65 variants and eight traits, being carried out).

Results

The breakdown of and overlap between the groups is shown in Table 2. The number of individuals included in the analysis of each trait and the means and SDs for each trait in each age group are given in Table 3. The number of families included in each analysis is also presented. All traits were significantly different across the different age groups at the 0.001 level (paired t test), except for GGT, which was not significantly different between the 12 and 14 year age groups (p = 0.106), and ferritin (p = 0.132 for 14–16 year comparison).

Table 3 The total number of individuals analysed in each group

The heritability estimates for each trait in our sample are high (Table 4), due to the heritability being estimated under an AE model, and there are strong phenotypic correlation across the three measurement time points [25, 35, 39].

Table 4 Estimates of heritability of traits across the three different age groups

Table 5 shows the most significant associations for each trait in the 12, 14 and 16 year age groups. Between the three analyses of the different age groups, there was only one variant for one trait that was found to be significant after correction for multiple testing—mt14365 with triacylglycerol in the 12 year age group. This association was not found in the other two groups. The p values for the association between mt14365 and triacylglycerol in the two other age groups were 0.1416 and 0.3412, respectively. Figure 1 shows a QQ plot of the observed vs expected test statistic for the 77 tests performed for triacylglycerol levels in the 12 year age group. It shows that there is an increase in the test statistic at the high end of the distribution above that which would be expected by chance. This increase was not found in the other age groups. This SNP did have the lowest p value in the analysis of the average of the phenotypes. Several other variants that approached significance in one group were non-significant in other groups, indicating that they are most likely approaching significance by chance. QQ plots for all variants with each of the nine traits in the 12, 14 and 16 year age groups are shown in the Electronic supplementary material (ESM) Figs 1, 2 and 3, respectively. Excluding the increase noted in the 12 year group for triacylglycerol, the distribution of the test statistic follows the expected χ 2 distribution.

Table 5 The most significantly associated variant for each trait in each age group
Fig. 1
figure 1

QQ plot of observed vs expected distribution of test statistic for triacylglycerol in the 12 year age group

Haplogroup W was the most significantly associated variant with uric acid in both the 14 and 16 year groups, but the association is not significant after multiple testing. After correction for multiple testing, there appear to be no significant associations between any of the traits examined here and the major European haplogroups.

The frequencies of all SNPs and haplogroups, and their results with each trait in the analysis of the average of the phenotypes are given in ESM Table 1.

Discussion

We have carried out a family-based mitochondrial association study on a number of traits related to type 2 diabetes and the metabolic syndrome in adolescent twins. The maternal inheritance pattern of mitochondria means that mitochondrial variants do not segregate in families in the same way as autosomal variants, and all children in a family will carry the same mtDNA sequence. This means that a family-based design will have less power than an equivalent mitochondrial association study using unrelated individuals, although some of the loss of power will be offset due to the increased accuracy of trait measurement that is achieved when many family members with the same genotype are measured. In spite of this loss of power, there are some advantages in using family-based data in such a study.

For this study, we have genotyped all family members for whom phenotypic data were collected. However, given that all family members share the same mitochondrial genome, future studies would only require genotyping of one child per family, while the phenotypes of all siblings could be included. Therefore, for the same amount of genotyping, more power is obtained if the number of phenotypes is not a restriction.

Phenotypic similarities between family members will not only be due to sharing mitochondrial variants, but also because they share large portions of the nuclear genome in common, and this needs to be taken into account in looking for potential associations. Because the family resemblance is controlled for, the error variance is decreased, increasing the power of the study. Until now, mitochondrial association studies have focused on case–control data. However, many large family cohorts have been previously collected for linkage analysis and are now being genotyped using the latest SNP chips for association studies. The latest of these SNP chips contain a number of common mitochondrial variants. In order for a proper test of association with mitochondrial variants to be carried out, a method similar to the one described here would need to be used. The implementation of this methodology in Sib-Pair will allow such analyses to be carried out efficiently in the future.

In spite of the loss of optimal power because of relatedness, our study still has at least equivalent power to a study of just one individual per family. The overall analysis contains data from 908 families, so our study has almost 100% power to detect a hypothetical mitochondrial variant with a frequency of 1%, which explains 5% of the variance of a quantitative trait, and ~50% power to detect a variant explaining 1% of the variance [33].

We find no evidence to suggest that common European mitochondrial variants contribute to variation in metabolic traits in adolescent twins. We have typed the mt16189 polymorphism, haplogroup J and mt12612, all of which have been previously posited as potentially being associated with type 2 diabetes, but find no evidence of their playing a role in normal variation in metabolic traits. However, as the association with type 2 diabetes was not directly tested, the hypothesis that these polymorphisms, or other mitochondrial variants increase susceptibility cannot be rejected.

There were no strong associations across age groups; however, this is not inconsistent with previous reports, which have suggested that different genetic factors may act upon these phenotypes at different times [25, 35]. The increased number of large test statistics relative to the expected distribution for the association with triacylglycerol suggests that common mitochondrial variants may influence this trait in early adolescence. However, this result will need to be replicated in further studies with even larger sample sizes than this study. Such further studies may also lead to the discovery of associations with more modest effects than can be detected here.

The traits being analysed here were studied because they are found to be increased in individuals with the metabolic syndrome and diabetes. To the best of our knowledge, none of the individuals analysed in our adolescent sample suffered from these conditions. However, the incidence of these conditions increases with age and so the hypothesis that common mitochondrial variants contribute to disease risk cannot be rejected here. It is possible that onset of disease might be a result of gene–environment interactions that act over time and that certain mitochondrial variants predispose to disease only after certain environmental factors (e.g. diet) have acted over a period of time. Such an interaction may not be picked up in a study of adolescents. In addition, we have not examined genes encoding other proteins in the OXPHOS complex, or their potential interaction with mitochondrial variants in this study as potential contributors to variation in these traits.

It has previously been proposed that the gradual accumulation of mutations in mtDNA may contribute to age-related diseases such as diabetes [10]. Current mitochondrial association studies cannot test this hypothesis as they only test for the association between the consensus mitochondrial genotype in the tissue under study with a disease or quantitative phenotype. It has been shown that less heteroplasmy can be detected in blood DNA than other tissues [21, 40], and so even if there are mutant mitochondria predisposing to disease, they may not be detected. Another limitation of mitochondrial association studies is that they are potentially confounded by the nuclear background as there is significant cross-talk between the two systems. Some of the most convincing evidence for a role of mtDNA in metabolic phenotypes comes from studies of mice with identical nuclear genomes, but different mitochondrial genomes [11]. In this regard, twin studies may prove to be a valuable resource in trying to uncover the role of mitochondria in disease. As monozygotic twins have identical nuclear genomes, the detection of differences in mtDNA caused by proliferation of somatically mutant mtDNAs could potentially explain phenotypic discordance between co-twins. This approach was used by Detjen et al. [41] to test whether phenotypic variability between monozygotic twins in the autosomal dominant disorder neurofibromatosis type I was due to differences in mtDNA. Such a method, when applied to large-scale association studies of quantitative phenotypes could reduce the potential for confounders and provide more direct evidence of a role of mtDNA in common complex diseases.