Long term conservation of human metabolic phenotypes and link to heritability

Yousri, Noha A.; Kastenmüller, Gabi; Gieger, Christian; Shin, So-Youn; Erte, Idil; Menni, Cristina; Peters, Annette; Meisinger, Christa; Mohney, Robert P.; Illig, Thomas; Adamski, Jerzy; Soranzo, Nicole; Spector, Tim D.; Suhre, Karsten

doi:10.1007/s11306-014-0629-y

Long term conservation of human metabolic phenotypes and link to heritability

Original Article
Open access
Published: 26 February 2014

Volume 10, pages 1005–1017, (2014)
Cite this article

Download PDF

You have full access to this open access article

Metabolomics Aims and scope Submit manuscript

Long term conservation of human metabolic phenotypes and link to heritability

Download PDF

Noha A. Yousri^1,2,
Gabi Kastenmüller³,
Christian Gieger⁴,
So-Youn Shin^9,11,
Idil Erte¹⁰,
Cristina Menni¹⁰,
Annette Peters⁵,
Christa Meisinger⁵,
Robert P. Mohney⁶,
Thomas Illig⁷,
Jerzy Adamski⁸,
Nicole Soranzo⁹,
Tim D. Spector¹⁰ &
…
Karsten Suhre^1,3

3442 Accesses
53 Citations
1 Altmetric
Explore all metrics

Abstract

Changes in an individual’s human metabolic phenotype (metabotype) over time can be indicative of disorder-related modifications. Studies covering several months to a few years have shown that metabolic profiles are often specific for an individual. This “metabolic individuality” and detected changes may contribute to personalized approaches in human health care. However, it is not clear whether such individual metabotypes persist over longer time periods. Here we investigate the conservation of metabotypes characterized by 212 different metabolites of 818 participants from the Cooperative Health Research in the Region of Augsburg; Germany population, taken within a 7-year time interval. For replication, we used paired samples from 83 non-related individuals from the TwinsUK study. Results indicated that over 40 % of all study participants could be uniquely identified after 7 years based on their metabolic profiles alone. Moreover, 95 % of the study participants showed a high degree of metabotype conservation (>70 %) whereas the remaining 5 % displayed major changes in their metabolic profiles over time. These latter individuals were likely to have undergone important biochemical changes between the two time points. We further show that metabolite conservation was positively associated with heritability (rank correlation 0.74), although there were some notable exceptions. Our results suggest that monitoring changes in metabotypes over several years can trace changes in health status and may provide indications for disease onset. Moreover, our study findings provide a general reference for metabotype conservation over longer time periods that can be used in biomarker discovery studies.

Instability of personal human metabotype is linked to all-cause mortality

Article Open access 28 June 2018

Whole-Genome Sequencing Analysis of Human Metabolome in Multi-Ethnic Populations

Article Open access 30 May 2023

A comprehensive study of metabolite genetics reveals strong pleiotropy and heterogeneity across time and context

Article Open access 21 October 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The ‘omics’ field has facilitated measuring thousands of biological entities (e.g., genes, proteins, mRNA transcripts, and metabolites) with the aim of detecting correlations between them or their possible association to a disease phenotype. Metabolomics aims at measuring the concentrations of small molecules or metabolites in a given biological fluid such as plasma, urine, saliva, and breath (Fiehn 2002; Lindon et al. 2006; Martinez-Lozano Sinues et al. 2013; Zhang et al. 2012). Today, with advances in biotechnology, modern mass spectrometry has allowed for comprehensive measurement of many endogenous metabolites in a biological fluid. Non-targeted metabolomics approaches enable the measurement of several hundred to a thousand or more metabolites from a variety of different metabolic pathways (Kettunen et al. 2012; Suhre et al. 2011a). Such technologies facilitate detailed metabolic profiles to an extent that has not been accessible before. The metabolites that are measured for each individual characterize the human metabolic phenotype (or “metabotype”), which is defined as “a probabilistic multi-parametric description of an organism in a given physiological state based on the analysis of its cell types, biofluids or tissues” (Gavaghan et al. 2000). Human metabotypes thus represent a comprehensive readout of the biological state of the human body and have been associated with a number of human disorders (Amara and Standaert 2013; Fiehn 2002; Halama et al. 2013; Holmes et al. 2008a, b; Kaddurah-Daouk et al. 2008; Langley et al. 2013; Suhre et al. 2010, 2011a; Wang et al. 2013).

Metabolomics provides information about the joint effects of both environmental and lifestyle factors (such as dietary patterns) and genomics. Changes in the metabolome can be described by the activity of genes, enzymes, and proteins that can be associated with conventional or new target therapies (Corona et al. 2012). This application is relevant for identifying disease biomarkers from metabolites, as can be possible in diabetes, obesity, and cancers (Griffin and Shockcor 2004; Suhre et al. 2010; Menni et al. 2013) where alterations of metabolite concentrations can provide early evidence of disease onset (Assfalg et al. 2008). Thus, pharmaco-metabolomics, which combines metabolite profiling and chemometrics to model and predict the efficacy of drug intervention in individuals, has benefited from this technology (Corona et al. 2012). The field of nutrigenomics, in which appropriate dietary choices are sought to avoid metabolic imbalances leading to disease, presents another motivation for profiling individual metabolites (German et al. 2004). However, such fields mainly depend on the “individuality” or “uniqueness” of human metabolic profiles because treatment, diet, or drug selection is specific for each individual based on a specific profile and one dietary plan can work optimally for an individual but may predispose others to disease (German et al. 2004).

Although a large day-to-day variability has been shown in individual metabotypes (Krug et al. 2012; Dallmann et al. 2012), the metabotype tends to be characteristic of each person. Both genetic differences and environmental factors play a role in such “individuality” (Assfalg et al. 2008; Bernini et al. 2009). The variability in an individual could be due to “diurnal changes (Walsh et al. 2006), hormonal status (Bollard et al. 2005), and stage in the menstrual cycle for women of reproductive age (Wallace et al. 2010)” while differences that characterize individual metabotypes could be due to factors such as “gender, age, and adiposity (Gu et al. 2009; Kochhar et al. 2006; Rasmussen et al. 2011; Winnike et al. 2009) or from less well-characterized habitual dietary patterns and other environmental and cultural influences (Lenz et al. 2003; Holmes et al. 2008a, b)” (Fave et al. 2011). (Heinzmann et al. 2012) indicated that, regardless of dietary patterns, each individual has a core metabolic fingerprint, influenced by a combination of many factors such as host metabolism, gut microbiota composition, dietary habits, physical activity, and body composition. Several studies have considered this individuality; for example, (Sampson et al. 2013) studied within-subject and between-subject variability and identified metabolites with high within-subject variability that can be used to distinguish among individuals/metabotypes.

Rather than being only characteristic to each individual at any given time point, metabotypes should be monitored for their persistence to a specific individual, or “conservation” over time. Assfalg et al. (2008), and Bernini et al. (2009) observed clusters of metabotypes of the same individual taken at different time points, which resulted from higher variability in inter-individual profiles compared to intra-individual profiles. Martinez-Lozano Sinues et al. (2013) observed the persistence of individual signatures over time in some biofluids, such as breath. However, the time intervals considered by such studies tend to be relatively short. For example, (Saude et al. 2007) studied the variability in human urine over of a period of 1 month, and (Nicholson et al. 2011) investigated the stable components in plasma and urine metabolites over a period of 4 months. In general, studies have monitored metabolic profiles for short periods, ranging from a few months and up to 2 or 3 years. However, the persistence of the individuality of metabotypes over longer time periods remains unexplored.

Metabolites shape the persistence of metabotype individuality over time, and identifying the degree of stability of a metabolite over long periods is thus necessary, in particular for biomarker discovery studies; biomarker concentration should not vary too much over the short term within an individual because such variation would undermine the predictive association from a single sample (Nicholson et al. 2011). In addition, such markers should not be completely heritable if environmental factors significantly influence disease risk. The predictive power of a biomarker has been speculated to be nested within the metabolite’s longitudinally stable component, yielding an intriguing question about the stability component of a metabolite versus its heritability component (Nicholson et al. 2011). However, as noted, only short periods have been used for such studies, and the challenge is that (Nicholson et al. 2011) expected a gradual smooth decay in stable behavior of metabolites with an increasing time scale from months to years.

We study the conservation of individual metabolic profiles or metabotypes over a long term while considering the effect of metabolite conservation (also referred to as stability). Note that we use the term “longitudinal study” in the narrow sense of a study that considers only two points in time for every individual. We also address the question of whether metabolite stability decays over time by studying the conservation of metabolites. We further examine the relation of heritability estimates to metabolite conservation, to understand the metabolome. While previous studies covered time spans on the order of months or only 2 or 3 years, the present study is by far the first to expand research into a longitudinal study of metabotypes/metabolites to time spans of up to 7 years with information on metabolite heritability. Pearson correlation was used in this study to measure correlation of a metabolic profile to itself (intra-correlations), as well as correlations to other profiles (inter-correlations) at the two time points. Being also done for metabolites, testing the difference in distributions between intra-correlations and inter-correlations is an initial step of investigating conservation behavior. A measure derived from these correlations is the conservation index (see Sect. 2), which we have used to show that over an interval of 7 years, a large fraction of the population displays a high degree of metabotype conservation that correlates with metabotype heritability and that drastic changes in metabotype occur in only a small fraction of study participants. These latter differences may be indicative of important physiological and potentially disease-related changes.

2 Materials and methods

2.1 Study population

Cooperative Health Research in the Region of Augsburg (KORA) is an epidemiological research cohort with participants randomly selected from the general population in the region of Augsburg in Southern Germany (Holle et al. 2005). Here we use samples and data that were collected during the fourth survey (KORA S4) between 1999 and 2001 and from a follow-up study (KORA F4) that was conducted 7 years later between 2006 and 2008. Extensive examinations and phenotyping using standardized protocols have been applied and are described in detail elsewhere (Wichmann et al. 2005 and references therein). Of 3,080 individuals who attended both studies, 818 who had metabolite profiles and other covariates measured at both baseline and follow-up visits were included in this analysis. These participants were between the ages of 54 and 75 years at baseline with an equal distribution of males and females. KORA is used here as the discovery study. For replication, we used data and samples from a subset of 83 unrelated participants of the TwinsUK cohort. TwinsUK is a British adult-twin registry with predominantly female participants. Samples not suitable for this project from TwinsUK, such as data from a second twin or having a too short time span between two samplings, were removed. Study participants were enrolled from the general population through national media campaigns and were shown to be comparable to age-matched population singletons in terms of disease-related and lifestyle characteristics (Andrew et al. 2001). The samples used in this study were derived from participants ages 30–75 (median 58 years) and are all from females (because most TwinsUK participants are female). Multiple samples of TwinsUK participants were collected at varying time intervals. Here we selected samples that were collected in variable ranges, with a minimum of 1 year and a maximum of 13 years apart (mean 8 years; 75 % of the data points have a range of 6–13 years; histogram is given in Supplemental Figure 1). For both studies, all participants gave written informed consent. The studies were approved by the local ethics committees, the Bayerische Landesärztekammer for KORA and Guy’s and St. Thomas’ Hospital Ethics Committee for TwinsUK.

2.2 Blood sampling

Blood for KORA F4 was drawn between 8:00 a.m. and 10:30 a.m. after 10 h fasting. Material was drawn into serum gel tubes, gently inverted twice, rested for 30 min at room temperature to obtain complete coagulation, and then centrifuged for 10 min at 2,750×g. Serum was divided into aliquots and kept for a maximum of 6 h at 4 °C, after which it was stored at −80 °C until analysis. A similar blood draw protocol was used in KORA S4 (Rathmann et al. 2003). For the TwinsUK study, blood samples were taken after at least 6 h of fasting. The samples were immediately inverted three times, followed by 40 min of resting at 4 °C to obtain complete coagulation. The samples were then centrifuged for 10 min at 2,000×g. Serum was removed from the centrifuged brown-topped tubes as the top, yellow, translucent layer of liquid. Aliquots were stored at −45 °C until sampling.

2.3 Metabolomics measurements

Metabolic profiling was done on serum using ultrahigh-performance liquid-phase chromatography and gas-chromatography separation, coupled with tandem mass spectrometry (UHPLC/MS/MS2 and GC/MS, respectively) at Metabolon, Inc. (Durham, NC, USA) using established procedures and technology (Evans et al. 2009; Suhre et al. 2011b). Briefly, Metabolon is a commercial supplier of metabolic analyses that developed a platform integrating chemical analysis, including the identification and relative quantification, data reduction, and quality-assurance components of the process. Samples are submitted to three analyses: to positive- and negative-mode UHPLC/MS/MS2 and to GC/MS. The UHPLC injections were optimized for basic and acidic species. The resulting MS/MS2 data were searched against a standard library generated by Metabolon that included retention time, molecular mass-to-charge ratio (m/z), and preferred adducts and in-source fragments as well as their associated MS/MS spectra for all molecules in the library. The library allowed for the identification of the experimentally detected molecules on the basis of a multi-parameter match without the need for additional analyses. RSD (relative standard deviation) was determined using repeated measurements of the technical replicates in pooled samples. Supplemental Table 1 gives details of measurements associated with each metabolite. Metabolomics measurements for KORA S4, KORA F4, and TwinsUK were performed in separate batches at Metabolon. Metabolites with more than 20 % missing values or that were detected only in either the S4 or the F4 samples were removed, resulting in a dataset of 212 metabolite levels for 818 participants in KORA and 203 metabolites for 83 participants in TwinsUK. A total of 135 metabolites were common to the KORA and the TwinsUK datasets. Missing data were imputed to the average over all valid observations of that metabolite at the respective time point. Metabolite concentrations were z-scored normalized over all samples.

2.4 Statistical analysis

Analysis was done using the R package (2.15.2). The following definitions are used throughout this paper: For an individual, the term metabotype is used to refer to the set (or vector) of metabolite concentrations over the entire set of metabolites. Time points (t₁ or t₂) refer to the first and second data point in each study, i.e., in KORA, t ₁ refers to data from the initial S4 survey and t ₂ to the F4 follow-up. The term metabotype correlation refers to the Pearson correlation between the metabolite profiles of two individuals ind _i and ind _j from a cohort C (KORA or TwinsUK) at two time points, written as r(ind ^C_i (t ₁), ind ^C_j (t ₂)). Longitudinal metabotype intra-correlation refers to the correlation of a metabotype of individual i at the first time point to that of the same individual at the second time point and is denoted as r(ind ^C_i (t ₁), ind ^C_i (t ₂)). Longitudinal metabotype inter-correlation refers to the correlation of a metabotype of individual i at the first time point to that of a different individual j at the second time point and is denoted as r(ind ^C_i (t ₁), ind ^C_j (t ₂)). Metabolite correlation refers to the Pearson correlation between two metabolite concentration vectors met ^C_k (t ₁) and met ^C_l (t ₂) of metabolites k and l at time points t ₁ and t ₂ and is denoted as r(met ^C_k (t ₁), met ^C_l (t ₂)). Weighted metabotype correlation refers to the Pearson correlation between two metabotypes at two time points using metabolite correlations as weights, calculated using the following formula:

$$r_{w} \left( {x,y} \right) = \frac{{cov_{w} \left( {x,y} \right)}}{{\sqrt {cov_{w} \left( {x,x} \right) \cdot cov_{w} \left( {y,y} \right)} }}$$

$$cov_{w} \left( {x,y} \right) = \mathop \sum \limits_{k = 1}^{M} (x_{k} w_{k} - \overline{x}_{w} )(y_{k} w_{k} - \overline{y}_{w} )$$

$$\overline{x}_{w} = 1/M\mathop \sum \limits_{k = 1}^{M} x_{k} w_{k}$$

where x represents a metabolite concentration vector and M is the number of metabolites. The weight is calculated as w _k = r(met ^C_k (t ₁), met ^C_k (t ₂)). The terms weighted longitudinal metabotype inter- and intra correlations are used as for the normal correlations defined above, but using weighted correlations.

The conservation index is used for both metabotypes and metabolites. The metabotype conservation index of an individual i is defined as the relative rank of the longitudinal metabotype intra-correlation of that individual with respect to all longitudinal metabotype inter-correlations of that individual with all other individuals from the same study cohort. To calculate this index, the intra-correlations are converted to ranks to measure a metabotype’s or metabolite’s similarity to itself when compared to its similarity to other metabotypes or metabolites. It is calculated as 1 − ((rank(i) − 1)/(N − 1)), where N is number of metabotypes. This index quantifies the comparison of intra-correlations to inter-correlations, yielding a value in the range [0,1]. The metabolite conservation index of a metabolite is calculated in the same manner as the metabotype conservation index by replacing vectors of metabotypes with metabolite concentration vectors. A value of 1 indicates a fully conserved metabotype or metabolite. For example, in a 3-subjects set (A, B, C) (N = 3), metabotype A has a correlation of 0.5 to itself after 7 years, and a correlation of 0.6 and 0.7 to B and C, respectively, after 7 years. Thus, its similarity to itself (0.5) is ranked third among all the other similarities to other metabotypes, and its conservation index is 0. If B has a correlation of 0.8 to itself and 0.5 to C, then its self-correlation is ranked first among correlations to others, meaning that it is fully conserved, thus resulting in a conservation index of 1. The weighted metabotype conservation index is defined similarly using weighted metabotype correlation.

2.5 Heritability estimates

Heritability estimates (h) were obtained from previous work based on a large cross-sectional metabolomics dataset from the TwinsUK study using the Metabolon platform (Shin et al., submitted manuscript). A total of 212 metabolites overlapped between the metabolite sets used in that study and the present work. Briefly, heritability was computed using monozygotic and dizygotic twin pairs under the ACE [additive genetic effects (A), shared family environment (C), and unique environment (E)] model (Zyphur et al. 2013), which models trait variance as a function of additive genetics, common environment, and unique environment and/or error effects. The narrow-sense heritability was inferred from the proportion of the total variance explained by estimated additive genetic effect. Calculations were carried out using maximum likelihood methods implemented in OpenMx software (Boker and Neale 2011) while adjusting for age, gender, and batch effects.

2.6 Association with age, gender, and BMI

To estimate the impact of age, gender, and BMI, metabolite levels at KORA S4 were modeled using multi-linear regression in R, with cofactors gender, age, and BMI [R code: lm (metabolites ~ age + gender + BMI)]. The 15 most strongly phenotype-associated metabolites were selected for visualization.

2.7 Principal component analysis (PCA)

R function prcomp was used to obtain the principal components.

3 Results

3.1 Metabolic profiles of the same individual taken at time points 7 years apart correlate

Pearson correlation was used to calculate intra- and inter-correlations between the metabotypes of individuals at two time points, designated as “longitudinal” intra- and inter-correlations (see “Sect. 2”). These values were calculated based on 818 individual metabotypes using 212 metabolites studied at the two time points S4 and F4 for the KORA cohort. The distributions of the pairwise longitudinal inter- and intra-correlations for KORA are shown in Fig. 1a. The median of the longitudinal intra-correlations was 0.35 and significantly different from zero (p < 2.2 × 10⁻¹⁶), but there was no observable correlation among the metabotypes of different individuals (longitudinal metabotype inter-correlation median = −0.0012). This observation was replicated in the TwinsUK study based on 83 unrelated female study participants and 203 metabolites taken at two time points that were on average 8 years apart. The median for the longitudinal metabotype intra-correlation was 0.26; for the inter-correlations, it was −0.0042 (Supplemental Figure 2a).

3.2 Unique identification of 40 % of KORA study participants based on their metabolic profiles is possible after 7 years

As a measure of human metabolic profile persistence over time, the metabotype conservation index was used. This index measures the relative rank of the individual metabotype’s longitudinal intra-correlation (correlation to self over time) within its longitudinal inter-correlation values (correlations to others over time). A conservation index value of 1 was observed for 334 out of the 818 KORA study participants, indicating that 40 % of the study participants could be uniquely identified after 7 years based on information about their metabolic profiles alone. Moreover, 95 % of the metabotypes had a conservation index above 0.7; i.e., the correlation of a metabotype in S4 to itself in F4 was ranked among the 30 % highest correlations with all other metabotypes in F4. Conversely, only 5 % of the individuals showed low conservation over time; i.e., they drastically changed their metabolic profiles over the 7-year period (black curve in Fig. 2a). This observation was also replicated in the TwinsUK study: 37 % of the participants showed a metabotype conservation index value of 1, and 95 % of all metabotypes had a conservation index above 0.57 (black curve in Fig. 2b).

3.3 Metabolic traits are also conserved over time

We computed metabolite conservation indices and Pearson correlations for each metabolite (using correlations between individual metabolite concentrations from all individuals) for both KORA (212 metabolites) and TwinsUK (203 metabolites) (Supplemental Figure 3). Medians of metabolite longitudinal intra-correlations were 0.322 and 0.28 for KORA and TwinsUK, respectively. For 135 metabolites that were observed in both studies, we compared the metabolite conservation between the KORA and the TwinsUK studies and observed a high rank correlation (r = 0.69, p < 2.2 × 10⁻¹⁶) between the Pearson correlations of the metabolites from both sets. For KORA the 10 most strongly conserved metabolites are shown at the top of Table 1 and comprise 6 out of 10 sterols and steroids. With regard to other metabolite classes, lysolipids appeared among the 25 % least-conserved metabolites, and 10 out of 16 long-chain fatty acids belonged to the 50 % least-conserved metabolites. Nine out of twelve metabolites associated with valine, leucine, and isoleucine metabolism were in the top 25 % of the most-conserved metabolites. Carbohydrates were more divided, with some showing higher (e.g., 1.5 anhydroglucitol) and others lower conservation over time (e.g., pyruvate).

Table 1 Selected metabolites with conservation [as longitudinal intra-correlations (r)] and heritability estimates (h), restricted to metabolites with conservation or heritability greater than 0.45, which is the union of two regions of heritability ranks bounded by a ceiling of 28 and conservation ranks bounded by a ceiling of 46; ranks and difference in ranks between conservation and heritability for each metabolite are given, significant association (p < 0.05 after Bonferroni correction for 212 tests) of metabolites with age, gender and BMI as to a linear model (see “Sect. 2”) are indicated by ‘x’

Full size table

3.4 Weighting metabotype correlation using metabolite conservation increases the uniquely identifiable fraction in KORA to 52 %

We hypothesized that metabolites that show a higher conservation over time also carry more information regarding an individual’s metabotype. Figure 1b shows the distributions of weighted longitudinal intra- and inter-correlations between the metabotypes of KORA at S4 and F4, where weights are the longitudinal intra-correlations of metabolites (replication for the TwinsUK set is presented in Supplemental Figure 2b). The weighting increased the median of the metabotype intra-correlations from 0.35 to 0.58 for the KORA set and from 0.26 to 0.53 for the TwinsUK set. Extending this weighting scheme to the conservation index (see Sect. 2), we observed a 30 % increase in the number of individuals who could be uniquely identified based on their metabolite profiles (from 40 % to 52 %; red curve in Fig. 2a). On an individual basis, 43 % of the participants showed an increased conservation index while only 22 % had a decreased index under this weighting procedure. For 95 % of the individuals, the metabotype conservation index was larger than 0.83, compared to 0.7 without weighting. In the TwinsUK replication study, the fraction of uniquely identifiable individuals increased from 37 to 64 % after weighting, with 95 % of all individuals having a conservation value over 0.78 (red curve in Fig. 2b).

3.5 Individuals who display a strong change in their metabotype over time are not different from the general population

About 5 % of all individuals showed low longitudinal metabotype conservation (Fig. 2). To investigate whether these individuals represented outliers with extreme metabotypes, we conducted PCA. Figure 3 shows the first two dimensions of the PCA for the KORA S4 and F4 datasets, respectively. The 5 % of the least-conserved metabotypes were within the data distribution of all other individuals and thus not different from the normal population (Fig. 3).

3.6 Most highly conserved metabolites are also highly heritable

We expected conservation of individual metabotypes to be influenced by genetic factors and hence partially heritable. To compare metabolite conservation to metabolite heritability, heritability estimates were obtained from an independent TwinsUK heritability study of over 6,000 twins (Shin et al., submitted). Figure 4 shows a cross-plot of heritability ranks and conservation ranks for all metabolites. The rank correlation between heritability and conservation was 0.74 (p < 2.2 × 10⁻¹⁶). Table 1 presents metabolite heritability estimates, correlation values, and respective ranks for a selected set of metabolites (full dataset, Supplemental Table 1).

3.7 Gender, age, and BMI are associated with metabolite heritability and conservation

Gender is a conserved phenotype while BMI might change slightly over time. On the other hand, age increases identically for all individuals. Metabolites that are biomarkers for these phenotypes should thus display a higher-than-average conservation over time. To identify metabolites strongly associated with these phenotypes in the present study, linear regressions of metabolite concentrations to gender, age, and BMI were calculated (using data from the KORA S4 dataset). For each phenotype, metabolites significantly regressing (p < 0.05/212) and non-significantly regressing were compared for their heritability and conservation ranks, using the Wilcoxon rank test. Metabolites significantly associating with gender were also more significantly associated with high conservation ranks (p = 4.7 × 10⁻⁹) and with high heritability ranks (p = 7.1 × 10⁻⁵). Age-related metabolites also correlated with high conservation (p = 6.9 × 10⁻⁴) and with high heritability (p = 3.8 × 10⁻⁴), as did BMI (p = 4.6 × 10⁻⁶ and 2.4 × 10⁻³, for high conservation and high heritability, respectively). The 15 metabolites that associated most strongly with each phenotype are highlighted in Fig. 4b and Supplemental Table 2.

4 Discussion

4.1 Metabotype conservation

Metabolite Pearson correlations (intra-correlations between two time points) were used in this analysis to calculate a weighted metabotype conservation index as described in Sect. 2. Weighting of the metabotype correlations with the conservation of the individual metabolites over time was motivated by the observation that not all metabolites may be equally informative to identify individuals with drastic changes in their metabotype over time. For instance, variability can result from a stronger dependency on varying lifestyle factors (e.g., metabolites from diet) but also from lower measurement quality (higher RSD; see Fig. 4a). The impact of metabolite conservation on the metabotype conservation index (i.e., the 30 % increase in metabotypes with a conservation index of 1 after weighting with metabolite longitudinal intra-correlations) shows that highly conserved metabolites can be used to better distinguish individuals. This result is also supported by the findings of (Sampson et al. 2013) because some of the highly conserved metabolites from our study (Table 1) have also shown high inter-subject variability, i.e., proving to be better at discriminating individuals (see Supplemental Table 3).

Metabotypes can be divided into three categories: strictly conserved metabotypes, or those with a conservation index value of 1 (52 % of the population); highly conserved metabotypes, or those with a conservation index value in the interval [0.83,1] (43 % of the population); and least-conserved metabotypes, or those with a value in the interval [0,0.83] (5 % of the population). However, the 5 % least-conserved metabotypes presented an intriguing question regarding whether they are entirely different from the “normal” population.

To address this question, we applied PCA to KORA S4 and F4 samples to determine whether the 5 % least-conserved metabotypes are separated from the remaining metabotypes. When projected onto the first two principal components (Fig. 3), the PCA did not distinguish the least-conserved metabotypes as outliers relative to the remaining population. Other explorative techniques (such as hierarchical clustering) were used to determine if the 5 % metabotypes could be distinguished as having extreme behavior from the 95 % metabotypes when exploring each of S4 and F4 separately, as well as exploring the PCA of the mean metabotype behavior (calculated as the average of the S4 and F4 metabolic profiles for each metabotype). The results showed that no extreme behavior of the metabotypes could be determined using these techniques, either (data not shown). Thus, the two time points together rather than one time point (i.e., the longitudinal study using the conservation index) can be used to distinguish such highly changing metabotypes, once again highlighting the importance of long-term studies in detecting the abnormal behavior of metabotypes. Whether such individuals have experienced important changes in their lifestyles or developed severe diseases requires additional investigation. Results from metabotype conservation analysis further motivate the study of factors affecting metabolite behavior over time, whether because of lifestyle, environment, or genetics.

4.2 Metabolite conservation analysis

The conservation analysis addressed the question posed by Nicholson et al. (2011) regarding the decay of metabolite stability over time. Our results indicate that even after 7 years, some metabolites remain highly conserved and contribute to increasing metabotype conservation. In exploring the conservation behavior of metabolites in different pathways, we found that the 6 steroids in the top 10 most-conserved metabolites are mostly in the androsterone pathway, which is explained by the discriminative power of gender as a “natural” individual classifier. Results from regression with gender, age, and BMI and the Wilcoxon test indicate that gender-related metabolites are more significantly associated with high conservation compared to age- and BMI-related metabolites. The conservation of gender-related metabolites has its role in increasing the metabotype conservation index, as indicated previously. This finding suggests their usability in studying the uniqueness of individual metabotypes over the long term. On the lower end of the conservation spectrum are lysolipids and the majority of the long-chain fatty acids; lysolipids are affected by nutrition but are also associated with high RSD values (>25; see Supplemental Table 1), implying that further investigation is needed for these metabolites. Examples of long-chain fatty acids that are not highly conserved include palmitate, oleate, and stearate, which are fatty acids that occur naturally in various animal and vegetable fats and oils (HMDB: http://www.hmdb.ca), and eicosenoate, which is found in a variety of plant oils. Another is margarate (heptadecanoic acid), which is known as a biological marker of long-term milk fat intake in populations with a high consumption of dairy. Food intake and lifestyle thus highly affect these long-chain fatty acids. Among the more conserved carbohydrates are 1,5-anhydroglucitol (1,5-AG), mannose, glucose, lactate, and erythronate. Because 1,5-AG is a known biomarker for short-term glycemic control (Buse et al. 2003), it is thus expected to be stable over time. It also shows a higher stability than glucose, which makes it even a stronger biomarker for diabetes. Urate appears in the top-conserved metabolites and is known to be a biomarker for Parkinson’s disease (Cipriani et al. 2010). With this overview, we have provided an example of using these results to identify metabolites that can be potentially used as biomarkers because of their stability.

Carbohydrates falling in the low-conserved region are pyruvate, erythrose, glycerate, and fructose, likely because of their high technical variance with RSD values above 25. Pyruvate showed a very low conservation compared to the more highly conserved glucose and lactate, which are in the same glycolysis pathway, and thus the RSD might explain this contradiction (Supplemental Table 1 shows annotation with pathways and superpathways together with correlation and heritability ranks of metabolites). This finding also suggests that low-conserved metabolites should be avoided when studying a metabolic disorder over time because their change with time arises from their instability rather than from an effect of the disorder’s metabolic influence.

To detect whether the conservation of some metabolites is affected by a higher conservation in one sex than the other, males and females were separated and the intra-correlations of metabolites calculated for both sexes separately, but we found no significant variation between the values obtained for each sex (Supplemental Figure 4).

4.3 Heritability versus conservation study

Deviation between heritability ranks and conservation ranks can be used to identify metabolites that may be conserved as a result of dietary patterns or lifestyle from those that are more conserved because of their genotype association. We ordered metabolites in a descending order based on the absolute difference in heritability and conservation ranks. With this approach, taking 100 as the lower bound of absolute difference, two groups of metabolites are at the top. The first group consists of metabolites that are more conserved than heritable (they appear below the diagonal, and more towards the lower right corner of heritability graph in Fig. 4), which includes 4-methyl-2-oxopentanoate, 3-methyl-2-oxobutyrate, 3-methyl-2-oxovalerate from valine, leucine, and isoleucine metabolism, methyl palmitate from fatty acid metabolism, and glucose and lactate from the glycolysis pathway. The second group consists of metabolites that are more heritable than conserved (they appear above the diagonal and towards the upper left corner of heritability graph in Fig. 4), and these include theobromine, glycerate, and homostachydrine. Supplemental Table 2 highlights the significance of association of some of these metabolites with gender, age, and BMI. Glucose and lactate are examples of metabolites that are only moderately conserved (with conservation ranks of 59 and 60 and heritability ranks of 161 and 169, respectively), in contrast to 1,5-anhydroglucitol, which shows a high heritability and a high conservation, and pyruvate, which shows least heritability and conservation. From the second group of metabolites, those that are more heritable than conserved, glycerate significantly regressed with gender, and homostachydrine significantly regressed with age. Homostachydrine is a food compound found in citrus fruits, and citrus fruit intake undergoes both seasonal and daily variations in Germany. Because the KORA surveys were conducted over periods that are longer than 1 year and participants were enrolled randomly with respect to season, low correlation between the availability of citrus fruits to individual participants between S4 and F4 is to be expected, which may explain the lack of conservation of homostachydrine levels between S4 and F4 despite its high heritability.

Other groups of metabolites show high heritability and high conservation or low heritability and low conservation. The first group of metabolites appears in Table 1 and is significantly associated with gender, as also confirmed with results of regression to gender, age, and BMI. The other group is near the lysolipids region (upper right corner of heritability graph in Fig. 4a) and where metabolites are also associated with high RSD values (Fig. 4a). Pathway and subpathway annotation of metabolites on the heritability graph is given in Supplemental Table 1.

Results from associating heritability with conservation reveal variability among metabolites and relate it to the biological background. Along with the results of the effect of conservation on distinguishing metabotypes, these findings can be used to distinguish disorder-related phenotypes and characterize them as arising from heritability or lifestyle. Disorder-related metabolites can also be used in prediction of abnormalities in longitudinal studies.

4.4 Limitations of the present study

Although the conservation of metabotypes confirms earlier findings from comparatively short-term studies and metabolite conservation shows results consistent with stable phenotypes (such as gender), several limitations of this study should be kept in mind. Some variation may have resulted from laboratory/technical errors associated with sample storage and the variation in the time of day at which the samples were collected at each time point. Other influences that might be limiting include the stability of serum between extraction and metabolomics analysis and variations attributable to the fasting behavior of the participants.

We use simple Pearson correlation between time points, neglecting possible influences of age, gender, and BMI on the correlation values of metabolites. However, a linear regression model that corrects for those factors was also used to calculate the correlation of metabolites and the resulting conservation index in order to evaluate the effect of these covariates. Using this more complex model did not substantially change our main results, as presented in Supplemental Figure 5.

The TwinsUK cohort presented varying time differences among participants (see Supplemental Figure 1) on the metabotype conservation. We therefore only used it for replication. It would have been interesting to study the impact of the time difference on metabolite and metabotype conservation. However, KORA involved only a fixed time difference between the two time points (i.e., 7 years), while the number of participants in the TwinsUK cohort was too low to expect statistically significant results from such an analysis, which thus was not done.

5 Conclusion

We studied the long-term conservation of human individual metabolic profiles over 7 years, an essential step for extrapolation from short-term studies. We also analyzed metabolite conservation and identified poorly and highly conserved examples. More than half of the study participants could be uniquely identified after 7 years for both the KORA and TwinsUK cohorts, based on their metabotype conservation index. Highly conserved metabolites increased this uniqueness. Heritability and the 7-year conservation of metabolites were highly correlated, and the two measures together revealed variations in metabolite behavior. Metabolites that showed extremely high conservation compared to heritability or vice versa were explored for biological relevance. Results confirm the long-term conservation of individuality of metabotypes, further increasing the possibility of using metabolomics as a surrogate for understanding the systems biology underlying normal and diseased phenotypes. Metabolites reported here may be investigated as potential long-term biomarkers to detect normality/abnormality of changes in human metabolic profiles. They also stand as a reference when studying long-term changes in a metabolic disorder and to identify whether changes are the result of metabolite or disorder instability over time. The characterization of metabolites based on heritability and conservation will also be useful in understanding disease pathways and interpreting clinical studies.

References

Amara, A. W., & Standaert, D. G. (2013). Metabolomics and the search for biomarkers in Parkinson’s disease. Movement Disorders, 28(12), 1620–1621. doi:10.1002/mds.25644.
Article PubMed PubMed Central Google Scholar
Andrew, T., Hart, D. J., Snieder, H., de Lange, M., Spector, T. D., & MacGregor, A. J. (2001). Are twins and singletons comparable? A study of disease-related and lifestyle characteristics in adult women. Twin Research, 4(6), 464–477.
Article CAS PubMed Google Scholar
Assfalg, M., Bertini, I., Colangiuli, D., Luchinat, C., Schafer, H., Schutz, B., et al. (2008). Evidence of different metabolic phenotypes in humans. Proceedings of the National Academy of Sciences USA, 105(5), 1420–1424. doi:10.1073/pnas.0705685105.
Article CAS Google Scholar
Bernini, P., Bertini, I., Luchinat, C., Nepi, S., Saccenti, E., Schafer, H., et al. (2009). Individual human phenotypes in metabolic space and time. Journal of Proteome Research, 8(9), 4264–4271. doi:10.1021/pr900344m.
Article CAS PubMed Google Scholar
Boker, S., Neale, M., Maes, H., Wilde, M., Spiegel, M., Brick, T., et al. (2011). OpenMx: An open source extended structural equation modeling framework. Psychometrika, 76(2), 306–317. doi:10.1007/s11336-010-9200-6.
Bollard, M. E., Stanley, E. G., Lindon, J. C., Nicholson, J. K., & Holmes, E. (2005). NMR-based metabonomic approaches for evaluating physiological influences on biofluid composition. NMR in Biomedicine, 18(3), 143–162. doi:10.1002/nbm.935.
Article CAS PubMed Google Scholar
Buse, J. B., Freeman, J. L., Edelman, S. V., Jovanovic, L., & McGill, J. B. (2003). Serum 1,5-anhydroglucitol (GlycoMark): A short-term glycemic marker. Diabetes Technology and Therapeutics, 5(3), 355–363. doi:10.1089/152091503765691839.
Article CAS PubMed Google Scholar
Cipriani, S., Chen, X., & Schwarzschild, M. A. (2010). Urate: A novel biomarker of Parkinson’s disease risk, diagnosis and prognosis. Biomarkers in Medicine, 4(5), 701–712. doi:10.2217/bmm.10.94.
Article CAS PubMed PubMed Central Google Scholar
Corona, G., Rizzolio, F., Giordano, A., & Toffoli, G. (2012). Pharmaco-metabolomics: an emerging “omics” tool for the personalization of anticancer treatments and identification of new valuable therapeutic targets. Journal of Cellular Physiology, 227(7), 2827–2831. doi:10.1002/jcp.24003.
Article CAS PubMed Google Scholar
Dallmann, R., Viola, A. U., Tarokh, L., Cajochen, C., & Brown, S. A. (2012). The human circadian metabolome. Proceedings National Academy of Sciences USA, 109(7), 2625–2629. doi:10.1073/pnas.1114410109.
Article CAS Google Scholar
Evans, A. M., DeHaven, C. D., Barrett, T., Mitchell, M., & Milgram, E. (2009). Integrated, nontargeted ultrahigh performance liquid chromatography/electrospray ionization tandem mass spectrometry platform for the identification and relative quantification of the small-molecule complement of biological systems. Analytical Chemistry, 81(16), 6656–6667. doi:10.1021/ac901536h.
Article CAS PubMed Google Scholar
Fave, G., Beckmann, M., Lloyd, A. J., Zhou, S., Harold, G., Lin, W., et al. (2011). Development and validation of a standardized protocol to monitor human dietary exposure by metabolite fingerprinting of urine samples. Metabolomics, 7(4), 469–484. doi:10.1007/s11306-011-0289-0.
Article CAS PubMed PubMed Central Google Scholar
Fiehn, O. (2002). Metabolomics—The link between genotypes and phenotypes. Plant Molecular Biology, 48(1–2), 155–171.
Article CAS PubMed Google Scholar
Gavaghan, C. L., Holmes, E., Lenz, E., Wilson, I. D., & Nicholson, J. K. (2000). An NMR-based metabonomic approach to investigate the biochemical consequences of genetic strain differences: Application to the C57BL10J and Alpk:ApfCD mouse. FEBS Letters, 484(3), 169–174.
Article CAS PubMed Google Scholar
German, J. B., Bauman, D. E., Burrin, D. G., Failla, M. L., Freake, H. C., King, J. C., et al. (2004). Metabolomics in the opening decade of the 21st century: Building the roads to individualized health. Journal of Nutrition, 134(10), 2729–2732.
CAS PubMed Google Scholar
Griffin, J. L., & Shockcor, J. P. (2004). Metabolic profiles of cancer cells. Nature Reviews Cancer, 4(7), 551–561. doi:10.1038/nrc1390.
Article CAS PubMed Google Scholar
Gu, H., Pan, Z., Xi, B., Hainline, B. E., Shanaiah, N., Asiago, V., et al. (2009). 1H NMR metabolomics study of age profiling in children. NMR in Biomedicine, 22(8), 826–833. doi:10.1002/nbm.1395.
Article CAS PubMed PubMed Central Google Scholar
Halama, A., Riesen, N., Moller, G., Hrabe de Angelis, M., & Adamski, J. (2013). Identification of biomarkers for apoptosis in cancer cell lines using metabolomics: Tools for individualized medicine. Journal of Internal Medicine, 274(5), 425–439. doi:10.1111/joim.12117.
Article CAS PubMed Google Scholar
Heinzmann, S. S., Merrifield, C. A., Rezzi, S., Kochhar, S., Lindon, J. C., Holmes, E., et al. (2012). Stability and robustness of human metabolic phenotypes in response to sequential food challenges. Journal of Proteome Research, 11(2), 643–655. doi:10.1021/pr2005764.
Article CAS PubMed Google Scholar
Holle, R., Happich, M., Lowel, H., & Wichmann, H. E. (2005). KORA—A research platform for population based health research. Gesundheitswesen, 67(Suppl 1), S19–S25. doi:10.1055/s-2005-858235.
Article PubMed Google Scholar
Holmes, E., Loo, R. L., Stamler, J., Bictash, M., Yap, I. K., Chan, Q., et al. (2008a). Human metabolic phenotype diversity and its association with diet and blood pressure. Nature, 453(7193), 396–400. doi:10.1038/nature06882.
Article CAS PubMed Google Scholar
Holmes, E., Wilson, I. D., & Nicholson, J. K. (2008b). Metabolic phenotyping in health and disease. Cell, 134(5), 714–717.
Article CAS PubMed Google Scholar
Kaddurah-Daouk, R., Kristal, B. S., & Weinshilboum, R. M. (2008). Metabolomics: A global biochemical approach to drug response and disease. Annual Review of Pharmacology and Toxicology, 48, 653–683. doi:10.1146/annurev.pharmtox.48.113006.094715.
Article CAS PubMed Google Scholar
Kettunen, J., Tukiainen, T., Sarin, A. P., Ortega-Alonso, A., Tikkanen, E., Lyytikainen, L. P., et al. (2012). Genome-wide association study identifies multiple loci influencing human serum metabolite levels. Nature Genetics, 44(3), 269–276. doi:10.1038/ng.1073.
Article CAS PubMed PubMed Central Google Scholar
Kochhar, S., Jacobs, D. M., Ramadan, Z., Berruex, F., Fuerholz, A., & Fay, L. B. (2006). Probing gender-specific metabolism differences in humans by nuclear magnetic resonance-based metabonomics. Analytical Biochemistry, 352(2), 274–281. doi:10.1016/j.ab.2006.02.033.
Article CAS PubMed Google Scholar
Krug, S., Kastenmuller, G., Stuckler, F., Rist, M. J., Skurk, T., Sailer, M., et al. (2012). The dynamic range of the human metabolome revealed by challenges. The Faseb Journal, 26(6), 2607–2619. doi:10.1096/fj.11-198093.
Article CAS PubMed Google Scholar
Langley, R. J., Tsalik, E. L., van Velkinburgh, J. C., Glickman, S. W., Rice, B. J., Wang, C., et al. (2013). An integrated clinico-metabolomic model improves prediction of death in sepsis. Science Translational Medicine, 5(195), 195ra195. doi:10.1126/scitranslmed.3005893.
Lenz, E. M., Bright, J., Wilson, I. D., Morgan, S. R., & Nash, A. F. (2003). A 1H NMR-based metabonomic study of urine and plasma samples obtained from healthy human subjects. Journal of Pharmaceutical and Biomedical Analysis, 33(5), 1103–1115.
Article CAS PubMed Google Scholar
Lindon, J. C., Holmes, E., & Nicholson, J. K. (2006). Metabonomics techniques and applications to pharmaceutical research and development. Pharmaceutical Research, 23(6), 1075–1088. doi:10.1007/s11095-006-0025-z.
Article CAS PubMed Google Scholar
Martinez-Lozano Sinues, P., Kohler, M., & Zenobi, R. (2013). Human breath analysis may support the existence of individual metabolic phenotypes. PLoS ONE, 8(4), e59909. doi:10.1371/journal.pone.0059909.
Article PubMed PubMed Central Google Scholar
Menni, C., Fauman, E., Erte, I., Perry, J. R., Kastenmuller, G., Shin, S. Y., et al. (2013). Biomarkers for type 2 diabetes and impaired fasting glucose using a non-targeted metabolomics approach. Diabetes,. doi:10.2337/db13-0570.
PubMed PubMed Central Google Scholar
Nicholson, G., Rantalainen, M., Maher, A. D., Li, J. V., Malmodin, D., Ahmadi, K. R., et al. (2011). Human metabolic profiles are stably controlled by genetic and environmental variation. Molecular System Biology, 7, 525. doi:10.1038/msb.2011.57.
Article Google Scholar
Rasmussen, L., Savorani, F., Larsen, T., Dragsted, L., Astrup, A., & Engelsen, S. (2011). Standardization of factors that influence human urine metabolomics. Metabolomics, 7(1), 71–83. doi:10.1007/s11306-010-0234-7.
Article CAS Google Scholar
Rathmann, W., Haastert, B., Icks, A., Lowel, H., Meisinger, C., Holle, R., et al. (2003). High prevalence of undiagnosed diabetes mellitus in Southern Germany: Target populations for efficient screening. The KORA survey 2000. Diabetologia, 46(2), 182–189. doi:10.1007/s00125-002-1025-0.
Article CAS PubMed Google Scholar
Sampson, J. N., Boca, S. M., Shu, X. O., Stolzenberg-Solomon, R. Z., Matthews, C. E., Hsing, A. W., et al. (2013). Metabolomics in epidemiology: Sources of variability in metabolite measurements and implications. Cancer Epidemiology Biomarkers and Prevention, 22(4), 631–640. doi:10.1158/1055-9965.epi-12-1109.
Article CAS Google Scholar
Saude, E., Adamko, D., Rowe, B., Marrie, T., & Sykes, B. (2007). Variation of metabolites in normal human urine. Metabolomics, 3(4), 439–451. doi:10.1007/s11306-007-0091-1.
Article CAS Google Scholar
Suhre, K., Meisinger, C., Doring, A., Altmaier, E., Belcredi, P., Gieger, C., et al. (2010). Metabolic footprint of diabetes: A multiplatform metabolomics study in an epidemiological setting. PLoS ONE, 5(11), e13953. doi:10.1371/journal.pone.0013953.
Article PubMed PubMed Central Google Scholar
Suhre, K., Shin, S. Y., Petersen, A. K., Mohney, R. P., Meredith, D., Wagele, B., et al. (2011a). Human metabolic individuality in biomedical and pharmaceutical research. Nature, 477(7362), 54–60. doi:10.1038/nature10354.
Article CAS PubMed Google Scholar
Suhre, K., Shin, S. Y., Petersen, A. K., Mohney, R. P., Meredith, D., Wagele, B., et al. (2011b). Human metabolic individuality in biomedical and pharmaceutical research. Nature, 477(7362), 54–60. doi:10.1038/nature10354.
Article CAS PubMed Google Scholar
Wallace, M., Hashim, Y. Z., Wingfield, M., Culliton, M., McAuliffe, F., Gibney, M. J., et al. (2010). Effects of menstrual cycle phase on metabolomic profiles in premenopausal women. Human Reproduction, 25(4), 949–956. doi:10.1093/humrep/deq011.
Article CAS PubMed Google Scholar
Walsh, M. C., Brennan, L., Malthouse, J. P., Roche, H. M., & Gibney, M. J. (2006). Effect of acute dietary standardization on the urinary, plasma, and salivary metabolomic profiles of healthy humans. American Journal of Clinical Nutrition, 84(3), 531–539.
CAS PubMed Google Scholar
Wang, T. J., Ngo, D., Psychogios, N., Dejam, A., Larson, M. G., Vasan, R. S., et al. (2013). 2-Aminoadipic acid is a biomarker for diabetes risk. Journal of Clinical Investigation, 123(10), 4309–4317. doi:10.1172/JCI64801.
Article CAS PubMed PubMed Central Google Scholar
Wichmann, H. E., Gieger, C., & Illig, T. (2005). KORA-gen-resource for population genetics, controls and a broad spectrum of disease phenotypes. Gesundheitswesen, 67(Suppl 1), S26–S30. doi:10.1055/s-2005-858226.
Article PubMed Google Scholar
Winnike, J. H., Busby, M. G., Watkins, P. B., & O’Connell, T. M. (2009). Effects of a prolonged standardized diet on normalizing the human metabolome. American Journal of Clinical Nutrition, 90(6), 1496–1501. doi:10.3945/ajcn.2009.28234.
Article CAS PubMed PubMed Central Google Scholar
Zhang, A., Sun, H., & Wang, X. (2012). Saliva metabolomics opens door to biomarker discovery, disease diagnosis, and treatment. Applied Biochemistry and Biotechnology, 168(6), 1718–1727. doi:10.1007/s12010-012-9891-5.
Article CAS PubMed Google Scholar
Zyphur, M. J., Zhang, Z., Barsky, A. P., & Li, W.-D. (2013). An ACE in the hole: Twin family models for applied behavioral genetics research. The Leadership Quarterly, 24(4), 572–594. doi:10.1016/j.leaqua.2013.04.001.
Article Google Scholar

Download references

Acknowledgments

We thank all KORA study participants and all members of the field staff in Augsburg who planned and conducted the study. We thank all TwinsUK study participants who provided us with replication data and heritability analysis. We also thank Khalid A. Fakhro and Anna Halama for their helpful comments and for paper proof reading. The KORA study was initiated and financed by the Helmholtz Zentrum München—German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria. Part of the metabolomics measurements were funded by BMBF grant 0315494A (project SysMBo) and was further supported by the German Federal Ministry of Education and Research (BMBF) to the German Center for Diabetes Research (DZD e.V.). NY and KS are supported by ‘Biomedical Research Program’ funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation. The statements made herein are solely the responsibility of the authors. The TwinsUK study was funded by the Wellcome Trust; European Community’s Seventh Framework Programme (FP7/2007-2013). The study also receives support from the National Institute for Health Research (NIHR) BioResource Clinical Research Facility and Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust and King’s College London. Tim Spector is holder of an ERC Advanced Principal Investigator award. SNP Genotyping was performed by The Wellcome Trust Sanger Institute and National Eye Institute via NIH/CIDR.

Conflict of interest

RPM is employed by Metabolon, Inc. He contributed only to the logistics and optimization of MS spectroscopy and to MS data interpretation. Metabolon Inc. was not involved in the design of the study, statistical analyses or interpretation of the results. All other authors declare that they have no competing interests.

Author information

Authors and Affiliations

Department of Physiology and Biophysics, Weill Cornell Medical College in Qatar, Education City, Qatar Foundation, P.O. Box 24144, Doha, Qatar
Noha A. Yousri & Karsten Suhre
Computers and System Engineering Department, Faculty of Engineering, Alexandria University, Alexandria, Egypt
Noha A. Yousri
Institute of Bioinformatics and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Gabi Kastenmüller & Karsten Suhre
Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Christian Gieger
Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Annette Peters & Christa Meisinger
Metabolon, Inc., Durham, NC, USA
Robert P. Mohney
Hannover Unified Biobank, Hannover Medical School, Carl-Neuberg-Straße 1, 30625, Hannover, Germany
Thomas Illig
Institute of Experimental Genetics, Genome Analysis Center, Helmholtz Zentrum München, German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Jerzy Adamski
Human Genetics, Wellcome Trust Sanger Institute, Hinxton, CB10 1HH, UK
So-Youn Shin & Nicole Soranzo
Department of Twin Research and Genetic Epidemiology, Kings College London, London, SE1 7EH, UK
Idil Erte, Cristina Menni & Tim D. Spector
MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
So-Youn Shin

Authors

Noha A. Yousri
View author publications
You can also search for this author in PubMed Google Scholar
Gabi Kastenmüller
View author publications
You can also search for this author in PubMed Google Scholar
Christian Gieger
View author publications
You can also search for this author in PubMed Google Scholar
So-Youn Shin
View author publications
You can also search for this author in PubMed Google Scholar
Idil Erte
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Menni
View author publications
You can also search for this author in PubMed Google Scholar
Annette Peters
View author publications
You can also search for this author in PubMed Google Scholar
Christa Meisinger
View author publications
You can also search for this author in PubMed Google Scholar
Robert P. Mohney
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Illig
View author publications
You can also search for this author in PubMed Google Scholar
Jerzy Adamski
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Soranzo
View author publications
You can also search for this author in PubMed Google Scholar
Tim D. Spector
View author publications
You can also search for this author in PubMed Google Scholar
Karsten Suhre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Karsten Suhre.

Electronic supplementary material

Below is the link to the electronic supplementary material.

11306_2014_629_MOESM1_ESM.pptx

Supplemental Figure 1: Ranges of time differences between the two time points in the TwinsUK data set. Mean is 8 years, 25 % quantile is 6 years, and 75 % quantile is 10 years. (PPTX 76 kb)

11306_2014_629_MOESM2_ESM.pptx

Supplemental Figure 2: Metabotype pairwise longitudinal inter correlations versus intra correlations distributions between TwinsUK two time points. (a) Pearson correlation of the metabolite levels between two time points for the same individual, or intra-correlations (median is 0.26 (red histogram)) and for pairwise inter correlations (median is -0.00422 (blue histogram)). (b) as in (a), but using metabolite correlations as weights to metabotype correlations (medians are 0.53 for intra-correlations (red) and -0.0047 for pairwise inter-correlations (blue)). (PPTX 187 kb)

11306_2014_629_MOESM3_ESM.pptx

Supplemental Figure 3: Metabolite correlation and metabolite conservation index. Distribution of the Pearson correlation coefficients for the correlation of metabolites between the two study time points for KORA (a) (212 metabolites) and TwinsUK (b), (203 metabolites); the metabolite conservation index for KORA (c) and TwinsUK (d). (PPTX 195 kb)

11306_2014_629_MOESM4_ESM.pptx

Supplemental Figure 4: Scatterplot of Pearson correlation of metabolites computed on the male and female subgroups in KORA alone. (PPTX 113 kb)

11306_2014_629_MOESM5_ESM.pptx

Supplemental Figure 5: Metabolite conservation index computed using a simple Pearson correlation (black) and using a linear model including age, gender and BMI as covariates (red). (PPTX 91 kb)

11306_2014_629_MOESM6_ESM.xlsx

Supplemental Table 1: Excel file with the complete data (see Table 1), incl. metabolite identifiers, m/z, retention time, RSD information. (XLSX 73 kb)

Supplemental Table 2 (DOCX 28 kb)

Supplemental Table 3 (DOCX 19 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Yousri, N.A., Kastenmüller, G., Gieger, C. et al. Long term conservation of human metabolic phenotypes and link to heritability. Metabolomics 10, 1005–1017 (2014). https://doi.org/10.1007/s11306-014-0629-y

Download citation

Received: 12 November 2013
Accepted: 27 January 2014
Published: 26 February 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s11306-014-0629-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Long term conservation of human metabolic phenotypes and link to heritability

Abstract

Similar content being viewed by others

1 Introduction

2 Materials and methods

2.1 Study population

2.2 Blood sampling

2.3 Metabolomics measurements

2.4 Statistical analysis

2.5 Heritability estimates

2.6 Association with age, gender, and BMI

2.7 Principal component analysis (PCA)

3 Results

3.1 Metabolic profiles of the same individual taken at time points 7 years apart correlate

3.2 Unique identification of 40 % of KORA study participants based on their metabolic profiles is possible after 7 years

3.3 Metabolic traits are also conserved over time

3.4 Weighting metabotype correlation using metabolite conservation increases the uniquely identifiable fraction in KORA to 52 %

3.5 Individuals who display a strong change in their metabotype over time are not different from the general population

3.6 Most highly conserved metabolites are also highly heritable

3.7 Gender, age, and BMI are associated with metabolite heritability and conservation

4 Discussion

4.1 Metabotype conservation

4.2 Metabolite conservation analysis

4.3 Heritability versus conservation study

4.4 Limitations of the present study

5 Conclusion

References

Acknowledgments

Conflict of interest

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation