Background

EMPHASIS (Epigenetic Mechanisms linking Pre-conceptional nutrition and Health Assessed in India and Sub-Saharan Africa, www.emphasisstudy.org) is a collaboration between investigators in the UK, India and The Gambia designed to profile genome-wide DNA methylation in children whose mothers participated in two pre- and peri-conceptional micronutrient supplementation trials (the Mumbai Maternal Nutrition Project in India, MMNP [1] and the Peri-conceptional Multiple Micronutrient Supplementation Trial, PMMST, in rural Gambia [2]). The main objectives of the study are to identify methylation differences associated with the interventions and correlate these with health-related phenotypes in the children, including size at birth, post-natal growth, and childhood body composition, skeletal health, cardiometabolic risk markers and cognitive function. We hypothesise that maternal nutritional supplementation around the time of conception will result in altered DNA methylation profiles in the children, and that the distinct methylation patterns identified will show potentially causal associations with phenotypic characteristics in the children. We further expect to identify methylation differences unrelated to supplementation that are associated with the measured phenotypes.

Context

Poor quality diets and the resulting micronutrient deficiencies are major public health problems in low-and-middle-income countries (LMICs). In pregnant women they impair fetal development, and recent evidence suggests that they are also associated with longer term health problems in the offspring including stunting [3], impaired neurodevelopment [4] and, through ‘metabolic programming’, with increased vulnerability to adult non-communicable chronic diseases (NCDs) such as obesity, type 2 diabetes, cardiovascular disease and osteoporosis [5,6,7].

Long-term effects of fetal nutrition on later health would require mechanisms by which a ‘memory’ of the early environment is retained into later life and influences metabolism. Epigenetic signatures, including patterns of DNA methylation that are modifiable by environmental exposures, are leading candidate mechanisms [8, 9]. DNA methylation is a mitotically heritable epigenetic mark that plays a key role in the transcriptional regulation of cellular processes, including cell differentiation, genomic imprinting and X-chromosome inactivation. DNA methylation depends on the supply of methyl groups through the 1-carbon pathway, which requires vitamins B2, B6, B12, folate, methionine, choline and betaine, and amino acids serine and glycine, for normal function. The peri-conceptional period is a critical window when the process of establishing methylation marks is sensitive to nutrition [8, 9].

The initial ‘proof of principle’ of nutritional programming mediated by changes in DNA methylation came from the Agouti mouse model, in which natural variation in methylation at the Agouti locus influences coat colour, adult adiposity and glucose tolerance [8]. This locus is a metastable epiallele (ME), a genomic region characterised by inter-individual variation in methylation patterns that are established in the early embryo before gastrulation, and are therefore highly correlated across tissues derived from all three germ layers. Feeding pregnant dams ‘methyl donor’ nutrients (vitamin B12, folic acid, betaine and choline) increased methylation of the agouti locus and reduced agouti gene expression, leading to fewer obese yellow offspring and more lean brown offspring, characteristics that persisted into adult life [8]. Dietary manipulations in pregnancy affect the methylation and expression of offspring genes other than MEs. For example, in rats, maternal protein-restriction reduces methylation of offspring peroxisome proliferator activated receptor 1 alpha (PPARα) and glucocorticoid and angiotensin receptor genes [10]. Maternal folic acid supplementation prevents both the methylation and phenotypic effects (e.g. high blood pressure) induced by maternal protein restriction in the offspring.

There is evidence in humans that epigenetic changes induced by the nutritional environment in early life alter later phenotype, including body composition and cardiometabolic health. For example, DNA methylation at a number of loci (insulin-like growth factor 2 (IGF2), retinoid X receptor alpha (RXRA), endothelial nitric oxide synthase (eNOS), PGC1α, and cyclin-dependent kinase inhibitor 2 (CDKN2a) genes in cord tissue, cord blood or children’s leucocytes is associated with adiposity in later childhood [11,12,13,14]. Studies in The Gambia, where there is marked seasonal variation in maternal nutrition, have shown that season of conception is related to DNA methylation of human infant MEs [15], and that methylation is predicted by elements of the mother’s methyl donor metabolome at conception [16]. One implicated gene is the maternally imprinted tumour suppressor and immune function regulator vault RNA2–1 (VTRNA2–1) [17], making it a promising candidate for exploring mechanisms linking season of conception and infectious disease mortality in Gambians [18]. A methylation variant affecting expression of the pro-opio-melanocortin (POMC) gene has also been associated with child and adult obesity. This locus is an ME, and in Gambian infants methylation is associated with season of conception and maternal 1-carbon metabolites at conception [19].

Evidence for epigenetic programming in humans has hitherto relied mainly on observational studies. Randomised trials of peri-conceptional maternal nutritional interventions with follow-up of the children represent a stronger study design in which to examine effects on DNA methylation and health-related phenotypes, reducing the problems of confounding and bias that affect observational studies, thus providing stronger evidence of causality. The EMPHASIS study will be an important step towards understanding mechanisms underpinning the developmental origins of health and disease (DOHaD), identifying biomarkers of early life exposures associated with later disease risk, and designing more effective nutritional interventions for mothers preparing for pregnancy.

Design and methods

EMPHASIS is a follow-up study of two cohorts of children born to mothers who took part in separate randomized controlled trials of nutritional supplementation before and during pregnancy.

The original trials and the cohorts of children

Mumbai maternal nutrition project

MMNP (also known as Project SARAS [‘excellent’]; ISRCTN62811278) was a non-blinded individually randomized trial among Indian women living in Mumbai slums (2006–2012) [1]. The intervention was a daily snack, eaten in addition to normal diet, made from naturally micronutrient-rich local foods (green leafy vegetables, fruit and milk). Control snacks contained foods of low micronutrient content (e.g. potato, onion). Intervention snacks contained 10–23% of the WHO Reference Nutrient Intake (RNI) for β-carotene, vitamins B2 and B12, folate, calcium and iron, and 0.7 MJ of energy and 6 g of protein, compared with 0–7% RNI for the micronutrients, 0.4 MJ of energy and 2 g of protein in control snacks. At recruitment, non-pregnant women had detailed anthropometry, and data were collected on socio-economic status (Standard of Living Index [20]) and habitual diet by food frequency questionnaire. They received either intervention or control snacks; intake was supervised and recorded daily. Women who became pregnant continued supplementation until delivery, and were supplied with routine iron (100 mg) and folic acid (500 μg) supplements as per Indian government recommendations. Fetal biometry was recorded three times during pregnancy (at approximately 10, 20 and 29 weeks gestation, estimated from last menstrual period date and ultrasound measures) [21]. Plasma folate and vitamin B12 concentrations were measured in early pregnancy (~10 weeks gestation). An oral glucose tolerance test (WHO 1999 protocol) was performed at 28–32 weeks gestation [22]. Main outcomes were newborn anthropometry and gestational age at delivery. Of 6513 women recruited, 2291 became pregnant, leading to 1962 live singleton deliveries.

In the intention to treat analysis, there were no differences in birth weight or other newborn measurements between allocation groups [1]. In the per protocol analysis, limited to women who started supplementation at least 3 months before conception, a period that was considered long enough to achieve the maximal effect on maternal nutritional status, birth weight increased by a mean 48 g (p = 0.05). In both analyses there was an interaction between maternal BMI and the intervention, with a larger birth weight effect in mothers of BMI >18.5 kg/m2 (intention to treat: +63 g [95%CI 11, 115]; per protocol: +96 g [95%CI 35, 154]; p for interaction 0.001). The intervention reduced the prevalence of gestational diabetes (intention to treat: 7.3% compared with 12.4% in controls; OR: 0.56; 95% CI: 0.36, 0.86; P = 0.008) [22]. It had no effect on fetal size assessed using standard ultrasound measures [21].

The children of mothers who participated in MMNP are currently (2013–2018) being studied at 5–7 years of age (“SARAS KIDS” study) to measure anthropometry, body composition, skeletal development, cardio-metabolic risk markers and cognitive function (Fig. 1, Tables 1 and 2). Venous blood samples and buccal swabs are collected for DNA and RNA, and are stored in -80 °C freezers until transportation in batches to the laboratory on dry ice. The DNA samples and phenotype data will be used for the EMPHASIS study, in which we will limit the sample to the 1562 children born to mothers in the per protocol group. Data collection will be completed by the end of January 2018.

Fig. 1
figure 1

Flow diagram of the MMNP trial in Mumbai, India and children’s follow-up (SARAS KIDS)

Table 1 Maternal, newborn and child characteristics for the children in India and The Gambia who have participated in the EMPHASIS study
Table 2 Data collected among the Indian and Gambian children

West kiang Peri-conceptional multiple micronutrient supplementation trial

PMMST (ISRCTN13687662) was a double-blind individually randomized trial among women living in rural West Kiang, The Gambia (2006–2008) [2] (Fig. 2). The intervention was a daily multiple micronutrient tablet (UNIMMAP) providing the RNI of vitamins A, B1, B2, niacin, B6, folic acid, B12, C, D and E and iron, zinc, copper, selenium and iodine [23]. Control women received matching placebo tablets. At recruitment, non-pregnant women had anthropometry. Compliance with supplementation was assessed by fortnightly tablet counts. Women stopped the supplement when they became pregnant, confirmed either by pregnancy test or by ultrasound at approximately 12 weeks gestation, and then both groups were supplied with routine iron (60 mg) and folic acid 250 μg) supplements and anti-malarial prophylaxis. Serial ultrasound scans were performed and newborn anthropometry was recorded. Primary outcomes were mid-gestation indices of utero-placental vascular-endothelial function (ratio of plasminogen-activator inhibitor [PAI] 1 to PAI-2), mean uterine-artery resistance index and fetal-to-maternal measles antibody ratio as an index of placental active transport capacity at delivery.

Fig. 2
figure 2

Flow diagram of the PMMST trial in The Gambia, and the children’s follow-up

Of 1156 women recruited, 376 had live singleton deliveries. There were no differences in PAI-1/PAI-2 or measles antibody ratio between trial arms, but there was a 0.02-unit reduction in uterine artery resistance index between 18 and 32 weeks of gestation (95% CI: -0.03, −0.00; P = 0.040) among women in the intervention group [2]. There was no significant effect of supplementation on birth weight. Two small pilot studies subsequently identified preliminary evidence of intervention-associated methylation differences in cord blood DNA, and in peripheral blood at the age of 9 months [24, 25]; no analyses were conducted relating DNA methylation to phenotypes.

For EMPHASIS, we aimed to study as many as possible of the 376 children; 356 were re-traced using the West Kiang Demographic Surveillance System [26], 298 of these were studied during 2016–2017 at the age of 7–9 years (Fig. 2). Similar outcomes were measured as in MMNP, using harmonised methods (Table 2). Blood samples and buccal swabs were collected for DNA and RNA and stored at -80 °C; DNA was isolated in The Gambia and DNA samples were transported to the laboratory in India on dry ice.

DNA methylation profiling

DNA methylation profiling for both cohorts will be carried out at the CSIR-Centre for Cellular and Molecular Biology, Hyderabad, India. In a stage 1 ‘discovery’ analysis, genome-wide DNA methylation will be measured at >850,000 CpG methylation sites in ~700 Mumbai children and all the Gambian children with available DNA (N = 293) using the Illumina Infinium MethylationEPIC array (EPIC) (Illumina Inc., San Diego, USA), to identify differentially methylated positions (DMPs), regions (DMRs), and variably methylated positions and regions (VMPs and VMRs). Technical validation of significant DMPs and DMRs will be performed by pyrosequencing a subset of samples spanning the range of observed methylation values, using a Pyromark 96 pyrosequencer (Qiagen, Hilden, Germany). In addition, a small number of selected candidate loci not present on the EPIC array will also be assayed in both cohorts by pyrosequencing. These have been selected a priori following a literature review (manuscript in preparation) of other studies in which DNA methylation has been associated with maternal nutritional exposures and/or health outcomes of interest (Table 3). Replication of the technically validated loci will be performed using pyrosequencing in an independent sample of size n = 200–400 MMNP samples (sample size will depend on the observed effect size in the discovery analysis). In a cross-tissue analysis, technically validated significant loci will be examined in buccal DNA samples (n~50 from each cohort). All samples from both cohorts will also be genotyped using the Illumina Global Screening Array (GSA). Blood samples collected into Paxgene tubes for RNA isolation will be stored for later transcriptomic studies.

Table 3 Genes not on the EPIC array with previous evidence of associations with nutritional exposures and/or phenotypes

Data analysis

A detailed analysis plan can be found on the EMPHASIS website (www.emphasisstudy.org).

Stage 1: Intervention-methylation associations (Fig. 3 ): Data from the two cohorts will be analysed separately. In a ‘hypothesis-free’, genome-wide analysis, the raw intensity data from the EPIC arrays will undergo pre-processing, quality control and normalization. Intervention-methylation associations will be identified at DMRs and DMPs using appropriate methods, and controlling for the false discovery rate (FDR). Loci and regions showing differences in methylation variance (VMPs and VMRs) will be identified, both genome wide and in an analysis targeted to MEs and imprinting control regions (ICRs). The candidate gene data will be analysed in parallel, using a similar strategy to the one outlined above to identify methylation differences associated with nutritional intervention. Technical validation will be carried out in a sub-set (10%) of samples using pyrosequencing. Significant hits will be those with p value <0.05 after correction for multiple testing. For the cross-tissue analysis, correlations of blood versus buccal methylation will be assessed using Pearson correlations.

Fig. 3
figure 3

Stage 1 analysis of the impact of the nutritional interventions on DNA methylation

Statistical power is estimated based on a sample size of 700 in the Indian cohort and 293 in The Gambia, to detect DMPs at a single locus using two-sample t-tests with alpha = 0.05, using a conservative assumption that tested loci have a methylation standard deviation at the 95th percentile of those observed (ie. within the 5% most variable probes). Significance thresholds were Bonferroni-adjusted assuming 800,000 independent tests, allowing for some filtering of probes. We estimate that we will be able to detect mean methylation differences between intervention and control groups, with 80% power, of 3 and 5% in the Indian and Gambian cohorts respectively. For the replication study using pyrosequencing in independent samples from the Indian cohort, and in the candidate loci studies in both cohorts, we estimate 80% power to detect a 2% difference in the Indian cohort and 3% in the Gambian cohort.

Stage 2: Methylation-outcome associations (Figs. 4 and 5 , Table 4 ): Significant loci associated with the nutritional intervention in either cohort from the EPIC array analysis, and all candidate loci, will be tested for associations with phenotype data measured in the children at the time of DNA collection, and also with birth outcomes (newborn anthropometry and gestation) (Table 4, Fig. 4). Loci identified in a separate meta-analysis of Stage 1 associations across both cohorts will also be considered. We will additionally carry out a ‘hypothesis-free’ analysis to identify loci where methylation is associated with outcomes, irrespective of intervention-methylation associations (Fig. 5).

Fig. 4
figure 4

Stage 2 Associations of intervention-associated DMRs and loci with health outcomes

Fig. 5
figure 5

Associations of methylation and outcomes

Table 4 Phenotypic outcomes in the children in both cohorts

Other analyses

To gain insights into underlying mechanisms, gene pathways analysis will be performed for the intervention-methylation-outcome genome wide association analysis. We will compare results from the intervention-methylation, methylation-outcome, and pathways analyses between the cohorts to identify commonalities and differences and explore further opportunities for meta-analysis. We will examine potential single nucleotide polymorphism (SNP) effects on methylation through methylation Quantitative Trait Loci (mQTL) analysis using the genome-wide genotype data on the children. We will also consider options for performing causal analysis with generated genotype data using Mendelian Randomisation (MR) [27,28,29].

DNA methylation assays for the discovery sample of Mumbai children and all of the Gambian children are scheduled to be completed in late 2017. Assays for the replication sample of Mumbai children will be completed in early 2018. The full EMPHASIS analysis (Stages 1 and 2) will be completed by mid-late 2018.

Discussion

EMPHASIS is the first study in humans to examine the effects of maternal pre- and peri-conceptional nutrition on genome wide DNA methylation in children in a randomized controlled trial setting and to relate nutrition-related DNA methylation to a range of health outcomes. Recent technical advances offer the ability to study the methylome at high resolution and affordable cost. This gives us an unprecedented opportunity to investigate the effects of nutrition on methylation at a critical period (peri-conception), when the epigenome undergoes extensive remodelling. Most previous studies investigating these effects in humans have been observational, with limited scope for causal inference due to issues of confounding and reverse causality, or are quasi-experimental (eg famine studies) with imprecise exposure measures and/or large losses to follow-up. EMPHASIS is a unique opportunity to test the developmental origins of health and disease (DOHaD) hypothesis [5, 6, 9] and its underlying mechanisms.

The two-country design has strong advantages. Findings replicated in both cohorts will provide persuasive evidence for globally-relevant mechanisms with implications for policy. Differences between the cohorts will delineate some of the complex interrelationships between ethnicity, environment, nutrition and epigenetics, highlighting important context-specific factors. The two trials have commonalities and differences. The intervention comprised multiple micronutrients in both (from foods in MMNP and tablets in PMMST); the quantities of micronutrients were about fourfold higher in PMMST, while the MMNP food-based supplements provided some nutrients not present in the tablets used in PMMST (e.g. fatty acids). The timing of the intervention differed (continued throughout pregnancy in MMNP, stopped in early pregnancy in PMMST). The baseline nutritional status of the populations differed (the mothers in India were thinner and shorter, and the babies more growth restricted, than in PMMST (Table 2)). Vitamin B12 deficiency is common in India but not in The Gambia; and seasonal variation in diets is more marked in West Kiang than in Mumbai. We therefore expect the findings to reflect these commonalities and differences, revealing both shared and cohort-specific effects.

Evidence of cross-tissue, genotype-independent stochastic variation in DNA methylation at nutrition-associated loci will provide strong evidence that these loci are MEs, programmed in the early embryo. Evidence of nutrition-related epigenetic programming at peri-conception with the potential to influence gene expression in multiple tissue types would be an important finding. Methylation data from two tissues of different developmental origin will give further information about their stability across populations. The potential utility of buccal cells to measure epigenetic changes will in future allow non-invasive testing at multiple points in the lifecourse and relationships with disease progression to be followed.

High resolution genomic data will enable the investigation of potential confounding effects due to mQTL, genetic variants that influence methylation [30,31,32,33,34]. This may be particularly relevant for cross-cohort replication where differences in genetic background between cohorts should be taken into account. The mQTL can also be used as genetic instruments for causal analysis using MR [27,28,29]. Our study design is particularly well suited for ‘two-sample’ MR where instruments (mQTL) are identified in one sample, and analysed for their association with phenotype in the other. A related and particularly powerful approach is to use existing large genome wide association study (GWAS) datasets with relevant phenotypes and a similar genetic background as the second cohort in a two-sample MR analysis.

A limitation of EMPHASIS is the relatively small size of the Gambian sample, reducing the power to detect small methylation changes. The lack of perinatal DNA samples, in both cohorts, for methylation assays makes it more difficult to establish evidence for the direction of causality when correlating methylation with birth outcomes, although causal analysis would be expected to help to in this respect.

South Asian and sub-Saharan African countries stand out on the world map of maternal undernutrition and micronutrient deficiencies, low birth weight and childhood stunting [35]. Despite large investment in supplementation programmes for pregnant women there has been slow progress in reducing intra-uterine growth restriction and stunting. The evidence that adult non-communicable disease (NCD) risk is increased by fetal and infant under-nutrition suggests that these persistent problems contribute to the high and rising burden of NCDs in these countries [36, 37]. EMPHASIS will improve understanding of the biology linking maternal nutrition to fetal development and later health, potentially leading to better interventions.