Genetic variants in the folate pathway and risk of childhood acute lymphoblastic leukemia
- First Online:
- Cite this article as:
- Metayer, C., Scélo, G., Chokkalingam, A.P. et al. Cancer Causes Control (2011) 22: 1243. doi:10.1007/s10552-011-9795-7
- 358 Views
Folate is involved in the one-carbon metabolism that plays an essential role in the synthesis, repair, and methylation of DNA. We examined whether child’s germline genetic variation in the folate pathway is associated with childhood acute lymphoblastic leukemia (ALL), and whether periconception maternal folate and alcohol intake modify the risk.
Seventy-six single nucleotide polymorphisms (SNPs), including 66 haplotype-tagging SNPs in 10 genes (CBS, DHFR, FOLH1, MTHFD1, MTHFR, MTR, MTRR, SHMT1, SLC19A1, and TYMS), were genotyped in 377 ALL cases and 448 controls. Log-additive associations between genotypes and ALL risk were adjusted for age, sex, Hispanic ethnicity (when appropriate), and maternal race.
Single and haplotype SNPs analyses showed statistically significant associations between SNPs located in (or adjacent to) CBS,MTRR, TYMS/ENOFS, and childhood ALL. Many regions of CBS were associated with childhood ALL in Hispanics and non-Hispanics (p < 0.01). Levels of maternal folate intake modified associations with SNPs in CBS, MTRR, and TYMS.
Our data suggest the importance of genetic variability in the folate pathway and childhood ALL risk.
KeywordsCase–control studyChildrenDNA methylationFolateGenetic polymorphismsLeukemia
Leukemia is the most common cancer worldwide in children (ages 0–14 years), with more than 2,600 cases diagnosed annually in the USA . The main histological type is acute lymphoblastic leukemia (ALL) representing 80% of all cases in the USA . The causes of childhood leukemia remain largely unknown, and, as for many cancers, genetic susceptibility combined with environmental exposures is likely to play a significant role in the mechanisms of childhood leukemia development .
Folate is a water-soluble B vitamin involved in one-carbon metabolism that plays an essential role in the synthesis, repair, and methylation of DNA. Folate metabolism provides one-carbon units necessary for the synthesis of nucleic acid bases and enables the conversion of methionine into S-adenosylmethionine (SAM), via its ability to methylate homocysteine. SAM is the universal methyl group donor in the majority of biochemical reactions including DNA methylation [4, 5]. Low folate intake or defects in folate metabolism may lead to DNA strand breaks, reduced DNA repair, and aberrant DNA methylation. Low levels of folate in utero have been hypothesized to increase the risk of cancer, including childhood leukemia [5–10].
Although there is growing evidence that genetic variants of methylenetetrahydrofolate reductase (MTHFR), a key enzyme in the regulation of folate metabolism, interact with folate bioavailability in the association with cancer risk , few studies on childhood leukemia have investigated gene-environment interactions, and those that have included limited information on folate intake and yielded conflicting results [9, 12, 13]. Moreover, these previous reports mainly focused on MTHFR polymorphisms [7–9, 12–19], and few studies have examined other genes in the folate pathway [10, 20–26].
Description of genes and SNPs included in the analyses (the Northern California Childhood Leukemia Study, 1996–2002)
GENE, HUGO nomenclature [aliases, where appropriate]
Function of the protein encoded by the gene
N. of SNPs selected for genotyping
SNPs excluded from analyses
CBS: cystathionine-beta-synthase [HIP4]
Catalyzes homocysteine to cystationine (first step of the transsulfuration pathway)
DHFR: dihydrofolate reductase
Converts dihydrofolate into tetrahydrofolate (methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids)
FOLH1: folate hydrolase (prostate-specific membrane antigen) 1 [PSM, FGCP, FOLH, GCP2, PSMA, Mgcp, GCPII, NAALAD1, NAALAdase]
Acts as a glutamate carboxypeptidase on different alternative substrates, including the nutrient folate; dysfunction may be associated with impaired intestinal absorption of dietary folates, resulting in hyperhomocysteinemia
MTHFD1: methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1, methenyltetrahydrofolate cyclohydrolase, formyltetrahydrofolate synthetase [MTHFC, MTHFD]
Catalyzes three sequential reactions in the interconversion of 1-carbon derivatives of tetrahydrofolate (substrates for methionine, thymidylate, and de novo purine syntheses)
MTHFR: 5,10-methylenetetrahydrofolate reductase (NADPH)
Catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate (cosubstrate for homocysteine remethylation to methionine)
MTR: 5-methyltetrahydrofolate-homocysteine methyltransferase [MS, FLJ33168, FLJ43216, FLJ45386]
Catalyzes the final step in methionine biosynthesis
MTRR: 5-methyltetrahydrofolate-homocysteine methyltransferase reductase [MSR, MGC129643]
Regenerates a functional 5-methyltetrahydrofolate-homocysteine methyltransferase via reductive methylation (which becomes inactive due to the oxidation of its cob(I)alamin cofactor)
SHMT1: serine hydroxymethyltransferase 1 (soluble) [SHMT, CSHMT, MGC15229, MGC24556]
Catalyzes the reversible conversion of serine and tetrahydrofolate to glycine and 5,10-methylene tetrahydrofolate (this reaction provides one carbon units for synthesis of methionine, thymidylate, and purines in the cytoplasm)
SLC19A1: solute carrier family 19 (folate transporter), member 1 [CHMD, FOLT, IFC1, REFC, RFC1]
Main entry point of folate into cell; plays a role in maintaining intracellular concentrations of folate
TYMS: thymidylate synthetase [TS, TMS, Tsase, HsT422, MGC88736]
Catalyzes the methylation of deoxyuridylate to deoxythymidylate using 5,10-methylenetetrahydrofolate (methylene-THF) as a cofactor (this function maintains the dTMP (thymidine-5-prime monophosphate) pool critical for DNA replication and repair)
Materials and methods
This study was approved by the University of California, Berkeley Committee for the Protection of Human Subjects, the California Health and Human Services Agency Committee for the Protection of Human Subjects, and the Institutional Review Boards of the nine participating hospitals. Written informed consent was obtained prior to interview from the responding parent of each participating child.
Selected characteristics of participating cases and controls (the Northern California Childhood Leukemia Study, 1996–2002)
B-cell lineage: c-ALL
B-cell lineage: other
Age at diagnosis/reference datea (years)
At least 10
Maternal alcohol consumption in the 3 months before pregnancy
Less than 1 drink per month
1–3 drink(s) per month
1 drink per week
2–4 drinks per week
5–6 drinks per week
At least 1 drink per day
Maternal alcohol consumption during pregnancy
Less than 1 drink per month
At least 1 drink per month
Maternal folate intake from food and supplements 3 months before pregnancy (mcg DFEs/day)
Household annual income (USD)
Biospecimen collection and DNA processing
Buccal cells were collected as the primary DNA source for case and control children. Buccal cytobrushes were collected at the time of interview by trained interviewers. DNA from cytobrush samples was extracted by heating (98–100°C) in the presence of NaOH, followed by neutralization with Tris–HCl buffer, and whole-genome amplification (WGA) using GenomePlex reagents (Sigma–Aldrich, St. Luis, MO). In the event that buccal cell DNA was insufficient for genotyping (26.6% of subjects), archived newborn blood (ANB) specimens were used as a secondary DNA source. ANB specimens are collected at birth on a paper card for each child born in California and archived at −20°C by the California Department of Public Health. The NCCLS receives one spot containing approximately 60 microliters of blood, per child. A small piece of the bloodspot was excised, and DNA was extracted using the QIAamp DNA mini-extraction kit. Isolated ANB DNA was whole-genome amplified using REPLI-g reagents (Qiagen, Hilden, Germany). WGA products were tested for minimum acceptable amplifiable human DNA content using an ALUq real-time PCR method published elsewhere . When analyzed using multiplexed GoldenGate genotyping (Illumina, San Diego, CA), whole-genome amplified DNA from both buccal cells and ANB specimens yielded genotypes that were highly concordant with those from genomic DNA from peripheral blood [31, 32].
SNP selection and genotyping
Within the context of a study of candidate pathways, we selected 89 SNPs in 10 folate metabolism genes (CBS, DHFR, FOLH1, MTHFD1, MTHFR, MTR, MTRR, SHMT1, SLC19A1, and TYMS) (Table 1) on the basis of two criteria: (i) SNPs that result in amino acid changes and (ii) SNPs that tag haplotype blocks, or groups of linked SNPs, within regions encompassing the target genes and 10,000 base pairs up- and down-stream of the gene. Using HaploView , in conjunction with SNP data from both the 30 Caucasian trios in the HapMap project  (Release 19, Build 34) and the 23 Hispanics in the SNP500Cancer project , we applied the method of Gabriel et al.  to select haplotype-tagging SNPs (htSNPs) that captured at least 80% of the haplotype diversity for common haplotypes (>5% frequency) in the Caucasian or Hispanic populations.
In addition, 82 ancestry informative markers (AIMs) for defining European versus African versus Native American ancestral origin were selected for genotyping. Genetic ancestry was estimated using a maximum likelihood approach as described by Chakraborty et al.  and Hanis et al. . Ancestral population AIM allele frequencies were input along with the genotypes of the admixed participants to improve ancestry estimation. Genotyping of 385 cases and 456 controls was performed using a custom Illumina GoldenGate genotyping panel (1536 SNPs). Across this panel, we used a GenTrain cutoff of 0.25, a SNP-wise call rate threshold of 90% and a subject-wise call rate threshold of 95%. Quality of genotyping was verified by comparing duplicate samples: (i) 59 samples were run in duplicate after processing with the same WGA method and typed on the same plate; these showed a 0.88% discordance of genotype; (ii) DNA specimens extracted from both buccal cell and ANB spots were genotyped for 9 subjects and showed a 1.05% discordance of genotype. In addition, we estimated the Mendelian errors in 10 trios from the HapMap Centre d’Etude du Polymorphisme Humain (CEPH), and only 28 pedigree errors in 25 markers were found (overall Mendelian error rate = 0.2%).
Out of the 79 SNPs that satisfied the call rate threshold described above, two were excluded because they departed from Hardy–Weinberg equilibrium (p < 0.01) in both Hispanic and non-Hispanic controls, and one SNP was excluded because the minor allele frequency was less than 5% in both Hispanic and non-Hispanic controls (Table 1). In addition, eight cases and eight controls were below the subject-wise call rate threshold, leaving a total of 76 SNPs in 377 ALL cases and 448 controls available for analysis.
Diet and alcohol intake assessment
Details of data collection have been published elsewhere . In brief, maternal folate intake estimates were assessed using a modified version of the Block Food Frequency Questionnaire (FFQ) administered during the in-home interviews to assess the pre-pregnancy diet of the mother. This period was chosen rather than diet during pregnancy because it represents the probable state of nutritional adequacy at the time of conception and during early pregnancy. Daily folate intake in micrograms per day of dietary folate equivalents (mcg DFEs) was calculated as a composite variable of vitamin/supplement use and dietary intake (folate from natural source and from fortification when it was in effect), taking into account all food items, the reported portion sizes, the folate content per 100 g, and the bioavailability corresponding to the source of folate (USDA SR 16-1, 2004) (Table 2).
Alcohol consumption was also measured through the FFQ. Consumption of different types of alcohol (beer, wine or wine coolers, and hard liquor or mixed drinks) was assessed in frequency before and during pregnancy, and combined variables were derived for total alcohol consumption (number of drinks per month, week, or day) (Table 2).
We used unconditional logistic regression models to estimate the odds ratios (ORs) associated between childhood ALL and genetic polymorphisms, maternal folate intake, and alcohol consumption, adjusting for child’s age at diagnosis/reference date, sex, Hispanic ethnicity (for analyses including all subjects), and maternal race. These models employed the following variables of interest: none, one or two copies of the variant allele for SNPs; folate intake using continuous or categorical variables (below or above the median of 430 mcg DFE per day in the control population for folate intake); alcohol consumption using ordinal categories (<1 drink per month, 1–3 drink(s) per month, 1 drink per week, 2–4 drinks per week, 5–6 drinks per week, or at least 1 drink per day) or two categories (<3 drinks per month, at least 3 drinks per month). Analyses involving folate or alcohol consumption were adjusted for household income. No statistically significant main effects were detected between the risk of ALL and maternal folate intake (ORcontinuous = 1.01 per 100 DFE unit, p = 0.65; and ORcategorical = 1.29 for high versus low folate level, p = 0.13) and alcohol consumption before pregnancy (ORordinal = 0.98, p = 0.63; and OR1 + drink per week versus less = 0.90, p = 0.56), or alcohol consumption during pregnancy (OR1 + drink per month versus less = 0.86, p = 0.66). No statistically significant interaction was reported between folate (as a continuous variable) and alcohol intake (ever/never) variables before (p = 0.21) or during (p = 0.41) pregnancy. These environmental factors were only used to test for interaction with genetic factors thereafter, using the Wald chi-square test. Interactions between SNPs and folate intake were estimated after adjustment for alcohol consumption, and interaction with alcohol consumption was estimated after adjustment for folate intake.
Nominal log-additive p values were calculated for all 76 SNPs for all subjects together and separately for Hispanic and non-Hispanic subjects. To test for heterogeneity between ethnic groups, a p value for the multiplicative interaction term between the genotype and Hispanic ethnicity was calculated for each SNP, using the Wald chi-square test.
We used a “sliding window” approach to conduct systematic analyses of haplotypes, with a “window” defined as 2–5 contiguous SNPs . For each possible window, haplotype frequencies were computed, and a global score statistic (Obs) was calculated. We then calculated 1,000 simulated global score statistics (Simi) by randomly permuting case/control status. The p value for the window was defined as the number of times Simi was greater than Obs, divided by 1,000. These analyses were conducted separately in Hispanics and non-Hispanics when there was a statistically significant interaction between at least one SNP within the gene and Hispanic ethnicity (p < 0.05).
Haplotype blocks with the lowest p value across all sliding windows (including single SNP windows) were selected for additional haplotype analyses. Analyses to examine specific haplotypes in the significant genomic regions were performed with haplotype trend regression techniques to calculate ORs associated with each copy of a specific haplotype using the most frequent haplotype as the referent group .
Adjustment for genetic ancestry derived from AIMs, in addition to maternal race, did not change the risk estimates associated with SNPs by more than 10% and was therefore not included in the final models for any SNP or haplotype.
Thirteen children with Down syndrome were excluded from all the analyses of SNPs located on chromosome 21 (gene CBS and SLC19A1). Analyses were implemented using SAS 9.1.3 and R 2.4.1 (Haplo.stats and SNPassoc packages). Graphical representations of the sliding window results were constructed using GrASP .
Association between the risk of childhood acute lymphoblastic leukemia and SNPs in the folate pathway genes, overall and by Hispanic ethnicity (the Northern California Childhood Leukemia Study, 1996–2002)
Odds ratio estimates for SNPs located in coding regions of genes (the Northern California Childhood Leukemia Study, 1996–2002)
OR (95% CI)a
OR (95% CI)a
OR (95% CI)a
Haplotype trend regression results (the Northern California Childhood Leukemia Study, 1996–2002)
OR (95% CI)a
CBS (rs400660, rs8128028, rs11909493, rs719037)—Hispanics
Global p: 0.01
CBS (rs400660, rs8128028, rs11909493, rs719037)—Non-Hispanics
Global p: 0.04
CBS (rs4920037, rs234705, rs2851391, rs234715, rs9982015)—Hispanics
Global p: 0.06
MTHFD (rs3783731, rs1950902, rs11627525)—Hispanics
Global p: 0.04
MTHFR (rs4846052, rs9651118)—Both ethnic groups
Does not converge
Global p: 0.02
MTRR (rs162037, rs2287779, rs10380)—Hispanics
Global p: 0.04
TYMS (rs3744962, rs11081251, rs2260821, rs495139)—Both ethnic groups
Global p: 0.04
Other gene regions, which did not include significant single SNPs, were associated with childhood ALL (minimum p value <0.05) as shown in Fig. 1 (“sliding window” analyses) and Table 5 (haplotype trend regression analyses). These gene regions included (1) the haplotype block tagged by rs3783731 to rs11627525 in MTHFD1 among Hispanics, where associations were mainly driven by the rare haplotypes; (2) the haplotype block tagged by rs4846052 and rs9651118 in MTHFR, where the GG haplotype appears to be slightly more prevalent in controls compared to cases (p = 0.12); (3) the haplotype block tagged by rs162037 to rs10380 in MTRR in Hispanics, where the GAG haplotype was associated with an increased risk of childhood ALL; and (4) the haplotype block tagged by rs3744962 to rs495139 in gene TYMS/ENOFS1, where the AAAC haplotype conferred a two-fold increased risk of childhood ALL.
Interactions between child’s SNPs in folate pathway genes and maternal folate intake (the Northern California Childhood Leukemia Study, 1996–2002)
SNP rs number
Odds ratiosa (95% CI)
p for interactionb
Maternal folate intake less than 430 mcg DFEs per day (n = 152 ca. and 204 co.)
Maternal folate intake at least 430 mcg DFEs per day (n = 196 ca. and 208 co.)
We examined the relationship between 76 SNPs in 10 genes involved in the folate pathway (i.e., CBS, DHFR, FOLH1, MTHFD1, MTHFR, MTR, MTRR, SHMT1, SLC19A1, and TYMS) and the development of childhood ALL. Our single SNP analyses did not provide evidence for an association with the most commonly studied MTHFR polymorphisms, i.e., 1298A>C (rs1801131) and 677C>T (rs1801133). We observed associations between childhood ALL and single SNPs in CBS and TYMS. Haplotype blocks in CBS, MTHFD1, MTRR, and MTHFR, as well as blocks just outside CBS and TYMS, were associated with ALL risk in children. Analyses conducted separately for Hispanic and non-Hispanic children showed differences in risk, especially for CBS and MTRR. In a few instances, associations with SNPs in CBS, MTRR, and TYMS—but not MTHFR—vary by level of maternal folate intake around the time of conception/early pregnancy.
Previous studies examining the association between childhood leukemia and MTHFR variants 1298A>C or 677C>T and self-reported maternal folate intake have shown positive, negative, or null associations [7–10, 12–16, 18–24]. The lack of associations with 1298A>C and 677C>T variants in our series is consistent with results from a comprehensive meta-analysis where the pooled OR for 677C>T variant was 0.88 (95% CI: 0.73–1.06; p = 0.18) and the pooled OR for 1298A>C variant was 0.80 (95% CI: 0.56–1.16; p = 0.24), suggesting that these variants are not likely to contribute to childhood ALL risk . It is well established that the 677C>T variant modulates folate and homocysteine levels, while the biological significance of 1298A>C polymorphism has been questioned as it does not appear to influence folate and homocysteine levels in populations mildly deficient in folate . Alternatively, 1298A>C polymorphism may only have an effect among subjects with low folate level [9, 23]. Our null findings for the 1298A>C and 677C>T variants, as well as the lack of interaction between MTHFR variants and maternal folate intake, may be due to the increase in folate serum levels among women of childbearing age since the early 1990s in the USA . A case-only study in Australia  found no evidence of a multiplicative interaction between MTHFR variants 1298A>C and 677C>T and self-reported maternal folate supplementation during pregnancy, although this finding was based on small numbers. Similarly, one study in Germany  reported no association between childhood ALL with these MTHFR variants overall and after taking into account calendar years when folate supplementation during pregnancy was implemented. These observations contrasted with positive findings reported earlier in a Canadian study .
To our knowledge, no studies of childhood leukemia to date have examined SNPs in the MTHFR other than the 1298A>C (rs1801131) and 677C>T (rs1801133) variants. We examined eight tagging SNPs in MTHFR and report an association between childhood ALL and single SNP rs1537515 (p = 0.008) located in the 3’UTR regulatory region of the gene. This association, however, did not persist through increasingly larger haplotype windows. The haplotype analyses revealed that the block composed of rs4846052 and rs9651118 was also associated with childhood ALL risk (minimum p value <0.05), although no specific haplotype could be identified as responsible for this association.
With the exception of MTHFR, few studies have examined the role of other key genes in the folate pathways [10, 20–23, 25]. We report associations between childhood ALL and single and haplotype-tagging SNPs, inside and outside the boundaries of CBS localized on chromosome 21 (minimum p < 0.01). While no conclusions in terms of causality can be drawn from observations derived from non-synonymous SNPs, our data provide useful information for future validation studies. Several CBS gene variants have been previously associated with adult cancers at various sites, i.e., brain , digestive tract [46, 47], and lung . The CBS enzyme is involved in the trans-sulfuration pathway, inducing irreversible catalysis of homocysteine to cystathionine. As a result, overexpression of CBS decreases levels of homocysteine inducing functional folate deficiency (“folate trap”). Because homocysteine is at the intersection of the DNA synthesis and repair pathway and the DNA methylation pathway, overexpression of CBS may lead to alteration in all these functions . CBS is over-expressed in children with trisomy 21 (Down syndrome) , and it has been postulated that the cellular lesions caused by the “folate trap” may contribute to leukemia risk in children with Down syndrome. There were only 13 children with DS in our series; these were excluded from our analyses of genes localized on the chromosome 21. It will be of interest in future studies to consider possible interactions of CBS SNPs with risk for cytogenetic subtypes of ALL containing additional copies of chromosome 21. A clue to the possible functional properties of CBS SNPs might be finding that specific CBS alleles are overrepresented among the additional chromosome 21 within hyperdiploid leukemias, thus suggesting a selection for CBS SNP variants during leukemogenesis. In addition to comparing germline and leukemic CBS genotypes, future studies in our group are underway to examine the correlation between CBS gene variants, folate intake, cytogenetic markers, and methylation profiles.
The SNPs in SLC19A1, a folate transport gene located on chromosome 21 and close to CBS, were not associated with childhood leukemia. However, coverage of that gene was limited. The enzyme regulated by MTRR maintains the MTR enzyme in its active form, which is essential for maintaining an adequate level of methionine, a precursor of the universal methyl group donor SAM. We observed a statistically significant increased risk of childhood ALL in children carrying the A variant of SNP rs2287779 (L412L) in exon 9 of MTRR, where the association appears to be limited to Hispanic children. In contrast to recent studies [10, 20, 21], we did not report association with rs1801394 (I49 M) in MTRR.
We reported associations between childhood ALL and intronic SNPs in ENOFS1, a gene regulating the expression of TYMS in the DNA synthesis pathway. One childhood leukemia study examined two polymorphisms (2R>3R and 1494del6) of TYMS and found no difference in haplotype distribution between ALL cases and controls . Lower risks of childhood ALL were reported in Malay carriers of both the 3′-TYMS 1494 -6p/-6p genotype and SLC19A1 80G>A (rs1051266) . No statistically significant associations were seen between the TYMS 6 bp and 2R>3R polymorphisms and adult ALL, while a decreased risk of adult AML was reported for the TYMS 6 bp polymorphism . Although single SNPs in MTHFD1 were not associated with childhood ALL in our series, haplotype analyses suggest an association with the gene region tagged by rs3783731 to rs11627525. No single SNP or haplotype in the SHMT1 and MTR genes was associated with childhood ALL risk in our study. Null findings were also previously reported for MTHFD1 1958 G>A (rs2236225) , MTHFD1 401 G>A (rs1950902) , TYMS 28-bp  SHMT1 1420 C>T (rs1979277) [10, 20, 23], and MTR 2756A>G (rs1805087) [10, 20] polymorphisms. In contrast, a large study in the UK reported an association between MTR 2756A>G and ALL and AML in children, specifically in the presence of MLL chromosomal abnormality . We reported that associations between several folate pathway genes and risk of childhood ALL varied by levels of maternal folate and alcohol intake. Although similar findings have been observed for various solid cancers in adults [51–54], no previous studies of childhood leukemia have comprehensively examined gene-environment interaction.
The current study presents several strengths and limitations. Haplotype analyses using tagging SNPs provide useful information on gene regions possibly involved in the etiology of childhood leukemia, but are not sufficient to establish causality. Although coverage of the folate genes in our study is not complete, it is more comprehensive than most investigations of this pathway in childhood ALL studies to date. Our analyses were initiated based on a priori hypotheses about the role of genes in folate metabolism pathways. This candidate gene approach, however, still suffers from the possibility of false positive reporting as none of the observed single SNP associations remained statistically significant after strict multiple testing correction using the Bonferroni method (results not shown). In contrast to single SNP analyses, the “sliding window” haplotype analyses have the advantage of providing additional power for mapping disease genes by examining larger sets of neighboring SNPs . Most individual epidemiologic studies of childhood leukemia are hampered by low statistical power. To overcome this limitation, the Childhood Leukemia International Consortium (http://clic.berkeley.edu) was recently established to conduct pooled and replication studies of genetic and environmental factors with greater sample size within the Consortium.
Our study population is unique in that it includes approximately 40% Hispanics. We observed several differences by Hispanic ethnicity in the association between several SNPs and childhood ALL, but the underlying reasons remain unclear. The analyses of ancestry informative markers in our series showed that potential confounding by population stratification was very low, likely due to close matching on race and ethnicity. We used all available information to select haplotype-tagging SNPs based on ethnic groups, but this was not possible for all SNPs. As a result, coverage of genetic variation by the selected SNPs might be unequal in the two populations, limiting our ability to interpret the observed ethnic differences. Haplotype-tagging SNPs have been mostly defined in US residents from northern and western European ancestry, such as in the HapMap project, and any given haplotype may be more or less common depending on the population. Finally, findings by ethnic group may be due to chance alone due to low statistical power for stratified analyses, and therefore, need to be replicated.
In our study, annual household income, a potential confounder of alcohol and/or folate intake, was lower in cases compared to controls. We previously assessed whether this observation may be due to selection bias , using information available on California birth certificates to compare characteristics such as parental age and education and maternal reproductive history between participating controls (i.e., whose location was successfully traced and provided consent) and “ideal” controls from the birth registry (with no tracing effort). We found little difference in socio-demographic characteristics between the two , suggesting that participating controls are representative of the study population base, therefore, reducing the potential for selection bias. In the current study, the amount of maternal alcohol consumption before and during pregnancy increases with the level of household income (p < 0.01). Although we adjusted for this possible confounder, we cannot rule out residual confounding by socio-economic status. In contrast, folate intake was not associated with household income (p = 0.82).
Estimates of maternal folate intake were derived from a detailed food frequency questionnaire with information on dietary and supplemental source 3 months prior to conception, as a surrogate for “folate environment” of the fetus during early pregnancy. No data on folate supplementation during the entire pregnancy were available for this analysis. Inherent to the case–control design, collection of self-reported information may suffer from recall bias. The public knowledge of “healthy” diets and lifestyles and their impacts on disease prevention may have led to differential recall between case and control mothers. However, the direction of a possible bias is difficult to evaluate since both over- and under-reporting may have occurred. The lack of association between maternal intake of alcohol and folate and childhood leukemia in our series may be due to non-differential misclassification of diet between case and control mothers, leading to risk estimates biased toward the null . However, the data collection methods and estimation of maternal folate level were similar between cases and controls. An alternative explanation is that folate levels are high in the California population  so that lack of variation in exposure may have affected our ability to identify possible associations or interactions.
In conclusion, our data do not support associations between childhood ALL and SNPs or haplotypes in the MTHFR gene. The strongest associations were found for the CBS gene, with SNPs located in or adjacent to the gene region. There were also suggestions of associations with haplotype blocks in genes MTRR, and TYMS/ENOFS1.
This research could not have been conducted without the strong support from our clinical collaborators and participating hospitals which include: University of California Davis Medical Center (Dr. Jonathan Ducore), University of California San Francisco (Dr. Mignon Loh and Dr Katherine Matthay), Children’s Hospital of Central California (Dr. Vonda Crouse), Lucile Packard Children’s Hospital (Dr. Gary Dahl), Children’s Hospital Oakland (Dr. James Feusner), Kaiser Permanente Sacramento (Dr. Vincent Kiley), Kaiser Permanente Santa Clara (Dr. Carolyn Russo and Dr. Alan Wong), Kaiser Permanente San Francisco (Dr. Kenneth Leung), and Kaiser Permanente Oakland (Dr. Stacy Month), and the families of the study participants. We also acknowledge our collaborators at the California Department of Public Health, and the entire Northern California Childhood Leukemia Study staff for their effort and dedication. Financial support: Children With Leukemia, UK, grants 2005/027 and 2006/051; National Institute of Environmental Health Sciences, grants P42-ES04705 and R01 ES09137. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Environmental Health Sciences.
Conflict of interest
The authors declare no competing financial interests.