A substantial evidence base, particularly from in vivo toxicology studies, has demonstrated a causal relationship between metal combinations on human health. Ex vivo studies have shown that high concentrations of metal mixtures are associated with cognitive impacts in the mammalian brain, including mercury (Hg), arsenic (As), cadmium (Cd), lead (Pb), and manganese (Mn) (Andrade & Aschner, 2017; Andrade et al., 2015; Cobbina et al., 2015; Rai et al., 2010). The mechanisms of action have been demonstrated through oxidative stress, inhibition of acetylcholine esterase, and effects on the nervous system and impacts on the hematopoietic system (Lin et al., 2016). Exposure science has been focusing on metal mixtures and other environmental factors and assessing their impact on human health outcomes (NIEHS, 2021). This investigation focused on one type of biologic change in genes that is associated with exposure to at least six metals [copper (Cu), molybdenum (Mo), arsenic (As), manganese (Mn), lead (Pb), and zinc (Zn)] in the air and soil, associated with mining.

Epigenetics, the heritable gene expression changes without altering the DNA sequence, plays essential roles in genomic imprinting, embryonic cell differentiation, postnatal development, and aging (Avgustinova & Benitah, 2016; Ferguson-Smith, 2011; Lessing et al., 2013; Lin et al., 2016; Lunyak & Rosenfeld, 2008; NIEHS, 2021). As one of the major epigenetic mechanisms, methylation of DNA is caused by the addition of a methyl (CH3) group at the carbon 5 position in the sequence 5'CpG3' to the cytosine ring, activated by the enzyme DNA methyltransferase. Methylation plays a vital role in regulating gene expression, as hypermethylation inactivate gene expression can block the bond of the transcriptional or repressor factors to their recognition sites (Razin & Cedar, 1991). Studies have shown that environmental exposures (such as to arsenic or cadmium) can disrupt gene expression, and the effects could be mediated at least partially by methylation (Andrew et al., 2008; Arita & Costa, 2009; Reichard & Puga, 2010). Importantly, both inactive and active epigenetic states of genes are inherited (Heard & Martienssen, 2014; Skvortsova et al., 2018). It is emerging that metals’ neurotoxicity can have dramatic consequences that can last a lifetime through epigenetic dysregulation (Cardenas et al., 2017; Senut et al., 2012). These consequences could affect future generations. Extensive animal models augment this premise (Skinner et al., 2013; Vassoler et al., 2014; Wolff et al., 1998).

Active mine sites and historic mining, including Superfund areas, are the nation’s most contaminated areas for metals. The residents of Butte Montana, in Silver Bow County have exposure to active copper and molybdenum mining occurring at an extremely large open pit, located just northeast of the town of Butte. Mining for copper (Cu) and molybdenum (Mo) has been ongoing for the past twenty years, with As, Mn, Pb, and zinc (Zn) also extracted as part of the process (Andrade & Aschner, 2017) Butte residents are also exposed to numerous underground historic mining sites, including the Silver Bow Creek/Butte Superfund Area (EPA, 2020; Schecher & McAvoy, 1992), which are proximal to the mine and the town of Butte. It is unknown to what degree the mine adds to or separately contributes to human health dangers, over and above exposures from the historic sites. This study focused on the biologic differences among mother child pairs, by comparing the methylation sites in mother–child pairs living in Butte and potentially exposed to pollution from active and historic mining, compared to mother child pairs who live in Anaconda where there is no active mining but there is a large historic smelting operation that is a Superfund site, and to mother–child pairs from Missoula, over 100 miles west of Butte and Anaconda, where there is no active or historic mining.

Method

Human subject considerations

This study received Expedited review, based on Categories 3 and 7, and it was approved by the University of South Carolina Institutional Review Board, where the data analysis occurred. The study recruiter discussed the study with the mother in each family unit. The mother then read and signed an informed consent form that explained the study purpose and protocol, and informed that family that the mother would receive a $20 gift card and the child would receive a $10 gift card. Participating children over the age of 5 years were read an assent script that summarized the purpose and the protocol.

Recruitment

The Montana-based recruiter identified and recruited mother–child pairs from three Montana towns. There were four pairs from Butte, four pairs from Anaconda, and four pairs from Missoula. This was a convenience sample, and the recruitment was conducted through personal contacts, flyers posted at appropriate venues, social media announcements, and at community gathering areas and events. The target population was children and their mothers who reside in either Butte, Anaconda, or Missoula, Montana. We included some children who have been placed in special education and others who were in regular education classes. The children were ages 4–11 years. We did not discriminate by race or ethnicity, but as expected, most were Caucasian, like almost 90% of the population of Montana. We collected a buccal sample from the mother of each child, with whom the child resided. The exclusion criteria were families who have moved to Anaconda or Butte or Missoula after the child was born and/or children who have a known genetic condition, such as Down’s syndrome or Fragile X syndrome. Mothers read and sign an informed consent document that clearly explained the study along with potential risks.

Data collection

For each mother–child pair, we collected the following:

  1. 1.

    Maternal survey, which included information about where the child lived from the time that the child was in utero to the current time, as well as the child’s school placement and some limited health information.

  2. 2.

    Buccal sample from mother and child.

The procedures that we used to collect the buccal cells are the standard sample collection procedures provided by the manufacturer of the swab kits, ORAcollect. We utilized their OCD-100/OCD-100A swab kit for the adult participants and the OC-175 swab kit for the child participants, as is recommended. Prior to buccal cell collection, the subjects were advised to rinse their mouth thoroughly with water to remove unwanted debris. Exfoliated buccal cells were obtained from the mother by gently rubbing the inside of both cheeks with a soft foam brush for one minute each. The brush was then placed in the collection tube, and the preservative was added. For the child, buccal cells were collected by saturation of a foam brush with saliva, and the brush was compressed to force the sample into a collection tube. The process is repeated until an adequate sample was collected, and then the preservative was added. The entire collection process took between 5 and 10 min.

The tubes containing the buccal cell samples were shipped in one batch to a commercial laboratory for DNA extraction and assessment of methylation. DNA from buccal samples was extracted and measured by AKESOgen, Inc., GA. Picogreen assay (Invitrogen™, Life Technologies Corporation, NY) and Bioanalyzer assay (Agilent Technologies, Inc. CA) were used to control DNA quality. Methylation levels of more than 850,000 methylation sites were measured using the Human Infinium MethylationEPIC BeadChip (Illumina, Inc. CA).

Data processing and analysis

We used the R package minfi (Fortin et al., 2017) to perform methylation data processing, including probe annotation by mapping probes to genome using the Illumina human methylation EPI Canno.ilm10b4.hg19 R package, quality control including removing large detection p-values (> 0.01), background correction by thresholding probes with values smaller than −3 of standard deviation from the median in log2 scale, followed by subset-quantile within array normalization where probes are stratified by region (CpG island, shore, etc.). Given that common SNPs may affect the methylation measurement, we removed sites from analysis if they contain SNPs at the CpG interrogation or at the single nucleotide extension. For each probe, Beta and M values were calculated for methylation level quantification. Specifically, for probe \(i\) with methylation signal \({\gamma }_{i,meth}\) and unmethylation signal \(\gamma_{{i,{\text{unmeth}}}}\), \(M_{i} = \log_{2} \left( {\frac{{{\text{max}}\left( {\gamma_{{i,{\text{meth}}}} ,0} \right) + 1}}{{{\text{max}}\left( {\gamma_{{i,{\text{unmeth}}}} ,0} \right) + 1}}} \right)\) and \({\text{beta}}_{i} = \frac{{{\text{max}}\left( {\gamma_{{i,{\text{meth}}}} ,0} \right)}}{{{\text{max}}\left( {\gamma_{{i,{\text{meth}}}} ,0} \right) + {\text{max}}\left( {\gamma_{{i,{\text{unmeth}}}} ,0} \right) + 1}}\).

To investigate the overall patterns inherited in samples, principal component analysis (PCA) was conducted on the centered and scaled M values of all probes. Patterns were visualized in the PCA plot of the first three principal components. Further, for each mapped probe, we compared samples from Butte to those in Anaconda and Missoula on M values, which are more statistically valid in linear model-based differential analyses than beta values (Du et al., 2010). To account for group random effects from family and site, a linear mixed model also fitted as

$$ M_{i} = \beta_{i0} + \beta_{i1} *{\text{Butte}}_{j} + \beta_{i2} *{\text{mom}}_{j} + \alpha_{i1} *{\text{family}}_{j} + \alpha_{i2} *{\text{town}}_{j} + e_{ij} $$

where \(i\) denotes probe, \(j\) denotes sample, \({\text{Butte}}_{j} = 1\) for samples from Butte and 0 for other regions. \({\text{mom}}_{j} = 1\) for moms and 0 for children. \({\text{family}}_{j}\) and \({\text{town}}_{j}\) encode which family and town sample \(j\) comes from. \(\beta_{i0} ,\beta_{i1} ,\beta_{i2}\) are fixed effects, \(\alpha_{i1} ,\alpha_{i2}\) are random effects where \(\alpha_{i1} \sim N\left( {0,\sigma_{i1}^{2} } \right)\), \(\alpha_{i2} \sim N\left( {0,\sigma_{i2}^{2} } \right)\) and \(e_{{{\text{ij}}}}\) is residual and \(e_{{{\text{ij}}}} \sim N\left( {0,\sigma_{{{\text{ie}}}}^{2} } \right)\). Differential methylation patterns were identified between the study groups: (1) Butte versus the other two sites; and (2) mothers versus children. Further, hierarchical clustering and heatmap were used to visualize the profiles (beta values, the percentage of methylated molecules) of top 50 differential methylation probes of known genes in samples. Note that the beta values were scaled and centered for each methylation probe. Also, gene set enrichment analysis (GSEA) via MSigDB (Liberzon et al., 2011) was performed to explore the involved biological pathways and functions. The enrichment of 200 genes which has top differentially methylated loci in the Hallmark and KEGG gene sets was assessed, and significantly enriched gene sets were identified with q-value < 0.05.

Collection of reported autism spectrum disorders (ASD)-associated genes

The ASD-associated genes identified from 42 genome-wide association (GWAS) studies were obtained from GWAS Catalog (Buniello et al., 2019) (trait label: autism spectrum disorder; trait ID: EFO_0003756). ASD-associated genes were also obtained from MalaCards (Rappaport et al., 2017), which inferred gene-disease connections by integrating 73 data sources.

Results and discussion

Demographic differences between sites

We were able to get buccal samples from 23 individuals. There were eleven maternal-child pairs (22 individuals) and one child (maternal sample was not available). The characteristics of the mother–child pairs are shown in the top part of Table 1. The age and grade level of the children, and the residences since birth, are not substantially different in the three locations. The medical and educational diagnoses provided by the mothers all suggest neurodevelopmental differences in 50% of the children who participated in the study. Two of the four children from Butte, one of the four children from Anaconda, and three of the four children from Missoula had a neurodevelopmental diagnosis.

Table 1 Maternal responses to survey administered for the Montana metals exposure study (n = 12 mothers) and community characteristics where the mother–child pairs resided

We also gathered data from two publicly available datasets. In the second section of Table 1, the first three rows show the 2010 US Census data for the three Montana counties. We used the 2010 census data to compare the three counties: Silver Bow county (including Butte), Deer Lodge county (including Anaconda), and Missoula county (including Missoula) since the Behavioral Risk Factor Surveillance System (BRFSS) data shown in the lower part of Table 1 represent data from 2009 for Butte and 2012 for Missoula. The three towns are very different in terms of population but their age and race distributions are similar. The BRFSS data, shown in the final section of Table 1, are based on self-report on a nationally standard telephone survey. The Montana BRFSS aggregated data for Standard Metropolitan Statistical Areas (SMSA) and since Anaconda has such a small population, it is not a SMSA, and therefore, these data are not available. The behavioral risk factors selected are shown for Butte SMSA and Missoula SMSA in Table 1. They are similar for alcohol use, smoking, overall health, and chronic disease.

Physical location of the three sites

A map of western Montana is shown in Fig. 1. The map shows the location of the three cities: Butte (in Silver Bow county), Anaconda, (in Deer Lodge county), and Missoula (in Missoula county). As observed, Missoula is approximately 100 miles to the west of Anaconda and Butte in far western Montana.

Fig. 1
figure 1

Map of Montana: cities of Butte, Anaconda and Butte

Mother–child differences in methylation

For each sample, methylated and unmethylated signals of 1,051,815 probes were acquired, in which 865,859 probes were mapped to the hg19 genome. After filtering probes with large detection p-values and background correction, 861,890 probes were kept. Further, 29,360 probes in the SNP site were removed, and M values and beta values of 832,530 probes were retained for downstream analysis.

Principal components analysis was performed on M values, and the top one PC (PC1) clearly distinguish children from mothers (Fig. 2A, C), which is well-aligned with the knowledge that aging strongly affects methylation. Moreover, the mother–child correlation of PC1 was observed showing that mothers with high PC1 scores tended to have children with high PC1 scores, reflecting the extensively reported epigenetic inheritance across generations.

Fig. 2
figure 2

Principal component analysis of methylation of maternal and child buccal samples, from Butte, Anaconda, and Missoula. Samples were colored by mothers (mom) and children (Child) in (A, C), or locations in (B, D). Families were connected by lines in (A, C)

We also compared mothers to children and identified their differentially methylated probes. A distinct methylation pattern was observed between mothers and children from the top 50 differentially methylation loci of known genes (Figure S1A). GSEA analysis found that signaling of KRAS, apical junction, TNF-α via NFκB, and TGF-β were upregulated, while UV response-related genes were downregulated (Figure S1B). This may show the underlying mechanism of aging related to the increased risk of uncontrolled cell proliferation and weaken ability of DNA repair and cell apoptosis. Involved in KRAS, TNF-α, and TGF-β pathways (Figure S1C), ID2 may play a pivotal role in the aging effect via epigenetic change, which warrant further investigation. This observation may link to the finding of a previous animal study that aging upregulated the expression of ID2 and other apoptosis inhibiting genes, and potentially enhanced the apoptosis resistance of CD8 central memory T cell. (Yang et al., 2021).

Butte differs from Anaconda and Missoula

This pilot study identified a pattern of DNA methylation that distinguished the mother–child pairs in the Butte area compared to the pairs from Anaconda and Missoula. Interestingly, the second PC (PC2) showed a significant difference between Butte compared to Missoula and Anaconda (two-sided Wilcoxon rank sum exact test, p-value = 0.027), which is shown in Fig. 2B, D. No significant difference was observed between Anaconda and Missoula samples.

Further, we identified differential methylation between Butte and other sites in all samples by fitting a random effect model described in the Methods section. The top 50 differentially methylated loci of known genes distinctly showed higher or lower methylation in samples from Butte as compared to those from other sites (Fig. 3A). The top 200 genes harboring hypermethylated loci were significantly enriched in pathways of actin cytoskeleton regulation, ABC transporters, leukocyte transendothelial migration, focal adhesion, and adherens junction (Fig. 3B). Disturbance of these pathways have been reported in ASD pathogenesis studies, (Folmsbee et al., 2016; Griesi-Oliveira et al., 2018; Iritani et al., 2018; Nazeen et al., 2016; Onore CE et al., 2012), and thus environmental exposure in Butte may link to ASD through methylation dysregulation. Involved in four of these five pathways (Fig. 3C), ROHA and RAC1 encodes proteins which play important roles in neural development. RohA is required for neuroblast proliferation (Lee et al., 2000), and Rac1 is essential in axon growth and guidance and in neuronal survival (Hua et al., 2015).

Fig. 3
figure 3

Differential methylation between samples from Butte and other two sites. A Heatmap of the top 50 differential methylation loci in samples. For each probe, the scale corresponds to the beta value which was centered and scaled. The dendrogram indicates the similarity of the samples. Mothers and children, and site are color-coded. B Enriched pathways of genes with the top 200 differential methylation loci. Dot sizes were determined by the number of detected genes involved in each pathway. For each gene, the connection to its involved pathways is illustrated in (C)

Among the 500 genes which harbors the top differentially methylated loci between Butte and other studied sites, 16 genes showed association with ASD in GWAS studies, and more associated genes (n = 29) were identified by integrating multiple lines of evidence (Fig. 4). Three genes (CACNA1C, RIMS1, and CSMD1) were found by all these three sources (this study, GWAS studies and multi-evidence integration), which imply their important roles in mediating the metal toxicity and neurological abnormalities and warrant further experimental and large-scale epidemiological studies. Indeed, studies have shown that exposure to lead (Ferreira de Mattos et al., 2017) or cadmium (Phuapittayalert et al., 2016) affect CACNA1C expression and Cav1.2 ion channels.

Fig. 4
figure 4

Overlap of detections between differential methylated genes in this study and ASD-associated genes in GWAS studies and MalaCards

We focused on specific geographic location because of our knowledge of the potential for metal exposures in the three locations. The Butte families are exposed to both historic mining sites and active open-pit mining, the Anaconda families are exposed to historic smelter tailings, and the Missoula families are exposed to neither of these specific metal sources.

Implications of findings

The anticipated variation in exposure based on geospatial factors allowed us to conduct these analyses. The residents of Anaconda and Butte have lived with the knowledge that there are harmful metals in their environment for over 30 years, and Superfund clean-up is underway. The existence of ongoing open-pit mining for Cu and Mo, located within one mile of residential Butte (Fig. 1), adds to the exposure for the families living in this area. The citizens of Butte and Anaconda have been informed about the impact on animals and waterways, but there has been limited information about human health effects. Notably, the residents have been told by the Environmental Protection Agency (EPA) that there are only three contaminants of concern—arsenic, lead, and mercury. However, it is established that other neurotoxic metals present, e.g., Cu, Mn, Zn, are associated with adverse human health outcomes.

Superfund, abandoned mining sites, and active mining are the nation’s most contaminated areas for metals. About 11 million people in the US, including three to four million children, live within 1.5 km of a federal Superfund site (EPA, 2020). There is evidence that transporting metals from Superfund sites does not reduce environmental concentrations, primarily due to associations with nanoscale mineral phases of iron oxides and mixed titanium/iron oxides (Plathe et al., 2010). Additionally, sediments are enriched above background by a factor of two to five times, even 500 km away from Superfund sites, (EPA, 2020) indicating that there is exposure to a much broader population.

The residents of Butte Montana have exposure to active copper and molybdenum mining occurring at a large open pit, located just northeast of the town. They are also exposed to the Superfund site, known as the Silver Bow Creek/Butte Area, which is proximal to the mine and town. It is unknown to what degree the mine adds to or separately contributes, to human health dangers, over and above exposures from historic mining and one of the largest Superfund sites in the US. The historically contaminated soils are in close proximity to the operations of the active Montana Resources copper and molybdenum mine in Butte. As a result, Silver Bow Creek, which flows through the town of Butte, has been shown to have contaminated local soils, sediments, ground water, and surface water with numerous metals (Plathe et al., 2010).

The Anaconda residents also live proximal to a Superfund site, in the contiguous county to the northwest of Butte. This Superfund site is a former ore processing facility/smelter. The town of Anaconda includes a number of residential neighborhoods in proximity to two large tailings ponds (the Anaconda Pond and the Opportunity Pond) and the former Anaconda smelter stack. The US Environmental Protection Agency (EPA) added the Silver Bow Creek/Butte area and the Anaconda and Opportunity areas to the National Priority List in 1983, designating them as Superfund sites. The EPA lists As, Cd, Pb, and Hg as metal contaminants of concern for residents living near these Superfund sites (USEPA. Silver Bow Creek, Butte Area Butte & MT Superfund Site., 2020). Figure 1 shows the two Superfund sites and the active mine sites.

Limitations of the study

This pilot study has substantial limitations, most importantly the small sample size for mothers and children that limits the power of our findings. In addition, we were not able to identify risk factors that could have some explanatory value, such as whether the mother or other household member worked in mining, the exact locations of the residences in relation to prevailing winds, and distance to the Superfund sites and open-pit mine. As a result of the small sample size and the inability to geocode the exact mother–child residential locations, the methylation sites that distinguished the Butte families from the others need to be confirmed by a substantially larger study.

Conclusion

We hope this pilot study will lay a foundation for more scientific inquiry about the biologic changes and health consequences of environmental exposure to heavy metals. This information is needed to understand the impact of metal exposures on biomarkers and the resulting consequences on the health of people living in proximity to both historic and active mining.