Abstract
Human NAT1 gene for N-acetyltransferase 1 modulates xenobiotic metabolism of arylamine drugs and mutagens. Beyond pharmacogenetics, NAT1 is also relevant to breast cancer. The population history of human NAT1 suggests evolution through purifying selection, but it is unclear whether this pattern is evident in other primate lineages where population studies are scarce. We report NAT1 polymorphism in 25 rhesus macaques (Macaca mulatta) and describe the haplotypic and functional characteristics of 12 variants. Seven non-synonymous single nucleotide variations (SNVs) were identified and experimentally demonstrated to compromise enzyme function, mainly through destabilization of NAT1 protein and consequent activity loss. One non-synonymous SNV (c.560G > A, p.Arg187Gln) has also been characterized for human NAT1 with similar effects. Population haplotypic and functional variability of rhesus NAT1 was considerably higher than previously reported for its human orthologue, suggesting different environmental pressures in the two lineages. Known functional elements downstream of human NAT1 were also differentiated in rhesus macaque and other primates. Xenobiotic metabolizing enzymes play roles beyond mere protection from exogenous chemicals. Therefore, any link to disease, particularly carcinogenesis, may be via modulation of xenobiotic mutagenicity or more subtle interference with cell physiology. Comparative analyses add the evolutionary dimension to such investigations, assessing functional conservation/diversification among primates.
Similar content being viewed by others
Introduction
The NAT1 gene (HGNC ID: 7645) for human arylamine N-acetyltransferase 1 (UniProt ID: P18440; E.C. 2.3.1.5) was originally characterized for its role in the metabolism of xenobiotics, particularly aromatic amine drugs and mutagens. Apart from its pharmacogenetic significance, NAT1 has more recently been implicated in cell systems relevant to carcinogenesis1,2,3. The human NAT1 gene, and its mouse orthologue Nat2, begin their expression during preimplantation development4,5,6 and remain transcriptionally active in most embryonic and adult tissues4,6,7,8,9,10,11. Both orthologues are controlled by a conserved Sp1-type housekeeping promoter, adjacent to an upstream non-coding exon11,12,13, and are potentially subject to epigenetic regulation14,15. Human NAT1 is also controlled by a second, more tissue-specific promoter16, and regulation has been shown to implicate steroid hormones17,18,19,20, micro-RNAs21,22, protein polyubiquitinylation23,24, as well as acetylation of the enzyme and allosteric modulation by adenosine triphosphate25,26. Human NAT1 and rodent NAT2 proteins have been demonstrated to acetylate the same arylamine substrates27, including p-aminobenzoylglutamate (pABGlu), a catabolic derivative of folate4,27,28,29,30. Nat2 knockout mice are apparently healthy, but unable to acetylate pABGlu31,32. By contrast, transgenic overexpression of human NAT1 in mice is not tolerated33. Recent studies propose that human NAT1 and rodent NAT2 may catalyze the folate-dependent hydrolysis of acetyl coenzyme A (CoA) in the absence of arylamine34,35, but a mechanism is not yet firmly established36. Other possible roles of human NAT1, currently under investigation, are relevant to fatty acid biosynthesis37, methionine salvage38, regulation of reactive oxygen species39 and mitochondrial bioenergetics40.
A strong association is emerging between human NAT1 overexpression and cancer, particularly estrogen/progesterone receptor positive tumours of both female and male breast (see earlier reviews2,3,41 and more recent literature42,43,44,45,46,47,48). The mechanism underlying the function and regulation of NAT1 in carcinogenesis is extensively investigated, as the gene is considered to be of prognostic and potentially therapeutic relevance2,21,22,49,50,51,52,53,54. Synthetic NAT1-selective inhibitors are also available55,56,57.
The evolutionary history of human NAT1 gene indicates a dominant role of purifying selection, consistent with the presumed endogenous function of NAT1 enzyme58,59. However, a recent study suggests that the evolutionary pattern may be different for the NAT1 orthologues of other primate species60. We have recently analysed the NAT1 sequences of 35 non-human primate species and performed recombinant expression and comparative enzymatic analysis for ten of them, elucidating their genetic evolution and demonstrating functional variability across different phylogenetic lineages61,62,63. Here, we describe a series of NAT1 variants found in a small population sample of rhesus macaques (species Macaca mulatta; taxon ID: 9544; UniProt taxon mnemonic: MACMU) and investigate the functional effects of non-synonymous single nucleotide variations (SNVs), drawing parallels with the effects of characterized NAT1 SNVs in humans.
Results
Variability in the rhesus macaque (MACMU)NAT1 gene
The intronless coding region of (MACMU)NAT1 gene was sequenced for 25 rhesus macaque individuals. A total of 12 polymorphic sites were identified, of which 5 represented synonymous and 7 non-synonymous nucleotide substitutions relative to reference allele (MACMU)NAT1*1 (Nucleotide ID: KU640969.1)61. The determined genotypes (Supplementary Table S1) corresponded to 12 haplotypes (Table 1), which were assigned allelic symbols (MACMU)NAT1*2 – (MACMU)NAT1*12 (Nucleotide IDs: KU640986 – KU640996). Allele (MACMU)NAT1*1 showed highest identity (96.45%) to (HUMAN)NAT1*4 reference allele (Nucleotide ID: AJ307007.1) and was also most frequently encountered in the rhesus macaque sample screened, representing 34% of the 50 haplotypes analysed in total. Other frequently found haplotypes were (MACMU)NAT1*3 (18%) and (MACMU)NAT1*5 (20%), while the remaining haplotypes were less common (2–6%). With the exception of (MACMU)NAT1*3 and *8, all haplotypes carried more than one nucleotide variation compared to (MACMU)NAT1*1. Haplotypes (MACMU)NAT1*3, *5, *8, *9 and *12 contained only synonymous substitutions. Haplotypes (MACMU)NAT1*2, *4, *6, *7 and *11 contained one non-synonymous, together with one or two synonymous substitutions. (MACMU)NAT1*10 allele carried three non-synonymous and one synonymous substitution (Table 1).
Of the SNVs found (Tables 1 and S1), synonymous substitution c.321C > T was most frequent (52.5%) and altered the codon for key catalytic residue p.His10764, which however remained conserved. Synonymous SNV c.540C > T was also common (20.3%), while other synonymous SNVs showed lower frequencies (up to 5%). Of the non-synonymous SNVs (Tables 1 and S1), c.152G > C (p.Gly51Ala), c.244A > G (p.Met82Val), c.267G > C (p.Leu89Phe), c.463G > C (p.Glu155Gln) and c.523T > C (p.Phe175Leu) were encountered only once in the rhesus macaque sample. Non-synonymous substitution of amino acid residue p.Gly51 (c.152G > T, p.Gly51Val; rs72466457:G > T) has been reported previously for the *6 M allele of (HUMAN)NAT2 gene (the 87.4% conserved NAT1 paralogue in human) and has been predicted to potentially affect substrate binding65. Substitution c.343G > T (p.Asp115Tyr) was found in two heterozygous rhesus macaque individuals carrying haplotypes NAT1*10 and *11. The most frequent (5%) non-synonymous SNV in the rhesus macaque sample was c.560G > A (p.Arg187Gln), found in three heterozygous individuals (Tables 1 and S1). This exact SNV (rs4986782:G > A) has been reported for (HUMAN)NAT1*14, with 1000 Genomes Project phase 3 global minor allele frequency (MAF) of 0.6%, and has important functional consequences. The same codon is also subject to another mutation (c.559C > T, p.Arg187Ter; rs5030839:C > T with global MAF of 0.3%) in the (HUMAN)NAT1*15 allele, causing premature termination of the polypeptide chain66,67,68,69,70,71.
Although non-synonymous SNVs have been reported in the literature for (HUMAN)NAT1 gene (http://nat.mbg.duth.gr/), the most common polymorphic alleles in human populations are characterized by functional variability in the 3′-untranslated region (3′-UTR). Allele (HUMAN)NAT1*10 is defined by rs1057126:T > A (c.*215T > A) which affects one active polyadenylation (polyA) signal of the gene. Allele (HUMAN)NAT1*11 is defined by a shortage of three TAA repeats in a microsatellite element spanning the same polyA signal. This particular region is highly variable in human populations (http://nat.mbg.duth.gr/), and has been demonstrated to modulate polyA usage of the gene and mRNA translation efficiency72. We show here that the rhesus macaque and other non-human primates (both Old World monkeys and apes) also carry a shorter microsatellite stretch at this position, suggesting an expansion of the ancestral wild-type allele specific to the human evolutionary lineage (Figs 1 and S1). Moreover, the genomic distance of this particular microsatellite/polyA region from the stop codon of NAT1 gene is about 300 bp longer in the rhesus macaque and other Old World monkeys, compared with human and other apes. This difference in length is due to an Alu/SINE element which is present immediately downstream of NAT1 gene in Old World monkeys, but not in apes (Figs 1 and S2). These notable differences between human and other primates suggest substantial evolutionary and functional divergence of the NAT1 3′-UTR that merits further investigation, especially given the phenotypic significance attributed to (HUMAN)NAT1*10 and *11 alleles and their investigated association with disease2,72.
Effects of polymorphisms on rhesus macaque (MACMU)NAT1 protein expression
A clone carrying the (MACMU)NAT1*1 reference allele in recombinant expression vector61 was used to generate a series of mutagenised constructs, each carrying one of the seven non-synonymous SNVs found in the rhesus macaque sample screened. One additional construct was engineered to carry both c.267G > C (p.Leu89Phe) and c.343G > T (p.Asp115Tyr), as those two SNVs were found together on the same haplotype (Supplementary Table S2).
As previously observed61, recombinant expression/purification of (MACMU)NAT1_1 reference protein was always efficient, generating preparations of high yield and purity (Fig. 2). Variant (MACMU)NAT1_p.Arg187Gln also generated sufficient yields of highly pure protein, although at considerably lower levels (by 30% on average) compared with (MACMU)NAT1_1 (Fig. 2). Recombinant expression of all other variants generated poor quality preparations that made recovery of pure proteins virtually impossible, although the expected NAT1 protein band was always visible on gels (Fig. 2). The recombinant preparations of (MACMU)NAT1_1 and (MACMU)NAT1_p.Arg187Gln were in fact differentiated from those of all other variants, throughout the entire expression/purification process (Supplementary Fig. S3). Those results were reproducible upon expression of recombinant (MACMU)NAT1 protein variants multiple (up to five) times and by different (up to three) persons, over an extended period of time. Procedures always incorporated (MACMU)NAT1_1, as reference protein, and followed a well-standardized protocol in our laboratory61,73,74 that has been successfully applied to numerous recombinant NAT homologues across a wide taxonomic spectrum of organisms, including the rhesus macaque and other primates. Our observations were consistent with protein aggregation involving misfolded recombinant (MACMU)NAT1 unstable variants, both during expression (in the form of bacterial inclusion bodies), as well as in cell lysates and during purification (in the form of soluble and insoluble complexes with native Escherichia coli proteins)75,76,77. Although a variety of approaches are available for overcoming recombinant protein aggregation75, optimizing the expression/purification conditions for individual (MACMU)NAT1 variants was beyond the scope of this study, which focused on comparison of SNV effects using a standard procedure widely employed by NAT researchers performing similar work27,35,64,78,79,80,81,82,83,84,85,86,87. As expression of mammalian proteins in E. coli could potentially confound the observed functional impact of different SNVs, future investigations of the characterized (MACMU)NAT1 variants could additionally employ eukaryotic cells. However, previous expression of (HUMAN)NAT1 and its variants in standard eukaryotic systems has been demonstrated to generate recombinant protein in very low yield that is barely detectable70,71.
To verify the presence of recombinant (MACMU)NAT1 protein in preparations of expressed variants, a series of immunoblot analyses were performed, using antibodies specific for the N-terminal, middle or C-terminal part of the polypeptide chain. All preparations were demonstrated to contain recombinant (MACMU)NAT1 protein, both in the soluble fraction of each cell lysate and in the products of subsequent chromatographic purifications (Fig. 3). However, compared with (MACMU)NAT1_1 reference protein, the amount of detected variants was substantially decreased (typically by more than 50%) in recombinant preparations, and this result was consistent across different experiments and with all three antibodies. As expected, (MACMU)NAT1_p.Arg187Gln appeared to be only moderately affected, relative to (MACMU)NAT1_1 reference protein, demonstrating up to 45% reduction in the amount of recombinant protein detected in purified chromatographic fractions. By contrast, the amount of immunodetected (MACMU)NAT1_p.Arg187Gln protein was similar to (MACMU)NAT1_1 reference protein in soluble fractions generated immediately after cell lysis (Fig. 3).
The combination of two amino acid substitutions in the same polypeptide chain had an additive, essentially detrimental, impact on the recombinant expression of (MACMU)NAT1_p.Leu89Phe/p.Asp115Tyr variant. The double mutant was barely detectable by immunoblot analyses, even in freshly prepared cell lysates following recombinant expression (Supplementary Fig. S4).
Overall, these results indicate that (MACMU)NAT1 variants are inherently unstable, when expressed in E. coli, suggesting that different non-synonymous substitutions may compromise protein folding and solubility. This effect is relatively moderate in the case of (MACMU)NAT1_p.Arg187Gln variant and most pronounced in the case of double mutant (MACMU)NAT1_p.Leu89Phe/p.Asp115Tyr. Bacteria possess complex systems to assess correct folding of expressed proteins and can reject misfolded proteins through proteolysis or the formation of insoluble aggregates76. Although application of more sophisticated methods75 for detection and monitoring of protein aggregate formation may enable more accurate assessments in the future, the present work demonstrates that the likely impact of characterized non-synonymous SNVs is to compromise the inherent stability of (MACMU)NAT1 protein, as summarized in Table 2. This is in line with previous investigations of (HUMAN)NAT1, demonstrating a similar effect on protein stability for most characterized SNVs, irrespectively of whether the expression was carried out in prokaryotic (E. coli) or eukaryotic (yeast or mammalian) cells23,70,71,88.
Effects of polymorphisms on rhesus macaque (MACMU)NAT1 enzyme activity
The recombinant preparations of (MACMU)NAT1_1 reference protein and its eight variants were assayed for enzymatic activity, using the arylamines p-aminobenzoic acid (PABA), 5-aminosalicylic acid (5AS), p-anisidine (PANS) and sulphamethazine (SMZ) as acetyl-group acceptor substrates. SMZ is a NAT2-selective substrate that was expected to provide only marginal activity with (MACMU)NAT1 protein, unless any of the assayed variants caused a switch in substrate specificity of the isoenzyme, as has been reported previously for the p.Val231Ile polymorphism of (MACMU)NAT274,89. No such effect was demonstrated for any of the (MACMU)NAT1 polymorphisms tested in the present study (Fig. 4).
Because of the apparent instability of (MACMU)NAT1 variants, and the consequently unavoidable variability in the amount and purity of recovered recombinant proteins across different preparations (Table 2), a 50% threshold was used to interpret the effects of polymorphisms on enzyme activity, relative to (MACMU)NAT1_1 reference protein which was included in each set of assays. Enzymatic activity was detected for all variants, when PABA, 5AS or PANS were used as substrates, but was decreased relative to (MACMU)NAT1_1 (Fig. 4). The smallest change was observed for variant (MACMU)NAT1_p.Glu155Gln, which was >90% active relative to (MACMU)NAT1_1, when PABA and 5AS were used. However, a 55% decrease in activity was observed with PANS, suggesting that the effects of the polymorphism could be substrate-specific; PABA and 5AS are more NAT1-selective, compared with PANS which is a substrate for both NAT1 and NAT2 isoenzymes of primates61. Most severely compromised was the double mutant (MACMU)NAT1_p.Leu89Phe/p.Asp115Tyr, demonstrating some activity only with PABA. Variants (MACMU)NAT1_p.Gly51Ala and (MACMU)NAT1_p.Met82Val demonstrated moderate (<50%) decrease in enzymatic activity relative to (MACMU)NAT1_1, while the decrease was more substantial (>50%) in the case of variants (MACMU)NAT1_p.Leu89Phe, (MACMU)NAT1_p.Asp115Tyr, (MACMU)NAT1_p.Phe175Leu and (MACMU)NAT1_p.Arg187Gln, as summarized in Table 2.
Initial attempts to assess the impact of substitutions on thermal stability of (MACMU)NAT1 protein, through measurement of enzymatic activity after exposure of recombinant preparations to temperatures from 35 to 45 °C, were complicated by the fact that most variants demonstrated very low enzymatic activities even before thermal stress. In those cases where the results were assessed as reliable, thermal stability of variants appeared to decline at different rates (Fig. 5). Loss of activity was marginal for (MACMU)NAT1_1 reference protein at temperatures close to its reported denaturation midpoint temperature (36.5 ± 1.2 °C)61, which is also similar to the body temperature (37.3 °C) of the animal (Animal Ageing and Longevity Database, http://genomics.senescence.info/species/)90, but became more evident above 39.5 °C. Variant (MACMU)NAT1_p.Leu89Phe appeared rather resilient to thermal stress, losing activity at similar rate as the reference protein, despite its demonstrated instability during expression/purification. Other variants were more sensitive to the effects of thermal stress (Fig. 5). To overcome the aforementioned limitations, we also tried differential scanning fluorimetry, a well-standardized method in our laboratory61,73,74, to assess thermal stability of (MACMU)NAT1 variants. However, with the exception of the reference protein, this approach was hampered by the insufficient purity of recombinant preparations. In the future, it may be useful to try alternative thermal denaturation assays, described in the literature91.
Combined, the observations of this and the previous section indicate inherent instability of (MACMU)NAT1 variants as the likely cause for reduced enzymatic activity. However, an effect on catalytic function may not be ruled out, particularly for variants with polymorphisms p.Asp115Tyr and p.Phe175Leu that demonstrated a decrease in enzymatic activity that was considerably more substantial relative to the decrease in the amount of recovered recombinant protein (Table 2). Impairement of catalytic function is also likely to be the main cause for the observed decrease in the enzymatic activity of variant (MACMU)NAT1_p.Arg187Gln, which appeared relatively stable compared with other variants in this study. Polymorphism p.Arg187Gln has also been demonstrated to cause decrease in activity of (HUMAN)NAT1 homologue, both when using human biological material and after recombinant expression in bacterial, yeast or mammalian cell systems68,69,70,71,92,93,94. As in the present study for (MACMU)NAT1, the effect of p.Arg187Gln on (HUMAN)NAT1 appeared to be in the form of moderate decrease in protein stability, combined with impairement of enzymatic function68,70,71. More sophisticated kinetic analyses with a range of substrates may help elucidate the exact mechanism underlying functional impairement of each characterized (MACMU)NAT1 variant, at least in those cases where stability and activity are sufficient to allow reliable enzymatic assays.
Structural effects of rhesus macaque (MACMU)NAT1 polymorphisms
Structural alignment and homology modelling of (MACMU)NAT1 protein against the published structure of (HUMAN)NAT164 was used to assess the potential functional effects of rhesus macaque polymorphisms at the molecular level (Figs 6 and S5). These predictions were further compared with the output of PolyPhen-2, a computational algorithm employing comparative sequence alignments and structural analyses to determine the impact of amino acid substitutions on proteins (Table 2).
Residue p.Gly51 is located within the α-helical domain I on the surface of the protein molecule, right before the start of α3-helix and behind p.Cys68 of the catalytic triad. No direct interactions of p.Gly51 are evident and its substitution to Ala is conservative (Figs 6a and S5). Therefore, the postulated effect is benign, which is in line with the prediction of PolyPhen-2 assigning a low impact probability score to this particular polymorphism (Table 2). Despite those computational predictions, the experimental results show p.Gly51Ala to have a moderate effect on (MACMU)NAT1 enzyme activity, presumably by compromising its intrinsic stability. The conformational flexibility of glycine makes the amino acid important for maintaining certain structural arrangements (e.g. tight turns), so its substitution could potentially affect protein stability95.
A similar impact can be attributed to polymorphism p.Met82Val which also affects a residue of domain I and is located on the surface of the protein molecule. Amino acid p.Met82 is the last amino acid on α4-helix, which has catalytic p.Cys68 as its first residue (Figs 6b and S5). Substitution from Met to Val is conservative and predicted as benign by PolyPhen-2 (Table 2). The experimental results, however, indicate a moderate effect on enzymatic activity, potentially caused by reduction in intrinsic stability of the protein. Valine has certain restrictions in adopting the α-helical conformation and is hydrophobic, usually found in the interior of protein molecules95. Such factors could contribute to the observed instability of variant (MACMU)NAT1_p.Met82Val.
Residue p.Leu89 is located inside the enzymatic core, on β2-strand of β-barrel domain II of the protein molecule64. Substitution to Phe is predicted by PolyPhen-2 as probably damaging, with the highest score (Table 2). The same tool identified p.Leu89 as essentially conserved across NAT1 homologues in the UniProt database. The residue is located directly behind β4-strand carrying p.His107 of the catalytic triad and its adjacent p.Ile106 that is important for binding of NAT1-selective aminobenzyl substrates, like p-aminosalicylic acid and PABA64. Structural modelling (Fig. 6c) supports that replacement of p.Leu89 by the much bulkier Phe causes a direct steric clash with β4-strand, leading to molecule destabilization and impairement of its catalytic function, consistent with the experimental observations of the previous sections.
Substitution of p.Asp115 by the bulkier, uncharged and more hydrophobic Tyr represents a non-conservative change that was predicted by PolyPhen-2 as possibly damaging to (MACMU)NAT1 (Table 2). Amino acid p.Asp115 is located on the surface of the protein molecule and is part of domain II (Figs 6d and S5). This residue is positioned right at the turn between anti-parallel strands β4 and β5, which determine the positioning of p.His107 and p.Asp122 in the catalytic triad. The conformation of the two β-strands is stabilized via bonding between p.Asp115 and p.Arg117. As described in the previous sections, polymorphism p.Asp115Tyr appeared to compromise the protein molecule in terms of stability and possibly catalytic function. Moreover, the combination of this polymorphism with p.Leu89Phe on the same (MACMU)NAT1 variant was demonstrated to be effectively detrimental. Apart from those two polymorphisms, haplotype (MACMU)NAT1*10 also contained a third SNV causing substitution p.Glu155Gln. However, given the severe impairement of double mutant (MACMU)NAT1_p.Leu89Phe/p.Asp115Tyr, the potential additive effect of the more benign p.Glu155Gln substitution was not examined.
Residue p.Glu155 is the last amino acid on β8-strand of domain II, which is connected to its anti-parallel β9-strand via a sharp turn. The location is away from the catalytic pocket and on the surface of the enzyme molecule (Figs 6e and S5). Its substitution to Gln is conservative and has no apparent consequences. PolyPhen-2 also predicted this change as benign, assigning a very low score (Table 2). Although our experiments detected no substantial impact on enzymatic activity, an apparent decrease in stability of variant (MACMU)NAT1_p.Glu155Gln was nevertheless observed. The underlying cause of this effect is unclear, although p.Glu155 could stabilize the conformation of anti-parallel strands β8-β9, which are part of a β-sheet forming a “wall” on one side of the active site pocket.
Residue p.Phe175 is one of 17 amino acids forming the so-called “eukaryotic loop” in domain II of mammalian NATs. This component is thought to stabilize an important β-sheet of domain III, comprising four anti-parallel β-strands that form a “wall” at the bottom of the active site pocket64. This stabilization is conferred via parallel alignment of the loop with the fourth strand (β15) of the β-sheet and, specifically, via “stacking” interaction of the aromatic rings of p.Phe175 on the loop with p.Phe255 on the β15-strand (Fig. 6f). The structural model predicts replacement of p.Phe175 by Leu to be damaging to this interaction and, consequently, to the protein molecule itself, both in terms of stability and potentially enzymatic function. PolyPhen-2 identified p.Phe175 as very conserved across NAT homologues and predicted its substitution as possibly damaging (Table 2). The in silico results are consistent with the experimental observations of the previous sections.
The final polymorphism in this study, p.Arg187Gln, has been structurally modelled before, as it is also found in (HUMAN)NAT164,88,96,97. As shown (Figs 6g and S5), residue p.Arg187 is located on β11-strand of domain II and forms hydrogen bond interactions with p.Lys188 and p.Glu182. The latter amino acid is located on the aforementioned “eukaryotic loop” of the enzyme and interacts via hydrogen bonding with two important residues, p.Tyr129 and p.Thr289. Residue p.Tyr129 is key for NAT1 acceptor substrate selectivity80 and also interacts with p.Thr289, which is the residue located just before the carboxyl terminus of the 290 amino acid polypeptide chain. This network of interactions maintains the shape of the active site pocket and of the “eukaryotic loop”, while also stabilizing the carboxyl terminus in a position deep within the catalytic core of the enzyme that enables contact with the donor substrate64. Although the rather conservative substitution of p.Arg187 to Gln could potentially maintain some of the required hydrogen bond interactions, instability in the polypeptide chain folding would be unavoidable, in line with the moderate loss of protein observed during recombinant expression/purification. Moreover, the expected change in the conformation of the active site pocket would explain the substantial decrease in enzymatic activity observed for variant p.Arg187Gln both in the rhesus macaque and human. Contrary to the solid experimental evidence, PolyPhen-2 predicted this polymorphism as benign, although the score was close to 0.5 (Table 2). Despite their value in guiding the assessment of SNV effects on protein function, computational predictions must always be interpreted with caution, as they may not be fully aligned with experimental evidence.
Discussion
The evolution of xenobiotic metabolizing enzymes is shaped by the ever-changing chemical environments organisms need to adapt to in order to survive. Genetic polymorphisms can be favourable, in that respect, as they confer a range of metabolic activities towards a plethora of potentially harmful xenobiotic substances. Moreover, population frequencies of high- vs. low-activity enzyme variants may shift to one or the other direction, depending on the prevailing xenobiotic challenges in a specific environment. In humans, this process is driven not only by natural selective pressures, but also by the lifestyle choices of our species. The NAT enzymes metabolize toxic aromatic amines found in cooked food and other byproducts of everyday human activity since prehistoric times. Variability in NAT genes has thus been studied in the context of understanding how subsistence patterns may have affected the genetic makeup of human populations58,59,98,99,100,101,102,103. Such knowledge is useful for elucidating how even apparently modest differentiations in gene-environment interactions may influence disease susceptibility or drug response.
Comparison of humans with other primates can further discriminate between variability that is ancestral or specific to our own species. Although previous phylogenetic analyses have demonstrated a dynamic pattern consistent with adaptive evolution of NAT genes in primates62,63, those studies have been limited by their entirely in silico methodology and the fact that they have examined NAT variability only between species. A new study, investigating population variability at NAT1, NAT2 and NATP loci across different hominid species, demonstrated opposite diversity patterns for the chimpanzee (Pan troglodytes) and bonobo (P. paniscus) compared with human. Genetic diversity was higher for NAT1 in the two Pan species and lower in human, with the opposite pattern observed for the NAT2 gene60. Here, we investigated NAT1 variability in a population sample of the Old World monkey rhesus macaque, also characterizing the possible functional consequences of detected variants. Our results are in accordance with the aforementioned study60, indicating high haplotypic diversity (Hd = 0.817) of (MACMU)NAT1 coding region for the specific rhesus macaque population studied, and with 16% of the haplotypes carrying at least one non-synonymous SNV with some functional impact. It appears that, in the rhesus macaque, polymorphic NAT1 alleles may modulate acetylation of arylamine xenobiotics more effectively compared with human, where population frequencies of non-synonymous SNVs in NAT1 coding region are extremely low (typically <1‰, with few exceptions)104. The low haplotypic diversity (Hd = 0.073) reported for (HUMAN)NAT1 coding region is consistent with the action of purifying selection predicted for the gene58. As mentioned above, (HUMAN)NAT1 is considered to have a more fundamental role in cells, unlike its paralogous (HUMAN)NAT2 which is higly polymorphic and exhibits the functional characteristics of a typical xenobiotic metabolizing enzyme.
Although (MACMU)NAT1 is phylogenetically orthologous to (HUMAN)NAT162, its higher degree of variability could be the outcome of environmental pressures differentiating the evolutionary lineages of the two species. However, despite this high level of diversity, all heterozygous (MACMU)NAT1 genotypes were found to contain at least one allele with no predicted phenotypic consequences. Specifically, of the 25 genotypes determined, seventeen contained only functional alleles (i.e. combinations of *1, *3, *5, *8, *9 and *12 alleles encoding for the reference amino acid sequence), while the remaining eight genotypes combined one functional with one compromised (*2, *4, *6, *7, *10, *11) allele. Genotypes consisting of two compromised alleles were not found (Supplementary Table S1). It is possible that rhesus macaque NAT1 variability may be maintained, as far as the corresponding enzymatic function is not entirely abolished in the individual. This could preserve the same endogenous function attributed to (HUMAN)NAT1, although the diversity pattern appears to differ in the evolutionary lineages of the two species. Although such observations are useful, in view of current hypotheses relating (HUMAN)NAT1 with functions beyond xenobiotic metabolism, other factors (potentially related to the small size or the particular makeup of the rhesus macaque population from which our sample was drawn) could also account for the high variability observed.
As described above, (HUMAN)NAT1 is also differentiated from other primate orthologues at the 3′-UTR of the gene. The reference allelic sequence downstream of the coding region bears a microsatellite repeat element that encompasses an active polyA signal of the gene. Variability in this element is common (~40% global frequency) in human populations, and is encountered both in the form of SNV, e.g. rs1057126:T > A in (HUMAN)NAT1*10, and in the form of simple sequence length polymorphism, as in (HUMAN)NAT1*11 and other alleles. Although those variations appear to confer some functional effects, their association with disease susceptibility has been controversial2,72. It would be interesting to examine if the functional significance attributed to the 3′-UTR of (HUMAN)NAT1 gene is corroborated by studies with NAT1 orthologues of other primates, given the substantial sequence and structural variability we report here.
Despite those differences, certain NAT1 variants are found in both the rhesus macaque and human. As discussed, NAT1 polymorphism c.560G > A (p.Arg187Gln) was the most prevalent non-synonymous SNV in our rhesus macaque sample and is relatively common in human populations too. The functional impact of this polymorphism was similar for the two species, compromising the activity of NAT1 isoenzyme. Other rhesus macaque SNVs of this study have also been reported for (HUMAN)NAT1 gene by large scale genome-wide population studies, like the 1000 Genomes Project, including synonymous SNVs at positions c.15 (rs747456038:A > G) and c.321 (rs755240678:C > T), as well as the non-synonymous SNV at position c.523 (rs762246777:T > C; p.Phe175Leu). Moreover, similarly to the characterized (MACMU)NAT1 variants, impaired intrinsic stability appears to be the main defect of low-activity (HUMAN)NAT1 variants, such as NAT1*17 (c.190C > T, p.Arg64Trp; rs56379106:C > T) and *22 (c.752A > T, p.Asp251Val; rs56172717:A > T)23,70,71,88,93.
Although pinpointing genetic aberrations that are strongly associated with disease is relatively straightforward, it is more difficult to unravel subtle genomic differences that may influence adaptability to exogenous challenges and shape the landscape of factors contributing to complex disorders. Xenobiotic metabolizing enzymes play diverse roles in cells that may not be limited merely to protection of organisms from exogenous substances. Therefore, any link to disease, particularly carcinogenesis, may be either via modulation of the mutagenic potential of xenobiotics, or via a less obvious interference with cell physiology, as has long been speculated for human NAT1 and is supported by recent studies mentioned above. Comparative genetic analyses between human and other primates add the evolutionary dimension to such investigations, particularly when combined with experimental insights into functions that may have been either conserved or diversified in the phylogenetic lineages separating our species from its closest relatives. To that end, it would be useful to conduct more comprehensive interdisciplinary investigations into the molecular, structural and enzymological aspects of variants reported for primate NAT homologues.
Methods
Amplification and sequencing of (MACMU)NAT1 gene
For this in vitro study, the 870 bp intronless coding region of (MACMU)NAT1 gene was amplified from genomic DNA samples of a rhesus macaque population described previously105,106. PCR was performed with flanking primers 1 A (5′-AGCCATAATTAGCCTACTC-3′) and 1AR (5′-GTACAGAAGATACATGATAGG-3′), and the reactions contained 1 u GoTaq DNA polymerase (Promega) in 1x reaction buffer with 1.5 mM MgCl2, 0.1 mM each dNTP, 0.1 μM each primer and 25 ng genomic DNA as template. Thermal cycling conditions were as follows: 95 °C (10 min), followed by 38 cycles of 94 °C (1 min), 53 °C (50 s) and 72 °C (2 min), with final extension at 72 °C (10 min). Specific PCR amplification was verified by agarose gel electrophoresis and the products (1350 bp) were purified and sequenced from both directions (Millegen, Labège, France).
Generation of polymorphic (MACMU)NAT1 constructs
To date, the NAT1 homologues of three macaque species (M. mulatta, M. sylvanus and M. fascicularis) have been cloned and characterized61,107, and those encode identical polypeptide chains. The (MACMU)NAT1 reference sequence (Nucleotide ID: KU640969.1) of the rhesus macaque was already available61 in pET28b(+) expression vector (Novagen) for the purposes of this study. Incorporation of each non-synonymous SNV found in the various (MACMU)NAT1 haplotypes was performed by site-directed mutagenesis using the QuikChange II kit (Agilent), as previously described74. The mutagenised constructs were verified by sequencing of the insert (GATC Biotech, Konstanz, Germany).
Recombinant protein expression and purification
Expression of polyhistidine-tagged (MACMU)NAT1 protein variants was carried out in 200 ml cultures of E. coli BL21(DE3)pLysS cells as previously described61,73. The soluble fraction of bacterial lysates was recovered and immediately subjected to purification by affinity chromatography through Nickel-charged columns (Qiagen). Recombinant proteins were eluted with increasing concentration of imidazol and analysed by quantification (280 nm) on a Nanodrop 2000 Spectrophotometer (Thermo Scientific). Protein separation was carried out by sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) and gels were photographed with BioRad ChemiDoc XRS + Imaging System. All handling of proteins took place on ice or at 4 °C, and subsequent experiments were carried out within one day from protein production (while maintaining on ice in a cold chamber), to avoid freeze-thawing.
Immunoblot analysis
Immunoblot detection was performed according to standard practices, using the BioRad Mini-Protean Tetra Cell System and Criterion Blotter. Gels were blotted onto Immuno-Blot PVDF membrane for antibody binding and subsequent chemiluminescence detection with Immun-Star Western kit (both products from BioRad). The antibodies used were: (a) commercial horseradish peroxidase (HRP) conjugated monoclonal antibody A7058 (Sigma-Aldrich) against the polyhistidine tag of recombinant proteins, diluted 1:12,000; (b) commercial non-selective NAT rabbit polyclonal antibody ARP44197 (kindly provided by Aviva Systems Biology, San Diego, CA, USA), diluted 1:2000; c) NAT1-selective rabbit polyclonal antiserum #183, raised against the bovine serum albumin (BSA) conjugated C-terminal peptide of human NAT1 (kindly provided by Professor Edith Sim, University of Oxford, UK), diluted 1:4000. The binding of antibodies ARP44197 and #183 was detected with HRP-conjugated monoclonal anti-rabbit IgG secondary antibody A1949 (Sigma-Aldrich), diluted 1:10,000. Quantification of affinity chromatography purified proteins (100 mM imidazol fractions) was carried out by spectrophotometry at 280 nm (Nanodrop). Total protein quantification in cell lysates was carried out using the Bradford reagent (Sigma-Aldrich), as instructed by the manufacturer. The ChemiDoc XRS + Imaging System was used to detect and quantify the chemiluminescent signal on immunoblot membranes.
Enzyme activity assays
The enzyme activity assays108 were performed with 1 μg of affinity chromatography purified NAT1 protein (100 mM imidazol fraction), according to well-standardized procedures in our laboratory61,73,74. The acyl-group acceptor substrates used (0.5 mM) were the NAT1-selective PABA (PubChem CID: 978) and 5AS (PubChem CID: 4075), the NAT2-selective SMZ (PubChem CID: 5327), all three pharmaceutical arylamines, and the less selective PANS (PubChem CID: 7732), a toxic arylamine. Acetyl-CoA (0.4 mM) was used as acyl-group donor substrate and the enzymatic release of CoA was monitored over time with Ellman’s reagent. Reactions were performed in duplicate.
Thermal stability assays
For comparing thermal stability of recombinant (MACMU)NAT1 variants of the rhesus macaque, protein expression/purification was carried out and the enzyme activity was recorded using PABA as substrate. The procedures were as described in the previous sections. Each recombinant protein preparation was then incubated for 10 min on a PCR DNA Engine Tetrad Gradient Cycler (MJ Research) across a temperature range spanning the denaturation midpoint temperature (Tm = 36.5 ± 1.2 °C) previously determined for (MACMU)NAT1 recombinant protein by differential scanning fluorimetry61. Following thermal stress, the activity of each recombinant variant was measured with PABA during a 5 min enzymatic assay, as described above.
Computational analyses
BioEdit version 7.1.9109 was used to inspect sequences and perform multiple alignments on ClustalW. NAT1 haplotypes were inferred from the unphased genotypes of sequenced individuals with PHASE v2.1.1 software110, using the default parameter values in the Markov chain Monte Carlo simulations. The algorithm was applied ten times with different random seed numbers to check for consistency of the results across the independent runs. We chose the results from the run displaying the best average goodness-of-fit of the estimated haplotypes to the underlying coalescent model. Primate sequences for NAT1 were retrieved from the Genome Database (https://www.ncbi.nlm.nih.gov/genome/), using BLASTn, and the search for repetitive elements downstream of NAT1 gene was carried out with RepeatMasker open version 4.0.8 (http://www.repeatmasker.org). The presentation of SNVs in the manuscript is according to the guidelines of the Human Genome Variation Society (http://www.hgvs.org/mutnomen/) and each haplotype was named following the recommendations of the NAT Gene Nomenclature Committee111 (http://nat.mbg.duth.gr/). The reference allele61 (Nucleotide ID: KU640969.1) was assigned symbol (MACMU)NAT1*1 and the remaining haplotypes were named in the order of their annotation during the study. A GenBank accession number (Nucleotide ID) was assigned per haplotype.
Structural alignment of protein sequences was carried out with T-COFFEE Expresso (http://tcoffee.crg.cat/apps/tcoffee/do:expresso) and graphically visualised with ESPript3.0 (http://espript.ibcp.fr). Modelling of (MACMU)NAT1 was performed against the known structure of human NAT1 protein (PDB ID: 2PQT)64, using the Swiss-Model online tool (http://swissmodel.expasy.org/). Graphical visualisation of models was performed on PyMOL (Schrödinger, LLC). The effect of each polymorphism on (MACMU)NAT1 protein was also predicted using PolyPhen-2 (HumDiv model, http://genetics.bwh.harvard.edu/pph2/), an online bioinformatics tool assigning impact probability scores in the scale from 0.00 (benign) to 1.00 (probably damaging). The potential consequences of amino acid substitutions were further assessed according to Russell and colleagues95 (http://www.russelllab.org/aas/).
References
Minchin, R. F. et al. Arylamine N-acetyltransferase I. Int J Biochem Cell Biol 39, 1999–2005 (2007).
Butcher, N. J. & Minchin, R. F. Arylamine N-acetyltransferase 1: a novel drug target in cancer development. Pharmacol Rev 64, 147–165, https://doi.org/10.1124/pr.110.004275 (2012).
Sim, E., Walters, K. & Boukouvala, S. Arylamine N-acetyltransferases: from structure to function. Drug Metab Rev 40, 479–510, https://doi.org/10.1080/03602530802186603 (2008).
Payton, M., Smelt, V., Upton, A. & Sim, E. A method for genotyping murine arylamine N-acetyltransferase type 2 (NAT2): a gene expressed in preimplantation embryonic stem cells encoding an enzyme acetylating the folate catabolite p-aminobenzoylglutamate. Biochem Pharmacol 58, 779–785 (1999).
Smelt, V. A. et al. Expression of arylamine N-acetyltransferases in pre-term placentas and in human pre-implantation embryos. Hum Mol Genet 9, 1101–1107 (2000).
Boukouvala, S., Price, N. & Sim, E. Identification and functional characterization of novel polymorphisms associated with the genes for arylamine N-acetyltransferases in mice. Pharmacogenetics 12, 385–394 (2002).
Pacifici, G. M., Bencini, C. & Rane, A. Acetyltransferase in humans: development and tissue distribution. Pharmacology 32, 283–291 (1986).
Chung, J. G., Levy, G. N. & Weber, W. W. Distribution of 2-aminofluorene and p-aminobenzoic acid N-acetyltransferase activity in tissues of C57BL/6J rapid and B6.A-NatS slow acetylator congenic mice. Drug Metab Dispos 21, 1057–1063 (1993).
Stanley, L. A. et al. Immunochemical detection of arylamine N-acetyltransferase during mouse embryonic development and in adult mouse brain. Teratology 58, 174–182, https://doi.org/10.1002/(sici)1096-9926(199811)58:5<174::aid-tera3>3.0.co;2-q (1998).
Boukouvala, S. & Sim, E. Structural analysis of the genes for human arylamine N-acetyltransferases and characterisation of alternative transcripts. Basic Clin Pharmacol Toxicol 96, 343–351, https://doi.org/10.1111/j.1742-7843.2005.pto_02.x (2005).
Husain, A. et al. Functional analysis of the human N-acetyltransferase 1 major promoter: quantitation of tissue expression and identification of critical sequence elements. Drug Metab Dispos 35, 1649–1656, https://doi.org/10.1124/dmd.107.016485 (2007).
Boukouvala, S., Price, N., Plant, K. E. & Sim, E. Structure and transcriptional regulation of the Nat2 gene encoding for the drug-metabolizing enzyme arylamine N-acetyltransferase type 2 in mice. Biochem J 375, 593–602, https://doi.org/10.1042/bj20030812 (2003).
Butcher, N. J., Arulpragasam, A., Goh, H. L., Davey, T. & Minchin, R. F. Genomic organization of human arylamine N-acetyltransferase Type I reveals alternative promoters that generate different 5′-UTR splice variants with altered translational activities. Biochem J 387, 119–127, https://doi.org/10.1042/bj20040903 (2005).
Wakefield, L., Boukouvala, S. & Sim, E. Characterisation of CpG methylation in the upstream control region of mouse Nat2: evidence for a gene-environment interaction in a polymorphic gene implicated in folate metabolism. Gene 452, 16–21, https://doi.org/10.1016/j.gene.2009.12.002 (2010).
Paterson, S., Sin, K. L., Tiang, J. M., Minchin, R. F. & Butcher, N. J. Histone deacetylase inhibitors increase human arylamine N-acetyltransferase-1 expression in human tumor cells. Drug Metab Dispos 39, 77–82, https://doi.org/10.1124/dmd.110.036202 (2011).
Barker, D. F. et al. Functional properties of an alternative, tissue-specific promoter for human arylamine N-acetyltransferase 1. Pharmacogenet Genomics 16, 515–525, https://doi.org/10.1097/01.fpc.0000215066.29342.26 (2006).
Butcher, N. J., Tetlow, N. L., Cheung, C., Broadhurst, G. M. & Minchin, R. F. Induction of human arylamine N-acetyltransferase type I by androgens in human prostate cancer cells. Cancer Res 67, 85–92, https://doi.org/10.1158/0008-5472.can-06-2635 (2007).
Wakefield, L. et al. Arylamine N-acetyltransferase 1 expression in breast cancer cell lines: a potential marker in estrogen receptor-positive tumors. Genes Chromosomes Cancer 47, 118–126, https://doi.org/10.1002/gcc.20512 (2008).
Butcher, N. J. & Minchin, R. F. Arylamine N-acetyltransferase 1 gene regulation by androgens requires a conserved heat shock element for heat shock factor-1. Carcinogenesis 31, 820–826, https://doi.org/10.1093/carcin/bgq042 (2010).
Bonamassa, B., Ma, Y. & Liu, D. Glucocorticoid receptor-mediated transcriptional regulation of N-acetyltransferase 1 gene through distal promoter. AAPS J 14, 581–590, https://doi.org/10.1208/s12248-012-9370-5 (2012).
Endo, Y. et al. miR-1290 and its potential targets are associated with characteristics of estrogen receptor alpha-positive breast cancer. Endocr Relat Cancer 20, 91–102, https://doi.org/10.1530/erc-12-0207 (2013).
Endo, Y. et al. Immunohistochemical determination of the miR-1290 target arylamine N-acetyltransferase 1 (NAT1) as a prognostic biomarker in breast cancer. BMC Cancer 14, 990, https://doi.org/10.1186/1471-2407-14-990 (2014).
Liu, F. et al. Arylamine N-acetyltransferase aggregation and constitutive ubiquitylation. J Mol Biol 361, 482–492, https://doi.org/10.1016/j.jmb.2006.06.029 (2006).
Liu, F., Koepp, D. M. & Walters, K. J. Artificial targeting of misfolded cytosolic proteins to endoplasmic reticulum as a mechanism for clearance. Sci Rep 5, 12088, https://doi.org/10.1038/srep12088 (2015).
Minchin, R. F. & Butcher, N. J. The role of lysine(100) in the binding of acetylcoenzyme A to human arylamine N-acetyltransferase 1: implications for other acetyltransferases. Biochem Pharmacol 94, 195–202, https://doi.org/10.1016/j.bcp.2015.01.015 (2015).
Minchin, R. F., Rosengren, K. J., Burow, R. & Butcher, N. J. Allosteric regulation of arylamine N-acetyltransferase 1 by adenosine triphosphate. Biochem Pharmacol 158, 153–160, https://doi.org/10.1016/j.bcp.2018.10.013 (2018).
Kawamura, A. et al. Eukaryotic arylamine N-acetyltransferase. Investigation of substrate specificity by high-throughput screening. Biochem Pharmacol 69, 347–359, https://doi.org/10.1016/j.bcp.2004.09.014 (2005).
Minchin, R. F. Acetylation of p-aminobenzoylglutamate, a folic acid catabolite, by recombinant human arylamine N-acetyltransferase and U937 cells. Biochem J 307(Pt 1), 1–3 (1995).
Ward, A., Summers, M. J. & Sim, E. Purification of recombinant human N-acetyltransferase type 1 (NAT1) expressed in E. coli and characterization of its potential role in folate metabolism. Biochem Pharmacol 49, 1759–1767 (1995).
Upton, A. et al. Placental arylamine N-acetyltransferase type 1: potential contributory source of urinary folate catabolite p-acetamidobenzoylglutamate during pregnancy. Biochim Biophys Acta 15, 2–3 (2000).
Cornish, V. A. et al. Generation and analysis of mice with a targeted disruption of the arylamine N-acetyltransferase type 2 gene. Pharmacogenomics J 3, 169–177, https://doi.org/10.1038/sj.tpj.6500170 (2003).
Wakefield, L., Cornish, V., Long, H., Griffiths, W. J. & Sim, E. Deletion of a xenobiotic metabolizing gene in mice affects folate metabolism. Biochem Biophys Res Commun 364, 556–560, https://doi.org/10.1016/j.bbrc.2007.10.026 (2007).
Cao, W. et al. Only low levels of exogenous N-acetyltransferase can be achieved in transgenic mice. Pharmacogenomics J 5, 255–261, https://doi.org/10.1038/sj.tpj.6500319 (2005).
Stepp, M. W., Mamaliga, G., Doll, M. A., States, J. C. & Hein, D. W. Folate-Dependent Hydrolysis of Acetyl-Coenzyme A by Recombinant Human and Rodent Arylamine N-Acetyltransferases. Biochem Biophys Rep 3, 45–50, https://doi.org/10.1016/j.bbrep.2015.07.011 (2015).
Laurieri, N. et al. From arylamine N-acetyltransferase to folate-dependent acetyl CoA hydrolase: impact of folic acid on the activity of (HUMAN)NAT1 and its homologue (MOUSE)NAT2. PLoS One 9, e96370, https://doi.org/10.1371/journal.pone.0096370 (2014).
Witham, K. L. et al. 5-methyl-tetrahydrofolate and the S-adenosylmethionine cycle in C57BL/6J mouse tissues: gender differences and effects of arylamine N-acetyltransferase-1 deletion. PLoS One 8, e77923, https://doi.org/10.1371/journal.pone.0077923 (2013).
Carlisle, S. M. et al. Untargeted polar metabolomics of transformed MDA-MB-231 breast cancer cells expressing varying levels of human arylamine N-acetyltransferase 1. Metabolomics 12, 016–1056 (2016).
Witham, K. L., Minchin, R. F. & Butcher, N. J. Role for human arylamine N-acetyltransferase 1 in the methionine salvage pathway. Biochem Pharmacol 125, 93–100, https://doi.org/10.1016/j.bcp.2016.11.015 (2017).
Wang, L., Minchin, R. F. & Butcher, N. J. Arylamine N-acetyltransferase 1 protects against reactive oxygen species during glucose starvation: Role in the regulation of p53 stability. PLoS One 13, e0193560, https://doi.org/10.1371/journal.pone.0193560 (2018).
Carlisle, S. M. et al. Knockout of human arylamine N-acetyltransferase 1 (NAT1) in MDA-MB-231 breast cancer cells leads to increased reserve capacity, maximum mitochondrial capacity, and glycolytic reserve capacity. Mol Carcinog 57, 1458–1466 (2018).
Srour, N., Reymond, M. A. & Steinert, R. Lost in translation? A systematic database of gene expression in breast cancer. Pathobiology 75, 112–118, https://doi.org/10.1159/000123849 (2008).
Andres, S. A., Brock, G. N. & Wittliff, J. L. Interrogating differences in expression of targeted gene sets to predict breast cancer outcome. BMC Cancer 13, 326, https://doi.org/10.1186/1471-2407-13-326 (2013).
Andres, S. A., Smolenkova, I. A. & Wittliff, J. L. Gender-associated expression of tumor markers and a small gene set in breast carcinoma. Breast 23, 226–233, https://doi.org/10.1016/j.breast.2014.02.007 (2014).
Andres, S. A. & Wittliff, J. L. Co-expression of genes with estrogen receptor-alpha and progesterone receptor in human breast carcinoma tissue. Horm Mol Biol Clin Investig 12, 377–390, https://doi.org/10.1515/hmbci-2012-0025 (2012).
Johansson, I., Killander, F., Linderholm, B. & Hedenfalk, I. Molecular profiling of male breast cancer - lost in translation? Int J Biochem Cell Biol 53, 526–535, https://doi.org/10.1016/j.biocel.2014.05.007 (2014).
Sieuwerts, A. M. et al. Evaluation of the ability of adjuvant tamoxifen-benefit gene signatures to predict outcome of hormone-naive estrogen receptor-positive breast cancer patients treated with tamoxifen in the advanced setting. Mol Oncol 8, 1679–1689, https://doi.org/10.1016/j.molonc.2014.07.003 (2014).
Carlisle, S. M. & Hein, D. W. Retrospective analysis of estrogen receptor 1 and Nacetyltransferase gene expression in normal breast tissue, primary breast tumors, and established breast cancer cell lines. Int J Oncol 53, 694–702, https://doi.org/10.3892/ijo.2018.4436 (2018).
Minchin, R. F. & Butcher, N. J. Trimodal distribution of arylamine N-acetyltransferase 1 mRNA in breast cancer tumors: association with overall survival and drug resistance. BMC Genomics 19, 513, https://doi.org/10.1186/s12864-018-4894-4 (2018).
Tiang, J. M., Butcher, N. J. & Minchin, R. F. Effects of human arylamine N-acetyltransferase I knockdown in triple-negative breast cancer cell lines. Cancer Med 4, 565–574 (2015).
Tiang, J. M., Butcher, N. J., Cullinane, C., Humbert, P. O. & Minchin, R. F. RNAi-mediated knock-down of arylamine N-acetyltransferase-1 expression induces E-cadherin up-regulation and cell-cell contact growth inhibition. PLoS One 6, e17031, https://doi.org/10.1371/journal.pone.0017031 (2011).
Tiang, J. M., Butcher, N. J. & Minchin, R. F. Small molecule inhibition of arylamine N-acetyltransferase Type I inhibits proliferation and invasiveness of MDA-MB-231 breast cancer cells. Biochem Biophys Res Commun 393, 95–100, https://doi.org/10.1016/j.bbrc.2010.01.087 (2010).
Stepp, M. W., Doll, M. A., Carlisle, S. M., States, J. C. & Hein, D. W. Genetic and small molecule inhibition of arylamine N-acetyltransferase 1 reduces anchorage-independent growth in human breast cancer cell line MDA-MB-231. Mol Carcinog 57, 549–558, https://doi.org/10.1002/mc.22779 (2018).
Stepp, M. W. et al. Congenic rats with higher arylamine N-acetyltransferase 2 activity exhibit greater carcinogen-induced mammary tumor susceptibility independent of carcinogen metabolism. BMC Cancer 17, 017–3221 (2017).
Zhang, X. et al. High N-Acetyltransferase 1 Expression Is Associated with Estrogen Receptor Expression in Breast Tumors, but Is not Under Direct Regulation by Estradiol, 5alpha-androstane-3beta,17beta-Diol, or Dihydrotestosterone in Breast Cancer Cells. J Pharmacol Exp Ther 365, 84–93 (2018).
Laurieri, N. et al. A novel color change mechanism for breast cancer biomarker detection: naphthoquinones as specific ligands of human arylamine N-acetyltransferase 1. PLoS One 8, e70600, https://doi.org/10.1371/journal.pone.0070600 (2013).
Russell, A. J. et al. Selective small molecule inhibitors of the potential breast cancer marker, human arylamine N-acetyltransferase 1, and its murine homologue, mouse arylamine N-acetyltransferase 2. Bioorg Med Chem 17, 905–918, https://doi.org/10.1016/j.bmc.2008.11.032 (2009).
Egleton, J. E. et al. Structure-activity relationships and colorimetric properties of specific probes for the putative cancer biomarker human arylamine N-acetyltransferase 1. Bioorg Med Chem 22, 3030–3054, https://doi.org/10.1016/j.bmc.2014.03.015 (2014).
Patin, E. et al. Deciphering the ancient and complex evolutionary history of human arylamine N-acetyltransferase genes. Am J Hum Genet 78, 423–436, https://doi.org/10.1086/500614 (2006).
Mortensen, H. M. et al. Characterization of genetic variation and natural selection at the arylamine N-acetyltransferase genes in global human populations. Pharmacogenomics 12, 1545–1558, https://doi.org/10.2217/pgs.11.88 (2011).
Vangenot, C. et al. Humans and chimpanzees display opposite patterns of diversity in arylamine N-acetyltransferase genes. G3 (Bethesda), https://doi.org/10.1534/g3.119.400223 (2019).
Tsirka, T. et al. Comparative analysis of xenobiotic metabolising N-acetyltransferases from ten non-human primates as in vitro models of human homologues. Sci Rep 8, 018–28094 (2018).
Sabbagh, A. et al. Rapid birth-and-death evolution of the xenobiotic metabolizing NAT gene family in vertebrates with evidence of adaptive selection. BMC Evol Biol 13, 62, https://doi.org/10.1186/1471-2148-13-62 (2013).
Boukouvala, S., Sabbagh, A. & Fakis, G. In Arylamine N-Acetyltransferases in Health and Disease 197–229 (2018).
Wu, H. et al. Structural basis of substrate-binding specificity of human arylamine N-acetyltransferases. J Biol Chem 282, 30189–30197, https://doi.org/10.1074/jbc.M704138200 (2007).
Teixeira, R. L. et al. Sequence analysis of NAT2 gene in Brazilians: identification of undescribed single nucleotide polymorphisms and molecular modeling of the N-acetyltransferase 2 protein structure. Mutat Res 683, 43–49, https://doi.org/10.1016/j.mrfmmm.2009.10.009 (2010).
Grant, D. M. et al. Human acetyltransferase polymorphisms. Mutat Res 376, 61–70 (1997).
Hubbard, A. L., Moyes, C., Wyllie, A. H., Smith, C. A. & Harrison, D. J. N-acetyl transferase 1: two polymorphisms in coding sequence identified in colorectal cancer patients. Br J Cancer 77, 913–916 (1998).
Hughes, N. C. et al. Identification and characterization of variant alleles of human acetyltransferase NAT1 with defective function using p-aminosalicylate as an in-vivo and in-vitro probe. Pharmacogenetics 8, 55–66 (1998).
Payton, M. A. & Sim, E. Genotyping human arylamine N-acetyltransferase type 1 (NAT1): the identification of two novel allelic variants. Biochem Pharmacol 55, 361–366 (1998).
Fretland, A. J., Doll, M. A., Leff, M. A. & Hein, D. W. Functional characterization of nucleotide polymorphisms in the coding region of N-acetyltransferase 1. Pharmacogenetics 11, 511–520 (2001).
Zhu, Y. & Hein, D. W. Functional effects of single nucleotide polymorphisms in the coding region of human N-acetyltransferase 1. Pharmacogenomics J 8, 339–348, https://doi.org/10.1038/sj.tpj.6500483 (2008).
Hein, D. W., Fakis, G. & Boukouvala, S. Functional expression of human arylamine N-acetyltransferase NAT1*10 and NAT1*11 alleles: a mini review. Pharmacogenet Genomics 28, 238–244, https://doi.org/10.1097/fpc.0000000000000350 (2018).
Karagianni, E. P. et al. Homologues of xenobiotic metabolizing N-acetyltransferases in plant-associated fungi: Novel functions for an old enzyme family. Sci Rep 5, 12900, https://doi.org/10.1038/srep12900 (2015).
Tsirka, T., Boukouvala, S., Agianian, B. & Fakis, G. Polymorphism p.Val231Ile alters substrate selectivity of drug-metabolizing arylamine N-acetyltransferase 2 (NAT2) isoenzyme of rhesus macaque and human. Gene 536, 65–73, https://doi.org/10.1016/j.gene.2013.11.085 (2014).
Lebendiker, M. & Danieli, T. Production of prone-to-aggregate proteins. FEBS Lett 588, 236–246, https://doi.org/10.1016/j.febslet.2013.10.044 (2014).
Schrodel, A. & de Marco, A. Characterization of the aggregates formed during recombinant protein expression in bacteria. BMC Biochem 6, 10, https://doi.org/10.1186/1471-2091-6-10 (2005).
Haacke, A., Fendrich, G., Ramage, P. & Geiser, M. Chaperone over-expression in Escherichia coli: apparent increased yields of soluble recombinant protein kinases are due mainly to soluble aggregates. Protein Expr Purif 64, 185–193, https://doi.org/10.1016/j.pep.2008.10.022 (2009).
Doll, M. A., Jiang, W., Deitz, A. C., Rustan, T. D. & Hein, D. W. Identification of a novel allele at the human NAT1 acetyltransferase locus. Biochem Biophys Res Commun 233, 584–591, https://doi.org/10.1006/bbrc.1997.6501 (1997).
de Leon, J. H., Vatsis, K. P. & Weber, W. W. Characterization of naturally occurring and recombinant human N-acetyltransferase variants encoded by NAT1. Mol Pharmacol 58, 288–299 (2000).
Goodfellow, G. H., Dupret, J. M. & Grant, D. M. Identification of amino acids imparting acceptor substrate selectivity to human arylamine acetyltransferases NAT1 and NAT2. Biochem J 348(Pt 1), 159–166 (2000).
Dupret, J. M., Dairou, J., Atmane, N. & Rodrigues-Lima, F. Inactivation of human arylamine N-acetyltransferase 1 by hydrogen peroxide and peroxynitrite. Methods Enzymol 400, 215–229 (2005).
Wang, H. et al. Over-expression, purification, and characterization of recombinant human arylamine N-acetyltransferase 1. Protein J 24, 65–77 (2005).
Liu, L., Wagner, C. R. & Hanna, P. E. Human arylamine N-acetyltransferase 1: in vitro and intracellular inactivation by nitrosoarene metabolites of toxic and carcinogenic arylamines. Chem Res Toxicol 21, 2005–2016, https://doi.org/10.1021/tx800215h (2008).
Liu, L., Wagner, C. R. & Hanna, P. E. Isoform-selective inactivation of human arylamine N-acetyltransferases by reactive metabolites of carcinogenic arylamines. Chem Res Toxicol 22, 1962–1974, https://doi.org/10.1021/tx9002676 (2009).
Ragunathan, N. et al. Cadmium alters the biotransformation of carcinogenic aromatic amines by arylamine N-acetyltransferase xenobiotic-metabolizing enzymes: molecular, cellular, and in vivo studies. Environ Health Perspect 118, 1685–1691, https://doi.org/10.1289/ehp.1002334 (2010).
Duval, R. et al. Identification of cancer chemopreventive isothiocyanates as direct inhibitors of the arylamine N-acetyltransferase-dependent acetylation and bioactivation of aromatic amine carcinogens. Oncotarget 7, 8688–8699, https://doi.org/10.18632/oncotarget.7086 (2016).
Xu, X. et al. Human Arylamine N-Acetyltransferase 1 Is Inhibited by the Dithiocarbamate Pesticide Thiram. Mol Pharmacol 92, 358–365, https://doi.org/10.1124/mol.117.108662 (2017).
Hein, D. W. N-acetyltransferase SNPs: emerging concepts serve as a paradigm for understanding complexities of personalized medicine. Expert Opin Drug Metab Toxicol 5, 353–366, https://doi.org/10.1517/17425250902877698 (2009).
Fakis, G., Boukouvala, S., Kawamura, A. & Kennedy, S. Description of a novel polymorphic gene encoding for arylamine N-acetyltransferase in the rhesus macaque (Macaca mulatta), a model animal for endometriosis. Pharmacogenet Genomics 17, 181–188, https://doi.org/10.1097/FPC.0b013e328011e3ad (2007).
de Magalhaes, J. P. & Costa, J. A database of vertebrate longevity records and their relation to other life-history traits. J Evol Biol 22, 1770–1774, https://doi.org/10.1111/j.1420-9101.2009.01783.x (2009).
Senisterra, G., Chau, I. & Vedadi, M. Thermal denaturation assays in chemical biology. Assay Drug Dev Technol 10, 128–136, https://doi.org/10.1089/adt.2011.0390 (2012).
Boukouvala, S. & Fakis, G. Arylamine N-acetyltransferases: what we learn from genes and genomes. Drug Metab Rev 37, 511–564, https://doi.org/10.1080/03602530500251204 (2005).
Butcher, N. J., Boukouvala, S., Sim, E. & Minchin, R. F. Pharmacogenetics of the arylamine N-acetyltransferases. Pharmacogenomics J 2, 30–42 (2002).
Butcher, N. J., Ilett, K. F. & Minchin, R. F. Functional polymorphism of the human arylamine N-acetyltransferase type 1 gene caused by C190T and G560A mutations. Pharmacogenetics 8, 67–72 (1998).
Betts, M. J. & Russel, R. B. In Bioinformatics for Geneticists (eds Michael R. Barnes & Ian C. Gray) Ch. 14, 289–316 (John Wiley & Sons, Ltd, 2003).
Walraven, J. M., Zang, Y., Trent, J. O. & Hein, D. W. Structure/function evaluations of single nucleotide polymorphisms in human N-acetyltransferase 2. Curr Drug Metab 9, 471–486 (2008).
Zhou, X., Ma, Z., Dong, D. & Wu, B. Arylamine N-acetyltransferases: A Structural Perspective. Br J Pharmacol, https://doi.org/10.1111/bph.12182 (2013).
Luca, F. et al. Multiple advantageous amino acid variants in the NAT2 gene in human populations. PLoS One 3, e3136, https://doi.org/10.1371/journal.pone.0003136 (2008).
Magalon, H. et al. Population genetic diversity of the NAT2 gene supports a role of acetylation in human adaptation to farming in Central Asia. Eur J Hum Genet 16, 243–251, https://doi.org/10.1038/sj.ejhg.5201963 (2008).
Sabbagh, A. et al. Worldwide distribution of NAT2 diversity: implications for NAT2 evolutionary history. BMC Genet 9, 21, https://doi.org/10.1186/1471-2156-9-21 (2008).
Sabbagh, A., Darlu, P., Crouau-Roy, B. & Poloni, E. S. Arylamine N-acetyltransferase 2 (NAT2) genetic diversity and traditional subsistence: a worldwide population survey. PLoS One 6, e18507, https://doi.org/10.1371/journal.pone.0018507 (2011).
Patillon, B. et al. A homogenizing process of selection has maintained an “ultra-slow” acetylation NAT2 variant in humans. Hum Biol 86, 185–214, https://doi.org/10.13110/humanbiology.86.3.0185 (2014).
Podgorna, E. et al. Variation in NAT2 acetylation phenotypes is associated with differences in food-producing subsistence modes and ecoregions in Africa. BMC Evol Biol 15, 263, https://doi.org/10.1186/s12862-015-0543-6 (2015).
Sabbagh, A., Darlu, P., Vangenot, C. & Poloni, E. S. In Arylamine N-Acetyltransferases in Health and Disease 165–193 (2018).
Bonhomme, M., Cuartero, S., Blancher, A. & Crouau-Roy, B. Assessing natural introgression in 2 biomedical model species, the rhesus macaque (Macaca mulatta) and the long-tailed macaque (Macaca fascicularis). J Hered 100, 158–169, https://doi.org/10.1093/jhered/esn093 (2009).
Blancher, A. et al. Mitochondrial DNA sequence phylogeny of 4 populations of the widely distributed cynomolgus macaque (Macaca fascicularis fascicularis). J Hered 99, 254–264, https://doi.org/10.1093/jhered/esn003 (2008).
Uno, Y., Murayama, N. & Yamazaki, H. Molecular and Functional Characterization of N-Acetyltransferases NAT1 and NAT2 in Cynomolgus Macaque. Chem Res Toxicol 31, 1269–1276, https://doi.org/10.1021/acs.chemrestox.8b00236 (2018).
Brooke, E. W. et al. An approach to identifying novel substrates of bacterial arylamine N-acetyltransferases. Bioorg Med Chem 11, 1227–1234 (2003).
Hall, T. A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41, 95–98 (1999).
Stephens, M. & Donnelly, P. A comparison of bayesian methods for haplotype reconstruction from population genotype data. Am J Hum Genet 73, 1162–1169, https://doi.org/10.1086/379378 (2003).
Hein, D. W., Boukouvala, S., Grant, D. M., Minchin, R. F. & Sim, E. Changes in consensus arylamine N-acetyltransferase gene nomenclature. Pharmacogenet Genomics 18, 367–368, https://doi.org/10.1097/FPC.0b013e3282f60db0 (2008).
Acknowledgements
We thank Professor Edith Sim (University of Oxford, UK) for kindly providing NAT1-specific polyclonal antiserum #183.
Author information
Authors and Affiliations
Contributions
S.B., G.F., B.C.R. and A.S. conceived of and designed the study, supervised implementation of the experimental procedures and analysed the data; Z.C., D.G., E.K., N.M., S.V.R., I.S., T.T., C.V. and S.Z. (presented in alphabetical order) carried out the experiments and performed initial data analyses with equal contributions. S.B. and G.F. wrote the manuscript. The materials and datasets generated during the study are available from G.F. on reasonable request. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing Interests
T.T. was recipient of PhD scholarships from the Onassis Foundation and the State Scholarships Foundation of Greece. The EDB lab was supported by “Investissement d’Avenir” Grants (CEBA, ANR 10LabX-0025; TULIP, ANR-10-labX-41). S.Z. is currently an employee of Medicon Hellas S.A., Greece. None of the funding bodies had any involvement in the design or conduct of this study or preparation of the manuscript.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Boukouvala, S., Chasapopoulou, Z., Giannouri, D. et al. Population variability of rhesus macaque (Macaca mulatta) NAT1 gene for arylamine N-acetyltransferase 1: Functional effects and comparison with human. Sci Rep 9, 10937 (2019). https://doi.org/10.1038/s41598-019-47485-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-47485-x
- Springer Nature Limited