Background

Cytochrome P450 family 2 subfamily D member 6 (CYP2D6) is one of the most important drug-metabolizing enzymes expressed in humans. The enzyme metabolizes 20–30 % of all xenobiotics including numerous antidepressants, antipsychotics, anti-emetics, analgesics, and cardiovascular medications. The expression of CYP2D6 is highly polymorphic; more than 70 allelic variants have been identified, and deletion or duplication of the gene leads to variable enzyme function in individual patients [1, 2]. The CYP2D6 gene has been mapped to chromosome 22q13.1 and consists of nine exons with an open reading frame of 1491 base pairs coding for 497 amino acids [25]. Variability in enzyme expression has been associated with altered drug effectiveness and safety [2]. In fact, there are four functional genotype groups identified for the enzyme, poor metabolizers (PMs), intermediate metabolizers (IMs), extensive metabolizers (EMs), and ultra-rapid metabolizers (UMs) based upon the number and activity of the gene copies the patients express (Table 1) [1, 6]. Therefore, many researchers and clinicians have targeted genotyping CYP2D6 in efforts to improve drug therapy.

Table 1 Predicted CYP2D6 enzyme function based upon the activity score derived from SNV identification

Expanding on previously published work [79], we sought to determine the effectiveness and safety of metoprolol succinate utilizing a systems biology approach, whereby we genotyped the drug-metabolizing enzyme CYP2D6 as well as the drug target, β1-adrenoreceptor (ADRB1). Additionally, we captured demographic and clinical factors that are necessary to understand the potential for individualized therapy in patients taking metoprolol succinate. Metoprolol is a B1 selective antagonist, known as a beta-blocker. Beta-blockers are first-line treatment for heart failure, hypertension (HTN), angina, and myocardial infarction [1014]. In 2011, 34.5 million prescriptions for metoprolol were written in the USA [15]. Thus, metoprolol is a first-line therapy for several of the most common chronic diseases and is one of the most commonly prescribed drugs in the USA. CYP2D6 is the only clinically pertinent pathway of metoprolol metabolism, and polymorphisms have been associated with altered levels of metoprolol [16, 17]. ADRB1 is the drug target, and polymorphisms in this receptor have been associated with variable drug response [7, 18]. Prediction of drug response with knowledge of one without knowledge of the other could be incomplete since both contribute to the ultimate clinical effect [19]. This highlights the importance of accounting for both factors. Prior investigators have identified imperfect associations with CYP2D6 [16, 17] and ADRB1 [7, 18] genotypes, but no study has been appropriately powered to account for variability in both genotypes [19]. No single assay accounts for variants in both genes. Given that multiple single-nucleotide variants (SNVs) and the presence of gene duplication can affect CYP2D6 and ADRB1, a simple, efficient, and inexpensive method that identifies both SNVs and gene duplication is required to allow for an adequately powered systems biology approach.

CYP2D6 and ADRB1 variants have been identified using long-range polymerase chain reaction (XL-PCR) with PCR restriction fragment length polymorphism (PCR-RFLP) or microarray analysis. Microarray analysis remains expensive for large-scale genotyping. Commercially available tests cost a minimum of $500 per patient. XL-PCR with PCR-RFLP accounts for allele variants, and multiplication is inexpensive, easy, and reliable in patients regardless of ethnicity or race [6]. However, XL-PCR with PCR-RFLP requires numerous enzymatic digestions and multiple amplification steps to identify more than one polymorphism in the CYP2D6 gene. In order to conduct a large-scale clinical trial to determine the effectiveness and safety of metoprolol in treating hypertension, we sought to develop a simple, rapid, yet less expensive assay for CYP2D6 and ADRB1 that accounts for numerous polymorphisms including gene duplications. Drug effectiveness data will be presented at a later date when the trial closes.

Material and methods

Study design and setting

This was a prospective observational clinical trial (NCT02293096) that enrolled patients with uncontrolled HTN from clinics, the emergency department, and across the community at the University of Colorado Hospital in Aurora, Colorado.

Subjects

In accordance with the University of Colorado IRB approval, we enrolled subjects with uncontrolled HTN between 30 and 80 years of age. Exclusion criteria included end-stage liver disease, glomerular filtration rate < 60 ml/min/1.73 m2, pregnancy, American Society of Anesthesiologists (ASA) classification of >3, prisoners or wards of the state, decisionally challenged, heart rate < 60 beats per minute, AV block > 240 ms, active reactive airway disease, illicit drug use in the preceding 30 days (excluding marijuana), allergy to metoprolol succinate, or severe peripheral arterial circulatory disorders.

Drug intervention

Patients were initially managed according to the Eighth Joint National Committee guidelines for management of HTN [20]. Patients were started on angiotensin-converting enzyme inhibitor or angiotensin receptor blockers as first-line therapy. If blood pressure remained uncontrolled, then metoprolol succinate was added. Patients were then followed up and metoprolol titrated weekly for 4 weeks. Drug effectiveness data will be presented at a later date.

DNA isolation

Genomic DNA was extracted from whole blood via the Puregene® Blood Core kit B (Qiagen) according to the manufacturer’s instructions.

Long-range PCR analyses to determine CYP2D6 gene multiplication

We analyzed CYP2D6 duplication using long-range PCR with the primers described by Lovlie et al. [21]. This primer combination amplifies a 5.2-kb PCR fragment from the CYP2D7-CYP2D6 intergenic regions in all individuals and a 3.6-kb PCR fragment from the CYP2D6-CYP2D6 region in individuals with a duplication of the gene [22]. The amplification was done using the Phusion High-Fidelity DNA Polymerase kit (New England Biolabs). The reaction components included 2 μl of 5× Phusion HF buffer, 0.2 μl of 25 mM dNTPs, 0.5 μl of 10 μM/μl forward primer, 0.5 μl of 10 μM/μl reverse primer, 1 μl of template DNA, 0.2 μl of Phusion DNA polymerase, 0.3 μl DMSO, 2.5 μl of BETAIN, and 2.8 μl of nuclease-free water. The total reaction volume was 10 μl. Thermal cycling conditions were as follows: 98 °C for 30 s followed by 35 cycles of 98 °C for 10 s, 67 °C for 30 s, and 72 °C for 2 min 30 s. After cycling, samples were stored at 4 °C. PCR products were analyzed by electrophoresis.

Long-range PCR analyses to determine CYP2D6 gene deletion

To detect deletion of the entire CYP2D6 gene, (CYP2D6 *5) we performed a multiplex longer-PCR reaction with the primers described by Okubo et al. [23]. The reaction components included 2 μl of 5× Phusion HF buffer, 0.2 μl of 25 mM dNTPs, 0.2 μl of 10 μM F1 primer, 0.2 μl of 10 μM R2 primer, 2 μl of 10 μM F3 primer, 1 μl of template DNA, 0.2 μl of Phusion DNA polymerase, 2.5 μl of BETAIN, and 1.7 μl of nuclease-free water. The PCR conditions were as follows: initiation at 98 °C for 3 min, 35 cycles of 98 °C for 10 s and 70 °C for 30 s, termination at 72 °C for 2 min 30 s, and a final elongation at 72 °C for 10 min. The PCR products were examined by electrophoresis and include the amplification of a 3.5-kb fragment, which indicates CYP2D6 *5, and a 4.7-kb fragment, which indicates CYP2D6 wild type.

SNV selection

We detected 20 variants associated with altered CYP2D6 enzyme activity. SNVs were chosen because they are representative of major haplotypes associated with altered enzyme function without providing redundancy of rsIDs leading to the same functional activity within the same allele variant (Table 2). If none of these SNVs were identified, the allele designation was defaulted to the reference allele, CYP2D6*1. Haplotype analysis for CYP2D6 was based upon predicted enzyme activity of the SNVs identified in the genotyping stage.

Table 2 CYP2D6 and ADRB1 variant alleles with rsIDs and subsequent effect on the gene sequence

Additionally, two ADRB1 SNVs were genotyped (Table 2) because haplotypes of these alleles are known to be associated with altered clinical response to metoprolol treatment (Table 3). If neither of these SNVs were identified, the allele was defaulted to Ser49 and Gly389, the reference allele.

Table 3 ADRB1 haplotype and associated metoprolol clinical effect

Genotyping

Genotyping was performed using two separate multiplex reactions. First, we used PCR to amplify a purified template DNA fragment that included the target nucleotide. We designed a total of eight pairs of PCR primers (Table 4) that were divided into two separate pools. The PCR was performed with the AmpliTaq® Gold kit (Applied Biosystems) using a hot start/touchdown PCR assay. The 10 μl reaction included 1 μl of template DNA, 1.6 μl of 10× GeneAmp® PCR Buffer, 0.15 μl of 25 mM/each dNTPs, 0.4 μl of 1 μM/each primer mix, 2.15 μl of 50 mM MgCl2, 0.15 μl of 5 U/μl AmpliTaq Gold, and 4.55 μl of nuclease-free water. Thermal cycling conditions were as follows: 94 °C for 5 min followed by 15 cycles of 94 °C for 30 s; annealing temperature steps down every cycle by 0.5 °C (from 63 to 56.5 °C) every 30 s and then 72 °C for 1 min. The annealing temperature for the final 25 cycles was 56 °C with a denaturation temperature of 94 °C for 30 s and extension temperature of 72 °C for 1 min. After PCR, the products were first treated with shrimp alkaline phosphatase (SAP) and Exonuclease Ι (Exo Ι) to remove excess primers and dNTPs. Two units of SAP and 1 unit of Exo Ι were added to 5 μl of PCR product and incubated at 37 °C for 30 min and then 80 °C for 15 min. Second, we performed multiplex single-base extraction (SBE) reactions. We designed 19 SBE primers and divided the reactions into two pools. See Table 5. Each SBE reaction was carried out in a 10 μl final volume containing 5 μl of SNaPshot Multiplex Ready Reaction Mix, 2 μl of pooled PCR products, 1 μl of pooled SNaPshot primers (0.3 μmol/each μl), and 2 μl of deionized water. Extension was performed for 25 cycles under the following conditions: 96 °C for 10 s, 50 °C for 5 s, and 60 °C for 30 s. To remove ddNTPs, the SBE reactions were then treated with 1.0 unit of calf-intestinal phosphatase (CIP) and incubated at 37 °C for 30 min. The enzyme was deactivated by incubating at 80 °C for 15 min. The samples were run by electrophoresis on the 3130 Genetic Analyzer (Applied Biosystems). The subsequent data were analyzed with GeneScan software and GeneScan-120 LIZ size standard.

Table 4 The oligonucleotide PCR primers used for amplification of genomic DNA to obtain template
Table 5 The oligonucleotide primers for SNaPshot primer extension reactions

CYP2D6 multiple-copy allele determination

The SNaPshot Multiplex System (Applied Biosystems) is a primer extension-based method developed for the analysis of SNVs. Theoretically, with the same SNV peak in the same reacting system with the same reaction conditions, a comparatively higher density means more DNA copies, and the ratios of any two peak densities of the same sample are relatively inflexible. We identified the copy numbers and duplicated/multiplicated allele by the ratio of the peak densities. In each pool, we designed one primer to identify an ADRB1 gene’s SNV. ADRB1 is a single-copy gene which allowed us to use the peaks of ADRB1 gene’s SNVs as the inner standard peaks. We calculated the ratios of the peak density of CYP2D6 SNVs to the peak density of the ADRB1 SNV. We set the reliability value range of the ratios of each SNV based on the values of the ratio of single-copy control samples with the same SNVs. If a ratio of a SNV of a known duplicated sample was greater than the maximum of the reference reliability range, this SNV was designated a duplication. We identified the copy number based upon that value. Based on published results and the values of the ratios, we assume all duplicated samples have the total copy number of three since this was the highest number of copy number variants (CNVs) in our controls.

Activity scoring and predicted phenotype assignment

Each identified CYP2D6 SNV was assigned a predicted enzyme activity score [1, 6]. Gene deletions were designated as an activity score of zero. The predicted enzyme phenotype was determined by addition of the individual gene activity scores, accounting for gene copies yielding decreased enzyme activity and gene duplications in each patient. A score of 0 was predicted to be a PM, 0.5 was predicted to be IM, 1–2 was predicted to be an EM, and 2.5 or greater was predicted to have a UM phenotype.

Assay verification

Genotypes were confirmed with known reference genotype samples from 5 PMs, 4 IMs, and 24 EMs [8, 24]. Copy number variations were determined by Taqman Copy Number Assay (Life Technologies, CA) and then by pyrosequencing allele quantification in the known samples [24].

Results

Subjects

The demographics on the initial 79 subjects with unknown haplotypes in this cohort were as follows: the median age was 52 (IQR 45, 60), 46 (58.2 %) were males, 14 (19.9 %) were Hispanic/Latino, 37 (46.8 %) were Black or African American, 3 (3.8 %) were American Indian or Alaskan Native, 3 (3.8 %) were of Asian decent, 1 (1.3 %) was Native Hawaiian or Pacific Islander, and 38 (48.1 %) were Caucasian.

Genotypes

Our genotyping method demonstrated consistent results with all 30 reference standards. The CYP2D6 haplotype analysis revealed 7 (5.88 %) CYP2D6 PMs, 18 (15.1 %) IMs, 89 (74.78 %) EMs, and 5 (4.2 %) UMs (Table 6). We also genotyped two variants of ADRB1, rs1801253 (Gly389Arg) and rs1801252 (Ser49Gly). The Gly389Arg genotype is Gly/Gly 18 (15.1 %), Gly/Arg 58 (48.7 %), and Arg/Arg 43 (36.1 %). The Ser49Gly genotype is Ser/Ser 82 (68.9 %), Ser/Gly 32 (26.9), and Gly/Gly 5 (4.2 %). See Table 7. CYP2D6 allele frequencies are shown in Table 8. All genotypes were in the Hardy-Weinberg equilibrium (HWE) after allele designation by SNV identification and CNV determination.

Copy number variants

We identified 13 subjects with CNVs with predicted enzyme activities scores ranging from 0 to 3 (Table 9). An example CNV determination is as follows. One sample had a genotype of *4/*41 and the total gene copy number of three. In this sample, we detected four heterozygous SNVs: rs1065852, rs3892097, rs28371725, and rs16947. The ratio of rs1065852 was 0.92, higher than the maximum of the reference reliability range (0.7–0.85). The ratio of rs3892097 was 0.98, higher than the maximum of the reference reliability range (0.6–0.9). The ratio of rs28371725 was 1.97, lower than the minimum of the reference reliability range (2.4–2.9). The ratio of rs16947 was 1.09, lower than minimum of the reference reliability range (1.28–1.38). All these ratios suggest that the variant allele, *4 was duplicated in this sample. Thus, the genotype is *4xN/*41. Since *4 is a non-functional allele, the activity score is assigned 0, and *41 is a reduced function allele, assigned an activity score of 0.5, even if the *4 is duplicated, the activity score remains 0.5. Hence, the genotype for this sample was designated IM (Table 6).

Table 6 Distribution of CYP2D6 genotypes and phenotypes (n = 119, including 79 unknown subjects and 30 reference subjects)
Table 7 ADRB1 genotype
Table 8 CYP2D6 Allele frequencies
Table 9 Genotypes, activity score and predicted phenotypes for samples with gene duplications

Assay verification

Genotyping results for the UM alleles showed a high degree of concordance between the Taqman Copy Number Assay paired with pyrosequencing quantification [24] and our SNaPshot methods. In fact, the only difference between the two assays was the identification of an additional CYP2D6*5 gene deletion with our multiplex longer-PCR method. Therefore, this assay is reliable for CYP2D6 genotyping dependent upon the polymorphisms listed.

Limitations

This method should be validated in an additional cohort with known genotypes to ensure concordance with other haplotype designations. This assay has not been validated to determine more than three CNVs because the control samples did not contain subjects with more than three. However, more than three CNVs of the identified SNVs are universally considered UMs. If additional CNVs of SNVs with associated lower activity scores are found, this may become important for haplotype distinction, depending upon the genotype at the second loci. Additional SNVs with altered predicted enzyme activity will not be captured unless the additional primers are added to the reaction pool. This flexibility of the method is an advantage though the assay is limited by 10 SNVs per pool and identification of allele duplications requires samples to be tested in batches in order to establish reference data. Oversaturation of the assay with additional primers in each pool may lead inconsistent detection of SNVs. Therefore, significantly increasing the number of SNVs identified will require larger volumes of DNA. With the presented assay, less than 60 ng of DNA was necessary to genotype 10 SNVs in these two genes in each sample.

Discussion

We have demonstrated that the SNaPshot method of genotyping CYP2D6 and ADRB1 is reliable and efficient for rapidly identifying numerous SNVs. We demonstrate concordance with known reference standards using this method that requires no enzymatic digestion and can be performed at high volumes. The only difference was an additional identification of a CYP2D6*5 gene deletion not identified by the Taqman method. Less than 60-ng genomic DNA was used for each sample in our assay. The assay is flexible; it would be easy to add additional primers to cover more CYP2D6 SNVs should this be necessary. This customizable assay has advantages given the speed of discovery of CYP SNV identification. As demonstrated by the success with CYP2D6, the multiplex SNaPshot method can be used for designing genotype assays for complex genotyping circumstances rapidly and inexpensively.

This method allows genotyping patients for two genes at only $40 per sample compared to commercially available microarray assays that cost in excess of $500 per sample. While the assay is limited by 10 SNVs per pool, this provided flexibility of the assay to add additional SNVs should they be clinically important. Additional SNVs would require additional DNA, but only 30 ng is required per pool allowing for significant up-scaling of the assay. This method is well suited for pharmacogenes with well-established clinical associations and a finite number of SNVs.

Subjects in our cohort had predicted phenotype frequencies similar to European populations.

Genotypes for CYP2D6 and ADRB1 demonstrate that 25 % may have a gene-gene interaction affecting their metoprolol therapy. Results in our cohort demonstrate a non-responder haplotype in 31.9 % of subjects. Liu et al. demonstrated a non-responder phenotype based upon these same haplotypes in 45.9 % of subjects in that trial [18]. Variability in ethnic proportions may explain this discrepancy though the frequencies are similar. These genes do not co-segregate, requiring knowledge of both genotypes to determine this interaction potential. Paired with clinical outcomes, these gene-gene interactions can be further clarified as the cohort matures. Overall, the multiplex SNaPshot represents a valuable tool for systems biology studies in need of flexible genotyping methods.

Conclusion

Our multiplex SNaPshot protocol allows for specific and accurate detection of CYP2D6 and ADRB1 genotypes. This platform is flexible, simple, efficient, and suited for high throughput.