Introduction

Cytochrome P450 2D6 (CYP2D6) enzyme is a key member of the Cytochrome P450 superfamily implicated in detoxification and metabolism of a wide range of endogenous and exogenous compounds, as well as in hormone synthesis and breakdown [1, 2]. CYP2D6 is one of the most extensively investigated enzymes owing to its role in the metabolism of 25% of all clinically used drugs, including various antidepressants, β-blockers, several opioid analgesics and anticancer drugs [3]. CYP2D6 is a highly polymorphic gene located on chromosome 22q13.1 [4, 5]. So far, more than a hundred CYP2D6 allelic variants were reported in the Pharmacogene Variation Consortium (https://www.pharmvar.org/). Genetic polymorphisms are responsible for null, decreased, normal or increased functions, altering enzyme activity among individuals and populations and consequently affecting pharmacological therapy outcomes [6,7,8]. Several metabolic phenotypes have been reported depending on CYP2D6 genotypes. Actually, individuals with two non-functional alleles are considered poor metabolizers (PM) unable to metabolize drugs, while carrying at least one increased function allele confers an ultrarapid metabolizer phenotype (URM) with increased CYP2D6 activity [9]. Otherwise, individuals with at least one decreased function allele are supplying reduced CYP2D6 activity and are considered as intermediate metabolizers (IM). Normal metabolizers (NM) are individuals with wild type alleles and normal CYP2D6 enzyme activity. Clinical studies reported an association between abnormal metabolizer phenotypes and risk of undergoing dose-related adverse events or lack of treatment efficiency [10]. Furthermore, CYP2D6 genetic polymorphisms have been associated with many treatment failures in several diseases [11,12,13]. CYP2D6 is implicated in tamoxifen activation, a drug administered in hormonal breast cancer therapy, inducing an association between CYP2D6 genotype and treatment response in breast cancer disease [8, 14, 15].

Due to its implication in the susceptibility and treatment of several diseases, and the fact that polymorphic expression of CYP2D6 affects its activity, analysis of CYP2D6 genetic variability is required to develop appropriate therapy for successful treatment. Therefore, CYP2D6 has been investigated in many populations, and the finding showed a substantial variation in allele frequencies among populations [3, 16,17,18]. Indeed, to evaluate CYP2D6 genetic polymorphisms, three alleles were most studied among various populations considering their association with several disease outcomes and treatments [19,20,21]. CYP2D6*3 (rs35742686) is a single-base deletion at exon 5 (A2549del) causing null activity [22, 23]. CYP2D6*4 (rs3892097), a splice site mutation (1846G > A), yields an absence or defective protein in the liver (Sachse et al. 1997). The CYP2D6*10 (rs1065852) 100C/T polymorphism leads to substitution of proline to serine and causes a mRNA splicing defect which produces an IM phenotype [22]. The aim of the present study is to evaluate, for the first time, the distribution of the CYP2D6 gene (CYP2D6*3/*4/*10) in the Moroccan population.

Methods

Subjects and blood sample

The current population study enrolled 321 unrelated healthy volunteers recruited during a blood donation campaign. All volunteers from southern Morocco, whom were Arabic, Amazigh, or sub-Saharan Moroccans. All patients received a medical examination during the blood donation campaign and donors with any disease suspicion (diabetes, high arterial blood pressure, etc.) or cancer history were excluded from the study. Peripheral blood was collected based on the World Health Organization criteria (Blood Donor Selection Guidelines, 2012), and written informed consent was obtained from all individuals enrolled in the study. This study was performed under the approval of the Ethics Commission of Cadi Ayyad University Hospital Center (CHU) Mohammed VI, in Marrakech, Morocco.

CYP2D6 genotyping

Genomic DNA was extracted from whole blood using the conventional salting out procedure [24] with phenol chloroform purification. Extracted genomic DNA was quantified by Qubit Fluorometers (Invitrogen) and the measured values vary 50–74 ng/μl. After PCR amplifications, restriction fragment-length polymorphism analysis (RFLP) was used to genotype CYP2D6 allele analysis. Primers for DNA amplification are given in Table 1. All PCRs were performed in a total volume of 25 μl containing approximately 100 ng of DNA template, 1U of MyTaq of DNA Polymerase enzyme (Bioline, USA), MyTaq Reaction Buffer (0.5 mM NTPs, 1.5 mM MgCl2 with stabilizers and enhancers), along with 200 nM of each appropriate primer [25, 26]. Amplified PCR products were digested with BsaI, BstNI and HphI enzymes (New England Biolabs, USA) for CYP2D6*3, CYP2D6*4 and CYP2D6*10 alleles, respectively. Amplification sizes details and expected digestion results for each SNP are also shown in Table 1.

Table 1 Primers and restriction enzymes used for genotyping

Statistical analysis

Allele and genotype frequencies, as well as Hardy–Weinberg equilibrium (HWE), were assessed using SNPStats software [27]. Results are considering significant when P-value is less than 0.05. Pairwise linkage disequilibrium (LD) within the three SNP was performed in terms of Lewontin’s () defined based on normalizing coefficient of linkage disequilibrium D which measures the deviation of haplotype frequencies from expected values based on gene frequencies and informing if alleles are inherited together, and Pearson’s coefficient of correlation (r2) defined as D2 normalized by the product of all allele frequencies. Linkage disequilibrium was assessed using Haploview software [28]. Haplotypic frequencies were assessed using SNPStats software [27]. To compare CYP2D6 allele frequencies in Moroccans with other populations from different ethnic origins, we carried out a qui-square test, a Principal Component Analysis (PCA) and a dendrogram clustering, using R environment [29].

Results

The allele and genotype frequency distributions of the CYP2D6*3 (A2549del), *4 (G1846A) and *10 (C100T) variants were analyzed in the blood samples of 321 healthy volunteers. Figure 1 presents the alleles in the pattern of the fragments digested for the detection of CYP2D6*3 (A), CYP2D6*4 (B), CYP2D6*10 (C).

Fig. 1
figure 1

Pattern of the fragments digested A Electrophoretic separation of cytochrome P450 2D6*3 digested segments. B Electrophoretic separation of cytochrome P450 2D6*4 digested segments. C Electrophoretic separation of cytochrome P450 2D6*10 digested segments

According to HW equilibrium analysis (Table 2), all the studied SNPs were in HW equilibrium (P-value > 0.05).

Table 2 Hardy–Weinberg Equilibrium for CYP2D6 polymorphisms in the Moroccan population

Concerning CYP2D6 genotypes (Table 3), carriers of homozygous wild type for CYP2D6*3 were 76.4%, while 23.6% were heterozygous. The carriers of the wild homozygous genotype of the CYP2D6*4 variant were the most predominant with 80.3%, and the heterozygous genotype represented 18.3%, while the homozygous mutant genotype was 1.4%, while the wild homozygous and heterozygous of the CYP2D6*10 variant were 84.7% and 15.3%, respectively. Minor allele frequencies for CYP2D6*3, CYP2D6*4 and CYP2D6*10 were 11.8%, 10.5% and 7.7%, respectively.

Table 3 Genotypic and allelic frequencies of CYP2D6 polymorphisms in the Moroccan population

According to our results, the AGC haplotype was the most predominant within our population (77.19%) (Table 4). A strong LD (Dʹ = 0.69; r2 = 0.33; P = 0.000) was found between CYP2D6*4 and CYP2D6*10 SNP, as shown in Fig. 2.

Table 4 Haplotypes frequencies of the CYP2D6 (CYP2D6*3/CYP2D6*4/CYP2D6*10) polymorphisms
Fig. 2
figure 2

Haploview figure showing Linkage disequilibrium (LD) plots characterizing haplotype blocks in CYP2D6 gene polymorphisms, in terms of Lewontin’s coefficient (Dʹ) and Pearson’s (r2) statistics

Allele frequencies of the three CYP2D6 polymorphisms were compared with different populations including Mediterraneans, Europeans, Africans, South Americans and Asians (Table 5). The considered studies consisted in the evaluation of CYP2D6 genetic frequencies in a healthy subject (population studies), or investigation the impact of CYP2D6 genotypes on therapy outcomes, and exploring risk susceptibility of CYP2D6 (Association studies/Case–control studies). Results of the three SNPs frequencies comparison showed a significant difference in CYP2D6*3 allele frequency between Moroccans and other ethnic groups, while no difference was observed in CYP2D6*4 and CYP2D6*10 allelic frequencies within the Moroccan population and the majority of European, Mediterranean and African populations (Table 5).

Table 5 Distribution of CYP2D6 allele frequencies in the Moroccan population compared to other worldwide populations

We also performed a PCA and cluster dendrogram to compare CYP2D6 allele frequencies between Moroccans and other populations. However, CYP2D6*3 allele was excluded from PCA analyses due to the significant difference in this allele frequency between Moroccans and other ethnic groups which affect substantially the PCA results. The PCA including data of CYP2D6*4 and CYP2D6*10 alleles revealed genetic proximity between the Moroccan population and Europeans as well as Africans (Fig. 3B). The cluster dendrogram also showed genetic proximity between Moroccans and Africans as well as European populations (Fig. 3A).

Fig. 3
figure 3

A Dendrogram cluster showing relatedness between Moroccan population and Mediterranean, African and Asian populations for CYP2D6*4 and CYP2D6*10 alleles. B Principal component analysis of the CYP2D6 allele frequencies distribution on the Mediterranean, South African and Asian populations

Discussion

The CYP2D6 enzyme is implicated in the metabolism of a wide range of clinically used drugs, such as tamoxifen. Many investigations reported an association of CYP2D6 genetic polymorphisms with susceptibility and response to treatment in many diseases such as autoimmune conditions, cardiovascular diseases and several cancers [52,53,54,55,56,57]. Furthermore, clinical studies reported that CYP2D6 genetic polymorphisms are associated with adverse responses to many drugs such as opioids including codeine, antiarrhythmic drugs and anti-cancer drugs [11,12,13]. Indeed, CYP2D6 is implicated in tamoxifen transformation, a drug usually administrated in breast cancer hormonal therapy [58]. Furthermore, numerous studies reported an association between the CYP2D6 poor metabolizer genotype and recurrences of breast cancer disease with worse event-free survival rates [59]. The role of CYP2D6 includes a large number of medical specialties; indeed, the pharmacogenomics guideline committees have reviewed clinical relevance of CYP2D6, and compiled therapeutic recommendations for more than 48 drugs and developed recommendations based on CYP2D6 genotype/phenotype drugs combinations for 26 drugs (PharmGKB Clinical Guideline Annotations (https://www.pharmgkb.org/guidelineAnnotations). Thus, having information on CYP2D6 genotypes is crucial in deciding the most appropriate therapy for each patient.

Therefore, our study contributes to the determination of the genetic profile of CYP2D6 within the Moroccan population. Overall, the Moroccan population showed a predominance of the wild-type genotype (CYP2D6*3 AA (76.4%); CYP2D6*4 GG (80.3%); CYP2D6*10 CC (84.7%)). These genotypes are responsible for the NM phenotype. For each SNP, about 19% of individuals were carrying an IM phenotype. However, 1.4% of Moroccans had PM phenotype for CYP2D6*4. This latter was investigated in many populations and the results attested to its association with breast cancer susceptibility and treatment [60,61,62]. Haplotype analysis revealed a predominance of AGC haplotype in the Moroccan population (77.19% of cases).

According to our results, an increased CYP2D6*3 (Null allele metabolizer) allelic frequency was observed in Moroccan population compared to others. This allele was found to be associated with acute lymphoblastic leukemia and breast cancer disease [53, 57]. A similar CYP2D6*3 allele frequency was also observed within Brazilians. This proximity might be a consequence of numerous factors, as the fact that Brazilian population have received significant immigration from descendants of original north-African groups, including Berbers [63], previously stablished in Iberic peninsula (Spain and Portugal) during the large Arabic occupation for about 700 years (711–1492) [64]. Some of them moved directly to Brazil, when settled by Portuguese in 1500 and especially after its independence from Portugal in 1822 [65], while others made previous migration to Holland and Azoras islands [66].

The CYP2D6*4 was the most studied null allele within populations. Concerning CYP2D6*4 allelic frequencies, results revealed a significant difference between Moroccan and both Asian and African populations (Ghanaians, Ethiopians), versus a similarity with European and Mediterranean populations (Turkish, Iranian, Finnish, Spanish and Egyptians) (Table 5). This similarity is probably resulting from their closer geographical distance, inducing a certain admixture of populations which boosts a degree of genetic similarity [67]. For CYP2D6*10, a reduced-function allele, frequencies showed a great variability within populations. Indeed, CYP2D6*10 frequencies in Moroccans showed a disparity with European populations, versus a similarity with African, Iranian and Syrian populations. PCA results for CYP2D6*4 and CYP2D6*10 alleles clearly showed genetic proximity between Moroccans and both European and African populations. The dendrogram clustering of populations with such diverse ethnic backgrounds revealed three clusters, the first one represented by the Asians. This cluster is characterized by a high allelic frequency of the CYP2D6*10, and low frequency for CYP2D6*4, and it is clearly distinct from other populations. The second and the third clusters are composed of European and African populations, which are characterized by an average value for the two allele frequencies for the second cluster, and a CYP2D6*4 high allelic frequency and CYP2D6*10 low frequency for the third one. This latter includes Moroccans and other European and African populations (Spanish, Finnish, French, South African).

This study suggests that the Moroccans genetic profile was impacted by historical and demographic events, along the Mediterranean, European, and sub-African populations, leading to its current genetic diversity. Actually, this relevant genetic pool could be interpreted by demographic events, for instance, the influx of Arab populations from the Middle East, Sub-Saharans as well as populations around the Mediterranean area. All these groups contributed to the genetic patrimony of the present-day Moroccan population. [68]. Indeed, the present work and other studies of North African genetic variation, which devote attention to the history of North African and Mediterranean populations, presumed that demographic events contributed to the genetic homogeneity with the nearby regions [68, 69].

Overall, our study revealed that 1.4% of our population carried a PM phenotype for CYP2D6*4 polymorphism. Since CYP2D6 is implicated in the metabolism pathway of many drugs used in several disease treatments, carrying out of CYP2D6*4 polymorphism may affect treatment response. Therefore, pharmacogenetic screening for this gene before any therapy is crucial to avoid treatment failure, hence reduce cost-related issues [70]. Indeed, in many countries, patients take advantage of CYP2D6 screening before treatments [71]. Moroccan population should also consider this recommendation.

Conclusion

The present study attested to the genetic proximity between Moroccan, African and European populations. Furthermore, these results will help to lay a basis for clinical studies, aimed to introduce and optimize a personalized therapy for the Moroccan population.