Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Suarez, Philippe; Baumer, Karine; Hall, Diana

doi:10.1038/s41598-021-01940-w

Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Article
Open access
Published: 18 November 2021

Volume 11, article number 22530, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Download PDF

4893 Accesses
1 Citation
2 Altmetric
Explore all metrics

Abstract

The OCA2-HERC2 locus is responsible for the greatest proportion of eye color variation in humans. Numerous studies extensively described both functional SNPs and associated patterns of variation over this region. The goal of our study is to examine how these haplotype structures and allelic associations vary when highly variable markers such as microsatellites are used. Eleven microsatellites spanning 357 Kb of OCA2-HERC2 genes are analyzed in 3029 individuals from worldwide populations. We found that several markers display large differences in allele frequency (10% to 35% difference) among Europeans, East Asians and Africans. In Europe, the alleles showing increased frequency can also discriminate individuals with (IrisPlex) predicted blue and brown eyes. Distinct haplotypes are identified around the variants C and T of the functional SNP rs12913832 (associated to blue eyes), with linkage disequilibrium r² values significant up to 237 Kb. The haplotype carrying the allele rs12913832 C has high frequency (76%) in blue eye predicted individuals (30% in brown eye predicted individuals), while the haplotype associated to the allele rs12913832 T is restricted to brown eye predicted individuals. Finally, homozygosity values reach levels of 91% near rs12913832. Odds ratios show values of 4.2, 7.4 and 10.4 for four markers around rs12913832 and 7.1 for their core haplotype. Hence, this study provides an example on the informativeness of multiallelic markers that, despite their current limited potential contribution to forensic eye color prediction, supports the use of microsatellites for identifying causing variants showing similar genetic features and history.

Optimizing the genetic prediction of the eye and hair color for North Eurasian populations

Article Open access 10 September 2020

Skin pigmentation and genetic variants in an admixed Brazilian population of primarily European ancestry

Article 09 May 2020

Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour

Article Open access 27 February 2017

Introduction

The human eye color trait is the most variable in European populations and it was for a long time considered a simple Mendelian trait with a brown eye color dominant allele and a blue eye color recessive allele¹. Genome-wide association studies in people of European decent^2,3 have instead indicated eye color as a polygenic trait⁴ yet characterized by a limited number of major genes. The OCA2-HERC2 genes explain most of the blue and brown eye color inheritance⁵. Different polymorphisms in the regulatory and coding region of OCA2 are primarily associated with different eye, hair and skin pigmentation phenotypes^6,7,8,9. These findings increased our understanding of the genetic basis of human pigmentation, and drew attention to their potential applications, such as forensic investigations^10,11, historical and anthropological researches¹².

One SNP in particular, rs12913832 in HERC2, is responsible for the greatest proportion of eye color predictability, this SNP together with five SNPs located in other genes have been brought together in the IrisPlex eye color prediction panel¹¹. The accuracy rate of correctly predicting an individual's eye color as being blue or brown is on average 94% in Europe¹³. Additional variation has yet to be identified to account for the poor success rate for intermediate eye color predictions (73% accuracy) and in admixed populations^14,15.

The SNP rs12913832 is located in a (11 bp) conserved region of intron 86 of the HERC2 gene, 21 kb upstream of the promoter of OCA2. The rs12913832 C-derived allele is highly associated with European blue eye color as a recessive trait⁹. The ability of this conserved element to act as an enhancer of OCA2 transcription has been confirmed in experiments using melanocyte cultures carrying either rs12913832 T/T or rs12913832 C/C genotypes^16,17. The P protein, produced by OCA2, is thought to be a mature melanosomal membrane protein, with a potential role in trafficking other proteins to melanosomes^18,19.

Interestingly, the selection pressure on the OCA2-HERC2 region associated with blue eye color in Europeans has been strong^19,20. This region encompass the third longest haplotype spam of diminished heterozygosity in the genome of modern Europeans²¹ which implies intense selection at this locus in ancestral European populations. Multiple factors possibly played a role such as sexual²², the ability to overcome seasonal affective disorder^23,24,25 and associated light skin increased risk for developing melanoma and nonmelanoma skin cancer²⁶. Several lines of research indicate that selective pressure for light pigmentation acted independently in Europeans and East Asians, yet with some genes in common. The brown-eyed associated SNPs frequent in Europeans are different from that of Asians, suggesting a population specific history of the genetic component of pigmentation^19,27.

Evidences from haplotype analysis comparing Dutch and Mediterranean population samples suggest that blue eye color has only arisen once during the Neolithic period, past 6–10,000 years ago, as a founder mutation shared by diverse European populations¹⁷. During the great agriculture migration to the northern part of Europe, the mutations spread out from the Black Sea region. Newer studies have also indicated that some OCA2 missense mutations give rise to blue eye color, when the genotype rs12913832 C/T predicts brown eye color and that these are only found in the Scandinavian population and not in individuals of Southern-European descent^6,28. The OCA2-HERC2 is therefore a region of large interest due to its functional role and population genetics. Global SNP variation has been comprehensively described, yet more data are being available from ongoing projects of whole genome sequencing which indicate that multiallelic sites represent as much as 10% of the genomic variants.

In this study, we aim at further investigating the genetic variability of OCA2-HERC2 by using microsatellite markers (STRs) as an example of multiallelic variants. We are interested in exploring how the use of polymorphisms with a different mutational mechanism and higher rate, change the pattern of haplotype structures and allelic associations around functional alleles. Conveniently, the recent history of selection of OCA2 should enhance the degree of correlations of neutral variants even when highly variable polymorphisms are used. Our interest in these data is twofold: on the one hand, to provide an empirical example of the utility of STRs in studies of association mapping of similar traits, on the other hand, to search for rs12913832 proxy STR markers for a presumptive eye color DNA test to be included in conventional forensic STR multiplexes.

The density of genetic markers required for successful association mapping of complex diseases depends on linkage disequilibrium between non-functional markers and functional variants. There are few reports about the pattern or extent of linkage disequilibrium (LD) between SNPs and STRs genomewide^29,30,31. Yet, several results suggest that association studies using not only SNPs but also multiallelic STRs within or near candidate loci would be useful to search for a disease susceptibility gene, especially in populations with unknown LD structure. The rationale behind this is that in non-African samples highly significant LD between microsatellite alleles and stable markers is preserved across relatively large genetic distances (about 100 Kb range)²⁹ compared to the shorter range (about 30 Kb) for SNPs³². The average length of LD for microsatellites is ~ 100 Kb, which is considerably higher than that of SNPs^33,34. Therefore, a single microsatellite captures a larger genomic region than does a single SNP³⁵. One STR can harbor both types of alleles: those showing complete association with a SNP and alleles that show little or no association. STR outside LD blocks may still show LD with SNPs inside the LD block. In that case, for association studies we would not need an STR in each haplotype block as for SNPs. At the same time, microsatellites show a smaller interpolation variability^36,37 and a single-step expansion or contraction of the tandem repeat on the background of ancestral SNP haplotypes can break up common haplotypes, leading to greater haplotype diversity within the linkage disequilibrium block of interest. The relative performance of STR and SNP in association mapping will also depend on the frequency of disease variants. Markers alleles achieve maximal power for detecting associations when disease alleles are at similar frequencies³⁸. As a result, STR have the potential to find rare disease variants that common SNPs will miss^39,40. Several genome wide association studies used tens of thousands of STRs to investigate the genetic basis of hypertension, narcolepsy, anorexia nervosa, mandibular prognathism, rheumatoid arthritis, type 2 diabetes and prostate cancer^{41,42,43,44,45,46,47}.The known susceptibility gene for rheumatoid arthritis HLA-DRB1 was successfully confirmed and two new candidates (TNXB and NOTCH4)⁴³ were identified through a combination of STR and SNP analysis. Microsatellites have starred in association studies leading to widely replicated discoveries of type 2 diabetes (TCF7L2)⁴⁶ and prostate cancer genes (the 8q region)⁴⁵. Moreover, the ability of microsatellite markers to capture information on coding SNPs was successfully tested for the human major histocompatibility complex to predict HLA functional SNPs^48,49.

However, patterns of LD between SNPs and microsatellites markers may vary considerably between loci⁵⁰. To increase the accuracy of power studies in STR based disease mapping it is therefore important to provide empirical data for genomic regions of large interest and well characterized as the OCA2-HERC2 locus.

Here we present our results on the global variation of eleven microsatellites spanning 357 Kb in the OCA2-HERC2 region and their correlation with the functional SNP rs12913832 and the eye color trait as predicted by the IrisPlex assay. We also examine degrees of LD, related microsatellites’ haplotype structures and homozygosity levels.

Results

STR markers variability

A total of 11 microsatellites polymorphisms are the focus on this study. They span the region from intron 18 of OCA2 to intron 5 of HERC2. Seven STRs are dinucleotide repeats, three are tetranucleotide and one is a pentanucleotide repeat sequence. Markers’ distances and number of alleles are indicated in Fig. 1 and Table 1.

Table 1 STR marker list.

Full size table

Allele frequency distribution in diverse human populations

Figure 2 shows markers’ allele frequencies for the HGDP-CEPH populations grouped according to the seven major geographic regions Africa, Europe, Middle East, Central-South Asia, East Asia, Oceania and Native Americans. Two markers are not reported here because of the low genetic diversity, these are STR 3 and STR 4. They show one major allele at frequency between 94 and 100% outside Africa.

Results of Fig. 2 are consistent with other studies of genetic variability of multiallelic markers: (1) Africa appears as the most variable region with the largest number of alleles also showing similar frequency values. (2) Patterns of allele frequencies are similar among Eurasians groups: Europe, Middle-East and Central-South Asia, and different from those observed for East Asians, Oceanians and Native-Americans. Conversely, peculiar of this study, is the presence of one allele highly frequent (about 50–60%) in Eurasians and low frequent (about 10%) outside this region across several markers (red arrows). This is true for the allele − 6 of STR 2, − 2 of STR A, − 4 of STR B. Alleles − 8 of STR 5 and − 8 of STR 9, are not highly frequent but almost private to Eurasians. The allele R of STR 6 and R of STR 8 show also high frequency in Native-Americans besides Eurasians.

Allele frequency distribution in Europeans with predicted blue and brown eye colors

The HGDP-CEPH European group includes 158 individuals. Prediction of eye color is possible using the IrisPlex marker set available for all DNA samples¹¹. Based on these data we selected individuals of Europe with a probability of blue or brown eye color higher or equal to 0.8, this arbitrary cut-off was used to reduce the error rate associated to such indirect determination of the phenotype, together with eliminating all individuals with predicted “intermediate eye color”. This analysis determined two groups, 42 predicted blue eyes and 62 predicted brown eyes individuals. In the HGDP-CEPH collection, outside Europe, only two individuals of Central-South Asia and three of Middle-East had a value of blue eye color prediction higher or equal to 0.8. To avoid a population structure bias we limited our analysis to Europeans.

When STRs’ allele frequencies are compared between predicted brown-eyed versus blue-eyed Europeans (Fig. 3) several alleles show increased frequency values ranging from 12 to 36% increase in predicted blue eye color individuals (red arrows). Most of the time these alleles are the same that also showed marked differences across major population groups. The analysis of a larger group of Europeans (n = 876) using also data available from gnomAD (see methods) shows a similar pattern of allele frequency distribution (data not shown). STR 2 and STR B could not be included because of more than 50% of missing data in the database. To further explore the relationship of the functional SNP rs12913832 and surrounding STRs, we compared single STR allele frequencies between haplotypes containing the alleles C, determinant for blue eye color (blue bar, n = 1105 European haplotypes), to those containing the alternative variant T (brown bar, n = 641 European haplotypes) (Fig. 4). The observed differences are similar to those of Fig. 3 yet, values are higher going from 14 to 48% especially around rs12913832. Note that the allele -8 of STR 9 is undistinguishable from the allele R according to the sequencing data of genomAD (see “Methods”).

The statistical significance of the differences in allele frequency distribution of the data reported in Fig. 3 and 4 was assessed by Chi-square tests and Monte Carlo simulations (Table 2). We report both the results from our genotyped HGDP-CEPH European samples on the right, and the results from the larger sample set from the gnomAD database parsed based on the functional SNP rs12913832 on the left. In both sample sets STR 5, 6, 8, 9 and 10 showed marked differences with P values ranging from 10⁻³ to < 10⁻⁶ including the haplotype of STR 6–8–9–10 P value 10⁻⁶ (data not shown). Smaller P values ranging from 10⁻⁵ to < 10⁻⁶ are obtained for the same STRs in addition to STR A for the larger dataset when rs12913832 C and T containing haplotypes are compared.

Table 2 Significance of allele frequency differences between predicted blue and brown eye color HGDP-CEPH Europeans and between European haplotypes containing rs12913832 C and T.

Full size table

Haplotypes and linkage disequilibrium values

Significant linkage disequilibrium r² values for Europeans are shown in Fig. 5 by color-coded graphics. Two graphics (a, b) are shown to report alternative multiallelic haplotypes with associated alleles. The C allele of rs12913832, is associated to neighboring alleles and forms a core haplotype including STR 6–8–9–10 (RRRR) (Fig. 5a). This haplotype is highly frequent in predicted blue eyed Europeans 76% and less frequent in predicted brown eyed Europeans 30% and very low 6% in East Asians. The composing alleles all appeared more common in Europe and in predicted blue eye color individuals. This haplotype is 237 Kb long and encompasses two LD blocs previously described. The frequency of the larger haplotype (313.6 kb) from STR A to STR 10 (-2G2RRRR) is still highly frequent in predicted blue eyed Europeans (33%) while its frequency is only 11% in predicted brown eyed individuals.

The alternative allele T of rs12913832, is part of two different haplotypes (-4-1-1-1) and (-4-1-6-4) (Fig. 5b) that are 7.4% and 3.8% frequent in predicted brown eye color Europeans and absent in predicted blue eyed individuals. The haplotype (-4-1-1-1) is 28% frequent in East Asians and the shorter haplotype -4-1-T from STR 6 to rs12913832 in the same population is 87% frequent. Another haplotype around rs1800407 is highly frequent (67%) in East Asia (data from HGDP-CEPH collection only), this includes alleles (RR-2) of STRs 2, A, B (LD plot not shown). STR2 is 2′226 bp distant from rs1800414, the SNP most associated to human pigmentation in Asia. In Europe, its frequency goes down to 5% with no differences between predicted blue and brown eye colors. In East Asia, high frequency haplotypes do not show allelic associations based on r² values (Supplementary Fig. 1).

Homozygosity levels and allelic associations

Homozygosity values of STR markers in predicted blue eye Europeans are higher than in predicted brown eye individuals and increase for markers located around rs12913832, with values reaching 75% to 91% (Table 3). As before, eye color phenotype is predicted using the IrisPlex marker genotypes available in gnomAD for the large European dataset of 876 individuals. For the allele showing increased frequency in predicted blue eye individuals, odds ratios reach values of 4.2, 7.4 and 10.4 for the four STRs around rs12913832 and 7.1 for the core haplotype RRRR.

Table 3 Homozygosity and allelic associations in predicted blue and brown eye color Europeans.

Full size table

Haplotype association

Next we explored whether homozygous individuals for the core haplotype RRRR of STR 6–8–9–10 are all predicted blue eyed color similarly to homozygous individuals for rs12913832 C. Table 4 shows that indeed most of the homozygous RRRR/RRRR are predicted blue eyed (n = 167) and a very minor group is instead predicted brown eyed (n = 18). Also, most of the individuals homozygous for any other haplotype ----/---- are predicted brown eyed (n = 136) and few (n = 17) blue eyed. For these two genotypes the haplotype RRRR is a good proxy for the genotype of rs12913832 yet heterozygous individuals with one copy of RRRR are not all predicted brown eyed as rs12913832 would indicate, instead almost equally distributed between the two phenotypes (n = 109 and n = 138).

Table 4 The core haplotype RRRR of STR 6–8–9–10 in predicted blue and brown eye Europeans compared to rs12913832 genotypes in the same individuals.

Full size table

Discussion

This study focuses on the most important genomic region for eye color variation in humans, which is OCA2-HERC2. Numerous studies based on biallelic (SNP) polymorphisms, previously elucidated functional variants and patterns of allelic associations including signatures of natural selection in Europe. Here we further investigate this region with a set of microsatellites annotated from whole genome sequencing data. With respect to previous studies based on SNP data, this type of polymorphism (STR) should provide non-redundant genetic information due to the different mutational mechanisms and its effect on patterns of allelic associations and haplotype structure. Here we show for the first time the correlation between multiallelic markers and functional variants of the eye color phenotype.

Eleven microsatellites spanning 357 Kb across the OCA2-HERC2 locus were genotyped in 1,064 individuals from 52 populations around the world (HGDP–CEPH panel). Data available from whole genome sequencing projects (gnomAD) were retrieved for the same STR when possible, this allowed us to work with larger sample collections from Europe (n = 876) East Asia (n = 801) and Africa (n = 896). Haplotypes were reconstructed integrating the two SNPs of the IrisPlex assay that are located on this chromosome, rs1800407 and rs12913832 and the eye color phenotype was predicted using the 6 SNP panel of IrisPlex.

The results showed that nine STRs were polymorphic across major population groups. Allele frequency distributions showed as expected, Africa as the most variable region. Europe, Middle-East and Central-South Asia form an undistinguishable homogeneous Eurasian group different from East Asians, Oceanians and Amerindians.

Interestingly, large differences in allele frequency distributions (up to 36%) were found between predicted blue and brown eyed individuals of Europe with often one allele showing increased frequency in predicted blue eye color. It should be noted here that the actual phenotype data of the individuals was not available and the IrisPlex method was reported to have lower prediction accuracy for intermediate eye colors (non-blue and non-brown) and heavily relies on rs12913832⁵¹. Although, only samples showing a prediction value for blue or brown higher or equal to 0.8 were included, the phenotyping procedure may have an error rate estimated to be 3–4%^11,28. Because of this limitation, the subsequent analyses of allele imbalance and LD using the larger population data from gnomAD rather focused on the correlation between STRs and the functional variant rs12913832.

Differences in frequency values go up to 48% when considering two groups of haplotypes containing the variants C or T of rs12913832 in 876 Europeans, with markers STR 5, 6, 8, 9, 10 showing statistical significance. LD analysis indicated some degree of allelic associations among SNPs and STRs, these are more abundant in Europe, where the associated alleles are those that showed skewed allele frequencies between predicted blue and brown eye individuals. Previous studies also showed large difference in SNP haplotypes frequencies between blue and brown eye color individuals going from 36⁵² to 54%⁹ including four SNPs in a region from STR 4 (17 Kb upstream) to STR 9 (22.6 Kb downstream) and to rs12913832, respectively.

Two haplotype blocks were identified from STR A to rs1800407 and from STR 6 to STR 10. Linkage disequilibrium r² values are significant up to 295 Kb from rs1800407 to STR 10 and values range from 0.1 to 0.5. Note that r² values higher than 0.1 should allow identifying a disease susceptibility variation in an association study^35,53. This block includes the functional variant rs12913832, three different haplotypes are associated to the allele C and T of rs12913832, one is most frequent in predicted blue eye individuals 76% (30% in blue) and two others exclusive to predicted brown eye individuals with 7.3% and 4% frequency. Finally, homozygosity values reach levels of 75% to 91% around rs12913832 in predicted blue eye individuals, which are consistent with the recessive mode of inheritance of the blue eye color. In addition, elevated odds ratios values up to 8 and 10.4 were obtained around rs12913832 and for the core haplotype RRRR of STR 6–8–9–10.

Previous studies using SNPs genotyped in a large population of Netherland indicated three LD blocks going from the position corresponding to STR 2 to rs1800407 for 27 kb (LD1), around STR 6 for 13 Kb (LD2) and around STR 9 for 70.5 kb (LD3)⁵². These blocks are also detected by this STR based study with some allelic associations going beyond originally defined blocks LD2 and LD3 up to 237 Kb. Eiberg and colleagues¹⁷ described a 175 kb long haplotype of 13 SNPs, 97% frequent in Danish individuals with blue eye color. The corresponding position of this block with respect to our data is 27 Kb upstream rs12913832 to 22.5 Kb downstream STR 9.

This study designed around known causal variants confirms that allelic associations with functional SNPs can be detected over greater distance with STRs than with SNPs^33,34. This is because the higher mutation rates of STRs underlie a stronger statistical significance of LD. Therefore the importance of using STR also for the first screen of association study to reduce the number of initial association tests or when LD blocks are not known. Although many studies support these predictions theoretically, few have attempted to provide empirical data on the extent of LD detectable at STR and SNP sites at defined distance flanking unknown functional variants of significant effect. In the context of alcohol dependence syndrome studies, the persistence of allele association and differences in allele frequencies of a range of STR markers was investigated around the functional locus aldehyde dehydrogenase ALDH2 known to be under selective constraint in Japanese alcoholic populations³³. This study showed the persistence of LD over distances up to 400 Kb for pairs of loci including at least one STR. It follows that the comparison of allele frequency differences for the STR markers in the case (alcoholics) and control populations would have detected the ALDH2 marker as a putative susceptibility locus. It should be noted that the recent origin of the ALDH2 functional markers is very likely to be a major factor determining the strength of the association observed. With regard to the observation of the allele frequency pattern difference between the cases and controls, population demography, allele frequencies and degree of natural selection are additional important factors in determining the success of gene hunting using this type of approach. Similarly, the genetic influence of IL10 SNP allele on HIV-1 infection and AIDS progression was first deduced by observations of epidemiology associations of two STR loci within 4 Kb of the IL10 gene⁵⁴. In the same way, the data presented here show that considering the two extremes of brown and blue predicted eye color (most robust prediction by IrisPlex) and OCA2-HERC2 as candidate genes, simple parameters of skewed allele frequencies of STRs and levels of homozygosity would allow us to point out the most relevant genetic region carrying the functional variant rs12913832, even with a relatively small sample size and a degree (3–4%) of mislabeled phenotype due to the indirect ascertainment.

In addition, these data confirm the different genetic history of eye color in Asians that showed no allelic associations around rs12913832 and one frequent haplotype that is rare in Europe. The patterns of variability we observed also support previous data indicating strong natural selection in this genomic region in Europe.

Conclusions

This study further describe novel SNP-STR correlations spanning a region of large interest for human genetics, population genetics and forensic science. These results show that in this region of OCA2-HERC2, markers mutating more rapidly than SNPs also capture the effect of demographic history and selective pressure by showing marked genetic differences between predicted blue and brown eyed individuals within the same population. These data support the hypothesis that with STR markers, besides SNPs, it is possible to investigate the genetics of eye color and effectively use this type of variation as marker for mapping causing variants of similar traits, appeared recently, with large penetrance and under positive selection.

Models for detecting variants responsible of genetic traits have been poorly constrained by available data leading to large uncertainties in model predictions. Studies like our go toward the effort of providing a portion of such empirical results. We hope that these data can help validating novel statistical methods aiming at using multiallelic markers for detecting functional variants.

Finally, the idea of possibility replacing functional SNP genotyping with linked STRs for roughly predicting the eye color phenotype in Europe, is poorly supported by the data. A small set of STRs could be easily added to current forensic DNA profiling multiplex, yet none of the STRs analyzed is strongly associated to rs12913832 to work as proxy. This is not surprising since the highest OR observed with these STR data is about 10 while a single SNP in HERC2 previously showed OR of about 30⁵⁵, due to the difference in ORs and therefore effects sizes of markers, the prediction using STR markers would be lower than using functional SNPs. Conversely, STRs alleles around rs12913832, are not as randomly distributed as one could expect from highly variable loci. In particular, for individuals found to be homozygous RRRR/RRRR around rs12913832 C/C the blue eye color would be predicted in 90% of the cases. Similarly, individuals that lack the haplotype RRRR around rs12913832 T/T are predicted brown eyed color in 89% of the times. Unfortunately, no indications of the phenotype can be obtained from all the other combinations of heterozygous haplotypes. Finally, there is great potential in investigating STRs in the HERC2-OCA2 region to explain the eye color that is wrongly predicted by rs11913832. These data may provide the basis of such future studies that should rely on precise and direct eye color phenotype determination.

Methods

All methods were performed in accordance with the relevant guidelines and regulations of the journal.

Population samples for genotyping

The CEPH Human Genome Diversity panel (HGDP-CEPH) contains 1,064 individuals from African, European, North African/Middle Eastern, Central-South Asian, East Asian, Native American and Oceanian populations⁵⁶. For all data analyses purposes we considered only 952 individuals (H952 subset) after exclusion of duplicates, first- and second-degree relatives⁵⁷. Populations were combined into continental-based groups which have been previously established⁵⁸ with the following composite populations, sample sizes and labels: 6 African (n = 105 AFR), 8 European (n = 158 EUR), 4 North African/Middle Eastern (n = 162 ME), 9 Central-South Asian (n = 202 CSA), 17 East Asian (n = 230 EAS), 2 Oceanian (n = 28 OCE) and 5 Native American (n = 64 NAM). A second population sample included 84 unrelated European individuals with Caucasian appearance and Swiss parents since three generations. DNA was extracted using the QIAamp DNA Mini kit (Qiagen AG Switzerland) according to the manufacturer’s guidelines and quantified using the Quantifiler Human DNA Quantification Kit (Life Technologies).

For all subjects, blood cell samples were obtained according to protocols and informed-consent procedures approved by the institutional review board Commission cantonale d'éthique de la recherche sur l'être humain (CER-VD), and were labelled with an anonymous code number linked only to demographic information and sex.

Available genotypes from genome sequencing projects

The genotypes of additional individuals were obtained from the Genome Aggregation Database (gnomAD v3.1.1 variants, https://gnomad.broadinstitute.org/downloads) ⁵⁹. The gnomAD v3.1.1 track shows variants from 143,150 unrelated individuals sequenced as part of various population-genetic and disease-specific studies, some markers have only frequency values while others show single individual genotypes. All variants are mapped to the GRCh38/hg38 reference sequence. From this database, markers STR 2 and STR B showed low quality genotypes with more than 50% of missing data. The population samples with individual genotypes from gnomAD used in this study include 634 Europeans (Finnish ‘fin’ and Non-Finnish European ‘nfe’), 791 Africans and 571 East-Asians. In summary, the largest sample collections considered in this study include 876 individuals from Europe, 896 form Africa and 801 from East-Asia.

Marker selection and typing

STR markers were identified by searching the OCA2-HERC2 genomic region for simple tandem repeats located by Tandem Repeats Finder⁶⁰. DNA samples were genotyped for the 11 STRs selected to overlap the region from SNP rs2703969 to rs1667394, where most of linkage and association studies for eye color reported positive results. PCR reactions were performed in 20 μl final volume. This contained 1 × PCR Buffer containing 1.5 mM MgCl₂ (Thermo Fisher), 125 μM dNTP (Thermo Fisher), 1.2 U AmpliTaq Gold DNA Polymerase (Thermo Fisher) and 0.5 ng DNA. Primers’ sequences, quantities and multiplexes are indicated in Supplementary Table S1. PCR thermal cycling conditions were: 5 min at 95 °C, 1 min at 94 °C, 1 min 55 °C, 1 min at 72 °C for 30 PCR cycles and a final extension of 30 min at 72 °C.

PCR fragments were separated by capillary electrophoresis after adding 1 μl PCR amplicon to 8.5 μl deionized formamide HI-DI (Thermo Fisher) and to 0.5 μl 600 LIZ size standard (Thermo Fisher). Capillary electrophoresis was performed using an ABI PRISM 3130xl Genetic Analyzer (Thermo Fisher) according to the manufacturer's instruction and analyzed using the GeneMapper® ID v3.2.1 software (Thermo Fisher), with a minimum peak height threshold of 50 RFU. The commercial DNA CEPH 1347-02 (Thermo Fisher) was added to two empty positions in each PCR plate as positive control of amplification and internal standard for allele calls, at least one empty well per plate was used a negative control of amplification.

Markers were grouped in three multiplexes as indicated in the table. For allele names, ‘R’ indicates the number of repeats of the reference genome and additional repeats are indicated by ‘+ n’ or’-n’ consistent with gnomAD. Considering all populations, 723 HGDP-CEPH samples genotypes are also reported by gnomAD. These data were used for confirmation and harmonization of allele calls. All the genotypes produced here corresponded to the data available in gnomAD except for STR 9 where we could distinguish the alleles -8 and R, while these two are the same allele R in gnomAD. The allele − 8 was observed only in the HGDP-CEPH Eurasian populations. This is probably due to a neighboring deletion linked to the allele R coamplified by the PCR primers used in this study. It is also possible that, because of the complexity of the tandem repeat sequence, that includes both variable CA repeats and variable TA repeats, two different neighboring sites instead of one are indicated in gnomAD.

The eye color prediction for the 84 Swiss Europeans samples was done by IrisPlex typing according to the protocol described in Walsh et al. 2011¹¹ and by using the model from the online tool (https://hirisplex.erasmusmc.nl/). The IrisPlex consists of 6 SNPs, rs12913832 (HERC2), rs1800407 (OCA2), rs12896399 (SLC24A4), rs16891982 (SLC45A2/MATP), rs1393350 (TYR) and rs12203592 (IRF4).

As previously published, the protocol consists of a single multiplex two step PCR using 1 µl genomic DNA extract (concentration of 0.5 ng/µl) and primers in a 12 µl reaction which includes 1 × PCR buffer, 2.7 mM MgCl₂, 200 µM of each dNTP and uses adjusted thermocycling conditions for increased specificity: (1) 95 °C for 10 min, (2) 33 cycles of 95 °C for 30 s and 61 °C for 30 s, (3) 5 min at 61 °C. Each primers concentration used was 0.208 µM except for rs1800407 used at 0.104 µM. This was followed by product purification (2.5 µl of PCR product) using an Exo I / SAP treatment (NEB) in CutSmart 10 × Buffer for an incubation time of 90 min at 37 °C and inactivation step of 15 min at 80 °C. Further multiplex single base extension (SBE) reaction using the ABI Prism1 SNaPshot kit (Applied Biosystems) was performed, 1 µl of purified product, SNaPShot reaction mix and Sequencing Primer mix were used in a final volume of 5 µl. Thermocycling conditions used were: (1) 96 °C for 2 min, (2) 25 cycles of 96 °C for 10 s and 50 °C for 5 s, (3) 30 s at 60 °C. The SNaPShot PCR product was then treated with SAP (NEB) for 90 min at 37 °C and inactivation for 15 min at 80 °C. One microliter of cleaned products was analyzed on the ABI 3130xl Genetic Analyser (Applied Biosystems) with POP-4 on a 36 cm capillary length array. Run parameters were optimized to increase sensitivity, with an injection voltage of 6 kV for 11 s, and run time of 1000 s at 60 °C.

Data analysis

Allele frequencies were estimated by gene counting. The statistical significance of the difference in allele frequency between predicted blue and brown eye color individuals was assessed with the CLUMP program⁶¹, which implements a Monte Carlo approach by performing repeated simulations.

The EM algorithm was used to estimate maximum likelihood haplotype frequencies by using Arlequin version 3.5.1.2⁶² as well as pairwise linkage disequilibrium r² values. Odds ratios (OR) 95% confidence interval (CI) and P values of Table 2 were calculated using the MedCalc software.

Ethics approval

The current study was approved by the Centre Hospitalier Universitaire Vaudois and Université de Lausanne institutional review board. Genomic DNA samples fully-consenting individuals were collected by the Human Genome Diversity Project (HGDP), in a collaboration with the Centre Etude Polymorphism Humain (CEPH) in Paris. For all subjects, blood cell samples were obtained according to protocols and informed-consent procedures approved by institutional review boards, and were labelled with an anonymous code number linked only to demographic information and sex. Besides the HGDP-CEPH diversity panel human cell line samples, all other samples involved in the study are long lasting anonymized DNA extracts previously obtained with informed written consent from healthy individuals for research purposes.

Consent to participate

Each blood sample used was freely donated under conditions of informed consent to participate.

Consent for publication

Each blood sample used was freely donated under conditions of informed consent to publish.

Data availability

Markers information and genotypes will be available right after publication acceptance at the HGDP-CEPH database (http://www.cephb.fr/en/hgdp_panel.php#basedonnees).

References

Davenport, G. C. & Davenport, C. B. Heredity of eye-color in man. Science 26, 589–592. https://doi.org/10.1126/science.26.670.589-b (1907).
Article ADS CAS PubMed Google Scholar
Han, J. et al. A genome-wide association study identifies novel alleles associated with hair color and skin pigmentation. PLoS Genet. 4, e1000074. https://doi.org/10.1371/journal.pgen.1000074 (2008).
Article CAS PubMed PubMed Central Google Scholar
Sulem, P. et al. Two newly identified genetic determinants of pigmentation in Europeans. Nat. Genet. 40, 835–837. https://doi.org/10.1038/ng.160 (2008).
Article CAS PubMed Google Scholar
Liu, F. et al. Digital quantification of human eye color highlights genetic association of three new loci. PLoS Genet. 6, e1000934. https://doi.org/10.1371/journal.pgen.1000934 (2010).
Article CAS PubMed PubMed Central Google Scholar
Liu, F. et al. Eye color and the prediction of complex phenotypes from genotypes. Curr. Biol. 19, R192-193. https://doi.org/10.1016/j.cub.2009.01.027 (2009).
Article CAS PubMed Google Scholar
Andersen, J. D. et al. Importance of nonsynonymous OCA2 variants in human eye color prediction. Mol. Genet. Genomic Med. 4, 420–430. https://doi.org/10.1002/mgg3.213 (2016).
Article CAS PubMed PubMed Central Google Scholar
Eaton, K. et al. Association study confirms the role of two OCA2 polymorphisms in normal skin pigmentation variation in East Asian populations. Am. J. Hum. Biol. 27, 520–525. https://doi.org/10.1002/ajhb.22678 (2015).
Article PubMed Google Scholar
Mengel-From, J., Borsting, C., Sanchez, J. J., Eiberg, H. & Morling, N. Human eye colour and HERC2, OCA2 and MATP. Forensic Sci. Int. Genet. 4, 323–328. https://doi.org/10.1016/j.fsigen.2009.12.004 (2010).
Article CAS PubMed Google Scholar
Sturm, R. A. et al. A single SNP in an evolutionary conserved region within intron 86 of the HERC2 gene determines human blue-brown eye color. Am. J. Hum. Genet. 82, 424–431. https://doi.org/10.1016/j.ajhg.2007.11.005 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kayser, M. & Schneider, P. M. DNA-based prediction of human externally visible characteristics in forensics: Motivations, scientific challenges, and ethical considerations. Forensic Sci. Int. Genet. 3, 154–161. https://doi.org/10.1016/j.fsigen.2009.01.012 (2009).
Article CAS PubMed Google Scholar
Walsh, S. et al. IrisPlex: A sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information. Forensic Sci. Int. Genet. 5, 170–180. https://doi.org/10.1016/j.fsigen.2010.02.004 (2011).
Article CAS PubMed Google Scholar
Draus-Barini, J. et al. Bona fide colour: DNA prediction of human eye and hair colour from ancient and contemporary skeletal remains. Investig. Genet. 4, 3. https://doi.org/10.1186/2041-2223-4-3 (2013).
Article CAS PubMed PubMed Central Google Scholar
Walsh, S. et al. The HIrisPlex system for simultaneous prediction of hair and eye colour from DNA. Forensic Sci. Int. Genet. 7, 98–115. https://doi.org/10.1016/j.fsigen.2012.07.005 (2013).
Article CAS PubMed Google Scholar
Freire-Aradas, A. et al. Exploring iris colour prediction and ancestry inference in admixed populations of South America. Forensic Sci. Int. Genet. 13, 3–9. https://doi.org/10.1016/j.fsigen.2014.06.007 (2014).
Article CAS PubMed Google Scholar
Yun, L., Gu, Y., Rajeevan, H. & Kidd, K. K. Application of six IrisPlex SNPs and comparison of two eye color prediction systems in diverse Eurasia populations. Int. J. Legal Med. 128, 447–453. https://doi.org/10.1007/s00414-013-0953-1 (2014).
Article PubMed Google Scholar
Visser, M., Kayser, M. & Palstra, R. J. HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter. Genome Res. 22, 446–455. https://doi.org/10.1101/gr.128652.111 (2012).
Article CAS PubMed PubMed Central Google Scholar
Eiberg, H. et al. Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression. Hum. Genet. 123, 177–187. https://doi.org/10.1007/s00439-007-0460-x (2008).
Article CAS PubMed Google Scholar
Sitaram, A. et al. Localization to mature melanosomes by virtue of cytoplasmic dileucine motifs is required for human OCA2 function. Mol. Biol. Cell 20, 1464–1477. https://doi.org/10.1091/mbc.E08-07-0710 (2009).
Article CAS PubMed PubMed Central Google Scholar
Donnelly, M. P. et al. A global view of the OCA2-HERC2 region and pigmentation. Hum. Genet. 131, 683–696. https://doi.org/10.1007/s00439-011-1110-x (2012).
Article CAS PubMed Google Scholar
Lao, O., de Gruijter, J. M., van Duijn, K., Navarro, A. & Kayser, M. Signatures of positive selection in genes associated with human skin pigmentation as revealed from analyses of single nucleotide polymorphisms. Ann. Hum. Genet. 71, 354–369. https://doi.org/10.1111/j.1469-1809.2006.00341.x (2007).
Article CAS PubMed Google Scholar
Voight, B. F., Kudaravalli, S., Wen, X. & Pritchard, J. K. A map of recent positive selection in the human genome. PLoS Biol. 4, e72. https://doi.org/10.1371/journal.pbio.0040072 (2006).
Article PubMed PubMed Central Google Scholar
Frost, P. Human skin-color sexual dimorphism: A test of the sexual selection hypothesis. Am. J. Phys. Anthropol. 133, 779–780; author reply 780–771. https://doi.org/10.1002/ajpa.20555 (2007).
Goel, N., Terman, M. & Terman, J. S. Depressive symptomatology differentiates subgroups of patients with seasonal affective disorder. Depress Anxiety 15, 34–41. https://doi.org/10.1002/da.1083 (2002).
Article PubMed Google Scholar
Terman, J. S. & Terman, M. Photopic and scotopic light detection in patients with seasonal affective disorder and control subjects. Biol. Psychiatry 46, 1642–1648. https://doi.org/10.1016/s0006-3223(99)00221-8 (1999).
Article CAS PubMed Google Scholar
Workman, L., Akcay, N., Reeves, M. & Taylor, S. Blue eyes keep away the winter blues: Is blue eye pigmentation an evolved feature to provide resilience to seasonal affective disorder. OA J. Behav. Sci. Psych. 1, 180002 (2018).
Google Scholar
Armstrong, B. K. & Kricker, A. The epidemiology of UV induced skin cancer. J. Photochem. Photobiol. B 63, 8–18. https://doi.org/10.1016/s1011-1344(01)00198-1 (2001).
Article CAS PubMed Google Scholar
Norton, H. L. et al. Genetic evidence for the convergent evolution of light skin in Europeans and East Asians. Mol. Biol. Evol. 24, 710–722. https://doi.org/10.1093/molbev/msl203 (2007).
Article CAS PubMed Google Scholar
Meyer, O. S. et al. Prediction of eye colour in Scandinavians using the eye colour 11 (EC11) SNP set. Genes https://doi.org/10.3390/genes12060821 (2021).
Article PubMed PubMed Central Google Scholar
Payseur, B. A., Place, M. & Weber, J. L. Linkage disequilibrium between STRPs and SNPs across the human genome. Am. J. Hum. Genet. 82, 1039–1050. https://doi.org/10.1016/j.ajhg.2008.02.018 (2008).
Article CAS PubMed PubMed Central Google Scholar
Payseur, B. A. & Jing, P. A genomewide comparison of population structure at STRPs and nearby SNPs in humans. Mol. Biol. Evol. 26, 1369–1377. https://doi.org/10.1093/molbev/msp052 (2009).
Article CAS PubMed PubMed Central Google Scholar
Ardlie, K. G., Kruglyak, L. & Seielstad, M. Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3, 299–309. https://doi.org/10.1038/nrg777 (2002).
Article CAS PubMed Google Scholar
Abecasis, G. R. et al. Extent and distribution of linkage disequilibrium in three genomic regions. Am. J. Hum. Genet. 68, 191–197. https://doi.org/10.1086/316944 (2001).
Article CAS PubMed Google Scholar
Koch, H. G. et al. Allele association studies with SSR and SNP markers at known physical distances within a 1 Mb region embracing the ALDH2 locus in the Japanese, demonstrates linkage disequilibrium extending up to 400 kb. Hum. Mol. Genet. 9, 2993–2999. https://doi.org/10.1093/hmg/9.20.2993 (2000).
Article CAS PubMed Google Scholar
Horowitz, A., Shifman, S., Rivlin, N., Pisante, A. & Darvasi, A. Further tests of the association between schizophrenia and single nucleotide polymorphism markers at the catechol-O-methyltransferase locus in an Askenazi Jewish population using microsatellite markers. Psychiatr. Genet. 15, 163–169. https://doi.org/10.1097/00041444-200509000-00005 (2005).
Article PubMed Google Scholar
Pritchard, J. K. & Przeworski, M. Linkage disequilibrium in humans: Models and data. Am. J. Hum. Genet. 69, 1–14. https://doi.org/10.1086/321275 (2001).
Article CAS PubMed PubMed Central Google Scholar
Jorde, L. B. et al. The distribution of human genetic diversity: A comparison of mitochondrial, autosomal, and Y-chromosome data. Am. J. Hum. Genet. 66, 979–988. https://doi.org/10.1086/302825 (2000).
Article CAS PubMed PubMed Central Google Scholar
Sawyer, S. L. et al. Linkage disequilibrium patterns vary substantially among populations. Eur. J. Hum. Genet. 13, 677–686. https://doi.org/10.1038/sj.ejhg.5201368 (2005).
Article CAS PubMed Google Scholar
Zondervan, K. T. & Cardon, L. R. The complex interplay among factors that influence allelic association. Nat. Rev. Genet. 5, 89–100. https://doi.org/10.1038/nrg1270 (2004).
Article CAS PubMed Google Scholar
Ohashi, J. & Tokunaga, K. Power of genome-wide linkage disequilibrium testing by using microsatellite markers. J. Hum. Genet. 48, 487–491. https://doi.org/10.1007/s10038-003-0058-7 (2003).
Article CAS PubMed Google Scholar
Varilo, T. et al. The interval of linkage disequilibrium (LD) detected with microsatellite and SNP markers in chromosomes of Finnish populations with different histories. Hum. Mol. Genet. 12, 51–59. https://doi.org/10.1093/hmg/ddg005 (2003).
Article CAS PubMed Google Scholar
Kawashima, M. et al. Genomewide association analysis of human narcolepsy and a new resistance gene. Am. J. Hum. Genet. 79, 252–263. https://doi.org/10.1086/505539 (2006).
Article CAS PubMed PubMed Central Google Scholar
Nakabayashi, K. et al. Identification of novel candidate loci for anorexia nervosa at 1q41 and 11q22 in Japanese by a genome-wide association analysis with microsatellite markers. J. Hum. Genet. 54, 531–537. https://doi.org/10.1038/jhg.2009.74 (2009).
Article CAS PubMed Google Scholar
Tamiya, G. et al. Whole genome association study of rheumatoid arthritis using 27 039 microsatellites. Hum. Mol. Genet. 14, 2305–2321. https://doi.org/10.1093/hmg/ddi234 (2005).
Article CAS PubMed Google Scholar
Yatsu, K. et al. Genome-wide association mapping for essential hypertension with high-density microsatellite markers. J. Hypertens. 24, 55–56 (2006).
Google Scholar
Amundadottir, L. T. et al. A common variant associated with prostate cancer in European and African populations. Nat. Genet. 38, 652–658. https://doi.org/10.1038/ng1808 (2006).
Article CAS PubMed Google Scholar
Reynisdottir, I. et al. Localization of a susceptibility gene for type 2 diabetes to chromosome 5q34-q35.2. Am. J. Hum. Genet. 73, 323–335. https://doi.org/10.1086/377139 (2003).
Article CAS PubMed PubMed Central Google Scholar
Saito, F., Kajii, T. S., Oka, A., Ikuno, K. & Iida, J. Genome-wide association study for mandibular prognathism using microsatellite and pooled DNA method. Am. J. Orthod. Dentofac. Orthop. 152, 382–388. https://doi.org/10.1016/j.ajodo.2017.01.021 (2017).
Article Google Scholar
Malkki, M., Single, R., Carrington, M., Thomson, G. & Petersdorf, E. MHC microsatellite diversity and linkage disequilibrium among common HLA-A, HLA-B, DRB1 haplotypes: Implications for unrelated donor hematopoietic transplantation and disease association studies. Tissue Antigens 66, 114–124. https://doi.org/10.1111/j.1399-0039.2005.00453.x (2005).
Article CAS PubMed Google Scholar
Foissac, A. et al. Microsatellites in the HLA region: HLA prediction and strategies for bone marrow donor registries. Transpl. Proc. 33, 491–492. https://doi.org/10.1016/s0041-1345(00)02107-2 (2001).
Article CAS Google Scholar
Burgner, D. et al. Haplotypic relationship between SNP and microsatellite markers at the NOS2A locus in two populations. Genes Immun. 4, 506–514. https://doi.org/10.1038/sj.gene.6364022 (2003).
Article CAS PubMed Google Scholar
Pietroni, C. et al. The effect of gender on eye colour variation in European populations and an evaluation of the IrisPlex prediction model. Forensic Sci. Int. Genet. 11, 1–6. https://doi.org/10.1016/j.fsigen.2014.02.002 (2014).
Article CAS PubMed Google Scholar
Kayser, M. et al. Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene. Am. J. Hum. Genet. 82, 411–423. https://doi.org/10.1016/j.ajhg.2007.10.003 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kruglyak, L. Prospects for whole-genome linkage disequilibrium mapping of common disease genes. Nat. Genet. 22, 139–144. https://doi.org/10.1038/9642 (1999).
Article CAS PubMed Google Scholar
Shin, H. D. et al. Genetic restriction of HIV-1 pathogenesis to AIDS by promoter alleles of IL10. Proc. Natl. Acad. Sci. U. S. A. 97, 14467–14472. https://doi.org/10.1073/pnas.97.26.14467 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Sulem, P. et al. Genetic determinants of hair, eye and skin pigmentation in Europeans. Nat. Genet. 39, 1443–1452. https://doi.org/10.1038/ng.2007.13 (2007).
Article CAS PubMed Google Scholar
Cann, H. M. et al. A human genome diversity cell line panel. Science 296, 261–262 (2002).
Article CAS PubMed Google Scholar
Rosenberg, N. A. Standardized subsets of the HGDP-CEPH human genome diversity cell line panel, accounting for atypical and duplicated samples and pairs of close relatives. Ann. Hum. Genet. 70, 841–847. https://doi.org/10.1111/j.1469-1809.2006.00285.x (2006).
Article CAS PubMed Google Scholar
Rosenberg, N. A. et al. Genetic structure of human populations. Science 298, 2381–2385 (2002).
Article ADS CAS PubMed Google Scholar
Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443. https://doi.org/10.1038/s41586-020-2308-7 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucl. Acids Res. 27, 573–580. https://doi.org/10.1093/nar/27.2.573 (1999).
Article CAS PubMed PubMed Central Google Scholar
Sham, P. C. & Curtis, D. Monte Carlo tests for associations between disease and alleles at highly polymorphic loci. Ann. Hum. Genet. 59, 97–105. https://doi.org/10.1111/j.1469-1809.1995.tb01608.x (1995).
Article CAS PubMed Google Scholar
Excoffier, L., Laval, G. & Schneider, S. Arlequin (version 3.0): An integrated software package for population genetics data analysis. Evol. Bioinform. Online 1, 47–50 (2005).
Article CAS Google Scholar

Download references

Funding

This work was supported by funding from the University Center of Legal Medicine of the University Hospital of Lausanne and the Swiss National Foundation (Project N 310030_184705).

Author information

Authors and Affiliations

Unité de Génétique Forensique, Centre Universitaire Romand de Médecine Légale, Centre Hospitalier Universitaire Vaudois et Université de Lausanne, Lausanne, Switzerland
Philippe Suarez, Karine Baumer & Diana Hall

Authors

Philippe Suarez
View author publications
You can also search for this author in PubMed Google Scholar
Karine Baumer
View author publications
You can also search for this author in PubMed Google Scholar
Diana Hall
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.H. conceived and designed the experiments. P.S. and K.B. performed the experiments and organized the presentation of synthesized data. D.H. analyzed the data and wrote the paper. All authors reviewed the manuscript.

Corresponding author

Correspondence to Diana Hall.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure 1.

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Suarez, P., Baumer, K. & Hall, D. Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers. Sci Rep 11, 22530 (2021). https://doi.org/10.1038/s41598-021-01940-w

Download citation

Received: 15 June 2021
Accepted: 02 November 2021
Published: 18 November 2021
DOI: https://doi.org/10.1038/s41598-021-01940-w
Springer Nature Limited

Further insight into the global variability of the OCA2-HERC2 locus for human pigmentation from multiallelic markers

Abstract

Similar content being viewed by others

Optimizing the genetic prediction of the eye and hair color for North Eurasian populations

Skin pigmentation and genetic variants in an admixed Brazilian population of primarily European ancestry

Novel quantitative pigmentation phenotyping enhances genetic association, epistasis, and prediction of human eye colour

Introduction

Results

STR markers variability

Allele frequency distribution in diverse human populations

Allele frequency distribution in Europeans with predicted blue and brown eye colors

Haplotypes and linkage disequilibrium values

Homozygosity levels and allelic associations

Haplotype association

Discussion

Conclusions

Methods

Population samples for genotyping

Available genotypes from genome sequencing projects

Marker selection and typing

Data analysis

Ethics approval

Consent to participate

Consent for publication

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figure 1.

Supplementary Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation