Introduction

The gene F5 encodes for blood coagulation Factor V, and the 1691G>A F5 transition in exon 10 causes a substitution (R506Q) known as Factor V Leiden. This genetic variant is the most common hereditable risk factor for thromboembolic disease (Kujovich 2011). Clinical observations with the 1691A F5 allele include an increased risk of deep vein thrombosis (Koster et al. 1993; Ridker et al. 1998; Rosendorff and Dorfman 2007), and 1691A F5 is also associated with an increased risk for pregnancy loss and possibly other obstetric complications (Ridker et al. 1998; Foka et al. 2000), these effects being semi-dominant.

The 1691A F5 allele is thought to have originated with a single mutation event which took place 21,000–34,000 years ago (Zivelin et al. 1997). The 1691A F5 allele appears to be generally confined to Indo-Europeans: the 1691A allele is very rare or non-existent in Asia (0.6 %) and some regions of Africa (0.0 %) (Hira et al. 2002; Nasiruddin et al. 2005; They-They et al. 2010); but note that Settin et al. (2008) reported a frequency of 10.2 % in Egypt. In various parts of Europe the frequency of this variant has been described as uneven (Rees et al. 1995; Herrmann et al. 1997; Paseka et al. 2000; Adler et al. 2010), with high frequencies observed in the Czech Republic (5.1 %, n = 2819) (Prochazka et al. 2003) and Turkey (5.0 %, n = 2003) (see Fig.1 for references).

Fig. 1
figure 1

a. Average allele frequencies for 1691A F5 across Europe. b. Significant differences between countries. a. 1691A F5 frequencies: >3 %, <3 % (n.d. = no data). Slavic data (Adler et al. 2012): Poland n = 1588, Czech Republic n = 2819, Slovakia n = 152, Slovenia n = 526, Croatia n = 749, Serbia/Montenegro n = 499; Macedonia n = 130, Bulgaria n = 506, Russia n = 539, Ukraine n = 172, Belarus n = 80; Bosnia/Herzegovina n = 67 (Valjevac et al. 2013). Other data: Group 1: Albania (n = 70; Arsov et al. 2006), Austria (n = 572; Gisslinger et al. 1995, Lalouschek et al. 2005), Finland (n = 1285; Hiltunen et al. 2007), France (n = 1242; Leroyer et al. 1997; Baba-Ahmed et al. 2007; Pasquier et al. 2009), Iceland (n = 96; Rees et al. 1995), Netherlands (n = 2631; Rosendaal et al. 1995; Heijmans et al. 1998; van Dunne et al. 2006), Portugal (n = 303; Ferrer-Antunes et al. 1995; Serrano et al. 2011), Spain (n = 889; Garcia-Gala et al. 1997; Aznar et al. 2000; Frances et al. 2006), Switzerland (n = 100; Redondo et al. 1999). Group 2: Denmark (n = 20242; Larsen et al. 1998; Juul et al. 2002; Benefield et al. 2005), Germany (n = 2199; Aschka et al. 1996; Schambeck et al. 1997; Nabavi et al. 1998; Ehrenforth et al. 1999; Lichy et al. 2006; Toth et al. 2008), Greece (n = 932; Foka et al. 2000; Rees et al. 1995; Lambropoulos et al. 1997; Antoniadi et al. 1999; Ioannou et al. 2000; Aronis et al. 2002; Komitopoulou et al. 2006), Hungary (n = 1196; Tordai et al. 1997; Stankovics et al. 1998; Szolnoki et al. 2001; Shemirani et al. 2011), Italy (n = 226; De Stefano et al. 1999; Sottilotta et al. 2009), Norway (n = 807; Ulvik et al. 1998; Berge et al. 2007), Romania (n = 42; Fischer et al. 2006), Sweden (n = 1343; Svensson et al. 1997; Kjellberg et al. 2010), Turkey (n = 2003; Ozbek and Tangun 1996; Akar et al. 1997; Gurgey and Mesci 1997, Atasay et al. 2003; Agaoglu et al. 2005; Eroglu et al. 2006; Ulukus et al. 2006), U.K. (n = 491; Rees et al. 1995; Catto et al. 1999). b. Significant differences (p < 0.05) between 1691A F5 allele frequency values in European countries, as analyzed by Tukey tests. Capital letters indicate countries with high frequency values significantly different from at least one country; lower case letters indicate which countries had significantly different, and lower, allele frequencies than those with the corresponding capital letter

We provide a summary study of the frequency distribution of 1691A F5 across Europe by addition of values from 12 countries/regions which are predominantly populated by Slavic populations (note that Serbia and Montenegro separated in 2006 - data in this article precede this date).

Materials and methods

The databases Medline (U.S. National Library of Medicine, Bethesda, USA) and several websites providing local language articles (e.g., www.google.ba for Bosnia/Herzegovina, www.google.hr for Croatia; Google Inc., Mountain View, California, USA) were searched until March 2012. Keywords and/or mesh terms used were (together with country/region names): “1691A F5”, “Leiden”, “FV’, “F5”, “FVL”, “Factor 5”, “fv”. Data for Bosnia/Herzegovina were obtained from our previous study (Valjevac et al. 2013). Data were included from: groups of blood donors, those defined as “healthy” adults in case–control studies, and population groups. Summary data in Europe for NM_000130.4:c.1691G>A F5 is presented as a map.

In order to test the significance of the resulting distribution, a chi-squared test for independence (Ulukus et al. 2006) and post-hoc pair-wise proportion comparison tests were carried out between all countries by two methods: (1) as in Zar (1999): tests analogous to, and refered to here as, “Tukey tests” (critical values from R package DKT; Lau 2011); (2) z-tests with Bonferroni correction (Microsoft Excel 2007, Redmond, WA, USA; http://www.csun.edu/~vchsc006/469/z.html). The critical significance level was set at p = 0.05.

Results

A map showing average frequencies for 1691A F5 throughout Europe is given in Fig. 1a.

Comparison of the 1691A F5 allele frequency values in all populations, assessed by the chi-squared test, gave a significant value (p < 0.001) showing the presence of differences between countries/regions.

The thick line in Fig. 1 arbitrarily delimits a group of countries with low frequency (group 1) from those with high frequency (group 2), with an arbitrary cut-off at 2.8 %. Although this was an arbitrary cut-off, with post-hoc Tukey and z-tests the two groups of countries/regions were found to have only one intra-group significant difference (between Czech Republic, 5.1 %, n = 2819; and Denmark, 3.9 %, n = 20242), but many significant differences between the groups (Fig.1b).

Group 1 consisted of countries/region with 1691A F5 allele frequency <2.8 %; the average frequency in the group was 1.9 %. Countries in group 1 included: Russia, Finland, Ukraine, Poland, Slovenia, Serbia/Montenegro, Croatia, The Netherlands, France, Spain, Portugal (these countries have reasonable sample sizes, and each has at least one significant difference with another country’s value). If those countries with no significant differences at all (with smaller sample sizes) were included, then this group consisted of most (ten) Slavic populations together with another eight non-Slavic countries (Fig.1).

Group 2 consisted of countries with 1691A F5 allele frequency ≥2.8 %; the average frequency in the group was 4.0 %: Countries with at least one pair-wise significant difference by Tukey tests included: Sweden, Norway, Denmark, Germany, U.K., The Czech Republic, Austria, Hungary, Italy, Bulgaria, Greece and Turkey. Including those countries with no significant differences at all (with small sample sizes), this group consisted of 13 countries including the predominantly Slavic countries Czech Republic and Bulgaria (Fig. 1).

Tukey tests and z-tests gave similar numbers of pair-wise differences between countries (66 and 59, respectively; Tukey test results are shown in Fig.1b.; z-test results not shown). However, the Tukey tests defined statistically significant differences involving Slovenia (2.5 %, n = 526), Russia (2.4 %, n = 539) and Serbia and Montenegro (2.2 %, n = 499), which were not identified using z-tests, whereas z-tests defined differences involving Romania (8.3 %, n = 42 - note the very small sample size). Additionally the other Tukey test differences were more evenly distributed whereas with z-tests most differences were concentrated on Spain (2.0 %, n = 889), and Finland (1.1 %, n = 1285).

Discussion

In Europe the average 1691A F5 allele frequency was found to be 3.5 % (n = 44,627, references in Fig.1).

In order to highlight the distribution of high frequency populations, European countries/regions were arbitrarily divided into two groups: with low (group 1) or high frequency (group 2), with an arbitrary boundary at 2.8 % (delineated by thick lines in Fig. 1). The group of high frequency countries (i.e., group 2) was found to have a North-Southeast band distribution. A pocket of low frequency with some Southern Slavic countries (Slovenia, Croatia, Bosnia and Herzegovina, and Serbia and Montenegro) and Albania (although note low sample size) was found to be enclosed by this band of high frequency. We can therefore surmise that the band of high frequency was created by founder effect seeding, and the pocket of low frequency might have been created with the purported movement of Eastern Slavic populations with low frequency to this region in the fifth to seventh centuries. However, conclusions referring to the migration of Slavic peoples have to take into account the fact that the Czech Republic and Bulgaria have high frequencies of this allele. Especially note that the Czech Republic has the highest frequency value in Europe (5.1 %; with a reasonable sample size of 2819) but note that this value is not significantly different to values in the neighboring countries Germany and Austria. Additionally, therefore, it could be hypothesized that these result from high influx of peoples carrying the variant from neighboring populations.

It is not known to what extent positive or negative selection acts on this allele - it is thought that the negative impact of 1691A F5 before reproductive age exists but reports in children are scanty (El-Karaksy and El-Raziky 2011). From the North-Southeast band of high frequency of the 1691A F5 allele there seem to be no indications of possible environmental effects which could vary according to location (and act as selection) on this allele.

Although such a map in Fig. 1 is informative, the statistical significance of the distribution found depends directly on the sample sizes of the data (see legend to Fig.1). Additionally, there are several ways to do post-hoc pair-wise comparison tests between countries, which take into account sample sizes. By using Tukey tests or z-tests it was found that there was only one intra-group significant difference (within group 2 between Czech Republic and Denmark; Fig.1b), but many significant differences between groups. (Significance maps at the p = 0.01 level, for either test, were very similar - not shown.)

This means that inferences from individual country values should be made with caution, and the high frequency group of countries could be treated more or less as one block (and likewise for the low frequency group). The map in Fig. 1 represents the present state of knowledge concerning the distribution of the 1691G>A F5 allele, and theoretically increasing the sample sizes would increase the number of pair-wise statistical differences shown in Fig.1b. It should be noted, however, that several sample sizes are already reasonably large e.g., The Netherlands (2.2 %, n = 2631), Germany (3.8 %, n = 2199) but show no intra-group significant differences by either of the methods used.

Despite the overall similarities in the conclusions drawn from the Tukey and z-tests, it should be noted that these analyses suggest (and confirm) the appropriateness of Tukey tests, rather than z-tests with Bonferroni correction, for this type of study. The reason for this is that any inference from differences involving Romania with only 42 subjects is rather dubious (as is the rather high allele sample frequency of 8.3 %, which might not be a good representation of the frequency in the population). The z-tests, however, gave six differences, between Romania and: Finland, Poland, Ukraine, Croatia, Spain, and Portugal (data not shown), whereas the Tukey tests gave no significant differences involving Romania (Fig.1b).

The clinical effects of the 1691A F5 variant are fairly well established, with a three to eight-fold increased thrombotic risk for heterozygotes, and up to 80-fold increased risk for homozygotes, found in a pooled analysis of eight case–control studies with over 2300 cases from Western European countries (Emmerich et al. 2001). In a Croatian case–control study (with 160 cases) a high frequency of thromboembolism was also found to be associated with the 1691A F5 allele: 21 % of patients with venous thromboembolism carried the 1691A F5 allele, but only 4 % of healthy controls (Coen et al. 2001). Further studies could be done to confirm whether or not the presence of the 1691A F5 allele does indeed provide a similar risk for venous thromboembolism across Europe.

Conclusions

In summary, a North-Southeast band distribution of 1691A F5 high frequency in Europe was found by the addition of data collated and presented from Central, Eastern and South-Eastern Europe. A pocket of low frequency in some Southern Slavic countries was found to be enclosed by this band of high frequency. The reasons for this distribution are not known, but perhaps reflect founder effects following migration after the last ice age, followed by the migration of Slavic peoples from the fifth century. There seem to be no indications of variation in environmental selection due to geographical location.