Introduction

DNA barcoding, the molecular characterization of species using a standardized DNA region, is a potentially effective tool for identifying species across the eukaryotes of the world (Frezal and Leblois 2008). For animals, a 648-bp fragment of the mitochondrial gene cytochrome c oxidase subunit I (COI) has been chosen as the standard barcoding marker due to its high interspecific variation, low intraspecific variation, and relatively universal primers for taxonomic groups at the level of orders and even classes (Hebert et al. 2003). This marker features high resolution in many of the animal groups studies so far, e.g., Neotropical bats (Clare et al. 2007), North American birds (Kerr et al. 2007), Australian fishes (Ward et al. 2005), and tropical Lepidoptera (Hajibabaei et al. 2006; but see Elias et al. 2007). In other groups, resolution is more moderate, e.g., Diptera (Meier et al. 2006) and marine gastropods (Meyer and Paulay 2005). Species identification through DNA barcoding has many practical utilities such as in conservation biology (Neigel et al. 2007; Ward et al. 2008a), food security control (Wong and Hanner 2008), and birdstrike identification (Dove et al. 2008). This approach depends on a good library of COI sequences being established prior to testing the species identity of an unknown sample or specimen.

In addition to the practical utility of DNA barcoding, the method provides a means of quickly screening biodiversity to identify species or species groups with unusual genetic structure, e.g., cryptic species or species with shared COI haplotypes. The potential for discovering biologically interesting patterns should be highest among the so-called microfauna, but even in more charismatic, well-studied groups like fishes and birds, DNA barcoding has revealed cases of apparent cryptic speciation and indicated the genetic uniformity of species that are classically viewed as two or more separate species (Hebert et al. 2004; Kerr et al. 2007; Ward et al. 2005, 2008). DNA barcoding can therefore have a scientific value beyond species identification, since it can provide clues to ongoing speciation processes and direct attention to species or species complexes that deserve more detailed morphological and genetic scrutiny.

Birds are among the best-studied and taxonomically well-described animal groups, which makes them good models for testing the efficacy of DNA barcoding as a species identification tool. Two publications have summarized DNA barcoding results for the North American bird fauna (Hebert et al. 2004; Kerr et al. 2007). These studies revealed a high species identification power (94% of species resolved), and pinpointed 15 species with deep intraspecific genetic splits (termed provisional species) and 17 congeneric species pairs/complexes with little or no differentiation. Two recent studies have focused on the ability of COI barcoding to distinguish between closely related, and potentially hybridizing, sister bird species (Aliabadian et al. 2009; Tavares and Baker 2008). While Tavares and Baker (2008) found 100% resolution of COI barcoding in a sample of 60 sister species pairs based on neighbor-joining tree clustering and diagnostic fixed substitutions, Aliabadian et al. (2009) found that a high percentage (42%) of 36 Holarctic allopatric sister species pairs could not be distinguished based on the suggested threshold genetic distance for bird species at 10 times the average intraspecific distance (Hebert et al. 2004). Delimiting species based on a threshold of sequence difference is controversial (Meyer and Paulay 2005; Moritz and Cicero 2004) and may not be applicable to closely related species (Moritz and Cicero 2004; Tavares and Baker 2008).

A number of bird species inhabit both the Palearctic and the Nearctic region. Typically, such species are divided into one or more different subspecies on each continent, based primarily on morphology (del Hoyo et al. 1992–2008). Recent studies of genetic divergence in a few Holarctic species have found relatively high levels of divergence between Palearctic and Nearctic populations (e.g., Drovetski et al. 2004; Koopman et al. 2005; Zink et al. 2006). To date, no comprehensive study of the genetic divergence between populations on each side of the Atlantic Ocean has been performed. Given the presumably long history of separation and lack of current gene flow for many species, the potential for deep trans-Atlantic splits should be high.

The aims of the present study are twofold. First, we wanted to test the resolution power of DNA barcoding in species discrimination in a data set of 296 Scandinavian bird species. Second, we wanted to assess the degrees of trans-Atlantic sequence divergence for 78 species breeding both in Scandinavia and in North America.

Methods

Sampling

Blood and tissue samples originated from the tissue collections of the Swedish Museum of Natural History in Stockholm, the Natural History Museum, University of Oslo, Tromsø University Museum, Agder Museum of Natural History, Helgeland Museum, Bergen Museum, Ajtte—Swedish Mountain and Sami Museum, and Göteborg Natural History Museum. These were subsampled and subjected to DNA extraction using Omega blood/tissue extraction kits (Omega) or QIAamp DNA Mini Kit (Qiagen). All samples originated from breeding or non-breeding birds in Scandinavia (Norway, Sweden, and Denmark) and Svalbard, except five samples collected from three Holarctic species in Canada, with members of the same species collected from widely-spaced geographic localities whenever possible. The full details about these samples, including information about vouchers and e-vouchers, are available at the BOLD website (http://www.barcodinglife.com/), project “Birds of Scandinavia”. Details about the North American samples used in the cross-Atlantic comparison can be found in Kerr et al. (2007), and sequences are available in BOLD (project: “Birds of North America—Phase II”).

Species nomenclature follows the taxonomy of Clements (2007), which is also used by the “All Birds Barcoding Initiative” (http://www.barcodingbirds.org/).

DNA sequencing

DNA extraction, PCR amplification, and DNA sequencing of the COI gene was performed at the Swedish Museum of Natural History in Stockholm and the Natural History Museum in Oslo. In the two laboratories, several combinations of primers were utilized: standard primers Bird F1 (TTCTCCAACCACAAAGACATTGGCAC), Bird R1 (ACGTGGGAGATAATTCCAAATCCTG), Bird R2 (ACTACATGTGAGATGATTCCGAATCCAG), and COIbirdR2 (ACGTGGGAGATAATTCCAAATCCTGG), plus newly designed CO1-ExtF (ACGCTTTAACACTCAGCCATCTTACC) and CO1-ExtR (AACCAGCATATGAGGGTTCGATTCCT). The number of different primer pairs used in this study reflects the fact that the work was carried out in two different laboratories which happened to choose different published primer sets, plus both experienced problems with sequencing some specimens, which prompted one of us (D.Z.) to design a new set of primers located outside the standard region (CO1-ExtF and CO1-ExtR). These were used either in an initial PCR before a second PCR with standard primers, or as both PCR and sequencing primers. The reasons for our problems with obtaining COI sequences from a number of individuals are unknown, but may partly be due to primer mismatches for the standard COI primers in several taxa. A recent paper described similar problems with the standard primers and recommended newly designed primers for passerines (Lohman et al. 2008).

During the course of the analyses, we used three different DNA polymerases, DyNAzyme II (Finnzymes), AmpliTaq Gold (Applied Biosystems), and Platinum taq (Invitrogen). The standard thermal profile used in Oslo was: 94°C for 1 min (6 min for AmpliTaq Gold), 5 cycles of 94°C for 1 min, 45°C for 1.5 min, 72°C for 1.5 min, followed by 30 cycles of 94°C for 1 min, 51°C for 1.5 min, 72°C for 1.5 min, and a final extension at 72°C for 5 min. The standard thermal profile used in Stockholm was: (1) initial PCR, using CO1-ExtF and CO1-ExtR: 95°C for 5 min, 2 cycles of 95°C for 30 s, 66°C for 30 s, 72°C for 50 s, 2 cycles of 95°C for 30 s, 64°C for 30 s, 72°C for 50 s, 16 cycles of 95°C for 30 s, 62°C for 30 s, 72°C for 50 s, and a final extension at 72°C for 8 min; and (2) second PCR, using internal primers: 95°C for 5 min, 4 cycles of 95°C for 30 s, 64°C for 30 s, 72°C for 10 s, 4 cycles of 95°C for 30 s, 62°C for 30 s, 72°C for 10 s, 22 cycles of 95°C for 30 s, 60°C for 30 s, 72°C for 10 s, and a final extension at 72°C for 8 min.

Pseudogenes (numts), i.e., mtDNA fragments incorporated in the nuclear genome (Bensasson et al. 2001; Sorenson and Quinn 1998), may represent a source of error since PCR-based analyses will often amplify both the authentic mitochondrial sequence and the pseudogene. In a few cases of large intra-specific variation in COI sequences (see “Results”), we checked the mitochondrial origin of our sequences by amplifying a much longer sequence of the mitochondrial genome and thereby reducing the risk of co-amplifying any pseudogene. We used the primers L6615 and H10884 (Sorenson et al. 1999), which amplify an approximately 4,000-bp mitochondrial fragment in birds, together with the GeneAmp© XL PCR kit (Applied Biosystems), following the instructions of the manufacturer. We then performed a second PCR on the product of this long PCR, using primers COI-ExtF and Bird R2, and checked whether the resulting sequences were identical to those originally obtained.

Dataset

The original Scandinavian dataset consisted of 996 specimens from 302 species. However, we failed to obtain sequences from 27 specimens of 24 species. In addition, we excluded all sequences with >1% ambiguous nucleotides from the analyses (n = 10). The sequences from one specimen of Glaucidium passerinum and one specimen of Motacilla flava were excluded because they contained stop codons, suggesting that they were pseudogene sequences. The original sequences from 2 specimens of Bucephala clangula proved to be very different from sequences of 9 other specimens from the same species and 18 specimens from two other congeners (Bucephala albeola and B. islandica) in the BOLD database. These samples were repeatedly amplified and sequenced with different combinations of the aforementioned COI primers, with similar results. However, using the XL-PCR procedure, we were able to recover mitochondrial COI sequences from both specimens that grouped with the other sequences from the same species. This suggests that the original sequences came from a nuclear pseudogene, despite containing no stop codons or frame-shift mutations (Song et al. 2008).

High quality sequences were thus obtained from 957 (96%) specimens, representing 296 species. The average number of specimens per species was 3.2 (range 1–10), with 256 species consisting of two or more specimens. The sample included 97% of the 250 species breeding in Sweden and 98% of the 258 species breeding in Norway and Svalbard.

Statistical procedures

We used the analytical tools at the BOLD website (http://www.barcodinglife.com/) to calculate nearest neighbor distances and other summary statistics (Ratnasingham and Hebert 2007). Neighbor-joining (NJ) trees were constructed based on the Kimura 2 parameter model. Bootstrap values were calculated in Mega using 1,000 iterations. For a subsample containing 97.7% of the specimens in the NJ tree, a Bayesian Markov chain Monte Carlo (MCMC) analysis was performed using MrBayes version 3.1.2 (Huelsenbeck and Ronquist 2001) as implemented at the Bioportal at the University of Oslo (http://www.bioportal.uio.no/). The analysis was run with 4 chains for 20 million generations, using 9 million generations as burnin, taking samples every 1,000 generations. The substitution model was determined using Mr Modeltest Version 3.7 (Posada and Crandall 1998) based on the Akaike Information Criterion (AIC). The preferred model was the general time reversible model (GTR; four nucleotide frequency state parameters; six substitution rate parameters) with a proportion of invariable sites (I) and a gamma distribution for (G).

Results

DNA barcodes of Scandinavian birds

Average intraspecific distance was 0.24% (range 0–5.08%), while the average nearest neighbour distance was 7.95% (range 0–17.05%) (Fig. 1). A total of 277 species (93.6%) had unique DNA barcodes with no overlap with other species and nearest neighbor distances ranging within the interval 0.94–17.05% (average 8.0%) (see complete NJ-tree in electronic supplementary material). Bootstrap support for these monophyletic clusters averaged 99.84 (range 88–100). The Bayesian phylogenetic analysis gave virtually identical results (data not shown). Of these 277 species, 12 species had a distance to a sister species that fell below the threshold suggested by Hebert et al. (2004), which amounted to 2.4% in our study (average nearest neighbour distance for the 12 species 1.42%, range 0.94–1.74%). Nevertheless, these 12 species had between 6 and 13 diagnostic base substitutions compared to their nearest neighbors. Three species pairs (Cygnus columbianus and C. cygnus, Gavia adamsii and G. immer, and Stercorarius pomarinus and S. skua) displayed high degrees of sequence similarity (99.38, 99.22, and 99.70%, respectively); however, this observation should be treated with caution since one species in each of these pairs was only represented by one individual (see “Discussion”). Thirteen species (4.4%) in five clusters had overlapping barcodes with one or more closely related species (Table 1), with sequence similarity ranging from 99.86 to 100%.

Fig. 1
figure 1

Histograms showing the distribution of a average intraspecific distances and b the average nearest neighbor distance across 296 Scandinavian bird species

Table 1 Scandinavian species with overlapping barcode clusters

Four species showed relatively high (>2%) intraspecific sequence variation (Table 2). This is illustrated for Phoenicurus phoenicurus in Fig. 2 and for Lanius collurio in Fig. 3. For the two species with the highest intraspecific divergence (Phoenicurus phoenicurus and Periparus ater), we used the XL-PCR procedure and found support for a mitochondrial origin of the divergent haplotypes.

Table 2 Scandinavian species with large (>2%) within-species variation
Fig. 2
figure 2

Neighbor-joining tree (based on the Kimura 2 parameter) illustrating intraspecific distances in two congeneric passerine species, Phoenicurus phoenicurus and P. ochruros, with Oenanthe oenanthe as outgroup. The two divergent P. phoenicurus haplotypes are highlighted with colors/shading. For each individual, the scientific name is followed by the BOLD process ID number and the sampling locality

Fig. 3
figure 3

Neighbor-joining tree (based on the Kimura 2 parameter) for three Lanius species, with Corvus cornix as outgroup. The tree illustrates the paraphyletic relationship between Lanius excubitor from North America (upper/blue box) and Scandinavia (lower/red box). For each individual, the scientific name is followed by the BOLD process ID number and the sampling locality

Scandinavian versus North American species

The sample contained 78 species breeding both in Scandinavia and North America. The average intraspecific between-continent distance was 0.85% (range 0–4.86%) for these species (Table 3). Nineteen species (24%) showed divergent genetic clusters in the two regions, with an average between-continent distance of 2.53% (range 0.45–4.86%). For another three species (Lagopus lagopus, Fulmarus glacialis, Corvus corax), the Scandinavian birds clustered with some North American specimens in a clade that was clearly separable from a second North American clade. The remaining 56 species (72%) showed little or no divergence related to continent, with an average between-continent distance of 0.25% (Table 3). Sixteen (84%) of the species with trans-Atlantic splits were terrestrial or inland breeders, whereas more marine or aquatic species were represented among the 59 species with no distinct trans-Atlantic clustering (Table 3).

Table 3 Summary statistics for 78 species breeding both in Scandinavia and North America

Three of the trans-Atlantic species were paraphyletic, that is, one cluster was more similar to a sister species than to the other intraspecific cluster. COI barcodes of North American Northern Shrikes Lanius excubitor were more closely related to the North American Loggerhead Shrike (Lanius ludovicianus) than to the Scandinavian Northern Shrikes (Fig. 3). Similarly, North American Sandwich Terns Thalasseus sandvicensis appeared more closely related to North American Elegant Terns (Thalasseus elegans) than to their Scandinavian counterparts (Fig. 4). North American Mew Gulls Larus canus grouped together within the shallow multispecies cluster of white-headed Larus gulls, while Scandinavian L. canus clustered outside the entire group with relatively good bootstrap support (Fig. 5). A mitochondrial origin of the Scandinavian sequences was supported by XL-PCR for Lanius excubitor and Larus canus, but not tested in Thalasseus sandvicensis due to technical problems.

Fig. 4
figure 4

Neighbor-joining tree (based on the Kimura 2 parameter) for three Thalasseus species, with Sterna hirundo as outgroup. The tree illustrates the paraphyletic relationship between Thalasseus sandvicensis from North America (lower/blue box) and Scandinavia (upper/red box). For each individual, the scientific name is followed by the BOLD process ID number and the sampling locality

Fig. 5
figure 5

Neighbor-joining tree (based on the Kimura 2 parameter) for ten Larus species, with Rissa tridactyla as outgroup. The tree illustrates the paraphyletic relationship between Larus canus from North America (lower/blue box) and Scandinavia (upper/red box). For each individual, the scientific name is followed by the BOLD process ID number and the sampling locality

Discussion

Cytochrome c oxidase subunit I barcodes gave high species resolution of Scandinavian birds, with 94% of currently recognized species displaying unique barcodes. Five genetic clusters consisted of two or more congeneric species that could not be separated using the COI gene. The comparison of bird species breeding both in Scandinavia and North America revealed 19 species with well-defined haplotype groups confined to each continent, some of which had an inter-continental degree of divergence commonly seen at the between-species level. Our results thus contribute to the growing library of DNA barcodes of bird species of the world, and flag particular species and species groups in need of taxonomic reassessments.

The high resolution of COI in species identification among Scandinavian birds (94%) is similar to that found in other studies of COI performance in well-known vertebrate groups, e.g., North American birds (94%; Kerr et al. 2007), Australian fishes (100%; Ward et al. 2005) and Neotropical bats (93%; Clare et al. 2007), and contrasts with lower identification rates of COI in some invertebrate groups (e.g., Elias et al. 2007; Meier et al. 2006; Meyer and Paulay 2005). Obviously, the success of DNA barcoding as a species identification tool can only be evaluated in groups for which robust species taxonomy exists already. This is definitely the case for vertebrates, but less so for many invertebrate groups. In any case, resolution of more than 90% of all species is quite an impressive performance by a single genetic marker, and provides a cost-effective tool for screening of biodiversity (Hebert and Gregory 2005; Stoeckle and Hebert 2008). Resolution was higher when using uniqueness of NJ-clusters (i.e., monophyly) as a species delimitation criterion (94%) rather than the threshold genetic distance (89%) proposed by Hebert et al. (2004). This is consistent with the results of two recent papers on DNA barcoding in closely related bird species (Aliabadian et al. 2009; Tavares and Baker 2008).

We found five species pairs or groups that could not be distinguished by DNA barcoding. In all five cases, the species or species groups in question were closely related congeners. There are two mutually non-exclusive hypotheses to explain a lack of COI haplotype divergence between closely related, but good species. First, the species may have diverged relatively recently, with insufficient time for lineage sorting of their mtDNA haplotypes. Second, hybridization may be prevalent, resulting in introgression of the whole mitochondrial genome from one species to the other. Assuming that hybrid pairs are not biased with respect to sex and species, one would expect to find two haplotype clusters, both consisting of a mix of both species. However, there could also be a directional bias in mtDNA transfer, which has been suggested for Stercoraria skua and S. pomarinus (Andersson 1999). All five groups of species with overlapping barcodes in this study are known to hybridize occasionally today (McCarthy 2006). However, the specimens included in this study were selected for showing clear species identity based on morphology, hence current hybridization seems unlikely to explain the overlapping barcodes in most cases. Three of the five species groups were also identified as having overlapping barcodes in the previous study of North American birds (Somateria mollissima/spectabilis, Larus spp., and Carduelis flammea/hornemanni; Kerr et al. 2007). In the Larus case, our study adds L. marinus to the group of large white-headed gulls with overlapping barcodes. For the Somateria eiders and Larus gulls there is generally little doubt that the taxa in question behave as good biological species with distinct gene pools being reproductively isolated, even in sympatry. For the Carduelis flammea/hornemanni complex, the taxonomy has long been debated (Knox 1988; Marthinsen et al. 2008; Molau 1985). A recent study of mtDNA (control region) and nuclear (microsatellite) markers revealed no genetic differentiation within this species complex (Marthinsen et al. 2008). The possibility of one common gene pool, displaying distinct clinal morphological variation, cannot yet be ruled out (Marthinsen et al. 2008). The two crossbills, Loxia curvirostra and L. pytyopsittacus, were found to be morphologically distinct and to mate assortatively with respect to bill morphology, but they were not genetically distinct in either mtDNA or microsatellites (Summers et al. 2007). The Loxia species complex may be an example of recent ecological speciation caused by adaptation of beak size to different seed types (Benkman 2003; Smith and Benkman 2007).

Three species pairs (Cygnus columbianus and C. cygnus, Gavia adamsii and G. immer, and Stercorarius pomarinus and S. skua) showed non-overlapping, but very similar, barcodes. In each of these species pairs, one species was represented by a single specimen, which precludes any conclusion about consistent differences based on the Scandinavian samples alone. However, when pooling these specimens with the published North American ones (Kerr et al. 2007), well-defined monophyletic groups were observed in all six species, albeit with a very low degree of differentiation (data not shown). These and several of the species with overlapping barcodes may be young species, in which morphological divergences have established faster than neutral genetic divergences (Peters et al. 2005).

In contrast to interspecifically similar or overlapping barcodes, we also identified some Scandinavian species with large intraspecific sequence divergence, typically represented by two distinct haplotype lineages (Table 2). The magnitude of intraspecific divergence in COI exceeds typical species-level differences, particularly in Periparus ater and Phoenicurus phoenicurus, yet there is no indication that this large sequence variation represents cryptic species. There is no known morphological differentiation between these haplotype groups, and in the species with largest intraspecific distance, Phoenicurus phoenicurus, we have direct evidence for interbreeding of the haplotypes. Either these large intraspecific divergences are real and demand an evolutionary explanation, or they are analytical artifacts due to amplification of nuclear pseudogenes (Song et al. 2008). To test the latter alternative, we amplified a long fragment (>4,000 bp) using conserved mitochondrial markers (Sorenson et al. 1999), and used this as the basis for a second PCR with internal COI primers. This XL-PCR procedure supported a mitochondrial origin of the divergent haplotypes in these species. In addition, we amplified another mtDNA marker, the 16S gene, for some specimens to see if they segregated similarly as for COI. This confirmed the segregation pattern for Periparus ater and Phoenicurus phoenicurus (A. Johnsen and J.T. Lifjeld, unpublished data). Furthermore, for all four species in Table 2, we have unpublished barcode results of specimens from other parts of the Palearctic, which adhere to the same pattern of two or more separate lineages within the species range (K.C.R. Kerr et al., unpublished data; A. Johnsen and J.T. Lifjeld, unpublished data). Hence, the most parsimonious explanation for these divergences is that the current populations consist of a mixture of separate lineages that once evolved in allopatry but at some point in time merged and have since survived as two separate clusters in sympatry. Only sequencing of more specimens from a wider geographical coverage of their ranges can elucidate the phylogeographical history of these lineages.

Our comparison of sequence divergence among 78 Holarctic species occurring both in Scandinavia and North America revealed 19 species with well-defined barcode clusters for each continent, with another three species (Lagopus lagopus, Fulmarus glacialis, Corvus corax) in which Scandinavian birds clustered with one of two North American groups. For one of the latter species, Corvus corax, a similar pattern was found using the mitochondrial control region (Omland et al. 2006). The remaining 56 species had low intraspecific divergence and no consistent intercontinental groupings. Six of these species have been introduced from Europe to North America (Streptopelia decaocto, Cygnus olor, Passer domesticus, Sturnus vulgaris, and Columba livia) or vice versa (Branta canadensis), hence their lack of genetic divergence is not surprising. More than half (60%) the naturally occurring species with no divergence between the continents have a more or less continuous distribution from western Europe to eastern Siberia which continues on the other side of the Bering strait into Alaska and further into North America, while 70% of the remaining species with discontinuous distribution are marine breeders (see Table 3). In contrast, 14 of the 19 divergent species (74%) show a discontinuous distribution with little geographic connection between populations on the two continents, and the majority (84%) are inland-breeding birds. The likelihood of gene flow between the continents therefore seems very low for many of these species. Previous studies, using other genetic markers, have found strong intercontinental genetic divergence in 11 of the 19 divergent species identified in the present study: Aegolius funereus (Koopman et al. 2005), Circus cyaneus (Wink and Sauer-Gürth 2004), Falco columbarius (Wink and Sauer-Gürth 2004), Hirundo rustica (Zink et al. 2006), Lanius excubitor (Gonzalez et al. 2008; Klassert et al. 2008), Larus canus (Zink et al. 1995), Numenius phaeopus (Zink et al. 1995), Pandion haliaetus (Wink et al. 2004), Thalasseus sandvicensis (Efe et al. 2009), Troglodytes troglodytes (Drovetski et al. 2004), and Tyto alba (Wink et al. 2008). For almost all the divergent species, different subspecies have been described on each continent (Clements 2007), and in many cases the divergence matches current subspecific taxonomy. It is an important but unresolved issue in avian taxonomy what merits subspecific or specific taxonomic status in each case. If more extensive sampling confirms these patterns of monophyly, it is a highly relevant issue for avian taxonomic authorities whether these subspecies should be lifted to the species rank. As argued by Zink (2004), a high proportion of current avian subspecies are not monophyletic units and should therefore not be assigned a taxonomic name or rank. An implication of this view, following a phylogenetic species concept, is that monophyletic groups should be assigned the rank of species. We would not make specific recommendations for taxonomy here, since our analyses in most cases reflect a restricted and geographically limited sampling within the species’ ranges. However, we would like to comment more specifically on the three particular cases of trans-Atlantic splits that revealed evidence of paraphyletic groups.

First, North American Lanius excubitor (probably belonging to the subspecies borealis and invictus) formed a sister clade to L. ludovicianus, whereas European L. excubitor (subspecies excubitor) clustered outside this North American clade (Fig. 3). This pattern is supported by Gonzalez et al. (2008) and Klassert et al. (2008), who both analyzed the Lanius excubitor/meridionalis complex with the cytochrome b gene, and found L. e. excubitor on the European side and L. e.invictus on the North American side to be paraphyletic. L. excubitor is widely distributed in the northern Holarctic, but does not occur in northeastern Siberia, making the geographic distance between Palearctic and Nearctic birds very long. Gene flow between these populations should therefore be negligible, and the relatively large sequence divergence (2.5%) between them suggests that they have been separated for a long time. A mitochondrial origin of the Scandinavian sequences was supported by XL-PCR, hence the patterns found are unlikely to result from amplification of pseudogenes. Second, North American Thalasseus sandvicensis (subspecies acuflavidus), clustered together with another North American Thalasseus species, T. elegans, while the European T. sandvicensis (subspecies sandvicensis) formed a group outside this clade (Fig. 4). T. sandvicensis breeds on each side of the Atlantic Ocean and shows a 3.9% sequence divergence between the continents. This combination of large sequence difference, geographic separation, and paraphyly suggests that these populations should be considered separate species (i.e., acuflavidis be given species rank), supporting the conclusions of a recent paper documenting similar divergences in both mitochondrial and nuclear markers between Spanish and American Thalasseus sandvicensis (Efe et al. 2009). Third, Scandinavian Larus canus (subspecies canus) formed a clade outside the large clade of white-headed Larus gulls to which North American L. canus (subspecies brachyrhynchus) belong (Fig. 5). A mitochondrial origin of the Scandinavian Larus canus sequences was supported by XL-PCR. Larus canus has a more or less continuous circumpolar distribution, but the Bering Strait may act as a barrier to gene flow, as suggested by Zink et al. (1995). Even if the overall divergence between the continents (1.9%) was more moderate than for the previous two species, paraphyly in COI indicates that a taxonomic revision is needed. Other species that deserve more detailed study include Gallinula chloropus, Strix nebulosa, Numenius phaeopus, Tyto alba, and Pinicola enucleator, all of which have >3% continental divergence.

Three North American species have recently been split from their Eurasian counterparts (Anas carolinensis from A. crecca, Gallinago delicata from G. gallinago, and Picoides dorsalis from P. tridactylus) (Clements 2007). The combined Scandinavian and North American barcoding projects provide support for two of these splits (A. carolinensis and A. crecca: nearest neighbor distance 3.56%, and P. dorsalis and P. tridactylus: nearest neighbor distance 3.69%), but not for the third one, since G. delicata cannot be separated from G. gallinago based on COI barcodes (nearest neighbor distance 0%).

In conclusion, we have shown that DNA barcoding with COI has a high species resolution for Scandinavian birds. Our approach also identified several species with large intraspecific variation as well as some species groups with overlapping barcodes, which make them good candidates for further studies of phylogeography and speciation processes. In particular, our comparison of trans-Atlantic species identified several species with deep intercontinental divergences. Further phylogenetic work on these species, using other genetic markers as well as morphology, will reveal which of these highly divergent and geographically separated populations should be treated as belonging to the same species or sister species. We leave it to the taxonomic authorities to decide, but we are convinced that continued DNA barcoding of birds from various regions of the world will be of great help for avian taxonomists in the continuing process of delimiting species. Lastly, we found that the DNA barcoding approach greatly amplified the power of our regional studies by enabling easy merging of our independently obtained datasets with the established avian barcode library (http://www.barcodingbirds.org).

Zusammenfassung

DNA-barcoding skandinavischer Vögel zeigt divergente Abstammungslinien bei transatlantischen Arten

Vögel sind eine taxonomisch gut beschriebene Tiergruppe, doch DNA-Barcoding, die molekulare Charakterisierung von Arten mittels eines standardisierten genetischen Markers, hat unerwartete Muster genetischer Divergenz bei nordamerikanischen Vögeln aufgezeigt. Wir haben eine umfassende COI (Cytochrom-c-Oxidase-Untereinheit I) Barcode-Erfassung von 296 skandinavischen Vogelarten durchgeführt und genetische Divergenzen zwischen 78 transatlantischen Arten verglichen, deren Brutgebiete sowohl Skandinavien als auch Nordamerika einschließen. Vierundneunzig Prozent der skandinavischen Arten zeigten einzigartige Barcode-Cluster; die verbleibenden sechs Prozent hatten überlappende Barcodes mit einer oder mehreren Arten der gleichen Gattung, was „incomplete lineage sorting“ oder einen einzigen Genpool widerspiegeln könnte. Vier Arten zeigten große innerartliche Divergenzen innerhalb Skandinaviens, trotz fehlender offensichtlicher morphologischer Differenzierung oder Anzeichen reproduktiver Isolation. Diese Fälle könnten Beimischung zuvor isolierter Linien widerspiegeln und daher umfassendere phylogeographische Analysen rechtfertigen. Neunzehn (24%) von 78 transatlantischen Arten zeigten divergente genetische Cluster, die mit regionalen Unterarten übereinstimmen. Drei dieser transatlantischen Divergenzen waren paraphyletisch. Unsere Studie zeigt die Effektivität von COI-Barcodes für die Identifizierung skandinavischer Vögel und hebt Taxa für eine taxonomische Nachprüfung hervor. Der standardisierte DNA-Barcoding Ansatz verstärkte die Power unserer regionalen Studien, indem er es ermöglichte, unabhängig gewonnene Datensätze mit der etablierten Vogel-Barcode-Bibliothek zu fusionieren.