Introduction

Oceanic islands represent excellent systems to answer questions of biogeography, phylogeography, and evolutionary biology. Compared to isolated oceanic islands, archipelagos are much more complex, e.g. due to the mostly different ages of the islands and their relative proximity to each other and to the mainland (MacArthur & Wilson, 1967). Island radiations of animal and plant species provide the means to study the effects of ecological and evolutionary forces and to infer colonisation history, leading to currently observed patterns of differentiation (Juan et al., 2000). The variety of observed phenomena in island radiations is extraordinarily diverse. In terms of colonisation alone, possibilities include (a) single colonisation of an ancestor followed by radiation, (b) multiple independent colonisations, (c) back colonisation to the mainland from island archipelagos, (d) additional colonisations and back colonisations within an archipelago (Kim et al., 2008), and (e) colonisations between archipelagos (Gillespie et al., 2008).

The Atlantic archipelagos of Madeira and the Canary Islands are excellent models for questions of island biogeo-graphy. In contrast to the Hawaiian Archipelago or the Galápagos Islands, the Canary Islands are relatively close to the mainland, providing a prolific source of colonisation events (e.g., Fuerteventura, eastern Canary Islands: 96 km to the west coast of Africa). In addition, both the Hawaiian Archipelago (oldest island 6 Ma) and the Galápagos Islands (oldest island about 4 Ma) are much younger than Porto Santo of the Madeira Archipelago (14.3 Ma) or Fuerteventura of the Canary Islands (20.2 Ma).

Patterns of colonisation of the Madeira Archipelago and the Canary Islands have been inferred for a variety of vascular plants (e.g., Böhle et al., 1996; Francisco-Ortega et al., 1996; Francisco-Ortega et al., 1997; Kim et al., 1996; Juan et al., 2000; Francisco-Ortega et al., 2002; Mort et al., 2002; Percy & Cronk, 2002) and vertebrates (Cano et al., 1984; Gonzales et al., 1996; Carranza et al., 1999; Barahona et al., 2000; Brehm et al., 2003; Maca-Meyer et al., 2003), as well as many invertebrates such as Gastropoda (Henriquez et al., 1993), Araneae (Wunderlich, 1991), Diplopoda (Enghoff, 1992), Collembola (Fjellberg, 1992), Hemiptera (Lindberg, 1953), and especially Coleoptera (Dajoz, 1977; Lindberg et al., 1958; Machado et al., 2017). Many of these taxa have undergone radiations associated with island colonisation. Vertebrate examples on the Canary Islands include lizards (Brown & Pestano, 1998; Cox et al., 2010; Thorpe et al., 1994). Invertebrate studies have addressed, e.g., weevils (Machado et al., 2017), tenebrionid beetles (Juan et al., 1996a; Rees et al., 2001), carabid beetles (Emerson et al., 1999, 2000a), grasshoppers (Hochkirch & Görzig, 2009; Hochkirch & Husemann, 2008; Husemann et al., 2014), mites (Salomone et al., 2002), spiders (Arnedo et al., 2001), cockroaches (Oromi et al., 1991), and molluscs (Abreu & Teixeira, 2008). It is striking that to date no studies have focused on the radiation of endemic wild bees in the Macaronesian islands, though they play a key role in the pollination of many of the islands’ endemic plant species (summarised in Kratochwil & Schwabe, 2018a, Kratochwil et al., 2019, Kratochwil & Schwabe, 2020). The Madeira Archipelago is characterised by 20 wild bee species with eight endemic species and one endemic subspecies (Kratochwil et al., 2018). The Canary Islands are home to 124 wild bee species, including 64 endemic species and 25 subspecies, 24 of which are endemic (Hohmann et al., 1993; Kratochwil et al., 2018; Kratochwil & Schwabe, 2018b).

Worldwide, numerous studies of the wild bee faunas of islands have been published. Oceanic volcanic islands mostly have small numbers of wild bee species, e.g., on the Galápagos Islands, only three wild bee species exist (including two probably introduced species; Gonzales et al., 2010; Rasmussen et al., 2012). An exception are islands with rapid diversification after a colonisation event, e.g., bees of the genus Hylaeus on Hawaii, which have undergone an impressive radiation in which 63 species derived from a single ancestor (Magnacca & King, 2013). Studies on oceanic islands with a focus on the radiation of taxonomically related taxa within the Anthophila using molecular genetic methods are rare (e.g., Hawaiian Hylaeus; Magnacca & Danforth, 2006, 2007), and none at all exists for the Madeira Archipelago and the Canary Islands. Exceptions are those that focus on the honey bee Apis mellifera (De la Rúa et al., 1998, 2001) and the bumble bee Bombus terrestris (Widmer et al., 1998).

The Madeira Archipelago and the Canary Islands do not have many wild bee taxa that show obvious patterns of radiation. Local assemblages are often characterised by the presence of various unrelated genera. In the case of the Canary Islands, this may be partly due to the favourable possibilities for colonisation from the mainland, which could have limited opportunities for in situ radiation of taxa. This idea is supported by the taxonomic distribution of species on the Madeira Archipelago, where many genera are represented by only a few species (Kratochwil et al., 2018). However, one genus has undergone a radiation on the archipelagos: mining bees of the genus Andrena, subgenus Micrandrena, namely, in the Andrena wollastoni group, including A. lineolata. The group has not been covered by the worldwide molecular analysis of the Andreninae by Pisanty et al. (2021).

The Andrena wollastoni group has recently been taxonomically investigated using qualitative morphological and morphometric traits (Kratochwil, 2020). All taxa of this group of small bees are endemic to the Madeira Archipelago and the Canary Islands (body length in females: 6.8–7.6 mm; in males: 5.5–6.4 mm) and are characterised by special habitat preferences (Kratochwil & Schwabe, 2020; Text S1). The group is thought to consist of two species on the Madeira Archipelago (A. wollastoni Cockerell, 1922, Madeira Island; A. dourada Kratochwil & Scheuchl, 2013, Porto Santo) and four species on the Canary Islands (A. acuta Warncke, 1968, Tenerife; A. catula Warncke, 1968, Gran Canaria; A. gomerensis, Warncke, 1993, La Gomera, La Palma; and A. lineolata Warncke, 1968, Tene-rife). Andrena gomerensis and A. acuta are further differentiated into subspecies (La Gomera: A. g. gomerensis Warncke, 1993; La Palma: A. g. palmae Kratochwil, 2020; Tenerife, Anaga region: A. a. acuta Warncke, 1968, Tenerife, Teno region: A. a. tenoensis Kratochwil, 2020, Tenerife, Dorsal Rift region: A. a. wildpreti Kratochwil, 2020).

The hypothetical ancestor of the A. wollastoni group is assumed to be related to A. tiaretta Warncke, 1974 or one of its ancestors (Text S2). Andrena tiaretta is distributed in southern Spain, Morocco, and Algeria. This hypothesis is specified in Kratochwil & Scheuchl (2013), Kratochwil (2015), and Kratochwil (2020). Andrena tiaretta morphologically shares the highest similarity with A. lineolata (Kratochwil, 2020). No other Micrandrena species occurring in Morocco today shows even approximate morphological similarity to Andrena tiaretta (Lhomme et al., 2020). The taxonomic position of the A. wollastoni group is described in the supplemental material (Text S2). To date, any phylogenetic hypotheses of the group were solely built on distribution and morphological data. However, homology of morphological characters is in many cases difficult to assess and genetic data may provide a more neutral, additional data source of information with which to infer phylogentic relationships and colonisation patterns.

We here present the first molecular study of Canarian and Madeiran wild bees. We use mitochondrial COI sequences together with qualitative morphological and morphometric data of the Andrena wollastoni group of the Madeira Archipelago and the Canary Islands to reconstruct their biogeographic history and derive colonisation and speciation scenarios. In doing so, we also test whether the results of the classification into different taxa by means of morphological and morphometric analyses published in Kratochwil (2020) can also be supported by molecular genetic markers.

Materials and methods

Study area

The distribution of the studied taxa covers the Madeira Archipelago and the Canary Islands with the exceptions of Lanzarote and Fuerteventura (Fig. 1).

Fig. 1
figure 1

Location of the Azores, the Archipelago of Madeira, the Selvagens Islands, the Canary Islands, and Cape Verde (a). Close view to the islands of the Madeira Archipelago (b) and the Western Canary Islands (below, right) (c). Tenerife is characterised by the regions of Anaga, Teno, Las Cañadas/Teide, and Dorsal Rift. The taxa of the A. wollastoni group (Kratochwil, 2020) and the centres of their distribution areas are indicated. Modified according to Aguiar & Karsholt (2006). The colours of the islands correspond to those used to represent species in Fig. 2. Distances from the mainland: 737 km from Porto Santo (Madeira Archipelago) to Cape Sim (Morocco); 96 km from Fuerteventura (Canary Islands) to Stafford Point (Western Sahara)

The age of the Madeiran Archipelago ranges from 5.2 to 14.3 Ma (Galopim De Carvalho & Brandão, 1991; Geldmacher et al., 2000; Schmincke, 1998). Geologically, Porto Santo is the oldest island of the Madeiran Achipelago (14.3 Ma). Much younger are Madeira Island (5.2 Ma) and Desertas (5.07 Ma). Morphospecies of the Andrena wollastoni group occur on Porto Santo and Madeira Island.

Due to the large number of islands differing in geological history and geomorphology, the Canary Islands are highly diverse. Fuerteventura and Lanzarote are the oldest islands (20.2 Ma and 15.5 Ma years; Carracedo, 2011) but are not known to have bees of the Andrena wollastoni group. Species of the A. wollastoni group occur on Gran Canaria (age 14.6 Ma), La Gomera (9.4 Ma), La Palma (1.72 Ma), and Tenerife (age between 6 to 11.5 Ma; Carracedo, 2011). The geological and geomorphological structure of Tenerife is highly complex. Features of current geological structures are mainly based on three previously separated volcanic islands with different ages (Anaga area, age 6 Ma; Teno area, 8 Ma; Roque del Conde Adeje area, 11.5 Ma), brought together by various volcanic-type processes. These, over long periods, volcanic activities have markedly affected Tenerife. The volcanic events that led to the connection of its regions Teno and Anaga began around 2 Ma and resulted in the formation of the volcanic structures of Las Cañadas and later Pico Teide (Ancochea et al., 1990). The area between Anaga and Las Cañadas, in contrast, has been filled by coastal and lacustrine sediments since the last 0.9 Ma (Dorsal Rift region = Cordillera Dorsal; Ancochea et al., 1990).

Sampling of specimens for molecular analyses

Specimens were collected from the Madeira Archipelago and the Canary Islands between 2016 and 2019 (Andrena wollastoni: A. F. Aguiar; all other taxa: A. Kratochwil and A. Schwabe). Collecting permits were issued by the governments of the Canary Islands and authorities of the Madeira Archipelago. Specimens were mostly caught on flowers with a small net or with small glass tubes, and then immediately transferred into a cool box. They were then stored individually in glass vials at 4 °C in 99% ethanol until further processing. Care was taken to ensure that at least three populations with at least four specimens per species and island were sampled at a greater distance (usually 10–20 km) from each other. Specimens that were collected a few hundred meters apart were considered members of the same population. A total of 143 specimens were collected (134 females, 9 males): Andrena acuta acuta (n = 4), A. a. tenoensis (n = 16), A. a. wildpreti (n = 12), A. catula (n = 13), A. dourada (n = 21), A. g. gomerensis (n = 24), A. g. palmae (n = 14), A. lineolata (n = 22), and A. wollastoni (n = 17). The specimens and populations per taxon are listed in Table S1.

Species and subspecies concept

The taxonomic basis of this publication is the revision of the A. wollastoni group in Kratochwil (2020). All existing types of the updated subspecies of Warncke were studied, and the types of existing species (A. lineolata, A. tiaretta) analysed. In the case of A. wollastoni, it was necessary to describe lectotypes (Kratochwil, 2018). Since no molecular genetic analysis was performed in Kratochwil (2020), which is the subject of this publication, the characterisation of the taxa (species and subspecies) corresponds to the morpho-species concept following the code of the ICZN (1999). For the molecular analyses, we analysed species and subspecies to test for molecular genetic differentiation among all taxa defined in Kratochwil (2020). We therefore treated the subspecies hypothetically as rankless morphotaxa.

Molecular analyses

DNA was extracted using a high salt protocol (Paxton et al., 1996) from adult bees, including both males and females that were all morphologically determined to species and subspecific levels (Table S1). The universal animal barcoding fragment of the cytochrome oxidase I (COI) gene was sequenced using standard protocols recommended by BOLD (http://www.barcodinglife.org) and oligonucleotide primers LCO/HCO (Folmer et al., 1994). We used the standard barcoding gene in our study as it has proven to be highly informative in previous studies, especially for bees (Schmidt et al., 2015). However, it is nevertheless important to recognise the limitations of this marker type in resolving phylogenies as problems may arise from lack of variation, hybridisation, incomplete lineage sorting, and other processes related to the maternal inheritance of mitochondrial DNA. DNA sequences were used to interrogate NCBI’s database using BLAST (https://blast.ncbi.nlm.nih.gov/Blast.cgi). All 140 sequences were devoid of stop codons, gave good query coverage (> 90%) and sequence identity (> 90%) to other Micrandrena species (e.g., Andrena enslinella Stoeckhert, 1924), though sequence identity was < 97% for all specimens, suggesting that databases currently lack reference sequences for the focal taxa.

Sequences were cleaned and trimmed using Geneious v. 6.1.2. (Kearse et al., 2012) and were aligned using the MUSCLE algorithm. In addition to our 140 sequences of the Andrena wollastoni group, we added eight reference sequences, two each from Andrena enslinella Stoeckhert, 1924; A. minutuloides Perkins, 1914; A. semilaevis Pérez, 1903; and A. subopaca Nylander, 1848, retrieved from the NCBI (GenBank) database as outgroups. The final alignment was unambiguous, devoid of INDELS, and, after trimming, the final dataset comprised 148 sequences of 533 bp length.

Phylogenetic analyses

We estimated the most suitable substitution model using MrModeltest v. 2.3 (Nylander, 2004). The most suitable model was determined as GTR + G based on the AIC. In order to adjust for rate heterogeneity, we used the reversible jump model with Gamma rate parameter in MrBayes v. 3.2.6 (Ronquist & Huelsenbeck, 2003). We ran the simulation for 10 million generations, sampling every 1000 generations for a total of 10,000 final sampled trees. Convergence was confirmed with average split frequencies being below 0.01. Trees were visualised with FigTree v. 1.4 (Rambaut, 2009).

Further, we ran *BEAST in BEAST v. 1.8.2. (Drummond et al., 2012) with a strict molecular clock to generate a species tree with a rough estimate for the timing of divergence. As we deal with very young divergences, we consider the strict clock a valid approach (Hein et al., 2005). We first generated the input file for BEAST using BEAUti v. 1.8.2. For *BEAST we grouped species and subspecies as units. As no bee specific rate is available, we used the rate proposed by Machado et al. (2017) for Macaronesian flightless weevils (genus Laparocerus; 225 taxa studied; confirmation of monophyly of 36 species and subspecies of the Madeira Archipelago, and 196 species and subspecies of the Canary Islands). The rate proposed by Machado et al. (2017) is intermediate between those proposed by Brower (1994) and Papadopoulou et al. (2010) and hence represents a conservative estimate. The Yule process was used as a tree prior with a piecewise linear and constant root as population size model as no coalescent models are available with *BEAST; a random tree was used as starting tree. The ploidy type was chosen as mitochondrial. We ran the analysis for 300 million generations, sampling every 30,000 iterations. Effective sampling size (ESS) values were above 100 for all parameters and above 200 for most parameters, suggesting the analyses converged. Trees were summarised with TreeAnnotator v. 1.8.2 and visualised with FigTree. We further calculated a Median Joining network using Popart (Leigh & Bryant, 2015).

Morphometric analyses

For the morphometric analyses, 279 females were available: Andrena acuta acuta (n = 15), A. a. tenoensis (n = 22), A. a. wildpreti (n = 13), A. catula (n = 50), A. dourada (n = 46), A. g. gomerensis (n = 40), A. g. palmae (n = 31), A. lineolata (n = 13), and A. wollastoni (n = 49). The analysed specimens are deposited in the following collections (with acronyms of the depository): A. Kratochwil (KR; n = 204); Upper Austrian State Museum Linz, Austria (OLML: n = 37); Überseemuseum Bremen, Germany (UMBB; n = 35); and H. R Schwenninger, Stuttgart, Germany (HS: n = 3). The specimens are individually characterised by an identity code (ID-No) with further information (locality, altitude above sea level, latitude and longitude coordinates, date of collection, collector and further remarks) (see Table S2). Only females were considered since a preliminary study revealed that morphometric analysis provided clearer differences between taxa in females (AK, unpubl. data).

Eighteen parameters were investigated (in alphabetical order; in brackets, abbreviation, and magnification used for measurement): body length (BL, 16.25); from antennal base to tip of the pygidium, clypeus length (CL, 100 ×), facial fovea maximal length (FVL, 100 ×), facial fovea maximal width (FVW, 100 ×), flagellomere length 1–3 (each flagellomere separately, FL1, FL2, FL3, 100 × ; measured on ventral surfaces of the flagellomeres with the antenna stretched forward), head length (HL, 40 × ; from the top of the vertex to the lower margin of the clypeus excluding the process of the labrum), head width (HW, 40 ×), labrum process width (apical process width) at the top (LPW, 100 ×), mesosomal width (MSW, 40 × ; between outer rims of the tegulae), metasomal width (MTW, 40 ×), maximum width of terga from dorsal view, ocelloccipital distance (OCD, 62.5 ×), ocellocular distance (OOD, 62.5 ×), postocular distance (POD, 62.5 ×), propodeum basal area length (PBAL, 100×), pterostigma length (PSL, 100×), wing length (WL, 16.25 ×), and length of the forewing including the tegula. The morphological and morphometric studies were carried out with a stereo microscope (Wild M3Z modular stereomicroscope, Heerbrugg, Switzerland, with a 25 × eyepiece with micrometer giving 16.25 × , 40 × , 62.5 × , and 100 × magnification).

Kratochwil (2020) used statistical methods for a morphometric analysis of the taxa of the A. wollastoni group according to Baur and Leuenberger (2011). These methods include univariate analysis, calculation of Pearson product-moment correlation coefficients, multivariate ratio analysis, principal component analysis, linear discriminant analysis, and allometry ratio spectrum. Of particular importance is the analysis of whether or not individual morphometric measures depend on individual body size. In Kratochwil (2020), further tools of correlation and eigenvalue analyses were developed. The morphometric results are presented in Kratochwil (2020) in 50 boxplots. In total, 225 significance tests were performed. Furthermore, the shape PC1/shape PC2 clusters and the best plotted ratio of morphometric features were presented in 60 diagrams. We summarise these results in one diagram using the following method: Pearson product-moment correlation coefficients were calculated for all measurements with the function ‘cor ()’ in R (R Core Team, 2016) using default settings. Ten of our parameters with the highest correlation coefficients were used for further analysis (BL, FL2, FL3, HL, HW, MSW, MTW, OOD, PSL, WL). The morphometric data of these parameters are presented in Table S3. For further analysis, the data were log10-transformed. A linear discriminant analysis (LDA) in R was then used to detect linear combinations of the original variables (10 morphometric parameters) that provided the best possible separation between the groups (taxa) in our dataset, including highest eigenvalues. Linear discriminant analysis was carried out using the ‘lda()’ function of the R package MASS (Venables & Ripley, 2002). LDA determines group means and computes, for each specimen, the probability of belonging to different groups. A specimen is then assigned to the group with the highest probability score. A plot of LDA scores was generated with a confidence ellipse around individuals of each taxon corresponding to the 95% confidence level, and further adjusted in EazyDraw (2020).

Qualitative morphological analyses

A total of 40 qualitative (non-meristic) parameters (colour, structure, pubescence) were measured in the same individuals (Table S4). An explanation of the parameter groups is given in Table S5, whilst the data matrix is shown in Table S6. All parameters are explained in detail in Kratochwil (2020). Principal component analysis was used to explore non-meristic character variation across individuals using the function ‘prcomp’ from R’s stats package (R Core Team, 2016). The various assumptions of the PCA approach were satisfied by the data. The results were visualised in the R package ggbiplot (Vu, 2011) and adjusted in EazyDraw (2020).

Combined analyses

In addition, we correlated the genetic and the qualitative morphological and morphometric results to understand their relationships, following the synthetic approach of Baur et al. (2014). We generated estimates of genetic differentiation between species as ΦST with DnaSP v. 6.12.03 (Rozas et al., 2017); subspecies were grouped together within species as phylogenetic analyses already suggested no genetic differences between them. Differentiation in qualitative morphological and morphometric traits were calculated as square Euclidian and Mahalanobis distances, respectively, with PAST v. 4.03 (Hammer-Muntz et al., 2001). The different distance matrices were correlated with PAST.

Results

Molecular analyses

We generated a total of 140 ingroup sequences and included eight outgroup sequences. The total alignment was 533 bp in length. The haplotype network showed little overlap between the island populations (Fig. 2). Only La Gomera and La Palma shared similar haplotypes on the Canary Islands; Madeira and Porto Santo also shared haplotypes. Individuals from La Gomera and La Palma were very distinct from all others (22 mutations). Further, Andrena lineolata from Tenerife was quite divergent from all other populations (11 mutations to the closest other haplotype from A. dourada and A. wollastoni, a maximum of 33 mutations to A. gomerensis). Andrena dourada (Porto Santo) took a central place in the haplotype network and is connected to most other species. Andrena dourada and A. wollastoni (Madeira Island) have private haplotypes, but also share two haplotypes. In turn, A. catula (Gran Canary) and A. acuta (Tene-rife) only differ by one mutation, but do not share any haplotypes. Andrena acuta is likely derived from an A. catula haplotype. Subspecies differentiation, as is found within A. acuta and within A. gomerensis using morphometric traits (Kratochwil, 2020), is not reflected in the haplotype analysis.

Fig. 2
figure 2

Median Joining network of Andrena species (Micrandrena) of the Canary Islands and the Madeira Archipelago. Circle size is relative to number of haplotype copies present in dataset. A branch re-presents a single nucleotide change (mutation); bars on branches re-present inferred missing haplotypes (single nucleotide changes). LG La Gomera, LP La Palma. The colours correspond to those used to represent the location of species on islands in Fig. 1

The phylogeny generated with MrBayes did not resolve all species (Fig. S1). As in the haplotype network, subspecies of Andrena gomerensis from La Gomera and La Palma could not be distinguished. The populations of both islands took the basal position in the ingroup and are the sister group to all other populations (PP: 1.00). Next, Andrena lineolata from Tenerife split off (PP: 1.00). The other populations were not well resolved and represent the sister group to A. lineolata. Andrena acuta (Tenerife) and A. catula (Gran Canary) grouped together with high support (PP: 0.99) but did not form monophyletic clusters. Andrena dourada (Porto Santo) and A. wollastoni (Madeira Island) were admixed.

The analysis with *BEAST provided overall similar results in terms of the relationships (Fig. 3). Again, Andrena gomerensis from La Palma and La Gomera grouped together with high support (PP: 1.0) and represented the basal lineage within the ingroup. Next, Andrena lineolata from Tenerife split off (PP: 1.0) and is the sister group to all other species. Andrena catula (Gran Canary) is the sister of A. acuta (Tenerife; PP: 0.94); the group of A. catula and A. acuta is the sister group to the species from the Madeira Archipelago (PP: 1.00). Andrena dourada is sister of A. wollastoni (PP: 0.97). The molecular clock analysis dates the basal split within the ingroup at about 2.18 Ma (1.28–3.11; 95% confidence intervals), when A. gomerensis split off from all others. Andrena lineolata split off the other populations at about 0.89 (0.41–1.41) Ma, whereas the species of the Madeira Archipelago and A. catula and A. acuta diverged about 0.27 (0.11–0.5) Ma. Andrena catula and A. acuta split about 0.14 (0.04–0.28) Ma. The species from the Madeira Archipelago diverged within the last 60 (10–160) Ka. Andrena wollastoni is the likely most recent species.

Fig. 3
figure 3

Dated species tree demonstrating the phylogenetic relationships of the different island populations calculated with *BEAST compared to the outgroup species Andrena enslinella, A. subopaca, A. minutuloides, and A. semilaevis (all Micrandrena); dating is based on a strict molecular clock with the rate proposed by Machado et al. (2017). Branch support above 0.95 is shown with red circles; posterior probabilities above 0.90 are shown with yellow circles. Times are given in million years. Blue bars represent 95% confidence intervals

Morphometric analyses

The LDA analysis, which includes ten different morphometric parameters (BL, FL2, FL3, HL, HW, MSW, MTW, OOD, PSL, WL), differentiates to a greater or lesser extent among the studied species (Fig. 4). Among the characters, HW and WL have the strongest influence on clustering, BL, FL2, and FL3 the lowest. The parameters HL, MSW, MTW, and OOD have an intermediate position.

Fig. 4
figure 4

Linear discriminant analysis (LDA) with morphometric data of all species of the A. wollastoni group (eigenvalues: LD1 19%, LD2 65%). The ellipses characterise the 95% confidence level. The colours of the specimen labels correspond to those used in Figs. 12

Andrena gomerensis is morphometrically well differentiated from A. lineolata (Fig. 4). Andrena wollastoni, the likely most recently differentiated species, is morphometrically in the LDA analysis distinguishable from all others (Fig. 4). Smaller differences exist between A. gomerensis and A. catula, as well as between A. dourada and A. acuta (Fig. 4). Andrena lineolata represents a well-defined morphometric cluster compared to all other species, with very limited overlap with A. catula and A. dourada (Fig. 4). Andrena dourada shows an almost complete separation from A. wollastoni. Andrena dourada and A. catula, as well as A. dourada and A. acuta show no clear differentiation (Fig. 4).

Qualitative morphological analyses

The PCA analysis of 40 morphological parameters (species-specific colours, structures, pubescence of different body parts) documents that the species are well differentiated and is in accord with the molecular results. The PCA shows a central position for Andrena gomerensis (Fig. 5) and a peripheral position for A. lineolata. The decisive morphological factors are shown in the vector diagram (Fig. 5, numbers 1–10). Andrena catula and A. dourada show high morphological similarity and are clearly differentiated from A. acuta and A. wollastoni, which themselves exhibit higher morphological similarity.

Fig. 5
figure 5

Principal component analysis of the morphological data. The colours of the specimen labels correspond to those used in Figs. 12 (explained variance PC1: 58.8%, PC2: 19.3%) The black arrows plotted in the centre (biplot function) characterise for each parameter the direction of increasing values. The length of the arrow indicates how accurately the parameter explains the distribution of the data

Comparative analyses

Genetic differentiation (ΦST) was generally high and varied between 1.0 between Andrena gomerensis and A. lineolata and 0.42 between A. wollastoni and A. dourada. Euclidian distances for morphometric traits varied between 9.33 between A. lineolata and A. acuta and 3.46 between A. acuta and A. gomerensis. Squared Mahalanobis distances for morphometric traits differed between 82.31 between A. gomerensis and A. wollastoni and 6.7 between A. catula and A. acuta. Both qualitative morphological and morphometric data showed limited correlation with genetic distance (r2 = 0.12 and r2 = 0.18, respectively; Fig. 6).

Fig. 6
figure 6

Correlation of genetic distance (ΦST) and A squared Mahalanobis distance of morphometric data (r2 = 0.18) and B Euclidian distance of qualitative morphological data (r2 = 0.12)

Discussion

We here provide the first molecular and taxonomic comparison of a group of bee species of the Madeira Archipelago and the Canary Islands using molecular, qualitative morphological, and morphometric data. Based on molecular clock estimates, the group seems to be young, with a common origin within the last two Ma; based on the (limited) genetic data, most, but not all, species are monophyletic, yet all are morphologically well defined. Our data suggest a complex colonisation scenario deviating from classic island hopping. We show that while the different types of data largely concur, the combination of all data provides the most complete picture of the group.

Colonisation patterns of Macaronesian Islands

For most animal groups studied in Macaronesia, taxa have generally only been found to have colonised a single archipelago, e.g., lizards (Cox et al., 2010), beetles (Emerson et al., 2000b, 2000c; Contreras-Díaz et al., 2007; Mas-Peinado et al., 2018; Rangel Lopez et al., 2018) and grasshoppers (Hochkirch & Görzig, 2009). The wild bees of the Andrena wollastoni group, on the other hand, occur with their representatives on two archipelagos (Madeira Archipelago and Canary Islands). Yet, separate colonisation of the two archipelagos from the mainland seems unlikely for the species of the Andrena wollastoni group due to the molecular results; the populations of the Madeira Archipelago are nested within the Canary Island populations.

It is remarkable that the molecular results presented here provide evidence of colonisation between the two archipelagos for the tiny bees of the Andrena wollastoni species group. Indeed, our results suggest that the sister group of all other Macaronesian species is found on La Gomera or La Palma (Canary Islands). Already Warncke (1968) considered all taxa of the Andrena wollastoni group, except A. lineolata, to be subspecies of A. wollastoni (Kratochwil, 2020); he hypothesised, but without any supporting data, (a) either the former existence of A. wollastoni on the African continent with later colonisation of the Madeira Archipelago and the Canary Islands (followed by diversification at the subspecies level) or (b) the speciation of A. wollastoni after colonisation, followed by secondary colonisation between both archipelagos, similar to our results presented here.

Colonisation models

Many colonisations within the Canary Islands follow a simple stepping-stone model (Emerson, 2002) from east to west according to the prevailing ocean currents and wind direction. This is documented for grasshoppers (Hochkirch & Görzig, 2009; López et al., 2007) and for Canarian beetles (Emerson et al., 2000c; Juan et al., 1995; Machado et al., 2017; Rees et al., 2001). In our case, successful colonisation of one island (La Gomera) seems to have been followed by subsequent radiation on the complete archipelago (Fig. 7). In Chalcides skinks (Brown & Pestano, 1998), La Gomera is also the centre of origin from which Gran Canaria and Tenerife were directly colonised.

Fig. 7
figure 7

Hypothetical colonisation model of the species of the A. wollastoni-group (continuous line). The colours of the specimen circles correspond to those used to represent constituent taxa in Figs. 12

Such colonisation patterns may have various causes. The islands of a volcanic archipelago did not all arise at the same point in time and have not exhibited equal patterns of volcanic activity. Therefore, the possible time windows for colonisations not only depend on the age of the islands but also on the habitat conditions for the coloniser (Text S1) and local extinction promoted by volcanism.

Andrena gomerensis (La Gomera, La Palma): Following our molecular clock analyses, the basal split of Andrena gomerensis (La Gomera, La Palma) is the basis for the development of all other species (2.18 Ma; Pleistocene, Gelasian) (Fig. 7). This is currently the oldest evidence of a wild bee species in the Canary Islands. It is unlikely that the ancestor of A. gomerensis colonised La Gomera directly from the mainland NW Africa. Fuerteventura and/or Lanzarote, as well as Gran Canaria, may have served as stepping-stone islands. Fuerteventura and Lanzarote are currently not inhabited by bee species of the A. wollastoni group, which may be explained by changes in the climate from more humid conditions at the time of the possible colonisation (Alonso-Zarza & Silva, 2002) to the currently extremely hot infracanarian zone (with the exception of very small subhumid summit areas of the mountains); the infracanarian zone is not a suitable habitat for this bee group (Text S1 and Kratochwil & Schwabe, 2020).

Lanzarote and Fuerteventura are also characterised by marked, recent volcanism that devastated large parts of the islands (Pallarés Padilla, 2007) and Fuerteventura had frequent periods of volcanic activity, with the most recent ones dated from about 6000 years ago (Carracedo, 2011). A reason for the apparent lack of early successful colonisation of the Andrena wollastoni group on Gran Canaria may be the island’s very long and violent volcanic activity. The marked volcanic activity in the area of the Roque Nublo complex on Gran Canaria destroyed large parts of the island 3–3.5 Ma ago (Pérez-Torrado & Mangas, 1994; Marrero, 2004). During the Quaternary, volcanic activity, which continued through to 3000 ybp, was mainly restricted to the northeast of the island (Funck et al., 1996). Extinctions or reduction in species numbers or range reductions of species as a probable consequence of volcanism have been suggested for insects, reptiles and also plants (Juan et al., 1995, Marrero & Francisco-Ortega, 2001, Emerson, 2003, Marrero, 2004, Machado et al., 2017).

The environmental conditions were completely different on La Gomera, where before 4–2 Ma, there is evidence of only insignificant, very local, and sporadic volcanic activity. In the last two million years, volcanic activity has been non-existent (Ancochea et al., 2006). The prevailing habitat conditions were likely favourable to the settlement and long-term establishment of Andrena gomerensis.

The populations of both islands (La Gomera, age 9.4 Ma; La Palma, age 1.72 Ma) could be separated at the subspecies level by morphometric methods (A. g. gomerensis, A. g. palmae; Kratochwil, 2020), but not with mDNA molecular methods. Due to the young age of La Palma, the colonisation can be hypothesised as a more recent one, as supported by the genetic data. The distance between both islands is only 58 km.

We hypothesise that A. gomerensis colonised Tenerife from La Gomera (distance about 30 km) (Fig. 7). On Tene-rife, the former palaeo-islands Anaga and Teno merged around 2 Ma ago by volcanism, forming Las Cañadas and later El Teide of Tenerife. The area between the regions Anaga and Las Cañadas had been filled by lacustrine and coastal material over the past 0.9 Ma (Ancochea et al., 1990). The young geomorphological structures of Las Cañadas were formed less than 200,000 years ago. According to our results, Andrena lineolata (Tenerife) split off at about 0.89 Ma (Pleistocene, Calabrian). We hypothesise that A. lineolata colonised primarily the lower zones and shifted later to the mountain areas. Today A. lineolata is limited to the Las Cañadas/Teide area, 2000 up to around 3000 m a.s.l., an extreme environment in the subalpine shrub zone with temperature extremes and stormy conditions.

Andrena catula (Gran Canaria) probably evolved from A. gomerensis or its ancestor, more likely through dispersal from La Gomera (direct line 125 km) than from La Palma (Fig. 7). Long-term establishment of a species of the A. wollastoni group on Gran Canaria from the mainland was probably not possible due to the long-lasting volcanic activity on that island (see above).

Andrena dourada (Porto Santo): The long colonisation distance of Porto Santo from La Palma (approximately 480 km) or La Gomera (approximately 540 km) seems astonishing (Fig. 7). Yet the most remote oceanic island archipelago Hawaii, 3200 km from the nearest land mass or island archipelago, has been colonised naturally by small insects, including those of the genus Hylaeus (Anthophila, Colletidae; Magnacca & Danforth, 2006), which are of similar body size to the species of the A. wollastoni-group studied here. Dispersal from La Gomera to Porto Santo is therefore plausible for members of the A. wollastoni group.

Violent sandstorms from the Sahara region can reach the Canary Islands, where they can cause great damage. Satellite images show that such winds can also reach the Madeira Archipelago via the Canary Islands. Impressive satellite images of such ‘Calima’ events exist (e.g., https://de.wikipedia.org/wiki/Calima#/media/Datei:Sahara_Canary_Islands.jpg). Such events could easily lead to aerial drifting of small animal species between the two archipelagos (e.g., Odonata; Weihrauch, 2011). It might be assumed that the decisive factor for successful long-distance colonisation is not so much the colonisation event per se as the ability to establish a new population once having reached a distant island, which depends on preexisting habitats for nesting and the availability of food at the recipient island.

Volcanic activity ended on Porto Santo over 8 million years ago (Cameron et al., 2006). The proposed split of Andrena dourada (Porto Santo) (0.27 Ma) falls at the Middle Pleistocene (Chibanian). Porto Santo is today an example of an old, already strongly eroded island with a mainly Mediterranean xeric oceanic climate; only small parts are exposed to subhumidic influence (Kratochwil & Schwabe, 2018b). We hypothesise that, in the time window of and after the split, the habitat conditions on Porto Santo were much more favourable for A. dourada or its ancestor than today. The island was less eroded, had more humid-subhumid parts and had larger and higher mountain areas, including sites with Laurisilva vegetation (Press & Short, 1994; Silva, 2003). We propose that in the Upper/Late Pleistocene, A. dourada or its ancestor colonised Madeira Island (Fig. 7).

The region of Madeira Island closest to Porto Santo (about 40 km) is a small peninsula (Ponta de São Lourenço). This was volcanically active in two time-windows: > 5.2–4.0 Ma and 2.4–0.9 Ma (Klügel et al., 2009).

Andrena wollastoni (Madeira Island) seemingly arose from Andrena dourada (Porto Santo) or its ancestor. Andrena wollastoni is a supergeneralist, nowadays found in large numbers on Madeira Island.

The current distance between Gran Canaria and Tene-rife is 62 km. Colonisation of Tenerife probably took place by individuals of Andrena catula from Gran Canaria about 140,000 years ago (Fig. 7), at a time when volcanic activity of the Canary Islands was much lower than earlier, at the end of the Pleistocene. On lower altitudes of Tenerife, a second endemic A. wollastoni-group species, Andrena acuta, is nowadays found in regions of the thermo-infracanarian and thermo-mesocanarian zones.

As the colonisation by A. acuta possibly occurred very recently, it would have been at a time when the former palaeo-islands were already connected. Machado (1976) and Machado et al. (2017) assumed that ‘inland islands’ covered with vegetation and surrounded by lava flows (‘kipuka scenario’) played a role in the separation of populations which were able to develop in isolation. ‘Kipukas’ means remnants of the original habitats in the area surrounded by lava flows (Vandergast & Gillespie, 2004).

Comparison between molecular, morphometric, und qualitative morphological results

The molecular, morphometric, and qualitative morphological analyses show mostly consistent results, but are only moderately correlated and differences are also apparent. All species can be separated as monophyletic group arising from Andrena gomerensis or its ancestor using all three methods. Although this is a very young radiation, the younger species can also be differentiated from each other with the exception of Andrena dourada and A. wollastoni.

As already documented in Kratochwil (2020), a taxon-taxon comparison using PCA and LDA shows that morphometry clearly separates all taxa. This method provides the highest resolution. In Kratochwil (2020) the morphometric results are presented in 50 boxplots comparing all taxa including 225 significance tests. In addition, a pairwise taxa comparison was performed and the shape PC1/shape PC2 clusters were visualised in 30 diagrams together with diagrams of the most differentiating ratios of morphometric features. Figure 4 sums this data up in a single diagram. However, it does not provide a morphometric comparison of all taxa in one diagram due to the compression of axes. The LDA, with all taxa considered, documents this through the loss of resolution. Nevertheless, the differences are clear at the 95% level and the species can be morphometrically related to each other, corresponding to the molecular analysis.

The large morphological divergence between A. gomerensis and A. lineolata can be explained by a longtime separation of the two taxa, as also supported by the sequence divergence between them. On the other hand, the morphometric analysis differentiated the two closely related species A. wollastoni and A. dourada much better than the molecular genetic data. This may suggest that morphology is under high selection, driving divergence. The weak correlation of morphological and morphometric data may also point towards selection on morphological traits (e.g., Saether et al., 2007). It may, however, also simply point to a high resolution of the morphological data or, conversely, the low resolution of COI gene sequences. However, generally caution is required when analysing qualitative morphological parameters as convergent feature complexes may also occur. In combination with the molecular results, the close morphological relationship of A. wollastoni and A. acuta (both are the youngest taxa) suggests that some morphological similarities (especially the presence of darker facial hairs, and equally darker scopa hairs) may be the result of convergence, while that of A. catula and A. dourada is more likely to be due to common ancestry.

The molecular relationships between island taxa using mitochondrial sequences do not always correspond to morphological classifications. According to Juan et al. (1996b), a discrepancy between mtDNA and morphological differentiation may result from higher morphological variability of island taxa compared to their related mainland species. One reason might be a higher evolutionary rate of morphological traits compared to molecular traits, which might be a reaction to increased selection under new environmental conditions. On the other hand, the effects of drift may be high in small founder populations and hence lead to faster shifts in traits.