Introduction

Cities worldwide vary in age, shape, size, human population, geographic context and history, and are undergoing accelerated expansion, making them highly dynamic and unstable systems. Urban ecosystems have high heterogeneity, usually characterized by mosaics of land use, including infrastructure with different purposes (residential, commercial, industrial), urban green space (parks, gardens) and natural remnants (Johnson and Munshi-South 2017; Gaertner et al. 2017), frequently organized along gradients from the city centers to suburban surrounding landscapes. In these environments, humans and wildlife experience complex interactions, affecting the evolutionary trajectory of species (Donihue and Lambert 2015; Hulme-Beaman et al. 2016; Des Roches et al. 2021; Santangelo et al. 2022). Urbanization favors the presence of closely commensal species, organisms that have changed life history features like breeding cycles, diet, behavior, or home ranges. Such changes enabled them to thrive in urbanized environments, among which invasive species are especially successful (Hulme-Beaman et al. 2016; Gaertner et al. 2017).

Invasive (non-native) species are those that have been accidentally or intentionally transported and released outside of their natural range, have dispersed from the area of introduction and have established reproductively active populations (Blackburn et al. 2011; Gaertner et al. 2017). In recent decades, global commercial exchange and transportation have facilitated and enhanced biological invasions. The invasion dynamics, i.e. propagule acquisition, introduction, establishment, proliferation, and spread, greatly determines the abundance, structure, and impact of invasive species (Lawson et al. 2011; Estoup et al. 2016). Also, the introduction and dispersal patterns of invasive species strongly influence their genetic diversity and structure, demography, and natural selection (Duglosh and Parker 2007; Konecny et al. 2013; Suárez-Atilano et al. 2019). Indeed, the success of invasion is limited when the founding populations are small, due to decreased genetic variability, genetic drift, and inbreeding resulting from founder effects. In this scenario, the ‘new population’ becomes highly differentiated from the source population, potentially leading also to the fixation of deleterious alleles (Lambrinos 2004; Lawson et al. 2011). However, genetic erosion can be avoided by the recurrent introduction of new alleles through multiple invasion events; also, if the number of initial propagules is high and the bottleneck is short, there would likely be a rapid subsequent population expansion and increased genetic diversity. Hence, the propagule pressure (i.e. relation between the propagule size and propagule frequency) is a key factor for the success or failure of an invasion (Duglosh and Parker 2007; Suarez and Tsutsui 2008; Simberloff 2009; Suárez-Atilano et al. 2019). The availability of resources in cities facilitates the establishment and proliferation of invasive species, while road networks allow the rapid dissemination of propagules (Gaertner et al. 2017; Gortat et al. 2017). In addition, multiple social, economic and ecological factors of cities influence their distribution and abundance, with varying effects on population genetic patterns, gene flow and connectivity of animals and plants (Blackburn et al. 2011; Panty-May et al. 2016; Gaertner et al. 2017; Santangelo et al. 2022). Nonetheless, our understanding of the dynamics of invasion in urban ecosystems is still limited, particularly on islands.

Among invasive species, rodents are among the most widely introduced vertebrates. Their success is associated with their high reproductive rates, short generation times and dietary plasticity; they can consume organic matter, seeds, fruits, different invertebrate and small vertebrate species, eggs and coastal resources (Miller and Webb 2001; Caut et al. 2008; Gardner-Santana et al. 2009; Clair 2011; Shiels et al. 2013; Banks and Smith 2015). Additionally, their evolutionary history is tightly linked with their commensal ecology (Blackburn et al. 2011; Siahsarvie et al. 2012; Lucaccioni et al. 2016), to such a degree that they are considered the main group introduced to oceanic islands (Pichlmueller et al. 2020). Moreover, their global dispersal is significantly associated with human colonization, expansion and urbanization, also regulated by the intensity of commercial and cultural ties established between human populations throughout their history (Stragier et al. 2022). This is the case of the Norwegian rat (Rattus norvegicus), the black rat (Rattus rattus) and the house mouse (Mus musculus).

The black rat is considered the most damaging invasive rodent on islands (Loiseau et al. 2008). It evolved in the Indian peninsula on southern Asia, spread across Europe via the Roman trade (400 BC–100 AC), and dispersed into America and the islands of the Indian and Atlantic oceans with the explorations of the sixteenth and seventeenth centuries (Calmet et al. 2001; Aplin et al. 2011). While the house mouse originated in the Iranian Plateau and is characterized by three evolutionary lineages that diverged 100,000 years ago: M. musculus castaneus in Asia, M. musculus musculus which colonized central and eastern Europe 3000 years ago, and M. musculus domesticus that established in western Europe and arrived in America, Africa and Australia associated with European expeditions beginning in the fifteenth century (Boursot et al. 1996; Jing et al. 2014; Lippens et al. 2017). There are many genetic studies of black rats and house mice focused on diversification, phylogeographic patterns and invasion routes (e.g. Hardouin et al. 2010, 2015; Aplin et al. 2011; Lack et al. 2013; López et al. 2013; Jing et al. 2014; Gabriel et al. 2015; King 2016), both in continental regions (Bastos et al. 2011; Jones et al. 2011; Lippens et al. 2017; Combs et al. 2018) and in islands (Abdelkrim et al. 2005a, 2005b, 2010; Brouat et al. 2014; Babiker and Tautz 2015). Nonetheless, few studies have investigated their genetic and ecological dynamics associated with urban ecosystems specifically on islands (Gatto-Almeida et al. 2022).

Cozumel is the largest of the Mexican Caribbean islands and has the highest number of endemics (31 taxa), including among mammals two carnivores (Procyon pygmaeus and Nasua nelsoni) and three rodents (Oryzomys couesi cozumelae, Reithrodontomys spectabilis, and Peromyscus leucopus cozumelae; the latter potentially extinct) (Vega et al. 2007; Cuarón 2009; Fuentes-Montemayor et al. 2009; Espindola et al. 2014; Flores-Manzanero et al. 2022). Nearly 75% of the island’s surface is covered with native vegetation (Cuarón 2009). Conservation efforts have rendered two terrestrial (Refugio Estatal de Flora y Fauna Isla Cozumel and Reserva Estatal Selvas y Humedales de Cozumel) and one marine (Parque Nacional Arrecifal de Cozumel) protected areas. Despite this, a growing human population and increased cruise ships tourism have accelerated urbanization, habitat loss, and fragmentation (Trejo de la Paz 2014). Cozumel’s human population was estimated in 84,519 (census of 2020), while more than 4.5 million people (average 2018–2019) visit the island yearly (INEGI 2020). Consequently, numerous exotic species have been, and continue to be, introduced to the island (e.g. domestic cats and dogs, black rat, house mouse, boa, margay, fox), which have become feral or invasive (Cuarón 2009; Vázquez-Domínguez et al. 2012; Valenzuela-Galván et al. 2022).

Originally inhabited by native Maya, Cozumel Island was abandoned around 1570, undergoing a major population decline some decades after the Spanish conquest in 1521 (Haviland 1972). More than three hundred years later, in 1846–1847, people established again in Cozumel because of the indigenous uprising know as Caste War in the Yucatán peninsula, when rebellious Maya people escaped seeking refuge on the island (Santander and Ramos-Díaz 2011). The contemporary demographic growth of Cozumel is divided in two phases, a stationary one since the recolonization until 1950, and the tourism expansion from 1950 onwards. The latter was characterized by a constant population growth due of the intensification of human migration processes and tourism infrastructure (hotels, piers), yielding a markedly disorganized urbanization expansion (Sánchez-Crispín and Propin 2003; Palafox and Zizumbo 2009; Santander and Ramos-Díaz 2011; Trejo de la Paz 2014). Currently, Cozumel is one of the main tourist destinations in Mexico and the busiest tourist port in the Caribbean.

Hence, Cozumel is a unique study system to evaluate the evolutionary and genetic patterns of invasive rodents associated with the history of urbanization pulses and human population growth of the island. To this end, we studied two invasive rodents, the black rat and the house mouse; we assessed genetic diversity and structure, connectivity, gene flow, relatedness and contemporary and ancestral bottleneck signals using microsatellites loci. In addition, given the particularly restricted distribution we found of the house mouse within the city of San Miguel Cozumel, we evaluated the species’ colonization process and connectivity based on demography and a landscape genetics framework. We integrated environmental, anthropogenic and social variables to adequately represent the urban ecosystem. Based on Cozumel’s contemporary history of population growth and extension of tourism infrastructure, we assumed a constant introduction of individuals of both invasive species via the island seaports and, accordingly, predicted that both species will exhibit moderate genetic diversity values and weak bottleneck signals. Additionally, San Miguel Cozumel preserves some remnants of natural vegetation and, despite its small size and relatively high income from the tourist industry, has high economic heterogeneity and social inequality. Hence, we expected the two rodents populations will exhibit spatial genetic structure and low connectivity associated with urban and socioeconomic features. Yet, we predicted a differential response to urbanization for each species owing to their distinct life history and social organization, with a greater genetic differentiation in the house mouse. We show that considering the genetic history of introduction and past demographic events is key for the understanding of invasion processes of commensal rodents on islands.

Materials and methods

Study site and sampling

Cozumel (ca. 486 km2) is an oceanic island of coralline origin that has never been connected to the mainland (Spaw 1978; Semarnat 2016). It is located 17.5 km off the Yucatán peninsula in the Caribbean Sea (20° 16′ 18.2″–20° 35′ 32.8″ N; 86° 43′ 23.3″–87° 01′ 31.1″ W), separated from the mainland by the Cozumel Channel (400 m deep). The island has an urbanised area of approximately 45.4 km2 and can be divided based on human-influence zones as: (i) the city of San Miguel Cozumel, home to 98% of the island’s inhabitants (INEGI 2020); it includes the airport, hotel developments on the north and south of the city, and main residential and commercial infrastructure, (ii) the sanitary landfill on the east, and (iii) several scattered small settlements, e.g. several ranches and El Cedral town (Fig. 1).

Fig. 1
figure 1

Map of sampling points for the house mouse (Mus musculus; red dots), the black rat (Rattus rattus; blue) and where no individuals were captured (yellow). The urban area of San Miguel Cozumel city is shown in purple. Other human-influence zones are indicated, the sanitary landfill and El Cedral town, the maritime terminal (Muelle Fiscal) and the ferry port (Muelle Fiscal de carga), and the Puerta Maya and Punta Langosta piers

We sampled black rats and house mice with Sherman-live traps during September 2015. Sampling was site-directed, intended to capture in sites with likely probability of presence of these invasive species, thus traps were set in 12 neighborhoods within San Miguel Cozumel, plus the sanitary landfill, El Cedral town and a few small human settlements around the island (Fig. 1), baited with a mixture of rolled oats, peanut butter, and vanilla extract. Authorization to set traps was obtained at all localities, from the local authorities (piers, landfield) and private owners (food market and establishments, hotel, nearby houses). The number of traps per site and the number of houses sampled varied by neighborhood (Table S1); traps were set before sundown and collected at sunrise during 2–3 days per site, for a total trapping effort of 984 trap/nights. Trapping was performed with the corresponding scientific collecting permit from Secretaría del Medio Ambiente y Recursos Naturales (Semarnat-FAUT-0168). Use of rodenticides or permanent animal trapping techniques cannot be performed on the island due to the high risk of killing native rodent species, despite the directed sampling scheme. Hence, all animals were euthanized, following the procedures allowed by the Mexican Official Norm (Technical specifications for the protection, care and use of laboratory animals; NOM 2001). Accordingly, each captured animal was transported within the Sherman trap to an outdoor processing site where they were introduced into an anesthetic chamber with a cotton soaked with ethyl ether, the amount estimated based on the size of each individual. Death was verified by examination of each animal. The delay between collection from the trapping site and euthanasia was less than 3 h and the complete anesthesia protocol was assessed by a wildlife veterinarian. Post-mortem ear tissue samples were immediately collected and preserved in Eppendorf tubes with 96% ethanol. We also collected tail bones of run over individuals.

Microsatellite genotyping

We extracted genomic DNA from tissue samples with the Genomic DNA Tissue Miniprep kit (ZimoReserch), following the manufacturer’s instructions. For the black rat we genotyped 10 nuclear loci, eight species-specific: Rr14, Rr17, Rr21, Rr22, Rr67, Rr68, Rr107, Rr114 (Loiseau et al. 2008) and two designed for R. norvegicus: D16Rat81, D5Rat83 (Jacob et al. 1995); we chose 10 specific microsatellites for the house mouse: D5Mit149, D6Mit309, D9Mit54, D13Mit61, D15Mit98, EGG22992, PP4A02, PP7B08, PP10A02, PP10E08 (Dietrich et al. 1992; Babiker and Tautz 2015; Hardouin et al. 2015). Microsatellites were amplified by PCR reactions (see protocol details in Table S2). Fragment analysis was performed with the capillary sequencer AbiPrism3730xl Analyzer (Roy J. Carver Biotechnology Center, University of Illinois, USA) with the molecular weight marker Rox500 size standard. Fragment analysis and genotyping were done with GeneMarker v.2.6.7.

Genetic structure, diversity, relatedness and bottlenecks

To infer genetic structure, we used a Discriminant Analysis of Principal Components (DAPC) (Jombart et al. 2010), a multivariate approach that reflects the genetic variation differences between groups while minimizing variation within them. We used adegenet in R v.2.1.5 (R Core Team 2016), applying the alpha-score method to determine the proportion of successful reassignment by individual in function of the number of retained PCs (black rat: 1 PC; house mouse: 8 PCs). We also performed a spatial genetic analysis with Geneland v.4.0.4 (Guillot et al. 2005), a spatial Bayesian method based in Monte Carlo Markov chains and Poisson-Voronoi tessellations to determine genetic discontinuities; 30 independent runs with 10,000,000 interactions and 100,000 of burn-in were performed, an uncertainty coordinate value of 30 m for the black rat based on its low dispersion in urban zones (Gardner-Santana et al. 2009), while for house mice we used a 20 m value considering the species’ home range in cities (Dean et al. 2006).

The following analyses were performed for the entire population and for the different genetic clusters obtained for each species (see genetic structure Results). We screened for null alleles and stuttering in Micro-Checker v.2.2.3 (van Oosterhout et al. 2004), using 3000 randomizations and adjusting the values for multiple comparisons with a Bonferroni sequential correction. We also tested genotypes for Hardy–Weinberg equilibrium and linkage disequilibrium with the Fisher exact test with 1000 batches and 100,000 interactions in Genepop v.4.0 (Rousset 2008) and Bonferroni correction. Genetic diversity was estimated with Genalex v.6.501 (Peakall and Smouse 2012) as the average number of alleles per locus (Na), number of effective (Ae) and private alleles (Ap), observed (Ho), expected (He) and unbiased expected heterozygosity (HNEI) (corrected for small sample sizes; Nei 1974), and inbreeding coefficient FIS. We assessed relatedness by estimating the likelihood of two individuals belonging to the same group being half-siblings, full siblings, parent–offspring or unrelated, with ML-Relate (Kalinowski et al. 2006).

To evaluate genetic contemporary bottlenecks we used two approaches, a graphical method based on the distribution of allele frequencies in which recently bottlenecked populations are likely to have lost mainly the rare alleles (Luikart and Cornuet 1998). We also tested for heterozygote excess with the program Bottleneck v.5.1.26 (Cornuet and Luikart 1996; Piry et al. 1999) by comparing observed and expected heterozygosity values estimated under three mutational models: infinite alleles (IAM), stepwise mutation (SMM) and two-phase (TPM) with two variants (90% SMM, 10% IAM and 10% variance; 70% SMM, 30% and 10% variance). Models were run with 10,000 replicates and significance was calculated with a Wilcoxon test.

Differentiation and migration

Pairwise genetic differentiation between genetic clusters for both species was estimated with FST and RST with Arlequin v.3.1 (Excoffier et al. 2005). Ancestral migration was assessed with an indirect estimator of gene flow (Nm), based on the allelic differences between genetic clusters (FST), with the formula 4Nm = (1/FST). The number of migrants was estimated both by individuals and by sexes with Arlequin v.3.1. This model assumes a constant population size and symmetric migration rates (Wilson and Rannala 2003). We also estimated contemporary bidirectional migration rate using Bayesass v.3.0. (Wilson and Rannala 2003), with 5 × 106 interactions, 3 × 105 burn-in, and sample rate of 2,000. The delta value was adjusted with various trials for each species, based on which we chose values of 0.25 for the black rat and 0.20 for the house mouse. Chain convergence was evaluated with the Faubet method (Faubet et al. 2007), which calculates the Bayesian deviation based on the posterior probabilities that a certain genotype is possible under the ancestral migration rate and the posterior probability of allocation given the same migration rate (Faubet et al. 2007; Meirmans 2014).

Mus musculus colonization patterns

For our objective to evaluate the colonization patterns and connectivity for the house mouse and explore the landscape genetics of the species, we specifically assessed if the urbanization growth pulses that occurred in San Miguel Cozumel determined the observed genetic structure and genetic diversity changes in the three genetic clusters obtained (SanMi1, SanMi2, SanMi3; see Results). First, we inferred the time, expressed as number of generations, and patterns of colonization with the Approximate Bayesian computation method (ABC) (Beaumont et al. 2002), implemented in DiyABC v2.04 (Cornuet et al. 2014). We evaluated three simple scenarios built based on the history of human population presence on Cozumel, using datasets simulated by coalescent from prior distribution parameters (Bertorelle et al. 2010): (i) Scenario1, we hypothesized a sequential colonization where SanMi1 is the ancestral (introduced) Cozumel population, from which SanMi2 originated and then SanMi3 from this one; (ii) Scenario2, we tested if SanMi2 and SanMi3 originated either as two independent populations from SanMi1 or from two colonization pulses at different times; and (iii) Scenario3, we evaluated if SanMi1 and SanMi3 originated independently from the same unsampled introduced population that colonized Cozumel, while SanMi2 originated from SanMi1 (Fig. 2a). For all introduction and colonization events we assumed short genetic bottlenecks. The house mouse has a generational time of 2–3 months in captivity, but in the wild it has been observed to have on average two litters per year, depending on resource availability (Nachman and Searle 2014; Phifer-Rixey and Nachman 2015).

Fig. 2
figure 2

a Scenarios tested with DiyABC v.2.04 (Cornuet et al. 2014) for the house mouse in San Miguel Cozumel. Scenario 1: sequential colonization where SanMi1 is the ancestral (introduced) population-SanMi2-SanMi3; Scenario 2: SanMi2 and SanMi3 originated either as two independent populations from SanMi1 or from two colonization pulses at different times; Scenario3: SanMi1 and SanMi3 originated independently from the same unsampled introduced population, and SanMi2 originated from SanMi1. b Scenario model comparison performance considering the direct and logistic approaches. c Posterior predictive test of model fit corresponding to Scenario 1

We defined the priors of the divergence time between populations based on the elapsed times between the historical dates when the island was abandoned around 1570 and when people again established in 1846–1847 until 2015 and multiplied by the number of generations in one year. Hence, we defined priors accordingly for introduction of SanMi1, between 336 and 1008 generations. In 1980 Cozumel experienced a first population growth that promoted further urbanization, based on which the prior for SanMi2 was set between 70 and 210 generations; finally, another population growth wave started in 2000, so the prior for SanMi3 was between 30 and 90 generations (see below). We modeled the effective population size applying the default values (Brouat et al. 2014; Cornuet et al. 2014) and applied the generalized stepwise mutation model for microsatellite data (Estoup et al. 2012). We generated 6 × 106 simulated datasets per scenario, and scenarios were compared with the logistic and direct approaches using the following summary statistics: mean of number of alleles, genetic diversity and size variance within and between populations (Estoup et al. 2012; Suárez-Atilano et al. 2019). Confidence in the most supported scenario was evaluated by modeling 500 pods (pseudo-observed data) from the prior distribution to estimate the false allocation rate, where Type I error (i.e. probability in which the true scenario is rejected) is the proportion of pods for which the scenario does not have the highest posterior probability; and Type II error (i.e. probability of accepting a scenario when it is not true), is the proportion of pods of which the scenario with the highest posterior probability is not the scenario evaluated (Estoup et al. 2012). Based on the number of generations that coalesce supported by the best model (see Results), we estimated the dates of colonization using 4 and 6 years for the rodent’s generation time, considering that high food availability favours a high reproductive rate.

Landscape genetics and connectivity

We generated environmental and anthropogenic variables at a resolution of 30 × 30 m. For the environmental variables, we used Landsat 8 images to calculate the normalized vegetation index (NDVI) in QGis 3.8, where low NDVI values indicate impervious land surface (i.e. man-made constructions, roads; Zhang and Ye 2021). For the anthropogenic variables we used the National Housing Inventory 2015 data (INEGI 2015) to generate three indices that measure marginalization, overcrowding and commercial businesses. The marginalization index was based on the number of essential services to which the population has access (concrete floors, running water, sewer system, sanitation services and street lighting); the overcrowding index represents the percentage of housing where more than three people share a room; and the commercial businesses denote the percentage of neighborhoods that have informal sector businesses in none, some, or all of their surrounding roads.

To obtain a measure of genetic differentiation for the house mouse across its sampled distribution, we estimated the proportion of shared alleles, an individual-based genetic distance that has no biological assumptions and can be used for inbreeding populations (Shirk et al. 2017), using adegenet. To generate resistance values for our environmental and anthropogenic surfaces we followed the optimization framework developed by Peterman (Peterman et al. 2014), which uses maximum-likelihood population effects mixed models. We optimized each resistance surface with the function commuteDistance in gstudio v.1.5.2 in R (Dyer 2014), with three independent runs to verify parameters convergence. We determined the best-supported features by AICc (Akaike’s information criterion corrected for small/finite sample size; Akaike 1974). After running each surface individually, we applied a multivariate approach for which we built a set of hypotheses considering socioeconomic features most likely relevant for the distribution of the house mouse (Table 1).

Table 1 Composite hypotheses tested of features that potentially affect connectivity of the house mouse (Mus musculus) within the city of San Miguel Cozumel

Results

A total of 48 black rat individuals (25 male and 17 females; six undetermined) and 59 house mice (37 males and 17 females; five undetermined) were captured (Table S1). The microsatellite locus D5Rat83 in black rats was monomorphic and was excluded from analyses. There was no evidence of linkage disequilibrium in the black rat while in the house mouse it was only significant when the entire island was evaluated as a population (D5Mit149-D9Mit54 and D5Mit149-PP7B08) and for the genetic cluster SanMi2 (D13Mit61-EG22992). Five loci for black rats (Rr14, Rr17, Rr22, Rr68, Rr107) and four for house mice (D5Mit149, D15Mit98, PP7B08, PP10E08) deviated significantly from Hardy–Weinberg equilibrium after Bonferroni correction. Only Rr14 in black rats and eight loci in house mice (except D6Mit309 and D13Mit61) showed evidence of null alleles, though no locus was shared between genetic clusters, thus all loci were considered for further analyses. Genepop files for the black rat (Table S3) and the house mouse (Table S4) microsatellite genotypes are included in Supplementary information.

Genetic patterns

Geneland results identified three genetic clusters (K = 3) for the black rat (Fig. 3), which correspond to San Miguel Cozumel (Coz1), the sanitary landfill and El Cedral (Coz2), and the hotel development on the southwest (Coz3); in contrast, with the DAPC we found two distinctive groups, where Coz2 and Coz3 form a single cluster. All subsequent analyses for the black rat considered the three genetic clusters from Geneland. For the house mouse both Geneland and DAPC identified three clusters, all distributed within San Miguel Cozumel, where SanMi1 is located on the northeast and includes the sanitary landfill, SanMi2 in the northwest, and SanMi3 in the south (Fig. 4). Genetic diversity results showed a total of 56 alleles for the black rat (Table 2), with an average of 6.5 alleles per locus and moderate heterozygosity (0.601). Coz3 was the cluster with less genetic diversity. Regarding the house mouse, diversity was markedly higher with a total of 100 alleles (10 per locus), where SanMi3 exhibited the lowest diversity values (Table 2). All clusters for both species showed positive FIS values indicating inbreeding. Notwithstanding, relatedness results showed a high percentage of unrelated individuals (83.4% and 86.7%, black rat and house mouse respectively) (Table 2). The ancestral bottleneck test was not significant under any mutational model, while the contemporary bottleneck analysis showed rare alleles in low proportion for Coz3 and SanMi3, suggesting a bottleneck signal or a result of low sample size (Table S5, Fig. S1).

Fig. 3
figure 3

a Venn diagram for the black rat showing private and shared alleles between genetic clusters identified with Geneland v.4.0.4 (Guillot et al. 2005). b Scatter plot based on the Geneland clusters as priors and the a-score plot used to determine the number of PCs (1 in this case) and discriminant functions to retain for the DAPC. c Map of the study site showing the distribution of the three genetic clusters considering the isocline = 0.5. d Posterior probability maps belonging to each Geneland genetic cluster where yellow colors depict the high probability isoclines

Fig. 4
figure 4

a Venn diagram showing the house mouse private and shared alleles between genetic clusters identified with Geneland v.4.0.4 (Guillot et al. 2005). b Map of the study site showing the distribution of the three genetic clusters considering the isocline = 0.5. c Scatter plots based on the Geneland clusters as priors and the a-score plot used to determine the number of PCs (8 in this case) and discriminant functions to retain for the DAPC. d Posterior probability maps belonging to each Geneland genetic cluster where yellow colors depict the high probability isoclines

Table 2 Summary of genetic diversity and relatedness for the black rat and the house mouse by genetic clusters and global

Ancestral and contemporary migration

Pairwise genetic differentiation between clusters was moderate to high (black rat FST = 0.107–0.309, RST = 0.086–0.52; house mouse FST = 0.094–0.184, RST = 0.122–0.274 p < 0.05) (Fig. 5a); the higher differentiation was observed between Coz2-Coz3 and SanMi1-SanMi3, respectively. Ancestral migration showed the highest exchange of individuals between Coz1-Coz2 (2.084) and SanMi1-SanMi2 (2.39) (Fig. 5b). Distinct patterns were observed for each species regarding differential migration between sexes, where black rat females disperse more and, contrastingly, house mouse males showed higher migration than females (Fig. 5a). Whereas contemporary bidirectional migration results showed extremely low migration rates in black rats where at least 95% of individuals are permanent residents; the same was observed for house mice, where only the SanMi2 showed a comparatively lower permanent resident rate (0.68), while receiving a high proportion of migrants from SanMi3 (0.30) (Fig. 5c).

Fig. 5
figure 5

a Pairwise genetic differentiation between genetic clusters based on FST (red) and RST (blue) estimators for the house mouse and the black rat. b Ancestral migration rates (Nm) based on the allelic differences between genetic clusters (FST). The thickness of the lines and arrows proportional to the intensity of migration between clusters. Color lines indicate the values for females (pink), males (blue), and global (black). c Bidirectional contemporary migration rates estimated with Bayesass v.3.0 (Wilson and Rannala 2003); proportion of non-migrant individuals by generation per genetic cluster (within the ovals) and direction of migrant individuals by generation between genetic clusters (next to each arrow) are shown

House mice colonization patterns and landscape connectivity

Model comparisons resulting from the Approximate Bayesian computation showed that Scenario1, in which the sequential colonization was tested, had the highest probability based on both the direct (P = 0.468, 95% confidence interval (CI) 0.031–0.905) and the logistic (P = 0.964, (CI) 0.929–0.994) approaches (Fig. 2b). The confidence test for the best scenario showed that type 1 error was 0.134 for the logistic approach. We performed a pre-evaluation of simulated scenarios via PCA, and results indicated that the observed data fall within the cluster of simulated data points. In addition, the goodness-of-fit test using the posterior predictions exhibited that summary statistics from the simulated datasets based on Scenario 1 closely matched the empirical data (Fig. 2c). Hence, based on the best model results and the generation times (4–6 reproductive events by year), we estimated the date of colonization events; accordingly, SanMi1 arrived at the island 676 generations ago (1846–1902), the divergence between SanMi1 and SanMi2 occurred 247 generations ago (1954–1974), and SanMi3 was founded 93 generations ago (1991–1999).

Results of model selection of the optimization resistance surfaces (environmental and anthropogenic) showed that the best-supported univariate model was geographic distance, identified as the top model 99% of the time, followed by NDVI (Table 3). The optimized NDVI surface assigned high resistance at values above zero that correspond to vegetated areas, and low resistance to bare landscape or urbanizations (Fig. S2). The linear mixed-effects models test of the composite resistance hypotheses showed that the best-supported multivariate model was Development (100% of the time; Table 4), where NDVI contributed the most to the model (88%).

Table 3 Univariate optimized model selection for the house mouse from the city San Miguel Cozumel, based on maximum-likelihood population effects mixed models
Table 4 Results for the optimized multivariate surfaces model selection for the house mouse

Discussion

Genetic diversity, invasion models and the genetic paradox of biological invasion

As a result of the reduced number of individuals with which introduced populations are established (founder effect), populations are expected to exhibit low genetic diversity, an effect that is theoretically stronger on islands (Frankham 1999, 2005). However, in most invasive species this does not happen, which is known as the genetic paradox of biological invasion (Frankham 2005; Lawson et al. 2011; Estoup et al. 2016). In agreement with this paradox, the black rat showed moderate genetic diversity levels (Ho = 0.44, He = 0.59, Na = 6.5), with higher levels in the house mouse (Ho = 0.56, He = 0.76, Na = 10). These values are within the range found in other islands, like the black rat in Guadeloupe (He = 0.69, Na = 5), Madagascar (He = 0.72) and New Zealand (He = 0.75, Na = 5.7) (Abdelkrim et al. 2005a, 2005b, 2010; Brouat et al. 2014), as well as the house mouse on islands of New Zealand (He = 0.51–0.56), Kerguelend (He = 0.40, Na = 2.8), Madagascar (He = 0.67, Na = 9.7) and Cyprus (He = 0.77, Na = 9.7) (Hardouin et al. 2010, 2015; Pichlmueller et al. 2020), despite the comparatively small size of Cozumel island. Interestingly, continental populations, some within cities, show similar diversity values (Konecny et al. 2013; Mangombi et al. 2016; Stragier et al. 2022).

Such levels of genetic diversity can be associated with the historical and demographic processes experienced by both the source and founder populations (Estoup and Guillemaud 2010), namely the likely short ancestral bottleneck supported by our analyses, followed by rapid population growth, and coupled with multiple introduction events facilitated by the constant arrival of commercial and, more recently, cruise ships. In fact, a high number of private alleles as observed in the Cozumel populations may be the product of constant introductions of individuals of different origin, incorporating different genotypes into the genetic pool of the resident population and increasing allelic diversity (Estoup and Guillemaud 2010; Lawson et al. 2011). This incorporation of new genotypes is favored by the ‘invasive meltdown model’, which establishes that ecosystems are more easily invaded as they are modified by previously arrived invasives, increasing population density (Simberloff and Von Holle 1999; Prenter et al. 2004; Braga et al. 2018). In addition, the ‘invasive bridgehead effect’ refers to secondary invasions stemming from a particularly successful invasive population, which serves as a source of colonists for potentially new territories, increasing their probability of successful establishment (Estoup and Guillemaud 2010; Lawson et al. 2011). These scenarios apply for most human settlements, as is the case in Cozumel, where the likely continued introduction of individuals of different geographic sources into the island entails a strong propagule pressure, rendering high allelic diversity and the effective establishment of migrants. Similar examples include the house mouse in the Azores islands, which shows high allelic diversity resulting from multiple origins identified from southern Spain, Portugal, Finland and the Canary Islands (Gabriel et al. 2015).

Independent colonization events from different lineages can be another factor influencing allelic differentiation and structure, as shown by the house mouse in the Kerguelen archipelago and the Canary Islands, where the observed genetic diversity derives from the invasion of two distinct lineages (López et al. 2013; Hardouin et al. 2015). Likewise, the black rat in Madagascar comprises two genetic groups product of two independent introduction events from the same ancestral population (Brouat et al. 2014). Our results do not support an independent colonization origin for the Cozumel clusters; instead, more recent social and demographic processes may also be contributing to the currently observed genetic diversity and structure.

Genetic structure, urbanization and human population growth

As we predicted, the two rodent species exhibit a clear genetic structure. In our fine-scale genetic study, the three clusters identified for the black rat correspond with three areas of the island with different anthropogenic impact (Fig. 3), Coz1 comprises San Miguel Cozumel, Coz2 on the south of the island includes the sanitary landfill and the small El Cedral town; and Coz3 is located on the southwestern hotel zone, adjacent to the city, characterized by a combination of highly developed infrastructure surrounded by partially preserved forest remnants. These structure patterns contrast with what has been observed in other studies, for instance black rats did not show genetic differentiation in the Puketi Forest, New Zealand despite the marked habitat differentiation and significant geographic distances between samples sites (Abdelkrim et al. 2010), neither in Franceville, Gabon, a small African tropical city that has undergone a recent growth, regardless of the composite landscape including urban forest remnants and savannah (Mangombi et al. 2016).

Three genetic groups were also identified for the house mouse, tightly differentiated within San Miguel Cozumel (although SanMi1 also includes the sanitary landfill; Fig. 4). These genetic clusters are limited both by roads and high traffic avenues and by the margins of the urbanized area. There are numerous examples in which avenues or highways constitute important barriers, for example in Salvador, Brazil where gene flow of the Norway rat is limited by a main avenue (Richardson et al. 2017). Furthermore, the house mouse genetic structure is likely associated with the human population growth pulses accompanying the urbanization of San Miguel Cozumel. The spatial distribution of SanMi1 coincides with the limits of the city before 1970, when the tourism development began, and the first large-scale contemporary human migration took place. Also, the site where the sanitary landfill is currently located was initially an open-air dump, thus the constant transport of waste trucks from the city to the dump prior to 1970 (Trejo de la Paz 2014) can explain why the SanMi1 cluster extends to this area. As the urban sprawl grew, the house mouse was able to disperse and establish new populations, so that SanMi2 includes all the new neighborhoods built between 1970 and 2000. Last, SanMi3 is distributed in the neighborhoods most recently established since 2000 onwards. Outstandingly, this relationship between the species genetic structure and urban growth coincides with the ancestral migration rate; that is, gene flow between the oldest clusters (SanMi1 and SanMi2) is greater compared to that between them and the most recent cluster (SanMi3). This sequential pattern also matches the genetic diversity of each cluster. Indeed, the ABC analysis supported our Scenario1 hypothesis of sequential colonization, where SanMi1 is the ancestral Cozumel population, which harbors the highest genetic diversity; from which SanMi2 originated and, more recently, SanMi3 that has the lowest genetic diversity. Loss of genetic variability through sequential colonization routes due to successive bottlenecks is frequent, as evidenced by the house mouse invasion in Senegal (Lippens et al. 2017).

The demographic growth of Cozumel is divided in two phases, a stationary one since the recolonization until 1950, and the tourism expansion from 1950 onwards, characterized by a constant population growth due of the intensification of human migration processes (Sánchez-Crispín and Propin 2003; Palafox and Zizumbo 2009). In agreement with the spatial location of the genetic clusters, our best demographic model indicated that the initial colonization of Cozumel Island occurred in SanMi1 between 1846 and 1902. In 1846, when the Mayan farming refugees from the east of the peninsula moved to Cozumel and founded San Miguel and El Cedral. The arrival of Mayan people to the island continued until 1913, thus these historical processes could be linked to the initial introduction of both invasive rodents. The origins of SanMi2 occurred between 1954 and 1974 coinciding with the initial urbanization pulse driven by tourism. Diving and fishing tourism were promoted in 1950 but is not until 1970 that a trust is created to promote tourist and economic development, initiating the construction of the northern hotel zone coupled with a strong demographic growth (Santander and Ramos-Díaz 2011). Finally, the colonization of SanMi3 occurred between 1991 and 1999, when Cozumel joined the Caribbean cruise network (Trejo de la Paz 2014).

With this in mind and considering that Cozumel is the main port of the Mexican Caribbean, the high number of exclusive alleles in the different genetic clusters is highly relevant. Coz1 had the highest number of private alleles (17) for black rats, and SanMi1 (34) and SanMi2 (20) for house mice. A note of caution is that the low sample sizes for Coz3 and SanMi3 might underestimate the number of private alleles. Nonetheless, the clusters with highest number of private alleles occur in strategic areas of commercial exchange and tourism, supporting the hypothesis of a constant introduction of individuals to Cozumel. SanMi1, located in the oldest city square, also includes the maritime terminal (Muelle fiscal) and the cargo ferry port, the main passenger and commercial connections between the mainland and the island until the 1980s, when it started to receive cruises and two grand cruise piers were built, Puerta Maya (1997) and Punta Langosta (2000) (PRORED 2008), the latter also within SanMi1. Puerta Maya is adjacent to the southwestern hotel zone where SanMi3 and Coz3 are situated (Fig. 1). Further phylogeographic and genetic assessments are needed to more finely reconstruct the routes of invasion of these rodents into Cozumel, but we hypothesize that Coz1 and SanMi1 share a genetic ancestry related to mainland individuals, particularly from Playa del Carmen that has historically been the region with the highest interchange with the island. More recent introductions would certainly include lineages from the various Caribbean ports that constitute the main Caribbean cruise route, i.e. Belize, Roatán (Honduras), Limón (Costa Rica), Colón (Panama), Santa María (Curaçao), Montego, Falmouth, Ocho Rios (Jamaica), and Grand Cayman ports (Hernández-González and Villaseñor-Franco 2020).

The lack of ancestral and contemporary bottleneck signals in most of the genetic clusters for both species suggests a brief bottleneck after the introduction, a common pattern in species with high reproductive rate, short generation time and strong propagule pressure. In addition, the continuous introduction of individuals, from the same or different geographic origin, contributes to erasing the initial bottleneck signal and increases the global genetic diversity (Suárez-Atilano et al. 2019). The house mouse can establish rapidly, as evidenced by the experimental introduction of two founders (one female and one male) on Saddle Island, New Zealand where the system’s load capacity was reached in only six months (Nathan et al. 2015). Likewise, the Norway rat could rebound population sizes after strong bottlenecks (Richardson et al. 2019). Yet, contemporary bottleneck signals were detected in SanMi3 and Coz3. As mentioned, the colonization of SanMi3 is relatively recent, thus likely still echoing the bottleneck signal, whereas such signal in Coz3 (located along the southwestern hotel zone) can be associated with the permanent pest control carried out by hotels, causing population size reductions and decreasing genetic variation. Indeed, significant loss of genetic variation related to control strategies has been documented in Poirier Island and in Salvador, Brazil, because of an eradication plan for black rats (Abdelkrim et al. 2010) and after urban rat control campaigns (Richardson et al. 2019), respectively. However, this would require further assessment because the SanMi3 and Coz3 bottleneck signals could also be associated with their low sample size.

Ancestral and contemporary migration patterns

The dispersal capacity of a species depends, among others, on body size, morphology and physiology, as well as on biotic and abiotic factors like resource availability, habitat quality and geographic barriers. Dispersal can also vary between sexes as seen in birds and mammals, where the differential response of males and females relies on population density, competitive ability and habitat quality, among others, as a strategy to avoid inbreeding (Globert et al. 2012).

Rattus species are considered mostly philopatric, i.e. tending to return to or remain near a particular site or area, observed in females as it is more common (Gardner-Santana et al. 2009). In our study in Cozumel, the black rat showed differential ancestral migration with higher values in females, which could be associated with their social structure, where males have a strong hierarchy and control territories through which the females ‘freely’ roam; in fact, female dispersion is food-determined and shapes male dispersion (Hooker and Innes 1995). Accordingly, in the past females could have dispersed to sites with high resource availability, in agreement with the high gene flow between Coz1 and Coz2. However, contemporary gene flow showed extremely low migration between clusters. If the system is saturated (e.g. low resources, high competition) that would prevent the arrival of new migrating individuals. Only Coz3 receives some migrants from both Coz1 and Coz2, which could be explained by the population control measures implemented by hotels in this area, temporarily reducing competition for resources and allowing new migrants to arrive.

House mouse individuals tend to be nomads if population densities are low, but at high densities males are territorial (Hulme-Beaman et al. 2016). This behavior patterns might have facilitated the presence of the three genetic groups within San Miguel Cozumel, with overall low ancestral migration. Contrary to the black rat, migration in the house mouse was higher for males, concordant with their social system where dominant males control and monopolize feeding sites, shared with several females and some subordinate males (Boursot et al. 1996; Dean et al. 2006). However, because population densities are usually high, such social hierarchies do not last long and males migrate constantly. As a result, populations behave panmictically on a regional scale but are structured on a finer scale (Morgan et al. 2022). Notably, as females are philopatric and do not move freely between territories, mitochondrial DNA-based studies have revealed that populations can be resistant to secondary female invasions (Gabriel et al. 2015), also explaining the observed low migration rate in this species.

Urban drivers of connectivity in the house mouse

Connectivity analyses allow identifying key landscape elements needed to maintain genetic diversity and evolutionary processes. In commensal species they also help to elucidate dispersal and distribution patterns of invasive species (Pichlmueller et al. 2020; Stragier et al. 2022). The most significant environmental feature influencing house mice connectivity was geographic distance, while the next best supported model was vegetation (NDVI). High and continuous vegetation cover is a good predictor of connectivity in natural environments, for example in rodents like Peromyscus melanotis (Borja-Martínez et al. 2022), and in cities with non-commensal species like Peromyscus leucopus (Munshi-South 2012; Munshi-South and Nagy 2014) and the lizard Podarcis muralis (Beninde et al. 2018).

Contrastingly, as we predicted for this commensal species, we found high connectivity for the house mice along impervious land surfaces, i.e. the urban landscape, indicated by the low NDVI values recovered by the optimization surfaces. Similarly, Stragier et al. (2022) found low gene flow in this species associated with highly vegetated areas in Dakar. Additionally, Fusco et al. (2021) highlight that the variables analyzed in urban landscapes under a landscape genetics approach generally include human population density, roadways, infrastructure, impervious surfaces and industrial development, while social, cultural or political factors are rarely used. Indeed, incorporating socioeconomic variables was key in our approach. The best-ranked model in the multivariate analysis was our Development hypothesis, which considers the interaction between NDVI and Commercial businesses, further supporting that house mice connectivity is tightly linked with urbanization. The information used to build the Commercial businesses layer included all the informal commerce across the city, most of which is dedicated to street food selling. Since they produce abundant waste, we used this as a proxy of resource availability for the house mouse. Interestingly, Combs et al. (2017) show that low food waste availability decreased the genetic diversity and gene flow in another invasive rat, the Norway rat in New York.

Genetic information and invasive rodent management

The lack of biosecurity measures to prevent the arrival of invasive species to Cozumel is evidenced by our results suggesting a constant introduction of invasive rodents. This, together with the environmental and social complexities of large, inhabited islands, make rodent control (sustained reduction), rather than eradication (complete removal, feasible on smaller islands) realistic (Holmes et al. 2023).

Information regarding genetic clusters can help defining meaningful spatial units for control management (Combs et al. 2018; Gatto-Almeida et al. 2022). Indeed, we detected different genetic groups and low connectivity for each species, determined by urban and socioeconomic features. Accordingly, we located areas that could be targeted with different management strategies. Constant monitoring should be performed in high-risk locations like piers and docks (both off and on island), and in urban sites (sanitary landfill, informal commerce); routinely examining the terrestrial and maritime transport vehicles should also be implemented. Importantly, the significant association we identified of the house mouse with impervious surfaces likely limits the chances of a secondary invasion to natural environments. We highlight that in our studies of rodent population ecology and genetics on Cozumel, performed in natural vegetation and conserved areas over many years, we have not trapped black rats nor house mice (see Vega et al. 2007; Fuentes-Montemayor et al. 2009; Espindola et al. 2014). Whether these invasive rodents are indeed absent from natural areas or present in low abundance is not clear. Nonetheless, improving urban planning to regulate future expansions of San Miguel Cozumel is of the outmost importance to prevent these invasive species to disperse further on the island. Likewise, it is imperative to make authorities and people aware of the negative effects of these invasives on the native biota and of the associated high potential health risks.