Introduction

Freshwater systems in arid and semi-arid zones present a unique richness of fishes, with different abundance, habitat selection, and distribution (Ceballos et al. 2018), that depend on their spatial ecology (Winemiller et al. 2008). In these ecosystems, the ichthyofauna is usually geographically isolated, often restricted to a single spring (Contreras-Balderas and Lozano-Vilano 1993; Palacio-Núñez et al. 2010a), presenting adaptations to the ecological conditions of the habitat (Albrecht and Gotelli 2001; Poff et al. 2003; Wisz et al. 2012). This leads to particular geographic patterns in response to the physical, chemical, and biological variables of the site (Contreras-B and Lozano-V 1994; Olden et al. 2002). However, natural and unnatural hydrological disturbances (e.g., drought, tourism pressure, underwater vegetation composition) change the ecosystem conditions in space and time and the dynamic patterns of the fishes (Ruetz III et al. 2005). Therefore, it is of utmost importance to understand the ecological and spatial dynamics of species populations (Engen et al. 2018).

In the arid and semi-arid zones of Mexico, springs with variable habitats have been recorded harboring a singular richness of endemic fishes with specific ecological affinities and spatial patterns that depend on habitat conditions (Contreras-Balderas and Lozano-Vilano 1993; CONABIO 2020). A clear example is the Rioverde plain, located in the northeastern portion of the country, in which there are scattered springs, with a richness of endemism. The Media Luna spring stands out for its larger water surface and popularity (Miller et al. 2005). This spring has a special landscape, with crystalline thermal water that attracts thousands of tourists each year. Thus, attention must be drawn both to the possible structural changes of this freshwater ecosystem and the pressure suffered by fish populations (Galván-Meza et al. 2018), as has been indicated for other sites (e.g., Hickley et al. 2004). This need for attention has motivated studies of the spatial ecology and conservation of the endemic ichthyofauna at this site (e.g., De la Vega Salazar 2009; Miller et al. 2005; Palacio-Núñez et al. 2010a).

In particular, the Media Luna spring is the habitat of four exclusive endemic fishes of the Rioverde plain (Ceballos et al. 2018) and is the main reservoir of the endemic goodeid, Ataeniobius toweri (Meek, 1904) (Palacio-Núñez et al. 2010b). This species is listed as endangered (IUCN 2021) because its population has been declining in the present century (Contreras-MacBeath et al. 2014; Lyons et al. 2019). In terms of its affinity for in-spring habitat, this species swims in open water above dense underwater vegetation or among stems (Palacio-Núñez et al. 2010b). Its greatest activity is recorded at a depth gradient of less than 1.0 m (Miller et al. 2005). However, it has been observed at depths slightly greater than 2.0 m (pers. obs.). Despite this information, there is a lack of understanding of the degree of affinity for habitat conditions and the sites of greatest occurrence of A. toweri. Also, the affinity for particular habitat conditions and sites can vary by life stages (Brosse and Lek 2002; Zhang et al. 2020), such as adults and juveniles, for this and other fish species (e.g., Brandt 1980).

The ecological response curves and spatial distribution models are suitable for correlating species occurrence with habitat variables (i.e., Miller 2010; Mateo et al. 2011). They are also useful for predicting spatial patterns and selection of habitat conditions (Benito and Peñas 2007; Lobo et al. 2015); especially when grouping by life stages. This, together with estimating relative abundance indexes from the records and analyzing different periods, makes it possible to study the variability of detection of the species (Martin-Garcia et al. 2022).

The studies that take into account wide-ranging periods to record species occurrence and habitat variables are fundamental to explaining the ecological and geographic scenario of the species (Ruetz III et al. 2005; Elith and Leathwick 2009). This makes it possible to establish exhaustive biomonitoring of site variables and species of interest, particularly endemic species, to generate ecological-spatial models (Joy and Death 2002; Maloney et al. 2013) that closely match the ecological reality of the species. This supports leading conservation management decisions based on scientific evidence (Pullin et al. 2004).

With this approach, we present a contribution to the ecological-spatial knowledge of A. toweri in the Media Luna spring in Mexico. We aimed to estimate its relative abundance and to model A. toweri’s ecological response curves and spatial distribution by life stage, in the summers of 1999, 2009, and 2019, in the Media Luna spring, Mexico. Based on the literature reviewed (e.g., Miller et al. 2005; Palacio-Núñez et al. 2010b), we hypothesized that A. toweri exhibits temporal variation in abundance, habitat selection, and distribution patterns under structural and compositional changes within its natural habitat. Therefore, we evaluated the response variable of occurrence to determine the temporal variation of this endemic species. We used water depth and underwater cover as predictor variables for modeling. This approach was necessary to understand the suitable habitat conditions, geographic patterns, and population status for this species, in specific ecological and spatial scenarios, on a temporal scale.

Methods

Study site

The Media Luna spring is located in the semiarid plain of Rioverde, municipality of Rioverde, in the state of San Luis Potosí, northeastern Mexico. This spring rises from a main lagoon, where six craters are located at different depths (36.2 m is the maximum recorded; Fig. 1), from which an average of 4.35 m3 s−1 of very crystalline thermal water flows (Galván-Meza et al. 2018). Three main canals continue from this lagoon, where the depth does not usually exceed 2.5 m. In this natural system, there is a wide extension of underwater vegetation coverage, dominated by the water lily Nymphaea ampla (Salisb.) DC. 1806. Due to the ecological singularity of the spring and the concern about damage caused by increased tourism, it was decreed as a natural protected area under the modality of a State Park on June 07, 2003 (Periódico Oficial del Estado Libre y Soberano de San Luis Potosí 2003).

Fig. 1
figure 1

Geographic location of the Media Luna spring, Mexico. a shows the distribution of Ataeniobius toweri inside the Rioverde plain (Koeck 2019) and the location of the study site; there are marks for the main lagoon, where the six craters are located, and the three main canals (above square). The digital bathymetric model is used as a base map for the water spring surface; it was based on Palacio-Núñez et al. (2010b). b shows a general view of the study site location at the country and state levels

Occurrence records and relative abundance

We assessed the temporal variation in the relative abundance, habitat selection, and spatial distribution of A. toweri over 20 years, from 1999, when the first sampling was carried out, until 2019. For this purpose, we selected three study replicates (years 1999, 2009, and 2019), separated by 10 years. This avoids assumptions of uniform animal distribution and biased temporal variation (Ruetz III et al. 2005). In addition, we selected only records from summer periods, because we studied variations when tourism increased at the site, which we consider to influence the natural habitat of this endemic fish.

The records of A. toweri were obtained from two sources. For the summer of 1999, data were obtained from the literature (Palacio-Núñez et al. 2010b), and for 2009 and 2019, data was obtained by our research group (Rössel-Ramírez et al. 2023a). These records were categorized into adult and juvenile life stages. For correct identification of life stage, based on body size, the dataset was verified with information from previous studies on the life history of A. toweri (e.g., Miller et al. 2005; Palacio-Núñez et al. 2010b; Ceballos et al. 2018). It should be mentioned that all data in each period was recorded from 66 underwater transects, evenly distributed along the spring surface (e.g., Palacio-Núñez et al. 2010b). For the positioning of these transects, the system was sectorized (14 sectors; Fig. S1), as proposed by Palacio-Núñez et al. (2010b). The criteria for sector boundaries were based on underwater coverage within the water depth, as well as the anthropic impact degree (Table S1). Since 1999, sectors 1 (S1) and 10 to 14 (S10 to S14) showed little or no underwater vegetation, mainly due to tourism activity, such as diving or swimming. These activities damage or eliminate the vegetation, in addition to dispersing native ichthyofauna (Maloney et al. 2013; Galván-Meza et al. 2018). In contrast, sectors 2 to 9 (S2 to S9) had a higher density of macrophyte coverage of N. ampla, which grows as emergent, sub-emergent, and floating vegetation (Palacio-Núñez et al. 2010b; pers. obs.).

Using a standardized method, in 1999, 3 or 6 transects were designed for each sector, as a function of the length and width of the canal; each transect replicate occurred at random habitat conditions and was placed from the shore to the center of the canal. In the following two summer periods, the number of transects was the same and the location coordinates were kept unchanged to carry out this spatio-temporal study. In each transect (dimensions: 2 m width × 10 m length), all occurrence records were obtained by direct observation, which included photographic documentation since 2009. Thus, no specimens of this species were handled, damaged, or captured (e.g., Prchalová et al. 2009).

Subsequently, using databases for summer periods, we first performed some filtering, following the method of García-Roselló et al. (2014), to identify spatial outliers or incomplete data (which were not found) or to exclude records with duplicate coordinates. With the remaining records, we also performed spatial filtering with the QGIS® software (i.e., excluding records outside of the study area or with spatial unlikelihood) and environmental filtering (i.e., normality test) in Rstudio®. The number of records in the final database for 1999 consisted of 407 adults and 189 juveniles; in 2009, there were 28 adults and 11 juveniles in the database, and in 2019, 80 adults and 71 juveniles (Fig. S2). From these records, we estimated the relative abundance (RA) to describe the temporal variation of the population by summer period and life stage in each sector. We plotted these estimations in Rstudio® using the ggplot2 package (Wickham et al. 2019).

Predictor variables selection

As predictor variables, we used water depth (WDp) and underwater coverage (UC) from previous literature reviews on freshwater fish distribution and habitat selection (e.g., Rozas and Odum 1988; Joy and Death 2002; Prchalová et al. 2009) and from our field observations. Particularly, WDp has a significant effect on the richness, abundance, and distribution of fish populations (Prchalová et al. 2009). In addition, UC conditions influence food and shelter for the fish (Rozas and Odum 1988; Brosse and Lek 2002), mainly in early life stages. Both variables are also related to water temperature (Brandt 1980; Eby et al. 2003), dissolved oxygen, and primary biomass productivity (Brosse et al. 1999; Del Río et al. 2019).

Given the information from both WDp and UC, we obtained a digital bathymetric model for WDp and a raster with the classification of UC in the three summer periods (Rössel-Ramírez et al. 2023b). Each raster was processed and rasterized with different spatial information by the summer period. The WDp variable was interpolated by the Inverse Distance Weighted method of Bartier and Keller (1996), with a P distance coefficient of 2.0. All raster layers had a range from 0.0 to 36.2 m depth. UC was reclassified using a supervised method, according to NASA’s accuracy methodology for coverage classification (Kalluri et al. 2003). This reclassification included the categories of UC proposed by Palacio-Núñez et al. (2010b) and Rössel-Ramírez et al. (2023b) for the Media Luna spring, based on the presence of macrophyte vegetation or bare soil. The vegetation coverage, formed by N. ampla, was classified as (1) small carpet (Sc), plants with a height below 0.3 m and heart-shaped leaves; (2) big carpet (Bc), plants with stems of variable length and ovoid leaves that do not reach the water surface; and (3) mature shape (Ms), with floating leaves and characteristic white flowering. Also, the isolated remnants of vegetation were (1) big carpet patch (Bcp) and (2) mature shape patch (Msp). The bare soil was classified as (1) natural bare soil (Nbs), (2) bare soil by tourism (Bst), and (3) bare soil by depth (Bsd).

For both variables, we downloaded the raster layers with the same geographic projection in WGS 84/Pseudo-Mercator (EPSG: 3857) and the same extent for each summer period. Also, we retained the Ground Sample Distance of 0.1 m (e.g., Joy and Death 2002; Guisan et al. 2007).

Relative contribution and multicollinearity of predictor variables

Using the statistical software R® and Rstudio® (Rstudio team 2020), we extracted the pixel values of each variable according to the location of the occurrence points (i.e., response variable) and generated a new database with this information. Then, we estimated the contribution percentage for both predictors in each summer period using a Gradient Boost Machine model with the gbm package (Freund and Schapire 1997; Friedman 2002). This model allows us to estimate the relative influence of explanatory variables, as a function of the response variables (Dedman et al. 2017). Before running the model, the variables and the filtered occurrence records were loaded into Rstudio®. Then, we followed this parameterization: (1) Gaussian distribution with a cross iteration of 5000 repetitions, (2) random training (80%) and test (20%) groups from the occurrence records, and (3) occurrence dataset as response variable (e.g., Friedman 2002; Hijmans 2012).

In the three summer periods, both variables had a contribution percentage ≥ 2% (Fig. 2), so both were selected for the model (e.g., Awan et al. 2021). It should be mentioned that, in general, we obtained a value over 80% for WDp, while for UC, it was between 2 and 20%. Subsequently, using the same database, to avoid redundancy and high correlation between the variables, the collinearity was evaluated using Pearson’s correlation coefficient (Guisan and Hofer 2003) in Rstudio®. In this process, we considered a bivariate correlation below ± 0.70 for the two variables, following the criteria of Awan et al. (2021). Thus, WDp and UC were independent and were included in the models for the three summer periods (Table 1).

Fig. 2
figure 2

Partial dependence blocks of the contribution percentage of the variables water depth (WDp) and underwater cover (UC) by study period (summers of years 1999, 2009, and 2019) and life stage (adult and juvenile), for their first selection for the models of ecological response and spatial distribution of Ataeniobius toweri in the Media Luna spring, Mexico

Table 1 Bivariate correlations between habitat variables, water depth (WDp), and underwater coverage (UC), by life stage and summer period (in the years 1999, 2009, and 2019) for Ataeniobius toweri in the Media Luna spring, Mexico. It includes the significance (P < 0.05) by bivariate correlation for the final variable selection to use in models of ecological response and spatial distribution

Ecological response curves

We estimated the occurrence probability (OP) of A. toweri adult and juvenile as a function of changing conditions. Therefore, we ran a Generalized Regression Model (GLM) by the summer period to evaluate the quadratic and linear response by individual and aggregated variables. This approach is widely used to avoid overfitting in the prediction of the ecological response during logistic regression (Fielding and Bell 1997; Allouche et al. 2006). The following packages were used in Rstudio®: raster (Hijmans and Elith 2017), sp (Pebesma 2012), RColorBrewer (Neuwirth 2014), pander (Farlane 2013), and rgdal (Bivand et al. 2015). Similarly, to Gradient Boost Machine models, we loaded the correspondent predictor variables and trained the models with the corresponding occurrence records by life stage and study period. Also, based on the literature (e.g., Barbet-Massin et al. 2012), we randomly generated and included 500 background points to evaluate the correct prediction of habitat selection in each stage on the entire surface of the Media Luna spring. Subsequently, both occurrence and background records were grouped into a training set (80%) and a test set (20%).

We ran the GLM models, using the R codes provided by Rössel-Ramírez et al. (2023c), which were two individual linear models (one for WDp and one for UC) and one to explore the interaction between predictor variables. The statistical prediction dataset of the individual models was used to plot the ecological response curves that estimated the OP according to the conditions of the continuous variable WDp and the categorical variable UC. We presented this probability in a polynomial line of quadratic order for WDp and probability bars for UC. In addition, in each graph, we also included the mean–variance relationship (i.e., residual variance) and the training points group for occurrence and background from the prediction dataset. For WDp, we clipped the graphs from 0.0 to 3.5 m of depth, because in all cases an OP with P > 0.01 was predicted at this depth range. Subsequently, we evaluated the performance of the models using the GLM for both variables, from a 2 × 2 contingency table with the observed versus projected data. To minimize the percentage of type I (omission) or II (commission) errors, we followed the method of Allouche et al. (2006): we selected a threshold cutoff of 0.5 and divided the points corresponding to both presence and background into 80% training and 20% testing datasets. The metrics estimated were overall performance (OAC) and true statistical skills (TSS); we decided to use TSS because it gives equal importance to the prevalence of the predictions in the presence of random performance corrections (Fielding and Bell 1997; Allouche et al. 2006).

Spatial distribution models

In the geographic scenario, Carpenter et al. (1993), Naoki et al. (2006), and Pandit et al. (2017) propose DOMAIN as a suitable distribution model for threatened and endemic species, such as A. toweri. For this, we ran DOMAIN distribution models to estimate the distribution probability in the Media Luna spring, by fish life stage and summer period. We used the R code provided by Rössel-Ramírez et al. (2023c). In addition to the packages used for GLM, we added dismo and maptools (Hijmans and Elith 2017), Rcpp (Eddelbuettel and Francois 2011), lattice (Sarkar 2008), caret (Kuhn et al. 2020), pROC (Robin et al. 2011,) and sdm (Naimi and Araujo 2016) R packages. We also loaded the two predictor variables and the occurrence records in the statistical software and trained the modeling with the occurrence records and 500 background points (e.g., Barbet-Massin et al. 2012); each database was divided into a training (80%) and test (20%) group. After this parameterization, only the stack with WDp and UC and the training group set were included in the DOMAIN function. The final predictive raster, which showed the distribution probability of A. toweri adult and juvenile by summer period, was loaded into QGIS® software to plot the spatial distribution maps.

Finally, we evaluated the prediction performance of these models using the partial receiver operating characteristic (ROC) curve (Fielding and Bell 1997) and the area under the curve (AUC) of the ROC curve (Fawcett 2006). For both assessments, we included the two test groups (i.e., occurrence, and background), the statistical data of the DOMAIN model metric, and the stack with the variables. The values of the partial ROC were evaluated, following the criteria of Deshpande (2020), who argues that as long as the bandwidth ranges between 0.2 and 0.6, between true positive and true negative groups, the model has excellent predictive performance. For the ROC curve, we used the Pearce and Ferrier (2000) independent method for calculating AUC, where a value > 0.5 indicates better prediction performance, away from a random classifier.

Results

Spatio-temporal variation of RA

For both stages of A. toweri (i.e., adults and juveniles), there was a different proportion of RA between S3 and S9, in the three summer periods (Fig. 3). In 1999, we also calculated relative abundance for both stages (juvenile RA = 3.70%; adult RA = 0.74%) in S10 and also for adults in S12 (RA = 0.49%); no relative abundance was calculated in the rest of the sectors. In 2009, the juveniles in S2 had a RA of 9.09%. In 2019, the juvenile’s RA was 2.82% and the adult’s RA was 11.25%. For both life stages, the highest abundance was observed in the summer of 1999, and the lowest in the summer of 2009. In 2019, the RA increased to over 10% in some sectors (i.e., adults: S2, S3, S8, S9, and juveniles: S3, S6, S7, S9). Particularly, between 1999 and 2019, for the adult stage, RA increased in S9 from 4.91 to 45%. In juveniles in S3, RA increased from 5.29 to 23.94%. In general, the trend with the polynomial line was similar in the different life stages and summer periods.

Fig. 3
figure 3

Relative abundance (%) of Ataeniobius toweri in the Media Luna spring, Mexico, in the summer periods of years 1999 (red), 2009 (green), and 2019 (blue), in each sector (S1-S14), by life stage. The second-degree polynomial trend line (black line with dashes) shows the temporal variation of abundance in each case. Also, it shows the standard deviation bars with the range of variability associated with the mean value

Modeling evaluation and validation

In general, in the GLM models for A. toweri (Table S2), the overall predictive performance (OAC) was > 0.60, except for the adult stage in 2009 with OAC = 0.51. Concerning the TSS metrics, the values were over 0.40 in all cases. On the other hand, in the evaluation of the DOMAIN models, we estimated a higher proportion of true negative cases compared to true positive cases in the partial ROC curves, for both life stages in the three summer periods (Fig. S3). Each curve’s density had a bandwidth < 0.1 concerning the predicted value in the models and an AUC ratio > 1.0; only the juvenile stage in 2009 had a bandwidth of 0.17. For the ROC curve, in the adult stage, the AUC values were 0.72 (1999), 0.79 (2009), and 0.86 (2019); in the juvenile stage, the AUC values were 0.75 (1999), 0.85 (2009), and 0.76 (2019). These AUC values on the ROC curve meant higher sensitivity and low specificity (i.e., high power of discrimination) in all models (Fig. S4).

Suitable habitat selection

For both life stages, the highest OP, estimated as a function of the WDp range, was from 0.3 to 1.0 m depth in the three summer periods (Fig. 4). In all three periods, for the adult stage, the OP (P > 0.01) was from ~ 1.6 m, although it was highest between 0.6 and 1.5 m depth. In 1999, the maximum OP was P = 0.65, while it was below P = 0.20 in 2009 (P = 0.13) and 2019 (P = 0.19). For the juvenile stage, in 1999 and 2009, the OP increased from ~ 1.0 m (1999, P = 0.08; 2009, P = 0.01) to 0.3 m depth, with maximum P = 0.67 and P = 0.08, respectively. In 2019, the OP was estimated from ~ 3.4 m (P = 0.02) to exceed 0.3 m (P > 0.21) following a linear trend, where the probability increased at shallower depths.

Fig. 4
figure 4

Ecological response curves for Ataeniobius toweri in adult and juvenile stages, as a function of water depth (WDp), in three summer periods (in the years 1999, 2009, and 2019) in the Media Luna spring, Mexico. The polynomial line of order two shows the probability of occurrence, as a function of the WDp range, from 0.0 to 3.5 m depth. Also, it shows the confidence interval between the upper and lower values of the fit prediction along the probability line. It included the three fitted values of probability and water depth, and the training set of the presence and background points are shown in gray color

In the case of UC categories (Fig. 5), in 1999, we estimated the highest OP (P > 0.40) for the adult stage in Bc, Ms, and Nbs; the trend gradually decreased in the rest of the coverages. In the next two summer periods, the first three coverages with high probability (2009, P > 0.01; 2019, P > 0.10) were the same as in 1999. In 2009, the maximum OP was estimated in Bc (P ≈ 0.07) and in 2019 in Ms (P ≈ 0.16). Similarly, in these two periods, the OP decreased in Bst, Bcp, Sc, Bsd, and Msp conditions (2009, P < 0.01; 2019, P < 0.10). For the juvenile stage, only in 1999, an OP trend was maintained with P > 0.20 between coverages, with the highest probability (P ≈ 0.48) in Bc. In 2009, the highest OP was in Bc (P ≈ 0.04) and the probability was null in the rest of the coverages. In 2019, the highest OP (P > 0.10) was estimated also in Bc (P ≈ 0.15), as well as in Ms (P ≈ 0.16) and Nbs (P ≈ 0.14). Similarly, to the adult stage, the OP decreased in Bst, Bcp, Sc, Bsd, and Msp conditions (P < 0.10).

Fig. 5
figure 5

Ecological response curves for Ataeniobius toweri in adult and juvenile stages, as a function of underwater coverage (UC), in three summer periods (in the years 1999, 2009, and 2019) in the Media Luna spring, Mexico. The bars show the probability of occurrence between the following eight UC categories: natural bare soil (Nbs), bare soil by tourism (Bst), bare soil by depth (Bsd), small carpet (Sc), big carpet (Bc), mature shape (Ms), big carpet patch (Bcp), and mature shape patch (Msp). The confidence interval between the upper and lower values of the fit prediction along the probability bars is also shown. It includes the mean-fitted values of probability per UC. The training set of the presence and background points are shown in gray color

Spatial distribution models

In the three summer periods, for the adult stage of A. toweri, the models had a P > 0.70, between S2 and S9 (Fig. 6). Only in 1999, its distribution was extended to S10 and included the maximum probability (P ≈ 0.77). For the summer of 1999, the maximum probability of distribution (P ≈ 0.55) of the juvenile stage was also from S2 to S10. In contrast, in 2009 and 2019, the highest probability (P ≥ 0.80) was between S2 and S9 (Fig. 7). In general, for both life stages, the range between S2 and S9 kept the highest probability of distribution throughout the three summer periods.

Fig. 6
figure 6

Spatial distribution models for adult Ataeniobius toweri in the three summer periods (years 1999, 2009, and 2019) in the Media Luna spring, Mexico. The models were generated based on the variables of water depth and underwater coverage. Regarding the probability of distribution, in the three cases, it was higher in sectors between S2 and S9, preferentially towards the shores of the natural system

Fig. 7
figure 7

Spatial distribution models of juvenile Ataeniobius toweri in the three summer periods (years 1999, 2009, and 2019) in the Media Luna spring, Mexico. The models were generated based on the variables of water depth and underwater coverage. The probability of distribution, in the three cases, was higher in sectors between S2 and S9, preferentially towards the shores of the natural system

Discussion

In the Media Luna spring, in the three summer periods, the different proportions of relative abundance (RA) and highest probability of distribution (P ≥ 0.50) of adult and juvenile A. toweri were in sectors 2 to 9 (Figs. 3, 6, and 7). In these sectors, the water depth (WDp) below 2.5 m (Fig. 4) and the underwater coverage (UC) corresponded mainly to the big carpet (Bc) and mature shape (Ms) (Fig. 5), where the highest probability of occurrence was observed for both juvenile and adult stages. It is worth mentioning that, as documented by Palacio-Núñez et al. (2010b) (and according to our observations), in all the sectors the depth was below ~ 2.9 m, except in S1. In addition, between S2 to S9, UC had extensive underwater vegetation coverage, both in the form Bc and Ms. In contrast, from S10 to S14, the vegetation corresponded to isolated remnants of big carpet patches (Bcp) and mature shape patches (Msp), or completely bare soil by tourism (Bst).

Considering the wide existing gradient of WDp (from 0.0 to 36.2 m depth), fish show higher occurrence and spatial dynamics in shallower areas and towards the water’s shore (McClain and Etter 2005; Prchalová et al. 2009). In addition, according to our results, both stages had the highest OP between 0.1 and 1.5 m depth, in all summer periods. Furthermore, based on our field data, juveniles and adults were always observed at this depth range; occasionally, in the deeper portions of the canals, the adult stage only overtopped the cover given by Bc, which moved it away from the bottom of the system.

Regarding UC, as summer periods progressed, we observed a decrease in the extent of underwater vegetation cover, especially in 2009, to which we attribute the increase of RA from 1999 to 2019, in S9 for adults and S3 for juveniles. In these sectors, Bc and Ms coverages have been conserved with a high density (pers. obs.). Even so, the highest OP was always in Bc and Ms coverages, which are used for fish food, fitness, and shelter (Brandt 1980; Rozas and Odum 1988; Brosse and Lek 2002). Therefore, taking into account WDp and UC, we considered that the selected habitat conditions were conserved for adults and juveniles between the three study periods. However, the temporal variation in OP was related to changes in the composition of the UC, caused by anthropogenic interference in the site (Galván-Meza et al. 2018) between summer periods, which implies a change in other underwater environmental variables (e.g., temperature or water chemistry; Rozas and Odum 1988; Brosse and Lek 2002).

We confirmed a good prediction performance of the OAC metric (Fielding and Bell 1997) and the TSS prediction (Allouche et al. 2006) for our GLM models and response curves, which also reflected a good quality of fieldwork (Lobo et al. 2015). Thus, the OP modeled for the selection of habitat conditions, using both WDp and UC as predictors, was suitable for A. toweri at both stages. These findings are important for directing conservation management for the threatened endemic species and their habitat (Sánchez-Cordero et al. 2005), as is the case of A. toweri.

We also had a good prediction performance of the DOMAIN models, being able to discriminate between the true presence and absence of A. toweri in at least 72% of the cases, for juvenile and adult stages. Based on DOMAIN predictions, the probability of spatial distribution of both stages of A. toweri remained highest in sectors S2 to S9, where suitable conditions of WDp and UC were present (based on ecological response curves). In these sectors, the distribution was more marked towards the water’s shore with a depth of less than 1.5 m. Furthermore, according to Palacio-Núñez et al. (2010b) and our observations at these sites, the UC was dense at Bc and Ms. In this geographic scenario, there was a clear continuous distribution of both stages in 1999 and 2019, in the sectors S2 to S9. Only in 2009, probably due to the scarcity of records, the predictions indicated an adjusted distribution. There was also a probability of distribution in sectors S11 to S14 and in S1 for both stages (except for adult in 2009); the prediction (P < 0.50) in each of these sectors also showed very marked stripes towards the water’s shore. We considered that the modifications suffered by the natural system, particularly in UC, were responsible for projecting these variations in distribution between periods.

It should be mentioned that the distribution models generated with DOMAIN had a good predictive performance considering the ROC/AUC curve (Fielding and Bell 1997) and partial ROC curve (Manzanilla-Quiñones 2020). This corroborated the statistical significance of the modeling to explain the spatial distribution patterns of A. toweri, both for adult and juvenile stages.

Regarding the partial ROC curves, we obtained a higher proportion of true negative than true positive cases, suggesting the existence of the spatial ecology of a threatened species (Fielding and Bell 1997), as is the case of A. toweri (Koeck 2019). Regarding the ROC curve, Deshpande (2020) indicates that an AUC between 0.70 and 0.80 reflects better presence-variable correlation, which was the case for our results for both stages. In general, our models had the power to discern between areas with a low, medium, or high probability of A. toweri distribution, as a function of the selected variables, through the three summer periods.

Both the ecological response and spatial distribution models, together with the RA plots, were representative of the temporal variation in the spatial ecology of adult and juvenile A. toweri, during the three periods. All these modeling results were consistent with our field data, and we consider them to be adequate. On the other hand, DOMAIN was efficient for modeling the distribution and habitat affinity of this endemic and specialist fish, which is in agreement with Lyons et al. (2019). This ecological and spatial information increases knowledge about the spatial ecology of A. toweri and thus enables us to identify suitable habitat conditions and population status to make pertinent recommendations for the conservation of its natural population (e.g., Ferrier 2002; Joy and Death 2002; Benito and Peñas 2007) in the Media Luna spring. In addition, we considered that the incorporation of variables to local and temporal scales in the modeling was crucial to know the natural system and its modifications, as it happened with UC. These modifications can affect the population sensitivity of this and other endemic and threatened species (e.g., Ferrier 2002; Benito and Peñas 2007). Therefore, particularly for springs and wetlands, especially in arid and semi-arid areas, we highlight the need to obtain prior knowledge about the biotic and abiotic conditions of the study site. This was the case of Media Luna, which led the process of generating predictor variables from in situ information.

On the other hand, for ichthyofaunal recording in freshwater systems, our sampling designs, particularly underwater sampling, must be adapted to each situation; species and habitat information must be included in each record (Ruetz III et al. 2005; Prchalová et al. 2009), which can ensure a better understanding of population variation and life stage distribution (Engen et al. 2002; Gamelon et al. 2016).

Finally, we associated this RA, as well as the OP and distribution probability with tourism pressure that affects the vegetation and alters the habitat of A. toweri. As a consequence, in sectors S11 to S14 and S1, the OP was low, scarce, or null in conditions of remnants of vegetation (Bcp and Msp) and Bst. In contrast, we argued that A. toweri was found mainly between sectors S2 and S9, due to its affinity for high vegetation cover. Therefore, a natural habitat with a high density of emergent, sub-emergent, and floating vegetation cover, as well as a shallow depth, is important for the conservation of A. toweri in general. This avoids causing abrupt temporal variations in the occurrence and distribution of this and other endemic fish populations. Based on these results, we proposed an urgent prioritization of sectors S2 to S9 as areas subject to conservation, and better management, to avoid the extinction of the species that inhabit it.