Background

The epidemiological importance of the Triatominae, a subfamily of blood-sucking bugs, derives from its capacity to transmit Trypanosoma cruzi, the etiological agent of Chagas disease, one of the most important vector borne diseases affecting around 8 million people in Latin America [1]. At present, 148 triatomine species are recognized, whose taxonomic knowledge reflects a balance between the use of traditional approaches and a wide variety of evolutionary interpretations [28].

Triatoma brasiliensis sensu lato Neiva, 1911 is one of the most important species for Chagas disease transmission and the main vector of T. cruzi in semi-arid areas of Northeastern Brazil, colonizing both natural and artificial ecotopes [9, 10]. This species shows remarkable chromatic variation across its geographic distribution, and indeed, it was concluded that T. brasiliensis is a complex of species.

Multidisciplinary studies have indicated that the Triatoma brasiliensis complex is a monophyletic group [11, 12], comprising four species, one of which includes two distinct subspecies. The relevant taxa in the group are T. brasiliensis brasiliensis Neiva, 1911; T. brasiliensis macromelasoma Galvão, 1956; Triatoma juazeirensis Costa & Felix, 2007; Triatoma melanica Neiva & Lent, 1941 and Triatoma sherlocki Papa et al. 2002. Each member of the group can be identified by external morphological characteristics and a taxonomic key was recently published [12].

Correct taxonomic identification of these insects is important because triatomine bugs behave differently in terms of their adaptation to artificial environments. Some species are primarily found in sylvatic ecotopes, where they live associated with small nest-building mammals or birds. Others invade and can even colonize the peridomestic environment and/or the human dwellings, exposing domestic mammals and humans to the infection of T. cruzi [1317]. Knowledge about the environmental space occupied by the triatomine species is an important factor for the development and/or improvement of vector control and surveillance strategies [18].

Currently, complementary methods to the classical taxonomy have been used in the comparative study of Triatominae species, such as geometric morphometric and molecular techniques. Wing shape is increasingly utilized in species identification and characterization [19] and in distinguishing sibling species [2022].

Wings are excellent structures for studying morphological variation because the intersections of the wing veins provide many well-defined landmarks suitable for morphometrics, and the metric properties of the wing provide precise quantitative information for the identification of species complexes [23, 24] and within-species variations [25]. In triatomines, the study of morphological variation by geometric morphometrics has been helpful to solve taxonomic problems [20, 26, 27], to analyze the house reinfestation phenomenon [28, 29] and to distinguish cryptic species [23, 30, 31].

Variations that are not expressed phenotypically ascribes importance to molecular studies for the comprehension of systematic issues. In this sense, the DNA sequencing has been widely used for identification, determination of phylogenetic relationships and inferences. The DNA Barcoding method is a system of universal taxonomic identification, fast with free access, based on the variation of a part of the sequence of the mitochondrial gene cytochrome c oxidase subunit I (CO1) [32]. This method remains in constant evaluation and, among other groups, the use of the CO1 sequences has shown promise for species-level identification in insects, including cryptic species of triatomines [3336] and other insects [3741].

After the confirmation of the species composition of the T. brasiliensis complex, we started exploration about the causes of the speciation process within the complex. Guided by the general question on speciation causes, this study aims at verifying the hypothesis that T. brasiliensis and T. melanica occupy different environmental niche spaces as a consequence of the speciation process within the T. brasiliensis complex. For the first time, we report findings on DNA barcoding as a tool for the identification of these species, and to complement, we also report the analysis of wing geometric morphometrics.

Methods

Insects

The study was based on a dataset that combined localities where sylvatic specimens of T. brasiliensis and T. melanica were collected in the municipalities of Ceará and Minas Gerais, respectively, by our study group, and localities listed in recent bibliographic records (Table 1) [4250]. Specimens were collected among rocky crevices by manual capture performed by ourselves, assisted by the personnel of the Chagas Disease Control Programme (PCDCh). Species identification was based on diagnostic characters [12] and supported by geometric morphometric and DNA barcoding.

Table 1 Triatoma brasiliensis and Triatoma melanica collection sites

Geometric morphometrics

Right hemelytra (forewings) from 36 insects of each species (18 females and 18 males), from the municipalities of Tauá (CE) and Espinosa (MG), were mounted between microscope slides. The wing images were captured using a digital camera (Carl Zeiss®Axiocam MRC 3.0) coupled to a Zeiss® Stemi SV6 stereomicroscope. Seven type I landmarks (venation intersection, according to Bookstein, 1991) were digitized, as shown in Fig. 1. In order to reduce the measurement error, landmark coordinates were recorded three times on each wing and the average of the results was calculated.

Fig. 1
figure 1

Right wing of Triatoma melanica with the seven type I landmarks used in analysis. The same configuration was taken for T. brasiliensis. Scale bar = 300 mm

Centroid size (CS), an isometric estimator derived from coordinate data, was computed as a measure of overall wing size. It is defined as the square root of the sum of the squared distances between the centre of the configuration of landmarks and each individual landmark [51]. The centroid size values of different species and sexes were compared by multiple linear regression using the “lm function” of R. The presence of atypical points (outliers) was verified and a 95 % confidence interval for the estimated regression coefficients was constructed. In this analysis, males and females were examined separately to check for the presence of sexual dimorphism.

Generalized Procrustes Analysis (GPA) superimposition algorithm [52] was used to produce shape variables. Here, both non-uniform (‘Partial Warps’) and the uniform component were used as shape variables, which describe respectively the local and global variation as deviations from a consensus shape [53]. In order to explore the wing shape variation among species, a multivariate discriminant analysis was performed on shape variables and the main results were shown by a factorial map of the first two discriminant factors. Mahalanobis distances were also computed from the same variables and used to illustrate morphological divergence among groups (species and sexes) by a dendrogram constructed according to the UPGMA algorithm. Statistical significance of these distances was assessed by non-parametric analyses based on permutation tests (1000 runs), and corrected by the Bonferroni method for P ≤ 0.05.

The residual relationship between shape and size variables was evaluated using a multivariate regression analysis of the first two shape discriminant factors against the centroid size values. This procedure allowed measurement of the contribution of size to shape variation (allometry), where statistical significance was assessed according to the non-parametric method based on permutation (1000 runs) [54].

We used the modules COO, TET, MOG, COV and PAD from the CLIC package (freely available at http://mome-clic.com/) to collect landmarks, to compute centroid size and shape variables, as well as to evaluate the residual relationship between shape and size variables, and to generate discriminant models and Mahalanobis distances. For comparing overall wing size among groups and to elaborate quantile-plots of CS values, the lm function of R was used. The UPGMA dendrogram based on Mahalanobis distances was constructed using the neighbor module from the PHYLIP package (by Joe Felsenstein, available at http://evolution.genetics.washington.edu/phylip.html).

Molecular analysis

Genomic DNA was extracted from individual specimen legs (stored at −70 °C) using the Wizard Genomic DNA Purification System (Promega, Madison, WI), re-suspended in 30 μl of DNA Rehydration Solution (10 mM Tris pH 8.0, 1 mM EDTA pH 8.0) and then stored at −80 °C until amplification by standard polymerase chain reaction (PCR). A fragment of 658 bp in the 5′ end of the CO1 gene was amplified using the primers LCO 1490 (5′-GGTCAACAAATCATAAAGATATTGG-3′) and HCO 2198 (5′-TAAACTTCAGGGTGACCAAAAAATCA-3′) [55], following the thermal cycling conditions: an initial denaturation step at 94 °C for 3 min, 30 cycles at 94 °C for 1 min, 45 °C for 1 min and 72 °C for 1 min, with a final extension at 72 °C for 10 min. The 20 μl PCR reaction mixes included 10 μl of ultrapure water, 2 μl of 10x PCR buffer, 0,8 μl of each primer (10 μM), 2 μl of dNTP (2,5 mM), 2 μl of MgCl2 (50 mM), 0,4 μl of Taq polymerase (5 U/μl) and 2 μl of the DNA template (10 ng/μl). The PCR products were visualized on a 2 % agarose gel and selected for direct sequencing, where both DNA strands were sequenced using the BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Inc., Foster City, California, USA), following the manufacturer’s protocol on an ABI 3730XL Automated DNA Analyzer (Applied Biosystems) available at the Oswaldo Cruz Institute, FIOCRUZ (Rio de Janeiro, Brazil).

All sequences were aligned with the Clustal W and edited using the Geneious R6 software (Biomatters Ltd, Auckland, New Zealand®). The consensus sequences analysed in this study are deposited on GenBank under the accession numbers KJ580486 to KJ580495. Afterwards, we used MEGA v. 5.2 [56] to carry the sequence analyses. Intraspecific and interspecific genetic distances (sequence divergence) were calculated using the Kimura 2-Parameter (K2P) nucleotide substitution model [57]. In this approach, we also quantified the number of variables sites among sequences, as well as the nucleotide frequencies. A neighbor-joining (NJ) tree was constructed based on the K2P distances for representation of relationships among the analysed sequences, and the reliability of each node was estimated using the bootstrap method with 2,000 replicates. The tree was rooted using sequences of Triatoma sordida Stål, 1851 from Minas Gerais, Brazil (G. H. Campolina-Silva, unpublished data) as an outgroup.

All barcoded specimens have been deposited in the Coleção de Vetores da Doença de Chagas (COLVEC), Centro de Pesquisas René Rachou/FIOCRUZ.

Ecological niche modelling

Ecological niche modelling methods were used for the study of the geographic distribution of biological species in general and triatomines in particular [17, 58, 59], using environmental variables as proxies of species niche dimensions. Two main approaches were used as environmental data sources: the Worldclim database [60] and remotely sensed variables recorded by earth observation satellites.

Previously published studies on geographic distribution of the T. brasiliensis species complex [10, 12, 17] used occurrence coordinates of specimens mainly collected during the 1980s and 1990s, before the species identification within the T. brasiliensis complex. To avoid confusion of occurrence points of possibly misclassified specimens, the present study of the geographic distribution of T. brasiliensis and T. melanica was based on an estimation of the environmental space from occurrence points where each species was collected either by our study group or taken from recent bibliographic records (15 for T. melanica and 140 for T. brasiliensis). The number of occurrence points in this study is smaller than the set analysed in previous studies (for example Costa et al. 2002) [61], as those studies used several occurrence points that in our consideration were initially identified as T. brasiliensis, but afterwards were synonymized to other specific names within the group complex. We used a set of occurrence points that have no possibility of taxonomic misidentification as T. brasiliensis brasiliensis. We consider that this conservative selection of occurrence points should show the distribution area of the species without errors. No absence points were used, but rather a random selection of 1000 background points. No biotic or abiotic barriers are known or can be imagined for the dispersion of the studied species, so that the accessible area for them is the whole study area. Although flying and/or walking are the most frequent mechanisms for active (and short ranged) dispersion in triatomines, passive dispersion (especially through human migrations) is the main mechanism for long range dispersion, as a number of very early studies demonstrated and recently confirmed using genetic markers [62].

A great variety of methods exist to estimate species distribution models. The maximum entropy is one of the most frequently used, because of the simplicity of use of the Maxent software. However, based on the simplicity criteria [63], the possibility of testing explicit assumptions, and on the equivalence between the results obtained using Maxent and Poisson point process models [64], we used a generalized linear model with a binomial link through the glm function of the R package.

A major objective of studies examining niche overlap is to determine if two species occupy the same niche. Even if two or more niches are identical, there will be some differences in the data purely by chance. To rule out detecting two niches as different, that actually only differ due to sampling variation, niche comparisons between two species must be done statistically to determine whether the same probability distribution describes the niche of two species, or whether there is evidence of some difference [65]. The estimated distribution models for the T. melanica and T. brasiliensis were compared using the Schoener index of niche overlap D, an index varying between 0–1 (no niche overlap and identical niche, respectively), and used to perform niche equivalency and similarity tests [66]. We followed the framework proposed by Broennimann et al. (2012) [67] (based on Warren et al. 2008) [68], and specifically an ordination method (PCA-env) to estimate niche overlap, as the method disentangles the dependence of species occurrence from different climatic conditions (correcting for relative availability of environments) and from environmental data resolution. For the niche equivalency test all occurrences were pooled and randomly split into two datasets, maintaining the number of occurrences as in the original datasets, and the niche overlap statistic D was calculated. The process was repeated 100 times and a histogram of simulated values was constructed. If the observed value of D falls within the density of 95 % of the simulated values, the null hypothesis of niche equivalency cannot be rejected. For the niche similarity test we randomly shift the entire observed density of occurrences in one range and calculate the overlap of the simulated niche with the observed niche in the other range. The test of niche similarity was also based on 100 repetitions. If the observed overlap is greater than 95 % of the simulated values, the entity occupies environments in both of its ranges that are more similar to each other than expected by chance. All calculations were carried out using an R script produced by Broennimann et al. (2012) [67], and appropriately modified.

Environmental variables included in this study were day and night land surface temperature (LST) and vegetation index (NDVI) estimated from raster images produced by the MODIS sensor during the period 2009 – 2013 (downloaded as granules h12v09, h13v09, h13v10, h13v11, h14v09, h14v10, h14v11 from the http://e4ftl01.cr.usgs.gov/MOLT/ site); rainfall estimated by the Tropical Rainfall Measuring Mission (TRMM) during the period 2005 – 2013 (downloaded from the Goddard Earth Science site http://mirador.gsfc.nasa.gov/collections/TRMM_3B43__007.shtml) and elevation (produced by the Shuttle RT Mission and downloaded as a digital elevation model (DEM) from the CGIAR Consortium for Spatial Information at http://www.cgiar-csi.org/data/srtm-90m-digital-elevation-database-v4-1). Temporal resolution was 8, 16 and 30 days for LST, NDVI and rainfall, respectively. The spatial resolution at the data sources were 1 km for LST, 250 metres for NDVI, 0.5° (nominally 25 kms at the Equator) for rainfall and 90 metres for the DEM. All imagery was reprojected to a common geographic projection with WGS84 datum and resampled to 250 metres spatial resolution. As some points of triatomine collection were closer than 250 m apart, the number of occurrence points considered for T. melanica decreased to 8 and 132 for T. brasiliensis. A total of 200 images of LST (day and night) dates, 100 of NDVI dates and 108 monthly rainfall time series were processed calculating average and standard deviation of the time series for the period 2009–2013. The ecological niche model was thus based on 9 environmental predictors. A separate model for each species was calculated using the glm function of R version 3.0.2. Variables were selected to minimize the AIC (Akaike Information Criteria) using the stepAIC package. Colinearity among predictor variables was evaluated using the variance inflation factor (using vif function of the R car package) and a variable remained in the model if vif <10. Area under the receiver operation curve was calculated for model evaluation estimating partial AUC using the pROC package of R [69] that take into account the criticisms raised by Lobo et al. 2007 on the use of AUC [70]. Cross-validation (through the cv.glm function) was used to measure the robustness of model estimation, using K = 100 for T. brasiliensis and K = 8 for T. melanica.

Results

Morphometric analysis

As expected, there was a significant difference in centroid size values in both sexes and species (P < 0.001) (Fig. 2). A consistent sexual size dimorphism was observed, where the female wings were larger than the males. Wing size was largest in T. brasiliensis, as observed by Costa and colleagues [42].

Fig. 2
figure 2

Wing centroid size variation among males and females of T. brasiliensis and T. melanica. This shows the sexual size dimorphism. The group mean is shown as a box between the quartiles (25th and 75th percentiles) and the standard deviations are shown as lines. Units are pixels

The first two discriminant factors (DF1 and DF2) explained 94 % and 4 % of wing shape variation, respectively. T. brasiliensis and T. melanica were significantly different (P < 0.01) and well separated in a factorial map of discriminant analysis along the DF1 axis (Fig. 3a). T. melanica wings were narrow when compared with T. brasiliensis, and the landmarks 3 and 7 better contributed to the conformational changes in wing venation (Fig. 3b), and are important in the discrimination of these species. Significant Mahalanobis distances could be found between species and sexes after the permutation test corrected by Bonferroni (P <0.05), except for males and females of T. brasiliensis. The UPGMA dendrogram derived from these distances showed a clear separation between the species (Fig. 4). The multivariate regression analyses of the first two shape discriminant factors (DF1 and DF2) against the centroid size values revealed no significant contribution of size to shape variation (no allometric content, p = 0.19).

Fig. 3
figure 3

Shape variation in T. brasiliensis and T. melanica. (A) Factorial map showing the distribution of specimens in the plane of the two discriminant factors derived from discriminant analysis of the shape variables. Polygons correspond to each group under study, where continuous polygons enclose males and dotted polygons enclose females. DF1 and DF2 are the first and second discriminate factors, and their corresponding percentage contribution to the total shape variation is shown in parentheses. (B) Wing consensus shape obtained after the Generalized Procrustes Analyses (GPA). Arrows indicate the major differences in wing shape of T. brasiliensis and T. melanica

Fig. 4
figure 4

UPGMA dendogram based on Mahalanobis distances derived from shape analyses. Observe a clear separation between T. melanica (T.me) and T. brasiliensis (T.br)

DNA Barcoding analysis

A total of 10 CO1 sequences were generated. There were no detected insertions, deletions or presence of stop codons. The DNA barcode region analysed in this study comprised 564 bp, where the nucleotide composition between T. brasiliensis and T. melanica showed themselves to be similar, with an overall average of nitrogenous bases being 30.4 % for Thymine (T), 28.1 % for adenine (A), 24.4 % for Cytosine (C), and 17.1 % for Guanine (G). A total of 56 variation sites was observed (mainly at the third codon position) between the sequences, 55 (98.2 %) of these were partially informative and in 43 (76.8 %) there were fixed nucleotide substitutions, i.e. specific sites that are common to all sequences where there was exact replacement of a certain nucleotide between the two species. Transitions were more frequent than transversions (Table 2), with the transition C ↔ T being the most commonly observed.

Table 2 Summary of the variation along of the CO1 sequences analyzed

In order to evaluate the interspecific genetic divergence, the pairwise intra- and interspecific genetic distance (K2P) was calculated. The average intraspecific K2P distance was 0.006, ranging from 0 to 0.009 in T. brasiliensis (0.005 ± SD0.002) and from 0.002 to 0.013 in T. melanica (0.008 ± SD 0.004), whereas the average K2P distance between these two closely related species was fifteen times higher (0.099 ± SD 0.003, ranging from 0.094 to 0.103), supporting the barcoding gap, as proposed by Hebert and colleagues [71]. The values of the K2P distances are shown in Table 3.

Table 3 Pairwise intra and interspecific genetic distances, based on the Kimura 2-Parameter model

A neighbor-joining (NJ) tree constructed based on K2P distances (Fig. 5) shows two distinct clades strongly supported by a bootstrap value of 100 %, comprising a group formed by all T. brasiliensis individuals and another by T. melanica. Hence, our results indicate that these species might be recognized by the barcode region evaluated in this study.

Fig. 5
figure 5

Neighbor-joining tree based on K2P distances of CO1 sequences from T. brasiliensis and T. melanica. The NJ tree was rooted using sequences of Triatoma sordida and the bootstrap value of 2,000 replicates is shown on each node. Scale bar = 0.02

Geographic distributions

The model estimating the distribution of T. brasiliensis (n = 132) showed the average of LSTd and accumulated rainfall, and the variability of LSTd and LSTn and rainfall were all highly significant (P < 0.001) in describing the species occurrence sites (Table 4). The occurrence chance of this species is positively related with LSTd and variability of LSTn, and negatively related with accumulated rainfall and variability of LSTd. Model fit is very good, with high partial AUC (pAUC) = 95 % (CI95 of 91.3 – 97.2) and low cross validation error (0.041).

Table 4 Generalised linear model fit

The model for T. melanica (n = 8) showed that elevation, annual averages of LSTd, NDVI and variability of accumulated rainfall were the significant variables (Table 4). The occurrence chance of this species is positively related with elevation and averages of LSTd and NDVI, and negatively related with the variability of accumulated rainfall. Model fit was high, with pAUC = 98 % (CI95 of 97.2 – 99.4) and low cross validation error = 0.0085.

The model that predicts the potential distribution of T. brasiliensis identifies the main caatinga region as an area with similar environmental properties as the one where this specie was collected. The model for T. melanica predicts a sparse area towards the west (Goiás and Tocantins) and the northeast into the main caatinga region. Although map prediction identified different areas for the species, there are wide overlapping areas of model occurrence prediction. Figure 6 shows the prediction for the T. brasiliensis and T. melanica models, together with the area where the two models overlap.

Fig. 6
figure 6

Modeling Disease Vector Occurrence. Geographic distribution of T. brasiliensis (orange), T. melanica (light blue) estimated by the generalized linear model for each species, and area where both models overlap (red). T. brasiliensis occurrence points (pink circles) and T. melanica occurrence points (blue circles)

Niche comparison using principal component analysis showed that the two first principal components explain 58 % of the variation included in the 9 environmental variables used for the study. Niche overlap was estimated as D = 0.291. This value is larger than expected by chance, thus leading to acceptance of niche similarity (P > 0.05), although leading to rejection of niche equivalency (P < 0.05).

Discussion

Based on similarities in morphology, geographical distribution profile, epidemiological significance and phylogenetic relationships, a review [2] analyzing the systematics and evolution of the Triatominae subfamily, list the probable existence of 8 species complexes within the genus Triatoma Laporte, 1832. Among these, the T. brasiliensis complex has been continuously evaluated and is currently composed by T. melanica, T. juazeirensis, T. sherlocki and T. brasiliensis, the latter having two subspecies, T. brasiliensis brasiliensis and T. brasiliensis macromelasoma [12].

Among the five members of this complex, the subspecies T. b. brasiliensis has been considered as the main concern in terms of Chagas disease transmission, because this subspecies is the most widespread and shows the highest rate of domestic capture and of natural infection by T. cruzi [9, 10]. On the other hand, to date, T. melanica has been found almost exclusively in wild environments, and collected specimens have been shown to be restricted to municipalities in the northern region of Minas Gerais and the southern region of Bahia [12].

Analysis involving Triatominae mtDNA sequences are regularly used in order to elucidate phylogenetic relationships among closely related Triatominae species. In this respect, the most widely used markers include fragments of gene encoding proteins such as cytochrome b [11, 34, 36, 72, 73] and subunits of ribosomal RNA (12S and 16S) [33, 7476]. The present study shows the effectiveness of DNA barcoding in identifying species of the T. brasiliensis complex and reinforces its contribution to classic taxonomy. The analyzed CO1 fragment was sufficient to discriminate T. brasiliensis and T. melanica, showing that these species are separated by a large genetic distance. The discrimination of these species can be well represented by the NJ tree with bootstrap values of 100 % for the two identified clades. This shows that our analysis of pair wise distance-based K2P within the T. brasiliensis complex supported a clear barcoding gap between the intra- and interspecific variation [76].

In a recent study, Justi et al. conclude that DNA barcoding is not applicable for the identification of Southern American Triatoma species, which may have diverged recently, because they observed at least one intraspecific genetic distance greater than interspecific distance in assessed species [77]. The authors mention that to be considered appropriate to identify species, intraspecific distances must always be lower than interspecific ones. Unlike the findings of Justi et al., the results of this study show that the CO1 barcode region is a robust marker for differentiating this species complex, and it may constitute a valuable tool for both epidemiologic studies and for Chagas disease control, including subspecies differentiation, as observed by Vendrami et al. [49]. Furthermore, this methodology can be especially useful for identifying immature stages, whose traditional taxonomic method is made impossible in smaller nymphs, especially considering the species complexes that exhibit wide morphological and chromatic variation.

The barcode sequences result found is congruent with the geometric morphometric analysis. Centroid size variation was as significant in the interspecific analysis as it was in the separation between males and females. Size variation related to gender, with females of both species larger than males, was expected with the presence of sexual dimorphism being a known characteristic in triatomines [78]. The morphometric analyses of wing was able to discriminate the two members of the complex (DF1 = 94 %), allowing assignment without error of each single wing to its corresponding taxon, and producing a pattern of relationships in agreement within known genetic distances between populations, as observed by Costa and colleagues [42]. As a result of there being no allometric trends for wing shape comparisons between T. brasiliensis and T. melanica, it is suggested that the change in this structure’s shape should mainly reflect the variation of genetic origin [69]. Furthermore, the recorded sexual dimorphism suggests stability and adaptation of both species to the sylvatic ecotope, without evidence of exposure to environmental stress conditions, which may lead to reduction in size of females, as proposed by Dujardin et al. 1999 [79].

The ecological niche model using elevation and current data on land surface temperature, vegetation cover and rainfall showed that elevation, the average of LSTd, accumulated rainfall and variability of LSTd and LSTn are the environmental variables that best describe the geographic distribution of T. brasiliensis, whereas elevation, the average of LST, NDVI and variability of accumulated rainfall are the ones best describing the distribution of T. melanica. The result for T. melanica is probably less reliable, because of the small sample size of occurrence points, and restricted to a small area. Conversely, the prediction map for T. brasiliensis is very reliable, and indicates a potential distribution particularly associated with the caatinga biome. Of particular importance for the objective of this study is the niche comparison between the species. Results show that niches are similar but not equivalent, suggesting that T. brasiliensis and T. melanica share similar environmental constraints but that a different set of variables within the environmental niche space restrict their distribution. It is interesting that the model for T. brasiliensis predicts occurrence where T. melanica was collected, and the model of T. melanica predicts occurrence (although loosely) where T. brasiliensis was collected. Both species are associated with the caatinga region and have rupicolous habits. No difference between the species at the microscale is expected within the macro landscape of the caatinga, but this is an aspect that should be verified under field conditions. In evolutionary terms, the results of this study suggest that the separation in two different species should be attributed to a factor that does not include the current environmental conditions. However, as the caatinga is a biome that has existed in the area for at least the last 18,000 years [80, 81], past conditions might have had an influence in the speciation process.

The potential geographic distribution maps obtained differed with previously published studies [17, 82]. The differences are attributed to a) our use of a different set of occurrence points of specimens, based on criteria of minimum chance of species misidentification, a weakness that old records, previous to the species identification within the T. brasiliensis complex might possess, b) our use of current environmental conditions of temperature, vegetation and rainfall (measured by sensors onboard earth observation satellites, reflecting the land cover situation within the last 5 years of an area that shows an important landscape modification within the last 20 years), instead of the Bioclim database [60] that reflect past conditions (averaged between 1950–2000), through an interpolation process of ground meteorological stations, that in the area were extremely sparse during the considered period and might produce low reliability for the temperature and rainfall variables for the area, and c) our approach used for the whole study area as potentially accessible by the species, without imposing political boundary limits to map predictions.

Despite the successful vector control programme of Triatoma infestans in most parts of the southern cone countries of South America, native vectors persistently reinfest insecticide-treated households, and sylvatic triatomines maintain disease transmission without colonizing human dwellings. In these scenarios, fine-scale vector studies are essential to define epidemiological risk patterns and clarify the involvement of the little-known triatomine taxa in disease transmission. These eco-epidemiological investigations, as well as the planning and monitoring of control interventions, rely on accurate taxonomic judgments. The problems of cryptic speciation and phenotypic plasticity illustrate this need.

Conclusion

Although DNA barcoding is a straightforward approach, it is applicable for identifying species of the T. brasiliensis complex and contributes to classic taxonomy. The analyses of wing shape was able to differentiate the two members of the complex, and showed that the species are not under stressful conditions. As a result of there being no allometric trends in the wing shape comparison between T. brasiliensis and T. melanica, it is suggested that the change in this structure’s shape should mainly reflect the variation of genetic origin. In evolutionary terms, the result of the environmental niche space occupied by the studied species suggests that their separation as two different entities should be attributed to a factor that does not include the current environmental conditions. However, as the caatinga is a biome that has existed in the area for at least the last 18,000 years, past conditions might have had an influence in the speciation process.