Background

There is a global renewed interest in area-wide integrated mosquito management strategies based on the mass production and release of sterile males to suppress target populations [1,2,3,4]. These techniques are usually referred to as genetic control methods and include, among others, the sterile insect technique (SIT), the incompatible insect technique (IIT) and the release of insects carrying a dominant lethal gene (RIDL) [1, 4,5,6,7]. Several small-scale projects have demonstrated the high potential of these strategies to suppress mosquito populations [6, 8,9,10]. The scaling-up of these projects from pilot to operational has been hampered by several problems, the most significant one being the lack of an efficient sex-sorting method [1, 11, 12]. Given that only the female mosquitoes bite and transmit the human pathogens, those methods must be capable of ensuring a predefined acceptable level of female contamination while maximizing the male pupae recovery.

The successful use of genetic sexing strains (GSS) for the sex sorting of Ceratitis capitata and other fruit fly species [13,14,15,16,17] has encouraged researchers to develop similar GSS strains for mosquitoes. GSS strains that can be sex-sorted at early developmental stages (eggs or L1) are generally accepted to be the optimal solution for mass-scale SIT and related techniques [11, 12]. A genetic sexing strain based on the tolerance to dieldrin has been developed for Anopheles arabiensis; however, this strain presents several problems and has limited potential for SIT applications [18, 19]. In addition, several transgenic genetic sexing strains developed for different mosquito vector species are also of limited applied potential due to either lack of stability, low male performance or subject to extensive regulation [11, 12].

The current lack of a functional GSS has led to the mosquito population suppression projects to use alternative ways in the sex-sorting process. For the mosquito species with strong sexual size dimorphism (SSD), mainly Aedes and Culex species, mechanical methods have been generally adopted for sorting [6, 8, 20]. Although several designs and proposals for sex sorting on a mass scale have been suggested in the past [21, 22], all mosquito genetic control programmes currently use either plate separators [23] or sieves [8] for sex sorting that have been devised for small-scale rearing conditions. The development of new designs with automation capability for unattended sorting would increase the efficiency of those projects and allow their upgrade to large operational programmes [12, 24].

The efficiency of SSD-sorting methods in terms of male recovery, female contamination and speed depends on technical and biological factors. The technical features basically affect the rate of separation per time unit, and differ between methods. The main biological determinant is the size distribution between sexes and their overlap as well as the effect of rearing conditions on this characteristic. All the SSD-sorting methods rely on the same principle: the separation in two samples by a threshold size. It should be noted that an analysis of the biological determinants of the distribution of size will in principle be applicable to all SSD-sorting methods

In order to improve the performance of new designs of sex-sorting methods based on SSD, a previous biometric analysis is required, specifically dealing with the analysis of the frequency distribution of the size of sexes. However, there is scarce information regarding the distribution of size in mosquitoes. Usually, the scope of the biometric studies in mosquitoes has been to find correlations between the body size and other biological traits [25,26,27,28,29], providing only point and variance estimates, and only a limited number of studies have included detailed frequency distributions [30,31,32]. For insects, most of the frequency distributions of the size can fit to normal probability functions. When a strong SSD is present, each sex can fit to an independent normal curve [33]. SSD is generally assumed as a species-specific (or population-specific) trait with a narrow degree of variation caused by complex interactions of factors [34,35,36,37].

Several experiments have shown that variations in the mosquito larval rearing conditions can increase or reduce the average size of the resulting pupae [38,39,40,41]. The SSD is slightly influenced by intraspecific competition [40, 42], but not by food availability [38] or the pollution by conspecifics [43] as size of both sexes is equally affected by these parameters and the difference between the average size of each sex remains constant. The objective of the present study is to optimize the utilization of the SSD-sorting methods through the understanding of the frequency distribution in the pupal size of different mosquito species and strains, with the ultimate goal to: (i) understand the performance of the current sex-sorting methods in different conditions; (ii) assess the relationship between the parameters of importance for sex-sorting devices: female contamination and male recovery; (iii) evaluate the suitability of size-based sex sorting methods for different species and strains of mosquitoes; and (iv) propose features that will optimize the performance of SSD-sorting methods.

To achieve these objectives, we developed a general model for the size distribution of mosquito at the pupal stage. The frequency distribution can be modelled as a mixture of two normal probability density functions. We analyzed the frequency distribution in size of four important mosquito vector species that are currently the target of area-wide integrated vector control projects using SIT-based methods: Aedes aegypti, Ae. albopictus, Ae. polynesiensis and Anopheles arabiensis. The intraspecific variation is also evaluated for Ae. aegypti, since three different laboratory strains were included in the analysis. The use of an automated pupae size estimator system based on artificial vision developed by Grupo Tragsa, Spain, allowed the collection of a large amount of size measurements thus facilitating the achievement of our objective.

Methods

Laboratory strains

The Aedes aegypti Sri Lanka strain originates from mosquitoes collected from the Narahenpita area, District of Colombo, Western Province, Sri Lanka. This strain was kindly provided by Ms. Asha Wijegunawardana (University of Kelaniya, Sri Lanka) and has been maintained in the Insect Pest Control Laboratory of the Joint Food and Agriculture Organization and International Atomic Energy Agency (IPCL-Joint FAO/IAEA) laboratories since 2017. F28 mosquitoes from this strain were analyzed in the present study.

The Ae. aegypti GSS has been developed by classical genetic approaches and has been maintained in the IPCL-Joint FAO/IAEA since 2017. F5 mosquitoes from this strain were used in the present study.

The Ae. aegypti WB2 line was recently generated by transfer of Wolbachia wAlbB from Aedes albopictus into Aedes aegypti via embryonic microinjection at Michigan State University (personal communication, Zhiyong Xi), and has been maintained in the IPCL laboratories since 2016. This strain was introgressed into the genomic background of an Ae. aegypti strain from Brazil, provided by Professor Margareth Capurro (University of Sao Paolo, Brazil), through a series of seven backcrosses using in every generation Wolbachia-infected females mated with Ae. aegypti Brazil males. This resulted in the construction of the Ae. aegypti WB2-BRA strain used. F12 mosquitoes from this strain were analyzed in the present study.

The incompatible Ae. polynesiensis “Aito” (BC9) strain carries Wolbachia B from Ae. riversi. This strain was generated through multiple backcrosses between Aedes riversi females and Aedes polynesiensis aposymbiotic males (Hapairai, 2013). This strain which has been maintained at Institut Louis Malardè (ILM), Tahiti since 2010 was recently used in a pilot IIT field study on the atoll of Tetiaroa, French Polynesia (Bossin et al. manuscript in preparation).

The Ae. albopictus Rimini strain was originated from field collections in northern Italy. It has been maintained in the IPCL since 2010.

The An. arabiensis Dongola strain was originated from the Northern State of Sudan. It has been maintained in the IPCL since 2005. It is also available at the Malaria Research and Reference Reagent Resource Center, MR4, as MRA-856.

Mosquito rearing

Standard rearing conditions have been used for the maintenance of experimental colonies, egg collection and hatching of Ae. albopictus and Ae. aegypti [44, 45], Ae. polynesiensis [39], and An. arabiensis [46, 47] colonies.

Pupae production

For each species or strain, three larval containers were prepared for pupae production as described below. These three replicates represented a random sample of the different rearing units found in a mass rearing facility.

For Ae. aegypti and Ae. albopictus, 2000 first-instar larvae were introduced in white acrylonitrile-butadiene-styrene (ABS) plastic trays (41 × 30 × 8 cm) with 1.5 l of deionized water. Since larvae of Ae. polynesiensis must be reared under lower densities [39], 1200 first-instar larvae were introduced in 40 × 60 × 15 cm containers with 4 l of water. The larvae were fed with the standard Aedes IPCL diet [48, 49] at a concentration of 75 g per liter of diet. The diet regime ranged from 0.2 mg of dry weight per larvae on the first day to 0.8 mg on the last days.

Different batches of eggs of Ae. arabiensis were hatched in white plastic trays (41 × 30 × 8 cm). Two days after the hatching, approximately 500-1000 larvae were visually isolated in the same kind of trays with 1.5 l of deionized water. The larvae were fed with the standard Anopheles IPCL diet [46], in a concentration of 10 g/l ranging from 5ml on the first day to 20 ml on the last days.

All pupae in each container were collected on a daily basis starting at 24 hours from the beginning of the pupation. A batch of pupae was defined as all the pupae produced in 24 hours for a specific species/strain and replicate(s). The selected batches were sex-sorted under a binocular microscope. All the pupae in a batch were classified into males and females groups, and the resulting samples are referred to as batch-sex groups. The batch where the proportion of male and female was closer to 50%, usually on the 2nd or 3rd day from the beginning of pupation, was selected for the analysis.

Measurement of pupal size

The lateral profile area of the pupae was automatically measured by means of a computer vision system (Fig. 1), which comprises: (i) a translucent rigid surface with circular uniform movement, acting as conveyor on which the mosquito pupae are arranged; (ii) a uniform and high intensity led white light backlight system and (iii) a high resolution/high speed camera placed in top position. In this way, the mosquito pupae pass continuously under the camera while being backlit by the lighting system. The backlighting of the pupae allows photograph of them with a high contrast, which facilitates their subsequent extraction and isolation from the background (segmentation). In order to increase the precision and accuracy of the measurement of the areas, avoiding errors due to the effects of refraction of light by water droplets, the size of the pupae was measured in dry conditions for each session.

Fig. 1
figure 1

Continuous image capture size

A factor of special importance is that the backlight system must guarantee an illuminated area with a uniform intensity, at least in the interest area of capture of the camera. This is because the light intensity directly affects the size of the areas extracted in the segmentation process, and variations in the intensity could result in errors in the relative measurements. However, even with a uniform backlighting system, minor errors may occur in the measurement due to the position of each pupa with respect to the position of the camera (different projected areas) and electrical noise in the silicone sensor of the camera. To minimise these phenomena, several pictures of each pupa are recorded while in the camera capture area (around 15 shots per pupa), and then the median of all the measurements of each individual is chosen as the value of size. So, each individual has to be identified and its path has to be tracked. For this task, we have developed a predictive tracking algorithm, based on Kalman filters [50], which is able to identify and track individuals in their rotational displacement under the field of view of the camera. Additionally, the algorithm is robust enough to follow the characteristic rapid movements of the pupae in dry conditions, which are quite active.

The median size in square pixels is then transformed to a unidimensional parameter by the square root to linearize the measure of size. The measure of size presented in this study is then the square root of the pixel area. The outcome of the analysis is scale-independent, and the conclusions are valid regardless of the actual value of this magnitude. Since the pupae were sex sorted manually, we assume that a certain degree of identification error occured. Errors in the manual sex identification can affect the estimation of the statistical parameters, especially those pupae with size far larger or smaller than the corresponding sex average value. In order to minimize this effect, we considered each value that exceeded two standard deviations from the average as an error in sex identification. These values were subsequently excluded from the analysis.

Model for the frequency distribution of size

The proposed model relies on the basic assumption that the probability density function for the pupal size of each sex follows a Gaussian distribution, with the mean and standard deviation as the characteristic parameters. The mixture distribution for this situation is:

$$ f(x)={\alpha}_m\mathcal{N}\left(x;{\mu}_m,{\sigma}_m\right)+{\alpha}_f\mathcal{N}\left(x;{\mu}_f,{\sigma}_f\right) $$
(1)

where N(x; μi, σi) is the normal probability density function for size (x), with mean μi and standard deviation σi. The scalars αi are the proportions of each sex in the model, being αm + αf = 1. The subscripts m and f denote males and females respectively.

Predictions from the model

One of the goals of our model is to provide a statistical tool to estimate the theoretical outcomes of male recovery and female contamination. In order to quantify them we introduce the performance (PER) and purity (PUR) functions defined as follows:

$$ PER(x)=\frac{recovered\ males\ from\ the\ original\ sample\ for\ a\ given\ x}{males\ in\ the\ original\ sample} \times 100 $$
(2)
$$ PUR(x)=\frac{recovered\ males\ from\ the\ original\ sample\ for\ a\ given\ x}{recovered\ pupae\ from\ the\ original\ sample\ for\ a\ given\ x} \times 100 $$
(3)

For any given size threshold (x), PER provides the male recovery percentage and PUR the percentage of males on the sorted sample. Assuming the model given by Equation 1, it is easy to show that both functions can be estimated in terms of the normal distribution function:

$$ PER(x)=\varPhi \left(\frac{x-{\mu}_m}{\sigma_m}\right)\times 100 $$
(4)
$$ PUR(x)=\frac{\alpha_m\Phi \left(\frac{x-{\mu}_m}{\sigma_m}\right)}{\alpha_m\Phi \left(\frac{x-{\mu}_m}{\sigma_m}\right)+\left(1-{\alpha}_m\right)\Phi \left(\frac{x-{\mu}_f}{\sigma_f}\right)}\times 100 $$
(5)

where Φ denotes the standard normal cumulative distribution function. In order to determine both functions, the parameters {αm, μi, σi} of the model must be known. Purity and performance are inversely linked. A decrease in female contamination can be achieved by reducing the value of the threshold, but this unavoidably produces a reduction in the performance (Fig. 2). The features of PER(x) and PUR(x) depend on the chosen set of parameters. However many sets give rise to functions that are related by simple symmetry transformations like translations or scaling. As long as both functions are transformed in the same way, the purity versus performance curve remains invariant (Fig. 2c). Therefore, parameters {αi, μi, σi} are not suitable to classify unequivocally the different samples as they can lead to the same purity-performance curve. In order to find a more appropriate space parameter, we introduce two new dimensionless parameters: the sexual dimorphism index (SDI) and the sexual homoscedasticity index (SHI) defined by

Fig. 2
figure 2

Depiction of the purity-performance relationship under a mixture of two Gaussian distributions applied to the analysis of sex sorting by size. The graphs consider αm = 0.5 and three different sets of parameters s = {μm, σm, μf, σf}, s1 ={10, 1, 11, 1}, s2 = {8, 1, 9, 1} and s3 = {12, 2, 14, 2}. The performance of sets 2 and 3 is obtained by translating and scaling the performance of set 1: PER2(x) = PER1(x + 2), PER3(x) = PER1(x−2/2). The same transformations are applied to purity functions. a Purity versus size (X). b Performance versus size (X). c Purity versus performance

$$ SDI=\frac{\mu_f-{\mu}_m}{2\sqrt{\sigma_m{\sigma}_f}} $$
(6)
$$ SHI=\sqrt{\frac{\sigma_m}{\sigma_f}} $$
(7)

Combining definitions (6) and (7) with Equations 4 and 5, we can rewrite performance and purity functions as

$$ {\displaystyle \begin{array}{cc} PER(z)& =\Phi \left(\frac{z+ SDI}{\mathrm{SHI}}\right)\times 100\\ {}& \end{array}} $$
(8)
$$ PUR(z)\kern0.5em =\frac{\alpha_m\varPhi \left(\frac{z+ SDI}{SHI}\right)}{\alpha_m\varPhi \left(\frac{z+ SDI}{SHI}\right)+\left(1-{\alpha}_m\right)\varPhi \left(\frac{z- SDI}{SHI^{-1}}\right)}\times 100 $$
(9)

z being a dimensionless variable defined by \( z=\frac{x-\left(\frac{\mu_m+{\mu}_f}{2}\right)}{\sqrt{\sigma_m{\sigma}_f}} \)

Equations 8 and 9 reveal that PER(z) and PUR(z) only depend on three dimensionless parameters: αm, SDI and SHI. In other words, given αm, all combinations of the original parameters {μm, μf, σm , σf} giving the same pair {SDI, SHI} have exactly the same performance and purity functions in terms of the dimensionless variable z. Consequently, the dimensionless parameters {SDI, SHI} are more suitable than classical measures of center and spread to classify unequivocally different pupae samples with regard to purity and performance.

Basic statistics and model fit

The mean and standard deviation was estimated for every batch-sex group dataset, and its deviation from the normal distribution was tested by means of the Shapiro-Wilk test. The fit of the data to the probability density function of the mixture model was tested by means of the Kolmogornov-Smirnov test. All the statistic computations were done using the base package of R [51]. The significance level was set to α = 0.05.

Partitioning of the variance

A number of pupae equal to the minimal sample size was randomly selected for each batch-sex group per species/strain. This was performed in order to get a balanced factorial design dataset. A linear model was fit for each batch by means of ordinary least squares. The model included sex and batch as fixed factors, and their interaction. The partitioning of variance was assessed through ANOVA. All statistical analyses were done using the base package of R [51].

Results

Table 1 summarizes the statistics of the pupal size of the species and strains used in the present study. In total, 7733 pupae were analyzed. The majority of the batches were not significantly different from a normal distribution, and the mixture of two Gaussian distributions fitted well to all batches of all the species/strains used when male and female pupae data are combined. Table 2 presents the significance of the goodness-of-fit between the models and the data. None of the samples was significantly different from a Gaussian mixture distribution. Only two batches separated by sex showed significant departures from normality: An. arabiensis females of the batch 3, and Ae. albopictus Rimini males of batch 3.

Table 1 Descriptive statistics for the batches of pupae
Table 2 Results for the goodness-of-fit of the data to a probability distribution function. P-values are provided for each test

Figure 3 shows the histograms and fitted models for the three batches of each species/strain studied. After the parameters of the model have been estimated, the purity-performance characteristic curve is computed and the quality of sorting can be analyzed theoretically. Each value of pupal size is linked to a pair of purity and performance values, which are inversely related (Fig. 4). The fitted models allow simulating the output of SSD-sorting methods under different circumstances. Table 3 shows the main descriptors for the predicted output from the fitted model for each batch in the experimental data. These results should not be considered as a general prediction of how a particular strain will perform with SSD-sorting methods, since they are only applicable for the specific rearing conditions of this experiment. However, they show the potential of SSD-based sex-sorting procedures when standard rearing procedures are applied. Table 4 provides examples for one of the strains of Ae. aegypti (GSS) of how the performance-purity output varies under different simulated conditions. The simulation a describes the performance when the three batches are mixed and a threshold size is determined by keeping constant the level of purity of 99.5 %. It is shown how SDI takes lower values than any of the individual batches, and an average reduction of 17 % in performance is predicted for the same level of purity (male recovery of 74.5 % with a female contamination of 0.5 % after mixing the three batches). Simulations b and c assess the effect in the performance of the size heterogeneity and the variations in the SSD respectively. For simulation b the variance of both sexes is scaled by the same factor while the distance between means remains constant. Taking batch 1 as a reference, the standard deviation of both sexes is multiplied by 0.8 and 1.2 respectively. For simulation c, taking again batch 1 as reference, the average size of males and females is increased or decreased by 0.5 √pixels but the standard deviations are not modified in this case. It is worth mentioning that the variation of the statistical parameters in simulations b and c only affect the dimensionless parameter SDI, while SHI remains unaltered. Results show that changes in size heterogeneity and SSD have contrary effects in the performance; an increase in intra-batch variance produces a significant drop in performance whereas an increase in SSD improves the quality of sorting. Simulation d describes how the output of SSD-sorting systems vary with the election of a predefined constant threshold for all the batches.

Fig. 3
figure 3

Frequency distribution of size (√pixel) for pupae of different mosquito species and strains reared under small-scale laboratory conditions. Males are represented in blue and females in red. Three replicates are presented for each mosquito species/strain

Fig. 4
figure 4

Depiction of the SSD-sorting methods functioning simulated by a mixture of two normal distribution functions. Different threshold of sizes separates the sample in two subsamples. The subsample of smaller size has a different male proportion depending on the chosen threshold. Purity = % males on the sorted sample. Performance = % males recovery. The dotted lines depict a value of threshold. a Probability density function of male and female pupal size. b Purity versus performance

Table 3 Predictions of the fitted models with the experimental parameters for different measures of suitability to sexual size dimorphism sorting methods. Performance (% males recovery) at different levels of purity (% males in the sorted sample)
Table 4 Predicted values for the descriptive parameters of the sex sorting of the Ae. aegypti GSS strain simulated under different conditions

The contribution of different factors to the variability in size has been assessed by means of ANOVA. The results for the partitioning of the variance are shown in Table 5. For all the Aedes species, the biggest source of variation is the SSD. For An. arabiensis, there are significant differences in size between sexes, but this factor explained only a relatively small portion of the total variance.

Table 5 ANOVA tables for each species/strain. The factors included are batch (rearing container) and sex (male or female)

Discussion

For all the species and strains examined in this study, the joint frequency distribution of pupal size included an area of overlap between the individual male and female distributions. This essentially means that a complete separation of sexes according to a given threshold of size is not possible, and every threshold that separates the sample in two will leave a certain proportion of each sex in the batch of the other group: smaller females in the male group and/or bigger males in the female group. This limitation of the SSD-based sorting methods is commonly recognized, altogether with the general observation that rearing conditions have a strong effect over the performance of the methods [8, 20,21,22, 52]. However, the mechanisms under these observations have not been investigated in depth, which may affect the optimization of new sorting methods based on SSD.

For the four species analyzed, including the three Ae. aegypti laboratory strains, the distribution of the size of each sex considered apart followed a normal distribution, as commonly observed in insects [33]. For the three Aedes species studied here, the joint frequency distribution for both sexes is noticeably bimodal, and can be modeled through a mixture of two normal probability density functions. This model is rather simple, with only five parameters that can be easily estimated directly from a population sample. It is likely that this approach can be generalized to other Aedes species as well as to culicine mosquitoes with a marked dimorphism in size [31, 38, 53]. In addition, this method of analysis could be generally applied to all known SSD-based sorting methods, since all of them rely on separating batches of pupae in two groups through the definition of a threshold size.

The features of the distribution in sizes of the individuals determine the differences between samples/ strains/ species in the suitability for any SSD-sorting method. These differences are reflected in two main parameters: the performance (% males recovery) and sample purity (% males on the sorted sample). Both can be predicted from the probability density function. For a given set of the model parameters (αm; μi, σi), each size threshold has a pair of values of performance and purity associated. Under these model assumptions, purity and performance in each sample of pupae are unequivocally linked; each value of purity corresponds to a single value of performance. Since the evaluation of the quality of a given sorting through the predicted values of purity-performance depends on the chosen value of threshold, the relationship of both parameters in a dimensionless space has been analyzed theoretically. This analysis has provided two useful indices that describe the applicability of SSD-sorting methods for a given sample of pupae, i.e. the quality of the biological material and the rearing conditions. As SDI increases, the purity-performance function becomes more optimal (better performance with higher purity). The SDI index has two components, the SSD and the sample variance. Consequently, an increase in SSD and a reduction in variance increase the efficiency of any SSD-sorting method, as will be discussed later. Index SHI modifies the slope of the curve. The higher the SHI value, the more flattened purity-performance curve is obtained. This parameter describes the difference in variance between males and females, which is difficult to control during the rearing process. SDI and SHI can be used for long-term monitoring of the quality control of the sorting process.

The purity and performance are inversely correlated. From the applied point of view, any sorting system must choose a size threshold considering the trade-off between performance and purity. From Equations 8 and 9, it is possible to estimate, for a given sample, the threshold of size needed to obtain a desired value of purity or performance. Unfortunately, it is not possible to calculate a single constant size threshold for sex sorting that keeps constant the purity and performance levels across different batches. It is known that there is heterogeneity in size in the production units (rearing containers) that affects the outcome of the sorting methods [8, 20, 21, 52]. For instance, the three Ae. aegypti GSS SSD batches varied in purity (99.2-100%) and performance (66-94%) when separated by the same threshold value (see Table 4).

Keeping the purity as a constant parameter, and assuming heterogeneity in size, the outcome of the sorting will have a variable percent recovery of males. Since this heterogeneity is important in the output of the sex sorting, we analyzed and quantified the sources of variation in size in the experimental sample. Then, we used the parameters directly estimated from the samples to predict the expected values of purity and performance for each batch of pupae. Changing the value of these parameters in the fitted models allowed us to simulate the outcome of SSD-sorting methods under different scenarios. The partitioning of variance showed two different patterns of relative importance in respect to the source of heterogeneity in size. For the Aedes species, the main source of variation was the sexual difference, followed by the residual, the differences between batches and finally the interaction sex-batch. For Anopheles, the effect of sex was of less importance, and the residual accounted for most of the variation.

The SSD, as the absolute difference between mean size of each sex, is the main factor that explains the interspecific differences in the applicability of SSD-sorting methods. A higher SSD will produce higher performance independently of the scale, and for all the size thresholds considered, yielding a higher SDI. The two species with higher SSD (Ae. aegypti and Ae. polynesiensis) are known to yield better results than Ae. albopictus when separated with plate separators. On the other hand, An. arabiensis, as expected [11, 12], showed a poor suitability for the SSD-sorting methods. The experimental samples of An. arabiensis showed an average SSD of 0.52 √pixels, while the Aedes species ranged from 3.3 to 3.9 √pixels. Likely, even achieving a reduction in the heterogeneity would not be enough to make SSD-sorting methods suitable for An. arabiensis or related species. For example, a SSD-sorting method was used to sort An. albimanus in a trial in El Salvador [54], and resulted in 14% of female contamination in the released mosquitoes which would be currently unacceptable.

The residual variance is the second important source of variation in the Aedes group. It accounts for the unexplained variation due to other factors, mainly the natural heterogeneity in size that can be found in any pupal batch. The heterogeneity in size in a given batch (rearing container) has a strong effect on the performance. As an example, our simulations (Table 4) with the GSS strain predict that a 20 % increase in the standard deviation of the experimental value reduces the performance by about 17 %. Conversely, a reduction of the same magnitude produces an increase in male recovery of 7.6 %. The heterogeneity in size could be due to genetic and/or environmental factors [55, 56]. It is not expected that the genetic heterogeneity of laboratory populations has a major effect given that it has been drastically reduced by the colonization process [55]. On the other hand, it is known that intraspecific asymmetric density-dependent factors can increase the variability in size in other insects which have an aquatic larval developmental stage [57, 58]. Given that the mosquito larvae are usually kept at high densities in the artificial rearing containers, the heterogeneity in size could potentially be reduced by adjusting the larval density.

The variance between rearing containers is also an important factor to consider in SSD-sorting. In a real mass production context, there is variation in size between batches that is present in our experiments as well. This variation affected mainly the average size of the pupae, but also at some extent the absolute SSD magnitude (Table 5, Interaction term Batch:Sex). It has been reported that the food availability or the water pollution by conspecifics does not affect the absolute SSD magnitude in Aedes [38, 43] while larval competition could produce some degree of sexual allometry in size [40, 42]. The intraspecific variation in SSD is a complex issue [35,36,37] out of the scope of this article, but worth to be investigated in mosquitoes in the context of SSD-sorting methods. The variability between rearing containers is of major applied significance because the threshold of size needed to obtain a desired degree of purity is specific for each batch. The use of a common fixed threshold for all the production batches would produce a variable output in respect to purity and performance, and it is therefore not recommended (Table 4). In a mass production context, it can be sometimes useful to mix the pupae production of different rearing containers before the sex sorting, but this would likely increase the size heterogeneity. This is clearly shown in the simulations presented in Tables 3 and 4. For example, mixing the Ae. aegypti GSS pupae production from three rearing trays reduced the performance in about 17 % (with purity level of 99.5 %).

Two main strategies are presently used for SSD sorting methods. First, sieves [8], rows of slots [21] or openings between plates [22], which are based on fixed size thresholds, were developed through a trial and error process. This means that the purity and performance are not controlled and they entirely depend on the rearing conditions. On the other hand, plate separators [23], which rely on a visual adaptive size threshold election system, exhibit better performance [52] at the expense of productivity [21]. Both strategies can be optimized using the appropriate analytical tool. For the fixed threshold methods, a more optimal threshold based on the actual range of variation in size between batches of pupae may be required. The plate separator could be optimized by the determination of less subjective threshold election criteria. Finally, the analytical framework proposed here can be integrated in large scale mechanized sex-sorters of high precision.

Conclusions

The distribution of size in mosquito pupae can be modeled by a mixture of two Gaussian distribution functions. This approach, combined with the parameters obtained from laboratory samples, can be useful to understand and optimize the mechanisms of the SSD-sorting methods. Purity and performance, which are the most relevant features of sex sorting devices, can be directly calculated from the presented model. Two additional dimensionless parameters, SDI and SHI, which are good descriptors of the suitability of a species/strain under given rearing conditions for its sorting with SSD-based methods are proposed. This approach can be applied to all the SSD-sorting methods. The output of the SSD-sorting methods can be improved by reducing the heterogeneity in size within the rearing containers. The heterogeneity between batches can affect the quality of sex sorting when different batches are mixed before the sorting or when a common separation threshold is determined for a series of batches. For new designs of sex-sorting devices based on SSD, we recommend the following: (i) use of an adaptive and precise threshold selection method based on automatic measurement systems and the proposed formulas; and (ii) a specific threshold size for each batch to maintain the purity at a constant level. In this way, the heterogeneity in size will be resulting to a variable male recovery (performance). From the practical point of view, this study shows that enhanced SSD-based sex sorting methods can be applied to Aedes mosquito mass-rearing facilities that depend on lateral area to distinguish sexes to efficiently produce batches of male-only pupae with a male recovery ranging between 70% and 99% and female contamination under 0.5%, with the lower values of male recovery being obtained when different batches are mixed or when larval rearing conditions are not standardized.