Skip to main content
Log in

Evaluation of four modelling techniques to predict the potential distribution of ticks using indigenous cattle infestations as calibration data

  • Original Paper
  • Published:
Experimental & Applied Acarology Aims and scope Submit manuscript

Abstract

Efficient tick and tick-borne disease control is a major goal in the efforts to improve the livestock industry in developing countries. To gain a better understanding of the distribution and abundance of livestock ticks under changing environmental conditions, a country-wide field survey of tick infestations on indigenous cattle was recently carried out in Tanzania. This paper evaluates four models to generate tick predictive maps including areas between the localities that were surveyed. Four techniques were compared: (1) linear discriminant analysis, (2) quadratic discriminant analysis, (3) generalised regression analysis, and (4) the weights-of-evidence method. Inter-model comparison was accomplished with a data-set of adult Rhipicephalus appendiculatus ticks and a set of predictor variables covering monthly mean temperature, relative humidity, rainfall, and the normalised difference vegetation index (NDVI). The data-set of tick records was divided into two equal subsets one of which was utilised for model fitting and the other for evaluation, and vice versa, in two independent experiments. For each locality the probability of tick occurrence was predicted and compared with the proportion of infested animals observed in the field; overall predictive success was measured with mean squared difference (MSD). All models exhibited a relatively good performance in configurations with optimised sets of predictors. The linear discriminant model had the least predictive success (MSD ≥ 0.210), whereas the accuracy increased in the quadratic discriminant (MSD ≥ 0.197) and generalised regression models (MSD ≥ 0.173). The best predictions were gained with the weights-of-evidence model (MSD ≥ 0.141). Theoretical as well as practical aspects of all models were taken into account. In summary, the weights-of-evidence model was considered to be the best option for the purpose of predictive mapping of the risk of infestation of Tanzanian indigenous cattle. A detailed description of the implementation of this model is provided in an annex to this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Araújo MB, Pearson RG, Thuiller W, Erhard M (2005) Validation of species-climate impact models under climate change. Global Change Biol 11:1504–1513

    Article  Google Scholar 

  • Aspinal R (1992) An inductive modelling procedure based on Bayesȁ9 theorem for analysis of pattern in spatial data. Int J Geogr Inf Syst 6:105–121

    Article  Google Scholar 

  • Austin MP (2002) Spatial prediction of species distribution: an interface between ecological theory and statistical modelling. Ecol Model 157:101–118

    Article  Google Scholar 

  • Beutel TS, Beeton RJS, Baxter GS (1999) Building better wildlife-habitat models. Ecography 22:219–223

    Article  Google Scholar 

  • Bonham-Carter GF (1997) Geographic information systems for geoscientists—modelling with GIS. Oxford, Pergamon

    Google Scholar 

  • Branagan D (1973) The developmental periods of the Ixodid tick Rhipicephalus appendiculatus Neumann, 1901, under quasi-natural conditions in Kenya. Bull Entomol Res 63:155–168

    Article  Google Scholar 

  • Branagan D (1978) Climate and east coast fever. In Gibson TE (ed) Weather and parasitic animal disease. World Meteorological Organisation, Geneva, pp 126–140

    Google Scholar 

  • Chatfield C (1995) Model uncertainty, data mining and statistical inference (with discussion). J R Statist Soc A 158:419–466

    Article  Google Scholar 

  • Coolbaugh MF, Bedell R (2006) A simplification of weights of evidence using a density function, fuzzy distributions, and geothermal systems. In Harris JR (ed) GIS for the Earth Sciences, Geological Association of Canada, Special Publication 44, pp 115–130

  • Cumming GS (1999) Host distribution does not limit the species ranges of most African ticks (Acari: Ixodidae). Bull Entomol Res 89:303–327

    Article  Google Scholar 

  • Cumming GS (2000) Using between-model comparisons to fine-tune linear models of species ranges. J Biogeogr 27:441–455

    Article  Google Scholar 

  • Cumming GS (2002) Comparing climate and vegetation as limiting factors for species ranges of African ticks. Ecology 83:255–268

    Article  Google Scholar 

  • Daniel M, Kolar J, Zeman P (2004) GIS tools for tick and tick-borne disease occurrence. Parasitology 129:S329–S352

    Article  PubMed  Google Scholar 

  • Davenport ML, Nicholson SE (1989) On the relation between rainfall and Normalized Difference Vegetation Index for diverse vegetation types in East Africa. Int J Remote Sensing 12:2369–2389

    Google Scholar 

  • Estrada-Peña A (1998) Geostatistics and remote sensing as predictive tools of tick distribution: a cokriging system to estimate Ixodes scapularis (Acari: Ixodidae) habitat suitability in the United States and Canada from Advanced very high radiometer satellite imagery. J Med Entomol 35:989–995

    PubMed  Google Scholar 

  • Estrada-Peña A (2001) Forecasting habitat suitability for ticks and prevention of tick-borne diseases. Vet Parasitol 98:111–132

    Article  PubMed  Google Scholar 

  • Estrada-Peña A (2003) Climate change decreases habitat suitability for some species (Acari: Ixodidae) in South Africa. Onderstepoort J Vet Res 70:79–93

    PubMed  Google Scholar 

  • Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ Conserv 24:38–49

    Article  Google Scholar 

  • Good IJ (1950) Probability and the weighing of evidence. Charles Griffin, London/Hafner Press, New York

    Google Scholar 

  • Guerra M, Walker E, Jones C, Paskewitz S, Cortinas MR, Stancil A, Beck L, Bobo M, Kitron U (2002) Predicting the risk of Lyme disease: habitat suitability for Ixodes scapularis in the North Central United States. Emerg Inf Dis 8:289–297

    Article  Google Scholar 

  • Guisan A, Zimmermann NE (2000) Predictive habitat distribution models in ecology. Ecol Model 135:147–186

    Article  Google Scholar 

  • Guisan A, Thuiller W (2005) Predicting species distribution: offering more than simple habitat models. Ecol Lett 8:993–1009

    Article  Google Scholar 

  • Hand DJ (1981) Discrimination and classification. Wiley, Chichester

    Google Scholar 

  • Härdle W, Steiger W (1995) Optimal median smoothing. Appl Stat 44:258–264

    Article  Google Scholar 

  • Hastie T, Tibshirani RJ (1990) Generalised additive models. Chapman & Hall, London

    Google Scholar 

  • Hirzel A, Guisan A (2002) Which is the optimal sampling strategy for habitat suitability modelling. Ecol Model 157:331–341

    Article  Google Scholar 

  • Hugh-Jones ME, Barre N, Nelson G, Wehnes K, Warner J, Garvin J, Garris G (1992) Landsat-TM identification of Amblyomma variegatum (Acari: Ixodidae) habitats in Guadeloupe. Remote Sens Environ 40:43–55

    Article  Google Scholar 

  • Ihaka R, Gentleman R (1996) R: a language for data analysis and graphics. J Comput Graph Stat 5:299–314

    Article  Google Scholar 

  • Isaaks E, Srivastava RM (1989) An introduction to applied geostatistics. Oxford University Press, New York

    Google Scholar 

  • Kelsall JE, Diggle PJ (1995) Kernel estimation of relative risk. Bernoulli 1:3–16

    Article  Google Scholar 

  • Lachenbruch PA (1975) Discriminant analysis. Hafner Press, New York

    Google Scholar 

  • Lehmann A, Overton JMC, Leathwick JR (2002) GRASP: Generalised regression analysis and spatial prediction. Ecol Model 157:189–207

    Article  Google Scholar 

  • Lynen G, Bakuname C, Mtui P, Sanka P, (2000) Tick and TBD survey in the northern regions in Tanzania. Proceedings of Tanzania Veterinary Association Scientific Conference, vol 19. pp 76–85

  • Kitron U, Bouseman JK, Jones CJ (1991) Use of the ARC/INFO GIS to study the distribution of Lyme disease ticks in an Illinois county. Prev Vet Med 11:243–248

    Article  Google Scholar 

  • Merler S, Furlanello C, Chemini C, Nicolini G (1996) Classification tree methods for analysis of mesoscale distribution of Ixodes ricinus (Acari, Ixodidae) in Trentino, Italian Alps. J Med Entomol 33:888–893

    PubMed  CAS  Google Scholar 

  • McCulloch B (1968) A study of the life history of the tick Rhipicephalus appendiculatus – the main vector of East Coast fever – with reference to its behaviour under field conditions with regard to its control in Sukumaland, Tanzania. Bulletin Epizootic Diseases of Africa 16:477–500

    CAS  Google Scholar 

  • Olwoch JM, Rautenbach CJ de W, Erasmus BFN, Engelbrecht FA, van Jaarsveld AS (2003) Simulating tick distributions over sub-Saharan Africa: the use of observed and simulated climate surfaces. J Biogeogr 30:1221–1232

  • Pearce J, Ferrier S (2000) An evaluation of alternative algorithms for fitting species distribution models using logistic regression. Ecol Model 128:127–147

    Article  Google Scholar 

  • Pebesma EJ, Wasseling CG (1998) GSTAT: a program for geostatistical modelling, prediction and simulation. Comp Geosci 24:17–31

    Article  Google Scholar 

  • Perry BD, Lessard P, Norval RAI, Kundert K, Kruska R (1990) Climate, vegetation and the distribution of Rhipicephalus appendiculatus in Africa. Parasitol Today 6:100–104

    Article  PubMed  CAS  Google Scholar 

  • Randolph SE (1994) Population dynamics and density-dependent seasonal mortality indices of the tick Rhipicephalus appendiculatus in eastern and southern Africa. Med Vet Entomol 8:351–368

    Article  PubMed  CAS  Google Scholar 

  • Randolph SE (2000) Ticks and tick-borne diseases systems in space and from space. In Baker JR, Muller R, Rollinson D (eds) Remote sensing and geographical information systems in epidemiology, advances in parasitology vol 47. Academic Press, pp 217–243

  • Reichert P, Omlin M (1997) On the usefulness of overparametrized ecological models. Ecol Model 95:289–299

    Article  Google Scholar 

  • Robinson T, Rogers D, Williams B (1997) Univariate analysis of tsetse habitat in the common fly belt of Southern Africa using climate and remotely sensed vegetation data. Med Vet Entomol 11:223–234

    Article  PubMed  CAS  Google Scholar 

  • Robinson TP (2000) Spatial statistics and geographical information systems in epidemiology and public health. In Baker JR, Muller R, Rollinson D (eds) Remote sensing and geographical information systems in epidemiology, Advances in parasitology, vol 47. Academic Press, pp 81–127

  • Rushton SP, Ormerod SJ, Kerby G (2004) New paradigms for modelling species distributions. J Appl Ecol 41:193–200

    Article  Google Scholar 

  • Segurado P, Araújo MB (2004) An evaluation of methods for modelling species distributions. J␣Biogeogr 31:1555–1568

    Article  Google Scholar 

  • Sutherst RW, Maywald GF (1985) A computerised system for matching climates in ecology. Agric Ecosyst Environ 13:281–299

    Article  Google Scholar 

  • Tucker K, Rushton SP, Sanderson RA, Martin EB, Blaiklock J (1997) Modelling bird distributions – a combined GIS and Bayesian rule-based approach. Landscape Ecol 12:77–93

    Article  Google Scholar 

Download references

Acknowledgements

This study was partly supported by the INCO-DEV programme of EU through the project No. ICA4-CT-2000-30006, the grant No. 201/98/0090 of the Grant Agency of the Czech Republic, and the Ministry of Water and Livestock Development of the United Republic of Tanzania.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Petr Zeman.

Annex 1. Implementation of the weights-of-evidence model

Annex 1. Implementation of the weights-of-evidence model

For a comprehensive description of this method we refer to Bonham-Carter (1997). Denoting with t and the respective tick presence and absence and with \({\cal T}\equiv \{{\cal T}_{1}\ldots {\cal T}_{n}\}\) a set of evidential themes (GIS maps) and assuming their values τ≡ {τ1 ... τ n } at a place of interest, the logit posterior probability of the tick occurrence given τ was calculated as

$$ {L}\{t\vert {\cal T}=\tau\}=L\{t\}+\sum\limits^{n}_{{i}=1} {W}[\tau_{\rm i}], $$

where L{t} is the logit of prior probability of the tick occurrence (estimated e.g. as a prevalence of infested animals), and W[·] is the ȁ8weight of evidenceȁ9 defined as the natural logarithm of the ratio between (conditional) probabilities that the evidential theme \({\cal T}_{\rm i}\) takes the value τi given the tick presence or absence, respectively:

$$ {W}[\tau_{\rm i}]=\log[{P}\{{\cal T}_{\rm i}=\tau_{\rm i} \vert t\}/{P}\{{\cal T}_{\rm i}=\tau_{\rm i}\vert \bar{t}\}]. $$

Making allowance for the spatial uncertainty of the true origin of ticks attached to pastoral livestock, the conditional probabilities were related to a broader neighbourhood around sampling sites conceived as a set of (overlapping) discs of a predefined radius centred on each tick sample setting:

\({P}^{\ast}\{{\cal T}_{\rm i}=\tau_{\rm i}\vert \hbox{t}\}\) = proportion of total area of the neighbourhood of tick-presence sites occupied by the pattern where \({\cal T}_{\rm i}=\tau _{\rm i}\), and analogically

\({P}^{\ast}\{{\cal T}_{\rm i}=\tau_{\rm i}\vert \bar{t}\}\) = proportion of total area of the neighbourhood of tick-absence sites occupied by the pattern where \({\cal T}_{\rm i}=\tau_{\rm i}\).

This allows for home ranges of the sentinel animals within which each point had the same chance to be the origin of the attached tick(s), and ensures that all potential habitat types in the vicinity of the sampling sites were taken into account. A radius of 12.5 km was selected as a way of minimising the MSD criterion.

Although evidential themes \({\cal T}\) can in principle be of logical (binary) as well as categorical or numerical type, in typical WofE applications both categorical and numerical themes are binary-reclassified prior to estimating the conditional probabilities (Bonham-Carter 1997). In this study we opt for an alternative density function approach to numerical themes (Coolbaugh and Bedell 2006) that is more appropriate in this context. Following Robinson et al. (1997), two probability density functions were considered, one for tick presence (f{x| t}) and one for tick absence (f{x| t¯}), and the weights viewed as the probability density ratio

$$ {W}[x] ={f}\{x\vert t\})/{f}\{x\vert \bar{t}\}). $$

The probability density functions were estimated analogously to simple probabilities shown above from proportions of total neighbourhood areas assigned to appropriate histogram bins of an evidential themeȁ9s data distribution. Each histogram was fitted with a curve using a running medians smoother (Härdle and Steiger 1995) and the area under the curve was normalised. In order to ensure that empty bins would not cause numerical problems, a small constant was added to each bin: as small as a goodness-of-fit test (e.g. Kolmogorov–Smirnov) indicates no significant departure from the initial distribution (generally, an equivalent of 0.01% of total neighbourhoodsȁ9 area was sufficient). For values completely absent from a calibration set, it guarantees the neutral weight 0.

Monte Carlo test was used to assess confidence bands of the probability density functions. Briefly, under the null hypothesis of complete spatial randomness (H0) both tick-presence and tick-absence sites originate in a common spatial process and the division into the two sub-sets is random. Thus, by randomly redistributing the sites between two pseudo-presence/absence sub-sets and by submitting them to the same analysis as the actual data, further estimates of the probability density functions under H0 were obtained. The 2.5 and 97.5 percentiles registered in each bin in a series of such simulations defined 95-% point-wise tolerance intervals under H0 (Kelsall and Diggle 1995) (Fig. 3).

Fig. 3
figure 3

An illustration of probability density functions employed in the WofE model with the R. appendiculatus even sub-set and mean temperature in November as an example: (a) plots probability density functions associated with the tick presence (thick line) and absence (thin line), (b) shows 95%-confidence limits (dashed) of a random model, (c) plots smoothed probability density functions, and (d) shows the corresponding diagram of weights of evidence. Notice that a preference of the tick for colder habitats manifests itself as a skew of the presence density function towards lower temperatures and that in a range of ca. 20–23°C it reaches the significance limit; this tendency is cleared of statistical noise, and appears as a peak of the weights positively discriminating habitats within this temperature range. Multiple such non-correlated pieces of evidence can put together a picture of suitable habitats, which is the principle on which the weights-of-evidence method works

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zeman, P., Lynen, G. Evaluation of four modelling techniques to predict the potential distribution of ticks using indigenous cattle infestations as calibration data. Exp Appl Acarol 39, 163–176 (2006). https://doi.org/10.1007/s10493-006-9001-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10493-006-9001-x

Key words

Navigation