Skip to main content
Log in

Applying XGBoost and SHAP to Open Source Data to Identify Key Drivers and Predict Likelihood of Wolf Pair Presence

  • Published:
Environmental Management Aims and scope Submit manuscript

Abstract

Wolves have returned to Germany since 2000. Numbers have grown to 209 territorial pairs in 2021. XGBoost machine learning, combined with SHAP analysis is applied to predict German wolf pair presence in 2022 for 10 × 10 km grid cells. Model input consisted of 38 variables from open sources, covering the period 2000 to 2021. The XGBoost model predicted well, with 0.91 as the AUC. SHAP analysis ranked the variables: distance to the closest neighboring wolf pair was the main driver for a grid cell to become occupied by a wolf pair. The clustering tendency of related wolves seems to be an important explanatory factor here. Second was the percentage of wooded area. The next eight variables related to wolf presence in the preceding year, except at fifth, eighth and tenth position in the total order: human density (square root) in the grid, percentage arable land and road density respectively. Other variables including the occurrence of wild prey were the weakest predictors. The SHAP analysis also provided crucial added value in identifying a variable that had threshold values where its contribution to the prediction changed from positive to negative or vice versa. For instance, low density of people increased the probability of wolf pair presence, whereas a high density decreased this probability. Cumulative lift techniques showed that the model performed almost four times better than random prediction. The combination of XGBoost, SHAP and cumulative lift techniques is new in wolf management and conservation, allowing for the focusing of educational and financial resources.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

Download references

Acknowledgements

First and foremost, we would like to express our gratitude to the members of our research team, Simone Spierings, Kjeld Vissers, Philip Vermeij and Erik van Nistelrooij, who provided valuable input, insights, and assistance at every stage of the project. Their contributions were critical to the success of this research, and we are deeply grateful for their hard work and dedication. We also thank dr Markus Ritz for allowing us to use the DBBW (Dokumentations- und Beratungsstelle des Bundes zum Wolf) data, that are provided to the DBBW by the German federal states.

Author information

Authors and Affiliations

Authors

Contributions

JS: Methodology, Validation, Formal analysis, Writing—Original draft, Visualization, Project administration. JN: Methodology, Validation, Formal analysis, Writing—Review & editing. TS: Formal analysis, Writing—Review & editing. NO: Conceptualization, Writing—Review & editing. DvanL: Conceptualization, Writing—Original draft All authors reviewed the manuscript.

Corresponding author

Correspondence to Jeanine Schoonemann.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Schoonemann, J., Nagelkerke, J., Seuntjens, T.G. et al. Applying XGBoost and SHAP to Open Source Data to Identify Key Drivers and Predict Likelihood of Wolf Pair Presence. Environmental Management 73, 1072–1087 (2024). https://doi.org/10.1007/s00267-024-01941-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00267-024-01941-1

Keywords

Navigation