Abstract
The American marten is a species that is dependent on old conifer forest at middle to high elevations and is highly sensitive to habitat loss and fragmentation. Our goal was to compoare logistic regression and random forest in multi-scale optimized predictive model of occurrence of the American marten (Martes americana) in northern Idaho USA. There have been relatively few formal comparisons of the performance of multi-scale modeling between logistic regression and random forest, but those that have been conducted have found that random forest out-performs logistic regression. There was substantial similarity in the qualitative interpretation of the logistic regression and the random forest model and both found that occurrence was strongly predicted by a unimodal function of elevation, a non-linear function of canopy cover, a non-linear function of patch density, and the extent of the landscape in large conifer forest. Visual inspection of the predicted occurrence probability maps shows that random forest produces predictions that are more discriminatory, with higher range of predicted probability and higher spatial heterogeneity than logistic regression. The logistic regression model has an AUC of 0.701, while the random forest model had an AUC of 0.981, indicating very high predictive ability, and a much stronger ability to predict presences and absences in the training dataset than the logistic regression model. Expressed as a percentage, the random forest model had 28% higher performance, leading to much better prediction of habitat suitability, better inferences about habitat variables influencing marten occurrence, improved identification of scale dependency, and ultimately, therefore, better guidance to conservation and management.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Baccini A, Goetz SJ, Walker WS et al (2012) Estimated carbon dioxide emissions from tropical deforestation improved by carbon-density maps. Nat Clim Chang 2(3):182–185
Blaszczynski JS (1997) Landform characterization with geographic information systems. Photogramm Eng Remote Sens 63(2):183–191
Breiman L (2001a) Random Forests. Mach Learn 45(1):5–32
Breiman L (2001b) Statistical modeling: the two cultures (with comments and a rejoinder by the author). Stat Sci 16:199–231
Buskirk SW, Ruggiero LF (1994) The American marten. In: Ruggiero LF, Aubry KB, Buskirk SW, Lyon LJ, Zielinski WJ (eds) American marten, fisher, lynx, and wolverine in the western United States. Gen. Tech. Rep. RM-254. U.S. Department of Agriculture, Forest Service, Rocky Mountain Forest and Range Experiment Station, Fort Collins
Chambers CL, Cushman SA, Medina-Fitoria A, Martinez-Fonesca J (2016) Influences of scale on bat habitat relationships in a forested landscape in Nicaragua. Landsc Ecol 31:1299–1318
Chapin TG, Hamson DJ, Katnik DD (1998) In audience FH Is that the correct title? of landscape pattern on habitat use by American marten in an industrial forest. Conserv Biol 12:1327–1337
Cohen J (1968) Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. Psychol Bull 70(4):213–220
Crookston NL, Finley AO (2008) yaImpute: An r package for knn imputation. J Stat Softw 23(10):1–16
Cushman SA, Gutzwiller K, Evans JS, McGarigal K (2010) The gradient paradigm: a conceptual and analytical framework for landscape ecology. In: Cushman SA, Huettman F (eds) Spatial complexity, informatics, and wildlife conservation. Springer, Tokyo, pp 83–108
Cushman SA, Macdonald EA, Landguth EL, Halhi Y, Macdonald DW (2017) Multiple-scale prediction of forest-loss risk across Borneo. Landsc Ecol 32:1581–1598
Cushman SA, Raphael MG, Ruggiero LF, Shirk AJ, Wasserman TN, O’Doherty EC (2011) Limiting factors and landscape connectivity: The American marten in the rocky mountains. Landsc Ecol 26:1137–1149
Cushman SA, Shirk AJ, Landguth EL (2013) Landscape genetics and limiting factors. Conserv Genet 14:263–274
Cutler DR, Edwards TC, Beard KH et al (2007) Random forests for classification ecology. Ecology 88(11):2783–2792
De'ath G, Fabricius KE (2000) Classification and Regression Trees: A powerful yet simple technique for ecological data analysis. Ecology 81(11):3178–3192
Drew CA, Wiersma YF, Huettmann F (eds) (2010) Predictive species and habitat modeling in landscape ecology: concepts and applications. Springer Science & Business Media, New York
Evans JS, Cushman SA (2009) Gradient modeling of conifer species using random forests. Landsc Ecol 24(5):673–683
Evans JS, Murphy MA, Holden ZA, Cushman SA (2011) Modeling species distribution and change using random forest. In: Drew CA (ed) Predictive species and habitat modeling in landscape ecology: concepts and applications. Springer, New York
Evans JS, Oakleaf J (2012) Geomorphometry & Gradient Metrics Toolbox (ArcGIS 10.0)
Evans JS, Oakleaf J, Cushman SA, Theobald DM (2014) An ArcGIS toolbox for surface gradient and geomorphometric modeling, version 2.0-0. Accessed:2015 Dec 2nd. http://evansmurphy.wix.com/evansspatial
Fuller AK, Harrison DJ (2005) Influence of partial timber harvesting on American martens in north-central Maine. J Wildl Manag 69:710–722
Godbout G, Ouellet JP (2008) Habitat selection of American marten in a logged landscape at the southern fringe of the boreal forest. Ecoscience 15:332–342
Grand J, Buonaccorsi J, Cushman SA, Griffin CR, Neel MC (2004) A multiscale landscape approach to predicting bird and moth rarity hotspots in a threatened pitch pine–scrub oak community. Conserv Biol 18(4):1063–1077
Grimm R, Behrens T, Märker M, Elsenbeer H (2008) Soil organic carbon concentrations and stocks on Barro Colorado Island—digital soil mapping using random forests analysis. Geoderma 146(1):102–113
Hargis CD, Bissonette JA, Turner DL (1999) The influence of forest fragmentation and landscape pattern on American martens. J Appl Ecol 36:157–172
Hargis CD, McCullough DR (1984) Winter diet and habitat selection of marten in Yosemite National Park. J Wildl Manag 48:140–146
Hegel TM, Cushman SA, Evans J, Huettmann F (2010) Current state of the art for statistical modelling of species distributions. In: Cushman SA, Huettman F (eds) Spatial complexity, informatics and wildlife conservation. Springer, Tokyo, pp 273–312
Liaw A, Wiener M (2002) Classification and regression by random. Forest R news 2(3):18–22
McGarigal K, Cushman SA, Ene E (2012) FRAGSTATS v4: Spatial Pattern Analysis Program for Categorical and Continuous Maps. Computer software program produced by the authors at the University of Massachusetts, Amherst. Available at the following web site: http://www.umass.edu/landeco/research/fragstats/fragstats.html
McGarigal K, Wan HY, Zeller KA, Timm BC, Cushman SA (2016) Multi-scale habitat modeling: A review and outlook. Landsc Ecol 31:1161–1175
Mi C, Huettmann F, Guo Y, Han X, Wen L (2017) Why to choose Random Forest to predict rare species distribution with few samples in large undersampled areas? Three Asian crane species models provide supporting evidence. Peerj 5:e2849
Murphy MA, Evans JS, Storfer A (2010) Quantifying Bufo boreas connectivity in Yellowstone National Park with landscape genetics. Ecology 91(1):252–261
Pontius RG Jr, Milones M (2011) Death to Kappa: Birth of quality disagreement and allocation disagreement for accuracy assessment. Int J Remote Sens 32:4407–4429
Pontius RG Jr, Parmentier B (2014) Recommendations for using the relative operating characteristic (ROC). Landsc Ecol 29:367–382
Pontius RG Jr, Si K (2014) The total operating characteristic to measure diagnostic ability for multiple thresholds. Int J Geogr Inf Sci 28:570–583
Pontius RG Jr, Walker R, Yao-Kumah R, Arima E, Aldrich S, Caldas M, Vergara D (2014) Accuracy assessment for a simulation model of Amazonian deforestation. Ann Assoc Am Geogr 97:677–695
R Development Core Team (2008) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Robinson L, Cushman SA, Lucid M (2017) Winter bait stations as a multi-species survey tool. Ecol Evol 7:6826–6838
Rodriguez-Galiano VF, Ghimire B, Rogan J, Chica-Olmo M, Rigol-Sanchez JP (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104
Samuel A. Cushman, Nicholas B. Elliot, Dominik Bauer, Kristina Kesch, Laila Bahaa-el-din, Helen Bothwell, Michael Flyman, Godfrey Mtare, David W. Macdonald, Andrew J. Loveridge (2018). Prioritizing core areas, corridors and conflict hotspots for lion conservation in southern Africa. July 5, https://doi.org/10.1371/journal.pone.0196213
Schneider A (2012) Monitoring land cover change in urban and peri-urban areas using dense time stacks of Landsat satellite data and a data mining approach. Remote Sens Environ 124:689–704
Shirk AS, Raphael MG, Cushman SA (2014) Spatiotemporal variation in resource selection: Insights from the American Marten (Martes americana). Ecol Appl 24:1434–1444
Svetnik V, Liaw A, Tong C, Wang T (2004) Application of breiman’s random forest to modeling structure-activity relationships of pharmaceutical molecules. In: Roli F, Kittler J, Windeatt T (eds) Multiple classifier systems, lecture notes in computer science. Springer, Berlin/Heidelberg, pp 334–343
Thompson CM, McGarigal K (2002) The influence of research scale on bald eagle habitat selection along the lower Hudson River, New York. Landsc Ecol 17:569–586
Tomson SD (1999) Ecology and summer/fall habitat selection of American marten in northern Idaho. University of Montana. Thesis, Missoula, p 80
Wasserman TN, Cushman SA, Schwartz MK, Wallin DO (2010) Spatial scaling and multi-model inference in landscape genetics: Martes americana in northern Idaho. Landsc Ecol 25:1601–1612
Wasserman TN, Cushman SA, Wallin DO, Hayden J (2012a) Multi scale habitat relationships of Martes americana in northern Idaho, USA. Research Paper RMRSRP-94. USDA Forest Service, Rocky Mountain Forest and Range Experimental Station, Fort Collins
Wasserman TN, Cushman SA, Shirk AS, Landugth EL, Littell JS (2012b) Simulating the effects of climate change on population connectivity of American marten (Martes americana) in the northern Rocky Mountains, USA. Landsc Ecol. https://doi.org/10.1007/s10980-011-9653-8
Wiens JA (1989) Spatial scaling in ecology. Funct Ecol 3(4):385–397
Wilbert CJ, Buskirk SW, Gerow KG (2000) Effects of weather and snow on habitat selection by American martens (Martes americana). Can J Zool 78:1691–1696
Wynne KM, Sherburne JA (1984) Summer home range use by adult marten in northwestern Maine. Can J Zool 62:941–943
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Cushman, S.A., Wasserman, T.N. (2018). Landscape Applications of Machine Learning: Comparing Random Forests and Logistic Regression in Multi-Scale Optimized Predictive Modeling of American Marten Occurrence in Northern Idaho, USA. In: Humphries, G., Magness, D., Huettmann, F. (eds) Machine Learning for Ecology and Sustainable Natural Resource Management. Springer, Cham. https://doi.org/10.1007/978-3-319-96978-7_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-96978-7_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-96976-3
Online ISBN: 978-3-319-96978-7
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)