Abstract
Context
The contribution of forest ecosystem services to human well-being varies over space following the dynamics in forest cover. Use of machine learning models is increasing in projecting forest cover changes and investigating the drivers, yet references are still lacking for selecting machine learning models for spatial projection of forest cover patterns.
Objectives
We assessed the ability of nonparametric machine learning techniques to project the spatial distribution of forest cover and identify its drivers using a case study of Tasmania, Australia.
Methods
We developed, evaluated, and compared the performance of four nonparametric machine learning models: support vector regression (SVR), artificial neural networks (ANN), random forest (RF), and gradient boosted regression trees (GBRT).
Results
The results demonstrated that RF far outperformed the other three models in both fitting and projection accuracy, and required less computional costs. GBRT outperformed SVR and ANN in projection accuracy. However, RF exhibited serious overfitting due to the full growth of its decision trees. The influence rankings of explanatory variables on spatial patterns of forest cover were different under the four models. Land tenure type and rainfall were identified among the top four most influential variables by all four models. The ranking produced by the RF model was significantly different with topographic factors associated with land clearing and production costs (elevation and distance to timber facilities) being the two most influential variables.
Conclusions
We encourage practitioners to consider nonparametric machine learning methods, especially RF, when facing problems of complex environmental data modelling.
Similar content being viewed by others
References
ABARES (2014) Tenure of Australia's Forest (2013) v2.0. Australian Bureau of Agricultural and Resource Economics and Sciences, Canberra
ACLEP (2014) National soil data provided by the Australian Collaborative Land Evaluation Program
Andam KS, Ferraro PJ, Pfaff A, Sanchez-Azofeifa GA, Robalino JA (2008) Measuring the effectiveness of protected area networks in reducing deforestation. Proc Natl Acad Sci 105(42):16089–16094
Australian Bureau of Meteorology (2015b) Climate Data Online
Baskent EZ, Kadiogullari AI (2007) Spatial and temporal dynamics of land use pattern in Turkey: a case study in Inegol. Landsc Urban Plan 81(4):316–327
Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn Res 13:281–305
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, Belmont
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and regression trees, Wadsworth Int. Group 37(15):237–251
Brereton RG, Lloyd GR (2010) Support vector machines for classification and regression. Analyst 135(2):230–267
Bryan BA (2013) High-performance computing tools for the integrated assessment and modelling of social–ecological systems. Environ Modell Softw 39:295–303
Caccetta P, Furby S, Wallace J, Wu X, Richards G, Waterworth R (2012) Long-term monitoring of australian land cover change using landsat data. In: Proceedings of the Global Forest Monitoring from Earth Observation, pp 243–258
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):1–27
Cohen WB, Maiersperger TK, Spies TA, Oetter DR (2001) Modelling forest cover attributes as continuous variables in a regional context with Thematic Mapper data. Int J Remote Sens 22(12):2279–2310
Cracknell MJ, Reading AM (2014) Geological mapping using remote sensing data: a comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information. Comput Geosci 63:22–33
Cutler DR, Edwards TC Jr, Beard KH, Cutler A, Hess KT, Gibson J, Lawler JJ (2007) Random forests for classification in ecology. Ecology 88(11):2783–2792
Department of Agriculture and Water Resources (2013) Australia's indigenous forest estate (2013)
Department of the Environment (2014) Collaborative Australian Protected Areas Database (CAPAD)
Dong M, Bryan BA, Connor JD, Nolan M, Gao L (2015) Land use mapping error introduces strongly-localised, scale-dependent uncertainty into land use and ecosystem services modelling. Ecosyst Serv 15:63–74
Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, Marquéz JRG, Gruber B, Lafourcade B, Leitão PJ, Münkemüller T, McClean C, Osborne PE, Reineking B, Schröder B, Skidmore AK, Zurell D, Lautenbach S (2013) Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography 36(1):27–46
Du G, Shin KJ, Yuan L, Managi S (2018) A comparative approach to modelling multiple urban land use changes using tree-based methods and cellular automata: the case of Greater Tokyo Area. Int J Geograph Inf Sci 32(4):757–782
Fan RE, Chen PH, Lin C-J (2005) Working set selection using second order information for training support vector machines. J Mach Learn Res 6:1889–1918
Ferretti-Gallon K, Busch J (2014) What drives deforestation and what stops it? A meta-analysis of spatially explicit econometric studies. A Meta-Analysis of Spatially Explicit Econometric Studies
Foley JA, DeFries R, Asner GP, Barford C, Bonan G, Carpenter SR, Chapin FS, Coe MT, Daily GC, Gibbs HK, Helkowski JH, Holloway T, Howard EA, Kucharik CJ, Monfreda C, Patz JA, Prentice IC, Ramankutty N, Snyder PK (2005) Global consequences of land use. Science 309(5734):570–574
Forest Practices Authority (2017) State of the forests Tasmania 2017, booklet by the Forest Practices Authority, Hobart, Tasmania
Freeman EA, Moisen GG, Coulston JW, Wilson BT (2015) Random forests and stochastic gradient boosting for predicting tree canopy cover: comparing tuning processes and model performance. Can J For Res 46(3):323–339
Freitas SR, Hawbaker TJ, Metzger JP (2010) Effects of roads, topography, and land use on forest cover dynamics in the Brazilian Atlantic Forest. For Ecol Manag 259(3):410–417
Freund Y (1996) Schapire RE Experiments with a new boosting algorithm. Icml 96:148–156
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 25:1189–1232
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38(4):367–378
Gallant J, Wilson N, Dowling T, Read A, Inskeep C (2011) SRTM-derived one second digital elevation models version 1.0. Geoscience Australia, Canberra
Gao L, Bryan BA (2016) Incorporating deep uncertainty into the elementary effects method for robust global sensitivity analysis. Ecol Modell 321:1–9
Gao L, Bryan BA (2017) Finding pathways to national-scale land-sector sustainability. Nature 544(7649):217–222
Gao L, Bryan BA, Liu J, Li W, Chen Y, Liu R, Barrett D (2017) Managing too little and too much water: robust mine-water management strategies under variable climate and mine conditions. J Clean Prod 162:1009–1020
Gao L, Bryan BA, Nolan M, Connor JD, Song X, Zhao G (2016) Robust global sensitivity analysis under deep uncertainty via scenario analysis. Environ Modell Softw 76:154–166
Gao L, Ding Y-S, Ren L-H (2004) A novel ecological network-based computation platform as grid middleware system. Int J Intell Syst 19(10):859–884
Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63(1):3–42
Giriraj A, Irfan-Ullah M, Murthy MSR, Beierkuhnlein C (2008) Modelling spatial and temporal forest cover change patterns (1973–2020): a case study from South Western Ghats (India). Sensors 8(10):6132–6153
GISCA (2001) Accessibility/Remoteness Index of Australia (ARIA). Canberra
Gleason CJ, Im J (2012) Forest biomass estimation from airborne LiDAR data using machine learning approaches. Remote Sens Environ 125:80–91
Hansen MC, DeFries RS (2004) Detecting long-term global forest change using continuous fields of tree-cover maps from 8-km advanced very high resolution radiometer (AVHRR) data for the years 1982–99. Ecosystems 7(7):695–716
Hastie T, Tibshirani R, Friedman J (2009) Unsupervised learning. The elements of statistical learning. Springer, New York, pp 485–585
Hu X, Wu C, Hong W, Qiu R, Li J, Hong T (2014) Forest cover change and its drivers in the upstream area of the Minjiang River, China. Ecol Indic 46:121–128
Huang C, Song K, Kim S, Townshend JR, Davis P, Masek JG, Goward SN (2008) Use of a dark object concept and support vector machines to automate forest cover change analysis. Remote Sens Environ 112(3):970–985
Huang X, Gao L, Crosbie RS, Zhang N, Fu G, Doble R (2019) Groundwater recharge prediction using linear regression, multi-layer perception network, and deep learning. Water 11(9):1879
Jain RK, Smith KM, Culligan PJ, Taylor JE (2014) Forecasting energy consumption of multi-family residential buildings using support vector regression: investigating the impact of temporal and spatial monitoring granularity on performance accuracy. Appl Energy 123:168–178
Jiang Z, Mallants D, Peeters L, Gao L, Soerensen C, Mariethoz G (2019) High-resolution paleovalley classification from airborne electromagnetic imaging and deep neural network training using digital elevation model data. Hydrol Earth Syst Sci 23(6):2561–2580
Jones DA, Wang W, Fawcett R (2009) High-quality spatial climate data-sets for Australia. Aust Meteorol Oceanogr J 58(4):233–248
Keerthi SS, Shevade SK, Bhattacharyya C, Murthy KRK (2001) Improvements to Platt's SMO algorithm for SVM classifier design. Neural Computat 13(3):637–649
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the Ijcai. vol 14. Montreal, Canada, pp 1137–1145
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Kumar R, Nandy S, Agarwal R, Kushwaha S (2014) Forest cover dynamics analysis and prediction modeling using logistic regression model. Ecol Indic 45:444–455
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
LeCun YA, Bottou L, Orr GB, Müller K-R (2012) Efficient backprop. Neural networks: Tricks of the trade. Springer, New York, pp 9–48
Lehmann EA, Wallace JF, Caccetta PA, Furby SL, Zdunic K (2013) Forest cover trends from time series Landsat data for the Australian continent. Int J Appl Earth Observ Geoinf 21:453–462
Leinenkugel P, Wolters ML, Kuenzer C, Oppelt N, Dech S (2014) Sensitivity analysis for predicting continuous fields of tree-cover and fractional land-cover distributions in cloud-prone areas. Int J Remote Sens 35(8):2799–2821
Lin S, Jiang Y, He J, Ma G, Xu Y, Jiang H (2010s) Changes in the spatial and temporal pattern of natural forest cover on Hainan Island from the 1950s to the 2010s: implications for natural forest conservation and management. PeerJ 5:e3320
Lin Y-P, Chu H-J, Wu C-F, Verburg PH (2011) Predictive ability of logistic regression, auto-logistic regression and neural network models in empirical land-use change modeling–a case study. Int J Geographys Inf Sci 25(1):65–87
Ludwig M, Morgenthal T, Detsch F, Higginbottom TP, Valdes ML, Nauß T, Meyer H (2019) Machine learning and multi-sensor based modelling of woody vegetation in the Molopo Area, South Africa. Remote Sens Environ 222:195–203
Makinano-Santillan MM, Santillan JR, Paringit EC (2001) Merging landsat image information with georeferenced biophysical and socio-economical datasets to describe forest cover change in a philippine province
Marcos-Martinez R, Bryan BA, Connor JD, King D (2017) Agricultural land-use dynamics: assessing the relative importance of socioeconomic and biophysical drivers for more targeted policy. Land Use Policy 63:53–66
Marcos-Martinez R, Bryan BA, Schwabe KA, Connor JD, Law EA (2018) Forest transition in developed agricultural regions needs efficient regulatory policy. For Policy Econ 86:67–75
Marcos-Martinez R, Bryan BA, Schwabe KA, Connor JD, Law EA, Nolan M, Sánchez JJ (2019) Projected social costs of CO2 emissions from forest losses far exceed the sequestration benefits of forest gains under global change. Ecosyst Serv 37:100935
Marinoni O, Navarro Garcia J, Marvanek S, Prestwidge D, Clifford D, Laredo LA (2012) Development of a system to produce maps of agricultural profit on a continental scale: an example for Australia. Agric Syst 105(1):33–45
Mas J-F, Puig H, Palacio JL, Sosa-López A (2004) Modelling deforestation using GIS and artificial neural networks. Environ Modell Softw 19(5):461–471
Mayfield H, Smith C, Gallagher M, Hockings M (2017) Use of freely available datasets and machine learning methods in predicting deforestation. Environ Modell Softw 87:17–28
McMichael A, Scholes R, Hefny M, Pereira E, Palm C, Foale S (2005) Linking ecosystem services and human well-being. In: Capistrano D, Samper K, Cristián L, Marcus J, Raudsepp-Hearne C (eds) Ecosystems and human well-being: multi-scale assessments, MIllenium Ecosystem Assessment Series 4. Island Press, Washington DC, pp 43–60
Micheletti N, Foresti L, Robert S, Leuenberger M, Pedrazzini A, Jaboyedoff M, Kanevski M (2014) Machine learning feature selection methods for landslide susceptibility mapping. Math Geosci 46(1):33–57
Nahib I, Suryanta J (2017) Forest cover dynamics analysis and prediction modelling using logistic regression model (case study: forest cover at Indragiri Hulu Regency, Riau Province). In: IOP Conference Series: Earth and Environmental Science. vol 54. IOP Publishing, p 012044
Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. J Artif Intell Res 11:169–198
Palmate S, Pandey A, Kumar D, Pandey R, Mishra S (2017) Climate change impact on forest cover and vegetation in Betwa Basin, India. Appl Water Sci 7(1):103–114
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830
Peeters LJM, Pagendam DE, Crosbie RS, Rachakonda PK, Dawes WR, Gao L, Marvanek SP, Zhang YQ, McVicar TR (2018) Determining the initial spatial extent of an environmental impact assessment with a probabilistic screening methodology. Environ Modell Softw 109:353–367
Pijanowski BC, Brown DG, Shellito BA, Manik GA (2002) Using neural networks and GIS to forecast land use changes: a land transformation model. Comput Environ Urban Syst 26(6):553–575
Platt JC (1999) 12 fast training of support vector machines using sequential minimal optimization. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in kernel methods. MIT Press, Cambridge, pp 185–208
Pressey R, Whish G, Barrett T, Watts M (2002) Effectiveness of protected areas in north-eastern New South Wales: recent trends in six measures. Biol Conserv 106(1):57–69
Russell SJ, Norvig P (2016) Artificial intelligence: a modern approach. Pearson Education Limited, Malaysia
Schwieder M, Leitão P, Suess S, Senf C, Hostert P (2014) Estimating fractional shrub cover using simulated EnMAP data: a comparison of three machine learning regression techniques. Remote Sens 6(4):3427–3445
Simard M, Pinto N, Fisher JB, Baccini A (2011) Mapping forest canopy height globally with spaceborne lidar. J Geophys Res 116(G4):G04021
Smola AJ, Schölkopf B (2004) A tutorial on support vector regression. Stat Comput 14(3):199–222
Sola J, Sevilla J (1997) Importance of input data normalization for the application of neural networks to complex industrial problems. IEEE Trans Nucl Sci 44(3):1464–1468
Sun H, Wang Q, Wang G, Lin H, Luo P, Li J, Zeng S, Xu X, Ren L (2018) Optimizing kNN for mapping vegetation cover of arid and semi-arid areas using landsat images. Remote Sens 10(8):1248
Taieb SB, Hyndman RJ (2014) A gradient boosting approach to the Kaggle load forecasting competition. Int J Forecast 30(2):382–394
Wang X, Huang H, Gong P, Biging GS, Xin Q, Chen Y, Yang J, Liu C (2016) Quantifying multi-decadal change of planted forest cover using airborne LiDAR and Landsat imagery. Remote Sens 8(1):62
Wei B, Hao K, Gao L, Tang X (2020) Bio-inspired visual integrated model for multi-label classification of textile defect images. IEEE Trans Cognit Dev Syst. https://doi.org/10.1109/TCDS.2020.2977974
Were K, Bui DT, Dick ØB, Singh BR (2015) A comparative assessment of support vector regression, artificial neural networks, and random forests for predicting and mapping soil organic carbon stocks across an Afromontane landscape. Ecol Indic 52:394–403
Wheeler D, Hammer D, Kraft R, Dasgupta S, Blankespoor B (2013) Economic dynamics and forest clearing: a spatial econometric analysis for Indonesia. Ecol Econ 85:85–96
Ye L, Gao L, Marcos-Martinez R, Mallants D, Bryan BA (2019) Projecting Australia's forest cover dynamics and exploring influential factors using deep learning. Environ Modell Softw 119:407–417
Zhou W, Huang G, Pickett STA, Cadenasso ML (2011) 90 years of forest cover change in an urbanizing watershed: spatial and temporal dynamics. Landsc Ecol 26(5):645
Acknowledgements
B. Liu and B. Li were supported by the Fundamental Research Funds for the Central Universities (20CX05006A). L. Gao was supported by a CSIRO Julius award and a CSIRO 2018/19 Land and Water Appropriation Project. The authors thank Dr. Xin Huang for his technical support in the map visualization work. The authors would also like to thank two anonymous reviewers for their constructive comments, which have been very helpful for improving this manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Liu, B., Gao, L., Li, B. et al. Nonparametric machine learning for mapping forest cover and exploring influential factors. Landscape Ecol 35, 1683–1699 (2020). https://doi.org/10.1007/s10980-020-01046-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10980-020-01046-0