Skip to main content

Soil Classification Based on Physical and Chemical Properties Using Random Forests

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11804))

Abstract

Soil classification is a method of encoding the most relevant information about a given soil, namely its composition and characteristics, in a single class, to be used in areas like agriculture and forestry. In this paper, we evaluate how confidently we can predict soil classes, following the World Reference Base classification system, based on the physical and chemical characteristics of its layers. The Random Forests classifier was used with data consisting of 6 760 soil profiles composed by 19 464 horizons, collected in Mexico. Four methods of modelling the data were tested (i.e., standard depths, n first layers, thickness, and area weighted thickness). We also fine-tuned the best parameters for the classifier and for a k-NN imputation algorithm, used for addressing problems of missing data. Under-represented classes showed significantly worse results, by being repeatedly predicted as one of the majority classes. The best method to model the data was found to be the n first layers approach, with missing values being imputed with k-NN (\(k=1\)). The results present a Kappa value from 0.36 to 0.48 and were in line with the state of the art methods, which mostly use remote sensing data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://soilgrids.org.

  2. 2.

    https://www.isric.org/.

  3. 3.

    https://scikit-learn.org/.

References

  1. Arrouays, D., McKenzie, N., de Forges, A.R., et al.: GlobalSoilMap: Basis of the Global Spatial Soil Information System. CRC Press, Leiden (2014)

    Book  Google Scholar 

  2. Batjes, N.H., Ribeiro, E., Oostrum, A.v., et al.: Wosis: providing standardised soil profile data for the world. Earth Syst. Sci. Data 9(1), 1–14 (2017)

    Article  Google Scholar 

  3. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  Google Scholar 

  4. Brungard, C.W., Boettinger, J.L., Duniway, M.C., et al.: Machine learning for predicting soil classes in three semi-arid landscapes. Geoderma 239, 68–83 (2015)

    Article  Google Scholar 

  5. Congalton, R.G.: A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 37(1), 35–46 (1991)

    Article  Google Scholar 

  6. Crookston, N.L., Finley, A.O.: yaimpute: an R package for KNN imputation. J. Stat. Softw. 23(10), 16 (2008)

    Article  Google Scholar 

  7. Hengl, T., de Jesus, J.M., Heuvelink, G.B., et al.: Soilgrids250m: global gridded soil information based on machine learning. PLoS ONE 12(2), e0169748 (2017)

    Article  Google Scholar 

  8. Hengl, T., Nussbaum, M., Wright, M.N., et al.: Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 6, e5518 (2018)

    Article  Google Scholar 

  9. Heung, B., Ho, H.C., Zhang, J., et al.: An overview and comparison of machine-learning techniques for classification purposes in digital soil mapping. Geoderma 265, 62–77 (2016)

    Article  Google Scholar 

  10. Hounkpatin, K.O., Schmidt, K., Stumpf, F., et al.: Predicting reference soil groups using legacy data: a data pruning and random forest approach for tropical environment (Dano catchment, Burkina Faso). Sci. Rep. 8(1), 9959 (2018)

    Article  Google Scholar 

  11. IUSS Working Group WRB: World reference base for soil resources 2014, update 2015 international soil classification system for naming soils and creating legends for soil maps. World Soil Resources Reports No. 106, p. 192 (2015)

    Google Scholar 

  12. Jeune, W., Francelino, M.R., de Souza, E., et al.: Multinomial logistic regression and random forest classifiers in digital mapping of soil classes in Western Haiti. Rev. Bras. Cienc. Solo 42, e0170133 (2018)

    Article  Google Scholar 

  13. Meier, M., Souza, E.d., Francelino, M.R., et al.: Digital soil mapping using machine learning algorithms in a tropical mountainous area. Revista Brasileira de Ciência do Solo 42, e0170421 (2018). http://dx.doi.org/10.1590/18069657rbcs20170421

  14. Soil Survey Staff USA: Soil taxonomy: a basic system of soil classification for making and interpreting soil surveys. US Government Printing Office (1999)

    Google Scholar 

Download references

Acknowledgments

This research was supported through Fundação para a Ciência e Tecnologia (FCT), through the project grant with reference PTDC/CCI-CIF/32607/2017 (MIMU), as well as through the INESC-ID (UID/CEC/50021/2019) and NOVA LINCS (UID/CEC/04516/2019) multi-annual funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Didier Dias .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dias, D., Martins, B., Pires, J., de Sousa, L.M., Estima, J., Damásio, C.V. (2019). Soil Classification Based on Physical and Chemical Properties Using Random Forests. In: Moura Oliveira, P., Novais, P., Reis, L. (eds) Progress in Artificial Intelligence. EPIA 2019. Lecture Notes in Computer Science(), vol 11804. Springer, Cham. https://doi.org/10.1007/978-3-030-30241-2_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-30241-2_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-30240-5

  • Online ISBN: 978-3-030-30241-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics