Skip to main content
Log in

Combination of machine learning and VIRS for predicting soil organic matter

  • Soils, Sec 2 • Global Change, Environ Risk Assess, Sustainable Land Use • Research Article
  • Published:
Journal of Soils and Sediments Aims and scope Submit manuscript

Abstract

Purpose

Visible-near-infrared spectroscopy (VIRS) is one of the most promising alternative techniques for soil organic matter (SOM) due to its direct response. In this study, partial least squares regression (PLSR), support vector machine (SVM), artificial neural networks (ANNs), and Cubist combined with VIRS were utilized to develop the calibration model and evaluate the ability of machine learning models to predict soil organic matter content.

Materials and methods

A total of 190 surface soil samples (earth-cumulic-orthic anthrosols) were collected from the Weihe Plain of Shaanxi Province, China. The Kennard–Stone (KS) algorithm was employed to divide them into calibration and validation data. Moreover, the successive projections algorithm (SPA), competitive adaptive weight weighting algorithm (CARS), and their combination (SPA + CARS) were utilized to select characteristic wavelengths and improve the predictive ability of the model. Different evaluation indices, including root mean square error (RMSE), coefficient of determination (R2), the ratio of the performance to deviation (RPD), and the ratio of performance to interquartile range (RPIQ), were adopted to evaluate the accuracy of the model.

Results and discussion

In all cases, the AFS-SPA + CARS-Cubist method outperformed the PLSR, SVM, and ANN. For the Cubist model, the Rv2, RPD, and RPIQ ranged from 0.8629 to 0.9782, 0.8720 to 3.0203, and 2.005 to 4.4164, respectively. According to the results, combining VIRS with Cubist could accurately determine the SOM of earth-cumulic-orthic anthrosol soils of the Weihe Plain, China. Furthermore, SPA + CARS provided more precise calibration–validation models than SPA and CARS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Baldrian P (2014) Distribution of extracellular enzymes in soils: spatial heterogeneity and determining factors at various scales. Soil Sci Soc Am J 78(1):11

    Article  Google Scholar 

  • Bo S, Rossel RAV, Mouazen AM, Wetterlind J (2010) Chapter five – visible and near infrared spectroscopy in soil science. Adv Agron 107(107):163–215

    Google Scholar 

  • Buondonno A, Amenta P, Viscarra-Rossel RA, Leone AP (2012) Prediction of soil properties with plsr and vis-nir spectroscopy: application to mediterranean soils from southern italy. Curr Anal Chem 8(2):283–299

    Article  Google Scholar 

  • Chang CW, Laird DA, Mausbach MJ, Hurburgh CR (2001) Near-infrared reflectance spectroscopy–principal components regression analyses of soil properties. Soil Sci Soc Am J 65:480–490

    Article  CAS  Google Scholar 

  • Conforti M, Castrignanò A, Robustelli G, Scarciglia F, Stelluti M, Buttafuoco G (2015) Laboratory-based vis–nir spectroscopy and partial least square regression with spatially correlated errors for predicting spatial variation of soil organic matter content. CATENA 124:60–67

    Article  CAS  Google Scholar 

  • Dotto AC, Dalmolin RSD, Grunwald S, Ten Caten A, Pereira Filho W (2017) Two preprocessing techniques to reduce model covariables in soil property predictions by vis-nir spectroscopy. Soil Tillage Res 172:59–68

    Article  Google Scholar 

  • Fontán J, Lópezbellido L, Garcíaolmo J, Lópezbellido R (2011) Soil carbon determination in a mediterranean vertisol by visible and near infrared reflectance spectroscopy. J near Infrared Spectrosc 19(4):253–263

    Article  Google Scholar 

  • Gao Y, Cui L, Lei B, Zhai Y, Shi T, Wang J et al (2014) Estimating soil organic carbon content with visible-near-infrared (vis-nir) spectroscopy. Appl Spectrosc 68(7):712–722

    Article  CAS  Google Scholar 

  • El Haddad J, Villot-Kadri M, Ismaël A, Gallou G, Michel K, Bruyère D et al (2013) Artificial neural network for on-site quantitative analysis of soils using laser induced breakdown spectroscopy. Spectrochim Acta Part B 79:51–57

    Article  Google Scholar 

  • Hagemann N, Joseph S, Schmidt HP, Kammann CI, Harter J, Borch T et al (2017) Organic coating on biochar explains its nutrient retention and stimulation of soil fertility. Nat Commun 8(1):1089

    Article  Google Scholar 

  • Hassan-Esfahani L, Torres-Rua A, Jensen A, Mckee M (2015) Assessment of surface soil moisture using high-resolution multi-spectral imagery and artificial neural networks. Remote Sens 7(3):2627–2646

    Article  Google Scholar 

  • Hong Y, Liu Y, Chen Y, Liu Y, Yu L, Liu Y et al (2019) Application of fractional-order derivative in the quantitative estimation of soil organic matter content through visible and near-infrared spectroscopy. Geoderma 337:758–769

    Article  CAS  Google Scholar 

  • Ji WJ, Li X, Li CX, Zhou Y, Shi Z (2012) Using different data mining algorithms to predict soil organic matter based on visible-near infrared spectroscopy. Spectrosc Spectral Anal 32(9):2393

    CAS  Google Scholar 

  • Kennard RW, Stone LA (1969) Computer aided design of experiments. Technometrics 11(1):137–148

    Article  Google Scholar 

  • Khaled AY, Aziz SA, Bejo SK, Nawi NM, Seman IA (2017) Spectral features selection and classification of oil palm leaves infected by basal stem rot (BSR) disease using dielectric spectroscopy. Comput Electron Agric 144:297–309

    Article  Google Scholar 

  • Khayamim F, Wetterlind J, Khademi H, Robertson J, Faz Cano A, Stenberg B (2015) Using visible and near infrared spectroscopy to estimate carbonates and gypsum in soils in arid and subhumid regions of Isfahan, Iran. J near Infrared Spectrosc 23(3):155–165

    Article  CAS  Google Scholar 

  • Kopačková V, Bendor E, Carmon N, Notesco G (2017) Modelling diverse soil attributes with visible to longwave infrared spectroscopy using PLSR employed by an automatic modelling engine. Remote Sens 9(2):1–21

    Article  Google Scholar 

  • Kwiatkowska-Malina J (2017) Qualitative and quantitative soil organic matter estimation for sustainable soil management. J Soils Sediments 18(8):2801–2812

    Article  Google Scholar 

  • Liu J, Han J, Zhang Y, Wang H, Kong H, Shi L (2018) Prediction of soil organic carbon with different parent materials development using visible-near infrared spectroscopy. Spectrochim Acta Part A 204:33–39

    Article  CAS  Google Scholar 

  • Mangi LJ, Stirling CM, Jat HS, Tetarwal JP, Jat RK, Singh R, Lopez-Ridaura S, Shirsath PB (2018) Soil processes and wheat cropping under emerging climate change scenarios in South Asia. Adv Agron 148:111–171

    Article  Google Scholar 

  • Mcbratney A, Fernandez-Ahumada E, Palagos B, Roger JM (2010) Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy. Trac TrAC Trends Anal Chem 29(9):1073–1081

    Article  Google Scholar 

  • Milne E, Aladamat R, Batjes NH, Bernoux M, Bhattacharyya T, Cerri CC et al (2007) National and sub-national assessments of soil organic carbon stocks and changes: the GEFSOC modelling system. Agric Ecosyst Environ 122(1):3–12

    Article  CAS  Google Scholar 

  • Moreira MM, Lemercier B, Dupas R, Michot D, Gascuel C (2017) High-resolution mapping of soil phosphorus concentration in agricultural landscapes with readily available or detailed survey data. Eur J Soil Sci 68(68):281–294

    Article  Google Scholar 

  • Navarro-Noya YE, Gómez-Acata S, Montoya-Ciriaco N, Rojas-Valdez A, Suárez-Arriaga MC, Valenzuela-Encinas C et al (2013) Relative impacts of tillage, residue management and crop-rotation on soil bacterial communities in a semi-arid agroecosystem. Soil Biol Biochem 65:86–95

    Article  CAS  Google Scholar 

  • Nawar S, Mouazen AM (2017) Predictive performance of mobile vis-near infrared spectroscopy for key soil properties at different geographical scales by using spiking and data mining techniques. CATENA 151:118–129

    Article  CAS  Google Scholar 

  • Peigné J, Cannavaciuolo M, Gautronneau Y, Aveline A, Giteau JL, Cluzeau D (2009) Earthworm populations under different tillage systems in organic farming. Soil Tillage Res 104(2):207–214

    Article  Google Scholar 

  • Ramirez-Lopez L, Behrens T, Schmidt K, Stevens A, Demattê JAM, Scholten T (2013) The spectrum-based learner: a new local approach for modeling soil vis–nir spectra of complex datasets. Geoderma 195(1):268–279

    Article  Google Scholar 

  • Russell RS, Russell EW, Marais PG (2010) Factors affecting the ability of plants to absorb phosphate from soils. J Soil Sci 8(2):248–267

    Article  Google Scholar 

  • Ruth EV, Kumpiene J, Gunneriusson L, Holmgren A (2005) Changes in soil organic matter composition and quantity with distance to a nickel smelter - a case study on the Kola Peninsula NW Russia. Geoderma 127(3):216–226

    Article  Google Scholar 

  • Sharma D, Banerjee S, Pati SK, Jaggi N (2020) Effect of conjugation on the vibrational modes of a carbon nanotube dimer. Spectrochim Acta Part A 246:118985

    Article  Google Scholar 

  • Shi Z, Ji W, Viscarra Rossel RA, Chen S, Zhou Y (2015) Prediction of soil organic matter using a spatially constrained local partial least squares regression and the Chinese vis-NIR spectral library. Eur J Soil Sci 66(4):679–687

    Article  CAS  Google Scholar 

  • Thielebruhn S, Emmerling C, Harbich M, Ludwig M, Vohland M (2016) Using variable selection and wavelets to exploit the full potential of visible-near infrared spectra for predicting soil properties. J near Infrared Spectrosc 24(3):255–269

    Article  Google Scholar 

  • Tsenkova R, Meilina H, Kuroki S, Burns DH (2010) Near infrared spectroscopy using short wavelengths and leave-one-cow-out cross-validation for quantification of somatic cells in milk. J near Infrared Spectrosc 17(6):345–351

    Article  Google Scholar 

  • Vohland M, Ludwig M, Thiele-Bruhn S, Ludwig B (2014) Determination of soil properties with visible to near- and mid-infrared spectroscopy: effects of spectral variable selection. Geoderma 223:88–96

    Article  Google Scholar 

  • Vohland M, Harbich M, Ludwig M, Emmerling C, Thiele-Bruhn S (2016) Quantification of soil variables in a heterogeneous soil region with VIS–NIR–SWIR data using different statistical sampling and modeling strategies. IEEE J Sel Top Appl Earth Obs Remote Sens 9(9):4011–4021

    Article  Google Scholar 

  • Wang X, Zhang F, Ding J, Kung HT, Latif A, Johnson VC (2018) Estimation of soil salt content (SSC) in the Ebinur Lake Wetland National Nature Reserve (ELWNNR), Northwest China, based on a bootstrap-BP neural network model and optimal spectral indices. Sci Total Environ 615:918–930

    Article  CAS  Google Scholar 

  • Wu W, Li AD, He XH, Ma R, Liu HB, Lv JK (2018) A comparison of support vector machines, artificial neural network and classification tree for identifying soil texture classes in Southwest China. Comput Electron Agric 144:86–93

    Article  Google Scholar 

  • Xing Z, Du C, Tian K, Ma F, Shen Y, Zhou J (2016) Application of FTIR-PAS and Raman spectroscopies for the determination of organic matter in farmland soils. Talanta 158:262–269

    Article  CAS  Google Scholar 

  • Xu S, Zhao Y, Wang M, Shi X (2018) Comparison of multivariate methods for estimating selected soil properties from intact soil cores of paddy fields by Vis–NIR spectroscopy. Geoderma 310:29–43

    Article  CAS  Google Scholar 

  • Ye S, Wang D, Min S (2008) Successive projections algorithm combined with uninformative variable elimination for spectral variable selection. Chemom Intell Lab Syst 91(2):194–199

    Article  CAS  Google Scholar 

  • Yu X, Liu Q, Wang Y, Liu X, Liu X (2016) Evaluation of MLSR and PLSR for estimating soil element contents using visible/near-infrared spectroscopy in apple orchards on the Jiaodong Peninsula. CATENA 137:340–349

    Article  CAS  Google Scholar 

  • Zornoza R, Guerrero C, Mataix-Solera J, Scow KM, Arcenegui V, Mataix-Beneyto J (2008) Near infrared spectroscopy for determination of various physical, chemical and biochemical properties in Mediterranean soils. Soil Biol Biochem 40(7):1923–1930

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We thank the anonymous reviewers and editor for their helpful comments.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 51979221), Xi’an University of Technology Doctoral Dissertation Innovation Fund (Grant No. 310-252072018), National Key Research and Development Program of China (Grant No. 2016YFC0401409), and Shaanxi Provincial Land Engineering Construction Group Project (Grant No. DJNY2021-15).

Author information

Authors and Affiliations

Authors

Contributions

We thank Zhenyu Dong, Ni Wang, and Jinbao Liu for contributing the soil samples and spectroscopic measurements, formal analysis, writing—original draft preparation, and writing—review and editing; Jichang Han and Jiancang Xie for their help with the data curation, article writing; Ni Wang, Jinbao Liu and Jiancang Xie for project administration. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Jinbao Liu.

Additional information

Responsible editor: Xiuping Jia

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dong, Z., Wang, N., Liu, J. et al. Combination of machine learning and VIRS for predicting soil organic matter. J Soils Sediments 21, 2578–2588 (2021). https://doi.org/10.1007/s11368-021-02977-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11368-021-02977-0

Keywords

Navigation