Soil Organic Carbon Prediction Using Vis-NIR Spectroscopy with a Large Dataset
Visible and near-infrared reflectance spectroscopy based soil properties estimation is an alternative to traditional laboratory analysis. The calibration model is a main factor influencing predictive performance. In this study, a large scale soil database, which contains 19,036 soil samples, was compared to its subsets to validate the effect of sample size on predictive performance. Four regression techniques based on linear model, namely, multiple linear regression (MLR), principal components regression (PCR), partial least squares regression (PLSR), and stepwise regression (SR) were compared to identify suitable models to predict the content of organic carbon in soil samples. The impact of derivatives or the raw spectra as predictor variables, and the interval of spectra were also studied. The best predictions were obtained using SR and MLR on raw spectra, yielding root mean square of error of cross validation (RMSECV) and coefficient of determination (R2) values of 25.3912, 25.4254 and 0.9227, 0.9225, indicating excellent models.
KeywordsSoil organic carbon MLR PCR PLSR Stepwise regression
This work was supported by National Science Foundation of China under Grant No. 31671586.
- 15.Montanarella, L., Tóth, G., Jones, A.: Soil component in the 2009 lucas survey. Land quality and land use information in the European Union. JRC, Office for Official Publications of the European Communities, Luxembourg, pp. 209–220 (2011)Google Scholar
- 17.Tóth, G., Jones, A., Montanarella, L.: LUCAS Topsoil Survey: Methodology, Data and Results. Publications Office (2013)Google Scholar
- 23.R Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria (2016). https://www.R-project.org/
- 24.Stevens, A., Ramirez-Lopez, L.: An introduction to the prospectr package (2013), r package version 0.1.3Google Scholar