1 Introduction

The chlorophylls are the most important pigment for photosynthesis (Gitelson et al. 1999; Singhal et al. 2019), which provides energy to the biosphere (Qi et al. 2021). Since the symptoms of plant response to stress and N deficiencies are often associated with plant chlorophyll content (Houles et al. 2007), timely determination of plant chlorophyll content can be important indicators of plant nutrient status and environmental effects, which is very useful for agricultural field management (Argenta et al. 2004).

Conventionally, the determination of chlorophyll content involves outdoor sampling and indoor chemical analysis. These methods are not only time-consuming and laborious, but also destructive and lagging. As a non-destructive technique, the portable chlorophyll meters (SPAD-502) have been widely used to estimate chlorophyll content in agricultural studies (Markwell et al. 1995; Martínez et al. 2004). However, the SPAD meter is a leaf-based measurement, and infeasible for large-scale regional monitoring (Uddling et al. 2007). Therefore, there is a need to develop the techniques to monitor plant chlorophyll content in real time on a regional scale.

In recent years, remote sensing technology allows to monitor plant chlorophyll content on a large regional scale (Yu et al. 2014). UAV-based multispectral imagery has been widely used to monitor the nutrient status in crops (Théau et al. 2020), e.g., chlorophyll content and N (Clevers et al. 2017; Verrelst et al. 2012). UAV image acquisition is not only fast, less consuming and flexible (Bendig et al. 2013), but also with higher spatial and temporal resolution than the satellite and aerial images to facilitate further image analysis (Bareth et al. 2015). Although satellite and aerial images are able to assess crop nutrient status on a large scale, such images possess the low spatial and temporal resolution. Rafael et al. (2014) have shown that there was difficulty of differentiating the content of chlorophyll in forest using multispectral satellite imagery. Over the past decades, however, the use of UAV images has rapidly narrowed the gap between satellite or aerial and ground-based sensing (Prado Osco et al. 2019).

However, the combination of remote sensing techniques with multispectral imagery requires more reliable assessment techniques due to the huge amount of data generated. Compared to individual spectral bands, spectral indices can be applied at different scales and mitigate to a certain extent the adverse effects due to anisotropic reflection, background shadows and soil brightness contributions (Inoue et al. 2012; Kooistra and Clevers 2016). For example, Qi et al. (2021) found NDVI and GNDVI had a much higher degree of fit and precision than the other indices. Tahir et al. (2018) found MSAVI2 and TNDVI were proved to be more robust indices to estimate the chlorophyll content in the orchard with the highest coefficients of determination (R2) 0.89 and 0.85, respectively. Although spectral indices can improve the robustness in predicting chlorophyll content to some degree, the saturation effect and multi-collinearity are still a problem for remotely estimating chlorophyll content in crops (Cao et al. 2017; Li et al. 2010).

Recently, machine learning algorithms have been induced in remote sensing (Wu et al. 2019; Lu et al. 2021). Support vector machine (SVM), random forests (RF), partial least squares regression (PLSR), ridge regularization (RR) are powerful tools to assist in UAV image computation. These algorithms performed well in current applications predicting plant conditions, such as N content (Colorado et al. 2020), chlorophyll content (Qi et al. 2021), and biomass (Vilijanen et al. 2018). Using spectral indices as input variables is a new approach, since these models ensure good performance in spite of only a few variables as input feature (Moghimi et al. 2020). Mutanga et al. (2012). found that predicting chlorophyll content using the RF algorithm and three NDVIs calculated from the red-edge and near-infrared bands produced low prediction errors. Shah et al. (2019) used the established spectral indices as an input variable combined with the RF algorithm to improve the accuracy of estimating chlorophyll content in wheat. However, most studies have only compared a single machine learning algorithm combined with spectral indices to a linear regression model and did not compare the differences between different machine learning algorithms. Therefore, the main objectives of this study were: (1) to evaluate the performances of published spectral indices in estimating chlorophyll content of potato plants, (2) to compare the performance of SVR, RF, PLSR and RR models in predicting potato chlorophyll content, (3) to generate chlorophyll content maps based on the optimal models, and (4) to verify whether the UAV images were able to detect the variation of potato chlorophyll content at different growth stages.

2 Materials and Methods

2.1 Experimental Sites

Experiments were carried out in Zhuozi Country, which is located in the middle of Inner Mongolia (extending from 110° 51′ E, 40° 38′ N to 112° 56′ E, 41 °16′ N), China (see Fig. 1). The major crops are potato, corn, and beans. This area is characterized with a middle temperate arid and semi-arid continental monsoon, cold winters and cool summers. The annual average precipitation is 544.5 mm, i.e., 90–95% between April and October. The average temperature is > 20 °C during the potato growing season.

Fig. 1
figure 1

Geographical location of the experimental site (a) in 2020 and site (b) in 2021

2.2 Experimental Design

Two experiments involving different potato cultivars were conducted at two different sites of Zhuozi county from June to September in 2020 and 2021, respectively. Experiment 1 in 2020 was a randomized complete block design with four replications. The potato cultivar was Yingniweite. There were five N treatments: four optimization of N application management, and conventional N rate fertilization (Con). The plot size was 10 × 10 m. There were six N treatments with four replications for Experiment 2 in 2021. The potato cultivar was Mingfeng 16. Five N treatments were same as those in experiment 1, but control (no N was applied) was added. The plot size was 10 × 12 m. For both experiments, the irrigation method of the study site was drip irrigation, and the field management was unified according to the local large field requirements.

2.3 Field Data Acquisition

A Minolta brand chlorophyll meter (Model SPAD-502; Spectrum Technologies Inc. Plainfield, IL) was used to measure the chlorophyll content of potato functional leaves (the 4th compound leaf from apex) before the UAV flight was carried out. Fifteen sampling points were randomly selected from each treatment. Each measured point represented the growth of a potato plant. The average of chlorophyll content of each plot can be precisely acquired by averaging the fifteen samples of data.

2.4 Acquisition of UAV Images

During the experimental period, the multispectral images were taken by the Parrot Bluegrass drone with Parrot Sequoia multispectral camera on 20 July, 3 August, and 20 August in 2020, whereas the images were acquired on 15 July, 6 August, and 25 August in 2021. UAV flight was conducted between 10:00 a.m. and 2:00 p.m. at 30 m high, resulting in a 10 cm ground sample distance (GSD). Multispectral images were taken from the following spectral regions: green (530–570 nm), red (640–680 nm), red-edge (730–740 nm), and near-infrared (770–810 nm). Because, during the flight, UAV images were affected by partly cloudy and the changing illumination in light intensity, a radiometric calibration white board and an onboard irradiance sensor were integrated in the multispectral camera to correct images for dash area and illumination differences the multispectral camera with (Kopačková-Strnadová et al. 2021; Franzini et al. 2019). Additional sensors such as GPS and laser sensor are also mounted in the UAV (see Fig. 2). The properties of the UAV camera are presented in Table 1.

Fig. 2
figure 2

Unmanned aerial vehicle (UAV)-based robotic system for chlorophyll content estimation in potato

Table 1 Specifications of parrot sequoia camera and flight conditions

2.5 UAV Image Processing

Figure 2 illustrates the flow chart of this study. Pix4Dmapper Version 4.5.6 was used to import all images taken in the same period according to the location coordinates in its properties, and to align the overlapping images using the feature point matching algorithm. Before the UAV flight, 4 ground control points were evenly arranged in the field to obtain accurate geographical reference positions. The plane precision of ground control points system is ± (8 + 1 × 10–6 × D) mm, and the elevation accuracy is ± (15 + 1 × 10–6 × D) mm. The area marked by ground control points is 40 cm × 40 cm. The UAV-based images were first aligned using Pix4Dmapper, and the camera parameters were estimated from the coordinates of their photographs and GCPs. A depth sharpening filter to further improve images edge detail was selected, and then dense point clouds were built. Finally, meshes and textures based on the camera’s own parameters were created and the processed images were exported as TIFF images. ENVI 5.1 was used to mark the region of interest (ROI) (see Fig. 1) in the experimental plot and to extract the average reflectance values in the four bands of ROI for each images.

2.6 Spectral Indices

Four canopy reflectance values (Green, Red, Red edge, Near-infrared) were then used to computer 12 spectral indices that were evaluated with their correlations with chlorophyll content (see Table 2). The indices were enhanced by the contribution of vegetation optical properties according to the spectral response of the canopy by combining several bands. And thus, spectral indices were able to reduce the effects of disturbing factors, such as soil background and atmosphere, particularly at low canopy coverage.

Table 2 Spectral indices used in the study

The relationship between 12 spectral indices and the chlorophyll content was calculated by Pearson correlation analysis. The correlation coefficient (r) values were used to evaluate the correlations between indices and chlorophyll content. Their performance was further evaluated by statistical comparison of the returned individual spectral indices correlation.

2.7 Machine Learning Algorithms

2.7.1 Partial Least Squares Regression and Ridge Regression (PLSR and RR)

Before the estimation model was constructed, Pearson correlation analyses between chlorophyll content measurements and 12 spectral indices were conducted. The indices with higher correlation coefficients (|r|> 0.4, and p values less than 0.01) were selected for machine learning algorithms. Compared to simple linear regression, PLSR and RR models are able to improve model accuracy and simplify the complexity of the model (Prado Osco et al. 2019). For the PLSR and RR methods, the grid-search method was used to find the optimal number of factors and regularization factor (α), and found that the optimal number of factors was 2 at tuber formation, tuber bulking and starch accumulation, 9 at the combined stages and (α) were 10 in all growth stages.

2.7.2 Support Vector Regression (SVR)

The support vector machine (SVM) was used to avoid having the problems of classification and regression (Wang et al. 2020). Since the loss function gamma (g), the error penalty factor (C), and the choice of kernel function affect the performance of the SVM, the “Support Vactor Regression” function in “sklearn” package was sued to implement the SVR algorithms in the Python 3.10 environment in this study. The commonly used radial basis kernel function (RBF) was applied in this study. Finally, a grid search was used to retrieve different combinations of g and C parameters. The best combination of g and C was 0.001 and 10.001 at tuber formation and tuber bulking, and 0.001 and 20.001 at starch accumulation and the combined stages for our data set, respectively.

2.7.3 Random Forest Regression Algorithm (RF)

The RF is an ensemble learning, that allows high accuracy and generalization performance from the overall model (Shi et al. 2020). Furthermore, the RF reduces overfitting by repeatedly putting back samples. In addition, RF takes into account for the influence of input variables (name: mtry) to attenuate the autocorrelation between variables effectively (Peng et al. 2021). The importance of each variable is expressed by the mean squared error (MSE) at the time of model calculation (Tan et al. 2019). Subsequently, the predictors are ranked according to the strength of the relationship between the input and response variables.

Spectral indices extracted from UAV images were used as input of variables for RF regression. To determine the optimal number of trees (ntree) for different stages of potato ntree values were tested from 100 to 400 with an increment of 100, and the value of 200 at tuber formation, 100 at tuber bulking, 400 at starch accumulation, and 400 at the combined stages were selected due to stable lower RMSE. The number of variables (mtry) was set to 4 at tuber formation, 9 at tuber bulking, 4 at starch accumulation, and 1 at the combined stages, as it yielded lower RMSE. In this study, the RF algorithms were implemented in the Python 3.10 environment, and the ‘Random Forest Regression” function in “sklearn” package was used. Other parameters were set as default values.

2.8 Data Analysis

In this study, there were 132 sample points. Around 70% of them were selected to train the four machine learning algorithms, and 30% data were for validation. The performances of the different models of machine learning were evaluated by comparing the coefficients of determination (R2), root square error (RMSE) and mean square error (MSE) in predictions. The higher the R2 and the lower the RMSE and MSE, the better the precision and accuracy of the models. The R2, RMSE and MSE were calculated according to Eqs. (1, 2, 3):

$$\begin{array}{c}{R}^{2}=\sum {\left({\widehat{y}}_{i}-\overline{y}\right)}^{2}/\sum {\left({y}_{i}-\overline{y}\right)}^{2}\end{array}$$
(1)
$$\begin{array}{c}RMSE=\sqrt{\frac{1}{n}\sum\limits_{i=1}^{n}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}}\end{array}$$
(2)
$$\begin{array}{c}MSE=\frac{1}{n}\sum\limits_{i=1}^{n}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}\end{array}$$
(3)

where \({y}_{i}\), \({\widehat{y}}_{i}\), and \(\overline{y}\) are the observed, predicted, and observed mean values, respectively, and n is the sample size.

3 Results

3.1 Variations in Plant Chlorophyll Content in Different N Treatments and Growth Stages

The results in this study showed that the chlorophyll content increased from tuber formation to starch accumulation (see Fig. 3). The chlorophyll content in the training data set ranged from 38.5 to 53.9 with a mean value of 45 and a CV value of 8.8%, while this in the testing data set ranged from 39.5 to 52.7 with a mean value of 45.5 and a CV value of 8.8%. The training and testing data sets showed a similar statistical distribution, indicating that there were no potentially biased estimations for model calibration and validation.

Fig. 3
figure 3

Variation of SPAD value of potato at tuber formation, tuber bulking and starch accumulation stages of tuber formation (T1), tuber bulking (T2) and starch accumulation (T3)

3.2 Relationships Between Spectral Indices and Chlorophyll Content of Potato Plants

The results showed that the most selected spectral indices were significantly correlated with the plant chlorophyll content regardless of growth stage (see Table 2 and Fig. 4). However, the results showed that there was the significant effect of the growth stages on the performances of the selected spectral indices (see Fig. 4). This study also found that the best spectral indices were GNDVI and CIgreen, especially at the growth stages of tuber formation and tuber bulking. At the starch accumulation and the combination of three stages, Ctr2 and CIred-edge showed significant correlations with potato chlorophyll content, suggesting that chlorophyll content can be effectively estimated using these spectral indices.

Fig. 4
figure 4figure 4

Correlation analysis of spectral indices with SPAD value at different stages

3.3 Estimation of Potato Chlorophyll Content Using Machine Learning Algorithms

In this study, the chlorophyll content was predicted from highly correlated spectral indices at different growth stages using four machine learning algorithms (SVR, RF, PLSR, and RR) (see Fig. 5). The results showed that RF performed the highest accuracy in estimating chlorophyll content with R2 ranging from 0.96 to 0.97, RMSE ranging from 0.62 to 0.81, and MSE ranging from 0.38 to 0.65 for all growth stages. Except for the PLSR, all other machine learning algorithms showed the highest estimation accuracy at the starch accumulation stage and followed by the tuber formation stage.

Fig. 5
figure 5

Relationships between the predicted value and observed value of training data set using four models at tuber formation stage (T1), tuber bulking stage (T2), starch accumulation stage (T3), and the combination of growth stages (T1 + T2 + T3)

The results in Fig. 6 showed that RF was the optimal model for estimating potato chlorophyll content with R2 of 0.61–0.83 that was greater than other models regardless of growth stage, whereas RMSE (1.75–2.31) and MSE (3.06–5.37) were smaller than other machine learning algorithms. Compared to the RF, other machine learning algorithms showed the least performance, especially PLSR and RR algorithms due to the growth stage effect.

Fig. 6
figure 6

Validation of the performance of SVR (a), RF (b), PLSR (c), and RR (d) at tuber formation stage, tuber bulking stage, starch accumulation stage and the combination of growth stages using the testing data set

3.4 Potato Chlorophyll Content Predictive Map

The RF model was used to map the predicted chlorophyll content of plants based on UAV multispectral images at the different growth stages (see Fig. 7). Figure 7 shows that the estimated value of chlorophyll content was relatively high in the tuber formation stage, whereas this gradually decreased with increasing plant growth. During the tuber bulking stage, the number of pixels of medium chlorophyll content (see Fig. 7b, e) increased due to the rapid growth over potato tuber stage. During late reproduction, the nutrient delivery to the crop is at its highest level, leading to the decrease in number of pixels due to low plant chlorophyll content in Fig. 7c, f. Meanwhile, chlorophyll content is significantly affected by the amount of N applied, and the use of UAV images mapping allows for real-time monitoring of the N status of the potato plant so that nutrient management of the crop can be adjusted.

Fig. 7
figure 7

Chlorophyll content map as indicated in SPAD values were derived by applying the RF model to the 8 spectral indices images for tuber formation (T1), tuber bulking (T2), and starch accumulation (T3)

4 Discussion

4.1 Feasibility of Using UAV to Estimate Potato Chlorophyll Content

The results in this study demonstrated that some spectral indices calculated from multispectral images were highly correlated with potato chlorophyll content, suggesting chlorophyll content of plants can be estimated by UAV multispectral image. Due to the different prediction performance of spectral indices in different growth stages, however, it was difficult to establish general models between different spectral indices and chlorophyll content, and the accuracy of simple linear regression models was still poor (Lee et al. 2020). Therefore, machine learning algorithms are promising as they can combine various spectral indices to improve the estimation accuracy (Qiu et al. 2021). Numerous studies have already used RF, SVR, PLSR, and RR algorithms for estimations of chlorophyll content (Tahir et al. 2018). However, it is still unclear whether these algorithms are most suitable to fulfill the requirement (Li et al. 2020). Therefore, this study compared four algorithms for precision and stability. The RF algorithm had higher R2 and lower RMSE and MSE than others algorithms for chlorophyll content estimates in different growth stages, indicating that RF algorithm can provide accurate chlorophyll content estimations. Our study also found that, since the model was trained using data set only from 2021, resulting in poor models validation (see Fig. 8), there is a need to have a large amount of experimental data for training the models in future studies for the model stability. Recently, there were many research showed that using UAVs combined the machine learning algorithms could be monitored of important crop growth parameters, such as biomass, N, and leaf area index (Prado Osco et al. 2019; Abdelbaki et al. 2021; Zhou et al. 2016), which further confirms the feasibility our method and results. Compared with conventional wet chemical method, our method demonstrated the advantages of convenience and speed, and compared with the chlorophyll meters (SPAD-502), using UAVs could achieve over a large area. Therefore, it was potential to estimate potato chlorophyll content using spectral indices calculated from UAV multispectral images.

Fig. 8
figure 8

Validation of the performance of RR, PLSR, SVR, and RF models at tuber formation stages (T1), tuber bulking stages (T2), starch accumulation (T3), and the combined stages (T1 + T2 + T3) using 2020 site data set

4.2 Analysis of Sensitive Bands

Selection of the appropriate spectral bands from all waveband to construct the spectral indices plays an important role in improving the model accuracy. At the tuber formation and the tuber bulking stages, the green wavelength is closely related to the potato chlorophyll content, which is in agreement with previous study of the relationship between the spectral reflectance in the green region and plant chlorophyll content (Zhao et al. 2018). Hensen and Schjoerring (2003) explained that chlorophyll a and b were more reflective in the green region, the green light region was often used to monitor chlorophyll content. However, this study suggests that the red-edge (680–800 nm) regions are important to potato chlorophyll content estimation at starch accumulation and the combination growth stages as well. Similar to the findings of Zheng et al. (2018b), the red-edge was a sensitive region for potato chlorophyll content. Chang-Hua et al. (2010) constructed a new red-edge spectral indices (RES) to estimate the chlorophyll content of sugar beet with high accuracy. Therefore, the influence of growth stages on the selection of sensitive bands should be taken into account when spectral indices are applied. In future studies, the optimization of the selected spectral indices should be further investigated at different growth stages to improve the accuracy of estimation of chlorophyll content in potato crops.

The spectral indices enhance the spectral characteristics sensitive to the biochemical properties of the vegetation by combining several bands and a specific formula format compared to the four bands. The wavebands of spectral indices contain some reference bands that are insensitive to chlorophyll content, and those bands are helpful to increase the signal-to-noise ratio (Yu et al. 2013). The spectral region used by the multispectral UAV is consistent with some satellite multispectral bands, such as the Sentinel-2. Therefore, the estimation framework constructed in this study can also be applied to satellite images for large-scale crop chlorophyll content estimation (see Fig. 2).

4.3 The Effect of the Number of Spectral Indices on the Model

Previous studies have also shown that predicting chlorophyll content was affected by the use of variable importance plot in RF that eliminates spectral variable inputs. The contribution of each spectral index to the RF model indicates the extent to which each of the spectral indices could predict the chlorophyll content variations (see Table 2 and Fig. 9). However, there is a need to analyze the performance of the model for the reduction in the number of input variables and increase computational efficiency. By implementing the best 1 to 3 spectral indices into the RF model, only a slight decrease was observed in its performance (see Fig. 10). Similarly, Prado Osco et al. (2019) used the variable importance graph in RF model to eliminate spectral variable inputs to reduce the amount of spectral data computation and keep the model with better performance.

Fig. 9
figure 9

Importance scores for spectral indices in the best RF model for predicting potato SPAD value at tuber formation stage (T1), tuber bulking stage (T2), starch accumulation stage (T3), and combined growth stages (T1 + T2 + T3)

Fig. 10
figure 10

Optimizing the number of predictors of spectral indices according to important scores at tuber formation stage (T1), tuber bulking stage (T2), starch accumulation stage (T3) and the combination of growth stages (T1 + T2 + T3) for training data set and testing data set

5 Conclusions

An integrated method combined with spectral indices and random forest model was proposed to estimate chlorophyll content in potato crop. Accuracy and robustness of the RF model were verified for each growth stage, such as tuber formation, tuber bulking, starch accumulation, and the combination of growth stages. The results of comparison of the RF model with SVR, PLSR, and RR showed that the accuracy of RF outperformed at each stage. The correlation analysis demonstrated that most of spectral indices were significantly correlated with the potato chlorophyll content, whereas the spectral indices with the strongest correlation varied at different period. The CIgreen and the green band highly contributed to the estimation model during the tuber formation and tuber bulking stage. However, the CIred edge and the red edge band provided higher contribution to the prediction model during the starch accumulation and the combination of growth stages.