Keywords

1 Introduction

High temporal resolution time series (TS) vegetation indices (VIs) have been well used in the research of global environment with respect to vegetation phenology. Large scale environmental parameters, such as global land use/cover change, could be derived with the knowledge of vegetation phenology. However, the high temporal resolution time series VIs often have coarse spatial resolution, which often make the time series VIs noisy and fluctuating due to the complex atmospheric condition within one coarse pixel (such as undetected cloud) and varying sun-sensor-surface geometries (Duggin 1985). Therefore, a number of noise-reduction functions, including maximum value composite (Huete et al. 2002), Fourier transform (Jentsch and Subba Rao 2014), wavelet transform (Sakamoto et al. 2010), Gaussian transform (Hird and McDermid 2009), etc., were proposed or utilized to solve this problem.

As for crop mapping, some studies (Sakamoto et al. 2010; Brown et al. 2013; Zhang et al. 2014) have used TS fitting function before mapping, while many others have not (Chang et al. 2007; Wardlow and Egbert 2008; Gusso et al. 2014; de Souza et al. 2015). Some of those which have used time series smoothing functions often conducted more than one smoothing function to the time series images. For example, researchers used MODIS Level 3 VI products always conducted smoothing function upon the already composited VIs by maximum value composite (MVC) (Huete et al. 2002). .

A scanty few studies have been conducted regarding the benefit of noise reduction for vegetation phenology (Hird and McDermid 2009), or regarding empirical information of noise of the smoothed images themselves, such as Atzberger and Eilers (2011), who have compared the effectiveness of four fitting methods. Fewer studies have investigated the necessity of fitting TS VIs for crop classification. Questions are raised that whether smoothing functions are necessary for crop type mapping. This goals of this paper are: (1) to study the effectiveness of four different time series fitting methods, and then with the best fitting results, (2) to compare the classification accuracy.

2 Materials and Methodology

2.1 Materials

MODIS 16 days composite NDVI of the year 2013 were used for time series analysis. The time series started at 2013001 as band 1, and ends at 2013353 as band 23, with an interval of 16 days. The value in band 1 represents the max NDVI from the date Jan. 1st to the date Jan. 16th, 2013, and so on. The MODIS NDVI 16 days composite NDVI named “MOD13Q1-Level 3 16-Day Vegetation Indices-250 m”, is one of the MODIS 16-Day Tiled Land Products (collection 5). The data was downloaded from the U.S. National Aeronautics and Space Administration (NASA). The “MOD13Q1-Level 3 16-Day Vegetation Indices-250 m” included both NDVI and enhanced VI, EVI. The NDVI was extracted for research as it was a normalized value. MODIS data are distributed from NASA on a near real-time basis at no cost. It is available at two satellite platforms, Terra and Aqua, which were launched in December 1999 and May 2002, respectively. The spatial resolutions of the MODIS-250 m (MOD13Q1, MYD13Q1) on the sinusoidal projections are 231.7 m. The downloaded tiles were mosaicked, re-projected to Albers projection and resampled to 250 m.

Crop data layer (CDL) was used for training and accuracy check. CDL is a specific crop type classification data with resampled resolution of 30 m, and is produced and distributed in the name of NASS (national agricultural statistics service), USDA. The data is available through the web of USDA.

One U.S. state, the Ohio, was taken as case study to classify corn and soybeans from others, as in large area mapping, the large area was often divided into small regions. Corn and soybeans are the dominant crop in Ohio, where the yield of corn takes up 4 % of the whole states total, the yield of soybeans takes up 7 %. Other parts of the state are mainly forest and shrub lands. The spatial distribution of corn and soybeans from CDL 2013 was shown in Fig. 1.

Fig. 1.
figure 1

Corn and soybeans distribution in 30 m resolution crop data layer (CDL) data in 2013

2.2 Time Series Smoothing Function and Result Check

Four popular (Hird and McDermid 2009; Zhang and Ren 2014) smoothing functions were conducted, including Harmonic analysis of TS (HANTS) (Zhou et al. 2015), S-G filtering function short for adaptive Savitzky-Golay filtering (Chen et al. 2004), asymmetric Gaussians (referred to as Gauss herein after) (Jönsson and Eklundh 2002) and double logistic functions (referred to as Logistics herein after) (Beck et al. 2006). The basic idea of HANTS is to calculate a Fourier series to model a TS of pixel-wise data, while meanwhile recognizing abnormal data value relative to the TS of Fourier model. HANTS deletes abnormal data value and substitutes it with the value given by the model. It is developed with the interactive data language (IDL). The other three smoothing algorithms are embedded in the free software TIMESAT (Jönsson and Eklundh 2004).

Visual and statistical check were conducted for the comparison of the four smoothing functions. Visual check was done first by the screenshot of the original images and all the smoothed images, and then by compared the time series NDVI profiles of two pixels, one of which was the corn pixel, and the other was soybeans.

Statistical check included the comparison of correlation coefficient R of one single band image (Band 14) and 20 sample points, local SNR and RMSE in Band 14, as Band 14 represents the NDVI value in late June and early July, when the NDVI corn and soybeans were near the max value (Wardlow and Egbert 2008).

Correlation coefficient R (whole image and sample point) was calculated with Eq. 1

$$ {\text{R}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} (xi - \bar{X})(yi - \bar{Y})}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} (xi - \bar{X})^{2} (yi - \bar{Y})^{2} )} }} $$
(1)

Where, xi, yi represent the pixel value of the whole non-smoothed image and the smoothed image, and \( \bar{X} \) and \( \bar{Y} \) are mean value of the whole non-smoothed image and the smoothed image in Band 14, respectively. For sample points, xi, yi represent the pixel value of samples from the non-smoothed image and the smoothed image in Band 14, and \( \bar{X} \) and \( \bar{Y} \) are the mean value of samples from the non-smoothed image and the smoothed image in Band 14 respectively.

As global SNR requires a homogeneous underlying surface, which is always not easy to find, the local SNR was used, which calculated local SNR using the mode of several grids. The local SNR is also used for the determination of HJ-images (Zhu et al. 2010). Sixteen fishnet grid with edge of 10 km was used to calculate SNR, and the mode, mean, min and max values were given. Root mean square error (RMSE), local signal noise ratio SNR (Zhu et al. 2010) of 16 fishnet grid of the whole time series images were calculated through the following equations (Eq. 2 to Eq. 6).

RMSE was calculated with Eq. 2

$$ {\text{RMSE }} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{N} (xi - yi)^{2} }}{N}} , $$
(2)

Local SNR is calculated by Eq. 3.

$$ {\text{SNR }} = \bar{X}/\upsigma , $$
(3)

where, \( \bar{X} \) is the mean value of the grid, and \( \upsigma \) is the standard error, which are calculated by Eqs. 4 and 5.

$$ \bar{X} = \frac{{\mathop \sum \nolimits_{i = 1}^{N} xi}}{N}, $$
(4)
$$ \upsigma = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{N} (xi - \bar{X})^{2} }}{N - 1}} , $$
(5)

where, xi represents the pixel value of i in the analyzing grid, and N is the total pixel numbers in the grid.

For the calculated R, RMSE, and SNR, the criteria for effectiveness is higher R, lower RMSE and lower SNR.

2.3 Crop Mapping and Result Check

Spectral Angle Mapping (SAM) in the commercial software ENVITM was used for classification. SAM “uses an n dimension angle to identify pixels to reference spectra” (Kruse et al. 1993). The spectral angle is reversed by Eq. 6.

$$ \cos \theta = \frac{XY}{\left| X \right||Y|} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} xiyi}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} xixi} \sqrt {\mathop \sum \nolimits_{i = 1}^{n} yiyi} }}\quad\uptheta \in (0,\frac{\pi }{2}), $$
(6)

where, n is the total band number. It is 23 in this study, xi and yi are the data value (usually reflectance) of each time series images, and \( \theta \) is the spectral angle which the mapper would be based on. The algorithm determines the spectral likeness between two spectrum profiles by figuring out the angle between the spectrum profiles in a space with dimensionality equaling to the number of bands. The SAM method is relatively less sensitive to illumination and albedo effects than other methods such as maximum likelihood.

End-member spectra (training data spectra) used in SAM mapping procedure was collected with the aid of CDL. The CDL was resampled to a resolution of 250*250 m, the same as the MODIS NDVI data, with each 250 * 250 m pixel assigned a value of percent of corn or soybeans. The pure corn and soybeans were extracted for random sampling. Ten percent of Random samples (points) of corn and soybeans were extracted from the pure corn or soybeans layer, respectively. One third of these random samples were used for training of SAM, which was done by performing a MOD function, going by like MOD(ID, 3) equaling to 0, and the left were used for accuracy check. The tolerance angle for SAM was set as 0.5 rad.

For the accuracy check of mapping result, three groups were compared. Overall, producer and user accuracy were all derived for comparison. The first group classified all the 23 original and S-G fitted time series images, the second group classified the 12 images from Band 7-18, and the third group classified 7 images including band 9, and Band12-17. Percent and pixel accuracy of corn and soybeans were given, as well as the overall accuracy. The mapping results with best overall accuracy was compared by ratio to the acreage estimated by CDL and acreage from national statistics released by NASS, USDA.

3 Results and Discussion

3.1 Time Series Fitting Results

3.1.1 Visual Check

The screen shots of one sub region of the original image and four smoothed images in Band 14 were shown in Fig. 2. The result of HANTS was different from others to the most extent, which has brought more noise to the original image.

Fig. 2.
figure 2

Screenshot of one random pixel in Band 14 before and after fitting

Time series NDVI profiles of one corn pixel and one soybeans pixel from the original images and four smoothed images were shown in Figs. 3 and 4, which indicated that HANTS tended to change the max value position, and obscured the start and end of season date. S-G has well kept the original curve.

Fig. 3.
figure 3

NDVI time series profiles of corn pixel before and after smoothing

Fig. 4.
figure 4

NDVI time series profiles of soybeans pixel before and after smoothing

3.1.2 Correlation Coefficient, RMSE and SNR

Correlation coefficient R of the whole image and the 20 sample points in Band 14 between the smoothed and the non-smoothed images were given in Table 1.

Table 1. Correlation coefficient between each smoothed and the non-smoothed images

The correlation coefficient R showed that the S-G smoothing functions got the best results both for the whole image and for sample points among the four smoothing functions conducted in this study.

Correlation analysis between smoothed data and the original data for the 16 fishnet grids also showed that S-G smoothing function presented higher median and mean correlation coefficient than Logistics, Gauss and HANTS as shown in Table 2:

Table 2. Correlation coefficient of each time series smoothing function of 16 fishnet grids

S-G smoothing function presented the lowest RMSE than Logistics, Gauss and HANTS as shown in Table 3:

Table 3. RMSE of each time series smoothing function of 16 fishnet grid

Local SNR analysis showed that the original time series had the highest SNR either in mode, median or in mean value. S-G smoothing function presented lower SNR than HANTS smoothing function, but higher SNR than Logistics and Gauss, as shown in Table 4:

Table 4. Local SNR of the original and fitted time series images

3.2 Classification Results

3.2.1 Visual Check

Corn and soybeans distribution mapped from time series Band 7 to Band 18 of the original images and S-G smoothed images were shown in Figs. 5 and 6. The coarse resolution of MODIS Vis has made the mapping results less specific compared to the CDL distribution of corn and soybeans as shown in Fig. 1. However, the trend of distribution were generally right where there was corn or soybeans. The red ellipse circle on the map showed slightly different details between the results from the original images and S-G smoothed images.

Fig. 5.
figure 5

Spectral angle mapper result from the original time series images (Color figure online)

Fig. 6.
figure 6

Spectral angle mapper result from the S-G algorithm smoothed time series images (Color figure online)

3.2.2 Mapping Accuracy

Overall accuracy, producer and user accuracy of classification were compared in three groups, the first of which classified all 23 original and S-G fitted images, the second classified the 12 images from Band 7-18, and the third group classified 7 images including band 9, and Band 12-17. Percent and pixel accuracy of corn and soybeans were given, as well as the overall accuracy, as shown in Table 5:

Table 5. Overall, producer, and user accuracy of classification of three group

The ratio of mapped acreage from MODIS time series NDVI images to the acreage of CDL and to the statistics reported by ASD released June 28, 2013, by NASS, USDA were given in Table 6.

Table 6. Mapped MODIS result acreage compared to pixel counted from CDL and statistical acreage released June 28, 2013, by NASS, USDA

The mapped acreage of corn was very near to the acreage of both CDL and state statistics released in June 28, 2013, by NASS, USDA.

4 Discussion

4.1 Time Series Fitting Effectiveness

Visual checking showed that HANTS altered the original data value massively, and thus has shown higher SNR than S-G, Gauss and Logistics. Visual checking by time series profiles showed that S-G, Gauss, and Logistics were similar in the growing season, but differed from each other in other part of the year. Generally, S-G smoothing function got the best results, especially with NDVI peak value neither obviously lagging nor postponing.

Statistical check showed that S-G smoothing algorithms had the highest correlation coefficient either for the whole image or for sample points. S-G smoothing algorithms also presented the lowest RMSE and relatively lower SNR. With these results, S-G was thought to be the best and stably effective smoothing function for crop mapping.

4.2 Crop Mapping Accuracy

With the same methods, the same input data bands, and the same training data, the overall accuracy of the original time series was generally better than the S-G smoothed time series, but the accuracy was not obviously high, no higher than 5 %. The ratio of mapped acreage to CDL pixel counted acreage was near 100 % for corn, and near 80 % for soybeans, and so was the ratio of mapped acreage to national statistical acreage. Besides, the detail in the distribution map also led to different conclusion compared to the accuracy report. Therefore, it was concluded here that the smoothing functions, the mapping method, the training data and the accuracy check method all had effect on the comparison. It is the goal the matters for the choosing of mapping method.

4.3 Limitations

Both time series fitting method and the selection of input bands have effect on the mapping result. This paper only investigated the result of spectral angle mapper mapping method. Other method may got different results as well.

5 Conclusion

This paper compared the effectiveness of four time series smoothing algorithms through a series of statistical indicators, and used the most effective smoothed result to conduct crop mapping. S-G smoothing was found to be the most effective fitting method. However, the most effective smoothed results did not provide the most accurate mapping results with the same method, the same training data and the same checking data. This findings imply that both mapping methods and the goals or purposes the user require matter more for a better result of crop mapping. It was not necessary to conduct a time series smoothing before a better mapping scheme was founded.