Is Time Series Smoothing Function Necessary for Crop Mapping? — Evidence from Spectral Angle Mapper After Empirical Analysis

Chen, Ailian; Zhao, Hu; Pei, Zhiyuan

doi:10.1007/978-3-319-48357-3_33

Ailian Chen¹⁷,
Hu Zhao¹⁷ &
Zhiyuan Pei¹⁷

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 478))

Included in the following conference series:

International Conference on Computer and Computing Technologies in Agriculture

1811 Accesses

Abstract

Time series smoothing functions have been frequently applied to fit multi-temporal vegetation index for better extraction of plant seasonal/growing parameters. Questions are raised that whether the smoothing is necessary for crop mapping. Four time series smoothing functions, namely, HANTS, Savitzky-Golay (S-G), double logistics and asymmetric Gaussian, were used to smooth 23 MODIS 16-days composite NDVI images in one year. The effectiveness were compared through visual check, correlation coefficient R, root mean square error (RMSE), and local signal noise ratio (SNR). The best smoothing time series NDVI images, along with the original time series images, were then used to map corn and soybeans by spectral angle mapper (SAM) method and their mapping accuracies were compared. Comparison of smoothing results showed that S-G fitted data got the strongest correlation coefficient R, the lowest RMSE and lower local SNR. Comparison of mapping results further showed that time smoothing function does not improve the classification accuracy obviously with the same training sample and same temporal bands. The whole analysis indicates that it is the mapping method that matters more than time series smoothing function for classification precision.

You have full access to this open access chapter, Download conference paper PDF

Modeling grass yields in Qinghai Province, China, based on MODIS NDVI data—an empirical comparison

Article 17 January 2020

Jianhong Liu, Clement Atzberger, … Lei Wang

Estimation of rice phenology date using integrated HJ-1 CCD and Landsat-8 OLI vegetation indices time-series images

Article 14 October 2015

Jing Wang, Jing-feng Huang, … Xiao-dong Song

Assimilation of Remote Sensing Data into Crop Growth Model for Yield Estimation: A Case Study from India

Article 25 March 2021

Murali Krishna Gumma, M. D. M. Kadiyala, … Anthony M. Whitbread

Keywords

1 Introduction

High temporal resolution time series (TS) vegetation indices (VIs) have been well used in the research of global environment with respect to vegetation phenology. Large scale environmental parameters, such as global land use/cover change, could be derived with the knowledge of vegetation phenology. However, the high temporal resolution time series VIs often have coarse spatial resolution, which often make the time series VIs noisy and fluctuating due to the complex atmospheric condition within one coarse pixel (such as undetected cloud) and varying sun-sensor-surface geometries (Duggin 1985). Therefore, a number of noise-reduction functions, including maximum value composite (Huete et al. 2002), Fourier transform (Jentsch and Subba Rao 2014), wavelet transform (Sakamoto et al. 2010), Gaussian transform (Hird and McDermid 2009), etc., were proposed or utilized to solve this problem.

As for crop mapping, some studies (Sakamoto et al. 2010; Brown et al. 2013; Zhang et al. 2014) have used TS fitting function before mapping, while many others have not (Chang et al. 2007; Wardlow and Egbert 2008; Gusso et al. 2014; de Souza et al. 2015). Some of those which have used time series smoothing functions often conducted more than one smoothing function to the time series images. For example, researchers used MODIS Level 3 VI products always conducted smoothing function upon the already composited VIs by maximum value composite (MVC) (Huete et al. 2002). .

A scanty few studies have been conducted regarding the benefit of noise reduction for vegetation phenology (Hird and McDermid 2009), or regarding empirical information of noise of the smoothed images themselves, such as Atzberger and Eilers (2011), who have compared the effectiveness of four fitting methods. Fewer studies have investigated the necessity of fitting TS VIs for crop classification. Questions are raised that whether smoothing functions are necessary for crop type mapping. This goals of this paper are: (1) to study the effectiveness of four different time series fitting methods, and then with the best fitting results, (2) to compare the classification accuracy.

2 Materials and Methodology

2.1 Materials

MODIS 16 days composite NDVI of the year 2013 were used for time series analysis. The time series started at 2013001 as band 1, and ends at 2013353 as band 23, with an interval of 16 days. The value in band 1 represents the max NDVI from the date Jan. 1st to the date Jan. 16th, 2013, and so on. The MODIS NDVI 16 days composite NDVI named “MOD13Q1-Level 3 16-Day Vegetation Indices-250 m”, is one of the MODIS 16-Day Tiled Land Products (collection 5). The data was downloaded from the U.S. National Aeronautics and Space Administration (NASA). The “MOD13Q1-Level 3 16-Day Vegetation Indices-250 m” included both NDVI and enhanced VI, EVI. The NDVI was extracted for research as it was a normalized value. MODIS data are distributed from NASA on a near real-time basis at no cost. It is available at two satellite platforms, Terra and Aqua, which were launched in December 1999 and May 2002, respectively. The spatial resolutions of the MODIS-250 m (MOD13Q1, MYD13Q1) on the sinusoidal projections are 231.7 m. The downloaded tiles were mosaicked, re-projected to Albers projection and resampled to 250 m.

Crop data layer (CDL) was used for training and accuracy check. CDL is a specific crop type classification data with resampled resolution of 30 m, and is produced and distributed in the name of NASS (national agricultural statistics service), USDA. The data is available through the web of USDA.

One U.S. state, the Ohio, was taken as case study to classify corn and soybeans from others, as in large area mapping, the large area was often divided into small regions. Corn and soybeans are the dominant crop in Ohio, where the yield of corn takes up 4 % of the whole states total, the yield of soybeans takes up 7 %. Other parts of the state are mainly forest and shrub lands. The spatial distribution of corn and soybeans from CDL 2013 was shown in Fig. 1.

2.2 Time Series Smoothing Function and Result Check

Four popular (Hird and McDermid 2009; Zhang and Ren 2014) smoothing functions were conducted, including Harmonic analysis of TS (HANTS) (Zhou et al. 2015), S-G filtering function short for adaptive Savitzky-Golay filtering (Chen et al. 2004), asymmetric Gaussians (referred to as Gauss herein after) (Jönsson and Eklundh 2002) and double logistic functions (referred to as Logistics herein after) (Beck et al. 2006). The basic idea of HANTS is to calculate a Fourier series to model a TS of pixel-wise data, while meanwhile recognizing abnormal data value relative to the TS of Fourier model. HANTS deletes abnormal data value and substitutes it with the value given by the model. It is developed with the interactive data language (IDL). The other three smoothing algorithms are embedded in the free software TIMESAT (Jönsson and Eklundh 2004).

Visual and statistical check were conducted for the comparison of the four smoothing functions. Visual check was done first by the screenshot of the original images and all the smoothed images, and then by compared the time series NDVI profiles of two pixels, one of which was the corn pixel, and the other was soybeans.

Statistical check included the comparison of correlation coefficient R of one single band image (Band 14) and 20 sample points, local SNR and RMSE in Band 14, as Band 14 represents the NDVI value in late June and early July, when the NDVI corn and soybeans were near the max value (Wardlow and Egbert 2008).

Correlation coefficient R (whole image and sample point) was calculated with Eq. 1

$$ {\text{R}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} (xi - \bar{X})(yi - \bar{Y})}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} (xi - \bar{X})^{2} (yi - \bar{Y})^{2} )} }} $$

(1)

Where, xi, yi represent the pixel value of the whole non-smoothed image and the smoothed image, and $ \bar{X} $ and $ \bar{Y} $ are mean value of the whole non-smoothed image and the smoothed image in Band 14, respectively. For sample points, xi, yi represent the pixel value of samples from the non-smoothed image and the smoothed image in Band 14, and $ \bar{X} $ and $ \bar{Y} $ are the mean value of samples from the non-smoothed image and the smoothed image in Band 14 respectively.

As global SNR requires a homogeneous underlying surface, which is always not easy to find, the local SNR was used, which calculated local SNR using the mode of several grids. The local SNR is also used for the determination of HJ-images (Zhu et al. 2010). Sixteen fishnet grid with edge of 10 km was used to calculate SNR, and the mode, mean, min and max values were given. Root mean square error (RMSE), local signal noise ratio SNR (Zhu et al. 2010) of 16 fishnet grid of the whole time series images were calculated through the following equations (Eq. 2 to Eq. 6).

RMSE was calculated with Eq. 2

$$ {\text{RMSE }} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{N} (xi - yi)^{2} }}{N}} , $$

(2)

Local SNR is calculated by Eq. 3.

$$ {\text{SNR }} = \bar{X}/\upsigma , $$

(3)

where, $ \bar{X} $ is the mean value of the grid, and $ \upsigma $ is the standard error, which are calculated by Eqs. 4 and 5.

$$ \bar{X} = \frac{{\mathop \sum \nolimits_{i = 1}^{N} xi}}{N}, $$

(4)

$$ \upsigma = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{N} (xi - \bar{X})^{2} }}{N - 1}} , $$

(5)

where, xi represents the pixel value of i in the analyzing grid, and N is the total pixel numbers in the grid.

For the calculated R, RMSE, and SNR, the criteria for effectiveness is higher R, lower RMSE and lower SNR.

2.3 Crop Mapping and Result Check

Spectral Angle Mapping (SAM) in the commercial software ENVI^TM was used for classification. SAM “uses an n dimension angle to identify pixels to reference spectra” (Kruse et al. 1993). The spectral angle is reversed by Eq. 6.

$$ \cos \theta = \frac{XY}{\left| X \right||Y|} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} xiyi}}{{\sqrt {\mathop \sum \nolimits_{i = 1}^{n} xixi} \sqrt {\mathop \sum \nolimits_{i = 1}^{n} yiyi} }}\quad\uptheta \in (0,\frac{\pi }{2}), $$

(6)

where, n is the total band number. It is 23 in this study, xi and yi are the data value (usually reflectance) of each time series images, and $ \theta $ is the spectral angle which the mapper would be based on. The algorithm determines the spectral likeness between two spectrum profiles by figuring out the angle between the spectrum profiles in a space with dimensionality equaling to the number of bands. The SAM method is relatively less sensitive to illumination and albedo effects than other methods such as maximum likelihood.

End-member spectra (training data spectra) used in SAM mapping procedure was collected with the aid of CDL. The CDL was resampled to a resolution of 250*250 m, the same as the MODIS NDVI data, with each 250 * 250 m pixel assigned a value of percent of corn or soybeans. The pure corn and soybeans were extracted for random sampling. Ten percent of Random samples (points) of corn and soybeans were extracted from the pure corn or soybeans layer, respectively. One third of these random samples were used for training of SAM, which was done by performing a MOD function, going by like MOD(ID, 3) equaling to 0, and the left were used for accuracy check. The tolerance angle for SAM was set as 0.5 rad.

For the accuracy check of mapping result, three groups were compared. Overall, producer and user accuracy were all derived for comparison. The first group classified all the 23 original and S-G fitted time series images, the second group classified the 12 images from Band 7-18, and the third group classified 7 images including band 9, and Band12-17. Percent and pixel accuracy of corn and soybeans were given, as well as the overall accuracy. The mapping results with best overall accuracy was compared by ratio to the acreage estimated by CDL and acreage from national statistics released by NASS, USDA.

3 Results and Discussion

3.1 Time Series Fitting Results

3.1.1 Visual Check

The screen shots of one sub region of the original image and four smoothed images in Band 14 were shown in Fig. 2. The result of HANTS was different from others to the most extent, which has brought more noise to the original image.

Time series NDVI profiles of one corn pixel and one soybeans pixel from the original images and four smoothed images were shown in Figs. 3 and 4, which indicated that HANTS tended to change the max value position, and obscured the start and end of season date. S-G has well kept the original curve.

3.1.2 Correlation Coefficient, RMSE and SNR

Correlation coefficient R of the whole image and the 20 sample points in Band 14 between the smoothed and the non-smoothed images were given in Table 1.

Table 1. Correlation coefficient between each smoothed and the non-smoothed images

Full size table

The correlation coefficient R showed that the S-G smoothing functions got the best results both for the whole image and for sample points among the four smoothing functions conducted in this study.

Correlation analysis between smoothed data and the original data for the 16 fishnet grids also showed that S-G smoothing function presented higher median and mean correlation coefficient than Logistics, Gauss and HANTS as shown in Table 2:

Table 2. Correlation coefficient of each time series smoothing function of 16 fishnet grids

Full size table

S-G smoothing function presented the lowest RMSE than Logistics, Gauss and HANTS as shown in Table 3:

Table 3. RMSE of each time series smoothing function of 16 fishnet grid

Full size table

Local SNR analysis showed that the original time series had the highest SNR either in mode, median or in mean value. S-G smoothing function presented lower SNR than HANTS smoothing function, but higher SNR than Logistics and Gauss, as shown in Table 4:

Table 4. Local SNR of the original and fitted time series images

Full size table

3.2 Classification Results

3.2.1 Visual Check

Corn and soybeans distribution mapped from time series Band 7 to Band 18 of the original images and S-G smoothed images were shown in Figs. 5 and 6. The coarse resolution of MODIS Vis has made the mapping results less specific compared to the CDL distribution of corn and soybeans as shown in Fig. 1. However, the trend of distribution were generally right where there was corn or soybeans. The red ellipse circle on the map showed slightly different details between the results from the original images and S-G smoothed images.

3.2.2 Mapping Accuracy

Overall accuracy, producer and user accuracy of classification were compared in three groups, the first of which classified all 23 original and S-G fitted images, the second classified the 12 images from Band 7-18, and the third group classified 7 images including band 9, and Band 12-17. Percent and pixel accuracy of corn and soybeans were given, as well as the overall accuracy, as shown in Table 5:

Table 5. Overall, producer, and user accuracy of classification of three group

Full size table

The ratio of mapped acreage from MODIS time series NDVI images to the acreage of CDL and to the statistics reported by ASD released June 28, 2013, by NASS, USDA were given in Table 6.

Table 6. Mapped MODIS result acreage compared to pixel counted from CDL and statistical acreage released June 28, 2013, by NASS, USDA

Full size table

The mapped acreage of corn was very near to the acreage of both CDL and state statistics released in June 28, 2013, by NASS, USDA.

4 Discussion

4.1 Time Series Fitting Effectiveness

Visual checking showed that HANTS altered the original data value massively, and thus has shown higher SNR than S-G, Gauss and Logistics. Visual checking by time series profiles showed that S-G, Gauss, and Logistics were similar in the growing season, but differed from each other in other part of the year. Generally, S-G smoothing function got the best results, especially with NDVI peak value neither obviously lagging nor postponing.

Statistical check showed that S-G smoothing algorithms had the highest correlation coefficient either for the whole image or for sample points. S-G smoothing algorithms also presented the lowest RMSE and relatively lower SNR. With these results, S-G was thought to be the best and stably effective smoothing function for crop mapping.

4.2 Crop Mapping Accuracy

With the same methods, the same input data bands, and the same training data, the overall accuracy of the original time series was generally better than the S-G smoothed time series, but the accuracy was not obviously high, no higher than 5 %. The ratio of mapped acreage to CDL pixel counted acreage was near 100 % for corn, and near 80 % for soybeans, and so was the ratio of mapped acreage to national statistical acreage. Besides, the detail in the distribution map also led to different conclusion compared to the accuracy report. Therefore, it was concluded here that the smoothing functions, the mapping method, the training data and the accuracy check method all had effect on the comparison. It is the goal the matters for the choosing of mapping method.

4.3 Limitations

Both time series fitting method and the selection of input bands have effect on the mapping result. This paper only investigated the result of spectral angle mapper mapping method. Other method may got different results as well.

5 Conclusion

This paper compared the effectiveness of four time series smoothing algorithms through a series of statistical indicators, and used the most effective smoothed result to conduct crop mapping. S-G smoothing was found to be the most effective fitting method. However, the most effective smoothed results did not provide the most accurate mapping results with the same method, the same training data and the same checking data. This findings imply that both mapping methods and the goals or purposes the user require matter more for a better result of crop mapping. It was not necessary to conduct a time series smoothing before a better mapping scheme was founded.

References

Atzberger, C., Eilers, P.H.: Evaluating the effectiveness of smoothing algorithms in the absence of ground reference measurements. Int. J. Remote Sens. 32, 3689–3709 (2011)
Article Google Scholar
Beck, P.S.A., Atzberger, C., Høgda, K.A., Johansen, B., Skidmore, A.K.: Improved monitoring of vegetation dynamics at very high latitudes: a new method using MODIS NDVI. Remote Sens. Environ. 100, 321–334 (2006)
Article Google Scholar
Brown, J.C., Kastens, J.H., Coutinho, A.C., de Castro Victoria, D., Bishop, C.R.: Classifying multiyear agricultural land use data from Mato Grosso using time-series MODIS vegetation index data. Remote Sens. Environ. 130, 39–50 (2013)
Article Google Scholar
Chang, J., Hansen, M.C., Pittman, K., Carroll, M., DiMiceli, C.: Corn and soybean mapping in the United States using MODIS time-series data sets. Agron. J. 99, 1654–1664 (2007)
Article Google Scholar
Chen, J., Jönsson, P., Tamura, M., Gu, Z., Matsushita, B., Eklundh, L.: A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky-Golay filter. Remote Sens. Environ. 91, 332–344 (2004)
Article Google Scholar
de Souza, C.H.W., Mercante, E., Johann, J.A., Lamparelli, R.A.C., Uribe-Opazo, M.A.: Mapping and discrimination of soya bean and corn crops using spectro-temporal profiles of vegetation indices. Int. J. Remote Sens. 36, 1809–1824 (2015)
Article Google Scholar
Duggin, M.: Review article: factors limiting the discrimination and quantification of terrestrial features using remotely sensed radiance. Int. J. Remote Sens. 6(1), 3–27 (1985)
Article Google Scholar
Gusso, A., Arvor, D., Ducati, J.R., Veronez, M.R., Da Silveira, L.G.: Assessing the MODIS crop detection algorithm for soybean crop area mapping and expansion in the Mato Grosso State, Brazil. Sci. World J. 2014 (2014)
Google Scholar
Hird, J.N., McDermid, G.J.: Noise reduction of NDVI time series: an empirical comparison of selected techniques. Remote Sens. Environ. 113, 248–258 (2009)
Article Google Scholar
Huete, A., Didan, K., Miura, T., Rodriguez, E.P., Gao, X., Ferreira, L.G.: Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 83, 195–213 (2002)
Article Google Scholar
Jönsson, P.K., Eklundh, L.: Seasonality extraction by function fitting to time series of satellite sensor data. IEEE Trans. Geosci. Remote Sens. 40, 1824–1832 (2002)
Article Google Scholar
Jönsson, P., Eklundh, L.: TIMESAT—a program for analyzing time-series of satellite sensor data. Comput. Geosci. 30, 833–845 (2004)
Article Google Scholar
Jentsch, C., Subba Rao, S.: A test for second order stationarity of a multivariate time series. J. Econometrics 185(1), 124–161 (2015)
Article MathSciNet MATH Google Scholar
Kruse, F.A., Lefkoff, A.B., Boardman, J.W., Heidebrecht, K.B., Shapiro, A.T., Barloon, P.J., Goetz, A.F.H.: The spectral image processing system (SIPS)—interactive visualization and analysis of imaging spectrometer data. Remote Sens. Environ. 44, 145–163 (1993)
Article Google Scholar
Sakamoto, T., Wardlow, B.D., Gitelson, A.A., Verma, S.B., Suyker, A.E., Arkebauer, T.J.: A two-step filtering approach for detecting maize and soybean phenology with time-series MODIS data. Remote Sens. Environ. 114, 2146–2159 (2010)
Article Google Scholar
Wardlow, B.D., Egbert, S.L.: Large-area crop mapping using time-series MODIS 250 m NDVI data: an assessment for the US Central Great Plains. Remote Sens. Environ. 112, 1096–1116 (2008)
Article Google Scholar
Zhang, J., Feng, L., Yao, F.: Improved maize cultivated area estimation over a large scale combining MODIS–EVI time series data and crop phenological information. ISPRS J. Photogrammetry Remote Sens. 94, 102–113 (2014)
Article Google Scholar
Zhou, J., Jia, L., Menenti, M.: Reconstruction of global MODIS NDVI time series: performance of Harmonic ANalysis of Time Series (HANTS). Remote Sens. Environ. 163, 217–228 (2015)
Article Google Scholar
Zhang, H., Ren, Z.Y.: Comparison and application analysis of several NDVI time-series reconstruction methods. Scientia Agricultura Sinica 47(15), 2998–3008 (2014). (in Chinese)
MathSciNet Google Scholar
Zhu, B., Wang, X.H., Tang, L.L., Li, C.R.: Review on methods for SNR estimation of optical remote sensing imagery. Remote Sens. Technol. Appl. 25(1), 303–309 (2010). (in Chinese)
Google Scholar

Download references

Acknowledgment

We express our deep gratitude to the free data and software providers: NASA for MODIS, USDA for CDL, GDSC (Geospatial Data Service Centre) for HANTS package, and Lars Eklundh for the software TIMESAT. The whole work was funded by the Civil Space Project in the 12th Five-Year Plan of China (2011–2015)

Author information

Authors and Affiliations

Chinese Academy of Agricultural Engineering, Beijing, 100125, People’s Republic of China
Ailian Chen, Hu Zhao & Zhiyuan Pei

Authors

Ailian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyuan Pei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ailian Chen .

Editor information

Editors and Affiliations

China Agricultural University, Beijing, China
Daoliang Li
China Agricultural University, Beijing, China
Zhenbo Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, A., Zhao, H., Pei, Z. (2016). Is Time Series Smoothing Function Necessary for Crop Mapping? — Evidence from Spectral Angle Mapper After Empirical Analysis. In: Li, D., Li, Z. (eds) Computer and Computing Technologies in Agriculture IX. CCTA 2015. IFIP Advances in Information and Communication Technology, vol 478. Springer, Cham. https://doi.org/10.1007/978-3-319-48357-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-48357-3_33
Published: 12 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48356-6
Online ISBN: 978-3-319-48357-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Is Time Series Smoothing Function Necessary for Crop Mapping? — Evidence from Spectral Angle Mapper After Empirical Analysis

Abstract

Similar content being viewed by others

Modeling grass yields in Qinghai Province, China, based on MODIS NDVI data—an empirical comparison

Estimation of rice phenology date using integrated HJ-1 CCD and Landsat-8 OLI vegetation indices time-series images

Assimilation of Remote Sensing Data into Crop Growth Model for Yield Estimation: A Case Study from India

Keywords

1 Introduction