Introduction

Dynamic variation of regional crop phenology is an important component of agriculture monitoring (Eerens et al., 2014), and is of increasing relevance for environmental monitoring, e.g. changes in the phenological period and length of the growing season may be caused by climate variability (Brown and de Beurs, 2008; Pan et al., 2015). China has about one fifth of the world’s paddy rice land, and it is of critical importance to extract the phenology information of rice on a large spatial scale to serve related research in agricultural modeling, yield estimation, and climate change (FAOSTAT, 2012; Li S.H. et al., 2014).

Remote sensing techniques offer a feasible tool for delineating spatiotemporal patterns of crop status on a per-pixel basis. Various kinds of optical remote sensing data and techniques have been applied in agricultural monitoring, e.g. the Advanced Very High Resolution Radiometer (AVHRR), the Moderate Resolution Imaging Spectroradiometer (MODIS), and the Systeme Probatoire d’Observation de la Terre (SPOT) VEGETATION (Wu et al., 2010; Peng et al., 2011; Cong et al., 2012; Lanorte et al., 2014). However, the spectral characteristics of the data may be seriously affected by a mixed-pixel problem due to the relative coarse resolutions (≥250 m) (Sakamoto et al., 2005). The Landsat remote sensing data have relatively high spatial resolution but, hampered by its low-temporal resolution, are not suitable for phenological information extraction purposes in agriculture (Anderson et al., 2011). When considering the influence of climate, the situation may be more serious. Most of the paddy fields located in the subtropical regions of China are influenced by the monsoon climate; as a consequence, the qualifying cloud-free remote sensing data during this monsoon period are relatively few (Pyongsop et al., 2010; Cai et al., 2012). To accurately extract the phenological information of rice in this region, a high spatial and temporal resolution remote sensing dataset is required, especially during the biologically sensitive growing period (Wu et al., 2010).

The small sun-synchronous satellites for environment and disaster monitoring and forecasting (HJ-1A and HJ-1B) from China were launched in 2008. The HJ-1A/B satellites have swath width of 700 km, four spectral bands with a spatial resolution of 30 m in the visible bands, and a revisit cycle of four days (the revisit cycle of the constellation is two days) as shown in Table 1 (She et al., 2015). The sensor characteristics of HJ-1 CCD and Landsat-8 operational land imagers (OLIs), including the visible bands composition and spatial resolution, are very comparable. The Landsat-8 was launched on February 2013, and our intent is to combine HJ-1 CCD and Landsat-8 OLI data and verify its capability for extracting the phenological information of the single-cropped rice in southern China.

Table 1 Sensor characteristics of HJ-1 CCD and Landsat-8 OLI

The time-series vegetation indices (VIs), such as the normalized difference vegetation index (NDVI) (Rouse et al., 1974) and the 2-band enhanced vegetation index (EVI2) (Jiang et al., 2007), are widely used in the studies of crop land classification, plant productivity, phenology, and crop growth monitoring (Panda et al., 2010; Gao et al., 2013; Shi et al., 2013; Zhang et al., 2013). It has been shown that VIs are relatively insensitive to the differences in angular viewing factors and atmospheric disturbances and thereby can be used as a benchmark for direct comparison between sensors (Steven et al., 2003).

Regression analysis has been common practice for the intercalibration of different remote sensing data. The ordinary least-squares (OLS) model is a primary tool for comparing two datasets and predicting one dataset from another (Ji et al., 2008). Anderson et al. (2011) compared the ResourceSat-1 NDVI values and Landsat-5 NDVI values by building an intercalibration equation. Li P. et al. (2014) used OLS in cross-comparison of VIs derived from Landsat-8 OLI and Landsat-7 Enhanced Thematic Mapper Plus (ETM+) sensors, and showed that OLI and ETM+ data are complementary. Agreement assessment is another important research topic in image intercalibration. The non-dimensional and symmetrical features of some agreement measures are more suitable for intercalibrating the VIs acquired from different sensors (Mielke and Berry, 2001; Ji and Gallo, 2006). In agreement analysis, the two datasets are treated equally and symmetrically, and the systematic and unsystematic differences between two datasets can be quantified. The geometric mean (GM) regression, orthogonal regression, and OLS bisector regression are widely used symmetrical regression models (Sprent and Dolby, 1980; Isobe et al., 1990; Rawlings et al., 1990; Valsami and Macheras, 1995). Ji et al. (2008) used agreement analysis using symmetrical regression models and compared the AVHRR and MODIS 16-d composite NDVI products. GM regression-based agreement analysis was also used in a cross-sensor comparison study between Landsat-5 TM and IRS-P6 AWiFS (Chen et al., 2013).

The goal of this study was to generate a new VI time-series data with a higher temporal resolution by integrating HJ-1 CCD and Landsat-8 OLI images, in order to extract the phenological information of the single-cropped rice more accurately, and verify that VIs derived from HJ CCD and OLI images can be used as complementary data after proper intercalibration. We used the OLS method to integrate the two remote sensing datasets with the aid of a field campaign, and compared the efficiencies of NDVI and EVI2 using the agreement analysis technique. We also evaluated the integrated time-series VIs in rice phenological parameter extraction using field survey data.

Data and methods

Study area

Deqing County located in North Zhejiang Province was selected as the study area. It has a mean annual temperature ranging from 13 to 16 °C and annual precipitation of 1379 mm (Fig. 1). This region is dominated by the tropical monsoon climate and more than 91% of the crop area is single-cropped rice, according to local statistical data.

Fig. 1
figure 1

Map of the study area

Two overlaid images are HJ-1B CCD1 (path 451 row 80) and Landsat-8 OLI (path 119 row 39), respectively. Dashed line refers to the HJ-1 CCD scene and the solid line refers to the Landsat-8 OLI scene (the swath width of each constellation)

According to a field survey conducted during 2013–2014, the land cover types were classified as rice, trees, water bodies, artificial surfaces, and others. Four typical testing sites were selected with different land cover structures to intercalibrate the VIs derived from the two sensors. All testing sites were larger than 4 km×4 km. Furthermore, total 2281 independent randomly selected sample points were used for comparison based on the field survey and the Second National Soil Survey Vector Map, in which there were 484 for rice, 434 for trees, 709 for water bodies, 448 for artificial surfaces, and 206 for other areas. Sixteen site-based observations of rice phenology in 2013 provided by the Deqing agro-meteorological station were used as validation.

Satellite data

The HJ-1 CCD sensors have similar characteristics to the Landsat-8 OLI, including a nominal descending orbit at 10:30 a.m. local crossing time, the same spatial resolution of 30 m, and overlapped bands in the visible spectrum (Fig. 1; Table 1). The prominent short revisit interval of HJ-1 CCD makes it an ideal candidate for crop status monitoring.

We followed three basic principles in selecting remote sensing images of HJ-1 CCD and OLI in the study area to meet the requirement of calibration and time-series VI construction. They were that (1) the acquisition time difference of HJ-1 CCD and OLI should be within one day; (2) the time span of the remote sensing dataset should cover the whole growing season of the single-cropped rice; and (3) all the four testing sites should be cloud-free in the image pairs for calibration.

Six pairs of archived HJ-1 CCD and Landsat-8 OLI data less than one day apart covering the study area in 2013 and 2014 were collected for intercalibration purposes (Table 2). To extract the rice phenological information, all the 32 relatively cloud-free HJ-1 CCD and OLI images which covered the rice growing period (from late May to early December) in 2013 were also collected to construct the time-series VIs (Table 3).

Table 2 Image pairs of HJ-1 CCD and Landsat-8 OLI used for calibration
Table 3 Images of HJ-1 CCD and Landsat-8 OLI used for phenology extraction

The Level 2 HJ-1 CCD images were downloaded from the China Centre for Resources Satellite Data and Application (CRESDA). The Landsat-8 OLI standard level-one terrain-corrected (L1T) products were collected from the US Geological Survey (USGS). The OLI L1T products were geometrically corrected and orthorectified products by the data provider using standard systematic correction methods.

Pro-processing and VI calculation

The images used were all processed by the radiometric calibration first. The formula and coefficients were collected from the raw data package. The atmospheric correction was performed using the Moderate Resolution Transmission (MODTRAN) 4 model integrated in FLAASH module in the ENVI package. The images were geometrically corrected using geographic information system (GIS) data to an accuracy less than 0.5 pixel (15 m).

Two widely used VIs, i.e. NDVI and EVI2, were selected as the candidates to construct the time-series VI for extracting the phenological information of the single-cropped rice. The NDVI has been proved to be one of the best indicators for vegetation status monitoring (Xie et al., 2008). While for EVI2, compared with EVI, it is more stable across sensors because only 2 bands are used (Jiang et al., 2007; 2008; Kim et al., 2010; Wang et al., 2015). The corresponding formulas are given as follows:

$${\rm{NDVI}} = {{{\rho _{{\rm{nir}}}} - {\rho _{{\rm{red}}}}} \over {{\rho _{{\rm{nir}}}} + {\rho _{{\rm{red}}}}}},$$
(1)
$${\rm{EVI2}} = 2.5 \times {{{\rho _{{\rm{nir}}}} - {\rho _{{\rm{red}}}}} \over {{\rho _{{\rm{nir}}}} + 2.4 \times {\rho _{{\rm{red}}}} + 1}},$$
(2)

where ρred and ρnir refer to the surface reflectance values of Bands 3 and 4 in HJ-1 CCD sensors, and Bands 4 and 5 in Landsat-8 OLI sensors, respectively (Table 1).

Regression and agreement analysis

For intercalibration, we built the OLS regression functions and transformed Landsat-8 OLI-VIs to HJ-1 CCD-VIs based on the data of the four typical testing sites. The GM regression results were used in the following agreement analysis.

The agreement metric, i.e. the agreement coefficient (AC), proposed by Ji and Gallo (2006) was adopted to evaluate the agreement of the two datasets. The AC has the properties to distinguish between systematic and unsystematic differences. The AC is defined as:

$${\rm{AC}} = 1 - {{\sum\limits_{i = 1}^n {{{({X_i} - {Y_i})}^2}} } \over {\sum\limits_{i = 1}^n {(\vert\overline X - \overline Y \vert + \vert{X_i} - \overline X \vert)(\vert\overline X - \overline Y \vert + \vert{Y_i} - \overline Y \vert)} }},$$
(3)

where \(\overline{X}\) and \(\overline{Y}\) are the mean values of datasets X and Y, respectively. X i and Y i are the VI values of each pixel (i=1, 2, 3, …, n). The agreement between X and Y increases as AC approaches 1, and vice versa.

The mean square difference (MSD), a metric used to measure the systematic and unsystematic differences between two dataset, is defined as:

$${\rm{MSD}} = {1 \over n}\sum\limits_{i = 1}^n {{{({X_i} - {Y_i})}^2}}.$$
(4)

The MSD can be further divided into the unsystematic mean product-difference (MPDu) and systematic mean product-difference (MPDs). The MPDu is as follows:

$${\rm{MP}}{{\rm{D}}_{\rm{u}}} = {1 \over n}\sum\limits_{i = 1}^n {(\vert{X_i} - {{\hat X}_i}\vert)(\vert{Y_i} - {{\hat Y}_i}\vert)},$$
(5)

where \(\hat{X}_{i}\) and \(\hat{Y}_{i}\) are the predicted X and Y values, estimated by the GM and inverse GM regressions, respectively. The MPDs is then defined as follows:

$${\rm{MP}}{{\rm{D}}_{\rm{s}}} = {\rm{MSD}} - {\rm{MP}}{{\rm{D}}_{\rm{u}}}.$$
(6)

To compare the difference properties, i.e. the systematic and unsystematic differences, between the two datasets, we also calculated MPDs/MSD and MPDu/MSD, expressed as percentages.

Rice phenological parameters extracted by time-series VIs

To evaluate the performance of time-series VIs produced by HJ-1 CCD and OLI sensors in rice phenological parameter extraction, we constructed two VI time-series for NDVI and EVI2, respectively. The 1st time-series VI was produced using HJ-1 CCD alone; and the 2nd one was generated by integrating HJ-1 CCD and OLI data. The Savitzky-Golay (S-G) filters were applied to smooth the VI time-series data to reduce the undesirable noise caused by unfavorable fluctuations (Jönsson and Eklundh, 2004; Hird and McDermid, 2009; Pan et al., 2015). Readers interested in the S-G filters are referred to these studies (Jönsson and Eklundh, 2004; Hird and McDermid, 2009) for more details. The S-G filters were applied to the image-based VI time-series data composed by the HJ-1CCD and Landsat-8 OLI data from late May to early December in 2013 as listed in Table 3.

Before phenological parameters extraction, the planting area of single-cropped rice was estimated using a stepwise classification strategy proposed by Wang et al. (2015). The total acreage of the single-cropped rice was about 94.0 km2 in 2013, mainly distributed in the eastern part of the study area.

Individual phenological stages of crops are identified by considering the VI increment or decrement between consecutive images over a certain period of time (Xin et al., 2002; Boschetti et al., 2009) (Fig. 2). Divided by the heading date, the growth phases of single-cropped rice can be classified into vegetative growth and reproductive growth. Because the maximum VI usually appears around the heading date, it is convenient to define the heading date as the date of the maximum VI on the VI profile. In general, the rice fields are flooded before transplanting and the VI of rice fields decreases during this period and then increases after rice planting. Therefore, it is reasonable to define the transplanting date of rice as the minimal point along the VI profile. Due to the etiolation and senescence of the rice leaves, the VI decreases after the heading, and the maturation date of rice is identified by the maximum slope method (Yu et al., 2003; You et al., 2013). The results were validated with the field survey data collected by the local agro-meteorological station in 2013.

Fig. 2
figure 2

VI profile of the single-cropped rice

Smooth curve was generated using S-G filters

Results

Intercalibration of VIs

We first compared the VI values of different land cover types of the sample points derived from HJ-1 CCD and Landsat-8 OLI images. It showed that EVI2 was more stable and comparable between the two sensors compared with NDVI (Fig. 3). This was further illustrated by the relatively small standard deviation of EVI2. The fluctuations of mean values of EVI2 were less than 0.04 while those of NDVI were around 0.05.

Fig. 3
figure 3

Mean values and standard deviations of the differenced vegetation indices

dNDVI: differenced NDVI; dEVI2: differenced EVI2. dNDVI (a) and dEVI2 (b) are derived from Landsat-8 OLI and HJ-1 CCD images (image pair No. represents the six images pairs listed in Table 2)

The linear regression relationships between Landsat-8 OLI-VIs and HJ-1 CCD-VIs for all samples (pixels) in the testing sites are shown in Fig. 4. The two datasets are highly correlated for both NDVI and EVI2. The coefficients of determination of NDVI and EVI2 were 0.871 and 0.891, respectively.

Fig. 4
figure 4

Regression analysis of HJ-1 CCD-VIs versus Landsat-8 OLI-VIs using data of the four testing sites within six image pairs

Black solid lines are the 45° lines and red solid lines are the linear regression trendlines (Note: for interpretation of the references to color in this figure legend, the reader is referred to the web version of this article)

The agreement analysis showed that NDVI and EVI2 both had relatively high AC values (0.87 for NDVI and 0.89 for EVI2; Table 4). It’s notable that the MPDu values were much higher than the MPDs values for both VIs, indicating that the unsystematic difference was greater than the systematic difference. Additionally, the MPDu/MSD values were much higher than MPDs/MSD values for both VIs, an indication that the unsystematic difference was the primary difference between the two datasets. The MPDs and MPDu values of EVI2 were lower than those of NDVI, and it appeared that HJ-1 CCD and Landsat OLI may have more consistency in using EVI2.

Table 4 Agreement analysis for HJ-1 CCD-VIs (variable Y ) and Landsat-8 OLI-VIs (variable X )

The systematic difference between HJ-1 CCD-VIs and Landsat-8 OLI-VIs can be minimized using OLS. By taking Landsat-8 OLI-VIs as the dependent variables and HJ-1 CCD-VIs as the response, two linear regression functions for NDVI and EVI2 derived from the four testing sites within six image pairs could be used to rebuild the time-series data in the following phenological parameter extraction.

Extraction of phenological parameters using integrated HJ-1 and Landsat-8 VI time-series images

We compared the smoothed NDVI and EVI2 time-series data at pixel level using S-G filters (Fig. 5). The phenology stages could be clearly identified from the VI profiles. Most of the noise was successfully eliminated from the VI time-series. The single-cropped rice usually is transplanted during late May to early July, and reaches the heading stage from late August to early September according to the weather and agricultural scheduling. By adding the Landsat-8 OLI data, the updated VI curves for both NDVI and EVI2 moved to the right.

Fig. 5
figure 5

VI time-series generated by Savitzky-Golay filters

The transplanting, heading, and maturation dates of the single-cropped rice extracted by HJ-1 VI time-series and the integrated HJ-1 and Landsat-8 VI time-series images in Deqing County are presented in Figs. 68 separately. For transplanting date, the dates extracted by NDVI time-series concentrated in late May to middle June; after integrating Landsat-8 OLI data, with more pixels moved into the 160–169 group. Similarly, the transplanting dates extracted by the EVI2 time-series data of HJ-1 concentrated in late May to early June, and the dates also delayed after integrating Landsat-8 OLI data. Since there were different rice varieties planted in Deqing, the heading dates were more dispersive from early August to early September, but the dates extracted by the NDVI time-series showed an advancing trend compared with the EVI2 time-series. For maturation dates, both of NDVI and EVI2 showed less difference between the original dataset and the integrated dataset compared with other phenological stages.

Fig. 6
figure 6

Transplanting date of rice extracted from NDVI and EVI time-series data in 2013

(a) HJ-CCD derived NDVI time-series; (b) Integrated NDVI time-series; (c) HJ-CCD derived EVI2 time-series; (d) Integrated EVI2 time-series

Fig. 7
figure 7

Heading date of rice extracted from NDVI and EVI2 time-series data in 2013

(a) HJ-CCD derived NDVI time-series; (b) Integrated NDVI time-series; (c) HJ-CCD derived EVI2 time-series; (d) Integrated EVI2 time-series

Fig. 8
figure 8

Maturation date of rice extracted from NDV I and EVI2 time-series data in 2013

(a) HJ-CCD derived NDVI time-series; (b) Integrated NDVI time-series; (c) HJ-CCD derived EVI2 time-series; (d) Integrated EVI2 time-series

The accuracy of the phenological dates extracted as above was evaluated using field-based phenology observations (Table 5; Fig. 9). The estimated dates for each phenological stage were generally within ±6 d compared with the observations. The differences between the two dates decreased after using the integrated VI time-series as source data for both NDVI and EVI2. Table 5 showed that the root mean square error (RMSE) of the estimated phenological dates using HJ-1 CCD images alone was larger than that of the integrated NDVI and EVI2 images. The RMSE of EVI2-derived phenology dates was smaller than that of NDVI in transplanting and heading dates. It showed that after integrating HJ-1 CCD and Landsat-8 OLI data, the VI time-series provided a more accurate estimation of rice transplanting, heading, and maturation dates than using single data sources.

Fig. 9
figure 9

Key phenology stages (transplanting, heading, and maturation) of rice extracted from time-series data (original and integrated data) compared with the field observed data

(a) Transplanting date extracted by NDVI time-series; (b) Transplanting date extracted by EVI2 time-series; (c) Heading date extracted by NDVI time-series; (d) Heading date extracted by EVI2 time-series; (e) Maturation date extracted by NDVI time-series; (f) Maturation date extracted by EVI2 time-series. DOY: day of year

Table 5 Root mean square error (RMSE) of the estimated phenological dates compared with the field observed data

Discussion

Where remote sensing is used as a tool to facilitate the extraction of crop phenology information, “more data” is always the better policy. The crucial point in this practice is to combine the readily available remote sensing data, if possible, and generate a more dense time-series of data. In this study, we took full advantage of HJ-1 CCD, characterized by its high temporal resolution, and combined this with Landsat-8 OLI data to generate integrated time-series VIs and extract the key phenology information of the single-cropped rice.

We selected NDVI and EVI2 in this study and compared their efficiencies in extracting the phenological information of rice. The comparison of VIs between HJ-1 CCD and Landsat-8 OLI showed slight differences due to the spectral band differences (Teillet and Ren, 2008; Anderson et al., 2011; Li P. et al., 2014). As we had pointed out, the inconsistency between the two sensors could be attributed to systematic and unsystematic differences quantitatively described by agreement analysis. The systematic difference between the two sensors could be minimized by OLS, while the unsystematic difference could be effectively attenuated using VI, which could minimize the influence of the environmental factors.

We built the empirical relationships between HJ-1 CCD-VIs and Landsat-8 OLI-VIs datasets at pixel level using OLS. By using the linear regression models, HJ-1 CCD-VIs and Landsat-8 OLI-VIs could be transformed interchangeably. Scatter plots of HJ-1 CCD-VIs versus Landsat-8 OLI-VIs for four testing sites within six image pairs demonstrated that the VIs obtained from HJ-1 CCD and Landsat-8 OLI were highly correlated, and EVI2 had a relatively high R2 compared with NDVI (Fig. 4). Not surprisingly, the agreement analysis also showed that EVI2 between the two sensors had significant lower systematic and unsystematic differences compared with NDVI.

To extract the phenological parameters of rice, the S-G filters were applied to smooth the VI time-series data, and we defined criteria to judge the dates of transplanting, heading, and maturation of the single-cropped rice by considering the VI increment or decrement between consecutive images over a certain period of time according to the field campaign work. Compared with the observed data, EVI2 showed a significantly lower RMSE than NDVI in the estimation of transplanting and heading dates, and this was testified by the relative higher agreement coefficient of EVI2 (Table 4). We noticed that NDVI showed the largest RMSE at heading date, and the heading dates estimated by NDVI advanced compared with the observations (Table 5; Fig. 9). Because NDVI is easily saturated in well-vegetated areas (Huete et al., 2002), it can be seen that NDVI becomes insensitive at high values of leaf area index (LAI) (Gu et al., 2013). Considering that the single-cropped rice usually reaches the peak of LAI between the ear differentiation and early heading stages, the advanced heading date estimated by using NDVI is not unexpected. Compared with NDVI, EVI2 is more robust in capturing the difference in well-vegetated areas and thus has more potential in phenology extraction.

We combined extremum and maximum slope methods in rice phenology estimation. Due to the different methodologies applied in estimating rice phenology in field and remote sensing, an error between the two kinds of estimation is unavoidable. However, EVI2, especially the integrated EVI2, showed a significant consistency and agreement in phenology dates estimation.

However, it should be noted that the empirical equation built between HJ-1 CCD-VI and Landsat-8 OLI-VI datasets was partially determined by the sample plots and dates of the image pairs. So, there are inevitable uncertainties in the estimation. If extrapolating this kind of integrating methodology and results to other regions or years and considering the variations of environmental factors and vegetation status, necessary re-calibration steps must be taken.

Conclusions

Accurate extraction of the phenology information of rice in large spatial scale is crucial to agricultural management and related ecological studies. High quality time-series remote sensing data are of critical importance in identifying the key phenology dates as close as possible to the “true” dates. Most of the paddy fields in southern China are influenced by the monsoon climate, and qualified remote sensing data are scarce resources in these regions.

In this study, we showed that by integrating HJ-1 CCD and Landsat-8 OLI data using OLS, the phenological parameters of the single-cropped rice can be estimated more accurately. Two widely used VIs, namely NDVI and EVI2, were adopted, and not surprisingly, the two indices obtained from HJ-1 and Landsat-8 showed high correlation and agreement (R2>0.87, AC>>0.86). However, compared with NDVI, EVI2 was more stable and comparable between the two sensors. Compared with the observed data, the integrated VI time-series had a relatively low RMSE, in which EVI2 was superior to NDVI. We also demonstrated the application of phenology extraction of the single-cropped rice in spatial scale in the study area. This work is of general value and can be extrapolated to other regions where qualified remote sensing data are the bottleneck and where complementary data are occasionally available.

Compliance with ethics guidelines

Jing WANG, Jing-feng HUANG, Xiu-zhen WANG, Meng-ting JIN, Zhen ZHOU, Qiao-ying GUO, Zhe-wen ZHAO, Wei-jiao HUANG, Yao ZHANG, and Xiao-dong SONG declare that they have no conflict of interest.

This article does not contain any studies with human or animal subjects performed by any of the authors.