Introduction

Digital Elevation Models (DEM) is widely used as a digital representation of terrain. Terrain models include important factors for soil erosion and hydrology, etc. [13]. DEM resolution is one of the most important factors that influence the ability of a DEM to represent terrain. That is because as resolution becomes coarser, many of the terrain indexes that derived from the DEM will change. Elevation and Slope are two of the most important terrain factors in many study fields [4, 5]. Many researchers have paid attention to the change of terrain factors with DEM resolution reduction such as Chang and Tsai (1991) [6], Gao (1997) [7], Zhang (1999) [8], Wolock (2000) [9] and Wu et al. (2008) [10]. Their results showed that slope tended to become smaller in most of areas with resolution reduction.

But there is not yet sufficient study on why there are differences between terrain factors derived from fine resolution DEM and from coarse resolution DEM and which part of the information has been lost with resolution reduction. Geostatistics has been widely applied to study fields such as vegetation investigation, soil characteristic analyses, etc. Some researchers have used geostatistical analyses to study spatial patterns in topography [11]. In this research the authors studied the structure of terrain and the change in terrain structure with resolution reduction by investigating changes in each component of the semi-variogram. The study uses Xian' Nan watershed as the study area. This watershed is located in Loess Hilly area in Loess Plateau in China. The authors studied the differences between elevation and slope derived from DEMs with resolutions of 5 m, 25 m and 50 m.

The aim of this research is to show which information "disappears" or "reduces" when resolution becomes coarser. This may help explain why terrain factors change with resolution and understand the terrain characteristics which change with resolution.

Material and methods

Study area

The study area is the Xian' Nan watershed located in the Hilly area of the Loess Plateau (E109°11′-109°22′, N36°42′-36°47′) (Figure 1). This watershed covers an area of 44 km2 with an average altitude of 1220 m. It is a typical hilly landform in the Loess Plateau with a complex surface configuration and crossing gullies. The ground slope is steep with an average gradient of 28°.

Figure 1
figure 1

Location of study area

Base data and data processing

The base data is a topographic map at 1:10,000 scale issued by China's National Bureau of Surveying and Mapping in 1981 with a contour interval of 5 m which covers the whole study area. The data processing includes topographic map digitizing, projection transformation, check in elevation values, check in river directions and Lake Boundaries. The projection for the base data is Krasovsky_1940_Albers. Longitude of the Central Meridian is E105°. The first Standard Parallel is N25° and the second Standard Parallel is N47°. Latitude of projection Origin is 0°. False Easting and False Northing are both 0 m. The projection is an equal-area projection. The grid cells are "square" and in meters.

DEMs with resolution of 5 m, 25 m and 50 m were built using ANUDEM which were developed in Australia by Hutchinson [12]. ANUDEM interpolates the data from the topographic maps to finer resolutions in down scaling steps from coarser resolutions. The ANUDEM software interpolates the DEM where there is no source data using minimum curvature in order to make the surface as smooth as possible. The finest resolution used here is 5 m because 5 m is found to be the most suitable finest resolution according to the ANUDEM rules for topographic maps with scale of 1:10,000. The coarser DEMs seem to be similar with those obtained by up scaling the finer DEMs using low pass filtering to some extent, however it is not fully the same. Research is under way into the nature of the differences. The parameter values used in ANUDEM are showed in Table 1.

Table 1 Parameters of ANUDEM

Information statistics for elevation and slope

Slope calculation

In this research slope is calculated using the following definition:

θ s = T a n - 1 p 2 + q 2 1 / 2
p , q = z = z x , z y

Where θ s refers to slope; p and q refer to gradient field components. In a digital image, p and q are both calculated using the high pass filters as following.

- 1 0 1 - 1 0 1 - 1 0 1 / 6 h 1 1 1 0 0 0 - 1 - 1 - 1 / 6 h  p filter  q filter

Where "h" is the resolution step or grid cell size for the data.

Histogram intersection

Swain and Ballard [13] efficiently recognized objects by matching their color histograms using the histogram intersection (HI) method. In this research the author used Histogram Intersection (HI) to evaluate histogram similarity.

HI is calculated by:

HI ( X , Y ) = i min ( x i , y i )

Where HI(X, Y) is Histogram Intersection of two histograms X and Y; x i and y i are frequency values of X and Y at slope value of i. The values for HI(X, Y) range from 0% to 100%. Histograms are more similar to each other if HI(X, Y) is larger. If two histograms are totally the same, HI(X, Y) equals to 100%.

Reduction rate of mean value and STD

Mean value Reduction Rate and STD reduction rate for both elevation and slope are calculated here to refer to absolute value of mean value and STD change per reduction of resolution in meter. It will be used to help quantitatively analyze mean value and STD change with resolution. It is calculated by:

M v = M r - M r r - r
S v = S r - S r r - r

Where M v is Mean value Reduction Rate; M r is mean value with coarser resolution of r ; M r is mean value with finer resolution of r ; S v is STD Reduction Rate; S r is STD value with coarser resolution of r ; S r is STD value with finer resolution of r .

Geostatistical analysis

Covariance and semi-variogram

In geostatistics, the mathematical expression for covariance and the semi-variance function is:

C ( h x , h y ) = E ( ( Z x + h x , y + h y - m ) ( Z x , y - m ) )
γ ( h x , h y ) = 1 2 E ( ( Z ( x + h x , y + h y ) - Z ( x , y ) ) 2

C ( h x h y ) is the covariance function and γ ( h x , h y ) is the semi-variance function; (x, y) stands for the spatial coordinators of the tested point of the slope; Z(x, y) stands for the data of the tested point; h x and h y stands for the interval in x and y direction between two tested points (in 1D cases h = ( h x 2 + h y 2 ) 1 / 2 ), m is the mean value over the image. We will assume that the covariance is stationary and the mean (m) is a constant over the image.

In this case the relationship between the two expressions is:

γ ( h x , h y ) = C ( 0 , 0 ) - C ( h x h y )
C ( 0 , 0 ) = σ 2

σ 2 is the variance of the image data which needs to be spatially stationary [14].

Modeling covariance and semi-variogram

In this article a form of model for the semi-variogram called the "Independent Structures" model [14] (sometimes it is called "Nested Model") is assumed. In this model the original field Z ( x , y ) is considered to be composed of N "independent" fields, the covariance between the N fields is zero.

Z ( x , y ) = j = 1 n σ j Y j ( x , y )

So the total variance of Z is:

σ Z 2 = j = 1 n σ j 2

In the "Independent Structures" Model, the covariance function and semi-variogram fuction for Z is given by:

C Z ( h x , h y ) = j = 1 N σ j 2 c j ( R j ; h x , h y )
γ Z ( h x , h y ) = j = 1 N σ j 2 γ j ( R j ; h x , h y ) = j = 1 N σ j 2 ( 1 - c j ( R j ; h x , h y )

R j refers to the ranges of the component semi-variograms and:

0 < R 1 < R 2 < R 3 < R N

In this way, the first component is the one with greatest "roughness" or high spatial frequency content effect and the last is the one with greatest low frequency (regional) effect. This representation models the data with different "scale" components with Y 1 Y 2 Y 3 Y N representing scale components with decreasing map scale.

In this paper the authors assume that the base semi-variogram model is the radial 2D Gaussian, the covariance for each component is:

c j ( h ) = σ j 2 e - ( h / R j ) 2

The radial distance is h = h x 2 + h y 2 ; σ j 2 is the variance of the component j; The quantity R j is taken as the range of component j and the component has the form of a correlation function.

Effect of filtering on the covariance and semi-variogram

Assuming that the function Z that is being filtered has mean zero and finite variance, the covariance function for Z is:

C Z ( h ) = ( Z * Z ) ( h )

If Z is filtered by a filter ϕ , then:

γ F Z ( h ) = C F Z ( 0 ) - C F Z ( h )
C F Z ( h ) = ( ( ϕ * Z ) * ( ϕ * Z ) ) ( h ) = [ ( ϕ * ϕ * C Z ] ( h ) = ( C ϕ * C Z ) ( h )

Where C F Z ( h ) the covariance of the filtered functions and γ F Z ( h ) is the semivariogram of the filtered function.

If the base semi-variogram model is radial 2D Gaussian, and the smoothing kernel is a normalized radially symmetric 2D Gaussian with range S, then:

c j ( h ) = σ j 2 e - ( h / R j ) 2
ϕ ( h ) = 1 π S 2 e - ( h / S ) 2
S = F W H M π
F W H M = ϕ max ( ϕ )

FWHM (Full Width Half Maximum) is the width of the filter at half height. H is the height of the filter.

In this case the covariance of filtered function is:

C F j ( h ) = ( C ϕ * C j ) ( h ) = σ j 2 ( 1 + 2 S 2 R j 2 ) e - h 2 / R j 2 ( 1 + 2 S 2 R j 2 ) = σ ̃ j 2 e - ( h / R ̃ j ) 2

The covariance of the filtered function is the same type as before however it has reduced variance and increased range.

Model efficiency analysjs

In order to evaluate the result of the Independent Structures model of the semi-variogram, the model efficiency coefficient (ME) which was proposed by Nash and Sutcliffe [15] was used in this research.

ME is calculated by:

M E = 1 - ( Y o b s - Y p r e d ) 2 ( Y o b s - Y m e a n ) 2

Where ME is the model efficiency, Y o b s is the observed value, Y p r e d is the predicted value, Y m e a n is the mean observed value. Values for ME range from -∞ to 1. The closer ME is to 1, the better the model will predict individual values.

Results and analysis

Mean elevation and slope change with resolution

Table 2 shows the mean value and Standard deviation (STD) of both elevation and slope. Elevation in Xian'nan Watershed does not change greatly with resolution reduction but slope seems to be largely influenced by resolution. Mean elevation and STD of elevation tend to decrease slightly with resolution reduction with a reduction rate of 0.03 m/m. Mean slope and STD of slope tend to decrease with resolution reduction more intensively than elevation and the reduction rates are different with different resolution changes. The mean slope reduction rate with resolution from 5 m to 25 m of 0.3°/m is the largest (Table 3). This means that when resolution becomes 1 meter coarser, the mean slope will decrease by 0.3° which is large.

Table 2 Mean and Std of elevation and slope
Table 3 Mean value and STD reduction rate

Histograms of elevation and slope changes with resolution

Histograms of elevation with varied resolutions of 5 m, 25 m and 50 m are shown in Figure 2a. Histograms for elevation mix together because they are quite similar to each other. Elevation histogram distribution changes little because of the resolution reduction. HI in Table 4 clearly shows that the HI values between elevation histograms with 25 m, 50 m resolution and elevation histogram with 5 m resolution are both larger than 90%.

Figure 2
figure 2

a Histograms of elevation with varied resolution. b Histograms of slope with varied resolution

Table 4 HI comparing with 5 m data (%)

There seems to be a great difference between slope histograms with 25 m, 50 m resolution and the slope histogram with 5 m resolution (Figure 2b). As resolution reduces, the histograms of slope move to the left side which is lower. HI values between the slope histograms with 25 m and 50 m resolution and the slope histogram with 5 m resolution are 73.8% and 55.7% which shows the great influence of resolution on the histogram of slope.

Changes in semi-variogram of elevation and slope with resolution

Independent structures of semi-variogram

Semi-variogram of elevation with resolution of 5 m, 25 m and 50 m are calculated and modeled using the Gaussian model. These are plotted in Figure 3a. The elevation semi-variogram in the Xian'nan watershed has been modeled with 4 components which are labeled as Y1, Y2, Y3, and Y4. Sill and range for each component are shown in Table 5. The ranges are ordered by Y1<Y2<Y3<Y4 as in the independent model. Y4 indicates information that of longer wave-length. Y4 seems to be a trend component and has not reached a sill inside the maximum lag of semi-variance. This component is "unstable" and cannot be treated in the same way as the other components. The sills are ordered as: Y1<Y3<Y2. Y1 indicates information relating to short wave-length. In the elevation data in the study area, the short wave-length information accounts for much less variance than the long wave-length information.

Figure 3
figure 3

Independent model of semi-variogram

Table 5 Sill and range of elevation semi-variogram

Semi-variograms of slope with resolution of 5 m, 25 m and 50 m were calculated and modeled. These are plotted in Figure 3b. The slope semi-variogram in Xian'nan watershed has been modeled using 3 components which are labeled as Y1, Y2, and Y3. Sill and range of each component are shown in Table 6. The range is ordered by Y1<Y2<Y3 in the independent model. All of the three components have reached a sill inside the maximum lag of semi-variance. The sill values are ordered as: Y3<Y2<Y1 with 5 meter resolution. In slope data in the study area, the short wave-length information accounts for more variance than the long wave-length information.

Table 6 Sill and range of slope semi-variogram

The effect of resolution on the independent structures of semi-variogram

The effect of resolution reduction is quite similar to low pass filter. If we treat the resolution reduction as a Gaussian low pass filter, the semi-variograms with coarser resolution can be predicted from the fine resolution using the method discussed in this paper. The sill and range in each component can be predicted. In this paper, the authors predicted the coarser resolution variograms of each component from the 5 m resolution variograms. The ME of the model is shown in Table 7. The ME values are all above 0.65.

Table 7 ME of independent structure model

The influence of resolution reduction is smallest on long wave-length components and is largest for Y1 in both elevation and slope cases. Table 5 and Figure 3a show that in the elevation case, the sill value of Y1 reduced to 72.6% of the 5 m case when resolution is 50 m. Y2 and Y3 seem to have be stable and the reduction of sill when the resolution is 50 m is less than 5% of the 5 m case. Table VI and Figure 3b show that in slope case, the sill value of Y1 reduced to 18.1% of the 5 m case when resolution is 50 m. The sill value of Y2 reduced to 82.2% of the 5 m case when the resolution is 50 m and the sill value of Y3 reduced to99.5% of the 5 m case when the resolution is 50 m.

Comparing the elevation and slope cases, the influence of resolution reduction on slope is much larger than on elevation especially in the Y1 components. The Sill (Model sum) of elevation reduced only to 94.4% of the 5 m case but the sill (Model sum) of slope reduced to 52.6% of the 5 m case. The reason is that the process of slope calculation acts as a high pass filter. More short wave-length information is left in the image after the slope is calculated. Therefore there is much more short wave-length information in slope data than in elevation data. The low pass filter and the resolution reduction effect greatly influence the short wave-length information.

Discussion

ME of slope is lower than ME of elevation. It may because that the Gaussian Model is not the most appropriate one for slope. In this paper the main purpose is to show the scale effect which has been done. Work on appropriate model for slope is investigated by the authors.

In this paper the authors modeled the effects of resolution reduction on the semi-variogram using a Gaussian filter. Although the way ANUDEM changes with scale do not behave exactly the same way as a Gaussian filter, the model can work quite well. In the future, more effort is being put into the study of the differences between the two.

The semi-variograms in this paper has been modeled using Independent structure model. The model allows us to clarify the way resolution influence DEM data. In further study this would be applied to different terrain types.

In further study, the semi-variogram of each component will be mapped out using kriging to show the nature of different information in each component.

Conclusions

In this paper the independent structure model was used to model the semi-variograms of elevation and slope data with resolution of 5 m, 25 m and 50 m. The results showed how the short wave-length information disappeared or weakened as resolution reduced. These results can explain which component of the information has changed with resolution reduction. By calculating the mean value, STD and histogram of both elevation and slope at different resolutions, it is clear that the influence of resolution on elevation is less than on slope. The reason is that the short wave-length information accounts for more variance in slope data than in elevation data. Resolution reduction or low pass filtering influence the short wave-length information the most.