Histogram-based weighted median filtering used for noise reduction of digital elevation model data

Kilik, Roland

doi:10.1007/s40328-021-00356-2

Histogram-based weighted median filtering used for noise reduction of digital elevation model data

Original Study
Open access
Published: 26 August 2021

Volume 56, pages 743–764, (2021)
Cite this article

Download PDF

You have full access to this open access article

Acta Geodaetica et Geophysica Aims and scope Submit manuscript

Histogram-based weighted median filtering used for noise reduction of digital elevation model data

Download PDF

Roland Kilik ORCID: orcid.org/0000-0003-2110-7029¹

1906 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

A new histogram-based robust filter developed for noise reduction of digital elevation model data is presented. When large percentage of data points in data matrices are contaminated with outlier noise, the noise reduction process can give better results than traditional median filtering, if elements with a potentially higher chance of being noise are eliminated by weighting from the input dataset before the median value is calculated. However, on the same matrices, there are likely to be subsets of data where unfiltered input is more reasonable for the calculation. The new method implementing weighting between these two cases is presented below, with its initial tuning and a comparison with both standard median filtering and the Most Frequent Value (MFV) method, as the latter being much more efficient than the usual methods. Following the description of the procedures, their effectiveness is compared for noise reduction in digital elevation model data systems, at various noise levels. The comparison is done mainly by three measures, with most of the focus on the ${L}_{1}$ norm data distance results. Finally, a modified version of the method—which includes Steiner’s MFV filter as a core part—is also introduced, with similar examination. The method to be presented has been shown to be superior to conventional median filtering for most noise rates, and in many cases also to Steiner' MFV, for handling non-zero mean noises. The modified version of the method—with the help of Steiner's MFV—has also achieved this in handling zero mean noise, in the field of application described in the paper.

An MFV-based image processing filter and its application to seismic tomographic images

Article Open access 22 August 2021

HeNLM-LA: a locally adaptive non-local means algorithm based on hermite functions expansion

Article 25 July 2014

Median Filtering: A New Insight

Article 27 December 2016

1 Introduction

Due to its robustness, the Most Frequent Value (MFV) method (Steiner 1991) can be well applied in the processing of noisy datasets in geophysical (Dobróka et al. 1991; Szabó and Balogh 2016; Szabó and Balogh 2018), hydrogeological research (Szűcs and Zákányi 2007) and various other fields (Zhang 2017). Such data systems, especially those with extreme noise, are also present in the field of spatial informatics, including digital elevation modelling and satellite transmission. A similarly widely used, but less robust and sophisticated technique is the median filtering method (Stone 1995; Huang et al. 1979).

In this paper, a new median filtering method, improved by histogram operations and weighted averaging is presented, and compared to the original median filter and the MFV based procedure. The method’s modified and also presented version – which aims to eliminate zero mean noises – contains Steiner’s MFV filter as a core part.

The aim of the proposed method’s both version is to eliminate scattered noise from digital elevation models in a moving windowed manner (i.e. the procedure corrects the central element of the actual window). This study was also conducted for noise exposure at four different percentages of data points.

2 Input dataset

The analysed data consisted of three 25 m spatial resolution digital elevation models of different areas, created using Topo to Raster interpolation in ArcGIS software by digitizing the contour lines, elevation points and water network of 1:10,000 scale EOTR map sheets. The histograms of the three datasets can be seen in Fig. 1. The mean of the data in the first dataset is 211.99, while the standard deviation is 8.440. The same statistical data for the second dataset is 191.31 mean and 8.752 standard deviation, while value of the mean is 96.39 and the standard deviation is 0.457 in the third data system.

For each of the resulting digital elevation model data systems, firstly normally distributed noise added to the data matrices, with a standard deviation to have the average noise amplitude at around 1% of the mean of the data matrix. After this—as outlier non-zero mean impulse noise—additional noise was added to 10, 15, 20, and 25 percent of the points randomly. In order to achieve this, a normally distributed noise vector is generated for every row with the mean equal of the mean of the given data row, and standard deviation to have the average noise amplitude around 100% of the mean of the data row.

Then, the elements of the noise vector generated for a given row of data were randomly scattered across the row with a multiplier between 0.1 and 0.7, giving an additional ~ 10–70% noise to the data in the different test cases (referred as 0.1–0.7 noise amplitude below).

3 Introduction of the weighted median (WM) method

The method is used to produce the corrected value of the central element at each position of the moving window going through the image matrices using a weighted mean. The following version of the presented method is mainly for eliminating non-zero mean noises (due to, for example, measurement device problem or long distance data transfer).

The weighted mean is calculated for each data point, with two weights (w₁, w₂) defined below. In order to achieve this, two independent window narrowing process occurs as an initial step, before calculating the weights. These narrowed windows are created from the actual (5 × 5) data window, at every window position. As a test, a window size of 9 (3 × 3) and 49 (7 × 7) was also applied, but this did not prove to be optimal for solving the problem.

The process of the first window narrowing is the following.

The range of the values of the elements in the moving window is divided into two and three bins with equal range widths, and then two ratios are generated:

λ₁ the element count ratio between the larger and the smaller element count domain out of the two,
λ₂ ratio of the largest and second largest domains out of 3 (regarding the element count again).

If ${\lambda }_{1}$>${\lambda }_{2}$, the new set ($D$) is defined as the most element count domain of the 2 domains, otherwise as the most element count domain of the 3 domains. Then the value of ${m}_{s}\, \mathrm{defined \, as}$

$m_{s} = {\text{median}}\left( {\text{D}} \right)$.

Thus, we used higher $\lambda$ value as an indication of a sharper cut. The ideal number of 2 and 3 as the number for ranges chosen because in case of splitting into 4 (or more), a bin would possibly not contain a sufficient number of elements from the initial 5 × 5 window, in order to do the further steps described.

Then another (independent) second window narrowing process occurs, for determining the value of parameter ${m}_{e1}$. In order to achieve this, the original moving window’s elements are sorted by value, then divided into two and three equal width ranges (based on the set of values).

For example, in the case of splitting into three, if the ordered vector is v, and $\mathrm{max}\left({\varvec{v}}\right)$ is its highest value element, and $\mathrm{min}\left({\varvec{v}}\right)$ is the lowest, then in the third with the lowest values will be the values lower than $\mathrm{min}\left({\varvec{v}}\right)+(\mathrm{max}\left({\varvec{v}}\right)-\mathrm{min}\left({\varvec{v}}\right))/3$.

Here we calculate ratio ${\lambda }_{3}$ from the case when splitting into two. The value of ${\lambda }_{3}$ is $1/n$, where $n$ is the sum of element count of the two bins without the highest valued bin (i.e. the bin with the lower values). Then we calculate ${\lambda }_{4}$ ratio from the case when splitting into three bins. The value of this ratio is $1/m$, where $m$ is the sum of the element count of the three bins without the highest valued bin.

If ${\lambda }_{3}$>${\lambda }_{4}$: we take the highest valued bin from the two (the bin with the highest values), otherwise we take the highest valued bin from the three as the chosen set (E). Finally, ${m}_{e1}$ will be the average of the elements of the chosen new set.

A similar value, ${m}_{e2}$ is determined with the same method, however by splitting the original window into 3 and 5 parts (instead of 2 and 3). With this step, a higher division number is reflected in the result, if the current window's value set allows it (i.e. if the new intervals’ element count is not zero).

As ${m}_{e1}$ and ${m}_{e2}$ are both calculated on chosen subsets of the largest values from the value set of the window, both ${m}_{e1}$ and ${m}_{e2}$ are related to maximums. Important difference between ${m}_{e}$ and ${m}_{s}$ that in ${m}_{e1}, {m}_{e2}$ the mean is calculated while in ${m}_{s}$ the median is calculated, with different bin numbers. In addition, in ${m}_{e1}$ and ${m}_{e2}$, the narrowest set has the largest valued elements, and in ${m}_{s}$, the set contains the largest number of elements (which may not necessarily contain the largest values).

Now we can calculate the first weight (${w}_{1}$) of the current point’s weight vector as

$$w_{1} = (m_{s} /m_{e1} )*\alpha,$$

(1)

where $\alpha$ is a scaling factor that ensures that the value obtained by $w_{1}$ falls within the same range of values as $w_{2}$, described below. Its value ($\alpha = 1/3$) is determined on experimental basis to fulfill this purpose.

We can also define $m_{w }$ as the median of the values in the original moving window.

Using the values described above, three sub-weights are produced as follows (all will have a role in determining the value of $w_{2}$ weight).

$$w_{a} = \frac{1}{{max_{1} }}*\left( {m_{e1} - max_{1} } \right) + 1,$$

(2)

where $max_{1}$: ${\text{max}}\left( {m_{w } ,m_{e1} } \right)$,

$$m_{as} = \left| {m_{e1} - m_{e2} } \right|,$$

(3)

$$w_{p} = \frac{\beta }{{max_{2} }}*\left( {m_{as} - max_{2} } \right) + 1,$$

(4)

where $max_{2} :\max \left( {m_{w } ,m_{e1} ,m_{e2} } \right),$

$$w_{p2} = \frac{\gamma }{{max_{3} }}*\left( {m_{as} - max_{3} } \right) + 1,$$

(5)

where $max_{3} :\max \left( {m_{w } ,mean\left( {window} \right)} \right).$

The calculation of the max value applied in a given sub-weight (${w}_{a}$,${w}_{p},{w}_{p2}$), shall in all cases include the median of the window, as well as the mean (${m}_{e1}, {m}_{e2}$) or median (${m}_{s}$) value of the narrowed window.

In Eqs. 4 and 5, fewer elements are omitted from the original window (because of containing ${m}_{e2} \: \mathrm{via } \: {m}_{as})$, so that the sub-weights ${w}_{p},$ and ${w}_{p2}$ both calculated with a smaller multiplier than in Eq. 2. The values of these $\beta$ and $\gamma$ constants are chosen as 0.5 (the adjustment procedure’s results can be in seen Sect. 6).

Both ${w}_{p}$ and ${w}_{p2}$ weights have a corrective role. Their value can be high if there is a large difference between the averages of the subsets obtained by splitting the weights into 3 and 5 (${m}_{e1}$ and ${m}_{e2}$). As can be seen in Fig. 2, a large difference between the values of ${m}_{e1}$ and ${m}_{e2}$ results in a large ${L}_{1}$ norm error. Thus, a large difference indicates that the histogram operations in the current window position may distort the result, so increasing value of the difference increases the weight of ${w}_{2}$ (i.e. the conventional median without histogram operations).

The subweight ${w}_{a}$ can take a high value other than 1, if the median value of the original, unconstrained window ${m}_{w}$ is greater than the average of the elements of the constrained window (${m}_{e1}$). Because the narrowed window contains the largest values of the subsets, if the difference between the median of the original window and the mean of the elements of this narrowed window is outstanding, it indicates that histogram operations at the current window position distorting the result. As in the previous, a large positive difference between ${m}_{e1}$ and ${max}_{1}$ results in a large ${L}_{1}$ norm error.

Therefore, this should be reflected in the final weight vector $\mathbf{w}$ in the form of either a reduction in the value of ${w}_{1}$ or an increase in the value of ${w}_{2}$ (i.e. an increase in the weight of the result of the traditional median method). The latter is achieved with the usage of $w_{a}$ in weight $w_{2}$. Since $w_{a}$ is the most important of the three correction factors ($w_{a}$, $w_{b}$, $w_{c}$), its square is included in the formula $w_{2}$. The higher effect was achieved by squaring the weight, because the maximum value of the weight before adding one to its value is 1, so squaring the weight will not result in an extreme weight value, even at its maximum.

The + 1 in the formulas for the partial weights $w_{p}$ and $w_{p2}$ is included because they all include a maximum value subtraction, which in most cases results in a negative value, so the constant provides a shift into the positive range. In the formula of $w_{a}$, the role of adding 1 is shifting its minimum to greater than one (in order to be able to increase $w_{2}$ weight in its squared value).

Finally, the following two weights ($w_{b}$, and $w_{c}$) are produced using $w_{p}$ and $w_{p2}$ respectively.

$$w_{b} = 1 + \frac{{w_{p} }}{2},$$

(6)

where m_w is the median of the values in the original moving window. Since the maximum value of the partial weight $w_{c}$ is not a function of the different narrowed windows, but of the median or average value of the original window, this partial weight is taken with a smaller constant:

$$w_{c} = 0.5{ } - \frac{{w_{p2} }}{2}.$$

(7)

With the components defined above, the weight $w_{2}$ takes the following form

$$w_{2} = w_{a}^{2} *w_{b} *w_{c}.$$

(8)

At this point, we know the ${\mathbf{w}}$ weight vector of the current data point

$${\mathbf{w}} = \left[ { w_{1} w_{2} } \right].$$

(9)

In weight vector ${\mathbf{w}}$ (Eq. 9), weights $w_{1}$ and $w_{2}$ have an effect on the median of the current data window ($m_{w }$) on the one hand, and on the median of the reduced set of the same window ($m_{s }$) on the other hand, $w_{1}$ weighting the latter, and $w_{2}$ weighting the former as follows (for example, on the k-th element of the data matrix):

$$res_{WMk} = (w_{1} *m_{s} + w_{2} *m_{w} )/(w_{1} + w_{2} ).$$

(10)

As described above, the median of a narrowed window ($m_{s } )$, and the original window’s median ($m_{w }$) is weighted at every window position for the final result of the actual point. Weight of $m_{s}$ is (m_s/m_e1)*α (i.e., the median is divided by the average of the maximal values of narrowed window). If this ratio is for example low for the given moving window position due to the noise (high $m_{e1}$ average of maximums), then $m_{s }$’s weight should be proportionally low, because otherwise, the high value of the outlier maximums would have negative effect on the final result. In such cases $m_{w}$’s weight will be proportionally high – not only because of the low weight of $m_{s}$, but due to the fact that $m_{w }$ is weighted by $w_{a}$,$w_{b} ,w_{c}$, all containing $m_{e1}$ or $m_{e2}$ values.

4 Most Frequent Value Method

A much more reliable statistical characteristic than the arithmetic mean, the weighted mean, is obtained by assigning a small weight ($w_{k}$) to points far from the majority of the data ($X_{k}$) and a larger weight ($w_{k}$) to points in the highest data density location (Eq. 11).

$$M = \mathop \sum \limits_{k = 1}^{N} X_{k} w_{k} \left[ {\mathop \sum \limits_{k = 1}^{N} w_{k} } \right]^{ - 1} (k = 1,2, \ldots ,N ).$$

(11)

The k-th weight is chosen by Steiner (Steiner 1991) as follows:

$$w_{k} = \varepsilon^{2} /\left[ {\varepsilon^{2} + \left( {X_{k} - M} \right)^{2} } \right].$$

(12)

In the above, $N$ is the number of data and ε is the dihesion, a scalar parameter. If ε is large, then all data are given nearly equal weights and outliers will spoil the value estimate, and if ε is too small, care must be taken to avoid ignoring some data.

The weighted mean, called the most frequent value (M) defined by Eq. (11), should be known in advance in order to assign weights with maximum values at its location and smaller and smaller weights away from it. Therefore, this procedure requires an iterative algorithm in which M and ε are determined jointly. In the first iteration step, the dihesion can be estimated from the sample space using the following formula:

$${\upvarepsilon }_{1} = \left( {\sqrt{\frac{3}{2}} } \right)\left[ {max\left( {X_{k} } \right) - {\text{min}}\left( {X_{k} } \right)} \right],$$

(13)

while for the $M_{1}$ the initial value is preferably chosen as the sample mean or median. In this study, the median value was used.

In subsequent iteration steps, $M$ and ε can be derived from each other according to the following procedure:

$$\varepsilon_{j + 1}^{2} = \frac{{3\mathop \sum \nolimits_{k = 1}^{N} \frac{{\left( {X_{k} - M_{j} } \right)^{2} }}{{\left[ {\varepsilon_{j}^{2} + (X_{k} - M_{j} )^{2} } \right]^{2} }}}}{{\mathop \sum \nolimits_{k = 1}^{N} \frac{1}{{\left[ {\varepsilon_{j}^{2} + (X_{k} - M_{j} )^{2} } \right]^{2} }}}} \leftrightarrow M_{j + 1} = \frac{{\mathop \sum \nolimits_{k = 1}^{N} \frac{{\varepsilon_{j + 1}^{2} }}{{\varepsilon_{j + 1}^{2} + (X_{k} - M_{j} )^{2} }}X_{k} }}{{\mathop \sum \nolimits_{k = 1}^{N} \frac{{\varepsilon_{j + 1}^{2} }}{{\varepsilon_{j + 1}^{2} + (X_{k} - M_{j} )^{2} }}}}.$$

(14)

5 Description of the resulting numerical values

For comparing the results of the different filtering methods, the following metrics are used in the paper. Calculation of the RMSE (Root Mean Square Error) value for the MFV method and median filtering (where inp is the noise-free data matrix):

$$RMSE_{St} = \sqrt {\frac{{\sum\limits_{i = 1}^{N} {(res_{St{i}} - inp_{i} )^{2} } }}{N}},$$

(15)

where $res_{St}$: matrix corrected by Steiner’s MFV,

$$RMSE_{Med} = \sqrt {\frac{{\sum\limits_{i = 1}^{N} {(res_{Med{i}} - inp_{i} )^{2} } }}{N}},$$

(16)

where $res_{Med}$: matrix corrected by median method,

$$RMSE_{WM} = \sqrt {\frac{{\sum\limits_{i = 1}^{N} {(res_{WM{i}} - inp_{i} )^{2} } }}{N}},$$

(17)

where $res_{WM}$: matrix corrected by weighted median.

Deviation regarding the three procedures:

$$Std_{St} = \overline{{std\left( {res_{St} - inp{ }} \right)}},$$

(18)

$$Std_{Med} = \overline{{std\left( {res_{Med} - inp{ }} \right)}},$$

(19)

$$Std_{WM} = \overline{{std\left( {res_{WM} - inp{ }} \right)}},$$

(20)

where $res_{St}$, $res_{Med}$, $res_{WM}$ inp: as above.

$L_{1}$ norm:

$$L_{1St} = \parallel res_{St} - inp\parallel _{1},$$

(21)

$$L_{1Med} = \parallel res_{Med} - inp \parallel _{1},$$

(22)

$$L_{1WM} =\parallel res_{WM} - inp \parallel _{1},$$

(23)

where $res_{St}$, $res_{Med}$, $res_{WM} ,$ inp: as above.

6 Adjusting constants

Regarding the 0.5 constant value of $w_{p}$, it was also tested in some randomly chosen test cases, that how the $L_{1}$ norm distance value between the noise-free matrix, and the weighted median-corrected matrix changes with the different values of the constant. The value of the norm monotonously decreased in every such cases, as can be seen as an example in Table 1 (10% noise rate, 0.3 noise amplitude):

Table 1 Example of adjusting constant value used for calculating w_p

Histogram-based weighted median filtering used for noise reduction of digital elevation model data

Abstract

Similar content being viewed by others

An MFV-based image processing filter and its application to seismic tomographic images

HeNLM-LA: a locally adaptive non-local means algorithm based on hermite functions expansion

Median Filtering: A New Insight

1 Introduction

2 Input dataset

3 Introduction of the weighted median (WM) method

4 Most Frequent Value Method

5 Description of the resulting numerical values

6 Adjusting constants

7 Comparative results

8 Handling zero mean noises

8.1 Noise generation

8.2 Modified version of the weighted median method

8.3 Results of the modified version of WM filtering procedure

9 Conclusions

Data availability

Code availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation