1 Introduction

In order to interpret synthetic aperture radar (SAR) images, the statistical modeling often be employed. In fact, the multiplicative model using \( \mathcal{G} \) family of distributions often is utilized to describe the speckled data, which is characterized by three parameters: the number of looks, the scale parameter, and the roughness parameter. By this distribution, it is able to describe rough and extremely rough areas, which is better than other distributions [1, 2]. Under the \( \mathcal{G} \) model, regions with different degree of roughness can be characterized by the parameters, thus, the accuracy of the estimation of these parameters becomes very important.

Several kinds of approaches for estimating roughness parameters with different number of looks have been proposed. For example, Gambini et al. gave an analogy estimator based on moments of order 1/2 and 1 [3, 4]. Vasconcellos et al. proposed an analytic change for improving performance with respect to bias and mean-squared error, and the bias in the estimation of the roughness parameter of the \( {\mathcal{G}}_I^0 \) distribution by maximum likelihood (ML) was quantified [5]. In fact, the processing and understanding of SAR image is the problem of small samples, for instance, image filtering where with a few observations within a window a new value is computed. The parameter estimation with small samples is subjected to many problems, mainly including bias, large variance, and sensitivity to deviations from the hypothesized model. On the one hand, \( {\mathcal{G}}_I^0 \) distribution is heavy-tailed distributions. Hence, dealing with SAR data is essentially difficult because samples from the tail of the distribution will have a strong influence on parameter estimation, and bias will be introduced if we decline the weights of them [6]. Therefore, a common issue in all the aforementioned estimation procedures, including ML, and those based on fractional moments and log-cumulants [7] is the need of iterative algorithms for which there is no granted convergence to global solutions. Frery et al. [8] and Pianto and Cribari-Neto [9] proposed techniques to aim at alleviating such issue at the cost of additional computational load.

Random weighting, proposed by Zheng (1987) [10], is an emerging computational method in statistics, and has been used to solve different problems [11]. The random weighting method has following advantages: (1) it is sample in computation; (2) it does not require the previous knowledge on the distribution, and the estimation results are unbiased; (3) the estimation error of the random weighting method is smaller than that of Bootstrap in the case of small samples; (4) it is independent and identically distributed, and robust; (5) statistic determined by the random weighting method has the density function, so it is particularly suitable for the problem described in the density function. This paper is to develop a robust estimation method of combining analogy with random weighting method for the \( {\mathcal{G}}_I^0 \) model, which has the good properties of unbiased, the small mean-squared error and its ability to resist contamination. Even in small samples and low computational cost, its performance is still very robust.

The paper is organized as follows: Section II presents the random weighting estimation for parameters of \( {\mathcal{G}}_I^0 \) distribution. In Section III, we will present and discuss the main numerical results. Finally, Conclusions and future work are presented in Section IV.

Random weighting estimation for \( {\mathcal{G}}_I^0 \) distribution

1.1 The \( {\mathcal{G}}_I^0 \) model

The multiplicative statistical model is extensively used in SAR image data analyses. Depending on this model, the data are described by a random variable X, which can be viewed as the product of the independent random variables U and V, that is X = U ⋅ V, where U models the properties of the imaged area (backscatter), and V models the multiplicative noise (speckle) introduced by the use of coherent illumination, which degrades the image quality. In this manner, speckle noise (V) in intensity L-looks format follows a gamma distribution, denoted by \( {\mathcal{G}}_I^0\left(\alpha, \gamma, L\right) \), whose probability density is given by

$$ {f}_X(x)=\frac{L^L\varGamma \left( L-\alpha \right)}{\gamma^{\alpha}\varGamma (L)\varGamma \left(-\alpha \right)}\frac{x^{L-1}}{{\left(\gamma + Lx\right)}^{L-\alpha}},\kern0.5em x>0 $$
(1)

The backscatter (U) exhibits different degrees of homogeneity and can be modeled using the inverse gamma distribution, denoted by U ~ Γ − 1(α, γ), whose probability density is

$$ \gamma >0 $$
(2)

Then X = U ⋅ V follows a \( {\mathcal{G}}_I^0\left(\alpha, \gamma, L\right) \) distribution [12], whose probability density is

$$ {f}_X(x)=\frac{L^L\varGamma \left( L-\alpha \right)}{\gamma^{\alpha}\varGamma (L)\varGamma \left(-\alpha \right)}\frac{x^{L-1}}{{\left(\gamma + Lx\right)}^{L-\alpha}},\kern0.5em x>0 $$
(3)

where − α > 0 is the roughness parameter, γ > 0 is the scale parameter and L ≥ 1 is the number of looks [1].

The r-order moments are given by

$$ E\left({X}^r\right)={\left(\frac{\gamma}{L}\right)}^r\frac{\varGamma \left(-\alpha - r\right)\varGamma \left( L+ r\right)}{\varGamma \left(-\alpha \right)\varGamma (L)} $$
(4)

The \( {\mathcal{G}}_I^0 \) distribution is very attractive for modeling data with speckle noise, due to its mathematical tractability and ability to describe information from most types of areas, for given α < − 1 and L. These densities are presented in semi-logarithmic scale, showing that they have heavy (linear) tails with respect to the Gaussian distribution which displays quadratic behavior. It is noticeable that the larger values of α, the larger the variances have; in fact, the variance is not finite when α ≥ − 1.

1.2 The random weighting estimator of parameter

Let X 1, …, X n be independently and identically distributed random variables with distribution \( X\sim {\mathcal{G}}_I^0\left(\alpha, \gamma, L\right) \), α < − 1/2, γ > 0 and L known. \( {\overline{X}}_n={n}^{-1}{\displaystyle \sum_{i=1}^n{X}_i} \) and \( {\overline{X}}_n^r={n}^{-1}{\displaystyle \sum_{i=1}^n{X}_i^r} \) are the mean of samples and rth sample moment, respectively, then the random weighting estimation of the \( {\overline{X}}_n \) and \( {\overline{X}}_n^r \) can be defined as

$$ {H}_n={\displaystyle \sum_{i=1}^n{\xi}_i{X}_i} $$
(5)

and

$$ {H}_n^r={\displaystyle \sum_{i=1}^n{\xi}_i{X}_i^r} $$
(6)

where ξ 1, …, ξ n are independent and identically distributed random variable with distribution function of random variable ξ 1 = η − 2, where η is from gamma distribution function G(4, 2), i.e., the density function of η is \( \varGamma (4)=\frac{2}{3!}{\left(2 x\right)}^3 \exp \left\{-2 x\right\} I\left\{ x>0\right\} \), I(A) is the indicator function of set A, and X 1, …, X n and ξ 1, …, ξ n are mutually independent.

By formula (4), we can know that

$$ E\left({X}^{\frac{1}{2}}\right)={\left(\frac{\gamma}{L}\right)}^{\frac{1}{2}}\frac{\varGamma \left(-\alpha -\left(1/2\right)\right)\varGamma \left( L+\left(1/2\right)\right)}{\varGamma \left(-\alpha \right)\varGamma (L)},-\alpha >1/2 $$
(7)
$$ E(X)=\left(\frac{\gamma}{L}\right)\frac{\varGamma \left(-\alpha -1\right)\varGamma \left( L+1\right)}{\varGamma \left(-\alpha \right)\varGamma (L)},-\alpha >1 $$
(8)

Replacing the population moments by their sample counterparts, and the parameters by the corresponding estimators, we arrive at the following system of two equations:

$$ {\overline{X}}_n=\left(\frac{\widehat{\gamma}}{L}\right)\frac{\varGamma \left(-\widehat{\alpha}-1\right)\varGamma \left( L+1\right)}{\varGamma \left(-\widehat{\alpha}\right)\varGamma (L)},-\widehat{\alpha}>1 $$
(9)
$$ {\overline{X}}_n^{\frac{1}{2}}={\left(\frac{\widehat{\gamma}}{L}\right)}^{\frac{1}{2}}\frac{\varGamma \left(-\widehat{\alpha}-\left(1/2\right)\right)\varGamma \left( L+\left(1/2\right)\right)}{\varGamma \left(-\widehat{\alpha}\right)\varGamma (L)},-\widehat{\alpha}>1/2 $$
(10)

which lead to the following equation that can be solved numerically in order to obtain an estimator for α:

$$ \frac{{\overline{X}}_n\varGamma \left(-\widehat{\alpha}\right)\varGamma (L) L}{\varGamma \left(-\widehat{\alpha}-1\right)\varGamma \left( L+1\right)}=\frac{{\left({\overline{X}}_n^{\frac{1}{2}}\right)}^2{\varGamma}^2\left(-\widehat{\alpha}\right){\varGamma}^2(L) L}{\varGamma^2\left(-\widehat{\alpha}-1/2\right){\varGamma}^2\left( L+\left(1/2\right)\right)} $$
(11)

By formulas (5) and (6), \( {\overline{X}}_n \), \( {\overline{X}}_n^{\frac{1}{2}} \) in (11) is replaced by its random weighting estimation H n , \( {H}_n^{\frac{1}{2}} \), respectively. We have

$$ \frac{H_n\varGamma \left(-\widehat{\alpha}\right)\varGamma (L) L}{\varGamma \left(-\widehat{\alpha}-1\right)\varGamma \left( L+1\right)}=\frac{{\left({H}_n^{\frac{1}{2}}\right)}^2{\varGamma}^2\left(-\widehat{\alpha}\right){\varGamma}^2(L) L}{\varGamma^2\left(-\widehat{\alpha}-1/2\right){\varGamma}^2\left( L+\left(1/2\right)\right)} $$
(12)

Therefore, the random weighting estimator \( {\widehat{\alpha}}^{*} \) for α can be obtained via solving equation (12). By substituting the value of \( {\widehat{\alpha}}^{*} \) into formula (9), we can obtain random weighting estimator \( {\widehat{\gamma}}^{*} \).

2 Performance evaluations

In this section, the random weighting estimation of \( {\mathcal{G}}_I^0 \) distribution parameters is comprehensively evaluated on SAR computational simulations data and real SAR images.

2.1 Simulations data and analysis

The results of double bounce in SAR system is that a high return value exists in some pixels. The presence of such outliers may provoke big errors in the estimation. Since robustness is the ability to perform well when the data obey the assumed model and to not provide completely useless results when the observations do not exactly follow it, moreover, estimators in SAR signal and image processing are always used in various robustness algorithms, thus, the robustness is of highest importance.

In order to assess the behavior of the proposed estimators, computational simulations have been performed to verify random weighting (RW) estimation of parameters α and γ. Let 0 < ε ≤ 1, C ∈ R +. In these simulation studies, we consider uncontaminated data and three contaminated models, where samples {X 1, …, X n } are identically distributed random variables with one of uncontaminated and three contaminated models. The three patterns of the contamination are defined as follows:

  • $$ \mathrm{Case}\ 1:\left(1-\varepsilon \right){\mathcal{G}}_I^0\left({\alpha}_1,{\gamma}_1, L\right)+\varepsilon {\mathcal{G}}_I^0\left({\alpha}_2,{\gamma}_2, L\right) $$
  • $$ \mathrm{Case}\ 2:\left(1-\varepsilon \right){\mathcal{G}}_I^0\left({\alpha}_1,{\gamma}_1, L\right)+\varepsilon C $$
  • $$ \mathrm{Case}\ 3:\left(1-\varepsilon \right){\mathcal{G}}_I^0\left({\alpha}_1,{\gamma}_1, L\right)+\varepsilon {\mathcal{G}}_I^0\left({\alpha}_1,{10}^{\mathrm{k}}{\gamma}_1, L\right) $$

The sample lengths are 9, 25, 49, 81, 121, and 1000, respectively. The parameter α is evaluated by using different real values such as α = {−1.5, − 3, − 5}, which are used on behalf of areas with intense and moderate texture. As compared, we simulate the analogy estimator (ANA) [7], maximum likelihood (ML) [8], log-cumulant (LCum) [7] and 1/2-moment (Mom12) [1]. Figures 1, 2, 3, 4, 5, 6, 7, and 8 show a graphical comparison of the mean and variance estimation values for parameter α after testing 1000 times under uncontaminated data with cases 1, 2, and 3, respectively.

Fig. 1
figure 1

Sample mean of estimates under uncontaminated data. The black line is the true parameter

Fig. 2
figure 2

Sample mean-squared error of estimates under uncontaminated data

Fig. 3
figure 3

Sample mean of estimates, case 1 with ε = 0.01 and α 2 = − 15

Fig. 4
figure 4

Sample mean-squared error of estimates, case 1 with ε = 0.01 and α 2 = − 15

Fig. 5
figure 5

Sample mean of estimates, case 2 with c = 100 and ε = 0.001

Fig. 6
figure 6

Sample mean-squared error of estimates, case 2 with c = 100 and ε = 0.001

Fig. 7
figure 7

Sample mean of estimates, case 3 with k = 2 and ε = 0.005

Fig. 8
figure 8

Sample mean-squared error of estimates, case 3 with k = 2 and ε = 0.005

From Figs. 1 and 2, we can observe that only three methods of five estimators are very close to the true value in mean, namely, \( {\widehat{\alpha}}_{\mathrm{ML}} \), \( {\widehat{\alpha}}_{\mathrm{ANA}} \) and \( {\widehat{\alpha}}_{\mathrm{RW}} \), and the farthest estimators from the true value are \( {\widehat{\alpha}}_{LC\mathrm{um}} \) and \( {\widehat{\alpha}}_{M\mathrm{om}12} \), when α = − 5. In most cases, all methods have very similar mean-squared error in the lager sample data, thus, we cannot decide which one is the best. However, our proposed method is better than other four methods especially when L = 3 and L = 8.

Figures 3 and 4 show the influence of the perturbation in contaminated cases. As expected, it is more obvious. With the increase of the number of sample, the proposed estimation is further away from the contamination, and the mean-squared errors of \( {\widehat{\alpha}}_{\mathrm{RW}} \) is smaller than that of \( {\widehat{\alpha}}_{\mathrm{ANA}} \), \( {\widehat{\alpha}}_{\mathrm{ML}} \), \( {\widehat{\alpha}}_{LC\mathrm{um}} \), \( {\widehat{\alpha}}_{M\mathrm{om}12} \) for L = 3, 8. But all methods do not have clear distinction for L = 1 except that \( {\widehat{\alpha}}_{\mathrm{RW}} \) is at least very competitive in the case of α = − 3.

From Figs. 5 and 6, our proposed estimator is approximately close to the true mean value, and its mean-square error is smaller than that of other methods, except for L = 1 and α = − 3. \( {\widehat{\alpha}}_{\mathrm{RW}} \) produces the closest estimates to the true value with reduced mean-squared error from Figs. 7 and 8.

2.2 Real SAR image and analysis

In order to further prove the performance of our proposed method, practical experiments have been conducted to estimate α and γ parameters in \( {\mathcal{G}}_I^0\left(\alpha, \gamma, L\right) \), which is employed to model the real SAR image by random weighting. In these experiments, SAR data from a three-look intensity format SAR image are used. Figure 9 shows the regions used for estimating parameter. Tables 1 and 2 show the results of estimating α and γ parameters for each rectangular region. Figure 10 shows the histogram and the line of the estimated probability density function by each estimation method for each rectangular region. The Kolmogorov–Smirnov test (KS test) is then performed between X from the image and Y for the null hypothesis H 0 “both samples come from the same distribution”, and the complementary alternative hypothesis. Table 3 shows the sample p values. It is observed that the null hypothesis is not rejected with a significance level of 5% in any of the cases. The results justify that the model is adequate for the data, and the performance of our proposed method is better than the others.

Fig. 9
figure 9

Real SAR image and the regions used to estimate the parameters

Table 1 Estimations of the α parameter using the samples shown in Fig. 9
Table 2 Estimations of the γ parameter using the samples shown in Fig. 9
Fig. 10
figure 10

The histogram and the line of the estimated probability density function by each estimation method for blue and red rectangular regions. a For blue rectangular region. b For red rectangular region

Table 3 Sample p values of the K–S test with samples from the image in Fig. 9

3 Conclusions

A new estimator for the roughness parameter of the c = 100 distribution based on the random weighing method is proposed. Moreover, three models of contamination inspired in real situations are defined to assess the impact of outliers in the performance of the estimators. By the experiments and analysis under the contamination, we can observe that: (1) do not consider the intensity of the contamination, the bigger the number of looks, the smaller the percentage of no convergence. (2) Under the contamination cases, the convergence with the increase of the level of contamination or with the reduction of α. In conclusion, our proposed method is much closer to the real mean value, and its mean-squared error is lower than that of other methods in the case of small samples.