1. Introduction

Hyperspectral images contain abundant spatial, spectral, and radiometric information of earth surfaces, which makes earth observation and information acquisition much more effective and efficient for some applications [1, 2]. Hyperspectral remote sensing images can be acquired through airborne or spaceborne sensors. There are two remote sensing satellites carrying hyperspectral imagers in China: the moon exploration satellite CE-1 launched during 2007 and the small satellite constellation for environment and disaster mitigation (HJ-1A) launched during 2008. The hyperspectral imager carried on HJ-1A satellite (HJ1A-HSI) is the first spaceborne hyperspectral imager in China, which is also one of few international spaceborne imaging spectrometers [3]. Instead of adopting the traditional dispersion element to acquire spectrometry, this spectrometer employs an advanced interference spectrometry technique. HJ1A-HSI first acquires an interference curve for each pixel, and then uses the Fourier transform to convert interference curves to spectral curves. The interference spectrometer is modulated across space using the Sagnac interference approach. Its successful operation and application open a new era of Chinese earth observation technology. Although HJ-1A has been operated for several years, new developments for HJ-1A data processing are important for further research and application due to the new interference spectrometry technique used in the sensor.

In general, a hyperspectral image contains hundreds of bands with high spectral resolution, which brings about difficulty in data processing due to data redundancy and complexity. Furthermore, the special characteristic of HJ-1A data (i.e., the distribution of noise and interference present in spectral and spatial domain) makes its application more difficult. Therefore, efficient dimensionality reduction remains as one of the key issues for HJ-1A data processing. Principal component analysis (PCA), which is widely used for dimensionality reduction in hyperspectral image processing [4, 5], transforms raw hyperspectral data into a new feature space with mutually orthogonal coordinates. It preserves most information of original data in a low-dimensional space. However, the performance of PCA strongly depends on noise characteristics. When noise variance is larger than signal variance in one band or when the noise is not uniformly distributed between each band, PCA cannot guarantee that image quality decreases for principal components with lower ranking [6]. This drawback limits its application for hyperspectral images, which generally have very different types of noise. For example, in hyperspectral unmixing, noisy pixels might be extracted as endmembers which normally correspond to known and macroscopically pure materials.

Recently, some new approaches have been proposed to deal with noise. One of the most popular ones is the maximum noise fraction (MNF) transform [6, 7]. Similar to PCA, MNF also transforms the original data to a feature space; however, features are arranged in terms of image quality, which is measured with signal-to-noise ratio (SNR). In MNF, noise covariance matrix (NCM) needs to be estimated [810], which is a key step. In the original MNF, only spatial information is used for NCM estimation, which may not effectively handle special noise with regular pattern, such as interference in HJ-1A data.

There are many methods developed for noise estimation in image analysis [2]. Some traditional approaches use homogeneous area (HA) selection and spatial character analysis, such as HA method [11], Geo-statistical method [11], and local mean and local standard deviation method [12]. However, these methods are easily affected by land cover types in the image. To solve this problem, Roger and Arnold [13] proposed Spectral and Spatial De-Correlation (SSDC) method. Compared with traditional methods, SSDC is more stable, and is widely used. However, SSDC also has some limitations. For example, the noise estimation may be inaccurate when hyperspectral image mainly contains one earth object with absorption feature in some bands, or when the image has a specific complex texture [14]. Residual-scaled Local Standard Deviations method (RLSD) [15] estimates signals according to high spectral correlation and eliminates the influence of complex texture and absorption feature by statistical analysis of sub-blocks. Thus, it is more stable than the SSDC and Homogeneous Regions Division and Spectral De-Correlation method [14], especially for hyperspectral images mainly covered by water. In this article, based on the characteristics of HJ-1A data, we use RLSD for SNR estimation and noise distribution analysis. Then, we propose an optimized MNF (OMNF) transform for dimensionality reduction, which contains two steps: the NCM is calculated via the SSDC, followed by the OMNF transform. Moreover, we propose an assessment framework to evaluate the performance of OMNF via spectral unmixing and classification.

The remainder of this article is organized as follows. Section 2 introduces OMNF method. Section 3 describes experimental image database, performance assessment framework, and comparative analysis methods. In Section 4, noise characteristics of HJ-1A image are analyzed, and evaluation of OMNF using spectral unmixing and classification is presented. Section 5 draws conclusions.

2. OMNF transform

Let X be a hyperspectral image data, S and N are signal component and noise component contained in image data, respectively. Assume S and N are uncorrelated, then X follows a linear model:

X = S + N
(1)

Then, data covariance matrix can be represented as

= S + N
(2)

where ∑ S and ∑ N are the covariance matrix of S and N, respectively. The MNF transform is expressed as

Y = AX
(3)

where Y is the MNF result of X, A is the MNF transform matrix. SNR of each component in Y can be analyzed as

Var a i T X Var a i T N = a i T a i a i T N a i
(4)

where Var{} computes the variance, a i is the i th component in A. Then we can obtain

N - 1 A = Λ A
(5)

where Λ and A are the eigenvalue matrix and eigenvector matrix of N - 1 , respectively. MNF is also called noise-adjusted principal components analysis which contains two steps [16, 17]. The first step is noise whitening of the hyperspectral image, then PCA is applied to noise whitened data. The main difference between conventional PCA and MNF is that MNF has a prior step of noise whitening, which needs to estimate the NCM. The original MNF method mainly adopts the spatial feature of image to estimate ∑ N , such as minimum/maximum autocorrelation factor (MAF), causal simultaneous autoregressive, and quadratic surface [18]. However, as shown in some studies [810], space-based noise estimation method is data-selective and unstable. This is because when hyperspectral image has low spatial resolution, the difference between pixels may mainly contain signal. Sometimes, noise with regular pattern, e.g., interference in spatial domain, may be considered as signal when only spatial features are used in noise estimation for MNF.

In hyperspectral images, correlation between bands generally is very large. Therefore, high correlations between bands can also be used for noise estimation, such as SSDC, which is a very useful method for hyperspectral noise estimation [13]. SSDC makes use of the high correlation of hyperspectral data in spatial and spectral domain together, where the radiation value of adjacent bands and pixels is used to estimate the radiation signal value of current pixel through multiple linear regression. Then the estimated radiation signal value is deducted from the actual radiation value of the current pixel, and the remaining value is considered as noise. However, since hyperspectral images do not completely meet the hypothesis adopted in SSDC, we cannot estimate the noise images following model (1). For example, there still is some correlation between noise images. Therefore, implementation of SSDC in Greco et al.’s study [9] is not feasible to estimate NCM for hyperspectral data.

2.1. NCM estimation

The most difference between MNF and OMNF is that OMNF adopts more accurate noise covariance estimation. In the proposed OMNF, noise image computed by SSDC can be used to estimate NCM. In order to control the influence of spatial feature, the image is divided into non-overlapping small sub-blocks, where noise image estimated by residual of each sub-block can be used to calculate NCM. SSDC adopted in this article uses multiple linear regression to estimate noise image:

x i , j , k = a + b x i , j , k - 1 + c x i , j , k + 1 + d x p , k
(6)

where x i,j,k is the pixel value of band k at (i, j) in a certain sub-block, x i,j,k-1 and x i,j,k+1 are pixel values in band k – 1 and k + 1, x p,k is the pixel value spatially near x i,j,k in band k, a, b, c, and d are parameters which need to be estimated through multiple linear regression. In (6), x p,k is defined as

x p , k = { x i - 1 , j , k ; i > 1 , j = 1 x i , j - 1 , k ; j > 1
(7)

where the pixel located at (1,1) of the sub-block is not considered. SSDC estimates possible signal value x ^ i , j , k at band k from the obtained parameters. Then, noise of each pixel can be obtained through r i , j , k = x i , j , k - x ^ i , j , k . Finally, the NCM for OMNF can be calculated as follows:

σ k 2 = i = 1 H j = 1 W r i , j , k - r ¯ i , j , k 2 , C kl = i = 1 H j = 1 W r i , j , k - r ¯ i , j , k r i , j , l - r ¯ i , j , l , 1 k , l N
(8)

where (i,j) ≠ (1,1), W, H are the width and height of image, respectively, and N is the total number of bands.

2.2. OMNF transformation

After noise variance is estimated through (8), noise correlation is removed through noise whitening with (5). Finally, dimensionality reduction can be performed through (3).

3. Experiments design and assessment methods

3.1. HJ-1A hyperspectral data

The InterFerometric Imaging Spectrometer (IFIS) installed on HJ-1A is the first hyperspectral earth observation sensor in China [3]. Its spectrum ranges from 0.45 to 0.95 μm with 115 spectral bands. The average spectral resolution is about 5 nm. The nominal ground sample distance is 100 m with an image swath of about 60 km. IFIS is a typical Sagnac imaging Fourier transform spectrometer featured by a compact structure, small volume, and light weight. Table 1 lists the parameters of the IFIS on HJ-1A comparing to those of HYPERION on Earth Observing 1 (EO-1) and Compact High Resolution Imaging Spectrometer (CHRIS) on PRoject for On-Board Autonomy (PROBA). This hyperspectral imaging sensor has excellent specifications for practical applications. However, the data quality is degraded by severe noise. The new interferometric spectrometer technique is used in this sensor, which makes that the noise characteristics of IFIS are very different from noise contained in normal dispersion spectrometry used in HYPERION and CHRIS. For instance, regular striping noise is still present after calibration, and cannot effectively be removed by Fourier transform and notch filter method. Therefore, effective noise removal is crucial.

Table 1 Parameters of the HJ-1A IFIS, HYPERION, and CHRIS

In this article, HJ-1A images at Bohai Sea area are chosen for experiments (Figure 1). The HJ-1A hyperspectral images used here contain 115 bands, and 400 × 400 pixels.

Figure 1
figure 1

HJ-1A hyperspectral data at Bohai Sea area used in this article.

3.2. Noise characteristics analysis

Image noise may be periodic noise or random noise. Periodic noise can effectively be eliminated through frequency domain filtering, such as notch or bandpass filtering. However, it is more complex to effectively remove random noise, which is generally assumed to be additive Gaussian white noise [19, 20]. In this article, we propose to use RLSD [15] for noise estimation. In summary, RLSD procedure is described as follows:

Step 1: We divide the image into many small rectangular sub-blocks, and then calculate parameters a, b, and c of signal component in each sub-block through multiple linear regression:

x k = a + b x k - 1 + c x k + 1
(9)

Here, x k is a vector with N × 1 values, where N = w × h with w and h being width and height of a certain sub-block at band k, respectively. Then, the estimated signal value x ^ k is calculated through the obtained parameters and pixel value at adjacent bands. Finally, the residuals are obtained by: r = x k - x ^ k . Since the predictable signal information between bands is removed, the remained ‘unexplained’ residuals can approximate noise.

Step 2: The Local Standard Deviation (LSD) is calculated at each sub-block as follows:

LSD = M - 3 - 1 S 2 1 / 2
(10)

where M is the number of pixels of this sub-block and S 2 is the variance of residuals of the sub-block. As there are three parameters used in multiple linear regression shown in (9), the unbiased estimation requires the term of M – 3.

Step 3: After LSDs of all sub-blocks are calculated, we extract maximum and minimum values of the obtained LSDs. Then, several bins with equal interval are set between these two values. The numbers of sub-blocks in each bin can be counted according to its LSD value. Finally, the mean LSD value of the bin with the most number of blocks is calculated, which can be considered as the noise of the whole image.

3.3. Assessment framework and methods

We consider several dimensionality reduction methods, i.e., PCA, MAF, MNF, and OMNF, for evaluation. Since the spatial resolution of HJ-1A hyperspectral data is 100 m, mixed pixels generally exist in the image. Therefore, we consider spectral unmixing for evaluation from full-pixel scale to sub-pixel scale. Similarly, image classification is also considered for evaluation as it has important applications. Figure 2 shows the flowchart of the proposed scheme.

Figure 2
figure 2

Schematical description of the approach to assess performance of OMNF.

3.3.1. Spectral unmixing method

Spectral unmixing mainly obtains endmember extraction and abundance estimation [21, 22]. Endmember extraction extracts pure pixels. Abundance estimation estimates the proportion of each endmember in a mixed pixel. In spectral unmixing, abundance estimation generally adopts a least squares method (constrained or unconstrained). Many endmember extraction methods are developed, such as, Pixel Purity Index (PPI), N-FINDR, Vertex Component Analysis (VCA), Iterative Error Analysis (IEA), and so on [22].

In the aforementioned methods, N-FINDR is one of the most widely used algorithms [23]. Its aim is to find a set of pixels that can construct a simplex with the maximum volume. These pixels can be considered as endmembers. Due to the requirement of a square matrix used in volume calculation in N-FINDR, the original image must be transformed to a (p – 1)-dimensional subspace by a dimensionality reduction method.

3.3.2. Image classification method

Hyperspectral classification can be supervised or unsupervised, parametric or non-parametric, and hard or soft (fuzzy). Traditional pixel-based classification methods, such as Maximum Likelihood Classifier (MLC), Spectral Angle Mapper (SAM), Minimum Distance Classifier (MDC), analyze data without incorporating spatial information. However, spatial information can play an important role in hyperspectral image classification [24]. Classification accuracy can greatly be improved when spatial and spectral features are effectively combined [25]. In this study, we propose to use a Homogenous Objects Extraction (HOE)-based method to combine spectral and spatial information for classification. Meanwhile, the HOE method can efficiently deal with the special noise present in HJ-1A data.

In homogenous object-oriented image classification, such as HOE, the key issue is to extract the objects with high homogeneity. Non-uniform radiation response increases spectral variation, which is common with the high degree of spectral heterogeneity in complex landscape. Thus, in the HOE-based classification approach used in this study, all pixels inside a homogeneous object can be considered belonging to the same class. Furthermore, since homogeneous regions are extracted through spectral similarity between pixel and neighborhoods, integration of spectral feature and a series of spatial features (such as shape, size, texture, and context relationship) can be applied in classification. This approach mainly includes three steps: image segmentation, feature extraction, and classification.

Image segmentation

In this article, fuzzy K-means clustering is used for image segmentation. Fuzzy K-means is a soft clustering algorithm which determines the subordination degree of each pixel in each type according to that of its vector value between [0, 1]. This algorithm is an iterative process, where each type of centroids (ci) and pixel subordination matrix (uij) are adjusted using (11) until the convergence of objection function J = i = 1 k j = 1 n u ij m d ij 2 .

c i = j = 1 n u ij m x j j = 1 n u ij m , where u ij = k = 1 c d ij d kj 2 / m - 1 - 1 ,
(11)

where m∈[1, ∞] is a weighted index, d ij is dissimilarity measurement, such as Euclidean distance.

Feature extraction

After image segmentation, the features of homogeneous objects can be extracted, which may include the spatial position, spectra of the homogenous object, and its class label. Since all the pixels in the same segment belong to the same class, the class label of the segment can be obtained by tracking the margin through a contour-based object tracking method. Moreover, the mean spectrum of all pixels in each homogeneous object is used as the spectral feature for this object.

Classification

In general, traditional pixel-based method performs classification by comparing the spectral similarity of each pixel with prior knowledge of the training samples. In the HOE-based method, pixel-wise training samples need to be transformed to objects according to the relationship between a given pixel and its corresponding homogenous object. Such classification model parameters can be estimated by training the objects at different homogeneous regions. As shown in (12), the Mahalanobis Distance (MD) is considered:

D x i , z l = x i - z l T l - 1 x i - z l ,
(12)

where z l and l are the mean vector and covariance matrix of training samples, respectively.

4. Experiments and results analysis

4.1. Noise characteristics analysis of HJ-1A hyperspectral data

The diagnostic spectral features of earth materials are required for image classification and information extraction of hyperspectral images. However, hyperspectral sensor acquires data with very small spectral interval. Thus, there is insufficient optical energy for each band. It is much more difficult to improve SNR of hyperspectral data than panchromatic or multispectral images. Absorption feature of the spectrum can be detected only when spectral absorption depth is one magnitude greater than the noise level [26]. During data acquisition, the spectral feature of earth object, however, is easily distorted by noise.

In this study, the size of each sub-block is 8 × 8 pixels. In order to handle the interval bin division (see step 3 in the RLSD procedure), we estimate the noise based on the parameters estimated by the technique proposed in [12], where the bins are set in the range between the minimum LSD and 1.2 times of LSD mean value, and 150 bins are recommended.

Figure 3 illustrates some bands of the considered HJ-1A data at Bohai Sea area. It is noticeable that image quality of these bands is significantly different. This is reasonable, since, according to the interference device used in HJ-1A sensor, spectral information is acquired in a way different from the dispersive spectrometer. It receives interference data modulated and interfered by target spectral information. The ordinary data with spectral radiation information can be obtained through spectral restoration. For the errors produced by interference device and spectral restoration, HJ-1A hyperspectral data are disturbed with periodical strip at spatial domain, which is difficult to be eliminated by traditional radiance calibration methods [3]. Therefore, in order to guarantee the precision and accuracy of image classification and spectral unmixing, this special noise requires specific method to remove.

Figure 3
figure 3

HJ-1A hyperspectral data at Bohai Sea area. The central wavelength, respectively, is 460, 480, 559, 719, 838, and 957 nm from left to right.

Figure 4 shows the SNR estimates from RLSD. It can be observed that noise distribution is non-uniform. Furthermore, it is well known that noise condition is more realistic when the image mainly covers water area [27]. Therefore, dimensionality reduction with effective noise-elimination is important for real applications.

Figure 4
figure 4

SNR estimation results of HJ-1A hyperspectral data at Bohai Sea area.

4.2. Dimensionality reduction results

Figure 5 shows the six components of the considered dimensionality reduction methods. It can be seen that the first two components obtained from PCA (see Figure 5a) have most information of the data. However, it is possible that these principle components contain noise which is non-uniformly distributed in the spectral domain. Therefore, PCA is not well suitable for dimensionality reduction of HJ-1A data. Furthermore, as shown in Figure 5b, the first three components of MAF have most spatial correlations of the image, which means those components have most volume of the signal. The fourth and fifth components are almost noise. However, the sixth component contains information. This brings difficulty for determining the number of components. Therefore, MAF is also not suitable for dimensionality reduction of HJ-1A hyperspectral images. Moreover, Figure 5c shows the components obtained from MNF. It can be seen that the first two components also have the highest image quality. However, the fifth component contains more information than the third and fourth components, which are interfered by periodic strips. Thus, although traditional MNF takes noise into account and can solve the influence of non-uniform noise distribution in spectral domain, it is easily affected by the hybrid distributions of earth objects and periodic interference; thus, its components may not be arranged in descending order of image quality. Finally, Figure 5d shows the components obtained by the proposed OMNF method. It can be observed that OMNF reduces data dimensionality more effectively where all components are arranged in descending order of image quality.

Figure 5
figure 5

Results of dimensionality reduction through PCA (a), MAF (b), traditional MNF (c), and OMNF (d). In each row, the image from first to sixth components in transformed data is represented from left to right.

4.3. Comparative performance analysis

In this section, the first three components of PCA, MAF, and OMNF are used for spectral unmixing and image classification, and the first, second, and fifth components of traditional MNF are used for spectral unmixing and image classification.

4.3.1. Spectral unmixing

Endmember extraction and abundance estimation by N-FINDR and unconstrained least squares methods are applied to the dimensionality-reduced data obtained from PCA, MAF, MNF, and OMNF. Based on the obtained components, only four endmembers are extracted, and Figure 6 shows the obtained abundance and error maps. Several conclusions can be obtained from Figure 6. First of all, the results obtained from the PCA components are only the salt area is appropriate and all the other endmembers are greatly affected by noise. Furthermore, it can be seen that the results obtained from MAF, MNF, and OMNF components are better than those of PCA, where OMNF obtains the best results. This is because all these methods take noise into account. Moreover, it can be observed that the abundance estimations of vegetation, salt area, muddy water, and water body are more reasonable in geographical distribution than results from other dimensionality reduction methods. This is because OMNF eliminates noise during the dimensionality reduction procedure. For example, distribution of vegetation in abundance map of OMNF is better than others. Another example is distribution of salt area is repeated in the abundance maps of salt area and water body in both MAF and MNF results.

Figure 6
figure 6

Abundance estimation results of extracted endmembers by N-FINDR. Abundance retrieval is processed based on dimensionality reduced HJ-1A hyperspectral data through PCA (a), MAF (b), traditional MNF (c), and OMNF (d). In each row, vegetation, salt area, muddy water, water body, and error are represented from left to right.

4.3.2. Image classification

Figure 7 shows the reference data on the false color composite. In the dataset, 20% of samples are used for training, and the rest for testing. Training and testing samples were randomly selected from the reference data.

Figure 7
figure 7

Training and testing samples used in classification experiments.

Figure 8 shows the classification results obtained from the raw data, the dimensionality-reduced data from the MNF and OMNF, respectively. It can be seen that the pixel-based classification using OMNF features obtained the best result, especially for the water and sea beach classes.

Figure 8
figure 8

Classification results through MD method for HJ-1A hyperspectral data. (a) Applied on raw data, and (b) applied on dimensionality-reduced data using traditional MNF, and (c) applied on dimensionality-reduced data using OMNF.

Figures 9 and 10 illustrate the classification results obtained from the MNF and OMNF features by the HOE method, respectively. Figures 9a and 10a present the segmentation of the hyperspectral image. Two steps are involved in this process. The first step is the fuzzy K-means clustering, followed by edge tracking to obtain the boundaries of the ground objects in the second step. As can be seen from Figures 9b and 10b, the classification results are better than those in Figure 8. Overall, classification result using OMNF and HOE is the best.

Figure 9
figure 9

Classification result through MNF and HOE for HJ-1A hyperspectral data. (a) The result of image segmentation, and (b) the result of image classification.

Figure 10
figure 10

Classification result through OMNF and HOE for HJ-1A hyperspectral data. (a) The result of image segmentation, and (b) the result of image classification.

The producer’s accuracy is used for further assessment [28]. Figure 11 shows the classification accuracies, where five methods are considered: pixel-based classification on the raw data, pixel-based classification on reduced data from MNF, pixel-based classification on reduced data from OMNF, HOE-based classification on reduced data from MNF, and HOE-based classification on reduced data from OMNF. It can be observed that HOE-based classification is better than the pixel-based method for water body in most parts of the study area (including water in sea water and salt area in saltern) and for sea beach, vegetation, and salt area in saltern. The obtained Kappa coefficients are 0.4076, 0.6229, 0.6740, 0.7011, and 0.8704 for pixel-based classification on the raw data, pixel-based classification of the MNF-reduced data, pixel-based classification of the OMNF-reduced data, HOE-based classification of the MNF-reduced data, and HOE-based classification of the OMNF-reduced data, respectively. It can be seen that the proposed HOE with OMNF method leads to excellent classification performance, which produced the highest accuracy for the considered HJ-1A hyperspectral image.

Figure 11
figure 11

Producer’s accuracy comparison of pixel-based classification and HOE-based classification, where classified earth objects are water (W), sea beach (SB), vegetation (V), salt area (SA), and dike (D).

5. Conclusion and discussion

Hyperspectral imager carried on HJ-1A satellite indicates a new development stage of hyperspectral remote sensing technology in China. However, due to the new interference spectrometry technique implemented in this sensor, noise characteristics of HJ-1A images are more complex than images acquired by other hyperspectral sensors, such as HYPERION and CHRIS. This article presents an OMNF method for dimensionality reduction for HJ-1A images, which estimates the NCM using SSDC method. The proposed approach is evaluated by a real HJ-1A hyperspectral data at Bohai Sea. Both unmixing and classification results indicate that the proposed method can offer excellent performance for HJ-1A data applications.