A digital watermarking method based on NSCT transform and hybrid evolutionary algorithms with neural networks

Amiri, Ali; Mirzakuchaki, Sattar

doi:10.1007/s42452-020-03452-0

A digital watermarking method based on NSCT transform and hybrid evolutionary algorithms with neural networks

Research Article
Published: 10 September 2020

Volume 2, article number 1669, (2020)
Cite this article

Download PDF

SN Applied Sciences Aims and scope Submit manuscript

A digital watermarking method based on NSCT transform and hybrid evolutionary algorithms with neural networks

Download PDF

1090 Accesses
10 Citations
Explore all metrics

Abstract

This study aims to determine the watermark resistance to different attacks as well as the PSNR level, both of which are essential requirements of watermarking. In our research, we came up with an intelligent design based on NSCT-SVD that fulfills these requirements to a great extent and we managed to use different-sized images for watermark instead of using logos on the host images. Yet we were able to improve PSNR levels and resistance to various attacks. In this paper an NSCT-SVD-based smart watermark model is proposed. We first compare the PSO and PSO-GA algorithms for greater stability using larger SFs obtained by the PSO-GA-AI algorithm. The resulting host image is then decomposed by NSCT transform to obtain images below the low frequency range. Stationary Wavelet Transform (SWT) is performed once on these coefficients and the low frequency coefficients are fed to SVD. Afterwards, SWT transform is performed on the watermark image and the transform is once again taken from the HL coefficients and the LL frequencies are given to the SVD conversion. The rest of image process is insertion. This insertion process dramatically increases the visual transparency and PSNR value. The experiment shows that such a model is able to resist the repeated image attacks with better visibility and power. These results are compared before and after using SWT. We have used a PSO-based algorithm for better results on the False Positive rate in the embedding phase.

Digital image watermarking using deep learning

Article 13 May 2023

Watermarking technique for copyright protection of digital images using coupled differential equations

Article 20 May 2024

Software and hardware realizations for different designs of chaos-based secret image sharing systems

Article Open access 06 May 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the increasing expansion of the world wide web and social networks, the use of copyright and the prevention of illegal copying in such a way that the proprietorship of the copyrighted work can be proven, has become more popular. In fact, digital watermarking can be a good solution to this problem [1,2,3]. With the development of Internet and digital media, creation of multimedia works has become easier than ever before and the security of this volume of information raises concerns [3]. Watermarking can be called the art of hiding information in a way that is not recognizable to others. This process should be accompanied by minimal alteration to the host image [2].

The most damaging kind of attack are the geometric attacks that do not actually remove the treasure but manipulate the watermarked object in such a way that the person extracting the treasure cannot extract it from the watermarked data [4]. Therefore, it is important to ensure the host image stability before inserting the watermark into the host image. Watermarks are generally divided into two categories of space domain and frequency domain. The former offers no effective protection against repetitive attacks [5]. In the frequency domain, conversions such as DFT, DCT, and DWT are used to insert the message. In the methods of this group, the watermarking operation is done in the field of sound conversion in such a way that first the whole image or any of its blocks are transferred to another domain and then watermarking is done in the corresponding field and the image is returned to the image domain again to obtain the watermarked image. What distinguishes the methods of this group from each other is the type of conversion function chosen and how the information is inserted in the field of conversion. In general, watermarking in the field of conversion has less storage capacity than the image domain, but instead exhibits greater resistance to any sabotage aimed at the image [6, 7]. DWT-based watermarking algorithms insert watermarks in areas with less sensitivity. In DCT-based methods, the host signal is broken into different frequency bands and the watermark can be inserted into different frequency bands [8]. In [9], an algorithm based on the combination of DWT and DCT is presented.

Contourlet transformation, which is another form of frequency domain transformations, creates boundaries with great accuracy. It is multi-scale and, unlike other transformations, offers a variety of directions [10]. Authors in [12] presented two algorithms which insert the watermark in coefficients with larger absolute values. Zaboli et al. [13], proposed a new method for non-blind watermarking of gray images that used the features of the human visual system and a new entropy-based method for the watermarking process. It decomposes the main image into four levels and the watermark is an image mixed with random noise sequences stored in the cover image. In Ref. [14], a new contourlet Conversion called Sharp Frequency Local contourlet is introduced which proposes a new structure of contourlet conversion claiming that it solves the non-positioning problem of the original contourlet conversion. The main case involves a combination of the Laplacian pyramid, Directional Filter Banks (DFB) and multi-resolution image interpretation. According to this article, filters are used in the new contourlet conversion more than the original contourlet conversion and sample increment and subtraction operations continue to be used. They used the new conversion in combination with SVD, which is another frequency domain transformation. GA is a population-based meta-heuristic algorithm [15]. Ref. [16] proposes a DCT-DWT-SVD-based algorithm using PSO which is another meta-heuristic algorithm based on Genetic Programming (GP) to modify single image values. In our simulations we have used a type of conversion called NSCT, which is actually an optimized version of contourlet conversion for further directions along with finer details provided in image parsing than other watermarking conversions. Also because contourlet outperforms in receiving smooth contours and provides more robust edges in many high frequency areas without disturbance. So we expected to be able to provide better results and since one of our goals was to achieve a method where the PSNR was increased, the redundant DWT which is a frequency-domain conversion is used. To fully assess our method, MSSIM, and MSE were also used in addition to PSNR. The SF coefficient, indicating the stability of the method against attacks, was compared with the standard test images in four ways using PSO, PSO-GA, PSOGA-AI, and PSO-AI to see in which mood the best result is obtained. We also tested the plan results with different dimensions of the host and watermark image to accurately evaluate it against the attacks. These results are presented in Sect. 4.

This paper develops an intelligent watermarking method using the contourlet and SVD combination and utilizing SF which is an algorithm to find the most robust image for insertion of watermark using an optimized PSO that combines neural networks with GA. It evaluates watermark visibility through the peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM) and tests power with normalized cross-sectional correlation (NCC). Also using SWT algorithm, the PSNR value is increased. These parameters are compared before and after applying this algorithm.

The experiment showed that such a pattern had better watermark and visibility as well as better information security. We also used this algorithm to hide an image in a second image rather than inserting a few bits or a logo. The rest of this article is organized as follows: Sect. 2 introduces the relevant principles, Sect. 3 shows the proposed model, and Sect. 4 presents the results.

2 Background

2.1 Contourlet transform

The directions applied for embedding the watermark are limited in conventional wavelet transform because it has only three horizontal, vertical, and diagonal directions, while the Contourlet transform is a single transform in which the number of directional bands can be determined by the user at any given resolution [10]. In fact, this transform provides a multi-resolution representation of the signal. It is also an integral directional 2D transform used to describe fine curves and details in images and efficiently describes the smooth contours that are key components in images. The contourlet expansion is a multi-resolution directional expansion of the basic functions. One of the most important features of this transform is that it can specify multiple multidirectional analyses at each level of the multi-resolution pyramid [11]. The contourlet transform is done by a two-filter set to go from the space to the frequency domain. The first filter uses the Laplacein Pyramid (LP) to receive point discontinuities and then connects the point to the linear DFB via a directional filter bank, and then the presented image is obtained with the main elements of the contour sections [17].

CT can be divided into two main parts: Laplacian Pyramid (LP) decomposition and directional filter bank (DFB). The original image is decomposed into a low-pass image and a band-pass filtering image via LP method. Each band-pass filtering image is then parsed with DFB. If the same steps are repeated with the low-pass image, we can achieve multi-directional and multi-solution parsing. Figure 1 shows the CT structure. NSCT is obtained by coupling a non-sample pyramid structure (NSP) with a non-sample DFB (NSDFB). The NSCT structure is also shown in Fig. 2. Due to the basic properties of anisotropy and orientation that are not common in wavelets, Contourlet is superior to other transformations in image processing [10]. Also due to the fact that we witnessed a better set of directions and shapes compared to wavelets, the contourlet outperformed in receiving smooth contours. Considering the more powerful capacity of hidden edges, Contourlet is more suitable for hiding data in many high frequency areas without disturbing the original image [18].

For the mathematical formulation of the process, suppose $ x\left[ n \right] = f, \phi_{L, n} $ is an image signal for each $ f = L^{2} \left( {R^{2} } \right),\phi_{L, n} $, which is an orthogonal scaling function. Then, the discrete contourlet transform of x is:

$$ \begin{aligned}{\text{x}}\,{\mathop{\Rightarrow}\limits^{{\text{Contourlet transform}}}} \left( {a_{J} ,d_{j,k}^{\left( l \right)} } \right),\quad j = 1, \ldots J; \hfill \\ {\text{k}} = 0, \ldots 2^{l} j - 1 \hfill \\ \end{aligned} $$

(1)

In which, $ a_{J} $ and $ d_{j, k}^{\left( l \right)} $ are the complete and approximate coefficients, including:

$$ a_{J} \left[ n \right] = f,\emptyset_{L } + J,n $$

(2)

$$ {\text{d}}_{j,k}^{\left( l \right)} \left[ n \right] = f,\rho_{j,k,n}^{\left( l \right)} $$

(3)

In which, $ \rho_{j,k,n}^{{\left( { l} \right)}} $ is the basis function of the directed filter bank and $ \phi_{L + J, n} $ is the basis function of LP [19].

2.2 SVD function in watermarking

SVD decomposes a symmetric matrix into three matrices. These three decomposed matrices are, in particular, the singular matrices U, S, and V. If we assume that Y is a symmetric matrix, SVD can be obtained by the following equation:

$$ {\text{USV}}^{\text{T}} = {\text{SVD}}\left( {\text{Y}} \right) $$

(4)

In Eq. (4) we have U $ {\text{U}}^{\text{T}} = {\text{I}}_{\rm n} $ and $ {\text{VV}}^{\text{T}} = {\text{I}}_{\rm n} $. The columns U are orthogonal eigenvectors $ {\text{YY}}^{\text{T}} $, as the degree of the matrix Y, the elements of the diagonal matrix S can satisfy the relation in Eq. 5, and we can also rewrite the matrix Y as Eq. 6 shown below. In Eq. (6),$ {{\upmu }}_{\rm i} $ and $ {\text{V}}_{\rm i} $ are the i-th eigenvectors U and V and $ {{\updelta i}} $ is considered equal to the i-th singular value.

$$ \delta 1 \ge \delta 2 \ge \ldots \ge \delta r \ge \delta_{r + 1} $$

(5)

$$ = \delta_{r + 2} = \ldots \delta_{n} = 0. $$

(6)

$$ Y = \mathop \sum \limits_{r = 1}^{r} \delta i\mu_{i} V_{i} $$

where $ {{\updelta i}} $ and $ {\text{V}}_{\rm i} $ are the ith eigenvectors of U and V and $ {{\updelta i}} $ is the i-th singular value.

2.3 Stationary wavelet transform

Stationary Wavelet Transform (SWT) is a waveform conversion algorithm designed to overcome the translation-in-variance of Discrete Wavelet Transform (DWT). The translation-in-variance is a result of removing the downstream and upstream samplers in the DWT and the upstream sampling of the filter coefficients by factor 2^j−1 at the j-th level of the algorithm. SWT is intrinsically redundant, since the output of each level of SWT contains a number of samples equal to inputs. For N surface decomposition, there is N redundancy in the wavelet coefficients [20, 21].

The 2D SWT divides the image into four sub-bands. LL is the approximate image of the input image, known as the low frequency sub-band. LH, HL and HH sub-bands represent the horizontal, vertical and diagonal features of the original image, respectively. Figure 3 illustrates the implementation of the SWT up to three levels. In our experiments, we found that this conversion increased PSNR. The application of this conversion to the Barbara image in three levels is shown in Fig. 3 and SWT conversion to Barbara image is shown in Fig. 4.

2.4 PSO combined with GA

As mentioned, Genetic Algorithm is a search-based algorithm which is based on natural selection and genetics [22]. In problems where the objective function is non-derivative and the design variables are continuous or discrete, this algorithm seems appropriate. The genetic algorithm is presented based on the Darwinian principle of evolution, which is based on the struggle for survival as well as the survival of the fittest. Each member of the population is considered as a chromosome and the fit is obtained from the objective function. This function must also be optimized. Operators such as mutation, composition, and selection are used to evolve the original population. Those members of the population who are more fit will have a better chance of reproducing. After several repetitions, the population reaches a steady state. So at this point, the algorithm converges and most of the population members will be the same, indicating a near-optimal answer to the problem. Genetic algorithm control is performed by three operators: mutation rate, composition rate and size. Like other search algorithms, the optimal response is obtained in the genetic algorithm after many iterations, the repetition rate of which is determined by chromosome length and population size. However, evolutionary algorithms, especially GA and PSO, which are also used in our design, have advantages and disadvantages [23]. The operators used in GA are random and this algorithm is very sensitive to the initial values selected by the user. Also, the convergence rate to the response in this algorithm is low. The PSO algorithm has a more accurate performance. However, like GA, this algorithm is sensitive to the initial value. To overcome the limitations of PSO, its combination with GA has been proposed. The basis for this is that such a combined approach is expected to have the simultaneous benefits of PSO and GA. One of the advantages of PSO over GA is its algorithmic simplicity. Another obvious difference between the two is the ability of convergence control [24].

Considering the advantages and disadvantages of the two algorithms, the PSO-GA combination was first proposed by Angelin and Eberhard as a new algorithm to outperform each individual algorithm [25, 26]. In the hybrid algorithm, the speed of finding a response is significantly increased and the accuracy of the response will be more acceptable. This algorithm can be used for many optimization problems. The hybrid algorithm is a one-objective algorithm. In this scheme, we first run PSO and then GA. And for all members of the population, the best update is done and the children inherit the best memory from their parents and the speed of the first child randomly takes one of the speeds of the first and second parents and the remaining parent reaches the second child.

2.5 Artificial neural networks (ANN)

In the artificial neural network (ANN), attempts have been made to model the nervous systems of living organs, especially the human brain. ANN is made up of a large number of highly interconnected processing elements, such as neurons, that perform tasks together. ANNs have the ability to be trained and, like the brain, to learn by seeing different examples. A neural network is created from parallel processing units that are interconnected. Each of these units receives input from the other units. This unit then takes the sum of the inputs and calculates the output, and this output is sent to the other units to which it is connected [2]. In fact, artificial neural networks are a powerful technique for controlling the information contained in the data and arise from the generalization of this information. Teaching neural networks is not something that can be done through programming. Programming is generally time consuming for the analyst and forces him to examine and determine the exact behavior of the model. In fact, it can be said that neural networks learn patterns in data [11]. Neural networks are much more flexible in changing environments. Neural networks can also perform very complex interactions, so they can easily use inferential statistics or programming logic to model data that is very difficult to model [12].

2.6 Neuronal identifiers

This procedure begins with the selection of a neural model defined by its structure and related learning algorithm. Since neural networks are capable of learning, they can begin learning when neural models and input and output data are available. Different structures are trained and compared using the learning set and the simulation data set and the criterion (error target).

The best structure for us is a structure with the smallest unit (neurons). The artificial neural networks have an input layer (buffer layer), one or more oblique nonlinear hidden layers, and a linear/nonlinear output layer [25, 26]. Hybrid identifiers can identify simple nonlinear systems and cannot identify complex ones [26,27,28].

Figure 5 shows the structure of the NID multilayer neural network identifier with two nonlinear hidden layers. The size of the neural network is crucial in the design of the entire structure. There is no mathematical formula for calculating the optimal size of such networks. However, with large free units, NID quickly learns. The fundamental limitation in increasing the size of hidden layers is the limitation of the hardware structure of the system used in experimental work that requires powerful hardware. Figure 5 shows the multilayer neural network identifier structure, forward feeding NID with two nonlinear hidden layers [27]. In the proposed scheme, we use a multilayer neural network to optimize PSO and PSO-GA.

3 Method

In this section we present our proposed algorithm. The proposed algorithm consists of two algorithms with/without SWT transform. In the results section, all the results with/without the use of SWT algorithm are expressed. It is worth noting that the flowchart of the algorithm is given in Fig. 6. Our algorithm consists of two parts. SF calculation step and watermark insertion.

3.1 Watermark insertion

Step 1

Reading the host I and W images as watermarks.

Step 2

Implementing the contourlet transform (here NSCT is used which is the newer and better contourlet transform) on I image to obtain a low-pass image and directional pass-through images.

coeffs = nsctdec(double(I), nlevels, dfilter, pfilter)

nlevels = [0, 1, 3] equals decomposition level

And the parameters “Pyramidal filter and the Directional filter” are set to be equal to the following values

pfilter = ’maxflat’

dfilter = ’dmaxflat7’

Step 3

Obtaining 2D SWT transform from coeffs {1}

(coeffs is obtained in step 2)

Step 4

Applying redundant DWT

Obtaining SWT2 transform from HL coefficients of the previous step.

[LL, LH, HL, HH] = swt2(coeffs{1},1)

Step 5

Extracting singular values and Watermark embedding [28].

SVD operation is performed for each sub-band of the host image, $ {\text{A}}^{\text{K}} = {\text{U}}_{\rm K} \sum_{\rm k} {\text{V}}_{\rm K}^{\text{T}} $, where k is equal to the frequency sub-bands. We also perform SVD on watermark image W ⇒ $ {\text{U}}_{\rm W} \sum_{W} {\text{V}}_{\rm W}^{\text{T}} $, then we will obtain the main components of watermark image.

We put the main components of watermarking in the singular values of the host image in each $ \sum_{I}^{k} = \sum_{k}^{ } + \Delta $ • $ {A_{Wa}} $ sub-band. Δ scaling factor is obtained from PSO algorithm. At this point, the product between the principal components and the scaling factor (which we will explain below) is a point product and the coefficients corrected for each sub-band are equal to

$$ {\text{A}}_{\rm w}^{{\rm k}} = {\text{U}}_{\rm K} \sum_{\text{I}}^{{\rm k}} {\text{V}}_{\text{k}}^{{\rm T}} $$

Step 6

Calculation of the scaling factor for image embedding using PSO is as follows:

Step 7

We obtain the Inverse Discrete Wavelet Transform (IDWT) on the modified coefficients for each subgroup $ {\text{A}}_{\text{w}}^{\text{k}} $ [28].

Singular PSO is obtained in the scaling factor Δ using the objective function. For each iteration in the PSO, the Δ value is examined for several attacks. At the end of the PSO iteration, we obtain the near optimal scaling factor [28]

Step 8

Taking the inverse SWT transform from the obtained coefficients in the previous step.

We have evaluated our proposed algorithm with all the criteria stated in Sect. 3. Four standard images in image processing are used in our experiments as shown in Fig. 1. All 4 images were used as input images or hosts. Besides being host, image D was also used as watermark image. Both of our proposed algorithms were able to accept 255 × 255 and 512 × 512 images as both the host and the Watermark image. This showed the increased potential of our method. All experiments were performed with a computer with CPU Core i3 and all coding and resulting extractions were performed by MATLAB 2018.

This article uses NCC to examine the Watermark power. A larger NCC represents a better certainty of the watermark. To evaluate the power of the proposed model, the watermarked image was tested according to the common image processing operations such as mean filter, median filter, cropping, and salt-and-pepper noise according to the following table. Figure 7 shows the host images.

In both algorithms, the parameters for the contourlet transform were defined as follows: The decomposition rate was 3, and the pyramidal and directional filters were 7.9 and PKVA, respectively. The GA-PSO parameter was c1 = c2 = 1.4962 and was considered to be 5 in Formula W1. In addition, the population was 100, with 10 replicates.

In addition to using the MATLAB standard images commonly used by researchers, we decided to use images from other databases to further evaluate our proposed plan. Figure 8 shows our selected images from other databases for the Host image.

4 Results

SSIM and PSNR were used to test watermark visibility and the values were expressed before the attacks. Table 1 shows the PSNR, NCC, and SSIM values after attacks when the Barbara watermarked image was 255 × 255 and the host image dimensions were 255 × 255 and it is visible by applying the first proposed algorithm (without SWT transform). Table 2 also shows the NCC values extracted after significant geometric attacks.

Table 1 Values of PSNR, NCC and SSIM with 255 × 255 watermarked image

A digital watermarking method based on NSCT transform and hybrid evolutionary algorithms with neural networks

Abstract

Similar content being viewed by others

Digital image watermarking using deep learning

Watermarking technique for copyright protection of digital images using coupled differential equations

Software and hardware realizations for different designs of chaos-based secret image sharing systems

1 Introduction

2 Background

2.1 Contourlet transform

2.2 SVD function in watermarking

2.3 Stationary wavelet transform

2.4 PSO combined with GA

2.5 Artificial neural networks (ANN)

2.6 Neuronal identifiers

3 Method

3.1 Watermark insertion

Step 1

Step 2

Step 3

Step 4

Step 5

Step 6

Step 7

Step 8

4 Results

5 Discussions

6 Conclusion

Notes

Abbreviations

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing of interest

Availability of data and material

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation