Fast colored video encryption using block scrambling and multi-key generation

Hosny, Khalid M.; Zaki, Mohamed A.; Lashin, Nabil A.; Hamza, Hanaa M.

doi:10.1007/s00371-022-02711-y

Fast colored video encryption using block scrambling and multi-key generation

Original article
Open access
Published: 18 November 2022

Volume 39, pages 6041–6072, (2023)
Cite this article

Download PDF

You have full access to this open access article

The Visual Computer Aims and scope Submit manuscript

Fast colored video encryption using block scrambling and multi-key generation

Download PDF

Khalid M. Hosny ORCID: orcid.org/0000-0001-8065-8977¹,
Mohamed A. Zaki¹,
Nabil A. Lashin¹ &
…
Hanaa M. Hamza¹

2780 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Multimedia information usage is increasing with new technologies such as the Internet of things (IoT), cloud computing, and big data processing. Video is one of the most widely used types of multimedia. Videos are played and transmitted over different networks in many IoT applications. Consequently, securing videos during transmission over various networks is necessary to prevent unauthorized access to the video's content. The existing securing schemes have limitations in terms of high resource consumption and high processing time, which are not liable to IoT devices with limited resources in terms of processor size, memory, time, and power consumption. This paper proposed a new encryption scheme for securing the colored videos. The video frames are extracted, and then, the frame components (red, green, and blue) are separated and padded by zero. Then, every frame component (channel) is split into blocks of different sizes. Then, the scrambled blocks of a component are obtained by applying a zigzag scan, rotating the blocks, and randomly changing the blocks' arrangements. Finally, a secret key produced from a chaotic logistic map is used to encrypt the scrambled frame component. Security analysis and time complexity are used to evaluate the efficiency of the proposed scheme in encrypting the colored videos. The results reveal that the proposed scheme has high-level security and encryption efficiency. Finally, a comparison between the proposed scheme and existing schemes is performed. The results confirmed that the proposed scheme has additional encryption efficiency.

Even symmetric chaotic and skewed maps as a technique in video encryption

Article Open access 06 April 2023

A robust and lossless DNA encryption scheme for color images

Article 16 June 2017

Block-Permutation-Based Encryption Scheme with Enhanced Color Scrambling

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the fast evolution of network technology and multimedia applications, video applications such as video-on-demand (VOD), video meetings, pay-tv, and video surveillance have been widely used. Because the video transmission depends on different networks, the video content may be captured because of the anatomy of the public channels. Securing colored videos during transmission and storage has become a challenging topic in recent years. The general video security objectives are availability, integrity, and confidentiality [1]. In general, three methods, i.e., video encryption (cryptography) [2,3,4], video steganography [5,6,7,8], and video watermarking [9, 10] could be used to achieve security. Cryptography is the most efficient technique to provide security to the colored videos by converting the raw video into an unintelligible video form using a secret key. The plain video can be restored only with the knowledge of the secret key. Video encryption techniques use two building blocks proposed by Shannon diffusion and confusion [11].

In general, image and video encryption algorithms are divided into full encryption and compression-combined encryption (selective encryption) [12]. Each of them has advantages and limitations. In full encryption [13,14,15,16,17,18,19], the whole image or video content is encrypted with a novel method directly, as shown in Fig. 1b. The full encryption algorithms are applied to uncompressed or compressed videos using any compression method [20]. The full encryption algorithms provide high-security encryption but take a long processing time. They are used in significant applications such as military and medical applications. In selective encryption [21,22,23], the video data are partially distorted by the encryption process, and the encrypted video is still partially intelligible after the encryption, as shown in Fig. 1c. They are used in applications that require low processing time. The proposed scheme wants to combine the advantages of the two mentioned methods to achieve good encryption and low processing time.

Because of the high correlation of the video frame neighboring pixels and the strong relationship between the video frames, traditional algorithms such as AES and DES could not guarantee high performance and low time processing for video encryption. The AES and DES are also unsuitable for encrypting colored video in real time [24]. Therefore, several algorithms for multimedia encryption were proposed [25,26,27,28,29,30,31,32,33,34]. These algorithms introduced by several academicians and researchers use different techniques such as DNA encoding and chaotic maps to encrypt images and videos securely and robustly. The most recent multimedia encryption methods are summarized in this section.

In [13], Li et al. presented a video encryption scheme that uses different chaotic algorithms and depends on the amount of information in each channel of a video frame. The video file is divided into a video stream and an audio stream. The video file stream is converted into YCbCr color space. The Arnold map and DNA encoding algorithm encrypt the Y channel, and the Lorenz hyperchaotic map is used to encrypt the other channels, where this scheme requires high-time processing. Yasser et al. proposed a multimedia encryption scheme based on hybrid-chaotic [19]. The proposed cryptosystem includes different media types such as videos, images, speech, and text. Alarifi et al. [16] developed a new hybrid cryptosystem for compressed video files based on chaotic maps, DNA sequences, and a modified Mandelbrot set. The scheme uses the Arnold map to generate three keys, and then, the encoding of the keys is performed with DNA sequences. The Hamming distance between the keys and a compressed YCbCr video frame is applied, encoding the result, and confusion and diffusion principles are applied. Valli and Ganesan[35] implemented a video encryption system that uses a substitution box to achieve diffusion and uses two different schemes. The first scheme is the higher-dimensional 12D chaos structure, and the other uses the Ikeda delay differential equation. The proposed drawbacks are the complexity of the key and the time that the encryption process takes. Kumar et al. [36] suggested a secure scheme based on chaos for video encryption. The algorithm provides a three-level of security: random selection of the frame, permutation order of the frame, and diffusion of the frame. In [37], Song et al. proposed a secure scheme to encrypt quantum videos. The proposed consists of three steps. First, permutation of the inter-frame position based on keys generated from an improved logistic map. Second, geometric transformation and improved logistic map change intra-frame pixels position. Finally, the quantum controlled-XOR operations and improved logistic map were used to encrypt the high 4-intra-frame-qubit-planes. In [38], Ye et al. used frequency domain encryption. First, the original image is transformed with the discrete wavelet transform and then compressed. Then, the carrier image is processed by lifting the wavelet transform and discrete cosine transform together with a Schur decomposition. Visually meaningful image encryption is achieved by embedding operation at the end. The encryption in the frequency domain improves encryption efficiency, but the implementation of frequency domain transformation leads to data loss. In [39], each channel of the color image was encrypted by the multi-parameter fractional discrete Tchebyshev moments. In [40], Gong et al. studied four-dimensional chaotic systems for image encryption applications. A new opto-digital color picture encryption scheme based on a compound chaotic map, the reality-preserving fractional Hartley transformation, and the piecewise linear chaotic map for image pixel replacement, optical processing, and permutation is suggested [41]. The proposed technique has a high sensitivity to keys and greater protection.

An overview of different schemes for securing colored video is introduced. Still, they have some drawbacks and vulnerabilities: (1) the running time of the related algorithms is high and does not meet the real-time applications. (2) Some related algorithms are complex and unsuitable for IoT devices. (3) Some related algorithms evaluate their proposed work based on test images and do not investigate the test videos. (4) Some related algorithms do not investigate the effect of different noises in the security performance analysis. Motivated by previous points, this paper introduces a new scheme for securing the colored video with high-quality encryption to improve such shortcomings. The proposed scheme consists of a video preprocessing step plus four main steps: colored video components extraction and padding, frame components splitting, frame components scrambling, key generation, and diffusion step. The input-colored video is preprocessed to extract individual frames. The three video components (channels), red, green, and blue, are separated from each frame and padded by zeros. The four main steps are applied to each frame channel independently. First, the plain video frame channel is split into blocks, and the blocks are further split into sub-blocks by applying a new frame channel dividing scheme. Second, a scrambled frame channel is obtained by applying a zigzag scan in the blocks and the sub-blocks; then, a counterclockwise rotation by a 90° is applied to all blocks, and then, the blocks are shuffled randomly. Third, a key is generated based on the logistic map. Finally, the encrypted frame channel is obtained by applying the XOR function between the generated key and the scrambled frame channel.

The paper's contributions are summarized as follows:

1.
A novel splitting method is introduced for each frame channel.
2.
Random shuffling is performed between blocks to get a scrambled frame channel.
3.
Diffuse the scrambled component using the logistic map, where the initial value of the logistic map is based on the first input frame component, making the proposed method robust against differential attacks.
4.
The results show that the proposed scheme takes low processing time to encrypt the colored videos compared to the literature.

The rest of this paper is coordinated as follows. The proposed scheme is demonstrated in Sect. 2 in detail. Section 3 presents the simulation results and security analysis. Eventually, the work is concluded in Sect. 4.

2 The proposed video encryption method

This section describes the proposed method in detail. The raw colored video is preprocessed and encrypted in an unintelligible format. The decryption process is applied to get the original colored video. Figure 2 shows an illustrative diagram of the total steps.

2.1 Preprocessing the video

A.
Video components extraction the proposed method is applied to each frame channel independently, so the input colored video is preprocessed to extract individual frames. Then, the frame channels are separated from each frame.
B.
Frame components padding the encryption and decryption process needs the input video frame's size to be multiple of the block size. So, after the frame components are separated, it is needed to pad them by zeros according to the size of these components.

2.2 Encryption process

Here, the proposed scheme for encrypting colored video consists of four phases. These phases are performed on each channel independently. In the first phase, channel splitting is performed. In the second phase, channel scrambling (permutation) is applied. Key streams are generated from the logistic map in the third phase. The channel diffusion process is performed in the last phase.

2.2.1 Channel splitting

A raw frame channel is partitioned into blocks of equal size. The block size dimensions that the users can select from and are suitable for the scheme are 16, 32, and 64. Then, a random vector with a length equal to the number of blocks is generated. The blocks are further partitioned into sub-blocks or kept without partition based on the generated vector.

2.2.2 Channel scrambling

The arrangements of the frame channel's pixels are changed in this phase as follows:

(a)
The zigzag scan is used to permute the positions of the pixels in each block (undivided and subdivided blocks) of the divided channel.
(b)
Each block (undivided block and subdivided block) is rotated by 90°.
(c)
For every block in the divided channel, a random number is generated to create a vector $ r$.
(d)
Depending on the vector $ r$, a random permutation between blocks is performed to obtain the permuted frame channel.

2.2.3 Key generation

A new key vector $K$ from the logistic map is generated for every frame channel. The mathematical equation of the logistic map is:

$$ Y_{n + 1} = bY_{n} \left( {1 - Y_{n} } \right) $$

(1)

where 0 < $b$ ≤ 4, and a starting value 0 < $Y_{0}$ < 1. When $b$ ∈ [3.57, 4], the map is chaotic. The starting value $Y_{0}$ depends on the input colored video. The key generation steps for every frame channel are:

(a)
The starting value of the logistic map is computed.
- For the first key vector (for the first channel of the first frame), $Y_{0}$ is calculated by:
  $$ Y_{0} = \frac{{\mathop \sum \nolimits_{i = 1}^{M} \mathop \sum \nolimits_{j = 1}^{N} C\left( {i,j} \right)}}{M \times N \times 255 \times 3} + 10^{ - 20} $$
  (2)
  where $C $ is the input frame channel, and M and N are the input size.
- For other key vectors (for the other channels in the same frame or other frames), $Y_{0} $ value equals the last value of the previous key vector $ K\left( {MN} \right)$ (in the previously processed channel).
(b)
Get a sequence $S_{{{\text{temp}}}} $ by iterating Eq. (1) $ N_{0} + MN $ times, then generate a new sequence $S$ with size $ MN$ by discarding the first $N_{0}$ values of $ S_{temp}$.
(c)
Generate the key vector $K$ by equation (3):
$$ K\left( i \right) = mod\left( {floor\left( {S\left( i \right) \times 10^{14} } \right), 256} \right), \quad i = 1\;{\text{ to}} MN $$
(3)

2.2.4 Channel diffusion

In this phase, a bit-wise exclusive OR function is applied between every value in the generated key vector and the corresponding value in the permuted frame channel vector. After the channel pixels values are changed, an encrypted frame channel is generated. Algorithm 1 presents the steps of the encryption process. Also, Fig. 3 shows the flowchart of the scheme phases.

2.3 Decryption process

The decryption process can be constructed by inverting the encryption phases with the original keys to get the plain channels of each frame. The decryption steps are:

(1)
The bit-wise exclusive OR function is performed between every value in the key vector and the corresponding value in the encrypted frame channel vector.
(2)
Reordering the channel blocks placements to their original placements based on the random vector.
(3)
Apply a rotation by -90° and inverse zigzag pattern to all blocks to rearrange the original placements of the pixels.

3 Simulation results and security analysis

This section examines the colored video encryption scheme for privacy and robustness. The colored videos used for testing are Train.avi (192 × 352 × 3), Rhinos.avi (240 × 320 × 3), Viptrain.avi (240 × 360 × 3) and Flamingo.avi (192 × 352 × 3) taken from Valli and Ganesan [35], and Foreman.avi (352 x 288 x 3) downloaded from YUV Sequences [42]. Figure 4 shows the test video samples. The proposed scheme is executed using MATLAB (R2015a) on a laptop that has the subsequent specifications: Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz 2.21 GHz, 16 GB memory, and Windows 11 OS. The algorithm's initial parameters are: In the channel splitting step, the dimension of the blocks is 16 (where n = 4), $b = 3.9$ for the logistic map, and $N_{0}$ = 1000 for the skipped elements.

3.1 Visual analysis

Different evaluation metrics have been used with the proposed scheme. The first metric used to evaluate the scheme is the visual inspection. The encryption/decryption results of the videos are displayed in Fig. 4. The results indicate that the scheme hides all details within the test videos, and the receiver side restores the original videos successfully.

3.2 Histogram analysis

A histogram is an essential tool in evaluating the efficiency of the encryption scheme. It represents the number of occurrences of each pixel value in a frame channel. The flat histogram indicates that the frame channel can resist different types of statistical attacks [43]. Figures 5, 6, 7, 8 and 9 show the histograms for various videos' 10th original, encrypted, and decrypted frames. It is observed that the encrypted frames histograms have a uniform distribution form and are not similar to their corresponding original frames histograms.

Consequently, the proposed scheme hides any pattern in the frames of the test videos. Additionally, the decrypted frames histograms and their corresponding original frames are the same. So, the scheme can recover the original frame from the encrypted one successfully.

3.3 Correlation analysis

Principally in each video frame, there is a high correlation between neighboring pixels as the intensity values are nearly the same. These relationships must be reduced to protect the video frame against different attacks. The adjacent pixels pair's correlation can be calculated using the following equations.

$$ r_{A,B } = \frac{{E\left( {\left( {A - E\left( A \right)} \right)\left( {B - E\left( B \right)} \right)} \right)}}{{\sqrt {D\left( A \right)D\left( B \right)} }} $$

(4)

$$ E\left( A \right) = \frac{1}{s} \mathop \sum \limits_{i = 1}^{s} A_{i} $$

(5)

$$ D\left( A \right) = \frac{1}{s} \mathop \sum \limits_{i = 1}^{s} \left( {A_{i} - E\left( A \right)} \right)^{2} $$

(6)

where $A$ and $B$ represent the two adjacent pixel values, and $s$ is the total number of selected pairs. Figures 10, 11 and 12 show the horizontal (H), vertical (V), and diagonal (D) correlation distributions of 6000 random pairs of neighboring pixels selected for the 10th original and encrypted frame of the Flamingo test video. The correlation values of 6000 random pairs of adjacent pixels for the 10th original and encrypted frame of various videos, along with H, V, and D directions, are presented in Table 1. From the results, the values of the original frames are close to one. On the contrary, the values of the encrypted frames are very low and very close to zero. So, there is no correlation between pixels in the frames encrypted by the proposed scheme. Therefore, the proposed scheme can resist statistical attacks.

Table 1 Correlation coefficients for various videos

Fast colored video encryption using block scrambling and multi-key generation

Abstract

Similar content being viewed by others

Even symmetric chaotic and skewed maps as a technique in video encryption

A robust and lossless DNA encryption scheme for color images

Block-Permutation-Based Encryption Scheme with Enhanced Color Scrambling

Explore related subjects

1 Introduction

2 The proposed video encryption method

2.1 Preprocessing the video

2.2 Encryption process

2.2.1 Channel splitting

2.2.2 Channel scrambling

2.2.3 Key generation

2.2.4 Channel diffusion

2.3 Decryption process

3 Simulation results and security analysis

3.1 Visual analysis

3.2 Histogram analysis

3.3 Correlation analysis

3.4 Entropy analysis

3.5 Differential attack

3.6 Encryption quality analysis

3.6.1 Histogram deviation (\({\varvec{D}}_{{\varvec{H}}}\))

3.6.2 Irregular deviation (\({\varvec{D}}_{{\varvec{I}}}\))

3.7 PSNR, SSIM, and FSIM analysis

3.8 Chosen-plaintext and known-plaintext attacks analysis

3.9 Edges detection analysis

3.10 Keyspace analysis

3.11 Key sensitivity analysis

3.12 Channel noises attack analysis

3.12.1 Salt & peppers noise

3.12.2 Gaussian noise

3.13 Occlusion attack analysis

3.14 Execution time

3.15 Time complexity analysis

3.16 Comparison with existing methods

4 Conclusion

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation