A new color image encryption technique using DNA computing and Chaos-based substitution box

Masood, Fawad; Masood, Junaid; Zhang, Lejun; Jamal, Sajjad Shaukat; Boulila, Wadii; Rehman, Sadaqat Ur; Khan, Fadia Ali; Ahmad, Jawad

doi:10.1007/s00500-021-06459-w

A new color image encryption technique using DNA computing and Chaos-based substitution box

Focus
Open access
Published: 01 December 2021

Volume 26, pages 7461–7477, (2022)
Cite this article

Download PDF

You have full access to this open access article

Soft Computing Aims and scope Submit manuscript

A new color image encryption technique using DNA computing and Chaos-based substitution box

Download PDF

Fawad Masood^1,9,
Junaid Masood²,
Lejun Zhang¹,
Sajjad Shaukat Jamal³,
Wadii Boulila^4,5,
Sadaqat Ur Rehman⁶,
Fadia Ali Khan^7,8 &
…
Jawad Ahmad⁹

3505 Accesses
25 Citations
Explore all metrics

Abstract

In many cases, images contain sensitive information and patterns that require secure processing to avoid risk. It can be accessed by unauthorized users who can illegally exploit them to threaten the safety of people’s life and property. Protecting the privacies of the images has quickly become one of the biggest obstacles that prevent further exploration of image data. In this paper, we propose a novel privacy-preserving scheme to protect sensitive information within images. The proposed approach combines deoxyribonucleic acid (DNA) sequencing code, Arnold transformation (AT), and a chaotic dynamical system to construct an initial S-box. Various tests have been conducted to validate the randomness of this newly constructed S-box. These tests include National Institute of Standards and Technology (NIST) analysis, histogram analysis (HA), nonlinearity analysis (NL), strict avalanche criterion (SAC), bit independence criterion (BIC), bit independence criterion strict avalanche criterion (BIC-SAC), bit independence criterion nonlinearity (BIC-NL), equiprobable input/output XOR distribution, and linear approximation probability (LP). The proposed scheme possesses higher security wit NL = 103.75, SAC ≈ 0.5 and LP = 0.1560. Other tests such as BIC-SAC and BIC-NL calculated values are 0.4960 and 112.35, respectively. The results show that the proposed scheme has a strong ability to resist many attacks. Furthermore, the achieved results are compared to existing state-of-the-art methods. The comparison results further demonstrate the effectiveness of the proposed algorithm.

A novel and Fast hybrid design of cryptosystems for Image via 5-D chaos based random keys and DNA

Article 22 December 2023

A novel RGB image encryption algorithm based on DNA sequences and chaos

Article 05 November 2020

Lossless chaotic color image cryptosystem based on DNA encryption and entropy

Article 03 August 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The trend of transmitting digital information over the Internet is growing exponentially. While this increases convenience and accessibility, extra challenges also increase with every development and improvement in technology. One of the inevitable issues is providing adequate security for data transmission that utilizes insecure communication networks. The number of connected users grows every day, as does their diverse Internet activity. As a result, the numbers and types of potential cybersecurity assaults have increased as well. This creates further challenges because data is an organization's most essential asset in today's world. Protecting sensitive data from unauthorized access has become a critical priority because attackers may use open public Internet for exploitative or malicious purposes. To avoid such attacks, sensitive data requires modification into cipherable forms before being transmitted via unencrypted channels. Confidential information requires a speedy, reliable, and robust cryptosystem to prevent information leakage.

Both researchers and academics have been exploring multiple alternative approaches to protecting transmitted data. With recent developments in communication technology, many encryption algorithms are designed for the security of real-time communication. Cryptography also plays an essential role in providing security for sensitive information. A wide range of algorithms has been presented to this end, including advanced encryption standard (AES), data encryption standard (DES), Elliptic curve cryptography (ECC), and so on. Many attempts have also been made to break down specific algorithms based on advanced encryption standards (AES) and data encryption standards (DES), which have been successful in some instances since 1993.

Regardless of these outliers, cryptography is still one of the most effective methods for preserving sensitive data. With expanding growth of new Internet channels and technologies, more sophisticated cryptanalysis and more robust and efficient image encryption techniques have become necessary for secure data communication. This is because cryptography encodes and transmits data in a specific format that can only be read and processed by those authorized to use advanced mathematical concepts. Encryption, or the act of encoding a communication in a format that unauthorized users cannot read or understand, is a crucial part of cryptography. Encryption in its various forms has been used since the Romans and even earlier, but increasingly complex versions are needed to keep up with new needs. A plain text can be encrypted into ciphertext and then the data can be sent over an insecure transmission medium. Depending on the security of the algorithm, the ciphertext may not be accessed by an unauthorized person.

A variety of symmetric and asymmetric image cryptographic algorithms have also been developed. In symmetric key cryptography, for instance, both users (i.e., sender and receiver) use a single key for the process of ciphering and deciphering. By contrast, asymmetric key cryptography utilizes two keys, a public key and a secret one, at each point to achieve additional security. In this approach, the private key is always kept secure because it decrypts the information. In contrast, the public key is always made publicly available to everyone because it does not help us decrypt the secret information.

In addition, most modern encryption designs are based on chaotic systems. Symmetric key cryptographic algorithms are significant because they produce a strong key for these cryptosystems and are very cheap. The keys are considerably smaller for the degree of security they provide, and running these algorithms is relatively inexpensive. Chaotic maps have also garnered a great deal of attention over the past few decades as another means of protecting cryptographic algorithms. Chaotic cryptography can secure communication further in a shorter duration of time. Quantum image processing is another method that is also becoming more popular to ensure information confidentiality. However, there are multiple proposals for data encryption in the literature. Chaotic systems, quantum encryption, and substitution boxes (S-boxes) are all often used.

Several nonlinear methods have been proposed to combat cryptographic attacks. In past, many image encryption schemes are mainly based on a chaotic dynamical system. The behavior of dynamical systems are pseudorandom and hence suited for multimedia encryption. The output of chaos maps is based on initial conditions. For this reason, chaos-based systems are known as deterministic systems. Their nature of randomness, sensitivity to original conditions, and ergodicity are unique characteristics (Stallings 2006; Chuang et al. 2011; Al-Najjar 2012; Banthia and Tiwari 2013; Rivest 1990; Matthews 1989; Wheeler and Matthews 1991; Chen and Liao 2005; Masood et al. 2020a, 2021, 2020b; Ahmad et al. 2020; Hanouti et al. 2020; Butt et al. 2020; Munir et al. 2020). These characteristics lead to a reliable cryptosystem, while chaotic maps and dynamical systems help to generate long-term chaotic sequences. Here, even a small change in initial conditions will significantly shift the chaotic sequence initially developed. These properties make these options some of the best choices for constructing secure algorithms in cryptography. By contrast, many techniques based on cryptanalysis are offered as a means of securing cryptographic algorithms, in turn depicting weakness in existing cryptosystems (Munir et al. 2021a, 2021b; Hanouti et al. 2021a, 2021b).

DNA computing and its intrinsic properties have been used extensively in the field of cryptography. Massive parallelism, high-level computational capacity, and storing large amounts of data are among these inherent properties. Research in this area often utilizes publicly accessible biological data to encrypt plaintext data in DNA computing applications. Adleman (Adleman 1994, 1998; Jiao and Goutte 2008) was the first to propose cryptographic DNA computing in 1994, initiating a new era of data processing that provides DNA-based encryption algorithms with tangible advantages over conventional cryptographic techniques. However, encrypting images with DNA encoding alone are inefficient. As a result, the underlying vulnerability problems are often solved using encryption techniques utilizing DNA computing and chaotic sequences (Enayatifar et al. 2014; Naskar and Chaudhuri 2016; Hanouti and Fadili 2021). For example, Clelland et al. (1999) have developed an innovative approach to protect secret communications using human genomic DNA. Meanwhile, Xiuli et al. (Chai et al. 2017) created a unique encryption method by adding chaotic maps and DNA sequences. A matrix based on DNA is created initially, and then, a plaintext image is stored before the circulation permutation process of row and column-wise is added. Yueping et al. (Li et al. 2017) have also offered a secure cryptographic technique. These proposed cryptosystems take high-dimensional chaotic maps to get robust security. Yueping et. al’s. systems could withstand various assaults based on chosen plain text and cipher text methods, and their proposed scheme works rapidly and efficiently.

Many other researchers (Mondal and Mandal 2017) have also developed effective and lightweight encryption schemes that use DNA and chaotic approaches. Here, the unencrypted image utilizes confusion with randomly generated numbers obtained from a chaotic logistic map (employed cross-linked). In one approach, the pixels are then distorted with the computational method of DNA. For instance, Chen et al. (2018) have presented a cryptosystem based on the pixel’s permutation and distortion process, which works on the self-adaptive process and is an efficient method due to its randomized but reusable variables. The last stage takes the DNA encoding method. In the Rijndael cipher (Daemen and Rijmen 1998), the work of well-known Belgian cryptographers Vincent Rijmen and Joan Daemen was selected as the advanced encryption standard (AES) in October 2000. The S-box based on AES is often regarded as the highest benchmark in this field. The optimum highly nonlinear value is 120, and the most significant value obtained by the AES S-box is 112 (Rijndael).

Following this lead, many more S-boxes have been developed to provide even stronger alternatives. For example, S-boxes with cryptographic features, such as the AES S-box, can be employed. Our work proposes a novel method for designing a robust substitution box (S-box) with better cryptography features. This S-box helps to substitute original data into plain text while maintaining its entropy level. We used one-dimensional (1D) and two-dimensional (2D) chaotic maps and DNA sequencing to construct this S-box. The sequence generated is filtered to unique random elements of a 256 count. The entropy value of 8 approximates an ideal value that satisfies the complete randomness needed from our proposed S-box.

Following its construction, our new S-box is investigated using multiple randomness and performance analysis tests, whose results show that our constructed S-box is exceptional for implementing real-time communication. Today, most cryptographers work in advanced encryption standard (AES) because of its highly robust cryptographic algorithm. In modern cryptography, block encryption algorithms play an essential role in providing security, such as international data encryption standards (IDES) and advanced encryption standards (AES). Due to their prominent chaos features, S-boxes are a superior choice for designing cryptosystems. At the same time, several security tests of S-boxes support the proposed cryptosystem's strength against both differential and linear attacks (Sani et al. 2021; Azam et al. 2021; Qayyum et al. 2020; Zahid et al. 2021; Liu et al. 2021). Thus, S-boxes form one of the fundamental nonlinear components used to provide security for cryptographic schemes.

1.1 Contribution

The following are the key contribution of our research study:

Presenting efficient cryptosystem that uses combined effect of DNA and chaotic dynamical system for the development of initial S-box.
The system uses multiple stages that help to generate highly random sequencing that exhibit minimum correlation.
The proposed system uses both substitution and permutation for an extra layer of security. Both substitution and permutation ensure higher image security.
Investigation of various existing state-of-the-art methods and comparing them with the proposed scheme.
The proposed scheme is investigated thoroughly using various tests, i.e., nonlinearity (NL), strict avalanche criterion (SAC), bit independence criterion (BIC), bit independence criterion strict avalanche criterion (BIC-SAC), bit independence criterion nonlinearity (BIC-NL), equiprobable input/output XOR distribution, and linear approximation probability.

2 Fundamental concepts

2.1 Arnold transformation (AT)

Shuffling the pixels of an initial image is one of the essential elements used to provide image security. Here, the security of an image can be accomplished by applying this one image transformation method. There are various shuffling methods; however, Arnold transformation (AT) is one of the methods utilized most extensively. The map of an Arnold transformation was discovered in the 1960s by Vladimir Arnold using a cat image (Arnold and Avez 1968); the map is described in Eq. 1:

$$ \left[ {\begin{array}{*{20}c} {x^{\prime}} \\ {y^{\prime}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} 1 & 1 \\ 2 & 2 \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right]\text{mod } 1 $$

(1)

where $x$ and $ \, y \in \left\{ {0,1} \right\}$. The formula illustrated above is defined for a unit square though which the existing matrix can be extended upon image pixels, i.e., if $x, \, y \in \left\{ {0,1,2, \, 3, \ldots ., \, N} \right\}.$ With the increase in image pixels, there will also be an increase of elements in the matrix, and Eq. 1 can be rewritten as:

$$ \left[ {\begin{array}{*{20}c} {x^{\prime}} \\ {y^{\prime}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} 1 & 1 \\ 2 & 2 \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right]\text{mod } N $$

(2)

An Arnold map (AM) utilizes linear algebra concepts on the positioning of pixels to change their values (Ye and Wong 2012). An AM can shuffle image pixels of any size and is generalizable. The generalized Arnold map (AMg) is expressed in matrix notation, as demonstrated by Eq. 3:

$$ \left[ {\begin{array}{*{20}c} {x^{\prime}} \\ {y^{\prime}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} 1 & a \\ b & {ab + 1} \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} x \\ y \\ \end{array} } \right]\text{mod } N $$

(3)

In this equation, $a$ and $b$ are the two control parameters that aid in changing the position of pixels $x$ and $y$, making new coordinates of pixels $x^{\prime}$ and $y^{\prime}$ in the shuffled image. The pixels of original image $(x,y)$ will then transform into shuffled pixels of $(x^{\prime},y^{\prime})$.

On the other hand, the distinctive exponents of the Lyapunov exponent are calculated as shown by Eq. 4:

$$ \lambda = 1 + \frac{{ab + \sqrt {a^{2} b^{2} + ab} }}{2} > 1 $$

(4)

This map will behave chaotically if the Lyapunov exponent (LE) is greater than 1 (Ye 2011). This implies that if the $a$ and $b$ are each greater than 0, i.e., ($a > 0$) and ($b > 0$), then the system will be in a chaotic state.

The Arnold map generalized (AMg) equation is the discrete system that works on two effects, namely stretching and folding. These effects can be attained using the phase space system, which helps in creating confusing image encryption schemes. However, to obtain the randomly confused image, the confusion process is repeated several times. As a result, utilizing AMg as part of an image encryption scheme will take a long time. Furthermore, a digital image's finite gray levels may cause the original image to reemerge after several rounds of confusion (Wang et al. 2010).

2.2 Logistic may system

A logistic and may map (LOMAS) is a discrete time 1D chaotic system (Nkandeu and Tiedeu 2019) that can be achieved using Eq. 5:

$$ y_{m + 1} = (y_{m} e^{\wedge}\,((r^{\prime} + 9)(1 - y_{m} )) - (r^{\prime} + 5)y_{m} (1 - y_{m} )){\text{mod }}\,1 $$

(5)

where $y_{m}$ $\in$ [ 0 1] and $r{^{\prime}}$ $\in$ [0, 5]. This modified system will behave with chaotic randomness.

3 DNA system

This section will discuss gene expression, DNA basics—i.e., the four nucleotides—and their application in image encryption.

3.1 DNA and gene expression

Gene expression is the continuous process by which the genome receives and decrypts information that the living organism can utilize and process using a DNA code (Tefferi 2006). The fundamental dogma of living organisms is responsible for gene expression. A DNA molecule is fed into the central dogma process, which is then synthesized into a polypeptide chain that possesses many amino acids bonded together. Molecular biology has also demonstrated that proteins are retrieved using DNA (Hollenbach 2020). Transcription and translation are the two critical stages of the central dogma process (Cooper 1981). Transcription turns DNA into RNA, while polypeptide chains can be obtained by converting RNA through translation. The core dogma process is depicted in Fig. 1.

3.2 DNA composition

DNA is composed of four nucleic acid bases. The human genome is enormously long and sophisticated, is comprised of approximately 3.2 billion base-paired nucleotides. These are the four most essential nucleotide bases (Watson and Crick 1953), which are adenine (A), cytosine (C), thymine (T), and guanine (G). These four nucleic acids are complementary pairs, i.e., like the binary ‘0’ and ‘1,’ they complement to each other. When seeking pairwise combinations, we can easily find that ‘00,’ ‘01,’ ‘10,’ and ‘11’ are complementary binary pairs. Thus, it is easy to encode binary numbers of ‘00,’ ‘01,’ ‘10,’ ‘11’ using four bases, i.e., ‘A,’ ‘C,’ ‘G,’ ‘T.’ Using 4! = 24, we can also get the maximum possible number of schemes. Eight out of 24 schemes have satisfied the complementary base pair principle shown in Table 1 (Watson and Crick 1993). DNA sequences have better encryption properties and meet all tests for constructed S-boxes, which in turn means these qualify for real-time communication.

Table 1 The relationship of four nucleotides with $P_{(i,j)}$

A new color image encryption technique using DNA computing and Chaos-based substitution box

Abstract

Similar content being viewed by others

A novel and Fast hybrid design of cryptosystems for Image via 5-D chaos based random keys and DNA

A novel RGB image encryption algorithm based on DNA sequences and chaos

Lossless chaotic color image cryptosystem based on DNA encryption and entropy

1 Introduction

1.1 Contribution

2 Fundamental concepts

2.1 Arnold transformation (AT)

2.2 Logistic may system

3 DNA system

3.1 DNA and gene expression

3.2 DNA composition

3.3 Transcription process

3.4 DNA operation

3.4.1 DNA coding

3.4.2 DNA chosen operations

3.4.3 Number of rounds

3.4.4 DNA joined operation

4 Anticipated algorithm for the construction of S-box

5 Randomness tests for the constructed S-box

5.1 NIST test

5.2 Histogram uniformity analysis

6 S-box fundamental characteristics and experimentation process

6.1 Nonlinearity test analysis

6.2 Strict avalanche criterion (SAC) test analysis

6.3 Bit independence criterion (BIC) test analysis

6.4 The Equiprobable input/output XOR distribution

6.5 Linear approximation probability

7 Conclusion, discussion, and future prospects

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Human and animals rights

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation