Abstract
Use of sophisticated image editing tools and computer graphics makes easy to edit, transform, or eliminate the significant features of an image without leaving any prominent proof of tampering. One of the most commonly used tampering techniques is image splicing. In image splicing, a portion of image is cut and paste it on the same image or different image to generate a new tampered image, which is hardly noticeable by naked eyes. In the proposed method, enhanced Markov model is applied in the block discrete cosine transform (BDCT) domain as well as in discrete Meyer wavelet transform (DMWT) domain. To classify the spliced image from an authentic image, the crossdomain features play the role of final discriminative features for support vector machine (SVM) classifier. The performance of the proposed method through experiments is estimated on the publicly available dataset (Columbia dataset) for image splicing. The experimental results show that the proposed method performs better than some of the existing state of the art.
1 Introduction
In recent years, due to immediacy and the easily understandable image content, images have become the prime source of information exchange. It is being used as evidence in legal matters, proof of an experiment, media, realworld events etc. At the same time, availability of sophisticated image manipulation software and pervasive imaging devices gave rise to the need for forensic toolboxes which can access authenticity of images without knowing the original source information. Hence, numerous forensic methods are proposed which focuses on detection of such malicious postprocessing of images. On the basis of method used for manipulation of an image, image forgery is divided into three categories: copymove, image splicing, and image resampling. Copymove forgery is also called as image cloning, where a subpart of an image is copied and pasted to the other part of the same image, to hide important information, whereas image splicing uses cut and paste technique, in which part of one or more images are pasted to different or same image. Image resampling is done on an image by geometric transformation like scaling, stretching, rotation, skewing, flipping. Figure 1 depicts few example of image manipulation which is downloaded from Internet.
In the course of this paper, we shall focus on image splicing, which is one of the most used techniques for image tempering. It involves combining or composition of two or more images to produce a forged image. Splicing detection uses passive approach where no prior information of image is known. In recent years, researchers have proposed several methods on image splicing forgery detection. Shi et al. [1] proposed a image model, which reduce statistical moments by treating the neighboring differences of block discrete cosine transform of an image as 1D signal and the dependencies between neighboring nodes along certain directions have sculpted as Markov model. These features are considered as discriminative feature for support vector machine classifier. Xuefang Li et al. [2] proposed an approach based on Hilbert–Huang transform (HHT) and moment of wavelet transform characteristic function. They used SVM as a classifier for spliced image classification with an accuracy of 85.86%.
Method proposed by Zhao et al. [3] is based on gray level run length (RLRN) feature and chroma channel. Features are extracted using gray level RLRN vectors along four different directions from decorrelated chroma channel. Extracted features are introduced to SVM for classification.
Pevny et al. [4] proposed a method which is based on SPAM feature and modeled it as secondorder Markov matrix along certain directions, which is treated as discriminative feature for SVM classifier. Later, Kirchner and Fridrich [5] used SPAM and extended it to detect median filter of JPEG compressed image which is supportive for image tempering detection. Other than above proposed method Markov modelbased approach which utilizes local transition feature has shown promising splicing detection accuracy. He et al. [6] introduced Markov model in DCT domain as well as DWT domain. The difference coefficient array and transition probability matrix are modeled as feature vector and crossdomain Markov feature are considered as discriminative feature for SVM classifier. However, the proposed approach requires up to 7290 features. An enhanced state selection method is proposed by B. Su et al. [7]. In this approach, author considers some already proposed function model and maps the large number of coefficients extracted from transform domain to specific states. However, by reducing the number of features, this method sacrifices the detection performance. X. Zhao et al. [8] proposed a model in which an image is modeled as a 2D noncasual signal and captures the dependencies between the current node and its neighbors. This model is applied on BDCT and DMWT, and combined extracted features are used for classification. It is found that their method has better detection rate with the cost of higher dimension of 14,240.
As per the above discussion, it is concluded that Markov modelbased approach suffers from information loss and higher feature dimension, which is directly proportional to the threshold election. Larger threshold value can minimize information loss, but it will increase the feature dimension too, which can create overfitting problem, and detection capability will get reduced. Therefore, the choice of threshold becomes a tradeoff between the detection performance and computational cost. In this paper, an enhanced threshold method is proposed which gives much lesser dimension of features even with large threshold value, which improves the computational cost as well as the detection rate as discussed in step 3 in proposed work.
The rest of the paper is organized as follows. Section 2 shows proposed work algorithm framework. The experimental results and the comparison with other methods are depicted in Sect. 3, followed by conclusion in Sect. 4.
2 Proposed Method
In this paper, we proposed a model in which features are extracted from discrete cosine transform (DCT) and discrete Meyer wavelet transform (DMWT) domain and an enhanced threshold method is used to reduce the information loss as well as the computational cost, which results in improved detection capability. After all the related features are generated, SVM is used as classifier to distinguish the authentic and spliced image. The proposed algorithm framework is shown in Fig. 2.
2.1 Algorithm Flow

Divide the input image into \(8 \times 8\) nonoverlapping blocks.

DCT is applied on each subblock.

Round the coefficient and difference array is obtained in horizontal and vertical direction.

Enhanced threshold method is applied to calculate Markov matrix.

Above process is applied in DMWT domain also; considering dependency among wavelet coefficient, more Markov features are extracted.

Combine all the features, extracted from DCT and DWT domain.

SVM classifier is used to distinguish authentic and spliced image.
2.2 Extracting Splicing Artifacts
Feature Extraction in DCT Domain: The Markov feature in DCT and DWT is proposed in [6], in which correlation of neighboring coefficients is considered to differentiate authentic and spliced images. The process involved in calculation of difference arrays followed by transition probability matrix. Threshold value T introduced in [6] is to minimize the computational cost, which achieved a feature dimension of \((2\mathrm{T}+1) \times (2\mathrm{T}+1) \times 4\), but it is still on higher side. To minimize the dimension of feature vector and limit the overfitting problem, we introduced an enhanced threshold method which achieves much lesser feature dimension of \((\mathrm{T}+1) \times (\mathrm{T}+1) \times 4\). Proposed approach is explained in step 3 of this section. Markov features in DCT domain are computed as follows:
Step 1: In the first step, DCT coefficient is obtained by applying nonoverlapping \(8\times 8\) block discrete cosine transform (BDCT) on the input image and denoted as S. We used BDCT in our proposed model due to its energy compaction and decorrelation capability.
Step 2: In the second step, round the DCT coefficient to the nearest integer value. Then, horizontal \((F_{h})\) and vertical \((F_{v})\) difference array is calculated using the following equations:
where \(i \in \left[ 1,S_{m}1 \right] , j \in \left[ 1,S_{n}1 \right] \), and \(S_{m}\) and \(S_{n}\) is the dimension of input source image.
Step 3: Enhanced threshold method: Considering threshold T\(\left( T \in N_{+} \right) \), it is replaced with T or −T, if the value of an element in difference array is either >T or \({<}\)T, respectively, and the range of threshold we considered is \((u,v)\in \left\{ T,T+2, \ldots ,T+2, T \right\} \). Under given range, we calculate the horizontal and vertical Markov matrices using Eqs. (3), (4), (5), (6), which minimize the feature dimension to \(4\times (\mathrm{T}+1) \times (\mathrm{T}+1)\).
where (u, v) \(\in \left\{ T,T+2,T+4, \ldots T4,T2,T \right. \left. \right\} \), and S\(_m\) and S\(_n\) denote the dimension of original source image and
Finally, all the captured elements of the Markov matrix can be used as features for image splicing detection.
Similarly, interblock correlation is considered to extract more Markov features. Here, interblock difference 2D array is calculated using Eqs. (8) and (9).
where \(i \in \left[ 1,S_{m}1 \right] ,j \in \left[ 1,S_{n}1 \right] \) and, \(S_{m}\) and \(S_{n}\) is the dimension of original input image.
Now, enhanced threshold method is applied to the interblock difference array \(E_{h}(i,j)\) and \(E_{h}(i,j)\) as explained in step 3 where \(S_{m}1\), \(S_{m}2\), \(S_{n}1\), and \(S_{n}2\) are replaced with \(S_{m}8\), \(S_{m}16\), \(S_{n}8\), and \(S_{n}16\), respectively. Hence, by considering interblock correlation \(4\times (T+1) \times (T+1)\), more features have been extracted. Thus, a total of \(2 \times 4 \times (T+1) \times (T+1)\) features are extracted from DCT domain which can be used to distinguish the authentic image from spliced one.
Feature Extraction in DMWT Domain: Most of the previously proposed approach based on DWT [9, 10] deals with all the subbands independently after wavelet decomposition, but [6] shows that there is dependency among wavelet components across position, scales, and orientation. However, it is observed that among the three dependencies contribution of position and orientation is more than scale in splicing detection. So, in this paper, we only consider dependency across position and orientation. Hence, Markov features with different dependencies are extracted as follows.
Step 1: We apply twolevel discrete Meyer wavelet transform on the input image and round the coefficient of eight subbands to absolute value. Processed subbands are denoted as \(\left\{ W_{a}^{b},W_{h}^{b},W_{v}^{b},W_{d}^{b} \right\} \), where \(\text {b}=\left\{ 1,2 \right\} \).
Step 2: Consider dependency across position in DMWT domain, which is similar to characterize correlation between neighboring coefficients in DCT domain. Hence, by replacing F in Eqs. (1) and (2) with each of the eight subbands of DMWT domain followed by using Eqs. (3), (4), (5), and (6), we captured a total of \((T+1)\times (T+1) \times 32\) more Markov features.
Step 3: Now, considering the dependency among orientation, more features can be extracted using the following difference arrays.
where b = \(\left\{ 1,2 \right\} \) and \( W_{a}^{b}, W_{h}^{b}, W_{v}^{b}, W_{d}^{b}\) denote bth level approximation, horizontal, vertical, and diagonal subbands, respectively.
Now, \(F_{h}\) in Eqs. (3) and (4) is replaced by each of the difference arrays obtained in (10), (11), and (12) to capture more Markov matrix. Hence, \((T+1)\times (T+1)\times 12\) more Markov features are obtained.
By combining \((T+1)\times (T+1)\times 8\) Markov features captured in DCT domain and \((T+1)\times (T+1)\times 44\) Markov features captured in DMWT domain, resultant feature vector is used to differentiate spliced image from an authentic one. We choose threshold T = 6. So, we got a total of 2548 features.
3 Experimental Results and Performance Analysis
3.1 Dataset and Classifier
We use Columbia image dataset [11] provided by DVMM. It consists of 933 authentic and 912 spliced images without any postprocessing enhancement. All the forged images are spliced image. This dataset is designed to test the blind image splicing detection method. Some images from the DVMM dataset are shown in Fig. 3, in which first row shows the set of authentic images and second row shows the set of spliced images.
To classify the images, support vector machine (SVM) is used in our experiments. In this experiment, SVM classifier is trained to solve the binary decision problem (classification of authentic and spliced images).
To evaluate the performance, all the experiments are performed on Columbia image splicing dataset [11] using same classifier. In each experiment, 80% randomly selected images are used to train the SVM classifier and remaining 20% images are used for testing.
3.2 Performance Analysis of the Proposed Model
Some experiments are carried out to verify and compare the detection accuracy of the proposed approach. T is set to 6 in these experiments. Feature vectors from DCT and DMWT domain are captured and effect on the detection performance of the proposed method with Z. He et al. [6] is evaluated in both the domain. The obtained results are shown in Table 1 and Table 2, respectively. In Table 2, level 1 and level 2 represent the firstlevel DMWT and secondlevel DMWT, respectively. It can be observed from Table 1 and Table 2 that our method has improved the detection rate by approximately 1.0%–3.1% and 2.1%–2.3 % in BDCT and DMWT domains, respectively. Further, it is observed that by combining feature vectors from DCT and DMWT domains, we are getting much better accuracy.
Table 3 shows the comparison and detection rate of proposed work and some previous splicing detection methods [7, 12]. The complete implementation of the proposed method has achieved an accuracy of 88.17%, which makes a significant progress in splicing detection. In Table 3, true positive (TP) and true negative (TN) are calculated as:
where \(N_{ca}\) = number of correct authentic classification, \(N_{cs}\) = number of correct spliced classification, \(N_{a}\) = total number of authentic images, and \(N_{s}\) = total number of spliced images.
The experimental results of proposed and other methods are shown in Table 3. It can be observed that our proposed method performs best out of the three presented splicing detection scheme in Table 3.
3.3 Recognizing Real Images
In Fig. 4, publicly available on Internet, we have given three original images (b), (c), and (e) and their associated altered images (a) and (d). To test these five images (three authentic and two spliced), we trained the classifier using experiments mentioned in Sect. 3.1. The test has been performed 20 times, and the results are shown in Table 4. It can be observed that there are only four cases in which images are wrongly classified.
3.4 Threshold Selection
Selecting a threshold is an issue because in general for a smaller T value, information loss will be higher; in that case, Markov matrices may be insufficient to distinguish authentic and forged images, whereas a larger T value can reduce information loss but a larger number of features can generate an overfitting problem, which results in low detection performance. Therefore, the choice of T and size of Markov matrix have an important impact on detection performance and computational cost.
The performance analysis of proposed approach for different thresholds \(\left( T = 4, 6,\,\mathrm{and}\,8\right) \) is shown in Table 5. From Table 5, it can be observed that T = 6 is the best choice which balances the detection rate and computational cost with the accuracy of 88.43%.
4 Conclusion
In this paper, an enhanced threshold method is proposed to extract Markov feature which generates reduced feature set without any feature loss which improves the detection rate. Reduced feature sets are extracted from DCT and DMWT domains by performing difference operation followed by enhanced threshold method. Features extracted from DCT domain consider the correlation between the DCT coefficients, while DMWT domain distinguishes the dependency among coefficients across orientations and positions. Finally, the combined reduced feature vector from both the domain is considered as distinguished feature for classification. SVM is used as a classifier in our experiments. Our experimental results are encouraging, yielding the accuracy of over 88.43% correct classification which outperforms some stateoftheart methods.
References
Shi YQ, Chen C, Chen W. A natural image model approach to splicing detection. In Proceedings of the 9th workshop on Multimedia & security 2007 Sep 20, ACM, 51–62.
Li X, Jing T, Li X.: Image splicing detection based on moment features and HilbertHuang Transform. In Information Theory and Information Security (ICITIS), 2010 IEEE International Conference on 2010 Dec 17, IEEE, 1127–1130.
Zhao X, Li J, Li S, Wang S.: Detecting digital image splicing in chroma spaces. In International Workshop on Digital Watermarking 2010 Oct 1, Springer Berlin Heidelberg, 12–22.
Pevny T, Bas P, Fridrich J.: Steganalysis by subtractive pixel adjacency matrix. IEEE Transactions on information Forensics and Security. 2010 Jun, 5(2):215–224.
Kirchner M, Fridrich J.: On detection of median filtering in digital images. In IS&T/SPIE Electronic Imaging, International Society for Optics and Photonics, 2010 Feb 4, 754110–754110.
He Z, Lu W, Sun W, Huang J.: Digital image splicing detection based on Markov features in DCT and DWT domain. Pattern Recognition. 2012 Dec 31, 45(12):4292–4299.
Su B, Yuan Q, Wang S, Zhao C, Li S.: Enhanced state selection Markov model for image splicing detection. EURASIP Journal on Wireless Communications and Networking. 2014 Dec 1, 2014(1):1–10.
Zhao X, Wang S, Li S, Li J.: Passive imagesplicing detection by a 2D noncausal Markov model. IEEE Transactions on Circuits and Systems for Video Technology. 2015 Feb;25(2):185–199.
Chen W, Shi YQ, Su W.: Image splicing detection using 2d phase congruency and statistical moments of characteristic function. In Society of photooptical instrumentation engineers (SPIE) conference series 2007 Feb 15, (6505), 26.
Lu W, Sun W, Chung FL, Lu H.: Revealing digital fakery using multi resolution decomposition and higher order statistics. Engineering Applications of Artificial Intelligence. 2011 Jun 30;24(4):666–672.
Ng TT, Chang SF, Sun Q.: A data set of authentic and spliced image blocks. Columbia University, ADVENT Technical Report. 2004 Jun:203–204.
He Z, Sun W, Lu W, Lu H.: Digital image splicing detection based on approximate run length. Pattern Recognition Letters. 2011 Sep 1;32(12):1591–1597.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, A., Prakash, C.S., Maheshkar, S., Maheshkar, V. (2019). Markov Feature Extraction Using Enhanced Threshold Method for Image Splicing Forgery Detection. In: Panigrahi, B., Trivedi, M., Mishra, K., Tiwari, S., Singh, P. (eds) Smart Innovations in Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 670. Springer, Singapore. https://doi.org/10.1007/9789811089718_2
Download citation
DOI: https://doi.org/10.1007/9789811089718_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 9789811089701
Online ISBN: 9789811089718
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)