Abstract
This work presents a novel method for assessing audio authenticity. Assuming that the electric network frequency is embedded in audio signals, the evaluation of audio integrity is carried out by detecting phase discontinuities. This is conducted by using causal and anti-causal filters, in order to avoid the mix of past and future phase information related to the time of analysis. This local phase change is then post-processed and thresholded to obtain the editing times. One remarkable property of the proposed method is its ability to withstand MP3 compression, an audio format widely used in practice. A more accurate evaluation metric is also introduced in this work. For that purpose, the databases used for evaluating the algorithm were automatically labeled indicating the editing times. The procedure to generate the ground truth is presented, as well as a discussion on the proposed metric. The performance of the technique presented promising results when evaluated on digitally edited and original audio signals.
Similar content being viewed by others
Notes
The tolerance parameter \(\tau \) was chosen to avoid losing editing points. Our assumption is that the duration of a short word is at least 500 ms, so in the case of the insertion of short words, the value \(\tau =250\) ms permits an evaluation of the system’s performance without losing any editing point.
References
I.F. Apolinário , C.A. Rossi, Time-stamp of digital audio recording based on the ENF estimated from another audio signal, in I IEEE Latin American Symposium on Circuits and Systems (LASCAS) (2010)
T. Bianchi, A.D. Rosa, M. Fontani, G. Rocciolo, A. Piva, Detection and localization of double compression in MP3 audio tracks. EURASIP J. Inf. Secur. 2, 1–14 (2014)
M. Bosi, R.E. Goldberg, Introduction to Digital Audio Coding and Standars (Kluwer Academic Publishers, Alphen aan den Rijn, 2003)
B. D’Alessandro, Y.Q. Shi, MP3 bit rate quality detection through frequency spectrum analysis, in Proceedings of the 11th ACM Workshop on Multimedia and Security, pp. 57–62. ACM, New York, (2009). https://doi.org/10.1145/1597817.1597828
P.A.A. Esquef, J.A. Apolinário Jr., L.W.P. Biscainho, Edit detection in speech recordings via instantaneous electric network frequency variations. IEEE Trans. Inf. Forensics Secur. 9(12), 2314–2326 (2014). https://doi.org/10.1109/TIFS.2014.2363524
P.A.A. Esquef, J.A. Apolinário Jr., L.W.P. Biscainho, Improved edit detection in speech via ENF patterns, in 2015 IEEE International Workshop on Information Forensics and Security (WIFS), pp. 1–6 (2015). https://doi.org/10.1109/WIFS.2015.7368585
H. Farid, Image forgery detection. IEEE Signal Process. Mag. 26(2), 16–25 (2009)
FFmpeg: A complete, cross-platform solution to record, convert and stream audio and video. http://www.ffmpeg.org. Accessed 10 June 2016
M. Fuentes, P. Zinemanas, P. Cancela, J.A. Apolinário Jr., Detection of ENF discontinuities using PLL for audio authenticity, in VII IEEE Latin American Symposium on Circuits and Systems (LASCAS), pp. 79–82 (2016). https://doi.org/10.1109/LASCAS.2016.7451014
R. Garg, A.L. Varna, A. Hajj-Ahmad, M. Wu, Seeing ENF: power-signature-based timestamp for digital multimedia via optical sensing and signal processing. IEEE Trans. Inf. Forensics Secur. 8(9), 1417–1432 (2013)
C. Grigoras, Forensic analysis of digital recordings—the electric network frequency criterion, in Forensic Science International (2003)
C. Grigoras, Digital audio recording analysis, the electric network frequency (ENF) criterion. Int. J. Speech Lang. Law 12, 43–49 (2005). https://doi.org/10.1558/sll.2005.12.1.63
C. Grigoras , Applications of ENF criterion in forensic audio, video, computer and telecommunication analysis (Selected Articles of the 4th European Academy of Forensic Science Conference (EAFS2006) June 13–16, 2006 Helsinki, Finland), pp. 136–145 (2007). https://doi.org/10.1016/j.forsciint.2006.06.033. http://www.sciencedirect.com/science/article/pii/S0379073806004312
A. Hajj-Ahmad, R. Garg, M. Wu, Spectrum combining for ENF signal estimation. IEEE Signal Process. Lett. 20(9), 885–888 (2013)
A. Hajj-Ahmad, R. Garg, M. Wu, ENF-based region-of-recording identification for media signals. IEEE Trans. Inf. Forensics Secur. 10(6), 1125–1136 (2015). https://doi.org/10.1109/TIFS.2015.2398367
M. Huijbregtse, Z. Geradts, Using the ENF criterion for determining the time of recording of short digital audio recordings, in Springer Series Lecture Notes in Computer Science vol. 5718, pp. 116–124 (2009)
R. Korycki, Authenticity investigation of digital audio recorded as MP3 files. Probl. Kryminal. 283(1), 54–67 (2014)
Q. Liu, A.H. Sung, M. Qiao, Detection of double MP3 compression. Cogn. Comput. 2(4), 291–296 (2010). https://doi.org/10.1007/s12559-010-9045-4
R.C. Maher, Audio forensic examination: authenticity, enhancement, and interpretation. IEEE Signal Process. Mag. 26(2), 84–94 (2009). https://doi.org/10.1109/MSP.2008.931080
D.P. Nicolalde-Rodríguez, J.A. Apolinário Jr., Evaluating digital audio authenticity with spectral distances and ENF phase change, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1417–1420 (2009)
D.P. Nicolalde-Rodríguez, J.A. Apolinário Jr., L.W.P. Biscainho, Audio authenticity: detecting ENF discontinuity with high precision phase analysis. IEEE Trans. Inf. Forensics Secur. 5(3), 534–543 (2010). https://doi.org/10.1109/TIFS.2010.2051270
D.P. Nicolalde-Rodríguez, J.A. Apolinario Jr., L.W.P. Biscainho, Audio authenticity based on the discontinuity of ENF higher harmonics, in Proceedings of the 21st European Signal Processing Conference (EUSIPCO), IEEE, pp. 1–5 (2013)
J. Ortega-Garcia, J. Gonzalez-Rodriguez, V. Marrero-Aguiar, AHUMADA: A large speech corpus in Spanish for speaker characterization and identification. Speech Commun. 31(2), 255–264 (2000). https://doi.org/10.1016/S0167-6393(99)00081-3. http://www.sciencedirect.com/science/article/pii/S0167639399000813. Accessed 20 July 2017
M. Qiao, A.H. Sung, Q. Liu, Revealing real quality of double compressed MP3 audio, in Proceedings of the 18th ACM International Conference on Multimedia, MM ’10, pp. 1011–1014. ACM, New York (2010). https://doi.org/10.1145/1873951.1874137
M. Qiao, A.H. Sung, Q. Liu, Improved detection of MP3 double compression using content-independent features, in 2013 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2013), pp. 1–4 (2013). https://doi.org/10.1109/ICSPCC.2013.6664121
P.M.G.I. Reis, J.P.C.L. da Costa, R.K. Miranda, ESPRIT-Hilbert-based audio tampering detection with SVM classifier for forensic analysis via electrical network frequency. IEEE Trans. Inf. Forensics Secur. 12(4), 853–864 (2017). https://doi.org/10.1109/TIFS.2016.2636095
H. Sakoe, S. Chiba, Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 43–49 (1978)
R.W. Sanders, Digital audio authenticity using the electric network frequency, in Audio Engineering Society Conference: 33rd International Conference: Audio Forensics-Theory and Practice (2008). http://www.aes.org/e-lib/browse.cfm?elib=14403. Accessed 20 July 2017
M.C. Stamm, M. Wu, K.J.R. Liu, Information forensics: an overview of the first decade. IEEE Access 1, 167–200 (2013)
R. Yang, Z. Qu, J. Huang, Exposing MP3 audio forgeries using frame offsets. ACM Trans. Multimed. Comput. Commun. Appl. 8(2S), 35:1–35:20 (2012). https://doi.org/10.1145/2344436.2344441
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zinemanas, P., Fuentes, M., Cancela, P. et al. An ENF-Based Audio Authenticity Method Robust to MP3 Compression. Circuits Syst Signal Process 37, 4973–4992 (2018). https://doi.org/10.1007/s00034-018-0793-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00034-018-0793-9