Skip to main content
Log in

An ENF-Based Audio Authenticity Method Robust to MP3 Compression

  • Published:
Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Abstract

This work presents a novel method for assessing audio authenticity. Assuming that the electric network frequency is embedded in audio signals, the evaluation of audio integrity is carried out by detecting phase discontinuities. This is conducted by using causal and anti-causal filters, in order to avoid the mix of past and future phase information related to the time of analysis. This local phase change is then post-processed and thresholded to obtain the editing times. One remarkable property of the proposed method is its ability to withstand MP3 compression, an audio format widely used in practice. A more accurate evaluation metric is also introduced in this work. For that purpose, the databases used for evaluating the algorithm were automatically labeled indicating the editing times. The procedure to generate the ground truth is presented, as well as a discussion on the proposed metric. The performance of the technique presented promising results when evaluated on digitally edited and original audio signals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. http://speechpro-usa.com/product/forensic_analysis/editracker.

  2. The tolerance parameter \(\tau \) was chosen to avoid losing editing points. Our assumption is that the duration of a short word is at least 500 ms, so in the case of the insertion of short words, the value \(\tau =250\) ms permits an evaluation of the system’s performance without losing any editing point.

References

  1. I.F. Apolinário , C.A. Rossi, Time-stamp of digital audio recording based on the ENF estimated from another audio signal, in I IEEE Latin American Symposium on Circuits and Systems (LASCAS) (2010)

  2. T. Bianchi, A.D. Rosa, M. Fontani, G. Rocciolo, A. Piva, Detection and localization of double compression in MP3 audio tracks. EURASIP J. Inf. Secur. 2, 1–14 (2014)

    Google Scholar 

  3. M. Bosi, R.E. Goldberg, Introduction to Digital Audio Coding and Standars (Kluwer Academic Publishers, Alphen aan den Rijn, 2003)

    Book  Google Scholar 

  4. B. D’Alessandro, Y.Q. Shi, MP3 bit rate quality detection through frequency spectrum analysis, in Proceedings of the 11th ACM Workshop on Multimedia and Security, pp. 57–62. ACM, New York, (2009). https://doi.org/10.1145/1597817.1597828

  5. P.A.A. Esquef, J.A. Apolinário Jr., L.W.P. Biscainho, Edit detection in speech recordings via instantaneous electric network frequency variations. IEEE Trans. Inf. Forensics Secur. 9(12), 2314–2326 (2014). https://doi.org/10.1109/TIFS.2014.2363524

    Article  Google Scholar 

  6. P.A.A. Esquef, J.A. Apolinário Jr., L.W.P. Biscainho, Improved edit detection in speech via ENF patterns, in 2015 IEEE International Workshop on Information Forensics and Security (WIFS), pp. 1–6 (2015). https://doi.org/10.1109/WIFS.2015.7368585

  7. H. Farid, Image forgery detection. IEEE Signal Process. Mag. 26(2), 16–25 (2009)

    Article  Google Scholar 

  8. FFmpeg: A complete, cross-platform solution to record, convert and stream audio and video. http://www.ffmpeg.org. Accessed 10 June 2016

  9. M. Fuentes, P. Zinemanas, P. Cancela, J.A. Apolinário Jr., Detection of ENF discontinuities using PLL for audio authenticity, in VII IEEE Latin American Symposium on Circuits and Systems (LASCAS), pp. 79–82 (2016). https://doi.org/10.1109/LASCAS.2016.7451014

  10. R. Garg, A.L. Varna, A. Hajj-Ahmad, M. Wu, Seeing ENF: power-signature-based timestamp for digital multimedia via optical sensing and signal processing. IEEE Trans. Inf. Forensics Secur. 8(9), 1417–1432 (2013)

    Article  Google Scholar 

  11. C. Grigoras, Forensic analysis of digital recordings—the electric network frequency criterion, in Forensic Science International (2003)

  12. C. Grigoras, Digital audio recording analysis, the electric network frequency (ENF) criterion. Int. J. Speech Lang. Law 12, 43–49 (2005). https://doi.org/10.1558/sll.2005.12.1.63

    Article  Google Scholar 

  13. C. Grigoras , Applications of ENF criterion in forensic audio, video, computer and telecommunication analysis (Selected Articles of the 4th European Academy of Forensic Science Conference (EAFS2006) June 13–16, 2006 Helsinki, Finland), pp. 136–145 (2007). https://doi.org/10.1016/j.forsciint.2006.06.033. http://www.sciencedirect.com/science/article/pii/S0379073806004312

    Article  Google Scholar 

  14. A. Hajj-Ahmad, R. Garg, M. Wu, Spectrum combining for ENF signal estimation. IEEE Signal Process. Lett. 20(9), 885–888 (2013)

    Article  Google Scholar 

  15. A. Hajj-Ahmad, R. Garg, M. Wu, ENF-based region-of-recording identification for media signals. IEEE Trans. Inf. Forensics Secur. 10(6), 1125–1136 (2015). https://doi.org/10.1109/TIFS.2015.2398367

    Article  Google Scholar 

  16. M. Huijbregtse, Z. Geradts, Using the ENF criterion for determining the time of recording of short digital audio recordings, in Springer Series Lecture Notes in Computer Science vol. 5718, pp. 116–124 (2009)

  17. R. Korycki, Authenticity investigation of digital audio recorded as MP3 files. Probl. Kryminal. 283(1), 54–67 (2014)

    Google Scholar 

  18. Q. Liu, A.H. Sung, M. Qiao, Detection of double MP3 compression. Cogn. Comput. 2(4), 291–296 (2010). https://doi.org/10.1007/s12559-010-9045-4

    Article  Google Scholar 

  19. R.C. Maher, Audio forensic examination: authenticity, enhancement, and interpretation. IEEE Signal Process. Mag. 26(2), 84–94 (2009). https://doi.org/10.1109/MSP.2008.931080

    Article  Google Scholar 

  20. D.P. Nicolalde-Rodríguez, J.A. Apolinário Jr., Evaluating digital audio authenticity with spectral distances and ENF phase change, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1417–1420 (2009)

  21. D.P. Nicolalde-Rodríguez, J.A. Apolinário Jr., L.W.P. Biscainho, Audio authenticity: detecting ENF discontinuity with high precision phase analysis. IEEE Trans. Inf. Forensics Secur. 5(3), 534–543 (2010). https://doi.org/10.1109/TIFS.2010.2051270

    Article  Google Scholar 

  22. D.P. Nicolalde-Rodríguez, J.A. Apolinario Jr., L.W.P. Biscainho, Audio authenticity based on the discontinuity of ENF higher harmonics, in Proceedings of the 21st European Signal Processing Conference (EUSIPCO), IEEE, pp. 1–5 (2013)

  23. J. Ortega-Garcia, J. Gonzalez-Rodriguez, V. Marrero-Aguiar, AHUMADA: A large speech corpus in Spanish for speaker characterization and identification. Speech Commun. 31(2), 255–264 (2000). https://doi.org/10.1016/S0167-6393(99)00081-3. http://www.sciencedirect.com/science/article/pii/S0167639399000813. Accessed 20 July 2017

    Article  Google Scholar 

  24. M. Qiao, A.H. Sung, Q. Liu, Revealing real quality of double compressed MP3 audio, in Proceedings of the 18th ACM International Conference on Multimedia, MM ’10, pp. 1011–1014. ACM, New York (2010). https://doi.org/10.1145/1873951.1874137

  25. M. Qiao, A.H. Sung, Q. Liu, Improved detection of MP3 double compression using content-independent features, in 2013 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2013), pp. 1–4 (2013). https://doi.org/10.1109/ICSPCC.2013.6664121

  26. P.M.G.I. Reis, J.P.C.L. da Costa, R.K. Miranda, ESPRIT-Hilbert-based audio tampering detection with SVM classifier for forensic analysis via electrical network frequency. IEEE Trans. Inf. Forensics Secur. 12(4), 853–864 (2017). https://doi.org/10.1109/TIFS.2016.2636095

    Article  Google Scholar 

  27. H. Sakoe, S. Chiba, Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 43–49 (1978)

    Article  Google Scholar 

  28. R.W. Sanders, Digital audio authenticity using the electric network frequency, in Audio Engineering Society Conference: 33rd International Conference: Audio Forensics-Theory and Practice (2008). http://www.aes.org/e-lib/browse.cfm?elib=14403. Accessed 20 July 2017

  29. M.C. Stamm, M. Wu, K.J.R. Liu, Information forensics: an overview of the first decade. IEEE Access 1, 167–200 (2013)

    Article  Google Scholar 

  30. R. Yang, Z. Qu, J. Huang, Exposing MP3 audio forgeries using frame offsets. ACM Trans. Multimed. Comput. Commun. Appl. 8(2S), 35:1–35:20 (2012). https://doi.org/10.1145/2344436.2344441

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pablo Zinemanas.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (wav 2422 KB)

Supplementary material 2 (wav 2646 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zinemanas, P., Fuentes, M., Cancela, P. et al. An ENF-Based Audio Authenticity Method Robust to MP3 Compression. Circuits Syst Signal Process 37, 4973–4992 (2018). https://doi.org/10.1007/s00034-018-0793-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00034-018-0793-9

Keywords

Navigation