Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features

Obaid AlQahtani, Mubarak; Al mazyad, Abdulaziz S.

doi:10.1007/978-3-642-22410-2_26

Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features

Mubarak Obaid AlQahtani³ &
Abdulaziz S. Al mazyad⁴

Conference paper

1178 Accesses
1 Citations

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 189))

Abstract

Linear Predictive Coding coefficients are of the main extraction feature in digital forensic. In this paper, we perform several experiments focusing on the problems of environments recognition from audio particularly for forensic application. We investigated the effect of temporal Linear Predictive Coding coefficient as feature extraction on environment sound recognition to compute the Linear Predictive Coding coefficient for each frame for all files. The performance is evaluated against varying number of training sounds and samples per training file and compare with Zero Crossing feature and Moving Picture Experts Group-7 low level description feature. We use K-Nearest Neighbors as classifier feature to detect which the environment for any audio testing file. Experimental results show that higher recognition accuracy is achieved by increasing the number of training files and by decreasing the number of samples per training file.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Delp, E., Memon, N., Wu, M.: Digital Forensics. IEEE Signal Process. Magazine, 14–15 (2009)
Google Scholar
Broeders, A.P.A.: Forensic Speech and Audio Analysis: the State of the Art in 2000 AD. Actas del I Congreso de la Sociedad Espanola de Acustica Forense, March, 13-24 (2000)
Google Scholar
Campbell, W., et al.: Understanding Scores in Forensic Speaker Recognition. In: ISCA Speaker Recognition Workshop, June, 1-8 (2006)
Google Scholar
AES AES43-2000: AES Standard for Forensic Purposes - Criteria for the Authentication of Analog Audio Tape Recordings. Journal of the Audio Engineering Society 48(3), 204–214 (2000)
Google Scholar
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993)
MATH Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Willey, New York (2001)
MATH Google Scholar
Eronen, A.J., et al.: Audio-Based Context Recognition. IEEE Trans. Audio, Speech and Language Process 14(1), 321–329 (2006)
Article Google Scholar
Selina, C., Narayanan, S., Kuo, J.: Environmental sound recognition using MP-based features. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), pp. 1–4 (2008)
Google Scholar
Mallat, S., Zhang, Z.: Matching pursuits with time-frequency dictionaries. IEEE Trans. on Signal Processing 41(12), 3397–3415 (1993)
Article MATH Google Scholar
Selina, C., et al.: Where am I? Scene recognition for mobile robots using audio features. In: Proc. IEEE ICME 2006, pp. 885–888 (2006)
Google Scholar
Malkin, R.G., Waibel, A.: Classifying user environment for mobile applications using linear autoencoding of ambient audio. In: Proc. ICASSP 2005, pp. 509–512 (2005)
Google Scholar
Wang, J.C., et al.: Environmental sound classification using hybrid SVM/KNN classifier and MPEG-7 audio low-level descriptor. In: Proc. IEEE International Joint Conference on Neural Networks, Canada, pp. 1731–1735 (July 2006)
Google Scholar
Ntalampiras, S., Potamitis, I., FakotaKis, N.: Automatic recognition of urban environmental sounds events. In: Proc. CIP 2008, pp. 110–113 (2008)
Google Scholar
Kraetzer, C., et al.: Digital audio forensics: a first practical evaluation on microphone and environmental classification. In: Proc. ACM MultiMedia Security (MM&Sec), pp. 63–73 (2007)
Google Scholar
Alqahtani, M.O., Muhammad, G., Alotibi, Y.: Environment Sound Recognition using Zero Crossing Features and MPEG-7. In: ICADIWT 2010 (2010)
Google Scholar
http://en.wikipedia.org/wiki/Linear_prediction
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
MATH Google Scholar
Yang, Y., Liu, X.: A Re-examination of Text Categorization Methods. In: Proc. 22nd Annual International ACM SIGIR Conf. Research and Development in Inform. Retrieval, pp. 42–49 (1999)
Google Scholar
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Proc. Euro. Conf. Machine Learn (1998)
Google Scholar
Baoli, L., et al.: A Comparative Study on Automatic Categorization Methods for Chinese Search Engine. In: Proc. Eighth Joint International Computer Conference Hangzhou, pp. 117–120 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Center of Excellence in Information Assurance, King Saud University, Riyadh, Saudi Arabia
Mubarak Obaid AlQahtani
College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
Abdulaziz S. Al mazyad

Authors

Mubarak Obaid AlQahtani
View author publications
You can also search for this author in PubMed Google Scholar
Abdulaziz S. Al mazyad
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electrical Engineering and Computer Science, VSB-Technical University of Ostrava, VŠB-TUO, 17. listopadu 15, 708 33, Ostrava, Poruba, Czech Republic
Vaclav Snasel & Jan Platos &
Information Systems Department, King Saud University, 11543, Riyadh, Saudi Arabia
Eyas El-Qawasmeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Obaid AlQahtani, M., Al mazyad, A.S. (2011). Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features. In: Snasel, V., Platos, J., El-Qawasmeh, E. (eds) Digital Information Processing and Communications. ICDIPC 2011. Communications in Computer and Information Science, vol 189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22410-2_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-22410-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22409-6
Online ISBN: 978-3-642-22410-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics