Abstract
Linear Predictive Coding coefficients are of the main extraction feature in digital forensic. In this paper, we perform several experiments focusing on the problems of environments recognition from audio particularly for forensic application. We investigated the effect of temporal Linear Predictive Coding coefficient as feature extraction on environment sound recognition to compute the Linear Predictive Coding coefficient for each frame for all files. The performance is evaluated against varying number of training sounds and samples per training file and compare with Zero Crossing feature and Moving Picture Experts Group-7 low level description feature. We use K-Nearest Neighbors as classifier feature to detect which the environment for any audio testing file. Experimental results show that higher recognition accuracy is achieved by increasing the number of training files and by decreasing the number of samples per training file.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Delp, E., Memon, N., Wu, M.: Digital Forensics. IEEE Signal Process. Magazine, 14–15 (2009)
Broeders, A.P.A.: Forensic Speech and Audio Analysis: the State of the Art in 2000 AD. Actas del I Congreso de la Sociedad Espanola de Acustica Forense, March, 13-24 (2000)
Campbell, W., et al.: Understanding Scores in Forensic Speaker Recognition. In: ISCA Speaker Recognition Workshop, June, 1-8 (2006)
AES AES43-2000: AES Standard for Forensic Purposes - Criteria for the Authentication of Analog Audio Tape Recordings. Journal of the Audio Engineering Society 48(3), 204–214 (2000)
Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Willey, New York (2001)
Eronen, A.J., et al.: Audio-Based Context Recognition. IEEE Trans. Audio, Speech and Language Process 14(1), 321–329 (2006)
Selina, C., Narayanan, S., Kuo, J.: Environmental sound recognition using MP-based features. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), pp. 1–4 (2008)
Mallat, S., Zhang, Z.: Matching pursuits with time-frequency dictionaries. IEEE Trans. on Signal Processing 41(12), 3397–3415 (1993)
Selina, C., et al.: Where am I? Scene recognition for mobile robots using audio features. In: Proc. IEEE ICME 2006, pp. 885–888 (2006)
Malkin, R.G., Waibel, A.: Classifying user environment for mobile applications using linear autoencoding of ambient audio. In: Proc. ICASSP 2005, pp. 509–512 (2005)
Wang, J.C., et al.: Environmental sound classification using hybrid SVM/KNN classifier and MPEG-7 audio low-level descriptor. In: Proc. IEEE International Joint Conference on Neural Networks, Canada, pp. 1731–1735 (July 2006)
Ntalampiras, S., Potamitis, I., FakotaKis, N.: Automatic recognition of urban environmental sounds events. In: Proc. CIP 2008, pp. 110–113 (2008)
Kraetzer, C., et al.: Digital audio forensics: a first practical evaluation on microphone and environmental classification. In: Proc. ACM MultiMedia Security (MM&Sec), pp. 63–73 (2007)
Alqahtani, M.O., Muhammad, G., Alotibi, Y.: Environment Sound Recognition using Zero Crossing Features and MPEG-7. In: ICADIWT 2010 (2010)
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
Yang, Y., Liu, X.: A Re-examination of Text Categorization Methods. In: Proc. 22nd Annual International ACM SIGIR Conf. Research and Development in Inform. Retrieval, pp. 42–49 (1999)
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Proc. Euro. Conf. Machine Learn (1998)
Baoli, L., et al.: A Comparative Study on Automatic Categorization Methods for Chinese Search Engine. In: Proc. Eighth Joint International Computer Conference Hangzhou, pp. 117–120 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Obaid AlQahtani, M., Al mazyad, A.S. (2011). Environment Sound Recognition for Digital Audio Forensics Using Linear Predictive Coding Features. In: Snasel, V., Platos, J., El-Qawasmeh, E. (eds) Digital Information Processing and Communications. ICDIPC 2011. Communications in Computer and Information Science, vol 189. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22410-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-22410-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22409-6
Online ISBN: 978-3-642-22410-2
eBook Packages: Computer ScienceComputer Science (R0)