Abstract
We proposed a manipulation detection method for interrogation speech. We used a robust fingerprinting method optimized for speech since our intended target is interrogation speech recorded during a police investigation. The fingerprint uses line spectral pairs (LSP) to measure the spectral envelope of the speech, and is coarsely quantized so that the fingerprint will not be altered by small degradation in the signal, but will be altered enough by malicious modifications to the speech content. This fingerprint is embedded in the speech signal using conventional spread-spectrum watermarks. To detect manipulation, the watermarked fingerprint is detected, and compared to the fingerprint extracted from the speech itself. If the fingerprints match within the predetermined tolerance, it can be authenticated to be unaltered. Otherwise, manipulation should be suspected. We conducted initial experiments to verify the feasibility of the proposed method, and confirmed that at the utterance level, we can identify all substitution manipulated speech utterances successfully.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Advisory Panel on White House Tapes: Report on a technical investigation conducted for the U.S. District Court for the District of Columbia by the advisory panel on White House tapes. Technical report, U.S. District Court for the District of Columbia, May 1974
Boney, L., Tewfik, A.H., Hamdy, K.N.: Digital watermarks for audio signals. In: Proceedings of IEEE International Conference on Multimedia Computing and Systems. IEEE, Hiroshima (1996)
Itakura, F.: Line spectrum representation of linear prediction coefficients of speech signals. J. Acoust. Soc. Am. 57, 535 (1975)
Kukucka, J.: Lights, camera, justice: the value of recording police investigations. The Huffington Post online article, July 2014. http://www.huffingtonpost.com/jeff-kukucka/lights-camera-justice-the_b_5404579.html
Kyodo News: Japanese police to tape all interrogations of suspects facing lay judge trials. The Japan Times online article, September 2016. http://www.japantimes.com/news/2016/09/16/crime-legal/japanese-police-tape-interrogations-suspect-facing-lay-judge-trials/
NII Speech Resources Consortium: ASJ continuous speech corpus for research. http://research.nii.ac.jp/src/en/ASJ-JIPDEC.html. Accessed 2 Mar 2016
Sugamura, N., Itakura, F.: Speech data compression by LSP analysis-synthesis technique. Trans. Inst. Electron. Inf. Commun. Eng. J64-A(8) (1981). (in Japanese)
Tousignant, L.: The secret of Nixon tape’s 18-minute gap revealed. New York Post online article, August 2014. http://nypost.com/2014/08/03/after-40-years-john-dean-re-examines-nixon-tapes-18-minute-gap
Acknowledgments
This work was supported in part by the Cooperative Research Project Program of the Research Institute of Electrical Communication, Tohoku University (H26/A14).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Takahashi, S., Kondo, K. (2018). Towards an Interrogation Speech Manipulation Detection Method Using Speech Fingerprinting. In: Pan, JS., Tsai, PW., Watada, J., Jain, L. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. IIH-MSP 2017. Smart Innovation, Systems and Technologies, vol 82. Springer, Cham. https://doi.org/10.1007/978-3-319-63859-1_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-63859-1_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63858-4
Online ISBN: 978-3-319-63859-1
eBook Packages: EngineeringEngineering (R0)