Skip to main content
Log in

Cancelable speaker identification based on cepstral coefficients and comb filters

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

This paper presents a proposed approach for cancelable biometric recognition related to speaker identification. Both comb and inverse comb filters are used to distort speech signals intentionally prior to and after feature extraction. The objective of this approach is to generate protected templates of speech signals representing speakers without subjecting the original speech signals or features to violations of attackers. Both comb and inverse comb filters are used for this purpose. In addition, a satisfactory performance represented in the speaker identification rate is retained. In case the database of speaker features is compromised, it is possible to change the comb or inverse comb filter orders to generate new features for the same speakers. Simulation results reveal the possibility to identify speakers from their deteriorated speech signals, which proves the robustness of the proposed cancelable speaker identification system. The reason of choosing the comb filter is that it is a multi-band filter with multiple nulls in its frequency response. Hence, its inversion is difficult due to the nulling effect. This characteristic can induce non-invertible distortion in speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25

Similar content being viewed by others

References

  • Abd El-Samie, F. E. (2017). Information security for automatic speaker identification. International Journal of Speech Technology., 20, 977–1004.

    Article  Google Scholar 

  • Al-Nuaimy, W., El-Bendary, M. A. M., Shafik, A., Shawki, F., Abou-El-azm, A. E., El-Fishawy, N. A., et al. (2011). An SVD audio watermarking approach using chaotic encrypted image. Digital Signal Process, 21(6), 764–779.

    Article  Google Scholar 

  • Asaker A. A., Sharkawy, Z. F. E. L., Nassar, S., Ayad, N., Zahran, O. & Abd EL-Samie F. E. (2020). A novel cancellable Iris template generation based on salting approach. Multimedia Tools and Applications.

  • Chauhan, N. & Chandra, M. (2017). Speaker recognition and verification using artificial neural network. In International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET).

  • Chen, H., & Chen, H. (2010). A hybrid scheme for securing fingerprint templates. International Journal of Information Security, 9, 353–361.

    Article  Google Scholar 

  • Chuang, C. T., Chang, T., Chiang, Y. T., & Chang, F. R. (2016). Heart rate monitoring using a slow–fast adaptive comb filter to eliminate motion artifacts. Journal of Medical and Biometric Engineering, 36, 833–842.

    Google Scholar 

  • Dolecek, G. J. (2017). Advances in multirate systems. Springer, pp. 59–81.

  • Fredrickson, S. E. & Tarassenko L. (1995). Text-independent speaker recognition using neural network techniques. In 4th International Conference on Artificial Neural Networks, pp. 13 18.

  • Furui, S. An overview of speaker recognition technology. Automatic Speech and Speaker Recognition, pp. 31–56.

  • Furui, S. (1996). An overview of speaker recognition technology. The Kluwer International Series in Engineering and Computer Science, 355, 31–55.

    Article  Google Scholar 

  • Hossain, M., Ahmed, B., Asrafi, M. (2007). A real Time Speaker Identification using artificial Network. In 10th international conference on Computer and Information Technology 2007, ICCIT2008, pp. 1–5, Dec. 2007, Dhaka.

  • Jimenez, M. G. C., Romero, D. E. T & Dolecek, G.J. (2015). Comb filters characteristics and applications. Encyclopedia of Information Science and Technology, Third Edition, pp. 4062–4071.

  • Kuo, S. M., Lee, B. H. & Tain, W. Real-time digital signal processing implementations and applications. Second Edition.

  • Nair, P. G. & Nair, R. (2015). Efficient speaker identification using artificial neural network. IJECT l6(1).

  • Nikolic, M., & Lutovac, M. (2011). Sharpening of the multistage modified comb filters. Serbian Journal of Electrical Engineering, 8(2), 281–291.

    Article  Google Scholar 

  • Park, J.S. (2013). Advanced technologies, embedded and multimedia for human-centric computing. pp. 31–36.

  • Pilt, K., Meigas, K., Karai, D. & Kaik, J. (2009). PPG signal processing for pulse delay computing by using adaptive comb filter. World Congress on Medical Physical and Biometric Engineering, pp. 1653–1656.

  • Poostchi, M., Kamkar, I. & Mohebbi, J. (2010). Soft computing in industrial applications. 75:165–173.

  • Kumar. J., Prabhakar, OP. & Sahu, N. S. (2014). Comparative analysis of different feature extraction and classifier techniques for speaker identification systems: a review. 2(1).

  • Prochazka, A., Uhlir, J., Rayner, P. J. W., & Kingsbury, N. J. (1998). Signal Analysis and Prediction. New York: Birkhauser Inc.

    Book  Google Scholar 

  • Saha, G., Kumar, P. & Chakroborty, S. (2004). A comparative study of feature extraction algorithms on ANN based speaker model for speaker recognition applications. In International Conference on Neural Information Processing, ICONIP 2004:Neural Information Processing, pp. 1192–1197.

  • Shafik, A., Elhalafawy, S. M., Diab, S. M., Sallam, B. M., & Abd El-samie, F. E. (2009). A wavelet based approach for speaker identification from degraded speech. International Journal of Communication Networks and Information Security (IJCNIS), 1(3), 53–60.

    Google Scholar 

  • Soliman, R. F., Amin, M., & Abd El-Samie, F. E. (2020). Cancelable Iris recognition system based on comb filter. Multimedia Tools and Applications, 79, 2521–2541.

    Article  Google Scholar 

  • Soliman, N. F., Mostfa, Z., Abd El-Samie, F. E., & Abdalla, M. I. (2017). Performance enhancement of speaker identification systems using speech encryption and cancelable features. International Journal of Speech Technology, 20(4), 977–1004.

    Article  Google Scholar 

  • Tirumala, S. S., Shahamiri, S. R., Garhwal, A. S., & Wang, R. (2017). Speaker identification features extraction methods a systematic review. Expert Systems with Applications, 90, 250–271.

    Article  Google Scholar 

  • Trang, H., Loc, T.H., Nam, H.B.H. (2014). Proposed combination of PCA and MFCC feature extraction in speech recognition system. International Conference on Advanced Technologies for Communications (ATC 2014).

  • Wang, W., Li, S. Yang, J. Liu, Z. & Zhou, W.O. (2016). Feature extraction of underwater target in auditory sensation area based on MFCC., IEEE/OES China Ocean Acoustics (COA).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohamed Monir.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Monir, M., Kareem, M., El-Dolil, S.M. et al. Cancelable speaker identification based on cepstral coefficients and comb filters. Int J Speech Technol 25, 471–492 (2022). https://doi.org/10.1007/s10772-021-09804-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-021-09804-4

Keywords

Navigation