Text-Independent Speaker Authentication with Spiking Neural Networks
This paper presents a novel system that performs text-independent speaker authentication using new spiking neural network (SNN) architectures. Each speaker is represented by a set of prototype vectors that is trained with standard Hebbian rule and winner-takes-all approach. For every speaker there is a separated spiking network that computes normalized similarity scores of MFCC (Mel Frequency Cepstrum Coefficients) features considering speaker and background models. Experiments with the VidTimit dataset show similar performance of the system when compared with a benchmark method based on vector quantization. As the main property, the system enables optimization in terms of performance, speed and energy efficiency. A procedure to create/merge neurons is also presented, which enables adaptive and on-line training in an evolvable way.
KeywordsSpiking Neural Network Speaker Authentication Brain-like Pattern Recognition Similarity Domain Normalization
Unable to display preview. Download preview PDF.
- 4.Delorme, A., Gautrais, J., van Rullen, R., Thorpe, S.: SpikeNet: a simulator for modeling large networks of integrate and fire neurons. Neurocomputing, 26–27, 986–996 (1999)Google Scholar
- 5.Wysoski, S.G., Benuskova, L., Kasabov, N.: On-line learning with structural adaptation in a network of spiking neurons for visual pattern recognition. In: Kollias, S., Stafylopatis, A., Duch, W., Oja, E. (eds.) ICANN 2006. LNCS, vol. 4131, pp. 61–70. Springer, Heidelberg (2006)CrossRefGoogle Scholar
- 6.Kuroyanagi, S., Iwata, A.: Auditory pulse neural network model to extract the inter-aural time and level difference for sound localization. Trans. of IEICE E77-D 4, 466–474 (1994)Google Scholar
- 7.Loiselle, S., Rouat, J., Pressnitzer, D., Thorpe, S.: Exploration of Rank Order Coding with spiking neural networks for speech recognition. IJCNN, Montreal, pp. 2076–2080 (2005)Google Scholar
- 8.Rouat, J., Pichevar, R., Loiselle, S.: Perceptive, non-linear speech processing and spiking neural networks. In: Chollet, G., Esposito, A., Faúndez-Zanuy, M., Marinaro, M. (eds.) Nonlinear Speech Modeling and Applications. LNCS (LNAI), vol. 3445, pp. 317–337. Springer, Heidelberg (2005)Google Scholar
- 10.Gold, B., Morgan, N.: Speech and Audio Signal Processing. John Wiley & Sons, Chichester (2000)Google Scholar
- 11.Rabiner, L., Juang, B.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)Google Scholar
- 13.Bothe, S.M., La Poutre, H.A., Kok, J.N.: Unsupervised clustering with spiking neurons by sparse temporal coding and multi-layer RBF networks. IEEE Trans. Neural Networks 10(2), 426–435 (2002)Google Scholar