Abstract
The performance of applying voice control in home automation can significantly drop under multi-resident situations and noisy environments. It therefore requires some appropriate approaches for smart home applications to address the problem of resident identification. Voice recognition, which explores characteristics of voice, is a potential biometric modality for such problem in smart home. In this paper, the power-normalized cepstral coefficient (PNCC) of voice biometrics is applied to identify individuals in smart home. A new technique of power-law nonlinearity and an algorithm of noise suppression based on asymmetric filtering are used to enhance feature extraction and reduce environmental noise. This proposed approach extremely reduces error rate and achieves high performance on different data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Benmansour, A., Bouchachia, A., Feham, M.: Multioccupant activity recognition in pervasive smart home environments. ACM Comput. Surv. 48(3), 34:1–34:36 (2015)
Cook, D.J.: Learning setting-generalized activity models for smart spaces. IEEE Intell. Syst. 27, 32–38 (2012)
Alemdar, H., Ertan, H., Incel, O.D., Ersoy, C.: Aras human activity datasets in multiple homes with multiple residents. PervasiveHealth’13. ICST, Brussels, Belgium, pp. 232–235 (2013)
Chen, R., Tong, Y.: A two-stage method for solving multi-resident activity recognition in smart environments. Entropy 16(4), 2184 (2014)
Alemdar, H., Ersoy, C.: Multi-resident activity tracking and recognition in smart environments. J. Ambient Intell. Hum. Comput. 8 (2017)
Rabiner, L.R.: Readings in speech recognition. A tutorial on hidden markov models and selected applications in speech recognition. Morgan Kaufmann Publishers Inc., San Francisco, pp. 267–296 (1990)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of 18th International Conference on Machine Learning. Morgan Kaufmann. pp. 282–289 (2001)
Son, N.T., Dung, N., Tung, S.N., Son, V.X., Long, H., Qing, Z., Mohan, K.: On multi-resident activity recognition in ambient smart-homes. In: Artificial Intelligence Review, Springer (2019)
Kim, C., Stern, R.M.: Power-normalized cepstral coefficients (PNCC) for robust speech recognition. IEEE/ACM Trans. Audio Speech Lang. 24(7), 1315–1329 (2016)
Cornaz, C., Hunkeler, U., Velisavljevic, V.: An Automatic Speaker Recognition System. Lausanne, Switzerland (2003)
Kumar, P., Vardhan, K., Krishna, K.: Performance evaluation of MLP for speech recognition in noise environments using MFCC & wavelets. Int. J. Comput. Sci. Commun. (IJCSC) 1(2), 41–45 (2010)
Muda, L., Began, M., Elamvazuthi, M.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput, 139–140 (2010)
Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1–4 (2013)
Reynolds, D.A.: An overview of automatic speaker recognition technology. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2002)
Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circ. Syst. Mag. 62–81 (2011)
Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circ. Syst. Mag. 11(2), 23–61 (2011)
Xihao, S., Miyanaga, Y.: Dynamic time warping for speech recognition with training part to reduce the computation. in: International Symposium on Signals, Circuits and Systems ISSCS2013, pp. 1–4 (2013)
Pawar, R.V, Kajave, P.P., Mali, S.N: Speaker identification using neural networks. World Acad. Sci. Eng. Technol. (2005)
Bhushan, C.K.: Speech recognition using artificial neural network. A Rev. Int. J. Comput. Commun. Instrum. Eng. 3(1) (2016)
Shahin, I., Botros, N.: Text-dependent speaker identification using hidden Markov model with stress compensation technique. in: Proceedings IEEE Southeastcon 98 Engineering for a New Era, Orlando, pp. 61–64 (1998)
Maesa, A., Garzia, F., Scarpiniti, M., Cusani, R.: Text independent automatic speaker recognition system using mel-frequency cepstrum coefficient and gaussian mixture models. J. Inf. Secur. 335–340 (2012)
Campbell, G.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)
Acknowledgements
This work is supported by the University of Information Technology - Vietnam National University Ho Chi Minh City under grant No. D1-2020-09.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Duy, D., Dat, N.H., Tram, H.T.M., Son, N.H., Son, N.M. (2022). An Approach of Enhanced PNCC for Resident Identification Applications. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) ICT with Intelligent Applications. Smart Innovation, Systems and Technologies, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-4177-0_35
Download citation
DOI: https://doi.org/10.1007/978-981-16-4177-0_35
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-4176-3
Online ISBN: 978-981-16-4177-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)