An Approach of Enhanced PNCC for Resident Identification Applications

Duy, D.; Dat, N. H.; Tram, H. T. M.; Son, N. H.; Son, N. M.

doi:10.1007/978-981-16-4177-0_35

D. Duy⁷,
N. H. Dat⁷,
H. T. M. Tram⁷,
N. H. Son⁷ &
…
N. M. Son⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 248))

673 Accesses
1 Citations

Abstract

The performance of applying voice control in home automation can significantly drop under multi-resident situations and noisy environments. It therefore requires some appropriate approaches for smart home applications to address the problem of resident identification. Voice recognition, which explores characteristics of voice, is a potential biometric modality for such problem in smart home. In this paper, the power-normalized cepstral coefficient (PNCC) of voice biometrics is applied to identify individuals in smart home. A new technique of power-law nonlinearity and an algorithm of noise suppression based on asymmetric filtering are used to enhance feature extraction and reduce environmental noise. This proposed approach extremely reduces error rate and achieves high performance on different data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Hardcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Benmansour, A., Bouchachia, A., Feham, M.: Multioccupant activity recognition in pervasive smart home environments. ACM Comput. Surv. 48(3), 34:1–34:36 (2015)
Google Scholar
Cook, D.J.: Learning setting-generalized activity models for smart spaces. IEEE Intell. Syst. 27, 32–38 (2012)
Article Google Scholar
Alemdar, H., Ertan, H., Incel, O.D., Ersoy, C.: Aras human activity datasets in multiple homes with multiple residents. PervasiveHealth’13. ICST, Brussels, Belgium, pp. 232–235 (2013)
Google Scholar
Chen, R., Tong, Y.: A two-stage method for solving multi-resident activity recognition in smart environments. Entropy 16(4), 2184 (2014)
Google Scholar
Alemdar, H., Ersoy, C.: Multi-resident activity tracking and recognition in smart environments. J. Ambient Intell. Hum. Comput. 8 (2017)
Google Scholar
Rabiner, L.R.: Readings in speech recognition. A tutorial on hidden markov models and selected applications in speech recognition. Morgan Kaufmann Publishers Inc., San Francisco, pp. 267–296 (1990)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of 18th International Conference on Machine Learning. Morgan Kaufmann. pp. 282–289 (2001)
Google Scholar
Son, N.T., Dung, N., Tung, S.N., Son, V.X., Long, H., Qing, Z., Mohan, K.: On multi-resident activity recognition in ambient smart-homes. In: Artificial Intelligence Review, Springer (2019)
Google Scholar
Kim, C., Stern, R.M.: Power-normalized cepstral coefficients (PNCC) for robust speech recognition. IEEE/ACM Trans. Audio Speech Lang. 24(7), 1315–1329 (2016)
Article Google Scholar
Cornaz, C., Hunkeler, U., Velisavljevic, V.: An Automatic Speaker Recognition System. Lausanne, Switzerland (2003)
Google Scholar
Kumar, P., Vardhan, K., Krishna, K.: Performance evaluation of MLP for speech recognition in noise environments using MFCC & wavelets. Int. J. Comput. Sci. Commun. (IJCSC) 1(2), 41–45 (2010)
Google Scholar
Muda, L., Began, M., Elamvazuthi, M.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput, 139–140 (2010)
Google Scholar
Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1–4 (2013)
Google Scholar
Reynolds, D.A.: An overview of automatic speaker recognition technology. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2002)
Google Scholar
Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circ. Syst. Mag. 62–81 (2011)
Google Scholar
Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circ. Syst. Mag. 11(2), 23–61 (2011)
Google Scholar
Xihao, S., Miyanaga, Y.: Dynamic time warping for speech recognition with training part to reduce the computation. in: International Symposium on Signals, Circuits and Systems ISSCS2013, pp. 1–4 (2013)
Google Scholar
Pawar, R.V, Kajave, P.P., Mali, S.N: Speaker identification using neural networks. World Acad. Sci. Eng. Technol. (2005)
Google Scholar
Bhushan, C.K.: Speech recognition using artificial neural network. A Rev. Int. J. Comput. Commun. Instrum. Eng. 3(1) (2016)
Google Scholar
Shahin, I., Botros, N.: Text-dependent speaker identification using hidden Markov model with stress compensation technique. in: Proceedings IEEE Southeastcon 98 Engineering for a New Era, Orlando, pp. 61–64 (1998)
Google Scholar
Maesa, A., Garzia, F., Scarpiniti, M., Cusani, R.: Text independent automatic speaker recognition system using mel-frequency cepstrum coefficient and gaussian mixture models. J. Inf. Secur. 335–340 (2012)
Google Scholar
Campbell, G.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)
Article Google Scholar

Download references

Acknowledgements

This work is supported by the University of Information Technology - Vietnam National University Ho Chi Minh City under grant No. D1-2020-09.

Author information

Authors and Affiliations

VNUHCM -University of Information Technology, Ho Chi Minh City, 70000, Vietnam
D. Duy, N. H. Dat, H. T. M. Tram, N. H. Son & N. M. Son

Authors

D. Duy
View author publications
You can also search for this author in PubMed Google Scholar
N. H. Dat
View author publications
You can also search for this author in PubMed Google Scholar
H. T. M. Tram
View author publications
You can also search for this author in PubMed Google Scholar
N. H. Son
View author publications
You can also search for this author in PubMed Google Scholar
N. M. Son
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Duy .

Editor information

Editors and Affiliations

University of the Ryukyus, Okinawa, Japan
Tomonobu Senjyu
Sinhgad Technical Education society, SKNCOE, Pune, India
Parikshit N. Mahalle
Computer Science, Faculty of CS and IT, Universiti Putra Malaysia, Seri Kembangan, Malaysia
Thinagaran Perumal
Global Knowledge Research Foundation, Ahmedabad, India
Amit Joshi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Duy, D., Dat, N.H., Tram, H.T.M., Son, N.H., Son, N.M. (2022). An Approach of Enhanced PNCC for Resident Identification Applications. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) ICT with Intelligent Applications. Smart Innovation, Systems and Technologies, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-4177-0_35

Download citation

DOI: https://doi.org/10.1007/978-981-16-4177-0_35
Published: 06 December 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-4176-3
Online ISBN: 978-981-16-4177-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

An Approach of Enhanced PNCC for Resident Identification Applications