Skip to main content

An Approach of Enhanced PNCC for Resident Identification Applications

  • Conference paper
  • First Online:
ICT with Intelligent Applications

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 248))

Abstract

The performance of applying voice control in home automation can significantly drop under multi-resident situations and noisy environments. It therefore requires some appropriate approaches for smart home applications to address the problem of resident identification. Voice recognition, which explores characteristics of voice, is a potential biometric modality for such problem in smart home. In this paper, the power-normalized cepstral coefficient (PNCC) of voice biometrics is applied to identify individuals in smart home. A new technique of power-law nonlinearity and an algorithm of noise suppression based on asymmetric filtering are used to enhance feature extraction and reduce environmental noise. This proposed approach extremely reduces error rate and achieves high performance on different data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Benmansour, A., Bouchachia, A., Feham, M.: Multioccupant activity recognition in pervasive smart home environments. ACM Comput. Surv. 48(3), 34:1–34:36 (2015)

    Google Scholar 

  2. Cook, D.J.: Learning setting-generalized activity models for smart spaces. IEEE Intell. Syst. 27, 32–38 (2012)

    Article  Google Scholar 

  3. Alemdar, H., Ertan, H., Incel, O.D., Ersoy, C.: Aras human activity datasets in multiple homes with multiple residents. PervasiveHealth’13. ICST, Brussels, Belgium, pp. 232–235 (2013)

    Google Scholar 

  4. Chen, R., Tong, Y.: A two-stage method for solving multi-resident activity recognition in smart environments. Entropy 16(4), 2184 (2014)

    Google Scholar 

  5. Alemdar, H., Ersoy, C.: Multi-resident activity tracking and recognition in smart environments. J. Ambient Intell. Hum. Comput. 8 (2017)

    Google Scholar 

  6. Rabiner, L.R.: Readings in speech recognition. A tutorial on hidden markov models and selected applications in speech recognition. Morgan Kaufmann Publishers Inc., San Francisco, pp. 267–296 (1990)

    Google Scholar 

  7. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of 18th International Conference on Machine Learning. Morgan Kaufmann. pp. 282–289 (2001)

    Google Scholar 

  8. Son, N.T., Dung, N., Tung, S.N., Son, V.X., Long, H., Qing, Z., Mohan, K.: On multi-resident activity recognition in ambient smart-homes. In: Artificial Intelligence Review, Springer (2019)

    Google Scholar 

  9. Kim, C., Stern, R.M.: Power-normalized cepstral coefficients (PNCC) for robust speech recognition. IEEE/ACM Trans. Audio Speech Lang. 24(7), 1315–1329 (2016)

    Article  Google Scholar 

  10. Cornaz, C., Hunkeler, U., Velisavljevic, V.: An Automatic Speaker Recognition System. Lausanne, Switzerland (2003)

    Google Scholar 

  11. Kumar, P., Vardhan, K., Krishna, K.: Performance evaluation of MLP for speech recognition in noise environments using MFCC & wavelets. Int. J. Comput. Sci. Commun. (IJCSC) 1(2), 41–45 (2010)

    Google Scholar 

  12. Muda, L., Began, M., Elamvazuthi, M.: Voice recognition algorithms using mel frequency cepstral coefficient (MFCC) and dynamic time warping (DTW) techniques. J. Comput, 139–140 (2010)

    Google Scholar 

  13. Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1–4 (2013)

    Google Scholar 

  14. Reynolds, D.A.: An overview of automatic speaker recognition technology. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2002)

    Google Scholar 

  15. Fazel, A., Chakrabartty, S.: An overview of statistical pattern recognition techniques for speaker verification. IEEE Circ. Syst. Mag. 62–81 (2011)

    Google Scholar 

  16. Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circ. Syst. Mag. 11(2), 23–61 (2011)

    Google Scholar 

  17. Xihao, S., Miyanaga, Y.: Dynamic time warping for speech recognition with training part to reduce the computation. in: International Symposium on Signals, Circuits and Systems ISSCS2013, pp. 1–4 (2013)

    Google Scholar 

  18. Pawar, R.V, Kajave, P.P., Mali, S.N: Speaker identification using neural networks. World Acad. Sci. Eng. Technol. (2005)

    Google Scholar 

  19. Bhushan, C.K.: Speech recognition using artificial neural network. A Rev. Int. J. Comput. Commun. Instrum. Eng. 3(1) (2016)

    Google Scholar 

  20. Shahin, I., Botros, N.: Text-dependent speaker identification using hidden Markov model with stress compensation technique. in: Proceedings IEEE Southeastcon 98 Engineering for a New Era, Orlando, pp. 61–64 (1998)

    Google Scholar 

  21. Maesa, A., Garzia, F., Scarpiniti, M., Cusani, R.: Text independent automatic speaker recognition system using mel-frequency cepstrum coefficient and gaussian mixture models. J. Inf. Secur. 335–340 (2012)

    Google Scholar 

  22. Campbell, G.P.: Speaker recognition: a tutorial. Proc. IEEE 85(9), 1437–1462 (1997)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the University of Information Technology - Vietnam National University Ho Chi Minh City under grant No. D1-2020-09.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Duy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Duy, D., Dat, N.H., Tram, H.T.M., Son, N.H., Son, N.M. (2022). An Approach of Enhanced PNCC for Resident Identification Applications. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) ICT with Intelligent Applications. Smart Innovation, Systems and Technologies, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-4177-0_35

Download citation

Publish with us

Policies and ethics