Skip to main content

Robustness Analysis of Feature Extractors for Ethnic Identification of Malaysian English Accents Database

  • Conference paper
  • First Online:
  • 890 Accesses

Abstract

Accent is fascinating human speech behavior that can be used to mark personal identity and social characteristics of its bearer. However, it also has potential to bias social interaction which includes prestige, job competency and ethnic discrimination. Albeit many successful methods have been deployed in the past to identify a speaker accent, the success rates are most likely database-dependent. This study aims to inquire about identification of Malaysian English (MalE) accents caused by ethnic diversities in this country. Robustness analysis was conducted using seven noisiness levels by corrupting the speech signals with additive white Gaussian noise (AWGN) to investigate the performance of four different schemes of feature extractors under clean and noisy conditions. These methods are filter bank analysis consists of mel-frequency cepstral coefficients (MFCC) and a new set of formulated features named as descriptors of mel-bands spectral energy (MBSE). Principle component analysis (PCA) was utilized to transform to another new features called PCA-MBSE. Second, vocal tract analysis consists of linear prediction coefficients (LPC) and formant frequencies (formants). Third, hybrid analysis consists of discrete wavelet transform (DWT) and LPC. The last scheme is fusions of spectral features (SFFs) of MFCC with formants and LPC with formants. Experimental results showed that SFFs techniques possess more sturdy noise resistivity than MFCC, LPC, MBSE, and DWT-derived LPC features. Similarly, PCA-transformed MBSE was just moderately affected as compared to the original features. While PCA-MBSE only caused a performance drop of 15 % in average and the SFFs were just slightly affected by the AWGN from 8 to 13 % drop, the percentage drop of other feature sets were fairly above 30 %.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Ahmed ZT, Abdullah AN, Heng CS (2013) The role of accent and ethnicity in the professional and academic context. Int J Appl Linguist Engl Lit 2(5):249–258

    Article  Google Scholar 

  • Arslan LM (1996) Foreign accent classification in American English. Duke University, Durham

    Google Scholar 

  • Arslan LM, Hansen JHL (1996) Language accent classification in American English. Speech Commun 18(4):353–367

    Article  Google Scholar 

  • Ghorshi S, Vaseghi S, Yan Q (2008) Cross-entropic comparison of formants of British, Australian and American English accents. Speech Commun 50(7):564–579

    Article  Google Scholar 

  • Hassan AR (February, 2012) Monthly statistical bulletin Malaysia

    Google Scholar 

  • Mustafa K, Bruce IC (2006) Robust formant tracking for continuous speech with speaker variability. IEEE Trans Audio Speech Lang Process 14(2):435–444

    Article  Google Scholar 

  • Nguyen P, Tran D, Huang X, Sharma D (2010) Australian accent-based speaker classification. In: Paper presented at the third international conference on knowledge discovery and data mining, 9–10 Jan 2010, Phuket, Thailand

    Google Scholar 

  • Nhat VDM, Lee S (2004) PCA-based human auditory filter bank for speech recognition. In: Paper presented at the 2004 international conference on signal processing and communications (SPCOM’04), 11–14 Dec, 2004, Bangalore, India

    Google Scholar 

  • Rabiee A, Setayeshi S (2010) Persian accents identification using an adaptive neural network. In: Paper presented at the second international workshop on education technology and computer science, 6–7 March 2010, Wuhan, Hubei, China

    Google Scholar 

  • Tanabian MM, Goubran RA (2005). Speech accent identification with vocal tract variation trajectory tracking using neural networks. In: Paper presented at the computational intelligence for homeland security and personal safety, 31 March–April 1 2005, proceedings of the 2005 IEEE international conference on CIHSPS 2005

    Google Scholar 

  • Teixeira C, Trancoso I, Serralheiro A (1996) Accent identification. In: Paper presented at the fourth international conference on spoken language, 3–6 Oct 1996, Philadelphia, PA

    Google Scholar 

  • Tufekci Z, Gowdy JN (2000) Feature extraction using discrete wavelet transform for speech recognition. In: Paper presented at the Proceedings of the IEEE Southeastcon 2000, 7–9 April, 2000, Nashville, TN

    Google Scholar 

  • Vergyri D, Lamel L, Gauvain JL (2010) Automatic speech recognition of multiple accented english data. In: Paper presented at the 11th annual conference of the international speech communication association: spoken language processing for all, 26–30 Sept 2010, INTERSPEECH 2010, Makuhari, Chiba, Japan

    Google Scholar 

  • Yusnita MA, Paulraj MP, Sazali Y, Shahriman AB, Fadzilah MN (2013a) Statistical band selection for descriptors of MBSE and MFCC-based features for accent classification of Malaysian English. Int J Electr Electron Syst Res 6(2013):31–46

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Fadzilah MN, Shahriman AB (2013b) Acoustic analysis of formants across genders and ethnical accents in Malaysian English using ANOVA. Procedia Eng 64(2013):385–394

    Article  Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB (2012a) Classification of speaker accent using hybrid DWT-LPC features and K-nearest neighbors in ethnically diverse Malaysian English. In: Paper presented at the 2012 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE), 3–4 Dec 2012, Kota Kinabalu

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB (2013c) Feature space reduction in ethnically diverse Malaysian English accents classification. In: Paper presented at the 2013 7th International Conference on Intelligent Systems and Control (ISCO), 4–5 Jan 2013, Coimbatore, Tamilnadu

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Mokhtar NF (2013d) Statistical formant descriptors with linear predictive coefficients for accent classification. In: Paper presented at the 8th IEEE Conference on Industrial Electronics and Applications (ICIEA) Melbourne, 19–21 June 2013, Australia

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Nataraj SK (2012b) Speaker accent recognition through statistical descriptors of mel-bands spectral energy and neural network model. In: Paper presented at the 2012 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT), 6–9 Oct 2012, Kuala Lumpur

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Saidatul A (2011) Malaysian English accents identification using LPC and formant analysis. In: Paper presented at the 2011 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 25–27 Nov 2011, Penang, Malaysia

    Google Scholar 

  • Yusnita MA, Paulraj MP, Yaacob S, Yusuf R, Shahriman AB (2013c) Analysis of accent-sensitive words in multi-resolution mel-frequency cepstral coefficients for classification of accents in Malaysian English. Int J Automot Mech Eng (IJAME) 7(2013):1053–1073

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohd Ali Yusnita .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media Singapore

About this paper

Cite this paper

Yusnita, M.A., Paulraj, M.P., Yaacob, S., Shahriman, A.B., Yusuf, R., Fadzilah, M.N. (2016). Robustness Analysis of Feature Extractors for Ethnic Identification of Malaysian English Accents Database. In: Yacob, N., Mohamed, M., Megat Hanafiah, M. (eds) Regional Conference on Science, Technology and Social Sciences (RCSTSS 2014). Springer, Singapore. https://doi.org/10.1007/978-981-10-0534-3_5

Download citation

Publish with us

Policies and ethics