Abstract
Accent is fascinating human speech behavior that can be used to mark personal identity and social characteristics of its bearer. However, it also has potential to bias social interaction which includes prestige, job competency and ethnic discrimination. Albeit many successful methods have been deployed in the past to identify a speaker accent, the success rates are most likely database-dependent. This study aims to inquire about identification of Malaysian English (MalE) accents caused by ethnic diversities in this country. Robustness analysis was conducted using seven noisiness levels by corrupting the speech signals with additive white Gaussian noise (AWGN) to investigate the performance of four different schemes of feature extractors under clean and noisy conditions. These methods are filter bank analysis consists of mel-frequency cepstral coefficients (MFCC) and a new set of formulated features named as descriptors of mel-bands spectral energy (MBSE). Principle component analysis (PCA) was utilized to transform to another new features called PCA-MBSE. Second, vocal tract analysis consists of linear prediction coefficients (LPC) and formant frequencies (formants). Third, hybrid analysis consists of discrete wavelet transform (DWT) and LPC. The last scheme is fusions of spectral features (SFFs) of MFCC with formants and LPC with formants. Experimental results showed that SFFs techniques possess more sturdy noise resistivity than MFCC, LPC, MBSE, and DWT-derived LPC features. Similarly, PCA-transformed MBSE was just moderately affected as compared to the original features. While PCA-MBSE only caused a performance drop of 15 % in average and the SFFs were just slightly affected by the AWGN from 8 to 13 % drop, the percentage drop of other feature sets were fairly above 30 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ahmed ZT, Abdullah AN, Heng CS (2013) The role of accent and ethnicity in the professional and academic context. Int J Appl Linguist Engl Lit 2(5):249–258
Arslan LM (1996) Foreign accent classification in American English. Duke University, Durham
Arslan LM, Hansen JHL (1996) Language accent classification in American English. Speech Commun 18(4):353–367
Ghorshi S, Vaseghi S, Yan Q (2008) Cross-entropic comparison of formants of British, Australian and American English accents. Speech Commun 50(7):564–579
Hassan AR (February, 2012) Monthly statistical bulletin Malaysia
Mustafa K, Bruce IC (2006) Robust formant tracking for continuous speech with speaker variability. IEEE Trans Audio Speech Lang Process 14(2):435–444
Nguyen P, Tran D, Huang X, Sharma D (2010) Australian accent-based speaker classification. In: Paper presented at the third international conference on knowledge discovery and data mining, 9–10 Jan 2010, Phuket, Thailand
Nhat VDM, Lee S (2004) PCA-based human auditory filter bank for speech recognition. In: Paper presented at the 2004 international conference on signal processing and communications (SPCOM’04), 11–14 Dec, 2004, Bangalore, India
Rabiee A, Setayeshi S (2010) Persian accents identification using an adaptive neural network. In: Paper presented at the second international workshop on education technology and computer science, 6–7 March 2010, Wuhan, Hubei, China
Tanabian MM, Goubran RA (2005). Speech accent identification with vocal tract variation trajectory tracking using neural networks. In: Paper presented at the computational intelligence for homeland security and personal safety, 31 March–April 1 2005, proceedings of the 2005 IEEE international conference on CIHSPS 2005
Teixeira C, Trancoso I, Serralheiro A (1996) Accent identification. In: Paper presented at the fourth international conference on spoken language, 3–6 Oct 1996, Philadelphia, PA
Tufekci Z, Gowdy JN (2000) Feature extraction using discrete wavelet transform for speech recognition. In: Paper presented at the Proceedings of the IEEE Southeastcon 2000, 7–9 April, 2000, Nashville, TN
Vergyri D, Lamel L, Gauvain JL (2010) Automatic speech recognition of multiple accented english data. In: Paper presented at the 11th annual conference of the international speech communication association: spoken language processing for all, 26–30 Sept 2010, INTERSPEECH 2010, Makuhari, Chiba, Japan
Yusnita MA, Paulraj MP, Sazali Y, Shahriman AB, Fadzilah MN (2013a) Statistical band selection for descriptors of MBSE and MFCC-based features for accent classification of Malaysian English. Int J Electr Electron Syst Res 6(2013):31–46
Yusnita MA, Paulraj MP, Yaacob S, Fadzilah MN, Shahriman AB (2013b) Acoustic analysis of formants across genders and ethnical accents in Malaysian English using ANOVA. Procedia Eng 64(2013):385–394
Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB (2012a) Classification of speaker accent using hybrid DWT-LPC features and K-nearest neighbors in ethnically diverse Malaysian English. In: Paper presented at the 2012 IEEE Symposium on Computer Applications and Industrial Electronics (ISCAIE), 3–4 Dec 2012, Kota Kinabalu
Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB (2013c) Feature space reduction in ethnically diverse Malaysian English accents classification. In: Paper presented at the 2013 7th International Conference on Intelligent Systems and Control (ISCO), 4–5 Jan 2013, Coimbatore, Tamilnadu
Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Mokhtar NF (2013d) Statistical formant descriptors with linear predictive coefficients for accent classification. In: Paper presented at the 8th IEEE Conference on Industrial Electronics and Applications (ICIEA) Melbourne, 19–21 June 2013, Australia
Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Nataraj SK (2012b) Speaker accent recognition through statistical descriptors of mel-bands spectral energy and neural network model. In: Paper presented at the 2012 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT), 6–9 Oct 2012, Kuala Lumpur
Yusnita MA, Paulraj MP, Yaacob S, Shahriman AB, Saidatul A (2011) Malaysian English accents identification using LPC and formant analysis. In: Paper presented at the 2011 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 25–27 Nov 2011, Penang, Malaysia
Yusnita MA, Paulraj MP, Yaacob S, Yusuf R, Shahriman AB (2013c) Analysis of accent-sensitive words in multi-resolution mel-frequency cepstral coefficients for classification of accents in Malaysian English. Int J Automot Mech Eng (IJAME) 7(2013):1053–1073
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Yusnita, M.A., Paulraj, M.P., Yaacob, S., Shahriman, A.B., Yusuf, R., Fadzilah, M.N. (2016). Robustness Analysis of Feature Extractors for Ethnic Identification of Malaysian English Accents Database. In: Yacob, N., Mohamed, M., Megat Hanafiah, M. (eds) Regional Conference on Science, Technology and Social Sciences (RCSTSS 2014). Springer, Singapore. https://doi.org/10.1007/978-981-10-0534-3_5
Download citation
DOI: https://doi.org/10.1007/978-981-10-0534-3_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0532-9
Online ISBN: 978-981-10-0534-3
eBook Packages: Business and ManagementBusiness and Management (R0)