Noise Robustness of Spectrum Delta (SpD) Features in Malay Vowel Recognition

Shahrul Azmi, Mohd Yusof; Nor Idayu, M.; Roshidi, D.; Yaakob, A. R.; Yaacob, Sazali

doi:10.1007/978-3-642-35594-3_38

Mohd Yusof Shahrul Azmi⁶,
M. Nor Idayu⁶,
D. Roshidi⁶,
A. R. Yaakob⁶ &
…
Sazali Yaacob⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 350))

Included in the following conference series:

International Conference on Future Generation Communication and Networking

1827 Accesses
2 Citations

Abstract

In Malaysia, there is increasing number of speech recognition researchers focusing on developing independent speaker speech recognition systems that uses Malay Language which are noise robust and accurate. The performance of speech recognition application under adverse noisy condition often becomes the topic of interest among speech recognition researchers regardless of the languages in use. This paper present a study of noise robust capability of an improved vowel feature extraction method called Spectrum Delta (SpD). The features are extracted from both original data and noise-added data and classified using three classifiers; (i) Multinomial Logistic Regression (MLR), (ii) K-Nearest Neighbors (k-NN) and (iii) Linear Discriminant Analysis (LDA). Results show that the proposed SpD is robust towards noise and LDA performs the best in overall vowel classification compared to MLR and k-NN in terms of robustness capability especially with signal-to-noise (SNR) above 20dB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rosdi, F., Ainon, R.: Isolated malay speech recognition using Hidden Markov Models. In: International Conference on Computer and Communication Engineering (ICCCE 2008), Kuala Lumpur, Malaysia, pp. 721–725 (2008)
Google Scholar
Devore, S., Shinn-Cunningham, B.G.: Perceptual consequences of including reverbera-tion in spatial auditory displays. In: 2003 International Conference on Auditory Display, Boston, MA, USA, pp. 75–78 (2003)
Google Scholar
Uhl, C., Lieb, M.: Experiments with an extended adaptive SVD enhancement scheme forspeech recognition in noise. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UT, USA, pp. 281–284 (2001)
Google Scholar
Al-Haddad, S., Samad, S., Hussain, A., Ishak, K.: Isolated Malay Digit Recognition Using Pattern Recognition Fusion of Dynamic Time Warping and Hidden Markov Models. American Journal of Applied Sciences 5, 714–720 (2008)
Article Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken language processing: A guide to theory, algorithm, and system development. Prentice Hall PTR, Upper Saddle River (2001)
Google Scholar
Kyriakou, C., Bakamidis, S., Dologlou, I., Carayannis, G.: Robust Continuous Speech Recognition in the Presence of Coloured Noise. In: Proceedings of 4th European Conference on Noise Control (EURONOISE 2001), Patra, pp. 702–705 (2001)
Google Scholar
Shahrul Azmi, M.Y.: Feature Extraction and Classification of Malay Speech Vowels, in School of Mechatronics. Ph.D, Kangar, Perlis. Universiti Malaysia Perlis, Malaysia (UniMAP) (2010)
Google Scholar
Lim, C.P., Woo, S.C., Loh, A.S., Osman, R.: Speech Recognition Using Artificial Neural Networks. In: 1st International Conference on Web Information Systems Engineering (WISE 2000), Hong Kong, China, pp. 419 (2000)
Google Scholar
Salam, M., Mohamad, D., Salleh, S.: Neural network speaker dependent isolated Malay speech recognition system: handcrafted vs genetic algorithm. In: 6th International Symposium on Signal Processing and its Applications (ISSPA 2001), Kuala Lumpur, Malaysia (2001)
Google Scholar
Tan, C., Jantan, A.: Digit Recognition Using Neural Networks. Malaysian Journal of Computer Science 17, 40–54 (2004)
Google Scholar
Ting, H.N., Mark, K.M.: Speaker-dependent Malay Vowel Recognition for a Child with Articulation Disorder Using Multi-layer Perceptron. In: 4th Kuala Lumpur International Conference on Biomedical Engineering 2008, pp. 238–241 (2008)
Google Scholar
Yusof, S.A.M., Yaacob, S., Murugesa, P.: Improved Classification of Malaysian Spoken Vowels using Formant Differences. Journal of ICT (JICT) 7 (December 2008)
Google Scholar
Nazari, M., Sayadiyan, A., Valiollahzadeh, S.M.: Speaker-Independent Vowel Recognition in Persian Speech. In: 3rd International Conference on Information and Communication Technologies: From Theory to Applications (ICTTA 2008), Umayyad Palace, Damascus, Syria, pp. 1–5 (2008)
Google Scholar
Carvalho, M., Ferreira, A.: Real-Time Recognition of Isolated Vowels. In: André, E., Dybkjær, L., Minker, W., Neumann, H., Pieraccini, R., Weber, M. (eds.) PIT 2008. LNCS (LNAI), vol. 5078, pp. 156–167. Springer, Heidelberg (2008)
Chapter Google Scholar
Bresolin, A., Neto, A., Alsina, P.: Brazilian Vowels Recognition using a New Hierarchical Decision Structure with Wavelet Packet and SVM (2007)
Google Scholar
Muralishankar, R., Kaushik, L.N., Ramakrishnan, A.G.: Time-scaling of speech and music using independent subspace analysis. In: International Conference on Signal Processing and Communications (SPCOM 2004), pp. 310–314 (2004)
Google Scholar
Merkx, P., Miles, J.: Automatic Vowel Classification in Speech, Department of Mathematics, Duke University, Durham, NC, USA, Final Project for Math 196S2005
Google Scholar
Ting, H., Yunus, J.: Speaker-independent Malay vowel recognition of children using multi-layer perceptron. In: IEEE Region 10 Conference, TENCON 2004 (2004)
Google Scholar
Al-Haddad, S., Samad, S., Hussain, A., Ishak, K., Noor, A.: Robust Speech Recognition Using Fusion Techniques and Adaptive Filtering. American Journal of Applied Sciences 6, 290–295 (2009)
Google Scholar
Hawley, M.: Structure out of Sound, in School of Architecture and Planning. PhD, p. 185. Massachusetts Institute of Technology, Massachusetts (1993)
Google Scholar
Scheirer, E., Slaney, M.: Construction and evaluation of a robust multifeature speech/music discriminator. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, pp. 1331–1334 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Arts and Sciences, Universiti Utara Malaysia, Malaysia
Mohd Yusof Shahrul Azmi, M. Nor Idayu, D. Roshidi & A. R. Yaakob
School of Mechatronics, Universiti Malaysia Perlis, Malaysia
Sazali Yaacob

Authors

Mohd Yusof Shahrul Azmi
View author publications
You can also search for this author in PubMed Google Scholar
M. Nor Idayu
View author publications
You can also search for this author in PubMed Google Scholar
D. Roshidi
View author publications
You can also search for this author in PubMed Google Scholar
A. R. Yaakob
View author publications
You can also search for this author in PubMed Google Scholar
Sazali Yaacob
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GVSA and University of Tasmania, Hobart, TAS, Australia
Tai-hoon Kim
Mokwon University, Daejeon, Korea
Dae-sik Ko
University of Western Macedonia, Kozani, Greece
Thanos Vasilakos
Jet Propulsion Laboratory/Caltech, NASA, 4800 Oak Grove Drive, 91109, Pasadena, CA, USA
Adrian Stoica
Deakin University, 75 Pigdons Road, 3216, Waurn Ponds, VIC, Australia
Jemal Abawajy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shahrul Azmi, M.Y., Nor Idayu, M., Roshidi, D., Yaakob, A.R., Yaacob, S. (2012). Noise Robustness of Spectrum Delta (SpD) Features in Malay Vowel Recognition. In: Kim, Th., Ko, Ds., Vasilakos, T., Stoica, A., Abawajy, J. (eds) Computer Applications for Communication, Networking, and Digital Contents. FGCN 2012. Communications in Computer and Information Science, vol 350. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35594-3_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-35594-3_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35593-6
Online ISBN: 978-3-642-35594-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics