Speech Recognition Using Feed Forward Neural Network and Principle Component Analysis

Momo, Nusrat; Abdullah; Uddin, Jia

doi:10.1007/978-3-319-67934-1_20

Nusrat Momo²⁰,
Abdullah²⁰ &
Jia Uddin²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 678))

Included in the following conference series:

International Symposium on Signal Processing and Intelligent Recognition Systems

1654 Accesses
2 Citations

Abstract

Various models have been proposed with many dimension reduction techniques and classifiers in the field of pattern recognition by using audio signal processing. In this paper, an effective model has been proposed for pattern recognition using PCA as the sole dimension reduction technique and Feed forward Neural network as the classifier. Twenty-eight Parkinson’s disease affected patients’ audio recordings consisting of the pronunciation of the vowels ‘A’ and ‘O’ have been used as the dataset. From these audio recordings twenty features were extracted and PCA was run on those features. PCA rearranged the feature vector matrix in a more optimized manner. Thus the optimal features were arranged in order of their significance. From this rearranged and optimized feature vector matrix, the first eight optimal features were chosen which were later used to train and test the classifier Feed forward Neural network. Experimental results demonstrate that the model can predict the occurrence and pattern of the vowels ‘A’ and ‘O’ from the audio files with very high accuracy compared to the swarm search for feature selection in classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisc. Rev. Comput. Stat. 2(4), 433–459 (2010). doi:10.1002/wics.101
Article Google Scholar
Asadi, S., Rao, C., Saikrishna, V.: A Comparative study of face recognition with principal component analysis and cross-correlation technique. Int. J. Comput. Appl. 10(8), 17–21 (2010). doi:10.5120/1502-2019
Google Scholar
Cybenko, G.: Continuous Valued Neural Networks with Two Hidden Layers are Sufficient, pp. 303–314 (1988)
Google Scholar
Fong, S., Yang, X., Deb, S.: Swarm search for feature selection in classification. In: 2013 IEEE 16th International Conference on Computational Science and Engineering (2013). doi:10.1109/cse.2013.135
Funahashi, K.: On the approximate realization of continuous mappings by neural networks. Neural Networks 2(3), 183–192 (1989). doi:10.1016/0893-6080(89)90003-8
Article Google Scholar
Hagan, M.T., Demuth, H.B., Jesús, O.D.: An introduction to the use of neural networks in control systems. Int. J. Robust Nonlinear Control 12(11), 959–985 (2002). doi:10.1002/rnc.727
Article MATH Google Scholar
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Networks 2(5), 359–366 (1989). doi:10.1016/0893-6080(89)90020-8
Article Google Scholar
Howard, W.: Pattern recognition and machine learning. Kybernetes 36(2), 275 (2007). doi:10.1108/03684920710743466. i‐xx, pp. 740. Springer, Heidelberg (2006). ISBN 0‐387‐31073‐8, $74.95 Hardcover
Article Google Scholar
Li, C., Diao, Y., Ma, H., Li, Y.: A statistical PCA method for face recognition. In: 2008 Second International Symposium on Intelligent Information Technology Application (2008). doi:10.1109/iita.2008.71
Hornik, K., Stinchcombe, M., White, H.: Multilayer feedforward networks are universal approximators. Neural Networks 2(5), 359–366 (1989). doi:10.1016/0893-6080(89)90020-8
Article Google Scholar
Mcculloch, W.S., Pitts, W.: A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biophys. 5(4), 115–133 (1943). doi:10.1007/bf02478259
Article MathSciNet MATH Google Scholar
Meruelo, A.C., Simpson, D.M., Veres, S.M., Newland, P.L.: Improved system identification using artificial neural networks and analysis of individual differences in responses of an identified neuron. Neural Networks 75, 56–65 (2016). doi:10.1016/j.neunet.2015.12.002
Article Google Scholar
Murali, M.: (2015). Principal component analysis based feature vector extraction. Indian J. Sci. Technol. (2015)
Google Scholar
Phillips, P., Flynn, P., Scruggs, T., Bowyer, K., Chang, J., Hoffman, K., Worek, W.: Overview of the Face Recognition Grand Challenge. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005) (2005). doi:10.1109/cvpr.2005.268
Rosenblatt, F.: The perceptron: a probabilistic model for information storage and organization in the brain. Psychol. Rev. 65(6), 386–408 (1958). doi:10.1037/h0042519
Article Google Scholar
Sakar, B.E., Isenkul, M., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S., Kursun, O.: Collection and analysis of a parkinson speech dataset with multiple types of sound recordings. IEEE J. Biomed. Health Inform. 17(4), 828–834 (2013). doi:10.1109/jbhi.2013.2245674
Article Google Scholar
Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Networks 61, 85–117 (2015). doi:10.1016/j.neunet.2014.09.003
Article Google Scholar
Tamura, S., Tateishi, M.: Capabilities of a four-layered feedforward neural network: four layers versus three. IEEE Trans. Neural Networks 8(2), 251–255 (1997). doi:10.1109/72.557662
Article Google Scholar
Hori, T., Kubo, Y., Nakamura, A.: Real-time one-pass decoding with recurrent neural network language model for speech recognition. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

BRAC University, Dhaka, 1212, Bangladesh
Nusrat Momo, Abdullah & Jia Uddin

Authors

Nusrat Momo
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah
View author publications
You can also search for this author in PubMed Google Scholar
Jia Uddin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jia Uddin .

Editor information

Editors and Affiliations

School of CS/IT, Indian Institute of Information Technology and Management, Trivandrum, Kerala, India
Sabu M. Thampi
Department of Electrical and Computer Engineering, Ryerson University, Toronto, Ontario, Canada
Sri Krishnan
Department of Computer Science, University of Salamanca, Salamanca, Salamanca, Spain
Juan Manuel Corchado Rodriguez
Electronics and Communication Sciences Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Swagatam Das
Department of Systems and Computer Networks, Wroclaw University of Science and Technology, Wroclaw, Poland
Michal Wozniak
Faculty of Engineering and Technology, Liverpool John Moores University, Liverpool, United Kingdom
Dhiya Al-Jumeily

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Momo, N., Abdullah, Uddin, J. (2018). Speech Recognition Using Feed Forward Neural Network and Principle Component Analysis. In: Thampi, S., Krishnan, S., Corchado Rodriguez, J., Das, S., Wozniak, M., Al-Jumeily, D. (eds) Advances in Signal Processing and Intelligent Recognition Systems. SIRS 2017. Advances in Intelligent Systems and Computing, vol 678. Springer, Cham. https://doi.org/10.1007/978-3-319-67934-1_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-67934-1_20
Published: 27 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67933-4
Online ISBN: 978-3-319-67934-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics