Vocal Mood Recognition: Text Dependent Sequential and Parallel Approach

Agarwal, Gaurav; Maheshkar, Vikas; Maheshkar, Sushila; Gupta, Sachi

doi:10.1007/978-981-13-1819-1_14

Vocal Mood Recognition: Text Dependent Sequential and Parallel Approach

Gaurav Agarwal¹⁸,
Vikas Maheshkar¹⁹,
Sushila Maheshkar¹⁸ &
…
Sachi Gupta²⁰

Conference paper
First Online: 29 September 2018

1175 Accesses
14 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 698))

Abstract

The speech is the most famous and essential method of communication among persons. The communication between human being and machine is called human PC interface. Speech has the capability of being the most essential method of communication with PC. Speech is also an acoustically rich flag that gives impressive individual data about talkers. The statement of emotions in speech sounds and relating capacities to visualize such emotions are both essential parts of human–machine communication. Discoveries from ponders trying to portray the acoustic properties of emotions present in vocal. The speech shows that discourse acoustics give an outside signal to the level of nonspecific excitement related with passionate procedures and, to a lesser degree, the relative charm of experienced feelings. A brief overview on bifurcation of speech, either into single or series of words with persistent or unconstrained speech is also taken into consideration within this paper. This paper represents the methodology that converts the vocal signals into corresponding text. As soon as the signal is converted into text, then the same will be analyzed with the sequential as well as with the parallel sum algorithm works for the parallel random access machine. Using both algorithms, the mood of the speech will be identified as romantic, sad or rock. Comparative studies of sequential and parallel approaches are also discussed in this paper. This paper concludes that the time complexity of the sequential executed algorithm can be reduced to a particular extent by using the parallel approaches.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

P.B. Dasgupta, Detection and analysis of human emotions through voice and speech pattern processing. Int. J. Comput. Trends Technol. 52(1) (2017)
Article MathSciNet Google Scholar
R.B. Lanjewar, D.S. Chaudhari, Speech emotion recognition: a review. Int. J. Innov. Technol. Explor. Eng. (IJITEE) 2(4) (2013)
Google Scholar
S. Shinde, S. Pande, A survey on: emotion recognition with respect to database and various recognition techniques. Int. J. Comput. Appl. 58(3) (2012)
Google Scholar
M. Sarode, D.G. Bhalke, Automatic music mood recognition using support vector regression. Int. J. Comput. Appl. 163(5) (2017)
Article Google Scholar
Mathieu Barthet, Gyorgy Fazekas, Mark Sandler, Music emotion recognition: from content to content based models (Springer, Berlin, Heidelberg, 2013)
Google Scholar
S.B. Davis, P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28(4) (1980)
Article Google Scholar
S.G. Koolagudi, K. Sreenivasa Rao, Emotion recognition from speech: a review. Int. J. Speech Technol. (2012)
Google Scholar
D.D. Joshi, M.B. Zalte, Speech emotion recognition: a review. IOSR J. Electron. Commun. Eng. (IOSR-JECE) 4(4) (2013)
Article Google Scholar
S. Swamy, K.V. Ramakrishnan, An efficient speech recognition system. Comput. Sci. Eng.: Int. J. (CSEIJ) 3(4) (2013)
Google Scholar
S. Arora, M. Goel, Survey paper on scheduling in Hadoop. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 4(5) (2014)
Google Scholar
S.K. Gaikwad, B.W. Gawali, P. Yannawar, A review on speech recognition technique. Int. J. Comput. Appl. 10(3) (2010)
Article Google Scholar
S. Sharma, R.S. Jadon, Mood based music classification. Int. J. Innov. Sci. Eng. Technol. (IJISET) 1(6) (2014)
Google Scholar
S. Karpagavalli, E. Chandra, A review on automatic speech recognition architecture and approaches. Int. J. Signal Process. Image Process. Pattern Recogn. 9(4) (2016)
Google Scholar
Y.-H. Cho, H. Lim, D.-W. Kim, I.-K. Lee, Music emotion recognition using chord progressions, in IEEE International Conference on Systems, Man, and Cybernetics SMC (2016). https://doi.org/10.1109/smc.2016.7844628
W.M. Campbell, D.E. Sturim, D.A. Reynolds, Support vector machines using GMM super vectors for speaker verification. IEEE Signal Process. Lett. (2015)
Google Scholar
M. Vyas, A Gaussian mixture model based speech recognition system using Matlab. Int. J. Speech Image Process. (SIPIJ) 4(4) (2013)
Google Scholar
H. Shen, F. Pétrot, Using Amdahl’s law for performance analysis of many-core SoC architectures based on functionally asymmetric processors, in 24th International Conference Camo, Italy, ARCS 2011, LNCS 6566 (Springer, Berlin, Heidelberg, 2011), pp. 38–49
Chapter Google Scholar

Download references

Acknowledgements

The author would like to thank the Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, Jharkhand, India for providing a platform and all necessary requirement for generating the dataset used in this study. This dataset has been created by the author in the lab. The authors are also thankful to the organizer to provide a nice platform for presenting the research.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology (ISM), Dhanbad, Jharkhand, India
Gaurav Agarwal & Sushila Maheshkar
Division of Information Technology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, India
Vikas Maheshkar
Department of Computer Science and Engineering, Raj Kumar Goel Institute of Technology, Ghaziabad, India
Sachi Gupta

Authors

Gaurav Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Vikas Maheshkar
View author publications
You can also search for this author in PubMed Google Scholar
Sushila Maheshkar
View author publications
You can also search for this author in PubMed Google Scholar
Sachi Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gaurav Agarwal .

Editor information

Editors and Affiliations

Department of Instrumentation and Control Engineering, Netaji Subhas Institute of Technology, New Delhi, Delhi, India
Hasmat Malik
Department of Instrumentation and Control Engineering, Netaji Subhas Institute of Technology, New Delhi, Delhi, India
Smriti Srivastava
National Institute of Technology, Puducherry, India
Yog Raj Sood
Perceiving Systems Department, Max Planck Institute for Intelligent Systems, Tübingen, Germany
Aamir Ahmad

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agarwal, G., Maheshkar, V., Maheshkar, S., Gupta, S. (2019). Vocal Mood Recognition: Text Dependent Sequential and Parallel Approach. In: Malik, H., Srivastava, S., Sood, Y., Ahmad, A. (eds) Applications of Artificial Intelligence Techniques in Engineering. Advances in Intelligent Systems and Computing, vol 698. Springer, Singapore. https://doi.org/10.1007/978-981-13-1819-1_14

Download citation

DOI: https://doi.org/10.1007/978-981-13-1819-1_14
Published: 29 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1818-4
Online ISBN: 978-981-13-1819-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics