International Journal of Speech Technology

, Volume 21, Issue 4, pp 967–973 | Cite as

Tamil and English speech database for heartbeat estimation

  • A. Milton
  • K. Anish Monsely


The aim of this research work is to provide an open source database containing speech signals and the corresponding heartbeat rates, so as to further widen the area of research in speech signal processing, especially estimation of heartbeat rate from speech. Tamil and English Speech Database for Heartbeat Estimation consists of 10,040 speech recordings. The speech signals were recorded from 109 persons, 52 females and 57 males with an average age of 25 years and 6 months. The informed consented volunteers were asked to perform three tasks; like answering and reading in rest state; answering and reading after physical exercise and answering after watching video clips. 24-th and 72-nd order Mel-Frequency Cepstral Coefficients and 14-th and 52-nd order Auto Regressive Reflection Coefficients are extracted from the speech signal. Prediction of heartbeat is done by linear regression using support vector machine. The statistical significance of the heartbeat prediction results are improved by 10-fold speaker-independent cross validation scheme. Experimental results show a minimum average estimation error of ± 13.


Speech database Heartbeat estimation from speech Mel-frequency cepstral coefficients Autoregressive reflection coefficients Linear regression 



We sincerely thank the Management, Principal, Students and Staff Members of St. Xavier’s Catholic College of Engineering, Nagercoil, for their valuable participation and support during the TESDHE database recording process.


  1. Barros, A. K., & Ohnishi, N. (2001). Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability. IEEE Transactions on Biomedical Engineering, 48(8), 850–855.CrossRefGoogle Scholar
  2. Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.CrossRefGoogle Scholar
  3. Hayre, H. S., & Holland, J. C. (1980). Cross-correlation of voice and heart rate as stress measures. Applied Acoustics, 13(1), 57–62.CrossRefGoogle Scholar
  4. James, A. P. (2015). Heart rate monitoring using human speech spectral features. Human-Centric Computing and Information Sciences, 5, 33. Scholar
  5. Johnson, H. J., & Campos, J. J. (1967). The effect of cognitive tasks and verbalization instructions on heart rate and skin conductance. Psychophysiology, 4(2), 143–150.CrossRefGoogle Scholar
  6. Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California.Google Scholar
  7. Makhoul, J. (1975). Linear prediction: a tutorial review. Proceedings of the IEEE, 63(4), 561–580.CrossRefGoogle Scholar
  8. Mesleh, A., Skopin, D., Baglikov, S., & Quteishat, A. (2012). Heart rate extraction from vowel speech signals. Journal of computer science and technology, 27(6), 1243–1251.CrossRefGoogle Scholar
  9. Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India.Google Scholar
  10. Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd.Google Scholar
  11. Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India.Google Scholar
  12. Schnell, I., Potchter, O., Epstein, Y., Yaakov, Y., Hermesh, H., Brenner, S., & Tirosh, E. (2013). The effects of exposure to environmental factors on heart rate variability: An ecological perspective. Environmental Pollution, 183, 7–13.CrossRefGoogle Scholar
  13. Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada.Google Scholar
  14. Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland.Google Scholar
  15. Seraganian, P., Szabob, A., & Brown, T. G. (1997). The effect of vocalization on the heart rate response to mental arithmetic. Physiology & Behavior, 62(2), 221–224.CrossRefGoogle Scholar
  16. Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA.Google Scholar
  17. Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany.Google Scholar
  18. Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.zbMATHGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Electronics and Communication EngineeringSt. Xavier’s Catholic College of EngineeringNagercoilIndia

Personalised recommendations