Tamil and English speech database for heartbeat estimation
- 72 Downloads
The aim of this research work is to provide an open source database containing speech signals and the corresponding heartbeat rates, so as to further widen the area of research in speech signal processing, especially estimation of heartbeat rate from speech. Tamil and English Speech Database for Heartbeat Estimation consists of 10,040 speech recordings. The speech signals were recorded from 109 persons, 52 females and 57 males with an average age of 25 years and 6 months. The informed consented volunteers were asked to perform three tasks; like answering and reading in rest state; answering and reading after physical exercise and answering after watching video clips. 24-th and 72-nd order Mel-Frequency Cepstral Coefficients and 14-th and 52-nd order Auto Regressive Reflection Coefficients are extracted from the speech signal. Prediction of heartbeat is done by linear regression using support vector machine. The statistical significance of the heartbeat prediction results are improved by 10-fold speaker-independent cross validation scheme. Experimental results show a minimum average estimation error of ± 13.
KeywordsSpeech database Heartbeat estimation from speech Mel-frequency cepstral coefficients Autoregressive reflection coefficients Linear regression
We sincerely thank the Management, Principal, Students and Staff Members of St. Xavier’s Catholic College of Engineering, Nagercoil, for their valuable participation and support during the TESDHE database recording process.
- Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.CrossRefGoogle Scholar
- Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California.Google Scholar
- Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India.Google Scholar
- Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd.Google Scholar
- Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India.Google Scholar
- Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada.Google Scholar
- Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland.Google Scholar
- Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA.Google Scholar
- Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany.Google Scholar