Skip to main content

Heart Rate Extraction from Vowel Speech Signals


This paper presents a novel non-contact heart rate extraction method from vowel speech signals. The proposed method is based on modeling the relationship between speech production of vowel speech signals and heart activities for humans where it is observed that the moment of heart beat causes a short increment (evolution) of vowel speech formants. The short-time Fourier transform (STFT) is used to detect the formant maximum peaks so as to accurately estimate the heart rate. Compared with traditional contact pulse oximeter, the average accuracy of the proposed non-contact heart rate extraction method exceeds 95%. The proposed non-contact heart rate extraction method is expected to play an important role in modern medical applications.

This is a preview of subscription content, access via your institution.


  1. Nelson M, Rejeski W, Blair S et al (2007) Physical activity and public health in older adults: Recommendation from the American college of sports, medicine and the American heart association. Medicine & Science in Sports & Exercise 39(8):1435–1445

    Article  Google Scholar 

  2. Berntson G, Bigger J, Eckberg D et al (1997) Heart rate variability: Origins, methods, and interpretive caveats. Psychophysiology 34(6):623–648

    Article  Google Scholar 

  3. Georgoulas G, Stylios C, Groumpos P (2006) Predicting the risk of metabolic acidosis for newborns based on fetal heart rate signal classification using support vector machines. IEEE Trans Biomedical Engineering 53(5):875–884

    Article  Google Scholar 

  4. Vasios G, Prentza A, Blana D et al. Classification of fetal heart rate tracings based on wavelet-transform and self-organizing-map neural networks. In Proc. the 23rd Annual Int. Conf. IEEE Engineering in Medicine and Biology Society, October 2001, Vol.2, pp.1633–1636.

  5. Linh T, Osowski S, Stodolski M (2003) On-line heart beat recognition using Hermite polynomials and neuro-fuzzy network. IEEE Trans Instrum Meas 52(4):1224–1231

    Article  Google Scholar 

  6. Li S, Ji Y, Liu G. Optimal wavelet basis selection of wavelet shrinkage for ECG de-noising. In Proc. Int. Conf. Management and Service Science, September 2009, pp.1–4.

  7. Hu Y, Palreddy S, Tompkins W (1997) A patient-adaptable ECG beat classifier using a mixture of experts approach. IEEE Trans Biomedical Engineering 44(9):891–900

    Article  Google Scholar 

  8. Moraes J, Seixas M, Vilani F, Costa E. A real time QRS complex classification method using Mahalanobis distance. In Proc. Computers in Cardiology, Sept. 2002, pp.201–204.

  9. Papaloukas C, Fotiadis D, Likas A, Michalis L (2003) Automated methods for ischemia detection in long duration ECGs. Cardiovascular Reviews & reports 24(6):313–319

    Google Scholar 

  10. Jager F (2002) Feature extraction and shape representation of ambulatory electrocardiogram using the Karhunen-Loève transform. Electrotechnical Review 69(2):83–89

    Google Scholar 

  11. Cuesta-Frau D, Pérez-Cortés J, Andreu-García G, Novák D. Feature extraction methods applied to the clustering of electrocardiographic signals: A comparative study. In Proc. the 16th Int. Conf. Pattern Recognition, August 2002, Vol.3, pp.961–964.

  12. Skopin D, Baglikov S. Heartbeat feature extraction from vowel speech signal using 2D spectrum representation. In Proc. the 4th Int. Conf. Information Technology, June 2009.

  13. Pickett J. The Acoustics of Speech Communication: Fundamentals, Speech Perception Theory, and Technology. Allyn & Bacon, 1998.

  14. Browman C, Goldstein L (1990) Representation and reality: Physical systems and phonological structure. Journal of Phonetics 18:411–424

    Google Scholar 

  15. Maton A, Hopkins J, McLaughlin C et al (1993) Human Biology and Health. Prentice Hall, New Jersey, USA

    Google Scholar 

  16. Allen J, Rabiner L (1977) A unified approach to short-time Fourier analysis and synthesis. Proceedings of IEEE 65(11):1558–1564

    Article  Google Scholar 

  17. Cohen L (1994) Time-Frequency Analysis: Theory and Applications. Prentice Hall, New Jersey, USA

    Google Scholar 

  18. Gonzales R, Woods R. Digital Image Processing (3rd edition), Prentice Hall, 2007.

  19. Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. Journal of Electronic Imaging 13(1):146–168

    Article  Google Scholar 

  20. James A, Dimitrijev S (2010) Inter-image outliers and their application to image classification. Pattern recognition 43(12):4101–4112

    MATH  Article  Google Scholar 

  21. Turkbey E, Jorgensen N, Johnson W et al (2010) Physical activity and physiological cardiac remodelling in a community setting: The Multi-Ethnic Study of Atherosclerosis (MESA). Heart and Education in Heart 96(1):42–48

    Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Abdelwadood Mesleh.

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Mesleh, A., Skopin, D., Baglikov, S. et al. Heart Rate Extraction from Vowel Speech Signals. J. Comput. Sci. Technol. 27, 1243–1251 (2012).

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI:


  • electrocardiogram
  • feature extraction
  • heart rate
  • short-time Fourier transform
  • vowel speech signal