An extensive quantitative analysis of the effects of errors in beat-to-beat intervals on all commonly used HRV parameters

Rohr, Maurice; Tarvainen, Mika; Miri, Seyedsadra; Güney, Gökhan; Vehkaoja, Antti; Hoog Antink, Christoph

doi:10.1038/s41598-023-50701-4

An extensive quantitative analysis of the effects of errors in beat-to-beat intervals on all commonly used HRV parameters

Article
Open access
Published: 30 January 2024

Volume 14, article number 2498, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

An extensive quantitative analysis of the effects of errors in beat-to-beat intervals on all commonly used HRV parameters

Download PDF

Maurice Rohr¹,
Mika Tarvainen^2,3,
Seyedsadra Miri^4,5,
Gökhan Güney¹,
Antti Vehkaoja^4,5^na1 &
…
Christoph Hoog Antink¹^na1

1060 Accesses
Explore all metrics

Abstract

Heart rate variability (HRV) analysis is often used to estimate human health and fitness status. More specifically, a range of parameters that express the variability in beat-to-beat intervals are calculated from electrocardiogram beat detections. Since beat detection may yield erroneous interval data, these errors travel through the processing chain and may result in misleading parameter values that can lead to incorrect conclusions. In this study, we utilized Monte Carlo simulation on real data, Kolmogorov–Smirnov tests and Bland–Altman analysis to carry out extensive analysis of the noise sensitivity of different HRV parameters. The used noise models consider Gaussian and student-t distributed noise. As a result we observed that commonly used HRV parameters (e.g. pNN50 and LF/HF ratio) are especially sensitive to noise and that all parameters show biases to some extent. We conclude that researchers should be careful when reporting different HRV parameters, consider the distributions in addition to mean values, and consider reference data if applicable. The analysis of HRV parameter sensitivity to noise and resulting biases presented in this work generalizes over a wide population and can serve as a reference and thus provide a basis for the decision about which HRV parameters to choose under similar conditions.

Evidence Based Recommendations for Designing Heart Rate Variability Studies

Article 26 August 2019

Computational System for Heart Rate Variability Analysis

Scatter Comparison of Heart Rate Variability Parameters

Introduction

Heart rate variability (HRV) is an indirect indicator of the state of the autonomic nervous system (ANS). HRV has been increasingly used for example in exercise recovery assessment¹ and in wellness applications for assessing sleep quality². Many research articles have also proposed its use in clinical applications. According to a recent review on HRV applications in the medical domain by Faust et al.³ cardiology is the most studied application area for HRV followed by mental health and sleep physiology. In cardiology, HRV is widely used for detecting cardiac arrhythmias. However, in arrhythmia applications, the purpose is not to obtain information on the state of ANS but rather to detect and analyze ectopic heartbeats originating from elsewhere in the heart rather than the sinoatrial node. Cardiological HRV applications assessing ANS include early detection of surgical stress⁴, detection and monitoring of heart failure, risk prediction for sudden cardiac death, and several others³. HRV has been proposed as a clinical tool for routine risk stratification in myocardial infarction patients⁵, but generally more trials and research are needed for a wider application in clinical practice.

One of the most widely used HRV parameters is the standard deviation of beat-to-beat intervals (SDNN)¹. It reflects the overall activity of ANS regulation and it is considered as the gold standard for evaluating cardiac risk⁶. It has been found to be directly affected by myocardial infarction (MI)⁷, autonomic dysfunction⁸ and mental conditions such as depression and anxiety⁹.

In several applications, it is important to detect even small changes in the HRV parameter values. However, the parameters are sensitive to the uncertainty or errors in the heartbeat intervals. Given that different HRV parameters carry redundant information about the ANS, it would be interesting to assess the differences in their sensitivity to the heartbeat interval errors. This would help to select those HRV parameters that are less sensitive to the errors but still provide relevant information.

While the ECG-based RR-interval tachogram is the gold standard data for HRV estimation, using pulse intervals measured with photoplethysmography (PPG) for HRV estimation has recently received a lot of interest. PPG-based HRV is usually referred to as pulse rate variability (PRV) to highlight the inevitably higher uncertainty in the heartbeat intervals as well as the fundamental difference caused by the variations in pulse arrival time due to changes in blood pressure¹⁰. In PRV analysis, the robustness of HRV parameters to uncertainty in beat-to-beat intervals is therefore even more important.

Most of the earlier studies on HRV sensitivity to heartbeat interval uncertainty have focused on evaluating the effect of ECG sampling rate and resulting uncertainty in the temporal location of the R-peak as well as on the sporadic errors in R-peak detection, missing R-peaks, and the effect of beat replacement^11,12. Usually, the studies have focused on evaluating the effects in a few most commonly used HRV parameters^13,14. Petelczyc et al.¹⁵ performed one of the most thorough analysis using Monte Carlo simulation for comparing the sensitivity of various HRV parameters on the ECG sampling frequency and QRS complex detection and estimated the effect of RR-interval errors in ten commonly used HRV parameters. They found that the most sensitive parameter was pNN50. On the other hand, short and long-term slopes, $\alpha _1$ and $\alpha _2$ of the detrended fluctuation analysis (DFA) were found to be the least sensitive to RR-interval errors. In the present work, we perform extensive Monte Carlos simulations in which we artificially introduce error in real-world RR-interval data and analyze how this noise is reflected in 34 HRV parameters calculated in 5-min segments. In our simulations, we vary both the distribution of the noise (uniform, Gaussian, t-distributed) as well as its standard deviation between 1 to 10 ms to approximate interval errors for example due to low sampling rate in ECG and average errors seen in PPG¹⁶. We present our results both quantitatively as well as qualitatively and perform statistical tests to find significant differences. We assess the consistency of the effect of the noise distribution and evaluate, which HRV parameters show consistent bias under the presence of RR-interval uncertainty and which are more and which are less affected by it.

The main contribution of this study is an overview over the error distributions of an extensive selection of different HRV parameters. We quantify their sensitivity to noise and show that most parameters show systematic biases. Based on our results, we argue that LF/HF ratio and pNN50 should be used with caution and that researchers should generally consider the distributions of parameter values instead of only computing mean and median values.

Material and methods

In order to obtain realistic beat-to-beat intervals from healthy subjects, the “Autonomic Aging Dataset” was used¹⁷, which is a publicly available dataset within the PhysioNet Database¹⁸. It contains recordings of 1121 healthy volunteers of approximately 8–40 min duration. It is divided into six age groups (below 30, 30–39, 40–49, 50–59, 60–69, above 69 years) with 670 participants being female and 433 male. For each subject, an ECG signal (lead II) and a continuous non-invasive blood pressure signal is available. The ECG is recorded at 1000 Hz either by an MP150 (ECG100C, BIOPAC systems inc., Golata, CA, USA) or Task Force Monitor system (CNSystems Medizintechnik GmbH, Graz, AUT). All subjects were screened rigorously and remained in a resting state sinus rhythm. In order to obtain clean and representative heartbeat interval data and heartbeat annotations, we extracted the heartbeat locations from the ECG using a Pan-Tompkins based QRS detector¹⁹ and employed a validated beat correction algorithm²⁰. Instead of using the corrected beats, segments containing overwritten missed or erroneous beats were discarded, reducing the number of subjects with more than 5 min of recording to 971. The processed dataset contains a total of 989,399 automatically validated heart beats.

In our simulation, we created a representative sample ${\mathcal {S}}$ of $n=15{,}000$ 5-min segments from the processed dataset. n was chosen as an upper bound for the convergence criterion for the distribution of each HRV parameter, which we defined as the point where the normalized standard error for the expectation of the HRV parameter distribution (standard error $S_e$/sample mean $\mu _s$) drops below 2%, where $S_e=1.96\sigma _s/\sqrt{n}$ and $\sigma _s$ is the sample standard deviation. Sampling was performed by first choosing one of the 971 recordings at random, selecting a starting heart beat annotation at random, and then including all beat annotations in the following 5-min window, leading to windows with partially overlapping information. The average number of heartbeats in each window was 344 with a standard deviation of 48, corresponding to an average heart rate of 69 bpm with a standard deviation of 9.6 bpm. Due to the relatively low number of sampling iterations not all subjects were represented equally by $n/971=15$ segments. Notably, the distribution of segments per age group matches the age distribution in the Autonomic Aging dataset, which is heavily tilted towards the younger groups.

HRV calculation

HRV analyses were carried out using Kubios HRV Scientific 4.0 software (Kubios Oy, Kuopio, Finland). The pre-processing features of Kubios HRV software including noise detection, beat correction, and detrending were all disabled in order to observe the true influence of IBI errors over different HRV analysis parameters. Otherwise, the HRV parameters were derived according to the guidelines²¹. The extensive set of HRV parameters assessed in this study are described in Table 1.

Table 1 Descriptions of assessed time-domain, frequency-domain and nonlinear HRV parameters and related analysis settings.

Full size table

Noise simulation

We added noise to the beat timings by sampling from three differently distributed random variables. We investigated 10 equally spaced noise levels with standard deviations $\sigma $ ranging from 1 to 10 ms. The distributions in Fig. 1 were parameterized to have the same standard deviations and zero-mean. The analytically derived parameters were computed as follows:

$$\begin{aligned} \text {Gaussian}&: {\hat{\mu }}=0,\;{\hat{\sigma }}=\sigma , \end{aligned}$$

(1)

$$\begin{aligned} \text {uniform}&: \text {lower}=-\sqrt{3}\sigma ,\;\text {upper}=\sqrt{3}\sigma , \end{aligned}$$

(2)

$$\begin{aligned} \text {triangular}&: A=-1/\sqrt{3/18}\sigma ,\; B=0,\; C=-A, \end{aligned}$$

(3)

$$\begin{aligned} \text {t-distribution}&: {\hat{\mu }}=0,\; {\hat{\sigma }}=\sqrt{({\hat{\nu }}-1)\sigma ^2/{\hat{\nu }}},\;{\hat{\nu }}=3 , \end{aligned}$$

(4)

where ${\hat{\nu }}$ is chosen as the minimal possible value, thus the second moment is defined to achieve very heavy tails.

The choice of distributions is motivated by errors introduced during signal processing. Due to the finite sampling frequency, peak locations in the raw data always show a uniformly distributed error³⁰. Besides the sampling error, the uniform distribution is chosen because typical QRS detectors with beat corrections usually come with an upper and a lower threshold for discarding beats, which limits the possible error, but apart from that arbitrary errors are possible. Interestingly, the triangular distribution (not directly applied in this study) is generated when we compute the inter-beat intervals from uniformly distributed peaks and should therefore be considered as relevant for inter-beat interval analysis. The Gaussian distribution models the combination of multiple unknown error sources in the signal generation and processing pipeline, such as sampling-noise, quantization noise, peak-uncertainty as well as the influence of signal noise on the detection and is the most common assumption. On other accounts, Petelczyc et al.¹⁵ assume that the QRS detection procedure amplifies existing errors. The t-distribution is a heavy-tailed distribution and thus allows extreme errors with non-zero probability such as outliers from missed beats, extra beats, or misaligned beats due to motion artifacts or undetected ectopic beats.

Evaluation

For every 5-min segment in ${\mathcal {S}}$, the HRV values calculated before adding noise are considered the ground truth values $x_{i,p}$, with $i \in \{1,\dots ,n\}, \, p \in \textrm{HRV}_{\mathrm{{params}}}$. After adding noise of a specific distribution and intensity, the estimated HRV value ${\hat{x}}_{i,p}$ is obtained. The difference to the ground truth that arises from the addition of noise as described above is considered the error,

$$\begin{aligned} e_{i,p} = {\hat{x}}_{i,p} - x_{i,p}. \end{aligned}$$

(5)

For the Bland–Altman analysis, we also need to calculate the average of the ground truth and the estimation, $a_{i,p} = ({\hat{x}}_{i,p} + x_{i,p})/2$. The systematic bias is defined as the average of the error over all $n=15000$ samples,

$$\begin{aligned} \Delta _p = \frac{1}{n} \sum _{i=1}^n e_{i,p}. \end{aligned}$$

(6)

In addition to the systematic bias, the mean absolute error (MAE)

$$\begin{aligned} \text {MAE}_p = \frac{1}{n} \sum _{i=1}^n\left| e_{i,p} \right| \end{aligned}$$

(7)

as well as the root-mean-square error (RMSE)

$$\begin{aligned} \text {RMSE}_p = \sqrt{ \frac{1}{n} \sum _{i=1}^n e_{i,p}^2 } \end{aligned}$$

(8)

are calculated. To further analyze the distribution of the error, the 5th and 95th percentile are calculated. The Kolmogorov–Smirnov test is performed to test whether the distributions of the HRV parameter error depend on the distribution of the noise. In our analysis we consider p-values $p<0.05$ to be statistically significant. Finally, to allow for a comparison of the noise sensitivity of the different HRV metrics, we calculate the group mean of the ground truth of all n windows,

$$\begin{aligned} {\bar{x}}_p = \frac{1}{n} \sum _{i=1}^n x_{i,p}. \end{aligned}$$

(9)

Next, we fit a linear function to determine the dependency of bias, MAE, and RMSE on $\sigma $,

$$\begin{aligned} \Delta _p&= \alpha _{\text {bias}}/100\% \cdot \sigma \cdot {\bar{x}}_p, \end{aligned}$$

(10)

$$\begin{aligned} \text {MAE}_p&= \alpha _{\text {MAE}}/100\% \cdot \sigma \cdot {\bar{x}}_p, \end{aligned}$$

(11)

$$\begin{aligned} \text {RMSE}_p&= \alpha _ { \text {RMSE}}/100\% \cdot \sigma \cdot {\bar{x}}_p. \end{aligned}$$

(12)

Results

Kolmogorov–Smirnov-test

Figure 2 shows the results of the Kolmogorov–Smirnov-test (KS-Test) when comparing Gaussian distribution and t-distribution (first row), t-distribution and uniform distribution (second row), and Gaussian and uniform distribution for noise levels of $\sigma = 5$ ms and $\sigma = 7$ ms.

First, if we assume a p-value of 0.05 to indicate statistical significance, we see that the t-distribution results in significantly different distribution of errors for the majority of HRV metrics when compared to the uniform distribution or the Gaussian distribution. This effect is more pronounced at the higher noise level. At the same time, when comparing Gaussian and uniform distribution, only for very few metrics (MeanRR, NN50, pNN50, and SampEn), a significant difference could be observed at $\sigma =7$ ms. Hence, we will only examine t-distribution and Gaussian distribution in the following.

Error vs. sigma (mean, 5th percentile and 95th percentile)

In the above Fig. 3, we plot the systematic bias, the 5th percentile and the 95th percentile for all HRV metrics over $\sigma $. The gray shaded area marks the area enclosed by the 95th and 5th percentile when Gaussian noise is added.

As expected, all parameters show an increase in absolute error as indicated by the shaded area with an increase in $\sigma $. Additionally, with an increase in $\sigma $, almost all parameters show an increase in systematic bias with either a positive or a negative sign, i.e. almost all parameters are either systematically over- or underestimated when heart beat locations are noisy. Unsurprisingly, only the mean heart rate and the mean RR interval do not show a systematic bias. Also, although the KS-test has revealed statistically significant differences in error distribution, when either t-distributed or Gaussian noise is considered, we can only make out small differences for most parameters. However, stark differences are obvious in NN50, pNN50, TINN, SI, and all RPA metrics.

Dependency of the relative error on $\sigma $

As the previous analysis has demonstrated, all parameters are influenced in terms of absolute error by the addition of noise, and almost all parameters show an increase in systematic bias. Table 2 shows the distribution of the ground truth data for all HRV metrics in terms of mean, 5th percentile, and 95th percentile. Column 5 to 7 show the results of the linear fit as described in Eqs. (10)–(12), i.e. the linear increase of bias, MAE and RMSE, respectively, in dependency of $\sigma $. The table is sorted in ascending order in terms of $\alpha _{\textrm{RMSE}}$ (see Eq. 12). To visualize the dependency of the normalized error, Fig. 4 shows $\alpha _{\textrm{bias}}$ over $\alpha _{\textrm{RMSE}}$ for all HRV metrics.

Table 2 Distribution of all HRV metrics in terms of mean, 5th, and 95th percentile as well as the linear fitting factors $\alpha _{\textrm{bias}}$, $\alpha _{\textrm{MABS}}$, and $\alpha _{\textrm{RMSE}}$ with respective Pearson correlation r for the goodness of the linear fit. The values are sorted by $\alpha _{\textrm{RMSE}}$ in ascending order, see also Fig. 4. Interesting observations are printed in bold.

Full size table

Several interesting observations can be made. First, the sensitivity of the parameters varies quite dramatically, ranging from close to 0 to almost 9% per ms. As expected, SDNN is very robust, as an increase of the power of the noise by 1 ms will increase the RMSE by 0.6% relative to the group mean. This is even more pronounced, with absolute LF power (0.2% RMSE per ms). Particularly problematic is the ratio LF/HF, which is a commonly used HRV parameter to assess sympathovagal balance, but exhibits strong underestimation as well as a large RMSE. This is consistent with the observation that normalized LF power is underestimated, while normalized HF power is overestimated. Interestingly, Sample Entropy is also quite sensitive ($\sim $ 2%/ms for Bias/MABS/RMSE). Similarly, although often used, pNN50 and NN50 show overestimation in the range of 2%/ms and an increase in RMSE in the range of 3%/ms.

Bland–Altman analysis

We have seen in the previous section that the noise may create a systematic bias. It remains to be analyzed if this bias is independent of the actual value of the parameter. If that were the case, the Bland–Altman (BA) plots would be point clouds symmetric to a line parallel to the y-axis. In the following, we present the BA plots for a noise level of $\sigma = 10$ and consider Gaussian noise. Moreover, we give the Pearson correlation coefficient r of average $a_{i,p}$ (x-axis) and difference $e_{i,p}$ (y-axis). To avoid clutter, usually a small random subset of the data is used for plotting in the standard BA plot. However, we introduce color-coding for the individual points to give information about the point density (Fig. 5).

Discussion

We know from system theory that a linear operation applied to Gaussian noise will result in Gaussian noise, and only its mean and standard deviation may be altered. For the general case of nonlinear operations and other distributions of noise, no straight-forward general description can be given. Instead, an extensive Monte Carlo - type analysis as the one presented here can be used. In this mathematical sense, almost all HRV metrics analyzed here involve non-linear calculations. Hence, it is not surprising that we can see from all figures and tables that the commonly used HRV metrics show great variability when it comes to their behaviour under artificially added noise.

From Fig. 2 we can see that the effect of the distribution of the noise on the HRV metrics when using either uniform noise or Gaussian noise are mostly negligible, with the exception of NN50 and pNN50. As argued above, uniform noise applied to the location of individual heart beats will result in a triangular noise distribution in the intervals. A visual inspection of Fig. 1 shows the similarity of the triangular distribution and the Gaussian distribution, and makes this result intuitively plausible. We do however learn that the heavy-tailed t-distribution will result in significant differences for almost all parameters. Again, this is plausible as the t-distribution has a higher probability of generating outliers compared to the Gaussian distribution (and obviously the uniform distribution). Still, statistical significance does not give information about the effect size. Hence, if we look at Fig. 1, we learn that for most parameters, the distribution of the error is very similar when Gaussian or t-distributed noise is added. However, notable exceptions such as, for example, TINN, NN50, pNN50, and SampEn do exist. Hence, we would argue that as a first step, analyzing an HRV metric’s sensitivity towards noise can be achieved using Gaussian noise. However, for an in-depth analysis, it makes sense to determine the actual distribution of noise generated by the measurement system (measurement modality and beat detection algorithm) to be used, and quantify its impact on the HRV metric in question.

We can see from the BA plots in Fig. 5 that in addition to an offset, a systematic error is introduced for most parameters with very few exceptions. Again, the effect varies from parameter to parameter. For example, for SDNN and RMSSD, small values are systematically over-estimated as shown by $r \approx 0.8$. Note that we learned from Table 2 that in general, the bias introduced by noise is relatively small for SDNN, emphasizing that the main source of over-estimation stems from comparatively small absolute parameter values. On the other hand, we could also see in Table 2 that, for example, absolute HF power is systematically over-estimated with a linear factor approximately close to SDNN. But we can see from the BA plot that this over-estimation is manifested in a relatively constant offset and is not correlated with the ground truth value of absolute HF power. No correlation is also found for absolute LF power and total power. Again, this is plausible as the addition of noise will always result in an addition of signal energy which is measured by these parameters. A common strategy for compensating uncertainty is to use averaging over several segments for which HRV is calculated. However, due to the significant bias in most of the parameters, averaging is limited as a tool for error reduction. The relative power metrics show a different behavior. This can be explained since additive noise on beat locations will particularly influence the higher frequencies of beat-to-beat intervals as they are calculated via differentiation, which amplifies high frequencies. Generally, HRV parameters that reflect higher frequencies perform poorer with respect to Bias and RMSE. Besides the differentiation, the noise applied is local and statistically independent between different time points, which reflects the type of errors expected from the processing pipeline. Notably, lower frequency errors that are dependent over multiple beats can appear for specific choices of algorithms and modalities (e.g. respiratory amplitude modulation in PPG) as described in¹⁶. In conclusion, we believe that it is important for HRV analysis in general to closely examine the distribution of the parameters instead of comparing only mean/median values. Also, one has to keep in mind that different cohorts may have different baseline distributions and hence the impact of noise may be an important factor.

In terms of individual parameters, several interesting observations were made. For example, NN50/pNN50 show inferior results in terms of bias, RMSE, sensitivity to the distribution of the noise, and dependency of the influence of noise on the ground truth value (Fig. 5). These findings align with studies that found pNN50 to be sensitive to missing beats¹⁴. Hence, we would argue that this simple, seemingly robust parameter should be used with caution, in particular when the baseline values are expected to be low (e.g. in older subjects and patients under physiological stress e.g. due to infection). Another “underperformer” is the LF/HF ratio. It can be expected that this parameter is very sensitive to noise as its derivation involves the calculation of a ratio of values, which are themselves sensitive to noise. Nevertheless, we find it somewhat surprising how poorly this parameter performs in terms of bias and absolute error/RMSE compared to the other parameters (Fig. 4). LF/HF is a popular parameter and we have used it in our previous works as well. In light of our current findings, we suggest to use extreme caution when using LF/HF in future studies, as small differences in noise in the groups to be distinguished may result in severe differences that may have nothing to do with the underlying physiology. The LF power in normalized units is less sensitive to noise and should be preferred over the LF/HF in studies which wish to use the frequency-domain analysis of HRV to assess sympathovagal balance of the autonomic nervous system.

In Table 2 we present the ground truth distribution of the HRV metrics we analyze. From the values and the fact that the used “Autonomic Aging”-dataset contains data from a population of young to old individuals of both sexes, we believe we cover a reasonable range of values and our results should generalize well. However, one hallmark of the database is that the subjects were “rigorously screened” to ensure healthiness and that the measurements are performed in a resting condition while maintaining wakefulness. Hence, different results may be obtained if data from severely diseased and/or non-resting individuals are used, provided their baseline values fall completely out of the ranges analyzed here. Hence, to make sure our results are applicable to a certain study, one should first check if the obtained values fall within the ranges of the present study. For certain HRV parameters (especially frequency domain variables) implementation details might also change the results slightly and pre-processing of the intervals plays a large role.

Let us emphasize again that to make the influence of noise on RMSE and bias comparable, one has to employ some sort of normalization. In this case, the error was normalized by the population mean (see Table 2) of a broad population. Additionally, our Bland–Altman analysis shows that the over- or underestimation of parameters may depend heavily on the ground truth value (e.g., small SDNN values may be over-over-estimated). As a consequence, the sorting of the HRV metrics in Table 2 may differ if a specific population is analyzed. For instance, the population mean of SDNN was found to be 44.43 ms, the one of HFpower to be 1300.68 ms$^2$ for this cohort of more than 1000 subjects. When for instance analyzing the much smaller Fantasia-dataset (20 subjects 20–34 years and 20 subjects 68–85 years), SDNN was similar (39.56 ms), while HFpower was on average almost half of the value of the large cohort (702.47 ms$^2$). Hence, with respect to this population mean, HFpower would be twice as sensitive. At the same time, note that the population mean only affects the normalized comparison. Still, even on the small dataset we observed the same general tendencies (small errors for MeanHR/RR and SDNN, larger errors for HF parameters compared to LF parameters, worst performance for RPA Lmax, NN50, LF/HF).

In order to evaluate in which error-range a given sensor and method M falls we suggest the following procedure: Take multiple simultaneous measurements with the test-device and a validated ECG-patient-monitor with sampling frequency of 1000 Hz or more for 5 min and at least 10 participants and varying age groups. Then compute the reference intervals with a validated QRS detector and verification of a trained Cardiologist. Subtract the reference intervals from the intervals estimated by M and estimate the standard deviation. Based on the standard deviation and the here presented reference values it can be checked if certain HRV parameters might be good enough.

Conclusion

In this paper we analysed the effect of uncertainty in temporal heartbeat location in various HRV parameters using a Monte Carlo simulation approach. The results showed that there are large differences between the HRV parameters in the robustness against errors in the heartbeat interval tachogram. The least tolerant ones being HF/LF ratio, pNN50 and NN50 and the most tolerant being RPA DET, LFpower but also SDNN. Three common noise distributions were evaluated in this study where the error range was limited to relatively small values and the influence of systematical errors over multiple beats was neglected. Future research should evaluate particularly the noise profiles (distributions and amplitudes) commonly seen in novel measurement modalities other than ECG, e.g., PPG, remote imaging PPG, seismo- and ballistocardiogram that commonly have larger uncertainty and/or might be influenced by systematic errors due to physiology. Biases in HRV parameters such as SDNN could possibly be partially compensated algorithmically if the noise distribution is known. On the other hand, simple averaging of HRV parameters over several analysis segments does not account for systematic bias. Most importantly, researchers should be cautious about the robustness of different HRV parameters to noise in the beat-to-beat interval data. To bring HRV closer to clinical practice, recommendations specific to application should be developed based on our findings.

Data availability

Raw data is available at https://physionet.org/. Derived data supporting the findings of this study are available from the corresponding author [M.R.] on request.

References

Mosley, E. & Laborde, S. A scoping review of heart rate variability in sport and exercise psychology. Int. Rev. Sport Exerc. Psychol.https://doi.org/10.1080/1750984X.2022.2092884 (2022).
Article Google Scholar
Mendonça, F., Mostafa, S. S., Morgado-Dias, F., Ravelo-Garcia, A. G. & Penzel, T. A review of approaches for sleep quality analysis. IEEE Access 7, 24527–24546. https://doi.org/10.1109/ACCESS.2019.2900345 (2019).
Article Google Scholar
Faust, O. et al. Heart rate variability for medical decision support systems: A review. Comput. Biol. Med. 145, 105407. https://doi.org/10.1016/j.compbiomed.2022.105407 (2022).
Article PubMed Google Scholar
Ushiyama, T. et al. Heart rate variability for evaluating surgical stress and development of postoperative complications. Clin. Exp. Hypertens. 30, 45–55. https://doi.org/10.1080/10641960701813908 (2008).
Article PubMed Google Scholar
Huikuri, H. V. & Stein, P. K. Clinical application of heart rate variability after acute myocardial infarction. Front. Physiol. 3, 41. https://doi.org/10.3389/fphys.2012.00041 (2012).
Article PubMed PubMed Central Google Scholar
Shaffer, F. & Ginsberg, J. P. An overview of heart rate variability metrics and norms. Front. Public Health 5, 258. https://doi.org/10.3389/fpubh.2017.00258 (2017).
Article PubMed PubMed Central Google Scholar
Buccelletti, F. et al. Heart rate variability and myocardial infarction: Systematic literature review and metanalysis. Eur. Rev. Med. Pharmacol. Sci. 13, 299–307 (2009).
CAS PubMed Google Scholar
Guo, Y., Palmer, J. L., Strasser, F., Yusuf, S. W. & Bruera, E. Heart rate variability as a measure of autonomic dysfunction in men with advanced cancer. Eur. J. Cancer Care 22, 612–616. https://doi.org/10.1111/ecc.12066 (2013).
Article CAS Google Scholar
Licht, C. M., De Geus, E. J., Van Richard, D. & Penninx, B. W. Association between anxiety disorders and heart rate variability in the Netherlands study of depression and anxiety (NESDA). Psychosom. Med. 71, 508–518. https://doi.org/10.1097/PSY.0b013e3181a292a6 (2009).
Article PubMed Google Scholar
Mejía-Mejía, E., Budidha, K., Abay, T. Y., May, J. M. & Kyriacou, P. A. Heart rate variability (HRV) and pulse rate variability (PRV) for the assessment of autonomic responses. Front. Physiol. 11, 2618–2621. https://doi.org/10.3389/fphys.2020.00779 (2020).
Article Google Scholar
Morelli, D., Bartoloni, L., Colombo, M., Plans, D. & Clifton, D. A. Profiling the propagation of error from PPG to HRV features in a wearable physiological-monitoring device. Healthc. Technol. Lett. 5, 59–64. https://doi.org/10.1049/htl.2017.0039 (2018).
Article PubMed PubMed Central Google Scholar
Clifford, G. D. & Tarassenko, L. Quantifying errors in spectral estimates of HRV due to beat replacement and resampling. IEEE Trans. Biomed. Eng. 52, 630–638. https://doi.org/10.1109/TBME.2005.844028 (2005).
Article PubMed Google Scholar
Liu, N. T., Batchinsky, A. I., Cancio, L. C. & Salinas, J. The impact of noise on the reliability of heart-rate variability and complexity analysis in trauma patients. Comput. Biol. Med. 43, 1955–1964. https://doi.org/10.1016/j.compbiomed.2013.09.012 (2013).
Article PubMed Google Scholar
Kim, K. K., Lim, Y. G., Kim, J. S. & Park, K. S. Effect of missing RR-interval data on heart rate variability analysis in the time domain. Physiol. Meas. 28, 1485–1494. https://doi.org/10.1088/0967-3334/28/12/003 (2007).
Article PubMed Google Scholar
Petelczyc, M., Gierałtowski, J. J., Żogała-Siudem, B. & Siudem, G. Impact of observational error on heart rate variability analysis. Heliyon 6, e03984. https://doi.org/10.1016/j.heliyon.2020.e03984 (2020).
Article PubMed PubMed Central Google Scholar
Hoog Antink, C., Leonhardt, S. & Walter, M. Local interval estimation improves accuracy and robustness of heart rate variability derivation from photoplethysmography. In 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 3558–3561. https://doi.org/10.1109/EMBC.2018.8512908 (2018).
Schumann, A. & Bär, K.-J. Autonomic aging–A dataset to quantify changes of cardiovascular autonomic function during healthy aging. Sci. Data 9, 95. https://doi.org/10.1038/s41597-022-01202-y (2022).
Article CAS PubMed PubMed Central Google Scholar
Goldberger, A. L. et al. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation 101, e215–e220. https://doi.org/10.1161/01.cir.101.23.e215 (2000).
Article CAS PubMed Google Scholar
Pan, J. & Tompkins, W. J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 32, 230–236. https://doi.org/10.1109/TBME.1985.325532 (1985).
Article CAS PubMed Google Scholar
Lipponen, J. A. & Tarvainen, M. P. A robust algorithm for heart rate variability time series artefact correction using novel beat classification. J. Med. Eng. Technol. 43, 173–181. https://doi.org/10.1080/03091902.2019.1640306 (2019).
Article PubMed Google Scholar
Malik, M. et al. Heart rate variability. Standards of measurement, physiological interpretation, and clinical use. Eur. Heart J. 17, 354–381. https://doi.org/10.1093/oxfordjournals.eurheartj.a014868 (1996).
Article Google Scholar
Baevsky, R. Methodical recommendations use KARDiVAR system for determination of the stress level and estimation of the body adaptability standards of measurements and physiological interpretation (2008).
Bauer, A. et al. Deceleration capacity of heart rate as a predictor of mortality after myocardial infarction: Cohort study. Lancet 367, 1674–1681. https://doi.org/10.1016/S0140-6736(06)68735-7 (2006).
Article PubMed Google Scholar
Nasario-Junior, O., Benchimol-Barbosa, P. R. & Nadal, J. Refining the deceleration capacity index in phase-rectified signal averaging to assess physical conditioning level. J. Electrocardiol. 47, 306–310. https://doi.org/10.1016/j.jelectrocard.2013.12.006 (2014).
Article PubMed Google Scholar
Brennan, M., Palaniswami, M. & Kamen, P. Do existing measures of Poincareé plot geometry reflect nonlinear features of heart rate variability?. IEEE Trans. Biomed. Eng. 48, 1342–1347. https://doi.org/10.1109/10.959330 (2001).
Article CAS PubMed Google Scholar
Richman, J. S. & Moorman, J. R. Physiological time-series analysis using approximate and sample entropy. Am. J. Physiol. Heart Circ. Physiol. 278, R2023–R2049. https://doi.org/10.1152/ajpheart.2000.278.6.h2039 (2000).
Article Google Scholar
Peng, C. K., Havlin, S., Stanley, H. E. & Goldberger, A. L. Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. Chaos 5, 82–87. https://doi.org/10.1063/1.166141 (1995).
Article CAS PubMed ADS Google Scholar
Guzzetti, S. et al. Non-linear dynamics and chaotic indices in heart rate variability of normal subjects and heart-transplanted patients. Cardiovasc. Res. 31, 441–446. https://doi.org/10.1016/0008-6363(95)00159-X (1996).
Article CAS PubMed Google Scholar
Webber, C. L. & Zbilut, J. P. Dynamical assessment of physiological systems and states using recurrence plot strategies. J. Appl. Physiol. 76, 965–973. https://doi.org/10.1152/jappl.1994.76.2.965 (1994).
Article PubMed Google Scholar
García-González, M. A., Fernández-Chimeno, M. & Ramos-Castro, J. Bias and uncertainty in heart rate variability spectral indices due to the finite ECG sampling frequency. Physiol. Meas. 25, 489–504. https://doi.org/10.1088/0967-3334/25/2/008 (2004).
Article PubMed Google Scholar

Download references

Acknowledgements

We acknowledge support by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) and the Open Access Publishing Fund of Technical University of Darmstadt. G.G. gratefully acknowledges funding by the Republic of Turkey Ministry 375 of National Education.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Antti Vehkaoja and Christoph Hoog Antink.

Authors and Affiliations

AI Systems in Medicine, Technical University of Darmstadt, 64283, Darmstadt, Germany
Maurice Rohr, Gökhan Güney & Christoph Hoog Antink
Department of Technical Physics, University of Eastern Finland, 70211, Kuopio, Finland
Mika Tarvainen
Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital, 70211, Kuopio, Finland
Mika Tarvainen
Faculty of Medicine and Health Technology, Tampere University, 33720, Tampere, Finland
Seyedsadra Miri & Antti Vehkaoja
Finnish Cardiovascular Research Center, 33720, Tampere, Finland
Seyedsadra Miri & Antti Vehkaoja

Authors

Maurice Rohr
View author publications
You can also search for this author in PubMed Google Scholar
Mika Tarvainen
View author publications
You can also search for this author in PubMed Google Scholar
Seyedsadra Miri
View author publications
You can also search for this author in PubMed Google Scholar
Gökhan Güney
View author publications
You can also search for this author in PubMed Google Scholar
Antti Vehkaoja
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Hoog Antink
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.R., A.V. and C.H.A. conceived the experiment, M.T. and C.H.A. implemented the simulation, all authors analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Maurice Rohr.

Ethics declarations

Competing interests

MT is the founder and CEO of Kubios company that provides the HRV analysis software used in this study. The other authors have nothing to declare.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rohr, M., Tarvainen, M., Miri, S. et al. An extensive quantitative analysis of the effects of errors in beat-to-beat intervals on all commonly used HRV parameters. Sci Rep 14, 2498 (2024). https://doi.org/10.1038/s41598-023-50701-4

Download citation

Received: 21 June 2023
Accepted: 23 December 2023
Published: 30 January 2024
DOI: https://doi.org/10.1038/s41598-023-50701-4
Springer Nature Limited

An extensive quantitative analysis of the effects of errors in beat-to-beat intervals on all commonly used HRV parameters

Abstract

Similar content being viewed by others

Evidence Based Recommendations for Designing Heart Rate Variability Studies

Computational System for Heart Rate Variability Analysis

Scatter Comparison of Heart Rate Variability Parameters

Introduction