Accuracy of heart rate variability estimated with reflective wrist-PPG in elderly vascular patients

Hoog Antink, Christoph; Mai, Yen; Peltokangas, Mikko; Leonhardt, Steffen; Oksala, Niku; Vehkaoja, Antti

doi:10.1038/s41598-021-87489-0

Accuracy of heart rate variability estimated with reflective wrist-PPG in elderly vascular patients

Article
Open access
Published: 14 April 2021

Volume 11, article number 8123, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Accuracy of heart rate variability estimated with reflective wrist-PPG in elderly vascular patients

Download PDF

Christoph Hoog Antink^1,2,
Yen Mai^3,4,
Mikko Peltokangas^3,4,
Steffen Leonhardt²,
Niku Oksala^3,4,5 &
…
Antti Vehkaoja^3,4,6

8062 Accesses
24 Citations
Explore all metrics

Abstract

Optical heart rate monitoring (OHR) with reflective wrist photoplethysmography is a technique mainly used in the wellness application domain for monitoring heart rate levels during exercise. In the absence of motion, OHR technique is also able to estimate individual beat-to-beat intervals relatively well and can therefore also be used, for example, in monitoring of cardiac arrhythmias, stress, or sleep quality through heart rate variability (HRV) analysis. HRV analysis has also potential in monitoring the recovery of patients, e.g. after a medical intervention. However, in order to detect subtle changes, the calculated HRV parameters should be sufficiently accurate and very few studies exist that asses the accuracy of OHR derived HRV in non-healthy subjects. In this paper, we present a method to estimate beat-to-beat-intervals (BBIs) from reflective wrist PPG signal and evaluated the accuracy of the proposed method in estimating BBIs in a cross-sectional study with 29 hospitalized patients (mean age 70.6 years) in 24-h recordings performed after peripheral vascular surgery or endovascular interventions. Finally, we evaluate the accuracy of more than 30 commonly used HRV parameters and find that the accuracy of certain metrics, for example SDNN and triangular index, shown in the literature to be associated with the deterioration of the status of the patients during recovery from surgical intervention, could be adequate for patient monitoring. On the other hand, the parameters more affected by the high-frequency content of the HRV and especially the LF/HF-ratio should be used with caution.

Are Wearable Photoplethysmogram-Based Heart Rate Variability Measures Equivalent to Electrocardiogram? A Simulation Study

Article 27 June 2024

Advances in Wireless, Batteryless, Implantable Electronics for Real-Time, Continuous Physiological Monitoring

Article Open access 15 December 2023

Detection and Removal of Motion Artifacts in PPG Signals

Article Open access 08 August 2019

Introduction

Unobtrusive continuous monitoring and automatic analysis of physiological variables is an emerging area that has the potential to improve the effectiveness of healthcare delivery by providing early indications in the changes of the patients’ status, whether being treated in a hospital or staying at home. However, in order to be usable in practice, the data used by the automatic analysis algorithms needs to be reliable and accurate.

Reflective photoplethysmography (PPG) measured with a wrist-worn device, also called optical heart rate (OHR) monitoring, is a technique traditionally used mainly in the wellness application domain for monitoring heart rate level during exercise. In the absence of motion, the OHR technique is also able to estimate individual beat-to-beat intervals (BBIs) relatively accurately and has therefore recently emerged as an unobtrusive method for detecting cardiac arrhythmias^1,2,3. Besides arrhythmias, the performance of wrist-worn OHR monitoring has also been studied, for example, in the assessment of psychological stress⁴ and in sleep staging through heart rate variability (HRV) and movement analysis in healthy subjects^5,6. Studies evaluating the performance in beat-to-beat heart rate monitoring and accuracy of HRV parameters have usually been performed in controlled situations during selected activities or at rest as in^7,8 or as reviewed in⁹. Further, studies evaluating the applicability of wrist-worn OHR technology in estimating HRV in hospitalised patients have been scantly reported.

A recent study reported poor performance of HRV estimation with commercial wrist OHR device in uncontrolled conditions¹⁰ highlighting the need for improvements in the measurement technology or signal analysis methods. The most significant limitation of OHR technology is its high sensitivity to movement artefacts, which poses challenges for the signal processing algorithms to choose only those heartbeats or heartbeat intervals that are not affected by movements. In addition, factors such as poor superficial blood perfusion and skin color affect the quality of the obtained signal and consequently, the accuracy of beat-to-beat heart rate^11,12.

The predictive value of HRV parameters measured through gold standard electrocardiography (ECG) in identifying patients at risk of post-surgical complications has been studied in various patient groups such as in hip fracture¹³, digestive surgery¹⁴, abdominal aortic surgery¹⁵, and cardiac surgery patients. In cardiac patients, a popular topic has also been the evaluation of the relation of long-term mortality and HRV after myocardial infarction. A number of studies focusing on cardiac patients and HRV have been aggregated in the reviews by Nenna et al.¹⁶ and Huikuri and Stein¹⁷. HRV analysis has also been proposed for early detection of infections related to communicable diseases¹⁸, development of septic shock¹⁹, and also for several other purposes related to anesthesia and intensive care²⁰.

Large number of HRV parameters have been found in the aforementioned studies to indicate a high risk of perioperative complications and post-surgery mortality. Short descriptions of these parameters and their abbreviations are found in Table 1. Ernst et al. found significant association with decreased RMSSD and total power of pre-operatively measured HRV and increased probability of post-operative complications as well as decreased VLF power and HF/LF-ratio and post-operative infections in hip fracture patients¹³. They, however, did not find association between pre-operative SDNN and post-operative complications. On the other hand, SDNN and HRV triangular index measured on post-operative day 1 were found to be statistically significantly lower in digestive surgery patients developing post-operative complications¹⁴. Both Nenna et al. as well as Huikuri and Stein identified scaling exponent \(\alpha _1\) of detrended fluctuation analysis as a non-linear HRV parameter with high prognostic value in predicting long-term cardiac mortality^16,17. This parameter was also found to be the best predictor of complicated recovery after coronary artery bypass grafting²¹. In the studies that have also evaluated the heart rate level as a potential indicator, increased post-operative heart rate has been found to predict or be associated with post-operative complications.

In all studies presented above, HRV analysis for monitoring of post-surgery patients and detecting complications has been performed using ECG. However, in order to be suitable for continuous monitoring for the duration of several days, the monitoring method should be as unobtrusive as possible. ECG electrodes are usually attached to the body with adhesives and may cause medical adhesive related skin injuries^22,23, especially if worn for long periods of time, and are thus not a suitable approach.

The OHR technology would provide a convenient and unobtrusive solution for the task. However, typically the absolute amount of heart rate variation is decreased in aged and fragile patients²⁴. As these patients exhibit the highest risk of developing complications, even better accuracy is needed from the measurement method in order to obtain adequate relative accuracy for the HRV parameters of interest. Inaccuracies in estimated heartbeat intervals also affect different HRV parameters differently²⁵. Therefore, it is an interesting question whether those HRV parameters that have earlier been found to predict the development of post-treatment complications, can be estimated with adequate accuracy with OHR technology. We hypothesize that not all commonly used HRV parameters can be estimated with the same accuracy. We further hypothesize that some parameters may exhibit error levels small enough to render them possible candidates for future studies.

We performed 24-h monitoring with 29 subjects who had undergone vascular surgery or endovascular treatment. Typically, these patients have several comorbidities such as diabetes, hypertension, dyslipidemia, coronary artery disease, cerebrovascular disease and may have reduced cardiac function. For all patients, long-term recordings (mean 17.72 h, range 4.64–22.96 h) including day- and nighttime were acquired. The accuracy of a large set of HRV parameters was evaluated based on 5-min segments of BBIs estimated from reflective wrist PPG and reference ECG data.

Methods

Heartbeat interval and data quality estimation algorithm

For BBI estimation, we used continuous local interval estimation (CLIE) algorithm²⁶ augmented with an iterative estimation approach (adaptive prior)^27,28. The algorithm was originally developed for BBI estimation from ballistocardiography (BCG) data, which is often noisy and exhibits changes in morphology, rendering standard peak-detection strategies suboptimal. In a previous study, we have shown that the methodology is suitable for accurate, unbiased estimation of BBI intervals from clean (finger clip) PPG²⁹: In short, intervals are estimated based on self-similarity of the underlying signal. For this, three estimators, namely short-term autocorrelation, maximum amplitude pairs, and mean absolute differences are fused. Estimation is performed iteratively, with the first iteration resulting in a prior signal that is used in the second pass. For details, the interested reader is referred to²⁸.

As in the original approach, in addition to the estimated interval, a quality metric q is reported for each estimated interval that quantifies the level of self-similarity detected. In Fig. 1, three signal excerpts are shown: while the first row presents an excerpt of a signal with a mean quality of 0.2, the second row shows an excerpt with \({\bar{q}} = 0.3\), and the last with \({\bar{q}} = 0.4\).

If a technology is used to detect slow changes in health status, it is often less critical to get continuous estimations, but more important that the accuracy of those is sufficient for the given task. Thus, a common approach is to exclude data based on a quality metric, thereby creating a tradeoff between the so-called (temporal) coverage and the error. In the particular case of HRV estimation, measurement protocols usually require the subject to be as calm as possible. As motion artifacts are the main source of error, it is particularly advantageous to exclude segments with low quality in such a scenario. In this work, four different quality metrics were used. First, we used a fixed threshold of \(q_{\mathrm{th}}=0.3\) to only accept intervals with \(q > q_{\mathrm{th}}\) in the subsequent analysis. In addition, for each 5-min window, the following rules were applied:

Only if the median quality \(q_{\mathrm{med}}\) of the window is above a threshold value \(q_{\mathrm{th, med}}\), the window is accepted: \(q_{\mathrm{med}} > q_{\mathrm{th, med}}\)
Only if the quality inside the window exhibits small variability (i.e. a small relative standard deviation), the window is accepted: \(\mathrm{SD}(q) / {\overline{q}} < q_{\mathrm{th, var}}\)
Only if the ratio of accepted intervals inside the window \(N_{\mathrm{OK}}\) relative to the estimated mean interval \(\bar{\mathrm{BBI}}\) is large enough, the window is accepted: \(N_{\mathrm{OK}} / \bar{\mathrm{BBI}} > R_{\mathrm{th, OK}}\)

The threshold values can be chosen by the user based on priorities of the application i.e. large data coverage or accuracy (as will be shown in the results section, Figs. 4 and 5). The overall process is visualized in Fig. 2.

HRV parameters

To calculate HRV parameters from both the reference ECG and the PPG-derived BBIs, the analysis software “Kubios HRV premium” by Kubios Oy, Finland, is used. Kubios allows the calculation of more than 30 HRV parameters³⁰ and has beed used in more than 4500 scientific publications according to the manufacturer. In this study, the parameters presented in Table 1 were evaluated.

Table 1 HRV parameters calculated using the “Kubios HRV Premium” software.

Full size table

In particular, the analysis includes several time-domain parameters (SDNN, SDHR, RMSSD), frequency-domain parameters associated with low-frequency components (LF Abs., LF Log.), frequency-domain parameters associated with high-frequency components (HF Abs., HF Log.), as well as relative (Poincaré \(\hbox {SD}_2\)/\(\hbox {SD}_1\), LF/HF), statistical (NN50, pNN50), and nonlinear (ApEn, SampEn, PC \(\hbox {SD}_{1,2}\), DFA \(\alpha _{1,2}\)) parameters. The 5-min analysis window was shifted in steps of 60 seconds and HRV parameter calculation repeated. HRV parameters estimated with Kubios from PPG-based BBI and corresponding ECG-based reference RR-interval windows were exported and comparisons were performed in MATLAB.

For each 5-min window of the data of each subject, a ground-truth HRV parameter and an estimated value exist. For evaluation, we performed per-subject analyses (Figs. 6 and 8) as well as combined gross analyses (all other figures and tables). In the per-subject analysis, error metrics are calculated for each subject individually. This implies that the relative error is calculated by normalization with the average of the ground truth of all windows of that subject. In the gross analysis, all data of all subjects are aggregated. Here, the relative metrics are based on the average ground truth values of all windows of all subjects. Thus, the gross analysis is biased towards subjects with more accepted datapoints. By comparing gross and per-subject analysis, information about inter-subject variability is obtained.

To provide information on the distribution of the error, several error metrics are used. For all metrics, the difference of estimation and ground truth (\(\Delta =\) estimation − ground truth) of the respective HRV parameter forms the basis. The “Bias” is defined as the average of all \(\Delta \), i.e. it indicates over- and under-estimation in absolute units. The relative bias “Bias (%)” is the bias normalized by the average of the ground truth. “P05” and “P95” are defined as the 5th and 95th percentile of \(\Delta \) and give an indication of the spread of the error in absolute units. Similarly, the standard deviation of \(\Delta \), “SD” is a metric for the spread of the error assuming a Gaussian distribution. The mean absolute error “MAE” is defined as the mean of the absolute values of \(\Delta \), while the root mean square error “RMSE” is determined by calculating the root of the mean value after squaring all individual \(\Delta \). As the RMSE is more sensitive to outliers, the comparison of MAE and RMSE gives information about their presence. The relative quantities “MAE (%)” and “RMSE (%)” are calculated analogously to the relative bias, with the selection of the normalization depending on per-subject or gross analysis as described above. Finally, we analyze the “Relative Error”, which is \(\Delta \) divided by the mean of the ground truth. While “Bias (%)” is equivalent to the mean of “Relative Error”, we additionally provide median, 25th, and 75th percentile to obtain further information of the spread of the error in relative terms (Fig. 7).

Evaluation data

The patient recordings were performed at the vascular surgery ward of Tampere University Hospital between April and October 2018. Inclusion criteria for the study were at least 50 years of age and admission to peripheral arterial bypass operation, endarterectomy, aortic surgery, or carotid surgery. Patient with cardiac pacemaker were excluded from the study. The study was a descriptive pilot study for gaining initial knowledge about the performance and suitability of new sensor technology for patient monitoring and 30 successful patient recordings was determined as a suitable sample size. Altogether 36 patients were recruited for the study but seven subjects had to be discarded due to technical problems or due to short duration of the recording. In four cases the problem was with the reference device (poor quality ECG), in one case with the study device (recording had not started), and in two cases the patient was admitted to re-operation shortly after the beginning of the recording. Thus, the data of 29 postoperative vascular patients were included in the analysis. The subjects were monitored for approximately 24 h with a wrist-worn OHR prototype device manufactured by PulseOn Oy, Finland, Fig. 3. Long-term recordings were obtained to eliminate bias potentially arising from, for example, monitoring only sleeping subjects.

The PulseOn wrist device uses green color LEDs with peak wavelength of 573 nm and 25 Hz sampling rate. Before interval estimation, the PPG data were upsampled to 200 Hz using linear interpolation and bandpass-filtered using a 2nd-order Butterworth bandpass-filter with a passband of 0.5 to 15 Hz.

The reference ECG was recorded with a Faros 360 five-lead Holter monitor manufactured by Bittium Biosignals using 1 kHz sampling frequency. Ambu Blue sensor L-00-S electrodes were used for the ECG recording. The average age of the subjects was 70.6 years (SD: 8.5 years, range 50–87 years). Seven of the subjects were female. The subjects had undergone different vascular and endovascular procedures such as lower limb percutaneous transluminal angioplasty and/or stenting, abdominal aortic aneurysm endovascular repair, carotid or femoral artery endarterectomy, or femoropopliteal bypass surgery. Approval for this study was obtained from the Regional Ethical Committee of Pirkanmaa Hospital District (R17027). Informed consent was obtained from all subjects. The guidelines of the Declaration of Helsinki were followed in the study. The study was registered at ClinicalTrials.gov, identifier NCT03572751.

In this study, ECG and PPG were recorded with two independent, unconnected wearable devices. Devices that have independent system clocks and are not synchronized can exhibit drifts in sampling rate³¹. In long-term recordings, even small drifts can amount to large offsets, which would lead to the comparison of non-corresponding windows in this study. Thus, we adopted the same alignment process as in our previous work on BCG data²⁸: The algorithm calculates a time-varying offset that minimizes the median BBI estimation error in a moving window with the size of 1500 beats, which corresponds to approximately 25 min. The offset-vector was additionally median filtered with the same filter size to remove outliers. Finally, an offset-vector is obtained that ensures that each window of the PPG data is compared to the matching ECG window.

Results and discussion

Figures 4 and 5 demonstrate the effect of the threshold-parameters on the coverage of accepted 5-min windows and the mean absolute error of the HRV parameter SDNN, respectively.

As expected, the coverage decreases monotonically with an increase in \(R_{\mathrm{th, OK}}\), an increase in \(q_{\mathrm{th, median}}\), and a decrease in \(q_{\mathrm{th, var}}\) (Fig. 4). The same general tendency holds for the estimation error. Fig. 5 shows the mean absolute error for SDNN as one example. Note that the distribution of the coverage is the same for all HRV parameters, whereas we observed varying dependencies on the three thresholding parameters for the different error metrics and the different HRV parameters (not shown). In the following, a target coverage of 30% was arbitrarily set. Out of the several combinations of thresholds that would result in this target coverage, the following set was chosen:

\(R_{\mathrm{th, OK}} = 171.22\)
\(q_{\mathrm{th, median}} = 0.46\)
\(q_{\mathrm{th, var}} = 2.11\)

This choice of thresholds is visualized with a red dot in Figs. 4 and 5. Note that while these parameters lead to an average coverage of 30 %, the coverage for individual subjects may vary. Further note that this choice is fixed for all following calculations and arbitrary in the sense that different combinations of thresholds would also lead to the same average coverage of 30% as can be inferred from Fig. 4. At the same time, it would lead to different error levels, as can be seen in Fig. 5. As a consequence, the arbitrarily chosen parameters mark the upper bounds of the errors of the individual HRV parameters.

Table 2 shows the numeric comparison of 38 different HRV parameters calculated for all accepted 5-min windows and aggregated for all 29 (out of the original 36) patients included into the analysis (i.e. so-called “gross analysis”, see Sect. 2.2). In Fig. 6, per-subject results are presented as boxplots, where each datapoint presents results for one individual subject. Note that the graph is clipped at 100 % for better readability.

Table 2 Accuracy of the estimated HRV parameters aggregated over all analyzed 5-min windows of all subjects.

Full size table

Several observations can be made from the results. For one, the (relative) RMSE tends to be significantly higher than the (relative) MAE for many of the parameters, indicating the tendency for large outliers in the estimation results. Moreover, estimation quality varies greatly from patient to patient and strongly depends on the parameter. While mean, maximum, and minimum heart rate can be estimated with an average relative error below 1.2 %, other parameters show inferior results. In particular, the patient-wise relative error in NN50 and pNN50 has a 75th percentile at about 280 % (not shown in Fig. 6 due to clipping). For one, the nature of this parameter makes it relatively susceptible to outliers. For another, the population in this study group has extremely low NN50/pNN50 values close to zero. As can be seen in Table 2, the mean absolute error in pNN50 is only 1.55 percentage points (which would obviously still result in an infinite error for patients with a true pNN50 of 0 %).

Figure 7 visualizes bias and spread of the error aggregated over all subjects and normalized by the overall mean of the ground truth in percent. The bias is given as median and the spread as 25th/75th percentiles. It can be seen that the time-domain estimations (SDNN, SDHR) and the nonlinear Poincaré SD2 have been estimated with only small biases. On the other hand, the components RMSSD and Poincaré SD1 exhibit a systematic overestimation in the range of 10 % with an interquartile range of approximately \(\pm 10\,\%\). This tendency is largely a result of arbitrary/sporadic large beat-to-beat interval estimation errors, which tend to affect parameters containing differentiations in a more severe way. Note that the same systematic overestimation can also be seen in the frequency-domain parameters associated with high-frequency components.

In the frequency domain, both absolute as well as relative power are estimated with large biases and spreads. If comparison is performed in the logarithmic domain, however, biases/spreads are small for VLF, LF, and HF components (\(-4\pm 5\,\%\), \(0\pm 3\,\%\), \(3\pm 7\,\%\)). As a general tendency, we observe that high-frequency associated parameters HF Abs. and HF Log. as well as PC \(\hbox {SD}_1\) and RMSSD are systematically overestimated. While these biases may be small in absolute numbers (\(4.86\,\hbox {ms}^2\) and 0.27 log, 1.71 ms and 2.42 ms, respectively), they do lead to a comparatively large relative bias in the range of 10 % for our patient group. Consequently, parameters that analyze the ratio of LF to HF components (LF/HF, PC \(\hbox {SD}_2\)/\(\hbox {SD}_1\)) show severe systematic underestimation.

Note that the per-subject analysis (Fig. 6) calculates the relative error as “average of the error of subject n divided by average of the ground truth of subject n”. The aggregated gross analysis, on the other hand, calculates “average of the error of all windows of all subjects divided by average of the ground truth of all windows of all subjects” (Table 2). Interesting observation can be made comparing the two: First, strong inter-individual differences can be observed in Fig. 6 and the optical measurement clearly seems to be more suitable for certain individuals than for others. For the “best” 25 % of the subjects, the relative MAE of most of the HRV parameters is less than 10 %. Second, as the relative per-subject errors far exceed those of the gross analysis in Table 2, we can assume that estimation errors in subjects with very low HRV parameter values have a strong negative impact on the relative accuracy seen in Fig. 6.

These assumptions can be supported further by Fig. 8. Here, each ‘x’ marks the mean over all windows where an estimation from PPG was available (‘Estimation’, right) and the mean over all corresponding reference windows obtained from ECG (‘Ground Truth’, left).

The graph shows data from 28 patients, as the coverage for one patient was zero (see also Fig. 6, rightmost column). Indeed, several patients exhibit a pNN50 of (close to) zero. Nevertheless, five patients show pNN50-values way above 10 % that can clearly be distinguished also in the PPG-based estimation. The same tendency holds for SDNN and LF Log. and, in parts, for RMSSD and HF Log., although for these parameters the aforementioned over-estimation of small values becomes obvious. Finally, neither LF/HF nor PC \(\hbox {SD}_2\)/\(\hbox {SD}_1\) can be estimated with confidence due to strong and, more importantly, varying under-estimations as indicated by several crossing lines.

As can be seen in our HRV parameter estimation accuracy evaluation, from the aforementioned parameters, SDNN was estimated on average with 9 % and triangular index with 12 % relative MAE from the wrist device PPG signal. The relative MAE of both DFA \(\alpha _1\) and RMSSD parameters were approximately 17 % and the absolute power of LF and HF components between 16 % and 19 %. The LF/HF ratio performed the worst with 36 % relative MAE. The relative MAE of ApEn parameter was only approximately 5 %, which is partially explained by the distribution of the parameter values, i.e. small deviation compared with the average. Further, for the parameters such as triangular index and approximate entropy that are showing low biases, the increase of the estimation window could improve the accuracy.

Previous studies have proposed that the changes in HRV parameters caused by postoperative complications and deterioration of the patient status can be seen on post-operative day 1, for example, in SDNN and triangular index¹⁴ as well as in DFA scaling exponent \(\alpha _1\)²¹. In²¹, Laitio et al. reported mean ± standard deviation of DFA \(\alpha _1\) of \(0.85\pm 0.17\) vs. \(0.69\pm 0.18\) for two groups of patients, “ICU Stay \(\le 48 \,\hbox {h}''\) and “ICU Stay \(>48\,\hbox {h}''\), respectively. Based on the results obtained in the present study, it is uncertain whether or not these two groups could be separated using OHR: For DFA \(\alpha _1\), we obtained an MAE of 0.150 and an RMSE of 0.204 across all data, which lies in the range of the differences between the two groups. On the other hand, Ushiyama et al. found mean ± standard deviation of SDNN for the “complicated” group being \(48.7\pm 24.4\,\hbox {ms}\), whereas \(71.2\pm 19.6\,\hbox {ms}\) was found for the “uncomplicated” group of digestive surgery patients, see¹⁴. Based on the observed accuracy for SDNN (1.71 ms MAE, 2.86 ms RMSE), we would argue that these groups could clearly be separated with the OHR technology. The same argument can be made for the Triangular Index (\(13.3\pm 6.7\) “complicated” group, \(19.9 \pm 6.5\) “uncomplicated” group¹⁴), for which we found an MAE of 0.659 and an RMSE of 0.866.

In¹⁹, the development of septic shock was most clearly predicted by RMSSD but also by absolute and normalized LF power as well as HF power and LF/HF-ratio. However, median values of (as well as differences between) both groups were extremely low: For example, comparing “No septic shock” with “Septic shock”, the median values of RMSSD for both groups were 3.8 ms and 7.3 ms, respectively¹⁹. It thus remains questionable if the accuracy obtained in the present study with OHR (3.55 ms MAE and 4.55 ms RMSE for RMSSD) would suffice for this application scenario. Approximate entropy has also been found to predict the onset of atrial fibrillation (AFib) after coronary artery bypass grafting³². The “control patients” exhibited mean ± standard deviation values of \(1.04\pm 0.05\), whereas a decreased ApEn of \(0.93\pm 0.05\) during 1 h preceding the AF onset was measured in³². Our observed error levels for ApEn of 0.0496 MAE/0.07354 RMSE indicate that ApEn measured with OHR could potentially be directly used for AFib prediction.

Although the presented study is, to the best of our knowledge, the most comprehensive one in terms of assessing the accuracy of HRV estimations obtained via OHR in hospitalized patients, all-encompassing statements about the most reliable parameters may not be possible. Nevertheless, we believe our findings in terms of parameter recommendations can be summarized as follows:

The time-domain parameter SDNN can be estimated with a relative error/absolute error/relative bias of 9%/2 ms/2% and we thus recommend its use. We also recommend the Triangular Index with 12%/0.66/1%. The parameters RMSSD and pNN50 exhibit large relative errors in this cohort (17% and even 28%) and systematic biases in the range of 12%, but also low absolute estimation errors (4 ms and 2%). The visualization in Fig. 8 further suggests that patients with low and high RMSSD/pNN50 might very well be separated using OHR technology. Thus, we believe these parameters should be investigated further.
In the frequency domain, the relative errors for LF Abs. and HF Abs. were 16% and 19%, while the corresponding values for LF Log. and HF Log. were only 5% and 9%. We thus recommend the use of the logarithmic quantities, although one needs to analyze whether they possess the same discriminative power as the absolute ones. Again, Fig. 8 suggests that separation of patients would be possible.
In their current implementation, the relative quantities (LF/HF, PC \(\hbox {SD}_2\)/\(\hbox {SD}_1\)) show strong relative errors (36% and 19%) and biases (− 30% and − 18%) and cannot be recommended without improving their estimation.

Finally, the patient cohort consisted predominantly of white caucasian subjects. Thus, the generalisability of results has to be validated with a more diverse group. Although we do not expect a fundamentally different outcome in terms of parameter feasibility, overall estimation errors may be higher in subjects with darker skin tones.

Conclusion

In conclusion, the accuracy of HRV parameters estimated from the PPG signal of wrist-worn OHR monitoring device varies significantly between parameters and subjects. The accuracy of certain parameters, for example SDNN and triangular index, shown in the literature to be associated with the deterioration of the status of the patients during recovery from surgical intervention, could be adequate for patient monitoring. On the other hand, the parameters more affected by the high-frequency content of the HRV and especially the LF/HF-ratio should be used with caution. It should also be emphasized that the proposed data analysis method tries to discard such segments of PPG signal that likely produce less reliable beat-to-beat interval data resulting the HRV parameters being obtained only for approximately 30% of the time. This may limit the usability of the approach in some applications. To further improve the applicability of wrist-worn OHR monitoring in patient surveillance through HRV, more robust methods for beat-to-beat interval estimation and especially methods for mitigating the effect of estimation uncertainty on the HRV parameter values, e.g. robust spectral estimation techniques, should be investigated.

References

Harju, J. et al. Monitoring of heart rate and inter-beat intervals with wrist plethysmography in patients with atrial fibrillation. Physiol. Meas. 39, 065007. https://doi.org/10.1088/1361-6579/aac9a9 (2018).
Article PubMed Google Scholar
Haddad, S. et al. Ectopic Beat Detection from Wrist Optical Signals for Sinus Rhythm and Atrial Fibrillation Subjects. In IFMBE Proceedings of IFMBE Proceedings (eds Henriques, J. et al.) 150–158 (Springer International Publishing, Cham, 2020). https://doi.org/10.1007/978-3-030-31635-8_18.
Chapter Google Scholar
Pereira, T. et al. Photoplethysmography based atrial fibrillation detection: A review. NPJ Digit. Med. 3, 3. https://doi.org/10.1038/s41746-019-0207-9 (2020).
Article PubMed PubMed Central Google Scholar
Li, F. et al. Photoplethysmography based psychological stress detection with pulse rate variability feature differences and elastic net. Int. J. Distrib. Sens. Netw. 14, 155014771880329. https://doi.org/10.1177/1550147718803298 (2018).
Article Google Scholar
Molkkari, M. et al. Non-Linear Heart Rate Variability Measures in Sleep Stage Analysis with Photoplethysmography. In 2019 Computing in Cardiology Conference (CinC), vol. 45, 1–4, https://doi.org/10.22489/CinC.2019.287 (2019).
Beattie, Z. et al. Estimation of sleep stages in a healthy adult population from optical plethysmography and accelerometer signals. Physiol. Meas. 38, 1968–1979. https://doi.org/10.1088/1361-6579/aa9047 (2017).
Article CAS PubMed Google Scholar
Menghini, L. et al. Stressing the accuracy: Wrist-worn wearable sensor validation over different conditions. Psychophysiology 56, 1–15. https://doi.org/10.1111/psyp.13441 (2019).
Article Google Scholar
Pietilä, J. et al. Evaluation of the accuracy and reliability for photoplethysmography based heart rate and beat-to-beat detection during daily activities. In IFMBE Proceedings of IFMBE Proceedings Vol. 65 (eds Eskola, H. et al.) 145–148 (Springer Singapore, Singapore, 2018).
Google Scholar
Schäfer, A. & Vagedes, J. How accurate is pulse rate variability as an estimate of heart rate variability?. Int. J. Cardiol. 166, 15–29. https://doi.org/10.1016/j.ijcard.2012.03.119 (2013).
Article PubMed Google Scholar
Lam, E., Aratia, S., Wang, J. & Tung, J. Measuring heart rate variability in free-living conditions using consumer-grade photoplethysmography: Validation study. JMIR Biomed. Eng. 5, e17355. https://doi.org/10.2196/17355 (2020).
Article Google Scholar
Lemay, M. et al. Application of optical heart rate monitoring. In Wearable Sensors, 105–129 (Elsevier, 2014).
Puranen, A., Halkola, T., Kirkeby, O. & Vehkaoja, A. Effect of skin tone and activity on the performance of wrist-worn optical beat-to-beat heart rate monitoring. In 2020 IEEE SENSORS, vol. 2020-Octob, 1–4, https://doi.org/10.1109/SENSORS47125.2020.9278523 (IEEE, 2020).
Ernst, G. et al. Decreases in heart rate variability are associated with postoperative complications in hip fracture patients. PLoS ONE 12, e0180423. https://doi.org/10.1088/1361-6579/aac9a90 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ushiyama, T. et al. Analysis of heart rate variability as an index of noncardiac surgical stress. Heart Vessels 23, 53–59. https://doi.org/10.1088/1361-6579/aac9a91 (2008).
Article PubMed Google Scholar
Stein, P. K., Schmieg, R. E., El-Fouly, A., Domitrovich, P. P. & Buchman, T. G. Heart rate variability and length of stay in abdominal aortic surgery patients. Crit. Care Med.29https://doi.org/10.1097/00003246-199912001-00098 (2001).
Nenna, A. et al. Heart rate variability: A new tool to predict complications in adult cardiac surgery. J. Geriatr. Cardiol. 14, 662–668. https://doi.org/10.11909/j.issn.1671-5411.2017.11.005 (2017).
Article PubMed PubMed Central Google Scholar
Huikuri, H. V. & Stein, P. K. Heart rate variability in risk stratification of cardiac patients. Prog. Cardiovasc. Dis. 56, 153–159. https://doi.org/10.1016/j.pcad.2013.07.003 (2013).
Article PubMed Google Scholar
Kamaleswaran, R. et al. Changes in non-linear and time-domain heart rate variability indices between critically ill COVID-19 and all-cause sepsis patients -a retrospective study. medRxiv 2020.06.05.20123752, https://doi.org/10.1101/2020.06.05.20123752 (2020).
Chen, W.-L. & Kuo, C.-D. Characteristics of Heart Rate Variability Can Predict Impending Septic Shock in Emergency Department Patients with Sepsis. Acad. Emerg. Med. 14, 392–397. https://doi.org/10.1088/1361-6579/aac9a94 (2007).
Article PubMed Google Scholar
Mazzeo, A. T., La Monaca, E., Di Leo, R., Vita, G. & Santamaria, L. B. Heart rate variability: A diagnostic and prognostic tool in anesthesia and intensive care. Acta Anaesthesiol. Scand. 55, 797–811. https://doi.org/10.1111/j.1399-6576.2011.02466.x (2011).
Article PubMed Google Scholar
Laitio, T. T. et al. Correlation properties and complexity of perioperative RR-interval dynamics in coronary artery bypass surgery patients. Anesthesiology 93, 69–80. https://doi.org/10.1097/00000542-200007000-00015 (2000).
Article CAS PubMed Google Scholar
McNichol, L., Lund, C., Rosen, T. & Gray, M. Medical adhesives and patient safety. J. Wound Ostomy Continence Nurs. 40, 365–380. https://doi.org/10.1097/WON.0b013e3182995516 (2013).
Article PubMed Google Scholar
Farris, M. K., Petty, M., Hamilton, J., Walters, S.-A. & Flynn, M. A. Medical adhesive-related skin injury prevalence among adult acute care patients. J. Wound Ostomy Continence Nurs. 42, 589–598. https://doi.org/10.1097/WON.0000000000000179 (2015).
Article PubMed Google Scholar
Kristal-Boneh, E., Raifel, M., Froom, P. & Ribak, J. Heart rate variability in health and disease. Scand. J. Work Environ. Health 21, 85–95 (1995).
Article CAS Google Scholar
Tapanainen, J. M., Seppänen, T., Laukkanen, R., Loimaala, A. & Huikuri, H. V. Significance of the accuracy of RR interval detection for the analysis of new dynamic measures of heart rate variability. Ann. Noninvasive Electrocardiol. 4, 10–17. https://doi.org/10.1111/j.1542-474X.1999.tb00359.x (1999).
Article Google Scholar
Brüser, C., Winter, S. & Leonhardt, S. Robust inter-beat interval estimation in cardiac vibration signals. Physiol. Meas. 34, 123–138. https://doi.org/10.1007/978-3-030-31635-8_180 (2013).
Article PubMed Google Scholar
Hoog Antink, C. et al. On the Performance of Bed-Integrated Ballistocardiography in Long-Term Heart Rate Monitoring of Vascular Patients. In Computi. Cardiol., https://doi.org/10.22489/CinC.2019.167 (2019).
Hoog Antink, C. et al. Ballistocardiography can estimate beat-to-beat heart rate accurately at night in patients after vascular intervention. IEEE J. Biomed. Health Inform. 24, 2230–2237. https://doi.org/10.1109/JBHI.2020.2970298 (2020).
Article PubMed Google Scholar
Hoog Antink, C., Leonhardt, S. & Walter, M. Local Interval Estimation Improves Accuracy and Robustness of Heart Rate Variability Derivation from Photoplethysmography. In 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 3558–3561, https://doi.org/10.1109/EMBC.2018.8512908 (IEEE, 2018).
Tarvainen, M. P., Niskanen, J.-P., Lipponen, J. A., Ranta-aho, P. O. & Karjalainen, P. A. Kubios HRV—Heart rate variability analysis software. Comput. Methods Programs Biomed. 113, 210–220. https://doi.org/10.1016/j.cmpb.2013.07.024 (2014).
Article PubMed Google Scholar
Vollmer, M., Bläsing, D. & Kaderali, L. Alignment of Multi-Sensored Data: Adjustment of Sampling Frequencies and Time Shifts. In 2019 Computing in Cardiology Conference (CinC), vol. 45, 11–14, https://doi.org/10.22489/CinC.2019.031 (2019).
Hogue, C. W. et al. RR interval dynamics before atrial fibrillation in patients after coronary artery bypass graft surgery. Circulation 98, 429–434. https://doi.org/10.1007/978-3-030-31635-8_183 (1998).
Article PubMed Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Biomedical Engineering, KIS*MED, TU Darmstadt, Darmstadt, Germany
Christoph Hoog Antink
Medical Information Technology, RWTH Aachen University, Aachen, Germany
Christoph Hoog Antink & Steffen Leonhardt
Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
Yen Mai, Mikko Peltokangas, Niku Oksala & Antti Vehkaoja
Finnish Cardiovascular Research Center, Tampere, Finland
Yen Mai, Mikko Peltokangas, Niku Oksala & Antti Vehkaoja
Center for Vascular Surgery and Interventional Radiology, Tampere University Hospital, Tampere, Finland
Niku Oksala
PulseOn Oy, Espoo, Finland
Antti Vehkaoja

Authors

Christoph Hoog Antink
View author publications
You can also search for this author in PubMed Google Scholar
Yen Mai
View author publications
You can also search for this author in PubMed Google Scholar
Mikko Peltokangas
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Leonhardt
View author publications
You can also search for this author in PubMed Google Scholar
Niku Oksala
View author publications
You can also search for this author in PubMed Google Scholar
Antti Vehkaoja
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H.A., A.V., and M.P. wrote the main manuscript text. C.H.A. performed the majority of analysis. Y.M. performed the measurements. A.V. and N.O. supervised the project. S.L. provided analysis tools and additional feedback. All authors reviewed the manuscript

Corresponding author

Correspondence to Christoph Hoog Antink.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hoog Antink, C., Mai, Y., Peltokangas, M. et al. Accuracy of heart rate variability estimated with reflective wrist-PPG in elderly vascular patients. Sci Rep 11, 8123 (2021). https://doi.org/10.1038/s41598-021-87489-0

Download citation

Received: 05 November 2020
Accepted: 30 March 2021
Published: 14 April 2021
DOI: https://doi.org/10.1038/s41598-021-87489-0
Springer Nature Limited

This article is cited by

EarSet: A Multi-Modal Dataset for Studying the Impact of Head and Facial Movements on In-Ear PPG Signals
- Alessandro Montanari
- Andrea Ferlini
- Fahim Kawsar
Scientific Data (2023)
Wearables in Cardiovascular Disease
- Sanchit Kumar
- Angela M. Victoria-Castro
- F. Perry Wilson
Journal of Cardiovascular Translational Research (2023)
Evaluation of a wrist-worn photoplethysmography monitor for heart rate variability estimation in patients recovering from laparoscopic colon resection
- Juha K. A. Rinne
- Seyedsadra Miri
- Jyrki Kössi
Journal of Clinical Monitoring and Computing (2023)

Accuracy of heart rate variability estimated with reflective wrist-PPG in elderly vascular patients

Abstract

Similar content being viewed by others

Are Wearable Photoplethysmogram-Based Heart Rate Variability Measures Equivalent to Electrocardiogram? A Simulation Study

Advances in Wireless, Batteryless, Implantable Electronics for Real-Time, Continuous Physiological Monitoring

Detection and Removal of Motion Artifacts in PPG Signals

Introduction

Methods