1 Introduction

The increasing number of people suffering from hypertension constitutes a social problem. Hypertension is a major risk factor for cardiovascular diseases and is associated with unhealthy lifestyle behaviors [1]. Daily blood pressure monitoring is important for prevention or early detection of these diseases. The conventional blood pressure measurement method applied pressure using a cuff attached to the finger or upper arm. This method has several disadvantages, such as personal discomfort and inability to continuously monitor blood pressure variations. A non-contact blood pressure measurement technique can reduce these problems and facilitate daily blood pressure monitoring [2].

Visible and thermal images are physiological indices that reflected skin hemodynamic and non-contact measurable using a web camera or an infrared thermography device. Pulse waves can be measured from visible images based on the relationship between hue and blood flow fluctuation [3]. Skin temperature obtained from thermal images is related to skin blood flow, which is controlled by the sympathetic nervous system. The skin temperature variations depend on the heat conduction from the skin blood [4]. In particular, facial skin temperature shows a capacitive variation that reflects a dermovascular capacitance and the heat capacity of skin tissue [5]. This capacitive variation is spatially dependent on differences in blood vessel distribution and skin tissue structure in the facial skin. The techniques of non-contact blood pressure measurement have been developed using facial visible and thermal images.

In cardiovascular physiology, the Windkessel model is a hemodynamics model including blood pressure [6]. This model is an electric circuit model representing the cardiovascular system that consists of blood pressure as power, the peripheral vascular resistance as resistance, flexibility as capacitance, and blood flow as current. Blood pressure can be obtained from these parameters. Kato et al. estimated blood pressure based on the Windkessel model using electric current and charge from facial visible and thermal images, respectively [7]. This study was expanded to attempt a blood pressure estimation using only facial thermal images (FTIs).

In previous studies using FTIs, blood pressure was estimated when it was artificially elevated by acute stress response using deep learning algorithms [8] or independent component analysis (ICA) [9]. Blood pressure is generally measured in the resting state without inducing acute stress. However, to the best of our knowledge, a non-contact method for resting blood pressure estimation using FTIs has not been established.

Humans have a physiological response that varies over a 24-h cycle known as the circadian rhythm. Physiological indices vary throughout the day without acute stress because of the effects of the circadian rhythm [10, 11]. Therefore, the day-long variation in physiological indices includes short-term variations related to acute stress and long-term variations related to circadian rhythms. Applying ICA to FTIs can extract independent components related to specific psychophysiological responses [12]. Ito et al. separated the components included in FTIs using ICA and extracted the core temperature or heart rate variable components related to circadian rhythms and acute stress [13]. It is expected that the short-term blood pressure variation components related to acute stress response can be separated and extracted from FTIs using the same method. In addition, resting blood pressure can be estimated using the independent components of FTIs, excluding the short-term variable component related to acute stress. This study aimed at estimating resting blood pressure from facial thermal images. In this first trial, the short-term blood pressure variations related to acute stress on FTIs were extracted and excluded using the method employed by Ito et al. FTIs and blood pressure were measured repeatedly throughout the day to evaluate variations related to circadian rhythm in each physiological index. For each experiment, a subject was provided with a task to induce acute stress to facilitate the separation of short-term components, such as artifacts. The task given was breath holding, which induced elevated blood pressure [14]. ICA was applied to each FTI obtained to extract the independent components. Correlation analysis was performed to extract short-term components related to acute stress. Multiple regression analysis was conducted to construct a model of resting blood pressure estimation using independent components excluding short-term components.

2 Independent component analysis

ICA is an algorithm that extracts a set of independent components from a set of random variables or signals [15, 16]. In its simplest form, m scalar random variables \(x_1, \ldots x_m\) were observed. It is assumed that linear combinations of n independent components exist, denoted by \(s_1, \ldots s_n\). In this study, the observed variables \(x_j\) and component variables \(s_i\) were arranged as vectors \(\mathbf{{ X}}=\mathbf{x}_{j}(t)^{T}(j=1,2,\ldots ,m)\); \(\mathbf{x}_j(t)\) have zero means, and \(\mathbf{{ S}}=\mathbf{s}_{i}(t)^{T}(i=1,2,\ldots ,n)\). The linear relation is expressed as:

$$\begin{aligned} {\mathbf{X}} = \mathbf{{ AS}} = \left[ \varvec{a}(1), \varvec{a}(2) \ldots , \varvec{a}(t) \right] ^{\mathrm{{T}}} \left[ \varvec{s}_1, \varvec{s}_2, \ldots , \varvec{s}_{n} \right] \end{aligned}$$
(1)

where \(\mathbf{A}\), called the weighting matrix, is an unknown \(t \times n\) matrix of the full column rank. \(\mathbf{X}\) denotes the observed signal. The independent components are assumed to be mutually statistically independent and have zero means. ICA was used to extract the weighting matrix, \(\mathbf{A}\) and independent components, \(\mathbf{s}_{i}(m)\) from the observed signal alone.

3 Experiment

3.1 Experimental system

Figure 1 shows the experimental system. The physiological indices were FTI and mean arterial pressure (MAP). The experimental system comprised an infrared thermography device (A35 Series; FLIR Co.) and a non-invasive continuous blood pressure monitor (CNAP Monitor HD, CNSystems Co.). The infrared thermography device was set up approximately 1.0 m in front of the subject. The thermal images were captured at a sampling frequency of 1 Hz. The size of the thermal image was \(320 \times 256\) pixels. The temperature resolution was less than \(0.05 \,^\circ \hbox {C}\), and the infrared emissivity of the skin was set to \(\epsilon =0.98\). MAP was measured by attaching a blood pressure cuff to the left finger and upper arm with a non-invasive continuous blood pressure monitor at a 10 Hz sampling frequency.

Fig. 1
figure 1

Experimental system

3.2 Procedure and condition

The subject of this study was a healthy adult man aged 22 years. The subject was fully informed of the experimental procedure and the purpose of the study prior to participation. The study obtained official approval from the Life Science Committee of the College of Science and Engineering, Aoyama Gakuin University (Approval number:H17-M13-3). The temperature in the experimental room was set to \(21.0 \pm 0.6^ \circ \hbox {C}\). Figure 2 shows the experimental protocol. The experiments were conducted every hour from 8:00 a.m. to 11:00 p.m. daily. The same experimental protocol was conducted for 3 days to confirm repeatability. To acclimatize to the temperature of the experiment room, the subject entered 15 min before the experiment began. To control the physiological responses to eating, the subject was only allowed three meals: breakfast at 8:00 a.m., lunch at 1:00 p.m., and dinner at 7:00 p.m. sleep time was required to be at least six hours. These restrictions were placed on eating and sleeping times to prevent changes in the circadian rhythm of physiological functions. The experiment consisted of a resting state segment (Rest) and an induced acute-stress physiological response segment (Task). In the Rest segment, subject was asked to rest with eyes closed for 120 s. In the Task segment, subject was asked to hold breath with eyes closed for 60 s to induce acute stress. FTIs and MAP were measured continuously during the Rest and Task, and the last 10 s of each segment were used for analysis.

Fig. 2
figure 2

Experimental protocol

4 Indices for analysis

The analysis method comprised five procedures: (1) statistical evaluation, (2) extraction of FTIs, (3) extraction of independent components, (4) correlation analysis, and (5) multiple regression analysis. The details of each procedure are described in this section.

4.1 Statistical evaluation

Statistical evaluation was conducted to evaluate the physiological states. A two-factor analysis of variance (ANOVA) was used to measure MAP variations to assess the differences statistically between the two conditions, Rest and Task, and time lapse. A mean value of MAP for 10 s obtained from each experiment was used for the analysis.

4.2 Extraction of facial thermal images (FTIs)

FTIs were extracted from thermal images obtained in the experiment. A face detection algorithm based on a single shot multibox detector (SSD) and active appearance models (AAM) was used for FTIs extraction from thermal images [17, 18]. An \(82 \times 65\) pixel FTI was cropped from each thermal image based on the face detection algorithm, and the background or subject’s hair was removed. Figure 3 shows an example of FTI. Each FTI was normalized to subtract blackbody temperature and mosaicked to remove temperature fluctuations caused by imperceptible facial movements.

Fig. 3
figure 3

Example of a FTI

4.3 Extraction of independent components

The observed signal was created from the extracted FTIs, and independent components were extracted based on ICA. The mean FTI was acquired from the FTIs measured for 10 s at time t and expanded to a one-dimensional vector, represented as:

$$\begin{aligned} \varvec{x}(t) = \left[ x_1(t), x_2(t), \ldots ,x_{a\times b}(t)\right] \end{aligned}$$
(2)

where \(x_{a\times b}(t)\) represents the pixel \(a\times b\) value of the mean FTI at time t. \(\mathbf{{x}}(t)\) are normalized to zero means. The mean FTI vectors for t hours, which were extracted using a similar technique, were stored in a matrix:

$$\begin{aligned} {\mathbf{X}}_{\mathrm{{FTI}}} = \left[ \varvec{x}(1), \varvec{x}(2), \ldots ,\varvec{x}(t)\right] ^{\mathrm{{T}}} = \left[ \begin{array}{ccc} x_{1,1} &{} \ldots &{} x_{1,a\times b} \\ \vdots &{} \ddots &{} \vdots \\ x_{t,1} &{} \ldots &{} x_{t,a\times b} \end{array} \right] \end{aligned}$$
(3)

ICA applied to \(\mathbf{X}_{\mathrm{{FTI}}}\). The weighting matrix \(\mathbf{A}\) and independent components \(\mathbf{S}\) were obtained using Eq. (1) as follows:

$$\begin{aligned} {\mathbf{A}}= & {} \left[ \varvec{a}(1), \varvec{a}(2), \ldots , \varvec{a}(t) \right] ^{\mathrm{{T}}} = \left[ \begin{array}{ccc} a_{1,1} &{} \ldots &{} a_{1,n} \\ \vdots &{} \ddots &{} \vdots \\ a_{t,1} &{} \ldots &{} a_{t,n} \end{array} \right] \end{aligned}$$
(4)
$$\begin{aligned} {\mathbf{S}}= & {} \left[ \varvec{s}_1, \varvec{s}_2, \ldots , \varvec{s}_{n} \right] ^{\mathrm{{T}}} = \left[ \begin{array}{ccc} s_{1,1} &{} \ldots &{} s_{1,a\times b} \\ \vdots &{} \ddots &{} \vdots \\ s_{n,1} &{} \ldots &{} s_{n, a\times b} \end{array} \right] \end{aligned}$$
(5)

The independent components \(\mathbf{S}\) were represented the feature map of the face. The weighting variations \(\mathbf{A}\) indicated time series variation. The fastICA algorithm [19,20,21], which outperforms the majority of commonly used ICA algorithms in terms of convergence speed, was used. The number of independent components was determined to be 13 based on negentropy [22, 23]. The observed signal for applying ICA was all FTIs for 3 days. At time t, the FTIs for Rest and Task were included. This observed signal was defined as \(\mathbf{X}_{\mathrm{{ALL}}}\). The extracted independent components and weighting variations from \(\mathbf{X}_{\mathrm{{ALL}}}\) were defined as \(\mathbf{S}_{\mathrm{{ALL}}}\) and \(\mathbf{A}_{\mathrm{{ALL}}}\), respectively.

4.4 Correlation analysis

It is expected that \(\mathbf{S}_{\mathrm{{ALL}}}\) includes long-term variable components on Rest or Task and short-term variable components due to acute stress. Correlation analysis was performed to separate short-term variable components in \(\mathbf{S}_{\mathrm{{ALL}}}\). As shown in Table 1, FTIs were considered as observed signals, and the extracted independent components and weighting variations from each FTI were defined for correlation analysis. In Table 1, \(\mathbf{X}_{\mathrm{{Rest}}}\) and \(\mathbf{X}_{\mathrm{{Task}}}\) were observed signals created from FTIs during the Rest or Task, respectively. \(\mathbf{S}_{\mathrm{{Rest}}}\), \(\mathbf{S}_{\mathrm{{Task}}}\), \(\mathbf{A}_{\mathrm{{Rest}}}\), and \(\mathbf{A}_{\mathrm{{Task}}}\) are the independent components and weighting variations created from each observed signal. The number of independent components of \(\mathbf{S}_{\mathrm{{Rest}}}\) and \(\mathbf{S}_{\mathrm{{Task}}}\) was set to 13, similar to \(\mathbf{S}_{\mathrm{{ALL}}}\). The correlation coefficients between \(s_{n}(n=1,2,\ldots ,13)\) in \(\mathbf{S}_{\mathrm{{ALL}}}\) and 13 independent components included in \(\mathbf{S}_{\mathrm{{Rest}}}\) or \(\mathbf{S}_{\mathrm{{Task}}}\) in Table 1 were calculated. If the maximum absolute value of the 13 correlation coefficients obtained was over 0.8, \(\varvec{s}_{n}\) is related to long-term variation of the Rest or Task. By contrast, if \(\varvec{s}_{n}\) in \(\mathbf{S}_{\mathrm{{ALL}}}\) was not correlated with either \(\mathbf{S}_{\mathrm{{Rest}}}\) or \(\mathbf{S}_{\mathrm{{Task}}}\), then \(\varvec{s}_{n}\) was related to the short-term variation.

Table 1 Defined observed signals, independent components, and weighting variations

4.5 Multiple regression analysis

Multiple regression analysis was performed to construct the blood pressure estimation model from independent components, excluding the short-term component. Multiple regression analysis is represented as:

$$\begin{aligned} y = \sum _{k=1} {\beta _k x_k} + {\text {Cnst}}. \end{aligned}$$
(6)

where \(\beta _k\) and Cnst. represent the partial regression coefficients and the constant term, respectively. Only MAP at rest was used for the blood pressure estimation model because this study aimed to estimate resting blood pressure. The dependent variable, y, was the mean MAP over 10 s obtained in each experiment at Rest for 3 days. The explanatory variables, \(x_{k}\), were weighting variations \(\mathbf{A}_{\mathrm{{ALL}}}\) excluding the independent components related to short-term variation identified in the correlation analysis. The explanatory variables were optimized using a stepwise procedure based on Akaike’s information criterion (AIC) [24].

5 Results and discussion

5.1 Independent component analysis to FTIs

Figure 4 shows the measured MAP variations in a subject over 3 days. The vertical axis, horizontal axis, and error bars indicate the mean value of MAP, time for 3 days, and standard error, respectively. The solid and dashed lines indicate the MAP in the Rest and Task segments, respectively. The ANOVA shows a significant main effect for conditions (\(p<0.05\)) and time lapse (\(p<0.001\)), and there was no interaction. The significant main effect for conditions was caused by blood pressure variation of the cardiac-dominant pattern induced by the task. On the other hand, it has been reported that blood pressure fluctuates based on circadian rhythms [25]. The significant main effect of time lapse is believed be caused by the circadian rhythm.

Fig. 4
figure 4

Measured MAP variations. Error bars indicate the standard error

Fig. 5
figure 5

Images of feature quantities of independent components \(\mathbf{S}_{\mathrm{{ALL}}}\) and corresponding weighting variations \(\mathbf{A}_{\mathrm{{ALL}}}\)

Table 2 Correlation between \(\mathbf{S}_{\mathrm{{ALL}}}\) and other independent components

Figure 5 shows the result of \(\mathbf{S}_{\mathrm{{ALL}}}\) and the corresponding \(\mathbf{A}_{\mathrm{{ALL}}}\). In \(\mathbf{S}_{\mathrm{{ALL}}}\), red and blue colors signify strong features, and white colors signify weak features. In \(\mathbf{A}_{\mathrm{{ALL}}}\), the vertical and horizontal axes indicate the weighting values and the time for 3 days, respectively. Each day is divided by vertical lines.

The results of correlation analysis are shown in Table 2. If the maximum absolute value of thirteen correlation coefficients (|R|) in is more than 0.8, each cell in Table 2 is denoted by “\(\bigcirc \)” and the others by “-”. The \(\varvec{s}_{12}\) and \(\varvec{s}_{13}\) in \(\mathbf{S}_{\mathrm{{ALL}}}\) were related to \(\mathbf{S}_{\mathrm{{Task1-3}}}\) and \(\mathbf{S}_{\mathrm{{Rest1-3}}}\), respectively, as shown in Table 2. These components are expected to be long-term components related to circadian rhythms and represent rhythms that are reproducible for three days. The \(\varvec{s}_{3}\) in \(\mathbf{S}_{\mathrm{{ALL}}}\) was not correlated with the other observed signals, as shown in Table 2. The corresponding weighting variation \(\varvec{a}_{3}\) qualitatively shows a time series of cyclic variation compared to other weighting variations, as shown in Fig. 5. ICA was applied to the FTIs dataset with Rest and Task arranged alternately because it assumed that the blood pressure variations were represented on the FTIs. It is considered that \(\varvec{a}_{3}\) was affected by blood pressure variations by acute stress. Therefore, \(\varvec{s}_{3}\) corresponding to \(\varvec{a}_{3}\) is expected to be a short-term blood pressure variable component related to acute stress. It could be shown that a short-term blood pressure variable component related to acute stress could be separated and extracted from FTIs.

Table 3 Results of multiple regression analysis
Fig. 6
figure 6

Time variation of measured and estimated MAP at Rest

5.2 Construction of model for resting blood pressure estimation

The explanatory variables were \(\mathbf{A}_{\mathrm{{ALL}}}\), excluding \(\varvec{a}_{3}\), which is the short-term variation component related to acute stress in the multiple regression analysis. Table 3 shows the results of multiple regression analysis. The shaded cells indicate excluded independent components, and the hyphen indicates the variables not selected as explanatory variables. Figure 6 shows the time variation of measured and estimated MAP. The vertical axis indicates the value of MAP, and the horizontal axis indicates time. The solid and dashed lines indicate the measured and estimated MAP, respectively. The coefficient of determination was 0.253, and the root mean square error was 9.90 mmHg. \(\varvec{a}_{4}\), \(\varvec{a}_{7}\), \(\varvec{a}_{11}\), and \(\varvec{a}_{13}\) were selected as explanatory variables for the resting blood pressure estimation model. Therefore, \(\varvec{s}_{4}\), \(\varvec{s}_{7}\), \(\varvec{s}_{11}\), and \(\varvec{s}_{13}\) were independent components related to the resting blood pressure variation. Additionally, \(\varvec{s}_{13}\) corresponding to \(\varvec{a}_{13}\) was a long-term component related to circadian rhythm, as mentioned in Sect. 5.1. \(\varvec{s}_{13}\) was expected to represent the blood pressure variable components related to the main circadian rhythm. The coloration in the nose in \(\varvec{s}_{4}\), lip in \(\varvec{s}_{7}\) and \(\varvec{s}_{13}\), and cheek in \(\varvec{s}_{11}\) regions were intensified as shown in Fig. 5, and these regions were suggested to be related to resting blood pressure. In a previous study, the nose region was extracted as feature quantities related to short-term blood pressure variations based on acute stress [26]. The nasal and lip region is well known as the peripheral region reflecting the blood flow variation [27]. Therefore, it is considered that regions reflecting blood flow fluctuations are more likely to be used to estimate resting blood pressure.

6 Conclusion

The objective of this study was to estimate resting blood pressure by separating acute stress blood pressure variations using FTIs. ICA was applied to the FTIs to extract features related to acute stress blood pressure variations. From the extracted independent components, a correlation analysis was used to separate and extract the short-term variable components related to acute stress. Multiple regression analysis was performed to estimate resting blood pressure using independent components, excluding short-term variable components. As a result, resting blood pressure was estimated from FTIs with an accuracy of 9.90 mmHg root mean square error between measured and estimated MAP. The features related to resting blood pressure variation were represented in the nose, lip, and cheek regions. The contribution of this study is the separation and extraction of short-term components related to acute stress blood pressure variations, and resting blood pressure could be estimated with an accuracy of 9.90 mmHg.

This study has a limitation owing to its lack of generality, because only one subject was involved. In future studies, it is necessary to test the generality of resting blood pressure estimation using FTIs by increasing the number of subjects and comparing the obtained results. Additionally, the components of FTIs need to be evaluated while considering interday differences in blood pressure, because a difference was observed in blood pressure variations on day 1, 2, and 3.