Background

Healthcare providers worldwide are constantly challenged to achieve the ‘quadruple aim’ in care delivery: Improving population health, enhancing patient experience, reducing healthcare worker burnout and dissatisfaction, and reducing costs [1, 2]. There has been great progress in healthcare artificial intelligence (AI) and machine learning (ML) in the recent years [3,4,5,6], which hold promise.

Healthcare AI technologies has seen tremendous adoption especially in the medical imaging field [7,8,9]. AI’s performance is shown to match or exceed human experts in studies involving computed tomography [10], chest X-ray classification [11], ophthalmology [12, 13], oncology [14, 15] and dermatology [16]. Such advances demonstrate how AI can refine diagnostic accuracy and improve patient outcomes, while using healthcare resources more efficiency. However, there are still challenges and limitations in the data quality, ethical issues, regulatory frameworks, and user acceptance of AI technologies [17].

In our main study, a novice layperson with no prior sonography knowledge was trained and equipped with a point-of-care(POC) ultrasound device [18], which had real-time AI image guidance on probe repositioning to capture better image quality. We also used a United States Food and Drug Agency-approved deep learning-based software [19], which automatically analyses the acquired images and provides cardiac measurements such as the left ventricular ejection fraction (LVEF). We named this workflow the AI-novice combination. Our study showed that the AI-novice combination, compared with trained sonographer-performed and cardiologist-interpreted echocardiogram studies, was able to achieve an area under the curve (AUC) of 0.880 (95%CI 0.802, 0.958) in detecting patients with left ventricular dysfunction. Furthermore, the AI-novice combination had a median absolute deviation from the reference standard LVEF at 6.03%, which is comparable to human inter-reader variability [20].

Apart from the AI-novice combination’s accuracy, we wanted to study the patients’ attitudes and hence usage intention, given that AI taskshifting is a new model of care. There are several theories which look at technology usage intention behaviours. For our study, we chose the unified theory of acceptance and use of technology (UTAUT) and its extended version (UTAUT2) because it was the most comprehensive [21, 22]. Among the UTAUT2 framework’s constructs, we examined the relationships between performance expectancy, effort expectancy, social influence, facilitating conditions, hedonic motivation and behavioral intention. We did not include questions on price and trust in the framework as the cost and reliability of the new AI-novice combination had not been established at the point of study. Also, the questions on habit pertain more towards voluntary consumerism, hence it was not applicable to a physician-ordered investigation. Additionally, the UTAUT2 has been one of the more important and versatile theoretical models used to predict consumer technology use and its effectiveness has been proven in different cultural backgrounds [23].

Prior studies largely looked at healthcare workers’ adoption intention of AI-assisted diagnosis and treatment [24]. In contrast, our study looks at the patients’ perspective, using the UTAUT2 framework. This has not been studied comprehensively, to our knowledge. Since patients are key stakeholders in new care pathways, this study will contribute to better understanding of factors influencing patient usage intention, therefore reduce barriers and facilitate implementation of AI-enabled taskshifting.

Study hypothesis and theoretical foundation

Hypothesis 1

Performance expectancy influences the intention to accept task shifting of specialized scans to AI-novice combination.

Performance expectancy describes the perception that using the technology will benefit the patient and is therefore tied to the perception of usefulness [21]. In most studies, performance expectancy significantly influenced behavioural intention and was also often the strongest predictor [25].

Hypothesis 2

Effort expectancy influences the intention to accept task shifting of specialized scans to AI-novice combination.

Effort expectancy describes the expected ease of using the technology [21].

Hypothesis 3

Social influence affects the intention to accept task shifting of specialized scans to AI-novice combination.

Social influence is “the degree to which an individual perceives that important others believe he or she should use the new system” [21].

Hypothesis 4

Facilitating conditions influence the intention to accept task shifting of specialized scans to AI-novice combination.

Facilitating conditions refer to the perceptions “of the resources and support available to perform a behavior” [21].

Hypothesis 5

Hedonic motivation influences the intention to accept task shifting of specialized scans to AI-novice combination.

Hedonic motivation refers to gratification (fun, pleasure, and enjoyment) associated with using technology [21]. It is considered as the most important theoretical addition to the UTAUT2 as it shifted the focus from extrinsic motivation of an organization to intrinsic motivation of consumers toward usage of technologies [26].

Behavioural intention refers to the degree in which a person will perform a specified action, in this case accepting the task shifting of specialized scan to the AI-novice combination. It is the variable that responds to the above hypotheses.

Methods

Sample and data collection

In this cross-sectional study, patients aged ≥ 21 years with at least one heart failure symptom (pedal oedema, New York Heart Association (NYHA) II-III effort limitation, orthopnoea, breathlessness), were prospectively included. Consecutive patients presenting to the cardiac imaging laboratory at the National Heart Centre Singapore for physician-ordered clinical echocardiogram for investigation or follow-up for heart failure were approached. All patients underwent standard cart-based sonographer-performed, cardiologist-reported echocardiogram as per clinical practice. For the study, patients then underwent an additional experience of a novice-performed AI-supported POC echocardiogram on the same day for comparison. After the AI-novice combination echocardiogram was performed, a questionnaire was administered individually to each patient. The questionnaire comprised two portions; the first collected demographic information, while the second focused on the key constructs of performance and effort expectancy, facilitating conditions, social influence, hedonic motivation and behavioural intention. The second part of the questionnaire was adapted from the original version developed by Venkatesh et al. [21], see supplementary file 1.

Ethics approvals

Ethics approvals were obtained from the local institutional review committee for this study, and this study conformed to the ethical guidelines in the Declaration of Helsinki.

Measurement and data analysis

The items measuring performance expectancy, effort expectancy, social influence, facilitating conditions, hedonic motivation and behavioral intention were adapted from Venkatesh et al. [21] to suit our study. We modelled our questionnaire as closely as possible to the original questionnaire by Venkatesh et al. [21], excluding only questions that did not suit our context. These items were measured using 5-point Likert scales anchored by 1(strongly disagree) and 5(strongly agree). Table 1 displays the finalized items with the mean scores of each question.

Table 1 Extended unified theory of acceptance and use of technology survey questionnaire and mean scores

The analysis process was divided into two parts. As the questionnaire was adapted, the measurement quality was rigorously checked for reliability and validity. Only when the quality of the model was confirmed, the structural model was analyzed and interpreted.

Composite reliability (CR) and Cronbach’s α are used to evaluate internal consistency reliability. Both CR and Cronbach’s α values are recommended to be greater than 0.7 [27].

Convergent validity is supported if all the standardized item loadings are greater than 0.70, and if the average variance extracted (AVE) values for every construct exceed 0.5, the threshold value recommended by Fornell and Larcker [28]. Items which did not meet the criteria of > 0.70 were dropped from the final partial least squares structural equation modelling (PLS-SEM).

Discriminant validity is achieved if both AVE estimates of a given pair of constructs are greater than the square of the construct correlation [29].

PLS-SEM was used to explain causal relationships among constructs [30], as it could support both exploratory and confirmatory relationships [31,31,34,34]. This is suitable in this context as the questionnaire has been adapted from the original to suit our study. This regression analysis also allows for smaller sample sizes and permits estimation of models using ordinal scale data [35]. Additionally, prior studies of UTAUT2 with adapted questionnaires also used PLS-SEM as the preferred method [36,36,38]. PLS-SEM evaluations are comprised of an outer model (measurement model) and an inner model (structural model).

The structural model was validated using a nonparametric bootstrap procedure to test the statistical significance of path coefficients with 5,000 bootstrap samples [27].

All analyses were performed using Stata statistical software (version 14.0, Stata Corporation, College Station, Texas, USA). All statistical analyses were conducted at the significance level of 0.05, and all tests were two-tailed whenever appropriate.

Results

The sample characteristics are presented in Table 2. Of the respondents, 56% were male, the average age was 61.2 years, and 33% of respondents received tertiary or higher education.

Table 2 Baseline demographics of patients

Internal consistency reliability, convergent and discriminant reliability.

The internal consistency reliabilities for all constructs were greater than the threshold value (i.e. α ≥ 0.7; CR ≥ 0.7). However, the item loadings for Question 5 of Performance Expectancy and Question 4 of Effort Expectancy were < 0.70; hence these 2 items were excluded from the PLS-SEM. The other items had met criteria for being statistically significant (p < 0.001) and greater than 0.70 (see Table 3).

Table 3 Convergent validity and construct reliability

After excluding Question 5 of Performance Expectancy and Question 4 of Effort Expectancy, the AVE values for all constructs were greater than 0.5. Thus, convergent validity was supported. The square of the correlation coefficients was smaller than the corresponding square roots of the AVE estimates for all pairs of constructs. Therefore, discriminant validity was achieved (see Table 4).

Table 4 Correlation matrix and descriptive statistics of the constructs

The structural model

The resulting path coefficients are depicted in Fig. 1, after bootstrapping analysis.

Fig. 1
figure 1

Resulting path coefficients after bootstrapping analysis for constructs affecting behvioural intention to accept task shifting of specialized scans to AI-novice combination in the extended UTAUT2 model

The results revealed that the extended UTAUT2 model could explain 85% of the variance in behavioural intention. All the hypothesized variables showed significant relationship to behavioral intention. Facilitating conditions, hedonic motivation and performance expectancy showed the strongest relationship to behavioural intention.

Patient factors such as age (p = 0.181), education level(p = 0.218) and gender(p = 0.776) did not significantly affect behavioural intention.

Discussion

Task shifting of specialized studies is one key area where AI is helping to revolutionize healthcare. Examples include interpreting medical imaging, such as chest x-rays [39], and implementing nationwide screening programs, such as retinal imaging [40]. While the medical community focuses on the accuracy and theoretical foundations of the AI, studies have shown that despite comparable or even superior accuracy of AI, patients have found it challenging to accept AI in medical care [41, 42]. Hence in our study, apart from determining the accuracy of the AI-novice combination, we also sought to study the motivators and barriers from the patient’s perspective toward acceptance of such new models of care.

The UTAUT2 suggests that actual use of any technology is affected by one’s behavioural intention to use it. It proposes that four main constructs affect behavioural intention: usage, intention—performance expectancy, effort expectancy, social influence and facilitating conditions. It extends three more constructs (hedonic motivation, price value and habit) in its framework. The UTAUT2 provides the highest explanatory power of all standard acceptance models. Knowing these underlying important constructs will help organizations improve their technology implementation processes [21, 22].

Our study showed that all four main constructs showed a significant relationship with behavioural intention, influencing 85.8% of variance to behavioural intention. The top 3 constructs influencing behavioural intention in our study were facilitating conditions, hedonic motivation, and performance expectancy. While medical performance expectancy was the most important construct in many other studies that focused on healthcare professionals [25], facilitating conditions may exert stronger influence from the patients’ perspective [43, 44]. This does not mean that performance expectancy, which is the perception that using the technology will benefit the patient, is less important; rather, in a complex network like the healthcare system, facilitating conditions play an important and significant role in access. Hedonic motivation, which is the fun and pleasure derived from using the technology, represents an affective component of the UTAUT2 [26] and looks at the user’s intrinsic motivation [45, 46] rather than the extrinsic motivation from the four main constructs. Our study shows that hedonic motivation is a significant factor in behavioural intention. Therefore, making the process fun and pleasurable is important towards patient adoption [47]. Facilitating conditions and hedonic motivation had a strong correlation (r = 0.992, p-value < 0.001) in our study. This intuitively suggests that having the resources to improve access itself also improves overall patient experience and enjoyment, placing further importance on facilitating conditions in the successful implementation of this new pathway.

Age, gender and education levels were not significant modifiers of behavioural intention. This was congruent with prior studies on health apps, where age, gender and education levels were weakly correlated with behavioural intention [36].

Prior studies of UTAUT2 in healthcare largely looked at patients; acceptance toward digital health applications [36, 37, 48, 49] and have largely found that hedonic motivation, social influence and habit, rather than utilitarian factors predicted usage intention. As the nature of digital health applications are different from AI-assisted taskshifting and diagnosis, we felt that it was important to apply the UTAUT2 framework in this new model of care. In the area of AI-assisted diagnosis, from the healthcare workers’ perspective, Cheng et al. [50] demonstrated that performance and effort expectancy directly influenced usage intention. Our study showed that patients have a different considerations, suggesting that it is important to look at factors affecting different stakeholders to ensure successful implementation.

The sustainability of new pathways and health programs requires understanding the implementation process and factors that impact adoption. The UTAUT2 is one of the most comprehensive theoretical frameworks developed to investigate factors related to future usage intention [47]. When detailing the implementation process, understanding these factors will allow changemakers to focus more on certain aspects, such as facilitating conditions, hedonic motivation and performance expectancy. Our study is the first to look at patients’ attitudes toward task shifting using AI in healthcare, using the UTAUT2 theory. This provides important information for the implementation of such pathways in future.

Limitations

Our study has several limitations. First, our study is based in Singapore, which has high levels of access to digital technologies. Based on the Singapore Digital Society Report 2023 [51], 97% of Singaporeans own a smartphone. The study also found that 65% of Singaporeans are keen to try out new digital technologies. This limits the generalizability of our study’s findings to settings that are less advantaged technologically, but it may still be used to make future predictions in these settings. Furthermore, we did not control for patients’ digital literacy, technology acceptance and attitudes towards AI in general in our study, which may be potential confounders for our study’s findings. In our study, we had chosen to omit items with item loadings < 0.70 for convergent validity, after ensuring that each construct had at least two questions for measurement. We recognize that by omitting these items, especially borderline cases, can result in missing out important information that can be measured by the question. Also, our study’s design offered patients the experience of both the standard cardiac sonography and AI-novice workflow for comparison on the same day. While the study team attempted to distinguish the two experiences by explanation and conducting the AI-novice workflow only after the standard echocardiogram as per protocol, we acknowledge that patients may not be fully able to distinguish their experiences and perceptions for each method clearly and this may limit the accuracy of our study’s findings.

Conclusion

This study shows that all constructs proposed by the UTAUT2 model are essential for patients’ successful acceptance of AI-enabled novice-performed echocardiogram. These results are important for stakeholders and changemakers such as policymakers, governments, physicians, and insurance companies, as they design adoption strategies to ensure successful patient engagement by focusing on factors affecting the facilitating conditions, hedonic motivation and performance expectancy for AI technologies used in healthcare task shifting. Future studies can look into the practical design of adoption strategies based on these key factors and to evaluate their impact on patients’ acceptance of similar AI technologies.