Background

Artificial intelligence (AI) is defined as the development of algorithms that give machines the ability to act with human-like rationality in complex tasks, such as problem-solving and decision-making, and is poised to reshape medicine and surgery broadly [1].

Diagnostic and judgment errors are the second leading cause of preventable harm [2]. Decision-making is one of the most challenging and crucial tasks performed by trauma and emergency surgeons, where time constraints and high cognitive loads from high volumes of information demand reliance on cognitive shortcuts, leading to mistakes and preventable patient harm [3].

In recent years, decision support systems based on AI algorithms are receiving considerable attention in the literature [4,5,6]. In contrast to human cognitive capacities, AI can be used to examine high-complexity and high-volume data for predictive analytics to augment the precision of complex decision-making processes. Nonetheless, deploying medical AI systems in routine trauma and emergency surgery care presents a critical yet largely unfulfilled opportunity as the medical AI community steers the complex ethical, technical, and human-centered challenges required for safe and adequate translation [7,8,9,10]. Experienced surgeons are appropriately cautious of enthusiastic, novel solutions that are unsubstantiated by academic rigor and evidence of performance advantages in clinical settings [8]. Even when AI accurately predicts clinical outcomes hours or days in advance, its application in trauma and emergency surgery will remain marginal until the deployment proves trust in the accuracy and the lack of bias of the model, addresses target risk-sensitive decisions, and integrates with clinical workflows [11, 12].

The recently published artificial intelligence in Emergency and Trauma Surgery (ARIES) survey, endorsed by the World Society of Emergency Surgery (WSES) evaluated the knowledge, attitude, and practices in the application of AI in the emergency setting among international acute care and emergency surgeons. The investigation revealed how the implementation of AI in the emergency and trauma setting is highly anticipated but still in an early phase, with trauma and emergency surgeons enthusiastic about being involved [13].

Starting from these premises and research gaps, this article aims to assess surgeons’ understanding and knowledge about the role of new technologies like AI in clinical decision-making by employing an international survey endorsed by the WSES.

Methods

Design and setting

The exploratory study of the international trauma and emergency surgeons’ community used a population-based online questionnaire to gather demographic, knowledge, and practice-based information regarding the adoption of new technological tools to support clinical decision-making. The online questionnaire was generated in English through Google Forms [14, 15], and followed the Checklist for Reporting Results of Internet E-Surveys (CHERRIES) [16].

A steering committee within the WSES was appointed, involving a multidisciplinary panel of academics and practitioners in the fields of trauma and emergency surgery, healthcare management and policies, innovation, business and medical ethics, information technology, law, and organization science. No Institutional Review Board (IRB) approval was required. Starting from a review of the literature, a research protocol was conceived and shared by the principal investigators (LC and FDM) with the steering committee. The protocol was peer-reviewed and published [17]. The leading references to create the protocol and the survey structure were gathered from Dal Mas et al. [18], Loftus et al. [19, 20], Cobianchi et al. [21], Venkatesh et al. [22], and Bashshur, Shannon and Sapci [23]. Before the initiative's official launch, the research protocol and the online survey were reviewed by the steering committee and filled in by a sample of surgeons to avoid mistakes.

The poll was made available at the end of November 2021 and was open until the middle of August 2022. All 917 WSES members received an e-mail invitation to participate in the survey. The initiative was also shared on the society's website and Twitter account. Additionally, the members of the Team Dynamics Study Group [14, 15] were also invited via e-mail. Four e-mail reminders were sent. WSES membership was not required to answer the survey. Still, it may be assumed that the vast majority of the participants come from the 917 WSES members to whom the research initiative was publicized, yielding an approximate 70% response rate.

The invitation e-mail included comprehensive details on the initiative's goals and rationale, the anticipated time frame (about 10 min), and the option to join the Team Dynamics Study Group to carry out additional research and disseminate the results. The identities of the participants were kept secret. Additionally, the identity of the investigators and the research protocol were kept private.

Survey

The first group of questions aimed at understanding the participants’ features. The questions’ structure and list were derived from the previous Team Dynamics investigation [14, 15]. Surgeons were asked to disclose their gender, years of experience in trauma/emergency surgery, type of institution (academic versus non-academic), the country in which they work, role, eventual participation within a trauma team (institutionalized or not, and of which type), the kind of trauma leader, the educational training attended, and the eventual presence of diverse team members.

The second group of questions investigated the decision aids as reported by the surgical literature [18,19,20], by assessing the surgeons’ perception of 11 items using a 5-point Likert scale.

The third group of questions concerned the perceived challenges that surgeons need to face when making clinical decisions, assessing 13 items gathered from Loftus and colleagues [20] using a 5-point Likert scale.

The fourth section of the questionnaire aimed at assessing the surgeons’ feelings about AI by asking them if they were familiar with the term AI, their understanding of it through an open question, and the perceived importance of AI in surgery at the present time and in a five-year horizon. One last question aimed to assess the perceived benefits [21, 22] of the application of AI in surgery.

The survey’s questions related to clinical decision-making through new technological aids are reported in Appendix 1.

Statistical analysis

The statistical analysis was conducted using the software R (RStudio 2022.07.0 + 548 "Spotted Wakerobin" Release) [24].

Manual coding was also employed concerning the qualitative questions. Surgeons were required to express their understanding of AI through an open question. Results were manually coded by two researchers (LC and FDM), who rated each statement as concordant, discordant, or inconclusive, following the analysis of Woltz et al. [25] and Cobianchi et al. [14].

Results

Participants

650 surgeons responded to the questionnaire. Located on five continents, participants hailed from 71 different nations. The sample, however, was not evenly dispersed, with the majority of surgeons coming from Europe (477, or 73%), particularly Italy (251, or 39%). 465 responders (72%) were from the 10 nations with the most participants overall.

118 female surgeons (18%), 531 male surgeons (82%), and one person who would rather remain anonymous made up the sample. Surgeons' years of experience in the field ranged from 1 to 35, with a mean of 12. The majority of participants (499, or 77% of the sample) were from academic institutions, and 540 of them declared to be part of a formally-established emergency surgery team (83%). Although there were a variety of roles indicated, senior consultants made up the bulk of surgeons (233, or 36%). Department heads made up 114 (18%) of the sample.

Table 1 presents descriptive statistics about the individuals and institutions that participated in the study, whereas Table 2 reports some information about the number of respondents according to their locations.

Table 1 Descriptive statistics about surgeons and institutions participating in the investigation
Table 2 Number of respondents according to their location

Clinical decision-making facilitators

Surgeons were given a list of 11 items referred to clinical decision-making facilitators to be rated on a 5-point Likert scale according to their perceived importance. Clinicians underlined the importance of training, clinical guidelines, and multidisciplinary committees and meetings, with a mean of over 4 out of 5. Interestingly, the most modern tools (namely, risk stratification by additive scores using static variable thresholds, machine learning and artificial intelligence, regression modeling and calculations—with 3.87, 3.56, and 3.49, respectively) received the lowest scores, with machine learning and artificial intelligence recording the highest standard deviation (1.07). The following Table 3 reports the results.

Table 3 Decision aids and facilitators

The responses given to the item “Machine Learning and Artificial Intelligence” were analyzed by dividing the sample by institution, position held, and country. Interestingly, the institution (academic or non-academic) and the role did not show any significant difference. Major differences emerged when analyzing countries, with some of them (like Argentina) showing a high mean (4.5) and a low standard deviation (0.548), and others (like Switzerland) a very low mean (2.6) and a high standard deviation (0.89). Such results are reported in the following Table 4.

Table 4 Item 10 “Machine Learning and Artificial Intelligence” responses per country

Moreover, the same responses were analyzed by considering the institution, position, and country simultaneously. Although numbers become smaller, some interesting findings emerge. For instance, all senior consultants belonging to Malaysian academic institutions firmly believed in AI, with all of them giving a rate of 5/5, compared to division chiefs working in the USA, who scored the item 2.5 with a standard deviation of 1.05. Despite being digital natives, residents do not appear in the top positions. Results are depicted in the following Table 5.

Table 5 Item 10 “Machine Learning and Artificial Intelligence” responses per institution, position, and country

Challenges in clinical decision-making

Surgeons were asked to rate 13 items regarding the challenges that could arise when engaging in clinical decision-making. The items were adapted from Loftus et al. [20].

Results underline a perceived misalignment between the actual clinical scenario and the independent assessment, the lack of complete data availability, the risk of expecting specific outcomes grounded on inaccurate data, the presence of bias derived from the most recent experiences, and the possibility of making mistakes all along the clinical journey. Interestingly, surgeons rated the potential support granted by new technologies like AI as the lowest item (with a mean of 3.10 and the highest standard deviation of 1.14).

Findings are reported in the following Table 6.

Table 6 Challenges in clinical decision-making

Knowledge and understanding of AI

Surgeons were asked if they were familiar with the term AI. 451 out of 650 participants (69% of the sample) declared they were familiar with the term, while 199 (31%) admitted they were not.

The following question required surgeons to write their understanding of AI applied to surgery. Each given statement was rated by the two principal investigators (LC and FDM) as concordant, discordant, or inconclusive.

To be rated as concordant, definitions needed to somehow stress the capability of the machine to mimic human intelligence or, at least, to express the aims and potential of AI applied to surgical practice or its technological functioning. Only 112 surgeons (17% of the sample) provided a statement that fitted the criterion. 178 participants (27% of the sample) gave responses that were incomplete, showing only a partial view of the phenomenon, being so rated as inconclusive. The remaining 360 surgeons (55% of the participants) gave answers that were not fitting the concept of AI, its aims, and its potential. Most of these participants declared they had no idea about what AI could entail in surgical practice.

The following Table 7 reports some examples of answers that were rated as concordant, inconclusive, and discordant [14, 25].

Table 7 Examples and ways of rating the given answers to the question: What is your understanding of AI applied to surgery?

The statements were also analyzed according to different keys, namely, the fact that they recalled the idea of AI supporting clinical decision-making and of learning/training, and the eventual other technologies linked to AI. Most surgeons (409, 63% of the sample) mentioned the impact on clinical decision-making, and 57 (9%) recalled the potential of AI for surgical training. Among the most cited technologies that may be linked with AI, there are Big Data (mentioned by 130 participants – 20% of the sample) and new technologies in general terms (107, 16%).

Table 8 highlights such results.

Table 8 Understanding of AI applied to surgery – Main results

The potential of AI today and tomorrow

Surgeons were asked to express their opinion about the importance of AI-based applications to support clinical decision-making at the current time and on a five-year horizon, using a 5-point Likert scale.

Participants rated the importance today with a mean of 3.06 out of 5 and a standard deviation of 1.1. In the future, the sample rated the importance as 3.88 as an average with a standard deviation of 0.97.

The goals of AI

One last question referred to the perceived goals and benefits of AI-based in supporting clinical decision-making. Five items needed to be rated on a 5-point Likert scale.

Interestingly, surgeons rated all five items as equally relevant (ranging from a mean score of 3.51 to 3.78). The item with the highest score sees AI as a support tool to validate decisions that clinicians would make anyway.

The following Table 9 reports the results.

Table 9 Goals and benefits of AI applied to surgery

Discussion

The results of our survey are in line with previous studies [13], with trauma and emergency surgeons showing interest in AI but still having several doubts and concerns about its actual application and potential as a decision-making aid.

When enquired about the most effective tools to support clinical decision-making, surgeons declare to rely more on training, clinical guidelines, and the support of their multidisciplinary staff and colleagues. Interestingly, these three items represent central and “classical” elements in the trauma and emergency surgery context [14] and in the action of the leading scientific societies like the WSES, which promotes training modules for both physicians and nurses [26], clinical guidelines covering several aspects of the clinical profession [27], and studies on multidisciplinary team dynamics [14, 15]. AI and Machine Learning tools got one of the lowest rates (3.56), with the highest standard deviation (1.07). The broad standard deviation interval may represent a gap in the surgical community, with some surgeons firmly believing in the potential of such new technologies and others still having severe concerns about their practical application.

Interestingly, while no significant differences emerge when considering the institution and the position held, geographical differences occur in the sample, with some countries giving higher scores than others. It would be interesting to investigate further while such differences emerge, for instance, in terms of cultural mindset, training opportunities, availability of technology or tech partners, and presence of technological surgical leaders or ambassadors. Moreover, when analyzing the sample under different lenses simultaneously (institution, role, and country), residents do not appear to be technology enthusiasts and believers, despite their younger age and the definition of digital natives.

When considering the decision-making process dynamics, trauma and emergency surgeons underline the need for sound decision aids. Indeed, the most rated elements refer to the potential gap between reality and the surgeon's initial assessment and the lack of accurate data. Even in this case, most respondents did not believe that AI offers valuable support for decision-making. Like in the case of decision aids, the contribution of AI to decision-making got the highest standard deviation, depicting the two extreme views, with enthusiastic adopters in contrast with skeptics.

Regarding the knowledge about AI, while most surgeons declare to be familiar with the concept of AI, many exhibited only a partial understanding of AI. Interestingly, many surgeons recall big data technology, and some associate AI primarily with robotics. While robotic surgery may represent one promising application of AI in surgical science [13, 28, 29], AI in its current state offers sound decision-making support, like in the case of the POTTER algorithm as an emergency surgery risk calculator [5, 6].

The current general skepticism of trauma and emergency surgeons about AI is also confirmed by the given rate on the relevance of AI-based tools for decision-making as they appear today, with a mean of 3.06. One more time, a high standard deviation interval depicts divergent opinions and beliefs in the surgical community. Still, the future looks brighter, with a higher perceived relevance on a five-year horizon.

The same picture and the divergent trust in using AI-based decision tools emerge in the question about the goals and benefits. Interestingly, the degree of technology acceptance appears generally low, with surgeons being rather skeptical when it comes to AI ensuring better support and outcomes than traditional tools. Such a low degree of acceptance opens up new research avenues to investigate the main concerns in the practical adoption of AI or the ethical bias connected to it. In such a perspective, one primary concern may reside in the lack of technical knowledge regarding AI, which emerges from several questions, especially that on the understanding of AI. Moreover, when enquired about legal responsibility (a topic to which medical doctors are particularly sensitive), surgeons modestly rate the need to share their legal responsibility with either the manufacturer, those in charge of maintenance, or the data manager, and so the need to rethink the informed written consent when new technologies are involved.

Limitations

Although our survey got a reasonably high response rate and 650 participants (approximately 70%), the sample is not equally distributed geographically. Indeed, most participants work in Europe and, more specifically, in Italy. The specific situation of the Italian and European contexts, including the actual access to AI-based technology and the medical education related to it, may have biased some of our results. Our limitations, along with the international community's perceived interest in AI [13], may stimulate new in-depth studies and investigations.

Conclusion

In concluding our work, we return to the premises and research gaps that inspired it. AI-based applications are gaining traction in surgery as decision-making aids, and the surgical community is showing interest in them. Trauma and emergency contexts are often challenging, and surgeons perceive the need for sound decision-making tools.

Our results underline how the emergency surgical community seems to include surgeons who strongly believe in the potential contribution of AI technology and who may act as enthusiastic early adopters [19] and those who are reluctant, feeling more comfortable relying upon more classical aids like guidelines, traditional training, and multidisciplinary groups’ support. Some countries seem to have greater comprehension and trust of AI-based tools. Younger surgeons, who some presume to be keener on using new technologies, do not appear to have greater trust or understanding of AI compared with mid-career and senior surgeons.

Given the potential of AI in clinical practice [7, 20, 30] and the speed of development, the role of scientific societies like the WSES and surgical training programs are crucial to expanding knowledge regarding AI-enabled decision aids, disseminate new approaches, and encompass them in training modules and clinical guidelines, to bridge the gaps between technology enthusiasts and those who misunderstand and mistrust AI. In addition, AI-enabled decision aids in surgery should seek to gain the trust of surgeons by establishing transparency and by demonstrating performance advantages that improve patient care.