Introduction

The workload of a general surgeon is multifaceted, encompassing not only surgical procedures but also a myriad of other responsibilities [1,2,3,4,5]. High workloads and insufficient recovery time can harm surgeons and patients, impacting the quality of care, outcomes, and surgeon’s well-being [6].

Surgical outcomes, quality of complex decision-making, and academic tasks are influenced by alertness, reaction time, eye–hand coordination, and concentration. While there is increasing focus on patient safety and adverse events because of medical errors [7, 8], less attention has been paid to surgeons’ well-being [9, 10]. This started a debate on how doctors should handle post-call fatigue and elective surgery after on-call. Several suggestions have been made [11], including that surgeons have ethical obligations to sleep and be well-rested to perform safe surgery [12].

Surgeon’s workload and the opportunity to access research might be influenced by resources, patient demographics, professional development opportunities, societal expectations, and governmental support. However, the role of such agents needs to be elucidated [13]. In an era where the sustainability of healthcare systems is under scrutiny, there is a clear need to better understand these dynamics and develop effective strategies to manage surgeon workloads and professional development at a global level. The present large-scale, international study aimed at filling this gap, analyzing the interrelationships between clinical workload, academic engagement, on-call shifts, and post-on-call rest. It also assessed the potential association between workload and local factors.

Methods

The Association of Italian Surgeons in Europe (ACIE), in collaboration with Association of Italian Surgeons in North America (AISNA) and Italian Surgical Research Group (ItSURG), conducted an internet-based cross-sectional survey to detect differences, strengths, and limitations among different national health systems concerning general surgery activity.

A brief informative letter about the study was shared with 1.600 surgeons. Informed consent of individuals was obtained with Google Forms. Survey respondents were informed of the purpose of the study, and their participation remained voluntary as no incentives were offered.

This survey was developed and reported as per the Checklist for Reporting Results of Internet E-Surveys (CHERRIES) [14]. For this research, which involved a non-intrusive, anonymized web-based survey, no formal ethical approval was deemed necessary.

Questionnaire development and composition

The Steering Committee developed the questionnaire using web-based and remote discussions after identifying the components and topics to include. The questionnaire underwent a preliminary beta testing phase before its official deployment to ensure its comprehensibility, relevance, and user-friendliness. Once agreement had been reached, the questionnaire (The On-calL shIft fOr GeNEral SurgeonS LIONESS Study Form) was completed using Google Form survey software (Google, Mountain View, California, USA).

The questionnaire had six sections and 35 questions (Supplementary Material Table 1). Both closed-ended (19/35) and open questions (16/35) were used. The first four sections included general questions about the baseline characteristics of the respondents, country and continent of practice, type of sub-specialty, years of practice, practice level, and hospital organization. The other sections focused on the type of clinical practice, research activity, and on-call shifts.

Study circulation

On April 1st 2023, the questionnaire was made available online and was open for completion until May 15th 2023. The survey (https://docs.google.com/forms/d/e/1FAIpQLSerFw4p_T2ADKdwkacN-SiPLyevO0KOP1PbO_esLkJxVVZqXg/viewform?usp=sf_link) was disseminated through general surgeons and trainees within the ACIE, AISNA and ItSurg networks. Three reminder emails were sent at three-week intervals, encouraging participants who had not yet responded to complete the survey.

Participants

Participants consisted of general surgeons comprising surgical trainees and fellows (classified as trainee for the planned analyses) and consultant/attending surgeons, researchers/professors, and directors (classified as consultant/professor for the planned analyses) currently covering on-call shifts in their hospitals across Europe, Asia, Africa, Oceania, North and South America. We excluded specialists in other surgical fields, such as gynecology, vascular surgery, head and neck surgery, cardiac and thoracic surgery, neurosurgery, and urology.

Data handling and extraction

A Steering Committee member (MP) downloaded the questionnaires and shared them with the other members for data analysis and discussion. Multiple entries from the same individual were sought manually and eliminated.

Domains assessed

  • Clinical practice: number of on-call duties per month, duration of each on-call shift, number of patients managed during an on-call shift.

  • Academic engagement: number of scientific articles published per year, number of scientific articles read per month, frequency of attending scientific conferences as a faculty member per year, number of scientific conferences attended per year in a learning capacity.

  • Post-on-call rest: the provision of a day off following an on-call shift or two days off if on-call during public holidays.

Statistical analysis

Categorical variables were reported using counts and percentages, while continuous variables were reported by means and standard deviation or median and interquartile (IQR), depending on the type of distribution. Variables were stratified between the two different levels of practice (trainee vs consultant/professor) and continents, compared using 4 × 2 or 5 × 5 contingency tables, and analyzed using the Chi-square test for categorical or one-way analyses of variance (ANOVA) for continuous variables, respectively. Univariable logistic regression analysis examined the association between variables and providing a day off following an on-call shift.

All variables associated with the provision of a day off following an on-call shift at a threshold of p-values less than 0.05 in the univariable logistic regression analyses were entered in the final multivariable model. Multivariable logistic regression analyses with backward stepwise elimination tested which variables had the best ability to explain the chance of having a day off after an on-call shift.

Results were considered statistically significant if p-values were < 0.05. SPSS® version 28 (IBM, Armonk, New York, USA) was used for the statistical analysis.

Different subgroup analyses were planned to aim at exploring possible variations in practice and workload of surgeons based on the level of practice, length of the on-call shifts (> 12 h), different continents, and different Human Development Index (HDI) (retrieved from https://hdr.undp.org/data-center/human-development-index#/indicies/HDI). HDI is a composite measure including the health (life expectancy at birth), education (mean years of schooling for adults and expected years of schooling for children), and the standard of living (per capita income indicators) dimensions commonly used to measure a country’s development by the United Nations Development Programme [13]. The countries, classified into four tiers based on the HDI, were categorized as having a high HDI, very high HDI, or a non-very-high HDI to ensure a balanced comparison among the groups.

Results

Baseline characteristics of the respondents

Our survey was distributed to approximately 1.600 surgeons. We received a total of 1.046 responses (65.4%) (Table 1), of whom 40.2% were based in Italy and 11.8% in Spain. Most (78.1%) respondents were from Europe and 65.1% worked at a general surgery unit. Participating surgeons were general (34.4%), colorectal (32.2%), HPB (12.4%), and emergency surgeons (7.9%). Overall, they had a duration of surgical activity lasting 10.7 years. Responders mainly came from public teaching hospitals (75.9%) with more than 400 beds (47.6%). The global distribution of respondents by HDI categories is shown in Fig. 1.

Table 1 Baseline characteristics and study results
Fig. 1
figure 1

Global distribution of survey respondents by Human Development Index (HDI) categories. This map illustrates the geographic dispersion of participants in our survey, categorized by the HDI of their respective countries. Areas are color-coded to represent different levels of development: very high (dark green), high (light green), medium (orange), low (red), and regions with no data (grey)

In Europe, 70.5% of responders were male, compared to 96.7% in Africa (p < 0.001). Age distribution data and years of practice showed variations underlining age-related demographic differences.

Hospitals with 0–50 beds were represented by 13% of responders in Europe and 1.3% in Asia (p = 0.015). Hospitals with > 400 beds were attended by 50.4% of European, 66.7% of North American, 53.9% of Asian, 33.3% of African, and 10.2% of South American responders (p < 0.001).

Data from the continent and professional level revealed significant differences in clinical workload, academic engagement, on-call activity, and post-on-call rest (Table 2).

Table 2 Study results stratified according to professional level (Trainee vs. Consultant/Professor) and Continent

Clinical workload

The distributions of clinical and academic responsibilities across continents and across different HDI levels are reported in the Figs. 2 and 3.

Fig. 2
figure 2

Distribution of clinical and academic responsibilities across continents (Consultant/Professor Left, Trainee Right). This heatmap illustrates the comparison of clinical and academic tasks undertaken by medical trainees and consultants across five continents. Color intensity correlates with the frequency of each activity, ranging from private practice to on-call duties, with darker shades indicating higher prevalence

Fig. 3
figure 3

Distribution of clinical and academic responsibilities by Human Development Index (HDI) (Consultant/Professor Left, Trainee Right). This heatmap displays the variance in clinical and academic responsibilities among medical trainees and consultants/professors across countries with different HDI levels. Color intensity correlates with the frequency of each activity, ranging from private practice to on-call duties, with darker shades indicating higher prevalence

In Europe, 80% of responders worked in public teaching hospitals vs 45.8% in North America (p < 0.001). In North America 54.2% worked in private teaching hospitals vs 9.2% in Europe (p < 0.001); 39.9% of respondents declared that they had a private practice in addition to their ordinary activity. The survey data on private practice across continents showed South America and Africa at the top, with 69.6% and 55.0% of responders engaged in private practice, followed by Asia (39.5%) and Europe (36.8%) (p < 0.001).

Twelve-point eight percent of trainees were involved in private practice compared to 50.8% consultants/professors. This disparity was consistent across continents (11.3% vs 45.2% in Europe, 27.3% vs 77.6% in South America, 30% vs 60% in Africa).

Academic engagement

The survey data on research activity showed overall high engagement across continents, with significant disparities. In Europe 92.8% of responders were involved in research, 87.5% in North America, and 71.7% in Africa (p < 0.001).

The mean number of research articles published in indexed journals in a year was 6.1(± 8.4), and the mean number of articles read monthly was 12.1(± 14.3). The number of scientific articles published yearly varied (p = 0.004), with responders in Europe (6.6 ± 8.6) publishing the most. The number of articles published by European consultants/professors was 6.76(± 8.41) versus 9.00(± 9.45) by North American trainees.

Involvement in teaching was 100% for North American and 91.7% for African responders, compared to 82.4% for Europeans (p = 0.024).

The mean number of conferences attended as a speaker in a year was 2.8(± 4.8), and the mean number of conferences attended as a learner was 4.7(± 6). In Europe, a mean of 4.5(± 3.9) conferences per year was attended, 8(± 24) in North America, 5.9(± 7) in South America, 5.2(± 9.3) in Asia, and 3.9(± 5.5) in Africa (p = 0.013). For African trainees, the mean was 0.60(± 0.91) conferences attended as faculty, compared to 0.90(± 0.66) for South American trainees; the same occurred with consultants/professors in these areas (1.32 ± 1.48 and 2.98 ± 2.83 conferences).

On-call activity and post-on-call rest

The mean number of on-call shifts per month was 6.7(± 4.9), of which 46.5% in 76–100% cases in-house. The duration of the on-call shift was 12–24 h in 39.3% of cases, 6–12 h in 33% of cases, 18–24 h in 12.8% of cases, 25–36 h in 9.6% of cases, 37–48 h in 1.4% of cases and more than 48 h in 3.9% of cases.

European responders had a mean of 6.5(± 4.9) on-calls per month, with 49.4% requiring in-house presence in 76–100% of the cases, compared with 7.8(± 4.1, 41.7% requiring in-house presence in 76–100% of the cases) in North America, and 8.7(± 6.1, 35.0% requiring in-house presence in 76–100% of the cases) in Africa (p = 0.005). During an on-call shift, the mean number of patients assessed was 13(± 15.7), and 42.9% of responders stated they usually manage patients outside their specialty. The number of on-calls per month for trainees and consultants/professors across continents showed significant variability. In Europe, trainees had 6.38(± 5.47) on-calls, while consultants/professors had 6.65(± 4.60); in North America 8.25(± 2.38) vs 7.70(± 4.26) of consultants/professors; in Asia, 7.66(± 4.15) vs 6.0(± 3.35); in Africa 6.80(± 3.86) vs 9.18(± 6.32), and 5.27(± 1.91) vs 7.36(± 5.16) in South America. Consultants/professors, when on-call, were less likely to be required to stay in the hospital compared to trainees. This pattern was observed across various regions. Regarding on-call shift hours exceeding 48, 0.9% of trainees and 4.0% of consultants/professors in Europe had such shifts, compared with 20% of consultants/professors in Africa and 6.7% of trainees in Asia. In North America, no trainees and 15% of consultants/professors experienced > 48-h shifts.

Overall, 35.1% of respondents declared having a day off after an on-call shift, while 10.4% declared having two days off in the week in which they were on-call during a public holiday. The data on days off after on-call shifts showed significant regional differences. In Europe, 40% of respondents got a day off after an on-call shift vs 20.8% in North America, 20.3% in South America, 23.7% in Asia, and 6.7% in Africa (p < 0.001). For trainees, 40.8% reported having a day off after an on-call shift, with variations by continent. Consultants/professors had a lower overall percentage of 33.7% receiving a day off after on-call shifts. In Europe, 44.1% of trainees received a day off vs 38.3% of consultants/professors, in North America 50% vs 15%, in Africa, 10% vs 6%, in Asia 33.3% vs 21.3%, and in South America 18.2% vs 20.7%. The different professional activity profiles by continent are reported with radar charts in Fig. 4.

Fig. 4
figure 4

Professional activity profiles by continent and role. This radar chart compares the distribution of clinical and academic tasks among medical trainees and consultants/professors across four different continents: Europe, North America, South America, and Asia. Each colored shape represents the composite profile for trainees (blue) and consultants/professors (red) within each continent, across activities like teaching, research activities, private practice and clinical duties. The extent of each axis reflects the level of engagement in the corresponding activity

Analysis of predictive factors of day-off after on-call

In the univariable analysis of predictive factors for having a day off after on-call, both very high and high HDI (OR 3.131) and very high HDI (OR 2.542), hospital capacity over 400 beds (OR 3.007), working in a specialty surgery unit (OR 2.389), and doing in-house on-calls for 76–100% of the cases (OR 5.355) were associated with a day off. Working as consultants (OR 0.678), in private non-teaching hospitals (OR 0.358), and high numbers of weekly public holidays on-call per month (OR 0.848) were associated with the lack of a day off after on-call (Supplementary Material Table 2). Continents with a higher mean number of on-calls per month showed a more significant percentage of surgeons receiving a day off afterward, but this was not uniform across all groups or regions (Supplementary Material Fig. 1). There was an overall positive correlation between the duration of on-call shifts and the likelihood of receiving a subsequent day off. Longer on-call shifts tended to be associated with a higher probability of having a rest day post-on-call across various continents and roles (Supplementary Material Fig. 1). On the general adjusted multivariable analysis HDI (aOR 1.993), hospital capacity > 400 beds (aOR 2.423), working in a specialty surgery unit (aOR 2.087), and in-house on-call (aOR 5.446), predicted the likelihood of having a day off after an on-call shift (Supplementary Material Table 3). Working as consultant/professor (aOR 0.713), and having a high number of on-calls per month (aOR 0.917) were risk factors for not having a day off after on-call. When looking at independent predictors of a day-off following on-call duties in the subgroup of responders who performed extended on-call shifts > 12 h (Supplementary Material Table 4), a Very High HDI (aOR 2.148) and doing in-house on-calls in > 50% of the cases (aOR 4.297) were associated with the chance of having a day-off. Independent predictors of a day off following on-call duties are depicted in Fig. 5.

Fig. 5
figure 5

Independent predictors of a day-off following on-call duties—Multivariable Analysis. General population of responders (A); Subgroup analysis of responders who perform > 12 h on-calls (B)

Discussion

The LIONESS study provided the first comparative overview of the engagement levels of surgical trainees and consultants/professors across various professional domains on different continents. The current study showed that access to scientific research, opportunities for professional development, and the clinical workload in terms of on-calls, length of shifts, and the possibility of having 24 h free to rest afterwards differ depending on the professional level, the continent, and the HDI of the countries.

Alarmingly, 1.1% of trainees and 4.2% of consultants who took the current survey still carried out on-call shifts lasting more than 48 h. In Europe, where the law imposes stringent rules on the length of work shifts, 4% of consultants and 0.9% of trainees stated that they still carried out on-call shifts > 48 h. These percentages were even higher in South America (5.2%), North America (15%) and Africa (20%) among consultants, while in Asia, 6.7% of trainees worked shifts > 48 h. The effects of sleep deprivation on surgeons have been widely investigated, and many studies have reported its negative impact on cognition, attention, and performance [10, 15]. In consideration of the research in this field, the US Accreditation Council for Graduate Medicine Education (ACGME) Work Group on Resident Duty Hours first reform, in 2003, established a maximum 80-h work week and prohibited shift lengths from exceeding 30 consecutive hours [5]. The second reform capped shift lengths at 16 consecutive hours for interns and 28 h for other trainees [16]. The European Working Time Directive for physicians is even stricter, with an average limit of 48 h a week, 11 h of continuous rest every 24 h, a day off each week, a 20-min rest break every six hours, and a limit of eight hours worked in every 24 h for night work [11].

A relevant finding of our study was that in the general analysis, 40.8% of trainees and 33.7% of consultants/professors could not have a 24-h rest after an on-call shift, which is now considered a fundamental recovery practice. The distribution of this variable showed differences based on professional levels and different continents. As far as consultants were concerned, only 6% of those working in Africa could benefit from a 24-h rest after an on-call shift, compared to 38.3% of European consultants. For trainees, more than 40% of those working in Europe and North America were likely to have a day of rest after the on-call shift, compared to only 10% in Africa and 18.2% in South America. How fatigue affects surgeon performance remains an intriguing field of research. Interestingly, Yaghoubian et al. [17] showed that trauma surgery performed at night by residents who have worked longer than 16 h have similar favorable outcomes compared with those performed during the day. Conversely, Dinges et al. reported that continuous wakefulness of periods of more than 21 h is a major predictor of performance errors [18]. It is difficult to establish a specific correlation between the number of hours worked when proceeding with surgery and potential adverse outcomes, as these may be affected by numerous other variables not related to the length of the surgeon's working shift. Moreover, potential correlations may be limited by insufficient power to detect such associations reliably. However, the findings from previous studies suggested that for surgeons during night shift floats, protected days off work are fundamental [16, 19,20,21]. Dunn et al. [22] showed that a three-day period of recovery is insufficient for restorative sleep, while another study by Brown et al. [23] concluded that recovery of sleep duration, efficiency, and quality necessitate five full days. In the current study, the possibility of having adequate rest after an on-call shift of 24 h was shown to be associated with working in a country with a Very High HDI (aOR 1.993), working in large hospitals with > 400 beds (aOR 2.423), in a specialist surgery unit (aOR 2.087) and carrying out in-house on-call shifts (aOR 5.446). It should be noted that holding more experienced roles, such as consultants, was associated with the probability of being unable to take advantage of the day off after an on-call shift.

HDI has been used for comparing surgical outcomes, access to surgery, surgical techniques across countries with different HDI scores [24], and resource allocation understanding health disparities [25], but there is not enough evidence to directly establish a correlation between HDI and surgeon workload and on-call shifts. The LIONESS study revealed a groundbreaking association between HDI and surgeons' likelihood of having a day off following on-call duties. In regions with higher HDI, surgeons were more likely to benefit from rest periods post-on-call. This finding is significant, as it links HDI not only to economic and educational outcomes but also to working conditions in healthcare. This correlation underscores the need for policies prioritizing healthcare worker well-being, especially in lower HDI regions.

For trainees, there appeared to be a high degree of variability across continents in categories such as research and conference attendance, suggesting differences in training focus or career opportunities availability. For consultants/professors, the engagement levels across these categories were more evenly distributed.

Similarly, although research activity was high among the participants in this study in terms of involvement in research activities, teaching, number of scientific articles published and read, and number of scientific conferences attended as speakers and learners, essential differences between continents were observed. Involvement in teaching was relatively higher among trainees in Africa compared to other continents. In contrast, trainees in Europe and North America tended to participate more in research than their counterparts in South America and Africa. In particular, research activity was less accessible in Africa than in other continents, especially for surgeons in training. There was a marked difference in the number of scientific articles published yearly. This varied significantly across continents, with Europe and North America leading. The mean number of scientific articles published yearly showed a drastic reduction in South America and Africa compared to other continents. It should also be noted that, in general, surgeons with top roles published more scientific articles than surgeons in training, thus outlining a probable lack of research training in younger groups who should instead constitute the driving force of scientific research.

Strategic recommendations can be proposed to balance surgeon workloads, enhance research opportunities, and ensure adequate rest. Our analysis suggests that healthcare policies and practices must be tailored to the unique challenges and resources of each region. We advise implementing a system that allows for flexibility within a standardized framework, accommodating the specific needs of different healthcare settings and ensuring surgeons have predictable and manageable workloads. In lower HDI countries, we suggest focusing on optimizing resource allocation by introducing telemedicine consultations to reduce the unnecessary physical workload on surgeons and by utilizing mid-level practitioners, such as nurse practitioners or physician assistants, to handle non-surgical or basic surgical tasks [26]. Additionally, establishing global research collaboratives that include surgeons from high, medium, and low HDI countries is crucial to ensure diverse participation and access to research opportunities, fostering an environment of shared learning and development [27]. Dedicated funding streams for research initiatives specifically designed for lower HDI regions could promote local research development and publication opportunities. For higher HDI regions, adopting legislation that mandates minimum rest periods for surgeons, similar to regulations in place for other high-stress professions, is critical. A potential strategy could be to implement systems that allow for shift swapping and provide additional support during peak times, ensuring that no single individual is overwhelmed, which can be particularly beneficial in settings with limited staffing. Moreover, leveraging digital tools and artificial intelligence for scheduling, administrative tasks, and even preliminary diagnostics can reduce the workload on surgeons, allowing them to focus on essential surgical duties and research.

Strengths and limitations

The main strengths of the LIONESS study are the employment of a standard survey methodology, the large sample size, and the inclusion of all continents, offering a holistic view of surgeons' professional engagements.

A notable limitation of this study is the significant gender disparity among the respondents, with a predominance of male participants. This imbalance reflects a broader issue within the surgical field but also limits the applicability of our findings across genders. The perspectives and experiences of female surgeons, who may face unique challenges and work-life balance issues in various healthcare systems, are underrepresented in our analysis. This gender imbalance could potentially skew the perception of workloads, rest periods, and opportunities for academic engagement. Future research should strive for a more balanced gender representation to capture a comprehensive view of the surgical profession's dynamics. Although efforts were made to encourage global recruitment across all HDI countries, participation from Very High HDI countries was more substantial, potentially introducing a selection bias among collaborating institutions. Last, most of the respondents (65.1%) were from general surgery units, while 34.9% were from subspecialty surgery units. This distribution could influence the survey findings, as experiences and practices might differ between general surgeons and those in surgical subspecialties. The predominance of participants from teaching hospitals likely influenced the responses related to academic activities, such as the number of articles published and read. Surgeons affiliated with academic institutions often have greater exposure to and encouragement for engaging in research activities as part of their professional roles. As a result, their reported levels of research activity and engagement with scientific literature could be higher than those of their counterparts in non-teaching hospitals. In these settings, the focus may be more heavily weighted towards clinical duties, with fewer resources and incentives for research available. Therefore, the experiences and challenges of surgeons in non-teaching, community, and rural hospitals, where academic and research activities might be less emphasized or supported, could significantly diverge from our study's findings. Efforts to engage a broader and more diverse participant pool in future studies could include targeted outreach to non-teaching hospitals and the incorporation of questions specifically designed to unearth the unique challenges and workloads faced by surgeons in these environments.

Conclusions

The LIONESS study revealed critical insights into the disparities in workload, access to research, and professional opportunities for surgeons across different continents. HDI was associated with the likelihood of surgeons having a day off post-on-call, with higher HDI regions offering better work-life balance. Despite regulatory rules introduced in several national healthcare systems, many surgeons continue to work long shifts without adequate rest. Finally, the study highlighted uneven access to research and scientific meetings, further influenced by geographic and possibly economic factors. These findings call for global initiatives to promote equality in healthcare work environments and professional development opportunities.