Gastric gastrointestinal stromal tumors (gGISTs) are the most common type of submucosal tumors (SMTs) found in the stomach. Standard preoperative diagnosis methods for gGIST include white light endoscopy, endoscopic ultrasound (EUS), and CT examination [1, 2]. According to the National Comprehensive Cancer Network (NCCN) guidelines, regular endoscopic follow-up is recommended for small gGISTs without a high risk of malignant changes under EUS [3]. However, due to the malignant behavior of gGISTs, European and Japanese guidelines recommend resection for all pathologically confirmed cases [4, 5].

Advancements in endoscopic resection (ER) techniques offered a promising opportunity for treating low-risk gGISTs, with the advantages of faster postoperative recovery, shorter hospital stays, and lower cost [6,7,8]. Nevertheless, ER of gGISTs can be complex and carries the risk of severe intraoperative bleeding, tumor rupture, and conversion to laparoscopic resection. To improve the quality of gGISTs ER outcomes, evaluating the level of operative difficulty before the surgery is essential.

The difficulty scoring system (DSS) is an effective tool for objectively evaluating the degree of surgical difficulty, which has been widely used for laparoscopic liver resection, splenectomy, and resection of digestive tract malignant tumor [9,10,11]. Therefore, this study aimed to develop a DSS that can objectively evaluate the difficulty of ER of gGISTs and to assess its ability of this model to predict the difficulty of this procedure.

Material and methods

Patients

We retrospectively analyzed the data of patients who underwent ER for gGISTs at the First Affiliated Hospital of Soochow University from December 2010 to December 2022 and randomly assigned all patients to the training cohort (TC) and internal validation cohort (IVC) on a 7:3 basis. For external verification, we collected data of patients with gGISTs who received endoscopic therapy at Changshu Hospital Affiliated to Soochow University, No.1 People’s Hospital of Kunshan, and No.2 People’s Hospital of Changshu from January 2017 to December 2022. The principal inclusion criteria were: (1) postoperative pathological and immunohistochemical diagnosis as a gGIST; (2) preoperative blood routine, hemagglutination, and electrocardiogram tests were normal; (3) patients had no lymph nodes or distant metastasis. The exclusion criteria were: (1) lesions with a high risk of malignant changes under EUS; (2) patients with synchronous lesions at different sites; (3) patients with multiple lesions in the stomach; (4) patients who cannot tolerate anesthesia and surgery due to poor cardiopulmonary function; (5) the patient’s medical record was incomplete. The ethics committee approved the study protocol for clinical research at our institutes. All patients were informed about the risks and benefits of ER and signed a written informed consent for the procedure.

Endoscopic equipment and procedures

We used three ER techniques according to the lesion: endoscopic submucosal dissection (ESD), endoscopic full-thickness resection (EFTR), and submucosal tunnel endoscopic resection (STER). Details of the ER procedures were described previously [12,13,14]. Procedures were fulfilled by endoscopists with a different experience in ER of gGISTs; however, all cases were performed by senior endoscopists, each of whom had performed more than 5,000 gastroscopy and colonoscopy procedures and more than 200 EMR procedures for gastrointestinal polyps before undergoing ER of gGISTs. When an endoscopist had performed more than 50 gGISTs ER, he or she was defined as an experiened endoscopist in our study. All patients received general anesthesia and endotracheal intubation. ER was performed by a dual knife (KD-650L; Olympus, Japan), an insulated-tip knife (KD-611L; Olympus, Japan), or a combination. A single-channel endoscope (GIF-Q260J, Olympus, Japan) with a transparent cap attached to the endoscope tip was used. A High-frequency electric coagulation and electrocautery device (ERBE VIO 200D) was used for energy output. Other equipment included: metallic clips, nylon loops (LeClampTM Loop-20 and Loop-30; Leo, Changzhou, China), over-the-scope clips(OTSC), injection needles, hot biopsy forceps, and carbon dioxide insufflator.

Postoperative management

Postoperative specimens were fixed in 10% formalin solution, and immunohistochemical staining (CD117, CD34, Dog-1, etc.) was performed to confirm the diagnosis. Usually, all patients underwent nasogastric decompression after surgery and were required to fast for two days (3 days or longer for EFTR patients, according to the postoperative condition). Blood routine, CRP, and/or calcitonin examination were performed after surgery, and proton pump inhibitors, gastric mucosal protective agents, fluid replacement, and nutritional support were given to all patients. When patients have abdominal pain or muscle tension, a CT or orthostatic X-ray examination is performed to rule out a postoperative perforation and undergo antibiotic therapy or surgical treatment depending on the condition. Antibiotics were recommended for patients with either intraoperative perforation or postoperative infection.

Data collection

Each patient’s information, including age, gender, primary symptom, history of smoking or drinking, past medical history, body mass index (BMI), American Society of Anaesthesiologists (ASA) score [15], tumor size, location, shape, invasion depth, boundary, procedure duration, intraoperative and postoperative complications, type of ER technique, R0 resection rates, modified National Institutes of Health (NIH) risk criteria [16], postoperative hospitalization, and postoperative fasting days, were obtained from electronic medical records of our institutes.

Definitions

A difficult case was defined as an operative time ≥ 90 min, or the occurrence of severe intraoperative bleeding, or conversion to laparoscopic resection. The operative time was defined from the start of the submucosal injection to the completion of the closure of the defect. Severe intraoperative bleeding was defined as repeated endoscopic hemostasis with a postoperative hemoglobin drop of > 2 g/dL or a requirement for surgical assistance [17, 18]. Characteristics of tumors were judged according to the preoperative endoscopic ultrasound examination or abdominal enhanced CT. Postoperative complications mainly include delayed bleeding, delayed perforation, and postoperative infection. Delayed bleeding was defined as clinical evidence of bleeding after ER, characterized by hematemesis or melena, a decrease in hemoglobin levels greater than 2.0 g/dl within 24 h, or requiring endoscopic therapy [19]. Delayed perforation was confirmed by X-ray or computed tomography. Postoperative infection was defined as postoperative body temperature exceeding 37.5℃ and/or accompanied by increased inflammatory indicators such as blood routine, CRP, or calcitonin [20].

Statistical analysis

Categorical variables were expressed as frequencies and percentages, and the Chi-square test or Fisher exact test was used for comparison between groups. Continuous variables were expressed as median and interquartile ranges (IQR), and the Mann–Whitney U test was performed for comparison between the two groups. The Bonferroni correction method was used for the pairwise comparison of data between multiple groups. A preliminary univariate analysis was performed to select risk factors for the difficult procedure. Predictors with a P-value < 0.05 in the univariate analysis were subjected to a multivariate analysis using the logistic regression model. A P-value < 0.05 was considered statistically significant. DSS was developed based on the β value of the regression coefficient in the multivariate analysis. We defined the variable with the smallest regression coefficient as 1 point and divided the other variables by the multiple of the smallest variable to define the corresponding score. The discrimination of the DSS was assessed by calculating the area under the curve (AUC) of the receiver operating characteristic (ROC) with a 95% confidence interval (CI). We assessed the calibration of the DSS with the Hosmer–Lemeshow goodness-of-fit test. From the continuous score in the TC, we classified patients into three categories according to the level of difficulty (easy, intermediate, and difficult). The rate of difficult operations for ER of gGISTs in each group was compared in TC, IVC, and EVC. Statistical analyses were conducted using SPSS version 26 (Chicago, IL, USA).

Results

General characteristics of patients and lesions

A flow chart of the study protocol is shown in Fig. 1. The characteristics of 431 gGISTs in the TC and the IVC are shown in Table 1. The difference in patient and lesion characteristics were not significant between the TC and the IVC (P > 0.05). The data of the EVC and its comparison with the TC are shown in Supplementary Table 1. There were statistically significant differences between the two cohorts in terms of gender, tumor location, tumor boundary, and tumor size (P < 0.05), while no significant statistical difference were observed between the two groups in other aspects (P > 0.05).

Fig. 1
figure 1

Flow chart of the study. gGISTs: gastric gastrointestinal stromal tumors

Table 1 Baseline characteristics of the patients and lesions

Procedural outcomes related to ER of gGISTs

There were 49 (16.2%) and 30 (23.3%) difficult cases in the TC and IVC (Table 2). Of the 431 patients, 288 (66.8%) were performed by experienced endoscopists, and 143 (33.2%) were performed by less-experienced endoscopists. Patients treated with ESD, EFTR, and STER accounted for 49.2%, 49.2%, and 1.6%, respectively. The median operative time was 59.0 min (IQR, 45.0–78.0 min). The conversion rate was 3.2% (14/431), the severe intraoperative bleeding rate was 3.2% (14/431), the R0 resection rate was 91.4% (394/431), and the postoperative complications rate was 13.5% (58/431). The median postoperative fasting days and postoperative hospitalization days were 3.0 days and 6.0 days. There were no differences in ER procedure outcomes between the TC and the IVC (P > 0.05) (Table 3). For the 124 cases in the EVC, difficult cases were experienced in 18 (14.5%) cases (Supplementary Table 2), the median operative time was 68.0 min (IQR, 50.0–75.0 min), and the severe intraoperative bleeding rate was 3.2% (4/124). Supplementary Table 3 presents details of the procedural outcomes related to ER of gGISTs in the EVC and compares them with those of the TC. The EVC had longer operative times (median 68.0 vs. 59.0 min, P = 0.016) and lower rate of conversion (0% vs. 3.3%, P = 0.039) compared to the TC. In other aspects, there was no significant statistical difference observed between the two groups (P > 0.05).

Table 2 Distribution of cases defined as difficult procedure in training cohorts and internal validation cohort
Table 3 Procedural outcomes related to ER of gGISTs

Risk factors for the difficulty of ER of gGISTs

Univariate analysis revealed that tumor location, shape, size, invasion depth, and endoscopists’ experience were risk factors for ER of gGISTs (Table 4). Multivariate analysis showed that the independent predictors for the difficulty of ER of gGISTs in the TC were tumor size, location, invasion depth, and endoscopists’ experience (Table 5). Compared with gGISTs < 2.0 cm, gGISTs of 2.0–3.0 cm significantly increased the ER difficulty (OR 2.344, 95% CI 1.091–6.040, P = 0.031), and the strongest risk indicator was gGISTs larger than 3.0 cm (OR 14.271, 95% CI 4.880–41.735, P < 0.001). Lesions in the upper third of the stomach exhibited significantly greater difficulty than other sites (OR 5.402, 95% CI 1.054–27.702, P = 0.043). A gGIST originating from MP-ex increased the difficulty more than a superficial gGIST (OR 4.433, 95% CI 1.367–14.382, P = 0.013). The risk of difficulty for procedures performed by non-experienced endoscopists was significantly higher than that of experienced endoscopists (OR 2.344, 95% CI 1.125–4.884, P = 0.023).

Table 4 Univariate analysis of the risk factors for the difficulty of ER of gGISTs
Table 5 Multivariate analysis of the risk factors for the difficulty of ER of gGISTs and development of DSS

Development and validation of DSS for ER of gGISTs

We assigned a score for each risk factor based on the β-coefficient of the four independent risk factors in the multivariate analysis of the TC. The points of the four risk factors were as follows: tumor size ≥ 3.0 cm (3 points) or 2.0–3.0 cm (1 point); location in the upper third of the stomach (2 points); invasion depth beyond the muscularis propria (2 points); lack of experience (1 point) (Table 5). Figure 2 depicts the relationship between the score and the rate of difficult operations for ER of gGISTs. The rate of difficult operations for scores of 0, 1, 2, 3, 4, 5, 6, 7, and 8 in the TC were 2.6%, 3.7%, 7.1%, 9.0%, 25.0%, 35.7%, 81.8%, 100.0%, and 100.0%, respectively. The AUC for the DSS in the TC was 0.815 (95% CI 0.744–0.886, Fig. 3A), showing a good discrimination ability. The DSS was also well calibrated according to the Hosmer–Lemeshow goodness-of-fit test (c2 = 8.684, P = 0.276). The sensitivity and specificity were 0.714 and 802, and the positive predictive value (PPV) and negative predictive value (NPV) were 0.412 and 0.935. The difficulty for ER of gGISTs in the TC was then categorized into three groups: (1) easy (score 0–3); (2) intermediate (score 4–5), and (3)difficult (score 6–8). The rate of difficult operations for ER of gGISTs in each group was 6.5%, 29.4%, and 88.2%, respectively (Fig. 4).

Fig. 2
figure 2

The rate of difficult operations for different scores in training cohort and internal validation cohort; gGISTs: gastric gastrointestinal stromal tumors

Fig. 3
figure 3

ROC curve of difficulty scoring system; A represent for the training cohort and B represent for the internal validation cohort

Fig. 4
figure 4

The rate of difficult operations for different grade of difficult in training cohort and internal validation cohort; gGISTs: gastric gastrointestinal stromal tumors; *P < 0.05, **P < 0.01, ***P < 0.001

In the IVC, the rate of difficult operations for scores of 0, 1, 2, 3, 4, 5, 6, 7, and 8 were 7.7%, 10.0%, 5.1%, 10.3%, 44.4%, 46.7%, 75.0%, 100.0%, and 87.5%, respectively (Fig. 2). The AUC for the DSS was 0.838 (95% CI 0.743–0.933, Fig. 3B), showing a good discrimination ability. The DSS was also well calibrated according to the Hosmer–Lemeshow goodness-of-fit test (c2 = 11.728, P = 0.110). The sensitivity and specificity were 0.767 and 848, and the PPV and NPV were 0.605 and 0.923. The rate of difficult operations for ER of gGISTs in 3 groups were 7.7%, 45.8%, and 85.7%, respectively (Fig. 4).

The AUC for the DSS in the EVC was 0.864 (95% CI 0.781–0.947, Supplementary Fig. 1), showing a good discrimination ability. The DSS was also well calibrated according to the Hosmer–Lemeshow goodness-of-fit test (c2 = 1.547, P = 0.992). The sensitivity and specificity were 0.889 and 651, and the PPV and NPV were 0.302 and 0.972. The rate of difficult operations for scores of 0, 1, 2, 3, 4, 5, 6, 7, and 8 were 0%, 0%, 6.5%, 17.2%, 20.0%, 42.9%, 100.0%, 66.7%, and 100.0%, respectively (Supplementary Fig. 2). The rate of difficult operations for ER of gGISTs in 3 groups were 7.0%, 29.4%, and 85.7%, respectively (Supplementary Fig. 3).

Discussion

In the past, laparoscopic wedge gastrectomy (LWG) was the standard procedure for resecting gGISTs [21]. With the advent of ER techniques represented by ESD, ER of gGISTs has offered an opportunity to treat gGISTs at low risk [22]. Several studies have shown that ER has better short-term efficacy for gGISTs than LWG, and comparable long-term efficacy [6,7,8, 23, 24]. However, ER for gGISTs is more challenging, and the risk of intraoperative perforation is significantly higher than that for early gastric cancer. The probability of perforation in ER of gGISTs was about 33.3%, while the probability of perforation in ER of early gastric cancer was about 0.5–4.1% [25, 26]. Therefore, endoscopists need to master not only intraoperative hemostatic technology but also perforation repair skills to successfully complete the ER of gGISTs. In order to evaluate the difficulty for ER of gGISTs preoperative, Su et al. [27]constructed a Nomogram risk prediction model and showed that the tumor size, invasion depth, and the experience of the endoscopist were independent risk factors. However, a number of previous studies have shown that lesion site has an impact on the difficulty of endoscopic surgery [28,29,30,31,32]. Therefore, this study intends to further analyze preoperative risk factors that affect the difficulty of ER for gGIST through a multi-center and large sample size study and quantify the procedure difficulty by constructing a DSS.

Operation time, the incidence of serious intraoperative complications, and whether to transfer surgery are commonly used surrogate indicators of surgical difficulty [33,34,35]. Similar to the method used to define difficult procedures in Su et al.’s study [27] (an operative time ≥ 90 min, or the occurrence of severe intraoperative bleeding), the cases transferred to surgery were also defined as a difficult procedures in this study. After analyzing the patients, lesions, and endoscopists characteristics, we found that tumor size, location, invasion depth, and endoscopists' experience to be independent predictors that affect the difficulty for ER of gGIST. Lesions with size ≥ 2.0 cm, invasion depth beyond MP, located in the upper third of the stomach, and endoscopist's lack of experience ( ER of gGISTs < 50 cases) were more likely to undergo the difficult procedure. Based on these findings, we developed a preoperative DSS for ER of gGISTs and validated our findings in IVC and EVC. We found that the DSS we developed had good discrimination and calibration in both cohorts.

We found that tumor size is a key factor in predicting whether ER of gGISTs is difficult, with odds ratios of 2.567 and 14.271 for lesions with diameters of 2.0–3.0 cm and ≥ 3.0 cm. Previous studies have also shown that lesion size is an important factor affecting the difficulty of endoscopic surgery. Su et al. [27] and Jian et al. [28] found that ER was difficult when the tumor size was more than 3.0 cm. Due to the limited operating space of ER, the operating space and surgical field of view are worse in the treatment of gGISTs with large tumor sizes. Endoscopicists need to repeatedly adjust the angle of the endoscopic incision and the amount of air in the stomach cavity to completely remove the tumor. Moreover, larger tumors carry a higher risk of intraoperative bleeding. Endoscopists needs to have rich experience in endoscopic hemostasis to shorten the operation time and reduce the risk of transfer to laparoscopic resection. Therefore, ER of gGISTs with large tumor sizes should be performed by experienced endoscopists in order to reduce the occurrence of complications, shorten the operation time and improve the success rate of surgery.

Tumor location is also an indicator for gGISTs ER difficulty in our study. A previous study of 916 patients with early gastric cancer showed that ER required longer operative time when the lesion was located in the upper third of the stomach compared to those in the middle and lower thirds of the stomach [36]. Yoon et al. [31] also reported that ER of tumors in the upper third of the stomach required a longer time and had a higher risk of perforation. Wang et al. [37] found that ER of gGISTs located in the cardia was particularly difficult, with a higher probability of intraoperative bleeding (100%) and perforation (20%). This may be due to the special anatomical structure of the area, which often requires the endoscopist to retroflex the endoscope and consume more time, especially for doctors lacking surgical experience. Additionally, the abundance of blood vessels in the upper third of the stomach increases the risk of intraoperative bleeding. Sun et al. [38] compared the effectiveness of double-bend endoscopy and single-bend endoscopy in the treatment of gGISTs and found that compared with the single-bend endoscopy group, the double-bend endoscopy group had shorter operation time and a lower incidence of perforation. Due to the design of the double-curved structure, the lens can be adjusted at a larger angle so that it can be more easily close to the cardiac, the fundus of the stomach, and other special parts, so as to better expose the tumor and improve the clarity of the surgical field of vision. Therefore, for the GIST located in the upper third of the stomach, in addition to be operated by experienced endoscopists, double-curved endoscopy can be selected for conditional units.

We have found that gGISTs with invasion depth beyond the muscularis propria constitute another independent risk factor that affects the difficulty of ER, consistent with the results of Su et al. [27]. For lesions with deep infiltration and external growth, active perforation technique may be required to achieve complete resection, and in some cases, even intraperitoneal surgery may be necessary for tumor dissection, which greatly increases the difficulty of surgery. At the same time, when a large perforation occurs, the gastric cavity is not well filled, which seriously affects the surgical field of view under endoscopic operation, and the repair of the perforation requires more time. Therefore, for a lesion with invasion depth beyond the muscularis propria, it should be performed by experienced endoscopists. Meanwhile, direct laparoscopic resection can sometimes be considered depending on the size, location, and invasion depth of the lesion.

Surgical experience is also a crucial factor affecting the difficulty of surgery. According to Sun et al. [39], the learning curve of ER of gastric submucosal tumors was about 32 cases. Yoshida et al. [40] retrospectively analyzed the learning curve of 7 novice endoscopists in ER of gastric lesions, and the results showed that a steady state could be achieved after completing approximately 30 cases. Combining the findings of both studies, we believe that the number of cases needed to reach the learning curve may differ among endoscopists. In our study, we set a threshold of 50 patients to determine the experience level of the endoscopists in performing ER on gGISTs.

The surgical indications and specific operation steps for different endoscopic procedures vary, thereby affecting the difficulty of ER. In our study, the main endoscopic techniques used were ESD and EFTR. We found that the incidence of difficult cases in the EFTR group was higher than that in the ESD group (22.6% vs. 14.2%), and the difference between groups was statistically significant (P < 0.05). For tumors with external growth, EFTR is the only method for ER of gGISTs. However, for gGISTs that originate from the MP, which endoscopic procedure should be adopted and whether active perforation should be performed usually need to be judged intraoperatively. Since most gGISTs originate from the MP, endoscopic procedures cannot be defined before surgery. Therefore, in this study, endoscopic procedures were not included in the DSS. Similarly, the method of defect closure may also affect the difficulty of surgery. Smaller defects are closed with OTSC or titanium clips, while large defects require the use of nylon rope. Therefore, the closure method of the defect should be reasonably selected according to the size of the intraoperative wound surface, which could not be determined before surgery.

Our study had some limitations. First, it was a retrospective study, which may have certain selection bias and information bias. Future prospective studies with larger sample sizes can be conducted to further explore the risk factors affecting the difficulty of ER of gGISTs. Second, with the innovation of various endoscopic instruments, hemostatic materials, wound closure devices, etc., risk factors affecting the difficulty of endoscopic surgery may change, so the DSS should also keep pace with the times. Third, in this study, the risk factors of ER of gGISTs were comprehensively analyzed, but the operation steps of ESD, EFTR, and STER endoscopic techniques are different, and their respective difficulties may be different. Further research on specific endoscopic techniques may be helpful in analyzing the specific difficulties corresponding to different endoscopic techniques.

In conclusion, the DSS base on four independent risk factors (tumor size, location, invasion depth, and endoscopists' experience) was closely associated with the rate of difficult operation for ER of gGISTs. This DSS can objectively evaluate the difficulty for ER of gGISTs, which can help endoscopists select cases comparable to their own technical level and develop individualized treatment plans for cases of different difficulty.