Deep learning model utilizing clinical data alone outperforms image-based model for hernia recurrence following abdominal wall reconstruction with long-term follow up

Wilson, Hadley H.; Ma, Chiyu; Ku, Dau; Scarola, Gregory T.; Augenstein, Vedra A.; Colavita, Paul D.; Heniford, B. Todd

doi:10.1007/s00464-024-10980-y

Deep learning model utilizing clinical data alone outperforms image-based model for hernia recurrence following abdominal wall reconstruction with long-term follow up

2023 SAGES Oral
Open access
Published: 11 June 2024

Volume 38, pages 3984–3991, (2024)
Cite this article

Download PDF

You have full access to this open access article

Surgical Endoscopy Aims and scope Submit manuscript

Deep learning model utilizing clinical data alone outperforms image-based model for hernia recurrence following abdominal wall reconstruction with long-term follow up

Download PDF

Hadley H. Wilson ORCID: orcid.org/0009-0009-4603-0460¹,
Chiyu Ma²,
Dau Ku¹,
Gregory T. Scarola¹,
Vedra A. Augenstein¹,
Paul D. Colavita¹ &
…
B. Todd Heniford¹

497 Accesses
1 Altmetric
Explore all metrics

Abstract

Background

Deep learning models (DLMs) using preoperative computed tomography (CT) imaging have shown promise in predicting outcomes following abdominal wall reconstruction (AWR), including component separation, wound complications, and pulmonary failure. This study aimed to apply these methods in predicting hernia recurrence and to evaluate if incorporating additional clinical data would improve the DLM’s predictive ability.

Methods

Patients were identified from a prospectively maintained single-institution database. Those who underwent AWR with available preoperative CTs were included, and those with < 18 months of follow up were excluded. Patients were separated into a training (80%) set and a testing (20%) set. A DLM was trained on the images only, and another DLM was trained on demographics only: age, sex, BMI, diabetes, and history of tobacco use. A mixed-value DLM incorporated data from both. The DLMs were evaluated by the area under the curve (AUC) in predicting recurrence.

Results

The models evaluated data from 190 AWR patients with a 14.7% recurrence rate after an average follow up of more than 7 years (mean ± SD: 86 ± 39 months; median [Q1, Q3]: 85.4 [56.1, 113.1]). Patients had a mean age of 57.5 ± 12.3 years and were majority (65.8%) female with a BMI of 34.2 ± 7.9 kg/m². There were 28.9% with diabetes and 16.8% with a history of tobacco use. The AUCs for the imaging DLM, clinical DLM, and combined DLM were 0.500, 0.667, and 0.604, respectively.

Conclusions

The clinical-only DLM outperformed both the image-only DLM and the mixed-value DLM in predicting recurrence. While all three models were poorly predictive of recurrence, the clinical-only DLM was the most predictive. These findings may indicate that imaging characteristics are not as useful for predicting recurrence as they have been for other AWR outcomes. Further research should focus on understanding the imaging characteristics that are identified by these DLMs and expanding the demographic information incorporated in the clinical-only DLM to further enhance the predictive ability of this model.

Clinical and radiomics feature-based outcome analysis in lumbar disc herniation surgery

Article Open access 06 October 2023

Optimal computed tomography-based biomarkers for prediction of incisional hernia formation

Article 07 September 2023

The enigma of incisional hernia prediction unraveled: external validation of a prognostic model in colorectal cancer patients

Article 16 January 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Hernia recurrence has been the traditional benchmark of success for elective hernia repair. A recurrence negatively impacts quality of life postoperatively, causes patients to question whether their surgery was necessary or adequate, and leaves the need for an additional operation [1, 2]. Patients presenting with a recurrence tend to be more complex, leading to a greater chance of complications and further recurrence when an additional operation is performed [3, 4]. Therefore, a recurrence after abdominal wall reconstruction (AWR) can lead to a “vicious cycle” of complications and further recurrences [5, 6]. These complications are burdensome to the healthcare system. It has been estimated that over 500,000 ventral hernia repairs (VHRs) are now being performed in the United States annually, representing health care costs exceeding $3 billion. Of those total numbers, patients with a recurrence have been shown to account for more than 20% of those undergoing incisional hernia repair [7, 8]. Of late, research in hernia surgery has shifted to quality of life and other important metrics. But, given the frequency, cost and psychosocial impact of a failed hernia surgery, many randomized controlled trials in the field continue to focus on hernia recurrence as their primary outcome [9,10,11,12,13].

Predicting patients who are at increased risk for hernia recurrence preoperatively may help to guide management strategy and improve shared decision-making with AWR patients. For example, greater emphasis may be placed on preoptimization – usually centered around smoking cessation, glycemic control, and weight loss – in high-risk patients prior to surgery, or nonoperative management may be considered in those at especially high risk [9]. Additionally, these factors might suggest referral to a tertiary AWR surgeon and facility. A recent meta-analysis identified 22 different predictors of recurrence [14]. Synthesizing this amount of patient information and translating it into a meaningful risk calculation is a daunting task for a preoperative clinic visit. Various risk stratification tools for AWR have thus been developed in an attempt to streamline this process. Unfortunately, many of these tools have considerable limitations as they do not assess the risk of recurrence specifically, are still overly cumbersome, or lack external validity [15]. There remains a need for a predictive tool for recurrence available to AWR surgeons.

Deep learning has the ability to efficiently analyze complex patient data and generate an accurate prediction of surgical outcomes. Briefly, deep learning is a subcategory of artificial intelligence (AI), a field of computer science in which computer systems mimic human cognitive function [16]. The authors have previously reported on a deep learning model (DLM) that was able to predict the need for component separation in AWR, outperforming a panel of expert AWR surgeons in the same task [17]. DLMs have also been shown to accurately predict postoperative outcomes including surgical site infection, mesh infection, and pulmonary failure [17, 18]. Interestingly, these DLMs have been able to make these predictions based solely on the patients’ preoperative computed tomography (CT) images. Recently, Hassan et al. also reported on the use of AI to predict recurrence, complications, and 30-day readmission following AWR. Their model did not incorporate preoperative imaging, but rather a number of clinical variables, to make predictions [19]. Given the previous success of CT image-based DLMs to predict surgical outcomes after AWR, the goal of this study was to develop a model that could predict hernia recurrence. The authors further hypothesized that incorporating clinical data into the DLM would enhance the predictive ability of the image-based model.

Materials and methods

Study population and design

Institutional review board approval was obtained prior to conducting this study. Patients were identified from a prospectively maintained database at a tertiary hernia referral center. Patients were included who underwent AWR and had preoperative CT imaging of their hernia available. The CT images had to be within 1 year prior to their operation to meet the inclusion criteria, and only images containing the hernia defect were included. Patients were excluded if there were missing images or if there was significant distortion of their CT images (for example, from an orthopedic prosthesis). Other exclusion criteria included age < 18 years old, undergoing an emergent operation, or having follow up < 18 months. This follow-up cutoff was based on previous data showing that < 50% of incisional hernia recurrences are captured within 1 year of follow up, but a majority are captured after 1–2 years of follow up [8]. Preoperative and operative characteristics and postoperative recurrence data were collected. Hernia defect size was calculated as a surface area as width x length based on measurements reported in the operating surgeon’s operative note. Recurrence was determined by physical exam documented by a provider.

Preoperative CT images were deidentified and prepared using the TeraRecon software (TeraRecon, Inc., Durham, NC). Axial slices of the abdomen that contained the hernia were included. This methodology meant that some patients would have a greater number of image slices, depending on the hernia defect size. The slices were 3–5 mm in thickness and images were standardized to 150 × 150 pixels.

All operations were performed by specialty-trained AWR surgeons at a single high-volume center. AWR refers to the practice of performing hernia repairs with the goal of restoring the structure and function of the abdominal wall [20]. The practice of these surgeons in terms of preoperative optimization and operative technique is similar and has been described previously [21]. Patients who are smoking are required to quit at least 4 weeks prior to surgery, and a preoperative urine cotinine test is used to confirm adherence to this requirement. An A1c of 7.2 or less is targeted for patients with diabetes. Appropriate counseling and referral are provided to assist patients in meeting these goals. There is not a strict cutoff for body mass index (BMI), but generally patients with BMI > 35 kg/m² are counseled to lose weight before surgery is performed. Instruction in a ketogenic diet is provided, and exercise is encouraged. Once patients are optimized for surgery, AWR is usually performed with an open preperitoneal approach as was done in the vast majority of patients in this study. Patients are given preoperative antibiotic and venous thromboembolism prophylaxis, and for an open operation a midline incision is performed. The hernia contents are reduced, and lysis of adhesions is performed as necessary to remove any adhesions to the anterior abdominal wall. Whenever possible, thorough dissection of the preperitoneal space is then accomplished to allow for placement of a large mesh into this space with a wide overlap of the mesh beyond the hernia defect 5–10 cm in all directions. Generally, a midweight polypropylene mesh is used in clean and clean-contaminated cases unless the patient is at higher risk for developing or not being able to tolerate a mesh-related complication, such as transplant or immunocompromised patients [22]. In contaminated and dirty cases, we preferentially use biologic mesh as our data suggests less mesh-related complications in these settings [23]. The mesh is secured with transfascial suture fixation to the anterior abdominal wall. The peritoneum is closed prior to mesh placement, and the fascia is closed over the mesh. It is the goal to achieve fascial closure when possible, and a component separation, either a transversus abdominis release or an external oblique release, is performed when necessary. To assess for tension and the need for a component separation, Kocher clamps are placed on the anterior fascia and pulled together. If the closure is felt to be on tension, the posterior rectus sheath is first released. If there is still felt to be tension on the fascial closure and the space needed for the fascia to come together is 6 cm or less, a transversus abdominis release is performed. If there is need for a greater release, then an external oblique release is performed with an effort to spare the periumbilical perforator vessels. The midline incision is typically closed with absorbable deep dermal sutures, staples, and an incisional negative pressure dressing. Of note, there were a small subset of patients in this study who had their surgery performed laparoscopically due to surgeon preference. An intraperitoneal underlay mesh was placed in these cases. One patient had significant intraabdominal adhesions requiring a conversion to an open incision for completion of lysis of adhesions. In this specific case, an intraperitoneal underlay mesh was placed laparoscopically after the fascia was closed.

Model development

A trained computer scientist developed the initial DLM based on preoperative CT images and a binary outcome (yes/no) of recurrence following AWR. Another model was developed to train on basic patient data: age, sex, history of tobacco use, history of diabetes, and BMI. The two models were then integrated to create the mixed-value model. Our prior experience with blending imaging and objective data has led to overfitting, or overlearning, of DLMs so that they are unable to consider further variations of information [18]. Thus, there is a fine line where including too many variables may render a model impractical, so the rationale was to use limited and relatively basic clinical variables, readily available for almost any patient, to build the model. Patients were randomized into a training set (80%) and a testing set (20%). The DLMs were blinded to the test set until internal validation was performed. The DLMs used an Adam optimizer with a learning rate of 0.1 and binary cross entropy loss.

In previous image-based DLMs for AWR outcomes, every image slice containing the hernia defect has been used as an input [17, 18]. Initially, the same strategy was used here, but there was a substantial amount of noise introduced by using every image, making it impossible to construct a reliable model. In collaboration with our data scientists, we determined that not every slice would capture the representative features of a hernia. In an effort to reduce irrelevant information contained in the images and focus on the relevant hernia characteristics, another strategy was used in this study by instead using a frame averaging technique, accomplished by the following algorithm: Output = { $\frac{1}{\text{N}}*{\sum }_{n=1}^{N}pixel\left(\text{n},\text{x},\text{y}\right)$|all pixels in the image set} with x and y representing the x, y coordinates of the pixel in a given image, and n representing the image number within the set. A single averaged image for each patient was produced for the DLMs to predict recurrence.

The image-only DLM was designed as a convolutional neural network (CNN) with two convolutional layers and one linear layer that trained with the averaged images. The image data was passed through the two convolutional layers with a window size of 5 × 5. The output embedding from the convolutional layers was passed to a linear layer and a dropout layer with a probability of 0.3 to train a node within the layer. The final output was passed to a sigmoid activation function with a logit of ≥ 0.5 indicating a prediction of recurrence and < 0.5 predicting no recurrence.

The clinical-only DLM was designed as a five-layer feedforward neural network (FNN) that trained on the clinical data alone. The patient characteristics passed through two batch norm layers followed by a dropout layer with a probability of 0.2. A logit was similarly returned predicting whether a recurrence would occur.

Finally, the mixed-value DLM was designed as another FNN, incorporating the logits returned by the image-only and the clinical-only DLMs. The logits were concatenated by an interpolation network and fit into two linear layers, producing a single logit used to determine the prediction of recurrence.

Statistical analysis

Statistical analyses were performed by a trained statistician using the Python Software Foundation (Python Language Reference, version 2.7) and SAS program version 9.4 (SAS, Cary, NC, USA). Categorical variables were reported as frequencies and percentages. For categorical variables with missing data, the patients with missing data were considered a “no” for the purposes of reporting summary statistics so that the reported frequencies/percentages were only those that were confirmed to be a “yes.” Continuous variables were reported as the mean ± standard deviation. Comparisons of preoperative and operative characteristics were performed using the Kruskal–Wallis test or Student’s t-test for continuous variables and the Pearson χ² or Fisher’s exact test for categorical variables, where appropriate. A two-tailed statistical significance was set at p < 0.05 before data collection. The primary outcome was the ability of the DLMs to differentiate between patients who did or did not have a hernia recurrence over the specified follow up period by the area under the curve (AUC) of the receiver operating characteristic (ROC) curve. As previously described, an AUC ≥ 0.7 was considered the threshold to be considered a predictive model [24].

Results

There were 190 patients included in this study. Preoperative and operative characteristics are reported in Table 1. Of the patient characteristics included in the models, there were 125 (65.8%) females, and patients had an average age of 57.5 ± 12.3 years old. Patients had an average BMI of 34.2 ± 7.9 kg/m², and there were 53 (27.9%) with a history of diabetes and 35 (18.4%) with a history of tobacco use. A majority of patients (54.7%) presented with a recurrent hernia, and the average number of previous hernia surgeries was 2.1 ± 1.5. These were on average very large hernias with a defect size of 177.5 ± 183.7 cm². Mesh was placed in 93.2%, and 95.8% had an open procedure performed. The fascial defect was completely closed in 91.1% (1.1% of patients missing data), and a component separation was required in 26.3% (2.6% of patients missing data). Postoperatively, 28 (14.7%) experienced a recurrence with a median follow up of more than 7 years (85.4 [56.1, 113.1] months). For the most part, patients who had a recurrence did not have statistically significant differences in their preoperative and operative characteristics from those who did not (Table 1). The exception was that there was a statistically significant higher proportion of patients repaired laparoscopically in the group with recurrences (14.3% vs 1.9%, p = 0.005). There were no statistically significant differences in characteristics for the patients in the training set and testing sets (Table 1). Of the 28 total patients with a recurrence, there were 22 (14.5%) in the training set and 6 (15.8%) in the testing set (p = 0.838).

Table 1 Preoperative and operative characteristics

Full size table

Deep learning models for recurrence

The image-only DLM was not found to be a discriminatory model with an AUC of 0.500 (Fig. 1). The best performance for the clinical-only DLM was achieved when sex was excluded, so the other four variables – age, BMI, history of diabetes, and history of tobacco use – were included. This model had a training accuracy of 0.877 and a validation accuracy of 0.897 with an AUC of 0.667, outperforming the image-only model (Fig. 2). These models were then incorporated into the final mixed-value DLM that had a training accuracy of 0.875 and a validation accuracy of 0.897, but had a lower AUC of 0.604 (Fig. 3).

Discussion

In this study, the DLM using preoperative CT images only was not predictive of hernia recurrence after AWR. The AUC of the image-only model indicated that the model was equally successful as randomly guessing. This result differed quite a bit from the prior successes of image-based DLMs in predicting other outcomes of AWR, including component separation, surgical site infection, pulmonary failure, and mesh infection [17, 18]. The hypothesis that incorporating basic clinical data into the model would improve the predictive ability of the image-based model was correct, but the final mixed-value DLM was not found to be very predictive of recurrence either. Most interestingly, the model that performed best was the DLM that utilized clinical data alone. Although this would not be considered a discriminatory model based on traditional standards, the results were nonetheless impressive given the limited amount of clinical data on which the model was given to train. In fact, the clinical-only DLM was intended only to enhance the image-based DLM and was not expected to supersede it. These findings are valuable additions to the burgeoning field of AI in the prediction of surgical outcomes. The field is still in its infancy, and it has been stressed in the literature that these models must be rigorously analyzed prior to clinical implementation [16]. Results such as these help to guide the way forward without being overly optimistic about the application of AI in augmenting surgical decision-making.

Much of the previous work with image-based DLMs to predict AWR outcomes was constructed on the rationale that specifically defined features from preoperative CT scans could be indicative of certain outcomes. For example, hernia defect size and abdominal wall thickness, measured by CT, have previously been shown to predict the need for component separation and postoperative wound complications [25]. Taking this one step further, studies by Schlosser et al. used CT volumetric analysis to predict component separation, wound complications, and respiratory insufficiency [26, 27]. Similarly, preoperative CT measurements have been shown to be predictive of achieving fascial closure during AWR [28, 29]. Some of the main drawbacks to these methods are that obtaining these measurements can be subjective, user-dependent, and time-intensive, and they typically involve specialized software and clinical expertise to define. Therefore, they do not overcome the concern of being overly cumbersome that is common to other risk stratification tools, and may be more cumbersome. This has been the purported strength of image-based DLMs: the computer learns to extract the imaging features on its own to simplify the process for clinicians. It can be postulated that previous successful DLMs have identified features from imaging that are similar to those already identified in the literature, but this has not been demonstrated. Extracting and interpreting the complex associations built by a DLM remains an extremely challenging task and is an active area of research [30].

There are a few possible reasons that the image-only DLM performed poorly in predicting hernia recurrence. Within the line of thinking that the computer is indeed “seeing” similar CT features to those that have already been studied, such as hernia dimensions, it should be acknowledged that the relationship of these features to hernia recurrence have not been well-demonstrated. In a previous study, Ballem et al. did show that a larger defect size was a risk factor for recurrence, but this finding has not held true in other studies, and a more recent meta-analysis did not show hernia dimensions to be independently predictive of recurrence [14, 31]. Additionally, another characteristic that could intuitively be extracted by the image-based DLMs is the hernia location. In particular, European Hernia Society class M1, or subxiphoid, hernias have been shown to have lower rates of tension-free fascial closure [29, 32]. However, the meta-analysis by Parker et al. did not show hernia location to be a risk factor for recurrence either [14]. The results of the present study serve as further evidence that the associations made by image-based DLMs for AWR may be fairly straightforward, aligning with hernia characteristics that have been identified in other studies. Thus, AI may perform well to predict outcomes from preoperative imaging only when CT-defined features have already been linked to these outcomes, but in this study, it did not show the ability to identify unseen characteristics to make accurate predictions.

It is also possible there were not enough instances of recurrence on which the model could train. There were 28 patients total with a recurrence; 22 were in the training set and 6 were in the testing set. In fact, the initial image-based DLM for pulmonary failure was thought to be unsuccessful for a similar reason [17]. It had used a comparable number of 29 patients with pulmonary failure, including 23 in the training set and 6 in the testing set. Initially, this had produced an unsatisfactory AUC of 0.545. The follow-up study by Ayuso et al. improved upon this model, generating an AUC of 0.70, by several different methods [18]. First, the number of patients in the total dataset was increased from 369 to 510. There were no additional positive instances of pulmonary failure added to the dataset, only patients without this outcome. Also, the raw number of patients with pulmonary failure in the training and testing sets stayed the same. Next, the model was trained in a different way, only training on the vast majority of patients who did not have pulmonary failure and learning to identify the patients who were abnormal, a strategy known as anomaly detection. At face value, this methodology would be very helpful for improving the current image-based DLM for recurrence. The main challenge is that while pulmonary failure is an outcome studied in the short term following AWR, hernia recurrence is a long-term outcome. It has been estimated that approaching the actual recurrence rate of incisional hernia repairs requires at least 10 years of follow up, a mark that very few hernia studies have achieved [8]. A strength of this study was the relatively long-term follow up, but excluding patients with shorter-term follow up also limits the ability of collecting a larger group of negative instances for the model to train on, as was done to improve the pulmonary failure model. In other words, including many patients who had short-term follow up and were considered as negative for recurrence, but who may have gone on to develop a recurrence later on, would likely introduce further variability and confusion to the model and limit its applicability to the clinical setting. A future direction for this work could be to establish a multicenter dataset that would be able to overcome these challenges, adding enough patients with preoperative imaging and long-term follow up to build a more robust image-based DLM for recurrence.

Perhaps the most unanticipated finding from this study was the predictive ability of the clinical-only DLM. In the end, the model only used four pieces of information – age, BMI, history of diabetes, and history of tobacco use – to predict hernia recurrence and far outperformed the image-only DLM as well as the mixed-value DLM. Notably, all four of these demographics have been shown to be risk factors for recurrence on meta-analysis [14]. The surprising part is that there was no statistical difference between these variables in patients with or without a recurrence in the present study. With this sample size it is also possible that this finding may represent a type II error, but this observation could highlight the ability of DLMs to find complex associations between input variables that standard statistical methods fail to identify [16]. Another recent study by Hassan et al. also used AI, analyzing clinical data only, to predict recurrence as well as surgical site occurrences and 30-day readmissions [19]. Their models outperformed traditional multivariable logistic regression models in predicting these outcomes. Furthermore, Holihan et al. described the flaws in several predictive models they created for ventral hernia recurrence that were built with regression analysis, showing that they all performed poorly on external validation [33]. The unexpected success of the clinical-only DLM in this study further establishes the advantages of AI over conventional statistical models that may fail to identify the complex, nonlinear associations between data.

To summarize, this study reports on the comparative success of three different DLMs in predicting hernia recurrence following AWR. While all three models were poorly predictive of recurrence, the clinical-only DLM was the most predictive. The image-only DLM in this study showed no ability to discriminate between patients who would or would not develop a recurrence based on their preoperative CT imaging. A mixed-value DLM incorporating image data and clinical data also performed poorly. In the context of multiple successful image-based DLMs predicting AWR outcomes that have been published previously, these results may reflect the limitations of image-based deep learning for predicting recurrence specifically or, more generally, the difficulties and complexities of accurately studying this outcome in AWR. On the other hand, the predictive ability of the DLM using only very few clinical data was encouraging, and building on this model with additional demographic information is a worthy direction for future work.

References

Ciomperlik H, Dhanani NH, Cassata N, Mohr C, Bernardi K, Holihan JL, Lyons N, Olavarria O, Ko TC, Liang MK (2021) Patient quality of life before and after ventral hernia repair. Surgery 169:1158–1163. https://doi.org/10.1016/j.surg.2020.11.003
Article PubMed Google Scholar
Cox TC, Huntington CR, Blair LJ, Prasad T, Lincourt AE, Heniford BT, Augenstein VA (2016) Predictive modeling for chronic pain after ventral hernia repair. Am J Surg 212:501–510. https://doi.org/10.1016/j.amjsurg.2016.02.021
Article PubMed Google Scholar
Shao JM, Deerenberg EB, Elhage SA, Prasad T, Davis BR, Kercher KW, Colavita PD, Augenstein VA, Heniford BT (2021) Recurrent incisional hernia repairs at a tertiary hernia center: are outcomes really inferior to initial repairs? Surgery 169:580–585. https://doi.org/10.1016/j.surg.2020.10.009
Article PubMed Google Scholar
Heniford BT, Ross SW, Wormer BA, Walters AL, Lincourt AE, Colavita PD, Kercher KW, Augenstein VA (2020) Preperitoneal ventral hernia repair. Ann Surg 271:364–374. https://doi.org/10.1097/SLA.0000000000002966
Article PubMed Google Scholar
Flum DR, Horvath K, Koepsell T (2003) Have outcomes of incisional hernia repair improved with time? Ann Surg 237:129–135. https://doi.org/10.1097/00000658-200301000-00018
Article PubMed PubMed Central Google Scholar
Holihan JL, Alawadi Z, Martindale RG, Roth SJ, Wray CJ, Ko TC, Kao LS, Liang MK (2015) Adverse events after ventral hernia repair: the vicious cycle of complications. J Am Coll Surg 221:478–485. https://doi.org/10.1016/j.jamcollsurg.2015.04.026
Article PubMed Google Scholar
Poulose BK, Shelton J, Phillips S, Moore D, Nealon W, Penson D, Beck W, Holzman MD (2012) Epidemiology and cost of ventral hernia repair: making the case for hernia research. Hernia 16:179–183. https://doi.org/10.1007/s10029-011-0879-9
Article CAS PubMed Google Scholar
Köckerling F, Koch A, Lorenz R, Schug-Pass C, Stechemesser B, Reinpold W (2015) How long do we need to follow-up our hernia patients to find the real recurrence rate? Front Surg. https://doi.org/10.3389/fsurg.2015.00024
Article PubMed PubMed Central Google Scholar
Liang MK, Holihan JL, Itani K, Alawadi ZM, Gonzalez JRF, Askenasy EP, Ballecer C, Sen CH, Goldblatt MI, Greenberg JA, Harvin JA, Keith JN, Martindale RG, Orenstein S, Richmond B, Roth JS, Szotek P, Towfigh S, Tsuda S, Vaziri K, Berger DH (2017) Ventral hernia management. Ann Surg 265:80–89. https://doi.org/10.1097/SLA.0000000000001701
Article PubMed Google Scholar
Rosen MJ, Krpata DM, Petro CC, Carbonell A, Warren J, Poulose BK, Costanzo A, Tu C, Blatnik J, Prabhu AS (2022) Biologic vs synthetic mesh for single-stage repair of contaminated ventral hernias. JAMA Surg 157:293. https://doi.org/10.1001/jamasurg.2021.6902
Article PubMed PubMed Central Google Scholar
Tryliskyy Y, Wong CS, Demykhova I, Tyselskyi V, Kebkalo A, Poylin V, Pournaras DJ (2022) Fascial defect closure versus bridged repair in laparoscopic ventral hernia mesh repair: a systematic review and meta-analysis of randomized controlled trials. Hernia 26:1473–1481. https://doi.org/10.1007/s10029-021-02533-2
Article CAS PubMed Google Scholar
Olavarria OA, Bernardi K, Dhanani NH, Lyons NB, Harvin JA, Millas SG, Ko TC, Kao LS, Liang MK (2021) Synthetic versus biologic mesh for complex open ventral hernia repair: a pilot randomized controlled trial. Surg Infect (Larchmt) 22:496–503. https://doi.org/10.1089/sur.2020.166
Article PubMed Google Scholar
Demetrashvili Z, Pipia I, Loladze D, Metreveli T, Ekaladze E, Kenchadze G, Khutsishvili K (2017) Open retromuscular mesh repair versus onlay technique of incisional hernia: a randomized controlled trial. Int J Surg 37:65–70. https://doi.org/10.1016/j.ijsu.2016.12.008
Article PubMed Google Scholar
Parker SG, Mallett S, Quinn L, Wood CPJ, Boulton RW, Jamshaid S, Erotocritou M, Gowda S, Collier W, Plumb AAO, Windsor ACJ, Archer L, Halligan S (2021) Identifying predictors of ventral hernia recurrence: systematic review and meta-analysis. BJS Open. https://doi.org/10.1093/bjsopen/zraa071
Article PubMed PubMed Central Google Scholar
Bernardi K, Adrales GL, Hope WW, Keith J, Kuhlens H, Martindale RG, Melin AA, Orenstein SB, Roth JS, Shah SK, Tsuda S, Liang MK (2018) Abdominal wall reconstruction risk stratification tools: a systematic review of the literature. Plast Reconstr Surg 142:9S-20S. https://doi.org/10.1097/PRS.0000000000004833
Article CAS PubMed Google Scholar
Loftus TJ, Tighe PJ, Filiberto AC, Efron PA, Brakenridge SC, Mohr AM, Rashidi P, Upchurch GR, Bihorac A (2020) Artificial intelligence and surgical decision-making. JAMA Surg 155:148. https://doi.org/10.1001/jamasurg.2019.4917
Article PubMed PubMed Central Google Scholar
Elhage SA, Deerenberg EB, Ayuso SA, Murphy KJ, Shao JM, Kercher KW, Smart NJ, Fischer JP, Augenstein VA, Colavita PD, Heniford BT (2021) Development and validation of image-based deep learning models to predict surgical complexity and complications in abdominal wall reconstruction. JAMA Surg 156:933. https://doi.org/10.1001/jamasurg.2021.3012
Article PubMed PubMed Central Google Scholar
Ayuso SA, Elhage SA, Zhang Y, Aladegbami BG, Gersin KS, Fischer JP, Augenstein VA, Colavita PD, Heniford BT (2023) Predicting rare outcomes in abdominal wall reconstruction using image-based deep learning models. Surgery 173:748–755. https://doi.org/10.1016/j.surg.2022.06.048
Article PubMed Google Scholar
Hassan AM, Lu S-C, Asaad M, Liu J, Offodile AC, Sidey-Gibbons C, Butler CE (2022) Novel machine learning approach for the prediction of hernia recurrence, surgical complication, and 30-day readmission after abdominal wall reconstruction. J Am Coll Surg 234:918–927. https://doi.org/10.1097/XCS.0000000000000141
Article PubMed Google Scholar
Hope WW, Abdul W, Winters R (2023) Abdominal Wall Reconstruction. In: StatPearls. StatPearls, Treasure Island
Google Scholar
Katzen MM, Kercher KW, Sacco JM, Ku D, Scarola GT, Davis BR, Colavita PD, Augenstein VA, Heniford BT (2023) Open preperitoneal ventral hernia repair: prospective observational study of quality improvement outcomes over 18 years and 1,842 patients. Surgery 173:739–747. https://doi.org/10.1016/j.surg.2022.07.042
Article PubMed Google Scholar
Shao JM, Ayuso SA, Deerenberg EB, Elhage SA, Prasad T, Colavita PD, Augenstein VA, Heniford BT (2022) Biologic mesh is non-inferior to synthetic mesh in CDC class 1 & 2 open abdominal wall reconstruction. Am J Surg 223:375–379. https://doi.org/10.1016/j.amjsurg.2021.05.019
Article PubMed Google Scholar
Katzen M, Ayuso SA, Sacco J, Ku D, Scarola GT, Kercher KW, Colavita PD, Augenstein VA, Heniford BT (2023) Outcomes of biologic versus synthetic mesh in CDC class 3 and 4 open abdominal wall reconstruction. Surg Endosc 37:3073–3083. https://doi.org/10.1007/s00464-022-09486-2
Article PubMed Google Scholar
Carter JV, Pan J, Rai SN, Galandiuk S (2016) ROC-ing along: evaluation and interpretation of receiver operating characteristic curves. Surgery 159:1638–1645. https://doi.org/10.1016/j.surg.2015.12.029
Article PubMed Google Scholar
Blair LJ, Ross SW, Huntington CR, Watkins JD, Prasad T, Lincourt AE, Augenstein VA, Heniford BT (2015) Computed tomographic measurements predict component separation in ventral hernia repair. J Surg Res 199:420–427. https://doi.org/10.1016/j.jss.2015.06.033
Article PubMed Google Scholar
Schlosser KA, Maloney SR, Prasad T, Colavita PD, Augenstein VA, Heniford BT (2020) Three-dimensional hernia analysis: the impact of size on surgical outcomes. Surg Endosc 34:1795–1801. https://doi.org/10.1007/s00464-019-06931-7
Article PubMed Google Scholar
Schlosser KA, Maloney SR, Prasad T, Colavita PD, Augenstein VA, Heniford BT (2020) Too big to breathe: predictors of respiratory failure and insufficiency after open ventral hernia repair. Surg Endosc 34:4131–4139. https://doi.org/10.1007/s00464-019-07181-3
Article PubMed Google Scholar
Love MW, Warren JA, Davis S, Ewing JA, Hall AM, Cobb WS, Carbonell AM (2021) Computed tomography imaging in ventral hernia repair: can we predict the need for myofascial release? Hernia 25:471–477. https://doi.org/10.1007/s10029-020-02181-y
Article CAS PubMed Google Scholar
Al-Mansour MR, Wu J, Gagnon G, Knee A, Romanelli JR, Seymour NE (2021) Linear versus volumetric CT analysis in predicting tension-free fascial closure in abdominal wall reconstruction. Hernia 25:91–98. https://doi.org/10.1007/s10029-020-02349-6
Article CAS PubMed Google Scholar
Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP (2020) Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol 9:14. https://doi.org/10.1167/tvst.9.2.14
Article PubMed PubMed Central Google Scholar
Ballem N, Parikh R, Berber E, Siperstein A (2008) Laparoscopic versus open ventral hernia repairs: 5 year recurrence rates. Surg Endosc 22:1935–1940. https://doi.org/10.1007/s00464-008-9981-1
Article PubMed Google Scholar
Muysoms FE, Miserez M, Berrevoet F, Campanelli G, Champault GG, Chelala E, Dietz UA, Eker HH, El Nakadi I, Hauters P, Hidalgo Pascual M, Hoeferlin A, Klinge U, Montgomery A, Simmermacher RKJ, Simons MP, Śmietański M, Sommeling C, Tollens T, Vierendeels T, Kingsnorth A (2009) Classification of primary and incisional abdominal wall hernias. Hernia 13:407–414. https://doi.org/10.1007/s10029-009-0518-x
Article CAS PubMed PubMed Central Google Scholar
Holihan JL, Li LT, Askenasy EP, Greenberg JA, Keith JN, Martindale RG, Roth JS, Liang MK (2016) Analysis of model development strategies: predicting ventral hernia recurrence. J Surg Res 206:159–167. https://doi.org/10.1016/j.jss.2016.07.042
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank Mr. Rahmatulla Tawkaliyar and Ms. Kiara Brown for their assistance with data collection for this study.

Funding

Open access funding provided by the Carolinas Consortium. This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Division of Gastrointestinal and Minimally Invasive Surgery, Department of Surgery, Carolinas Medical Center, 1025 Morehead Medical Drive Suite 300, Charlotte, NC, 28204, USA
Hadley H. Wilson, Dau Ku, Gregory T. Scarola, Vedra A. Augenstein, Paul D. Colavita & B. Todd Heniford
Department of Statistical Science, Duke University, Durham, NC, USA
Chiyu Ma

Authors

Hadley H. Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Chiyu Ma
View author publications
You can also search for this author in PubMed Google Scholar
Dau Ku
View author publications
You can also search for this author in PubMed Google Scholar
Gregory T. Scarola
View author publications
You can also search for this author in PubMed Google Scholar
Vedra A. Augenstein
View author publications
You can also search for this author in PubMed Google Scholar
Paul D. Colavita
View author publications
You can also search for this author in PubMed Google Scholar
B. Todd Heniford
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. Todd Heniford.

Ethics declarations

Disclosures

Paul Colavita has an investigator-initiated research grant with Medtronic. Vedra Augenstein receives speaking honoraria from Medtronic, Allergan, Intuitive, Acelity, and Bard. Todd Heniford receives surgical research and education grants and speaking honoraria from Allergan and WL Gore. Hadley Wilson, Chiyu Ma, Dau Ku, and Gregory Scarola have no disclosures to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wilson, H.H., Ma, C., Ku, D. et al. Deep learning model utilizing clinical data alone outperforms image-based model for hernia recurrence following abdominal wall reconstruction with long-term follow up. Surg Endosc 38, 3984–3991 (2024). https://doi.org/10.1007/s00464-024-10980-y

Download citation

Received: 13 April 2023
Accepted: 02 June 2024
Published: 11 June 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s00464-024-10980-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep learning model utilizing clinical data alone outperforms image-based model for hernia recurrence following abdominal wall reconstruction with long-term follow up