Automated segmentation by deep learning of loose connective tissue fibers to define safe dissection planes in robot-assisted gastrectomy

Kumazu, Yuta; Kobayashi, Nao; Kitamura, Naoki; Rayan, Elleuch; Neculoiu, Paul; Misumi, Toshihiro; Hojo, Yudai; Nakamura, Tatsuro; Kumamoto, Tsutomu; Kurahashi, Yasunori; Ishida, Yoshinori; Masuda, Munetaka; Shinohara, Hisashi

doi:10.1038/s41598-021-00557-3

Automated segmentation by deep learning of loose connective tissue fibers to define safe dissection planes in robot-assisted gastrectomy

Article
Open access
Published: 27 October 2021

Volume 11, article number 21198, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Automated segmentation by deep learning of loose connective tissue fibers to define safe dissection planes in robot-assisted gastrectomy

Download PDF

Yuta Kumazu^1,2^na1,
Nao Kobayashi²^na1,
Naoki Kitamura³,
Elleuch Rayan³,
Paul Neculoiu³,
Toshihiro Misumi⁴,
Yudai Hojo⁵,
Tatsuro Nakamura⁵,
Tsutomu Kumamoto⁵,
Yasunori Kurahashi⁵,
Yoshinori Ishida⁵,
Munetaka Masuda¹ &
…
Hisashi Shinohara⁵

3421 Accesses
27 Citations
Explore all metrics

Abstract

The prediction of anatomical structures within the surgical field by artificial intelligence (AI) is expected to support surgeons’ experience and cognitive skills. We aimed to develop a deep-learning model to automatically segment loose connective tissue fibers (LCTFs) that define a safe dissection plane. The annotation was performed on video frames capturing a robot-assisted gastrectomy performed by trained surgeons. A deep-learning model based on U-net was developed to output segmentation results. Twenty randomly sampled frames were provided to evaluate model performance by comparing Recall and F1/Dice scores with a ground truth and with a two-item questionnaire on sensitivity and misrecognition that was completed by 20 surgeons. The model produced high Recall scores (mean 0.606, maximum 0.861). Mean F1/Dice scores reached 0.549 (range 0.335–0.691), showing acceptable spatial overlap of the objects. Surgeon evaluators gave a mean sensitivity score of 3.52 (with 88.0% assigning the highest score of 4; range 2.45–3.95). The mean misrecognition score was a low 0.14 (range 0–0.7), indicating very few acknowledged over-detection failures. Thus, AI can be trained to predict fine, difficult-to-discern anatomical structures at a level convincing to expert surgeons. This technology may help reduce adverse events by determining safe dissection planes.

Deep learning-based recognition of key anatomical structures during robot-assisted minimally invasive esophagectomy

Article Open access 22 March 2023

An analysis on the effect of body tissues and surgical tools on workflow recognition in first person surgical videos

Article Open access 27 February 2024

A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation

Article 16 January 2019

Introduction

Technological innovations in optics and robotics help to support and improve the surgeon’s eyes and hands, and yet adverse surgical events remain an unsolved problem^1,2,3. According to a report on human performance errors during surgery, nearly 30% of surgical complications are caused by misrecognition (MR) during operations⁴. Fatigue from long shifts or prolonged surgeries reduces surgeons’ cognitive ability and performance⁵, and they can also experience fluctuations in cognition and attention based on their physical and mental condition. Moreover, inexperienced surgeons tend to have insufficient anatomical knowledge and techniques for managing intraoperative events such as bleeding⁶. Further technological innovations that can support the “surgeon's brain” may help to improve surgical outcomes.

Performing surgery is not unlike driving a vehicle, in that decisions and maneuvers must be made based on visual information. In recent years, the technologies behind autonomous driving systems that utilize artificial intelligence (AI) technology, especially deep-learning algorithms, have progressed rapidly⁷. One benefit that AI offers in that field is fewer traffic accidents due to human error, by predicting safe driving lanes based on obstacle recognition (e.g., of other vehicles, traffic lights, road signs, and pedestrians). For example, the Japanese automobile manufacturer Subaru has reported that its driver assistance system has reduced car accidents by 60%⁸. Applying similar technologies to surgery could be a way to support surgeons’ experience and skills, mitigating fluctuations in cognition and attention due to their physical and mental condition while operating⁹.

To continue the analogy, in gastrointestinal cancer surgery the “driving lane” is the dissection plane, referred to as the “Holy Plane” in total mesorectal excision¹⁰ and considered to be a common anatomy in colonic^11,12, esophageal^13,14, and gastric surgery^15,16. The dissection plane is an avascular space consisting of loose connective tissue fibers (LCTFs) that appears when expanded by optimal countertraction^12,15,16. Accumulating evidence has revealed that a sharp dissection of LCTFs not only improves oncological outcomes, but also reduces surgical complications^17,18,19. In this study, we explored the use of deep learning in medical image analysis to identify this complex and difficult-to-discern anatomy within the surgical field. We aimed to develop an AI model that achieves LCTF predictions which are highly convincing to expert surgeons and help surgeons visualize safe dissection planes during lymphadenectomy in robot-assisted gastrectomy.

Methods

Video dataset

Videos of robot-assisted surgeries for gastric cancer performed at the Hyogo College of Medicine, Japan, from May 2018 to January 2020 were used to develop and evaluate the AI algorithm. These operations were performed using the da Vinci Xi Surgical System (Intuitive Surgical, Sunnyvale, CA) by board-certified surgeons (H.S., Y.I., Y.K., and T.K.) who were certified as Console Surgeons through da Vinci Surgical System Off-site Training. The recording system (AVCCAM AG-MDR15, Panasonic, Osaka, Japan) produced videos with framerates of 30 frames per second (fps). We selected videos that captured suprapancreatic lymph node dissections, because this operative step is not complicated, is well formalized, and the dissection plane is easily visualized. The 33 eligible videos were clipped and downloaded to a hard drive. The videos were then categorized according to use: 20 for training the algorithm, 3 for validation, and 10 for evaluation.

Annotation and deep learning

Still images, including at least 10 with clearly depicted LCTF structures, were framed from the training videos and saved in BMP format at a resolution of 1920 × 1080 pixels (aspect ratio 16:9). To create the training set, the boundaries of each LCTF were precisely annotated on each frame by two surgeons (N.K. and Y.K.) who have completed a fellowship in gastroenterological surgery and have experience performing more than 100 laparoscopic gastrectomies. The neural network model was based on the convolutional neural network U-net architecture, which has previously shown promising results in segmentation tasks, particularly for medical images^20,21,22. Figure 1a shows our deep learning algorithm, which allows more accurate output of segmentation maps by extracting object features in the convolution layer while restoring positional information in the deconvolution layer. Model training and inference were performed on a workstation with a Tesla V100 GPU (NVIDIA Corp., Santa Clara, CA) with 32 GB memory. The LCTF detection threshold was set to 50%. Automated segmentation results were output at around 5 fps by highlighting the LCTF area in turquoise.

Development of the AI model

A prototype AI model was produced in May 2019 using 630 images taken from 11 of the training videos. As Fig. 1b shows, the U-net deep learning algorithm was developed by augmenting the training data with surgeons’ annotations. The process of developing the prototype AI model to the latest one was carried out through more sophisticated annotations and data augmentation without changing the architecture of U-net. Performance of the developed AI model was carefully verified using the 3 validation videos, separate from the training ones. The latest AI model was trained using a total of 1800 images, including more than 20,000 LCTF annotations taken from the 20 training videos.

Model evaluation by computation

Three engineers (E.R., P.N., and N.K.) randomly sampled 80 frames from the 10 evaluation videos that underwent LCTF prediction using the latest AI model (see Fig. 1b). Two annotators (N.K. and Y.K.) manually segmented the corresponding frames from the original to create the ground truth. Agreement was quantitatively evaluated by measuring spatial overlap of the number of pixels between the actual area concordant with surgeons’ manual segmentations (i.e., the ground truth) and the predicted area of the AI’s automated segmentation, using Recall²³ and F1/Dice^24,25 scores. These are the most commonly used performance metrics in machine learning for assessing sensitivity and similarity, respectively, calculated as

$$\mathrm{Recall}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}$$

$$\frac{\mathrm{F}1}{\mathrm{Dice}}=\frac{\mathrm{TP}}{\mathrm{TP}+\frac{1}{2}(\mathrm{FP}+\mathrm{FN})}$$

where TP, FN, and FP respectively represent true positive, false negative, and false positive counts.

Model evaluation by trained surgeons

In quantitative evaluations such as the F1/Dice score and Recall, it is difficult for clinician to interpret values to judge validity for clinical application, especially in cases of evaluations related to visual or cognitive performance. Therefore, we created a questionnaire with reference to previous studies for the purpose of complementing the quantitative evaluation^26,27,28. Model performance for 20 of the 80 test frames was also evaluated qualitatively by a two-item questionnaire completed by 20 trained gastrointestinal surgeons (Fig. 1b). Each test frame was sequentially projected onto a high-resolution screen alongside the original frame (Fig. 2), and the evaluators intuitively answered the questionnaire. The first question was: Q1. How sensitive was the AI in recognizing loose connective tissue fibers? The answers were scored for recognition on a 5-point scale at 20% increment (0 for lowest recognition [0%–19%] to 4 for highest recognition [80%–100%]). The mean score for each frame was used as the sensitivity score. The second question was: Q2. How many structures did the AI misrecognize as loose connective tissue fibers? These answers were also scored on a 5-point scale (0 for no MR areas to 4 for 4 or more MR areas). The mean score for each frame was used as the MR score.

Statistical analysis

The sensitivity and MR scores were plotted as a scatter diagram and a confidence ellipse with probability 0.95 was drawn. The correlation between Recall and sensitivity scores was assessed by calculating the Pearson correlation coefficient. JMP Pro version 15 software (SAS Institute Inc., Cary, NC) was used for statistical analysis.

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Hyogo College of Medicine (Approval number 3057). All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. All participants provided informed consent to video recording of their cases before surgery in the study, and data were completely anonymized.

Results

Figure 3 shows the results of automated segmentation at different stages in deep learning using the same frame in a validation video. In the prototype model, the AI had already learned to approximately highlight LCTF features from a vast number of image pixels representing other anatomical landmarks (e.g., arteries, lymph nodes, and fat tissue) and surgical instruments. It also discriminated LCTFs from nerves with similar fine, white features. However, when magnified (Fig. 3b), the outline was still ambiguous and there were some undetected or over-detected areas. With more sophisticated annotations and data augmentations, the latest model segmented the LCTFs more sharply and naturally, and recognition failures were significantly reduced (Fig. 3c).

The Electronic Supplementary Material (Video 1) shows examples of outputs from the latest AI model compared with those from the original. The AI accurately highlighted the LCTFs as soon as the dissection plane appeared due to the surgeon’s countertraction. Note that this segmentation was done on the operative video retrospectively, although it seems in the video that the surgeon is cutting while confirming LCTFs highlighted by the AI.

The mean Recall and F1/Dice scores of the 80 test frames were 0.605 (range 0.230–0.909) and 0.525 (range 0.263–0.712), respectively, showing acceptable sensitivity and similarity between the automated and manual segmentations. Of these 80 test frames, 20 frames were used for the qualitative evaluation. Table 1 summarizes the performance metrics measured by computation and qualitative scores assigned by the evaluators for each of the 20 frames. The mean Recall score (0.606, range 0.230–0.861) and mean F1/Dice score (0.549, range 0.335–0.691) were comparable to the results for the 80 test frames. In the qualitative evaluation by surgeons, the mean sensitivity score was 3.52 (range 2.45–3.95). Note that 88.0% of evaluations were the highest score of 4, indicating that the evaluators were generally convinced by the LCTF segmentation output from the AI. Furthermore, the mean MR score was a low 0.14 (range 0–0.7), indicating very few acknowledged MR failures.

Table 1 Performance metrics and qualitative scores in the 20 randomly sampled video frames.

Full size table

We further analyzed the relation between the performance metrics and qualitative scores. Figure 4a shows a mosaic diagram showing the distribution of all scores for each question assigned by the 20 evaluators to the 20 sampled frames. The most common response (from 52.0% of evaluators) was a score of 4 for Question 1 and 0 for Question 2, followed by a score of 3 for Question 1 and 0 for Question 2 (25.3%). No evaluators scored Question 1 as 1 or less, nor did they assign 3 or more to Question 2. The scatter plot in Fig. 4b shows the relation between sensitivity scores and MR scores for each frame. The sensitivity scores showed some variation in the high range among samples, but the MR scores were generally low, so the 95% confidence ellipse converged on the upper left corner of the coordinates. Figure 4c shows the relation between the sensitivity scores and Recall. A strong correlation with a correlation coefficient of 0.733 (95% CI 0.430–0.887) was revealed between the two sensitivity parameters. The regression equation was Y = 2.302 + 2.001X (Y: Sensitivity score; X: Recall), suggesting that surgeon evaluators are more convinced than the performance metrics.

Figure 5 shows two examples of AI predictions. Based on human judgment, in frame 6 the AI seemingly completely segments the LCTF, in that the results are nearly identical to the manually segmented areas of the ground truth. Indeed, 18 of the 20 surgeons assigned Question 1 the highest score, and the sensitivity score was 3.80. Even so, the F1/Dice score was only 0.642, probably due to overemphasis of slight deviations. In frame 19, there is a clear discrepancy between the AI’s segmentation results and the ground truth. Surgeon evaluations were lowest for this frame, with a sensitivity score of 2.45. The F1/Dice score was also a low 0.493, probably due to under-detection of translucent LCTFs.

Nine of the 20 sampled frames had no areas that were judged as false recognitions. There were up to two misrecognitions in each of the remaining 11 frames. Specifically, the AI misrecognized features such as gauze mesh fiber (Fig. 6a), fine grooves at the tips of forceps (Fig. 6b), and minor halation of fat or blood surfaces (Fig. 6c) as LCTFs.

Discussion

In this study, we demonstrated the feasibility of using AI to automatically segment LCTFs to define safe dissection planes during lymphadenectomy in intraoperative videos of robot-assisted gastrectomy. The method’s performance was quantitatively demonstrated (by mean Recall and F1/Dice scores of 0.605 and 0.525, respectively) and was qualitatively convincing to expert surgeons. Notably, there were nearly no MRs. This study is the first to show that AI developed through deep learning can precisely identify fine surgical anatomy.

AI algorithms, particularly those for deep learning, have advanced considerably in medical image-recognition tasks such as radiography^29,30,31, endoscopy^32,33, and pathological diagnosis^34,35, but their applications to surgery are still being investigated. Many attempts have aimed to recognize surgical instruments³⁶ or operative workflows such as cholecystectomy^37,38,39, colectomy⁴⁰, and sleeve gastrectomy⁴¹. Madani et al. reported promising results for the identification of safe zones for dissection during laparoscopic cholecystectomy (defined as the area located within the hepatocystic triangle), with high sensitivity and F1/Dice scores of 0.69 and 0.70, respectively³⁷. In the present study, we assigned AI the more difficult task of recognizing LCTFs for direct visualization of safe dissections planes. The feasible results obtained may be due to augmentation of more than 1800 training data, including over 20,000 objects from intraoperative videos in which surgical fields were stabilized by robotic equipment. In addition to the dataset size, annotation consistency could be especially important when recognizing indefinite regions of interest such as LCTFs, because preciseness of the ground truth greatly affects the outcome of supervised learning. In this study, we used training data carefully labeled by surgeons with clinical experience in gastric cancer surgery. Annotation reliability is indicated by a strong correlation between the Recall scores calculated using surgeons’ annotations as the ground truth and the sensitivity scores assigned by trained surgeon evaluators.

Performance metrics in machine learning highly rely on pixel-wise deviation between the two sets and are biased according to the shape of segment regions²³. We used F1/Dice scores because they reflect the size and location agreement for object segmentation⁴². However, when compared with human vision, the values for fine structures such as fibers are underestimated, because slight deviations increase FP and FN, which are used in the denominator of the calculating formulas²⁵. In this study, the mean F1/Dice score of 0.525 was not necessarily higher than those used by Madani et al.³⁷ to identify the liver and gallbladder (0.86 and 0.72, respectively). However, as shown in Fig. 3 and the Supplementary Videos, it is clear that the AI exactly highlights LCTFs without any visual disagreement. Indeed, these subjective impressions are supported by the results shown in Fig. 4, namely that most surgeons were convinced by the AI’s prediction of LCTFs. Considering that the value was only 0.642 even in frame 6, to which 90% of trained surgeons assigned the highest sensitivity score, we believe the F1/Dice score demonstrates acceptable performance. As computer segmentation tasks expand to the field of surgery, it will be necessary to discuss how small deviations beyond human discernment are problematic. Additional research is therefore needed to develop better metrics.

Those LCTFs that the evaluators judged to be inadequately predicted by the AI shared the common characteristics of translucency and blurring. One cause for such under-detection errors is that the detection threshold was set to 50%, but experts empirically know where LCTFs appear in the tissue deployed by countertraction¹⁹, making it easy to recognize any discrepancy between what is actually seen and the segmentations produced. Interestingly, medical students assigned higher scores in the same questionnaire (data not shown). In other words, expert surgeons require AI to have higher levels of predictive ability than humans with anatomical knowledge but no surgical experience. With further learning, AI will be able to predict operative procedures and display surgical anatomy that can be identified only by highly experienced surgeons. Capabilities for sharing an image of the dissection plane with others will enhance common understanding and facilitate surgery. Further, displays highlighted with a probability heat map will be more useful for probabilistic predictions of safe or dangerous dissection planes. Video-based coaching is known to be an efficient teaching method for surgical residents⁴³, our results could be utilized as automated coaching early in surgical education.

The most important application of automated anatomy segmentation is to support surgeons’ decision making. Even with technological advances in surgical optics, the outcome of an operation still ultimately depends on the surgeon’s experience and expertise^6,44 and cognition due to physical and mental condition⁵ during the operations, so automated image segmentation technology can improve the safety and outcome of surgery by supporting decision making. The six levels of autonomous driving as defined by the Society of Automotive Engineers range from 0 (fully manual) to 5 (fully autonomous), with levels up to 2 classified as “driving assistance” that includes steering correction to maintain a “driving lane”⁴⁵. Recent studies on level-2 driver assistance systems suggest that such technologies reduce driving stress and accidents⁴⁶. Recently, Yang et al. proposed a roadmap toward full automation of surgery⁴⁷, where level 2 is defined as task autonomy in which the robot autonomously performs specific human-designated tasks. Similar to the evolution of automated vehicles, real-time display of AI-analyzed visual data could eventually be incorporated into advanced robotic surgery platforms to help surgeons maintain a safe “dissection plane”.

While our results show promise for clinical use, there are some limitations to consider. First, our AI model has not yet been trained to accurately identify LCTFs under possible intraoperative conditions such as bleeding, which can blur boundaries and change colors. Overcoming this challenge is essential to our goal of developing deep-learning models that improve surgical safety by pairing surgery and AI technologies. Creating training data from surgical videos performed by highly experienced surgeons in difficult situations will improve segmentation performance. Second, we need to evaluate the method’s versatility. Generally, AI models can make inferences and predictions based on the training dataset only. However, LCTFs are common anatomy that appears in the optimal dissection plane in many areas of surgery^{10,11,12,13,14,15,16}, and we preliminarily confirmed that the algorithm trained using a gastrectomy dataset also segments LCTFs in total mesorectal excision videos. Third, the mean inference framerate of the AI model was only 4.9 fps, so the real-time predictions needed for operating room deployment have not been achieved. However, due to improved machine learning methods, this value is recently approaching 30 fps, so we should soon be ready to bring this model to the operating room.

Conclusions

Deep-learning algorithms can be trained to predict fine, difficult-to-discern anatomical structures such as LCTFs in intraoperative videos at a level that is convincing to expert surgeons. This technology can be used to assist in real-time decision making by presenting a safe dissection plane, which in turn can reduce adverse events. Newer and more advanced algorithms for image segmentation will become increasingly available in surgical fields to provide higher performance and safety.

Data availability

We cannot share the data and materials because the Ethics Committee of Hyogo College of Medicine prohibit publication of raw data base including patients’ clinical data even in the case that identifying/confidential data are not included.

References

Mari, G. M. et al. 4K ultra HD technology reduces operative time and intraoperative blood loss in colorectal laparoscopic surgery. F1000Res 9, 106 (2020).
Article Google Scholar
Yamashita, H., Aoki, H., Tanioka, K., Mori, T. & Chiba, T. Ultra-high definition (8K UHD) endoscope: Our first clinical success. Springerplus 5, 1445 (2016).
Article Google Scholar
Xiong, B. et al. Robotic versus laparoscopic total mesorectal excision for rectal cancer: A meta-analysis of eight studies. J. Gastrointest. Surg. 19, 516–526 (2015).
Article Google Scholar
Suliburk, J. W. et al. Analysis of human performance deficiencies associated with surgical adverse events. JAMA Netw. Open 2, e198067 (2019).
Article Google Scholar
Kahol, K. et al. Effect of fatigue on psychomotor and cognitive skills. Am. J. Surg. 195, 195–204 (2008).
Article Google Scholar
Guru, K. A. et al. Cognitive skills assessment during robot-assisted surgery: Separating the wheat from the chaff. BJU Int. 115, 166–174 (2015).
Article Google Scholar
Morales-Alvarez, W., Sipele, O., Léberon, R., Tadjine, H. H. & Olaverri-Monreal, C. Automated driving: A literature review of the take over request in conditional automation. Electronics 9, 2087 (2020).
Article Google Scholar
SUBARU. Survey Reveals Subaru Vehicles Equipped with EyeSight Had 60% Fewer Accidents in Japan https://www.subaru.co.jp/press/news-en/2016_01_26_1827/3/28/2021).
Hashimoto, D. A., Rosman, G., Rus, D. & Meireles, O. R. Artificial intelligence in surgery: Promises and perils. Ann. Surg. 268, 70–76 (2018).
Article Google Scholar
Heald, R. J. The “Holy Plane” of rectal surgery. J. R. Soc. Med. 81, 503–508 (1988).
Article CAS Google Scholar
Hohenberger, W., Weber, K., Matzel, K., Papadopoulos, T. & Merkel, S. Standardized surgery for colonic cancer: Complete mesocolic excision and central ligation–technical notes and outcome. Colorectal Dis. 11, 354–364 (2009).
Article CAS Google Scholar
Culligan, K. et al. A detailed appraisal of mesocolic lymphangiology: An immunohistochemical and stereological analysis. J. Anat. 225, 463–472 (2014).
Article CAS Google Scholar
Akagawa, S., Hosogi, H., Yoshimura, F., Kawada, H. & Kanaya, S. Mesenteric excision for esophageal cancer surgery: Based on the concept of mesotracheoesophagus. Int. Cancer Conf. J. 7, 117–120 (2018).
Article Google Scholar
Tsunoda, S. et al. Mesenteric excision of upper esophagus: A concept for rational anatomical lymphadenectomy of the recurrent laryngeal nodes in thoracoscopic esophagectomy. Surg. Endosc. 34, 133–141 (2020).
Article Google Scholar
Shinohara, H., Kurahashi, Y., Haruta, S., Ishida, Y. & Sasako, M. Universalization of the operative strategy by systematic mesogastric excision for stomach cancer with that for total mesorectal excision and complete mesocolic excision colorectal counterparts. Ann. Gastroenterol. Surg. 2, 28–36 (2018).
Article Google Scholar
Shinohara, H., Kurahashi, Y. & Ishida, Y. Gastric equivalent of the “Holy Plane” to standardize the surgical concept of stomach cancer to mesogastric excision: Updating Jamieson and Dobson’s historic schema. Gastric Cancer 24, 273–282 (2021).
Article Google Scholar
Heald, R. J., Santiago, I., Pares, O., Carvalho, C. & Figueiredo, N. The perfect total mesorectal excision obviates the need for anything else in the management of most rectal cancers. Clin. Colon Rectal Surg. 30, 324–332 (2017).
Article Google Scholar
Di Buono, G. et al. Feasibility and safety of laparoscopic complete mesocolic excision (CME) for right-sided colon cancer: Short-term outcomes. A Randomized Clinical Study. Ann. Surg. 274, 57–62 (2020).
Article Google Scholar
Shinohara, H., Haruta, S., Ohkura, Y., Udagawa, H. & Sakai, Y. Tracing dissectable layers of mesenteries overcomes embryologic restrictions when performing infrapyloric lymphadenectomy in laparoscopic gastric cancer surgery. J. Am. Coll. Surg. 220, e81-87 (2015).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional Networks for Biomedical Image Segmentation (Springer, 2015).
Google Scholar
Hasan, S. & Linte, C. A. U-NetPlus: A modified encoder-decoder U-Net architecture for semantic and instance segmentation of surgical instrument. arXiv preprint (2019).
Chandrashekar, A. et al. A deep learning pipeline to automate high-resolution arterial segmentation with or without intravenous contrast. Ann. Surg. https://doi.org/10.1097/sla.0000000000004595 (2020).
Article PubMed Google Scholar
Powers, D. M. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. J. Mach. Learn. Technol. 2, 37–63 (2020).
Google Scholar
Dice, L. R. Measures of the amount of ecologic association between species. Ecology 26, 297–302 (1945).
Article Google Scholar
Eelbode, T. et al. Optimization for medical image segmentation: Theory and practice when evaluating with dice score or Jaccard Index. IEEE Trans. Med. Imaging. 39, 3679–3690 (2020).
Article Google Scholar
Tokuyasu, T. et al. Development of an artificial intelligence system using deep learning to indicate anatomical landmarks during laparoscopic cholecystectomy. Surg. Endosc. 35, 1651–1658 (2020).
Article Google Scholar
Immenroth, M. et al. Mental training in surgical education: A randomized controlled trial. Ann. Surg. 245, 385–391 (2007).
Article Google Scholar
Arora, S. et al. Development and validation of mental practice as a training strategy for laparoscopic surgery. Surg. Endosc. 24, 179–187 (2010).
Article Google Scholar
Chan, H. P., Samala, R. K., Hadjiiski, L. M. & Zhou, C. Deep learning in medical image analysis. Adv. Exp. Med. Biol. 1213, 3–21 (2020).
Article Google Scholar
Hosny, A., Parmar, C., Quackenbush, J., Schwartz, L. H. & Aerts, H. Artificial intelligence in radiology. Nat. Rev. Cancer 18, 500–510 (2018).
Article CAS Google Scholar
Nam, J. G. et al. Development and validation of deep learning-based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology 290, 218–228 (2019).
Article Google Scholar
Hirasawa, T. et al. Application of artificial intelligence using a convolutional neural network for detecting gastric cancer in endoscopic images. Gastric Cancer 21, 653–660 (2018).
Article Google Scholar
Byrne, M. F. et al. Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model. Gut 68, 94–100 (2019).
Article Google Scholar
Ariji, Y. et al. Contrast-enhanced computed tomography image assessment of cervical lymph node metastasis in patients with oral cancer by using a deep learning system of artificial intelligence. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 127, 458–463 (2019).
Article Google Scholar
Hu, Y. et al. Deep learning system for lymph node quantification and metastatic cancer identification from whole-slide pathology images. Gastric Cancer 24, 868–877 (2021).
Article Google Scholar
Yamazaki, Y. et al. Automated surgical instrument detection from laparoscopic gastrectomy video images using an open source convolutional neural network platform. J. Am. Coll. Surg. 230, 725–732 (2020).
Article Google Scholar
Madani, A. et al. Artificial intelligence for intraoperative guidance: Using semantic segmentation to identify surgical anatomy during laparoscopic cholecystectomy. Ann. Surg. https://doi.org/10.1097/SLA.0000000000004594 (2020).
Article PubMed Google Scholar
Mascagni, P. et al. Artificial intelligence for surgical safety: Automatic assessment of the critical view of safety in laparoscopic cholecystectomy using deep learning. Ann. Surg. https://doi.org/10.1097/SLA.0000000000004351 (2020).
Article PubMed Google Scholar
Mascagni, P. et al. A computer vision platform to automatically locate critical events in surgical videos: Documenting safety in laparoscopic cholecystectomy. Ann. Surg. 274, e93–e95 (2021).
Article Google Scholar
Kitaguchi, D. et al. Automated laparoscopic colorectal surgery workflow recognition using artificial intelligence: Experimental research. Int. J. Surg. 79, 88–94 (2020).
Article Google Scholar
Hashimoto, D. A. et al. Computer vision analysis of intraoperative video: Automated recognition of operative steps in laparoscopic sleeve gastrectomy. Ann. Surg. 270, 414–421 (2019).
Article Google Scholar
Carass, A. et al. Evaluating white matter lesion segmentations with refined Sørensen-Dice analysis. Sci. Rep. 10, 8242 (2020).
Article ADS CAS Google Scholar
Soucisse, M. L. et al. Video coaching as an efficient teaching method for surgical residents: A randomized controlled trial. J. Surg. Educ. 74, 365–371 (2017).
Article Google Scholar
Crebbin, W., Beasley, S. W. & Watters, D. A. Clinical decision making: How surgeons do it. ANZ J. Surg. 83, 422–428 (2013).
Article Google Scholar
Synopsys. The 6 Levels of Vehicle Autonoy Explained. https://www.synopsys.com/automotive/autonomous-driving-levels.html3/28/2021).
Institute, H. L. D. Compendium of HLDI Collision Avoidance Research. https://www.iihs.org/media/7560e1bf-fcc5-4540-aa16-07444f17d240/A25ptg/HLDI%20Research/Collisions%20avoidance%20features/35.34-compendium.pdf (Accessed 28 March 2021).
Yang, G.-Z. et al. Medical robotics: Regulatory, ethical, and legal considerations for increasing levels of autonomy. Sci. Robot. 2, 8638 (2017).
Article Google Scholar

Download references

Acknowledgements

The authors thank Ichiro Uyama at Fujita Health University, Hirokazu Noshiro at Saga University, and Seiichiro Kanaya at Osaka Red Cross Hospital for advice on the research concept, and Caryn Jones at ThinkSCIENCE for professional editing and language revision. The work was financially supported in part by the Japan Society for the Promotion of Science (KAKENHI Grant Number 19H03735).

Author information

These authors contributed equally: Yuta Kumazu and Nao Kobayashi.

Authors and Affiliations

Department of Surgery, Yokohama City University, Kanagawa, Japan
Yuta Kumazu & Munetaka Masuda
Anaut Inc., Tokyo, Japan
Yuta Kumazu & Nao Kobayashi
Incubit Inc., Tokyo, Japan
Naoki Kitamura, Elleuch Rayan & Paul Neculoiu
Department of Biostatistics, Yokohama City University School of Medicine, Kanagawa, Japan
Toshihiro Misumi
Department of Gastroenterological Surgery, Hyogo College of Medicine, 1-1 Mukogawa-cho, Nishinomiya, Hyogo, 663-8501, Japan
Yudai Hojo, Tatsuro Nakamura, Tsutomu Kumamoto, Yasunori Kurahashi, Yoshinori Ishida & Hisashi Shinohara

Authors

Yuta Kumazu
View author publications
You can also search for this author in PubMed Google Scholar
Nao Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Naoki Kitamura
View author publications
You can also search for this author in PubMed Google Scholar
Elleuch Rayan
View author publications
You can also search for this author in PubMed Google Scholar
Paul Neculoiu
View author publications
You can also search for this author in PubMed Google Scholar
Toshihiro Misumi
View author publications
You can also search for this author in PubMed Google Scholar
Yudai Hojo
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuro Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Tsutomu Kumamoto
View author publications
You can also search for this author in PubMed Google Scholar
Yasunori Kurahashi
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinori Ishida
View author publications
You can also search for this author in PubMed Google Scholar
Munetaka Masuda
View author publications
You can also search for this author in PubMed Google Scholar
Hisashi Shinohara
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.K. and N.K.: Participated in the study conception and design; data acquisition, annotation, and interpretation; manuscript drafting; and approved the final version of the manuscript. N.K., E.R., and P.N.: Participated in data acquisition and interpretation; revised the manuscript; and approved the final version of the manuscript. T.M.: Participated in data analysis and statistics; revised the manuscript; and approved the final version of the manuscript. Y.H., T.N., T.K., Y.K., and Y.I.: Participated in data acquisition and interpretation; revised the manuscript; and approved the final version of the manuscript. M.M.: Participated in data interpretation; revised the manuscript; and approved the final version of the manuscript. H.S.: Participated in the study conception and design; data acquisition and interpretation; manuscript drafting; and approved the final version of the manuscript.

Corresponding author

Correspondence to Hisashi Shinohara.

Ethics declarations

Competing interests:

Y.K. and N.K. are shareholders of Anaut, Inc. N.K. is a shareholder of Incubit, Inc. E.R. and P.N. are technical staff of Incubit, Inc. The sponsor had no role in the study design, data collection, data analysis, manuscript preparation, or publication decisions. The other authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Video 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumazu, Y., Kobayashi, N., Kitamura, N. et al. Automated segmentation by deep learning of loose connective tissue fibers to define safe dissection planes in robot-assisted gastrectomy. Sci Rep 11, 21198 (2021). https://doi.org/10.1038/s41598-021-00557-3

Download citation

Received: 18 June 2021
Accepted: 13 October 2021
Published: 27 October 2021
DOI: https://doi.org/10.1038/s41598-021-00557-3
Springer Nature Limited

This article is cited by

Laparoscopic distal gastrectomy skill evaluation from video: a new artificial intelligence-based instrument identification system
- Shiro Matsumoto
- Hiroshi Kawahira
- Naohiro Sata
Scientific Reports (2024)
Precise highlighting of the pancreas by semantic segmentation during robot-assisted gastrectomy: visual assistance with artificial intelligence for surgeons
- Tatsuro Nakamura
- Nao Kobayashi
- Hisashi Shinohara
Gastric Cancer (2024)
The power of AI on aggregating, managing, and curating medical knowledge for global surgical education
- Daichi Kitaguchi
- Masaaki Ito
Global Surgical Education - Journal of the Association for Surgical Education (2024)
Real-time detection of active bleeding in laparoscopic colectomy using artificial intelligence
- Kenta Horita
- Koya Hida
- Kazutaka Obama
Surgical Endoscopy (2024)
Clinical applications of artificial intelligence in robotic surgery
- J. Everett Knudsen
- Umar Ghaffar
- Andrew J. Hung
Journal of Robotic Surgery (2024)

Automated segmentation by deep learning of loose connective tissue fibers to define safe dissection planes in robot-assisted gastrectomy

Abstract

Similar content being viewed by others

Deep learning-based recognition of key anatomical structures during robot-assisted minimally invasive esophagectomy

An analysis on the effect of body tissues and surgical tools on workflow recognition in first person surgical videos

A dataset of laryngeal endoscopic images with comparative study on convolution neural network-based semantic segmentation

Introduction