Abstract
Background
The literature on artificial intelligence (AI) in surgery has advanced rapidly during the past few years. However, the published studies on AI are mostly reported by computer scientists using their own jargon which is unfamiliar to surgeons.
Methods
A literature search was conducted in using PubMed following the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement. The primary outcome of this review is to provide a glossary with definitions of the commonly used AI terms in surgery to improve their understanding by surgeons.
Results
One hundred ninety-five studies were included in this review, and 38 AI terms related to surgery were retrieved. Convolutional neural networks were the most frequently culled term by the search, accounting for 74 studies on AI in surgery, followed by classification task (n = 62), artificial neural networks (n = 53), and regression (n = 49). Then, the most frequent expressions were supervised learning (reported in 24 articles), support vector machine (SVM) in 21, and logistic regression in 16. The rest of the 38 terms was seldom mentioned.
Conclusions
The proposed glossary can be used by several stakeholders. First and foremost, by residents and attending consultant surgeons, both having to understand the fundamentals of AI when reading such articles. Secondly, junior researchers at the start of their career in Surgical Data Science and thirdly experts working in the regulatory sections of companies involved in the AI Business Software as a Medical Device (SaMD) preparing documents for submission to the Food and Drug Administration (FDA) or other agencies for approval.
Similar content being viewed by others
Artificial intelligence (AI) is a branch of computer science enabling computers to build mathematical models to train and solve problems by mimicking human cognition, provided these models have been trained properly [1]. Machine learning (ML) is a subfield of AI that enables computers to make predictions based on underlying data patterns, while deep learning (DL) is a subset of ML [2] (Fig. 1).
AI has been applied successfully to medicine during the last years [3]. Surgical Data Science is the research field of AI, specifically targeted towards surgical practice to improve the quality of surgical healthcare delivery through the capture, organization, analysis, and modeling of data [4]. AI can be applied to surgery (i) to obtain an automated objective assessment of skills during training [5, 6], (ii) to automate surgery following the paradigm of self-driving cars with respect to robot-assisted surgery (RAS) [7], (iii) to enable intra-operative navigation [8] and (iv) to detect early and thereby prevent errors and improve patient outcomes [9].
As literature on AI in surgery increases, surgeons should be able to understand the published studies on AI. Although this additional AI knowledge may be perceived currently as too theoretical and not useful, and consequently overlooked as not essential for good practice, it is more than likely that in the future this attitude will change as medical devices based on AI will enter the market and be integrated into the clinical practice.
However, this is not a trivial task as the AI terms used in these studies are generated by computer scientists using their own jargon which is largely unfamiliar to surgeons. Thus, the widespread clinical use of AI in surgery faces several challenges. These include AI algorithms that are not transparent or “understood” by surgeons who for this reason regard AI systems as ‘black box’ in nature and understood incompletely [10]. Indeed, few physicians have the necessary knowledge to understand them [11]. Additionally, the data files structure are often extremely complex [12].
Thus, the primary outcome of this review is to provide definitions of the commonly used AI terms in surgery to simplify their understanding by surgeons. In this way, we want to contribute to the development of a multidisciplinary collaboration between surgeons, engineers, and computer scientists. The secondary outcome is to provide, in a supplement, a detailed list of surgical articles in which AI terminology is used.
Materials and methods
A literature search was conducted in September 2021 on PubMed following the preferred reporting items for systematic reviews and meta-analyses (PRISMA) statement [13]. This search required several steps. First, we retrieved any AI terms used in laparoscopy or RAS. We then compiled an initial database using the following search strategy:
-
1.
((((artificial intelligence) OR (machine learning)) OR (deep learning)) OR (computer vision)) OR (Natural language processing)
-
2.
((((laparoscopy) OR (robotic)) OR (minimal invasive)) OR (minimally invasive)) OR (laparoscopic)
-
3.
(surgery) OR (surgical)
-
4.
#2 AND #3
-
5.
#1 AND #4
-
6.
#5 NOT REVIEW
We then applied the following filters to the retrieved articles: Abstract, Journal Article, in the last 5 years, Humans, and English language. Thus, in total 1729 articles were retrieved and two reviewers (AM and KG) independently screened titles and abstracts of all identified publications for relevance before inclusion. The exclusion criteria were reviews, letters, non-peer reviewed articles, conference abstracts, and proceedings. A total of 195 articles were finally included in our initial database. Data from these articles were extracted and checked by two authors (AM and KG). During the second step, we compiled a list of AI terms mentioned at least once in the abstracts of our initial database. The three fundamental terms i.e., AI, ML, and DL were excluded. A final database based on the resulting 38 AI-related terms was established. Next, we then conducted a reciprocal number of queries to our initial database to establish the frequency each term was used. In the third step, we searched for the first mention of each AI term in the literature. In this step, we performed a query in our primary search using item #5 tagged as review to identify all published reviews. We applied the same filters as those used in the initial search for non-review articles and retrieved 34 review articles. The same methodology as in the first step was followed by the same authors (AM and KG) who inspected all reviews to see whether they provided explanations of any of the 38 AI terms either into text or in the online Appendix. Ten published reviews in the last five years contained at least one of the 38 AI terms. The flowchart of the searches performed, based on PRISMA statement is shown in Fig. 2.
Results
Table 1 depicts the list of the 38 AI terms and their occurrence in our initial search (second column). The final number of occurrences used in our final database is shown in the third column. Table 1 shows that convolutional neural networks (CNNs) were the most frequent, appearing in 74 studies, followed by classification in 62, artificial neural networks (ANNs) in 53, and regression in 49. The most frequent expressions were supervised learning (reported in 24 articles), support vector machine (SVM) in 21, and logistic regression in 16. The rest of the 38 terms were mentioned infrequently. The detailed list of the citations occurring in each one of the 38 terms is shown in the online Appendix.
Table 2 reports the occurrence of each one of the 38 main AI terms in the ten retrieved reviews. Except the one by Zhou et al. [14] where 21 out of the 38 (55%) terms were mentioned, only a minority of the terms occurred in the rest, ranging from 12 (32%) [15] to just one (3%) [9].
Table 3 provides a glossary with the definitions of the AI terms identified in our search results. A detailed list of surgical articles in which each AI term is included is reported in the online Appendix.
Discussion
Surgery is in its fourth generation (open surgery, endoluminal surgery, laparoscopic surgery, and RAS). Laparoscopic surgery and RAS provide huge amount of data which can be processed by AI, e.g., one minute of a high-resolution minimal access operations generate 25 times the amount of data found in a high-resolution computed tomography scan [16]. However, minimal access surgery poses a significant challenge to image analysis due to changes in illumination, unfocused frames, blood and smoke in the surgical field, and anatomical diversity [17]. In addition to data from videos, RAS generates data from robot kinematics and event data (e.g., pressing camera and/or clutching pedals) [18].
Since the number of AI terms is increasing constantly with constant expansion of the reported literature on AI, the new generation of surgeons will be required to become familiar with AI knowledge and its reported literature, since AI is expected to have a significant impact on surgery at all the stages: pre-operative, intra-operative, and post-operative.
In this report, we aimed to review and attempt to categorize the relevant terms as well as provide a glossary for surgeons. Our search revealed that CNNs is the AI term reporting the highest number (n = 74) of published studies in surgery. This is not surprising as CNNs constitute the backbone of AI frameworks for different applications of computer vision, namely classification (prediction of the correct class of objects in an image), and object detection (localization of objects in addition to classifying the correct class). CNNs also form the basis of complex DL architectures like U-Net for semantic segmentation (definition of pixel-wise borders of objects of the same class) and Mask R-CNN for instance segmentation (definition of pixel-wise borders of each object). As shown in Table 1, all these frameworks have been applied to surgery. When coupled with recurrent neural networks (RNNs), CNNs are capable not only to process spatial information to localize surgical tools, but also to analyze temporal information so that they can be used to analyze the surgical workflow, for instance for action recognition (e.g., dissection and cutting) and phase recognition (e.g., incision of splenorenal ligament). The two common tasks in a ML i.e., classification and regression, were ranked highly in terms of their occurrence. As previously, classification is part of other computer vision tasks e.g., detection and segmentation. Regression is defined as the task involved in the prediction of continuous numerical values. The simplest type of regression is linear regression in which a fitting line is used to model the data, representing the relation between one dependent and one independent variable. There are also more complex types of regression, e.g., non-linear when a curve is used to fit the data or multiple regression, when the dependent variables is related by more than one independent variable. The main ML algorithms were developed for both classification and regression e.g., support vector machine (SVM), random forests, and multilayer perceptrons (MLP) (Table 1).
In contrast there are some AI terms reporting very low number of occurrences, but which are expected to grow rapidly over the next few years. Examples include imitation learning and reinforcement learning, respectively, mentioned in three and seven studies, which may become more common in the near term in wake of the widespread use of robots in several fields, including surgery.
Our glossary provides a comprehensive list of definitions of AI terms to help different stakeholders. Firstly, residents and surgeons with the need to understand the fundamentals of AI while reading articles. Secondly, young researchers starting their career in Surgical Data Science. Thirdly, experts working in the regulatory department of companies in the business of AI Software as a Medical Device (SaMD) to prepare and submit documents for approval from Food and Drug Administration (FDA) or other agencies.
Our glossary contains not only the definitions of AI terms to develop software e.g., the models, but also those related to the hardware necessary to perform the heavy computation requested by AI, e.g., graphical processing unit (GPU). The availability of cloud services hosting large numbers of GPUs significantly lowered the economic barriers to access powerful hardware to train and test even the most complex AI models.
The access to high performance computers at a reasonable price and the possibility to record and store videos of minimal access surgeries would suggest that building and training large datasets is within the reach of most research centers. However, training of AI models for surgery is extremely labor intensive since the process of annotating images (called annotation) requires specific knowledge. It is not a simple annotation of images for “cat versus dog” classification or detection tasks, but rather a process to correctly identify the surgical tools, the anatomical parts (e.g., organs and vessels), and the clinically meaningful events. While laypersons and crowd annotators can reach the same level of surgeons for annotating surgical tools, to identify the anatomy and the quality of a dissection experts surgeons are required [19]. Additionally, the files must be anonymized to protect patients’ identity. Consequently, the size of the datasets of the published studies is typically small. An attempt to overcome this limitation is the Critical View of Safety Challenge [20] of the AI Task Force of the Society of American Gastrointestinal and Endoscopic Surgeons (SAGES), an online platform where it is possible to donate videos of laparoscopic cholecystectomy and contribute as annotators of the videos.
Conclusions
Surgical data science was recently introduced as the specific field of AI in surgery. Literature on this subject is expanding rapidly. For this reason, there is the need for surgeons to become familiar with the AI terms which were traditionally coined by computer scientists. In this review, we prepared a glossary with definitions of AI terms in surgery after reviewing the literature. This glossary will be useful not only to surgeons, but also to young researchers approaching the field, and companies developing SaMD applications.
References
Russell SJ, Norvig P, Davis E (2010) Artificial intelligence: a modern approach, 3rd edn. Prentice Hall, Upper Saddle River
Murphy KP (2012) Machine learning: a probabilistic perspective. MIT Press, Cambridge
Topol EJ (2019) High-performance medicine: the convergence of human and artificial intelligence. Nat Med 25:44–56. https://doi.org/10.1038/s41591-018-0300-7
Maier-Hein L, Eisenmann M, Sarikaya D, März K, Collins T, Malpani A, Fallert J, Feussner H, Giannarou S, Mascagni P, Nakawala H, Park A, Pugh C, Stoyanov D, Vedula SS, Cleary K, Fichtinger G, Forestier G, Gibaud B, Grantcharov T, Hashizume M, Heckmann-Nötzel D, Kenngott HG, Kikinis R, Mündermann L, Navab N, Onogur S, Roß T, Sznitman R, Taylor RH, Tizabi MD, Wagner M, Hager GD, Neumuth T, Padoy N, Collins J, Gockel I, Goedeke J, Hashimoto DA, Joyeux L, Lam K, Leff DR, Madani A, Marcus HJ, Meireles O, Seitel A, Teber D, Ückert F, Müller-Stich BP, Jannin P, Speidel S (2022) Surgical data science—from concepts toward clinical translation. Med Image Anal 76:102306. https://doi.org/10.1016/j.media.2021.102306
Wang Z, Majewicz Fey A (2018) Deep learning with convolutional neural network for objective skill evaluation in robot-assisted surgery. Int J CARS 13:1959–1970. https://doi.org/10.1007/s11548-018-1860-1
Ershad M, Rege R, Majewicz Fey A (2019) Automatic and near real-time stylistic behavior assessment in robotic surgery. Int J CARS 14:635–643. https://doi.org/10.1007/s11548-019-01920-6
Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M (2019) Artificial intelligence and the future of surgical robotics. Ann Surg 270:223–226. https://doi.org/10.1097/SLA.0000000000003262
Kong S-H, Haouchine N, Soares R, Klymchenko A, Andreiuk B, Marques B, Shabat G, Piechaud T, Diana M, Cotin S, Marescaux J (2017) Robust augmented reality registration method for localization of solid organs’ tumors using CT-derived virtual biomechanical model and fluorescent fiducials. Surg Endosc 31:2863–2871. https://doi.org/10.1007/s00464-016-5297-8
Egert M, Steward JE, Sundaram CP (2020) Machine learning and artificial intelligence in surgical fields. Indian J Surg Oncol 11:573–577. https://doi.org/10.1007/s13193-020-01166-8
Rimmer L, Howard C, Picca L, Bashir M (2021) The automaton as a surgeon: the future of artificial intelligence in emergency and general surgery. Eur J Trauma Emerg Surg 47:757–762. https://doi.org/10.1007/s00068-020-01444-8
Burlina PM, Joshi N, Pekala M, Pacheco KD, Freund DE, Bressler NM (2017) Automated grading of age-related macular degeneration from color fundus images using deep convolutional neural networks. JAMA Ophthalmol 135:1170. https://doi.org/10.1001/jamaophthalmol.2017.3782
Cahill RA, Mac Aonghusa P, Mortensen N (2021) The age of surgical operative video big data—my bicycle or our park? Surgeon. https://doi.org/10.1016/j.surge.2021.03.006
(2020) PRISMA transparent reporting of systematic reviews and meta-analyses. http://www.prisma-statement.org. Accessed 17 June 2022
Zhou X-Y, Guo Y, Shen M, Yang G-Z (2020) Application of artificial intelligence in surgery. Front Med 14:417–430. https://doi.org/10.1007/s11684-020-0770-0
Garrow CR, Kowalewski K-F, Li L, Wagner M, Schmidt MW, Engelhardt S, Hashimoto DA, Kenngott HG, Bodenstedt S, Speidel S, Müller-Stich BP, Nickel F (2021) Machine learning for surgical phase recognition: a systematic review. Ann Surg 273:684–693. https://doi.org/10.1097/SLA.0000000000004425
Hashimoto DA, Rosman G, Rus D, Meireles OR (2018) Artificial intelligence in surgery: promises and perils. Ann Surg 268:70–76. https://doi.org/10.1097/SLA.0000000000002693
Anteby R, Horesh N, Soffer S, Zager Y, Barash Y, Amiel I, Rosin D, Gutman M, Klang E (2021) Deep learning visual analysis in laparoscopic surgery: a systematic review and diagnostic test accuracy meta-analysis. Surg Endosc 35:1521–1533. https://doi.org/10.1007/s00464-020-08168-1
Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A (2021) A systematic review on artificial intelligence in robot-assisted surgery. Int J Surg 95:106151. https://doi.org/10.1016/j.ijsu.2021.106151
Ward TM, Fer DM, Ban Y, Rosman G, Meireles OR, Hashimoto DA (2021) Challenges in surgical video annotation. Comput Assist Surg 26:58–68. https://doi.org/10.1080/24699322.2021.1937320
Society of American Gastrointestinal and Endoscopic Surgeons (SAGES) (2019) The critical view of safety challenge. In: The critical view of safety challenge. https://www.cvschallenge.org/. Accessed 16 Mar 2022
Chang TC, Seufert C, Eminaga O, Shkolyar E, Hu JC, Liao JC (2021) Current trends in artificial intelligence application for endourology and robotic surgery. Urol Clin North Am 48:151–160. https://doi.org/10.1016/j.ucl.2020.09.004
Chen J, Remulla D, Nguyen JH, Aastha D, Liu Y, Dasgupta P, Hung AJ (2019) Current status of artificial intelligence applications in urology and their potential to influence clinical practice: artificial intelligence in urology. BJU Int 124:567–577. https://doi.org/10.1111/bju.14852
Hashimoto DA, Ward TM, Meireles OR (2020) The role of artificial intelligence in surgery. Adv Surg 54:89–101. https://doi.org/10.1016/j.yasu.2020.05.010
Ma R, Vanstrum EB, Lee R, Chen J, Hung AJ (2020) Machine learning in the optimization of robotics in the operative field. Curr Opin Urol 30:808–816. https://doi.org/10.1097/MOU.0000000000000816
Tanzi L, Piazzolla P, Vezzetti E (2020) Intraoperative surgery room management: a deep learning perspective. Int J Med Robot 16:1–12. https://doi.org/10.1002/rcs.2136
van Amsterdam B, Clarkson MJ, Stoyanov D (2021) Gesture recognition in robotic surgery: a review. IEEE Trans Biomed Eng 68:2021–2035. https://doi.org/10.1109/TBME.2021.3054828
Stevens E, Antiga L, Viehmann T, Chintala S (2020) Deep learning with PyTorch. Manning Publications Co, Shelter Island
Müller AC, Guido S (2016) Introduction to machine learning with Python: a guide for data scientists, 1st edn. O’Reilly Media Inc, Sebastopol
Géron A (2019) Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: concepts, tools, and techniques to build intelligent systems, 2nd edn. O’Reilly Media, Inc, Beijing [China], Sebastopol
Rumelhart DE (1999) Parallel distributed processing. 1: Foundations/David E. Rumelhart, 12. print. MIT Pr, Cambridge, Mass
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:150203167 [cs]. https://arxiv.org/abs/1502.03167
Foster D (2019) Generative deep learning: teaching machines to paint, write, compose, and play, 1st edn. O’Reilly Media Inc, Sebastopol
Saha S (2018) A comprehensive guide to convolutional neural networks—the ELI5 way. In: Towards data science. https://towardsdatascience.com/a-comprehensive-guide-to-convolutional-neural-networks-the-eli5-way-3bd2b1164a53. Accessed 17 Jun 2021
Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv:12070580 [cs]. https://arxiv.org/abs/1207.0580
Modaresnezhad M, Vahdati A, Nemati H, Ardestani A, Sadri F (2019) A rule-based semantic approach for data integration, standardization and dimensionality reduction utilizing the UMLS: application to predicting bariatric surgery outcomes. Comput Biol Med 106:84–90. https://doi.org/10.1016/j.compbiomed.2019.01.019
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder–decoder for statistical machine translation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, pp 1724–1734
Gers FA, Schmidhuber J (2000) Recurrent nets that time and count. In: Proceedings of the IEEE-INNS-enns international joint conference on neural networks. IJCNN 2000. Neural computing: new challenges and perspectives for the new millennium, vol. 3. IEEE, Como, Italy, pp 189–194
Ho J, Ermon S (2016) Generative adversarial imitation learning. arXiv:160603476 [cs]. https://arxiv.org/abs/1606.03476
Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial networks. arXiv:14062661 [cs, stat]. https://arxiv.org/abs/1406.2661
Obaidat MS, Nicopolitidis P, Zarai F (2015) Modeling and simulation of computer networks and systems: methodologies and applications. Morgan Kaufmann, Waltham
Gao Y, Vedula S, Reily C, Ahmidi N, Varadarajan B, Lin HC, Tao L, Zappella L, Bejar B, Yuh DD, Chen CCG, Vidal R, Khudanpur S, Hager GD (2014) JHU-ISI gesture and skill assessment working set (JIGSAWS): a surgical activity dataset for human motion modeling. modeling and monitoring of computer assisted interventions (M2CAI)—MICCAI workshop 3:
Zhou F, De la Torre F, Hodgins JK (2008) Aligned cluster analysis for temporal segmentation of human motion. 2008 8th IEEE international conference on automatic face & gesture recognition. IEEE, Amsterdam, pp 1–7
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031
Krishnan S, Garg A, Patil S, Lea C, Hager G, Abbeel P, Goldberg K (2017) Transition state clustering: unsupervised surgical trajectory segmentation for robot learning. Int J Robot Res 36:1595–1618. https://doi.org/10.1177/0278364917743319
Aqel MOA, Marhaban MH, Saripan MI, Ismail NBt (2016) Review of visual odometry: types, approaches, challenges, and applications. Springerplus 5:1897. https://doi.org/10.1186/s40064-016-3573-7
Acknowledgements
Fondazione Banca Del Monte Di Lucca partly funded this work.
Funding
Open access funding provided by Università di Pisa within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Disclosures
Andrea Moglia, Konstantinos Georgiou, Luca Morelli, Konstantinos Toutouzas, Richard M. Satava, and Alfred Cuschieri have no conflicts of interest or financial ties to disclose.
Ethical approval
Institutional Review Board was not required since the work did not involve humans or animals.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Moglia, A., Georgiou, K., Morelli, L. et al. Breaking down the silos of artificial intelligence in surgery: glossary of terms. Surg Endosc 36, 7986–7997 (2022). https://doi.org/10.1007/s00464-022-09371-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00464-022-09371-y