Abstract
Background
Cutaneous malignant melanoma (CMM) ranks among the ten most frequent malignancies, clinicopathological staging being of key importance to predict prognosis. Artificial intelligence (AI) has been recently applied to develop prognostically reliable staging systems for CMM. This study aims to provide a useful machine learning based tool to predict the overall CMM short-term survival.
Methods
CMM records as collected at the Veneto Cancer Registry (RTV) and at the Veneto regional health service were considered. A univariate Cox regression validated the strength and direction of each independent variable with overall mortality. A range of machine learning models (Logistic Regression classifier, Support-Vector Machine, Random Forest, Gradient Boosting, and k-Nearest Neighbors) and a Deep Neural Network were then trained to predict the 3-years mortality probability. Five-fold cross-validation and Grid Search were performed to test the best data preprocessing procedures, features selection, and to optimize models hyperparameters. A final evaluation was carried out on a separate test set in terms of balanced accuracy, precision, recall and F1 score. The best model was deployed as online tool.
Results
The univariate analysis confirmed the significant prognostic value of TNM staging. Adjunctive clinicopathological variables not included in the AJCC 8th melanoma staging system, i.e., sex, tumor site, histotype, growth phase, and age, were significantly linked to overall survival. Among the models, the Neural Network and the Random Forest models featured the best prognostic performance, achieving a balanced accuracy of 91% and 88%, respectively. According to the Gini importance score, age, T and M stages, mitotic count, and ulceration appeared to be the variables with the greatest impact on survival prediction.
Conclusions
Using data from patients with CMM, we developed an AI algorithm with high staging reliability, on top of which a web tool was implemented (unipd.link/melanomaprediction). Being essentially based on routinely recorded clinicopathological variables, it can already be implemented with minimal effort and further tested in the current clinical practice, an essential phase for validating the model’s accuracy beyond the original research context.
Similar content being viewed by others
1 Introduction
Cutaneous malignant melanoma (CMM) is one of the deadliest skin cancers due to its intrinsic biological aggressiveness and relatively high probability of misdiagnosis [1, 2]. Melanoma accounted for 5.6% of all new cancer cases in the U.S. in 2021, and its incidence has been on a steady global increase over the past few decades [3,4,5]. In the U.S., the median age at diagnosis is 65 years, and the median age at death from melanoma is 71 years [3, 6]. In Italy, the estimated total number of new cases of cutaneous melanoma was 14,900 (8,100 in males, 6,700 in females) in 2020, while 169,900 people are estimated to be alive following a melanoma diagnosis. CMM is the third most common malignancy among Italians aged 50 years or less [6, 7]. These numbers raise concerns about optimizing the efficacy and the efficiency of CMM management, as well as the economic impact of this disease on healthcare systems [4, 6, 8].
Despite advances in early detection and treatment, CMM continues to be a disease with highly variable outcomes. Developments in systemic adjuvant medications for stage III and stage IV melanomas are contributing to improved outcomes even for advanced melanoma patients, but there are still gaps in our ability to correctly stage melanomas [9,10,11]. Internationally, the prognostic assessment of CMM outcome is based on the American Joint Committee on Cancer (AJCC) melanoma staging system [2, 12].
Recent theoretical applications of various artificial intelligence (AI) algorithms in oncological research have produced promising results, which might help optimize cancer care by personalizing patient treatment [13,14,15,16]. Unlike traditional computer programming, AI does not rely on a pre-determined algorithm to produce an output, but rather analyzes input data with its associated output to process a model that can be then used to infer on similar datasets [17]. In order to work properly, these types of algorithm need large amounts of data to be trained on, with the further proviso that the data must be of good quality [18]. The main advantage of AI over traditional approaches lies in its ability to analyze multiple measures in complex and large data sets, combining information, and weighing the relative impact in relation to the target outcomes, therefore offering more advanced prognostic capabilities compared to human-based staging systems [19, 20]. Given the recent increase in the development of AI-based algorithms for medical use, and the appearance in public health of good-quality, large clinical databases, the times are ready for testing the application of artificial intelligence techniques to CMM staging systems as well [13,14,15,16]. A number of significant published studies [21,22,23,24,25] demonstrated the effectiveness of bioinformatics analysis and machine learning to address this issue alternatively to prognostic nomograms [26] for melanoma patients. Nevertheless, most of these works never turned into a real application [25]. Moreover, richer data, including medical examination results, such as whole genome sequencing, medical imaging and pathological pictures, do not always result in better models. In fact, despite high claimed accuracy, models relying on digital pathology slides and deep learning techniques (e.g. Convolutional Neural Networks) may fall into overfitting due to the high variability of the histopathological images [27]. In addition, omics data are complex to handle and genome sequencing has expensive costs [23].
It is therefore necessary to find the right compromise between model flexibility degree and data type complexity in order to minimize bias and ensure high generalizability.
Based only on plain routinely collected CMM clinicopathological variables, as recorded by the regional population-based Veneto cancer registry, this study aims to explore the consistency of AI in predicting short term overall mortality in CMM patients and then to provide a useful online tool for clinical practice [6].
2 Methods
2.1 Context
The Italian National Health System is a public service grounded in the fundamental values of universality, free access, freedom of choice, pluralism in provision, and equity. On an organization level, the health system is regionally structured and primarily supported by general taxation [28].
In 2015, the Veneto Oncology Network (ROV) published a comprehensive document based on the current national and international literature, detailing the clinical procedures for the clinical management of CMM patients [29,30,31,32]. It included procedures to be followed from the patient’s initial diagnosis to end-of-life care, as well as a detailed set of indicators to monitor consistency between recommendations and real-world clinical practice [33].
2.2 Clinical data
The data for the analysis were sourced from the Veneto Cancer Registry (RTV), a high-resolution, population-based dataset covering the regional population (approximately 4.9 million residents), and the regional health service records. Cancer registration procedures were based on information collected from various sources (e.g., pathology reports, death certificates, and the health service’s administrative records) [6, 33].
All incident cases of invasive CMM registered by the RTV in 2015 (1,279 cases) and 2017 (1,368 cases) were included [6, 33]. The following variables were considered for this study: demographics (age and sex); histological subtypes of CMM (malignant not otherwise specified - NOS -, superficial spreading melanoma, nodular melanoma, lentigo maligna melanoma, acral-lentiginous melanoma, desmoplastic melanoma, and spitzoid melanoma); tumor site (lower limbs, upper limbs, head, hands and feet, and trunk); CMM growth phase (radial versus vertical); ulceration (absent versus present); Breslow thickness (≤ 0.75, 0.76–1.50, 1.51–3.99, ≥ 4.00 mm); CMM regression (absent versus present); tumor-infiltrating lymphocytes (TIL) (absent versus present); mitotic count (number of mitoses per mm2); T, N, and M 8th edition AJCC stages at diagnosis; sentinel lymph node biopsy - SLNB - (negative versus positive); SLNB maximum metastasis diameter (in mm); number of positive lymph nodes after SLNB or lymphadenectomy; and, evaluation of overall survival (OS) time truncated at 3 years from diagnosis [6, 33].
2.3 Data preparation
Some features were not available for all subjects, particularly tumor site (96 missing), Breslow thickness (152), ulceration (168), pTNM stage [T value (63), N value (59), M value (21)], TIL (354), mitotic count (392), growth (587), and regression (759). As the missing values were evenly distributed across the subjects, it was considered preferable to proceed with imputation strategies as opposed to discarding incomplete records so as not to excessively reduce the sample and lose information. Simple feature imputation and multivariable regression were adopted to fill in missing data exploiting the complete records (excluding the survival outcome) as a training set.
Ordinal encoding was used for Breslow thickness, while one-hot encoding was used for pTNM stage (T, N, M values), sex, histological subtypes, ulceration, regression, TIL, growth, SLNB positivity, and site.
Finally, the dataset was shuffled and split into train and test sets (test size = 10%, 265 records) to estimate the algorithms’ predictive performances without biases.
2.4 Statistical analysis
Descriptive statistics were obtained representing categorical variables as frequencies and proportions and summarising continuous numerical variables with means, medians, and minimum–maximum intervals. A univariate analysis was performed with Cox Regression to verify the strength and direction of each independent variable on CMM mortality. The correlation matrix was also calculated to check interdependencies and redundancies in the data.
Principal Components Analysis [34] was instead conducted to graphically inspect the grade of separability of the two classes, survived/deceased in our data (Fig. 1). PCA was additionally used as an alternative feature reduction strategy, selecting the minimum number of components such that the total explained variance ratio is greater than 80% of the original data.
2.5 Machine learning models
In this study, several machine learning (ML) models were trained to predict mortality risk expressed as a binary label, survived versus deceased within 3 years from diagnosis, in a supervised learning fashion.
A range of shallow classifiers were firstly implemented with varying degrees of complexity and interpretability: Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting (GB), and k-Nearest Neighbors (kNN).
Logistic Regression [35] (or logit model) is a statistical technique which models the probability of an event taking place, assuming the log-odds for the event is a linear combination of one or more independent variables. Support Vector Machine [36], is a robust prediction methods that maps a set of training examples, belonging to two different classes, to points in space so as to maximise the width of the gap between the two categories. New examples are then mapped into the same space and predicted to belong to a category based on which side of the gap they fall. SVMs can efficiently be generalized for non-linear classification using the so called "kernel trick". The third model, Random Forest [37], is an ensemble-based classification or regression method that builds a large number of weak decision trees. For classification tasks, the output of the RF is the class which receives the majority of votes from the trees. Similarly, Gradient Boosting is a tree based ensemble predictor which stage-wise builds weak learners to progressively reduce the prediction error of previous model. Lastly, k-Nearest Neighbors [38] algorithm prediction is based on the shortest distance between the sample points and all of the training dataset’s points (with k indicating the number of nearest neighbours considered for the membership voting). As in RF, a new sample point is labelled by the class that gained the most votes.
In this work, a simple Deep Neural Network [39] (DNN) has also been tested. The architecture, illustrated in Fig. 2a, has been designed as follows: first a standard Input layer with number of units equal to the number of features of the training data, then two dense layer with a gradually halving number of units and Rectified Linear Unit [40] activation function (ReLU), lastly a one dimensional Output layer with Sigmoid [41] activation function, since the task is binary classification. In addition, considering that DNNs are very complex and flexible models, the inner layers have been provided of L2 weight decay as regularization [42] technique to avoid a rapid overfitting. The training of the network weights was performed via the Adam optimizer [43] algorithm with the Binary Cross-entropy [44] as loss since the prediction task is a survived versus deceased classification.
Five-folds cross-validation [45] (CV) in combination with the Grid Search [46] optimization algorithm were performed on the training set to understand the best preprocessing procedures (feature selection, scaling, etc.) and to automatically select the best hyperparameters values between different possible combinations. The hyperparameters for each model are listed in Fig. 2b.
Given that the task of predicting CMM mortality risk is naturally defined as an imbalanced classification problem, the fitting and test evaluations were measured in terms of balanced accuracy, precision, recall and F1 score as defined below:
where
and P, N, TP, TN, FP, and FN, respectively, represent the number of real, correctly classified, and incorrectly classified examples with positive and negative classes. In this context, the positive instances (P) are the observed subjects deceased within 3 years from CMM diagnosis, conversely to the negative label (N) corresponds to survival.
Note that when running the Grid Search algorithm only the balanced accuracy metric was selected to be maximized.
R 4.0.4 was used to conduct statistical analyses. Data preparation and ML modeling were performed in Python 3.8.8, with an extensive use of “sklearn” (version 0.24.1) [47] and “tensorflow” (version 2.9.2) [48] libraries.
3 Results
Table 1 depicts the distribution of the main characteristics of the CMM cohorts in the Veneto Region in 2015 and 2017. The mean follow-up duration was 1,032.8 days. The overall mortality was 10.4% at 3 years after diagnosis. The univariate analysis revealed that older age, male sex, vertical growth pattern, thicker Breslow depth, presence of ulceration, absence of TILs, higher mitotic count, SLNB positivity, wider SLNB max diameter, and greater number of positive lymph nodes are all statistically associated with short-term CMM mortality.
The correlation between TNM stages and melanoma outcome was confirmed. The primary site resulted relevant when the tumor is located on the hands, feet, or head. However, upper limb or trunk localizations do not appear to have a higher hazard in comparison to lower limbs. Regarding histology, nodular and malignant (NOS) subtypes had the highest hazard ratios (HR 2.55 and 2.30, respectively; acral-lentiginous subtype as the reference category), while superficial spreading correlates with better outcomes (HR 0.40; acral-lentiginous subtype as the reference category). Desmoplastic melanoma, spitzoid melanoma, and lentigo maligna melanoma were also linked with higher survival rates, but the results were not statistically significant (p-values 0.991, 0.057, and 0.300, respectively). Similarly, the desmoplastic subtype’s HR showed an increased risk not statistically significant (p-value 0.580).
As expected, the correlation analysis revealed interdependence between T stage values and Breslow thickness, as well as between N stages and SLNB positivity (Fig. 3). For this reason, the ML models were trained and evaluated on two different variables subsets: one excluding Breslow, number of positive lymph nodes, SLNB positivity and maximum diameter, one excluding T and N stages. A third tested alternative of feature selection was provided by PCA, which naturally reduces dimensionality and remove multicollinearity from the data at the cost of a poorer results interpretability.
The prognostic performances of the implemented classifiers are reported in Table 2. Given its more complex structure and the known universal approximation ability, the DNN unsurprisingly outperformed the shallow predictors both in terms of balanced accuracy and F1 score. However, the ensemble models, GB and RF, have achieved only slightly worse results. With a balanced accuracy of 91.1%, respectively 88.0%, the DNN and the RF proved to be the best options. Figure 4 shows the two models’ Receiver Operating Characteristic (ROC) curves and relative area under the curve (AUC) values.
As RF is a tree-based model, it was also possible to extract each feature’s Gini importance score [49] and to represents the most important variables for CMM risk prediction (Fig. 5). The patient’s age, mitotic rate, T4 staging, the presence of ulceration, and metastasis appear to have the greatest influence on the classification of short-term mortality.
Finally, a web application was built on top of the best developed model. The tool is accessible for free at the following web page: unipd.link/melanomaprediction.
4 Discussion
This study developed a machine learning algorithm that effectively predicts short-term overall mortality of patients with CMM.
In recent years, machine learning has been applied extensively to improve melanoma risk stratification and prognosis prediction. Most research has focused on finding new clinical and pathological markers [6, 10]. Nevertheless, none of the new, promising, prognostic variables have yet been added to the AJCC system, which is currently the gold standard staging system [9, 50]. Stage II and III patients currently have access to different therapeutic strategies (with or without adjuvant strategies), resulting in a subset of stage II patients having worse survival rates than stage III patients [10]. A more accurate prognostic tool is needed to increase the survival of melanoma patients by preventing recurrence and providing the most appropriate follow-up regimens [6, 10].
We decided to focus on the implementation of an algorithm based on known and validated prognostic factors, with the aim of using machine learning to improve prediction capabilities and facilitate the application of this novel melanoma risk stratification tool [6]. The results of an initial univariate analysis on the available subjects’ characteristics were consistent with those of earlier scientific literature. In addition, histological features, including thicker Breslow depth, the presence of ulceration, SLNB positivity, and the absence of TILs, are widely accepted [51,52,53,54]. In contrast, other prognostic factors, including a vertical growth pattern, a higher mitotic count, a wider diameter of the metastasis in the SLNB, a greater number of positive lymph nodes, and tumor regression (which did not reach significance in our analysis), have demonstrated less relevance and, therefore, were evaluated differently in various research contexts [51,52,53,54]. Consistently with previous research, our analysis also proved that some already known prognostic factors, namely primary site location, histology subtype, and N stage, may have different relevance depending on the specific class considered in the prognosis [51,52,53, 55]. These findings suggest that a better classification of existing prognostic factors is possible. [6]
The results of training a new model through machine learning are promising [6]. Using only routinely collected information, our best algorithm, a small Deep Neural Network, was able to attain an accuracy of 91.1% and an AUC value of 93.3%. Comparatively, one previous study on the prognostic accuracy of the AJCC staging system, 8th edition, reported an AUC of 74% (on a cohort of 1,462 patients). [53]
One previous study by Arora et al. [50] tried the same approach, using the least number of routinely-used variables to produce an improved risk stratification algorithm for melanoma. They analyzed a subset of 449 patients from The Cancer Genome Atlas (TCGA) and developed an algorithm to predict life expectancy based on Breslow thickness, N staging, M staging, and ulceration status (“CMcrpred,” which is accessible via a web page and an Android app). The algorithm was validated on the same cohort of patients and was reported to perform better than the traditional AJCC staging system. Arora et al. also reported that clinicopathological features outperformed the use of molecular biomarkers in a combined model for melanoma. [10, 50]
Another interesting work [25] deployed two AI models, one predicting the probability of 5-year survival and the second for predicting overall survival as a regression task, as an online calculator with an interactive interface. The web page prompts the user to enter not only Breslow thickness, N staging, M staging, and ulceration status, but also the patient’s gender, age at diagnosis, tumour site, eventual recurrence type and history of previous malignancies. To develop this tool, the authors retrieved more than one hundred thousand adult subjects with cutaneous melanoma from the Surveillance, Epidemiology, and End Results (SEER) database, then experimented with several techniques, including Naïve Bayes, generalized linear model, logistic regression, decision tree, random forest, gradient boosted trees, support vector machine and deep learning. The latter was found to be the best in the prediction of 5-year survival, reporting an AUC up to 91.5% and accuracy up to 84.8%, similar with the present study’ findings.
In the Italian context, Comes and colleagues [24] combined deep learning and support vector machine with the aim of learning prognostic biomarkers from cutaneous melanoma whole-slide histological images to predict 1-year disease free survival in a binary classification fashion. After feeding their models with thousands of slide crops, manually selected and annotated by two expert histopathologists, the authors obtained an AUC value of 66.7% and an accuracy of 72.7% on the validation cohort of patients. A similar approach was proposed by Li et al. [23], who managed to achieve an AUC of 76.9%. The superior performances of classifiers with one-dimensional vectors, used by the present study, as input compared to the two latter studies involving instead image-based models presented, confirms how difficult it is to implement high-precision model using only imaging. In fact, despite their potential, pathology images require time spending preparations, often including manual elaboration or classification, in order to uniform the crop sizes and normalize colours. Problems of slide inconsistencies could also occur due to different stain manufacturers, staining procedures or storage [24]. In addition, AI methods injecting figures usually relies on more complex architectures, such as Convolutional Neural Networks, which have more parameters to be trained and need large sets of labelled examples. These characteristics make such models more prone to possible overfitting and more difficult to generalize and use in practice.
As reported by Triantafyllidis and Tsanas [56], the incorporation of machine learning into clinical practice requires multiple steps: (1) a retrospective validation of the algorithm on an adequately powered sample; (2) integration of the algorithm into an accessible digital tool (such as a mobile phone-based tool); (3) assessment of the tool in a real-life clinical setting; and (4) monitoring the tool’s actual implementation outside the research setting through periodic reviews of its effectiveness and safety. For these reasons, we developed a web-based application (available at unipd.link/melanomaprediction) to make the research results accessible and applicable. As already discussed, in addition to possible lack of generalizability and bias introduction, this type of technology faces the main risk of never being utilized in a real-world clinical context. These problems are commonly found in the development of machine learning and are especially relevant for algorithms constructed using data from a single center [10, 17, 57, 58]. However, we encourage physicians and institutions to cross validate our results by testing the algorithm in a real-world setting.
In the end, we conducted the research using a large cohort of patients to develop an algorithm capable of predicting melanoma mortality and providing estimates of its accuracy on a separate blinded subset of patients. Comparing our work with state-of-art models, we managed to achieve similar or better results, also showing superior performances than the AJCC staging system both in accuracy and in AUC, thus supporting the hypothesis that the use of machine learning in the field of melanoma risk stratification produces better results than traditional staging systems, as hoped. This could be a first step in bringing the power of clinical stratification to non-specialist settings, and to support physician in decision making. This result is also relevant for its implications in supporting the development of new and improved AI-based staging systems, both for melanoma and other diseases.
4.1 Limitations
It is important to note that ML has some limitations. The ML survival probability computation is not easy to understand for most users, as the methodology used by the algorithm is complex [59]. Moreover, our cohort is population-based including subjects treated in different hospitals, which could have received heterogeneous treatments, even though the Veneto region defined standardized clinical pathways shared with all different health care institutions and based on national and international guidelines. Unfortunately, no molecular biology variables were available, and thus only clinicopathological information was imputed in our model; however, we expect that implementation of molecular profiling data will further improve the accuracy of our prediction tool. Lastly, as a final note, training models on more cohort patients’ examples would have produced even more robust and accurate predictions [17, 57].
5 Conclusions
Applications based on machine learning techniques will probably reshape the future of prognosis prediction in cancer medicine. Our best model achieved satisfying prognostic performance considering routinely collected information: importantly, this algorithm appears to outperform traditional approaches relying exclusively on AJCC staging system as well as state-of-art results based on deep learning strategies and digital pathology slides. Future studies are needed to investigate whether immunohistochemical tests and molecular analyses could provide new features that improve predictivity or whether they constitute an unnecessary diagnostic delay and costs increase.
To date, few applications have been tested in the real-world environment. The tool implemented in this study shows promising results and is designed to be used with minimal effort in the current clinical setting. The real-world validation of the results achieved is a necessary step to understand the actual effectiveness of the tool and to promote this technology’s integration into everyday clinical practice.
Data availability
The data supporting this study’s findings are held by the Veneto Epidemiological Registry and were used under license for this work, but they are not available to the general public. These data are nonetheless available from Manuel Zorzi upon reasonable request and subject to authorization from the Veneto Epidemiological Registry (Veneto Regional Authority).
Code availability
All code for data cleaning, analysis, and modeling is available at https://github.com/coclab/predict_melanoma_mortality.
References
Abbas O, Miller DD, Bhawan J. Cutaneous malignant melanoma: update on diagnostic and prognostic biomarkers. Am J Dermatopathol. 2014;36(5):363–79. https://doi.org/10.1097/DAD.0b013e31828a2ec5.
Davis LE, Shalin SC, Tackett AJ. Current state of melanoma diagnosis and treatment. Cancer Biol Ther. 2019;20(11):1366–79. https://doi.org/10.1080/15384047.2019.1640032.
Melanoma of the Skin—Cancer Stat Facts. SEER. 2022. https://seer.cancer.gov/statfacts/html/melan.html. Accessed 1 Apr 2022.
Rastrelli M, Tropea S, Rossi CR, Alaibac M. Melanoma: epidemiology, risk factors, pathogenesis diagnosis and classification. In Vivo. 2014;28(6):1005–11.
Pavri SN, Clune J, Ariyan S, Narayan D. Malignant melanoma: beyond the basics. Plast Reconstr Surg. 2016;138(2):330e–40e. https://doi.org/10.1097/PRS.0000000000002367.
Cozzolino Claudia. Machine learning to predict overall short-term mortality in cutaneous melanoma. Poster presented at: Italian Melanoma Intergroup (IMI) XXVIII Congress; September 30, 2022; Florence. 2022. https://www.melanomaimi.it/images/CONGRESSO_NAZ_2022/POSTER_SESSION_2022/P103.pdf. Accessed 21 Dec 2022.
Associazione Italiana di Oncologia Medica, Gruppo di Lavoro Registri Tumori Italiani, SIAPEC-IAP, PASSI e PASSI D’ARGENTO, Osservatorio Nazionale Screening. I Numeri del Cancro in Italia 2021. 2022. https://www.aiom.it/wp-content/uploads/2021/10/2021_NumeriCancro_web.pdf. Accessed 12 Apr 2022.
Buja A, Rugge M, De Luca G, et al. Cutaneous melanoma in alpine population: incidence trends and clinicopathological profile. Curr Oncol. 2022;29(3):2165–73. https://doi.org/10.3390/curroncol29030175.
Keung EZ, Gershenwald JE. The eighth editionAmerican Joint Committee on Cancer (AJCC) melanoma staging system: implications for melanoma treatment and care. Expert Rev Anticancer Ther. 2018;18(8):775–84. https://doi.org/10.1080/14737140.2018.1489246.
Ma E, Hoegler K, Zhou A. Bioinformatic and machine learning applications in melanoma risk assessment and prognosis: a literature review. Genes. 2021;12(11):1751. https://doi.org/10.3390/genes12111751.
Gareau DS, Browning J, Correa Da Rosa J, et al. Deep learning-level melanoma detection by interpretable machine learning and imaging biomarker cues. J Biomed Opt. 2020. https://doi.org/10.1117/1.JBO.25.11.112906.
Ali Z, Yousaf N, Larkin J. Melanoma epidemiology, biology and prognosis. Eur J Cancer Suppl. 2013;11(2):81–91. https://doi.org/10.1016/j.ejcsup.2013.07.012.
Howard J. Artificial intelligence: Implications for the future of work. Am J Ind Med. 2019;62(11):917–26. https://doi.org/10.1002/ajim.23037.
Hamet P, Tremblay J. Artificial intelligence in medicine. Metabolism. 2017;69:S36–40. https://doi.org/10.1016/j.metabol.2017.01.011.
Mintz Y, Brodie R. Introduction to artificial intelligence in medicine. Minim Invasive Ther Allied Technol. 2019;28(2):73–81. https://doi.org/10.1080/13645706.2019.1575882.
Chen M, Decary M. Artificial intelligence in healthcare: an essential guide for health leaders. Healthc Manage Forum. 2020;33(1):10–8. https://doi.org/10.1177/0840470419873123.
Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to Machine Learning, Neural Networks, and Deep Learning. Transl Vis Sci Technol. 2020;9(2):14.
Elmore JG, Lee CI. Data quality, data sharing, and moving artificial intelligence forward. JAMA Netw Open. 2021;4(8):e2119345. https://doi.org/10.1001/jamanetworkopen.2021.19345.
Jiang T, Gradus JL, Rosellini AJ. Supervised machine learning: a brief primer. Behav Ther. 2020;51(5):675–87. https://doi.org/10.1016/j.beth.2020.05.002.
Sultan AS, Elgharib MA, Tavares T, Jessri M, Basile JR. The use of artificial intelligence, machine learning and deep learning in oncologic histopathology. J Oral Pathol Med. 2020;49(9):849–56. https://doi.org/10.1111/jop.13042.
Ju A, Tang J, Chen S, Fu Y, Luo Y. Pyroptosis-related gene signatures can robustly diagnose skin cutaneous melanoma and predict the prognosis. Front Oncol. 2021. https://doi.org/10.3389/fonc.2021.709077.
Richter AN, Khoshgoftaar TM. Efficient learning from big data for cancer risk modeling: a case study with melanoma. Comput Biol Med. 2019;110:29–39. https://doi.org/10.1016/j.compbiomed.2019.04.039.
Li A, Li X, Li W, Yu X, Qi M, Li D. Application of deep learning on the prognosis of cutaneous melanoma based on full scan pathology images. BioMed Res Int. 2022;2022:1–6. https://doi.org/10.1155/2022/4864485.
Comes MC, Fucci L, Mele F, et al. A deep learning model based on whole slide images to predict disease-free survival in cutaneous melanoma patients. Sci Rep. 2022;12(1):20366. https://doi.org/10.1038/s41598-022-24315-1.
Liu W, Zhu Y, Lin C, Liu L, Li G. An online prognostic application for melanoma based on machine learning and statistics. J Plast Reconstr Aesthet Surg. 2022;75(10):3853–8. https://doi.org/10.1016/j.bjps.2022.06.069.
Yang J, Pan Z, Zhao F, et al. A nomogram for predicting survival in patients with nodular melanoma: a population-based study. Medicine. 2019;98(24):e16059. https://doi.org/10.1097/MD.0000000000016059.
Tang H, Sun N, Shen S. Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: experiments on osteosarcoma subtypes. J Pathol Inform. 2021;12(1):30. https://doi.org/10.4103/jpi.jpi_78_20.
Ferre F, de Belvis AG, Valerio L, et al. Italy: health system review. Health Syst Transit. 2014;16(4):1–168.
Portale Sanità Regione del Veneto - PDTA Melanoma. 2020. https://salute.regione.veneto.it/web/rov/pdta-melanoma. Accessed 20 Apr 2022.
AIOM. LINEE GUIDA MELANOMA EDIZIONE 2020. https://www.aiom.it/wp-content/uploads/2020/10/2020_LG_AIOM_Melanoma.pdf. Accessed 20 Apr 2022.
National Comprehensive Cancer Network Cutaneous melanoma guidelines. NCCN. 2021. https://www.nccn.org/guidelines/guidelines-detail. Accessed 22 Mar 2022.
Overview | Melanoma: assessment and management | Guidance | NICE. 2022. https://www.nice.org.uk/guidance/ng14. Accessed 22 Mar 2022.
Buja A, Rugge M, De Luca G, et al. Clinical performance indicators for monitoring the management of cutaneous melanoma: a population-based perspective. Melanoma Res. 2022;32(5):353–9. https://doi.org/10.1097/CMR.0000000000000841.
Pearson KLIII. On lines and planes of closest fit to systems of points in space. London Edinburgh Dublin Philos Mag J Sci. 1901;2(11):559–72. https://doi.org/10.1080/14786440109462720.
Cox DR. The regression analysis of binary sequences. J Roy Stat Soc: Ser B (Methodol). 1958;20(2):215–42.
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97. https://doi.org/10.1007/BF00994018.
Ho TK. Random decision forests. In: proceedings of 3rd international conference on document analysis and recognition. Vol 1. 1995:278–282 vol.1. doi:https://doi.org/10.1109/ICDAR.1995.598994
Altman NS. An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat. 1992;46(3):175–85. https://doi.org/10.2307/2685209.
McCulloch WS, Pitts W. A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics. 1943;5(4):115–33. https://doi.org/10.1007/BF02478259.
Agarap AF. Deep Learning using Rectified Linear Units (ReLU). Published online February 7, 2019. http://arxiv.org/abs/1803.08375. Accessed 16 Dec 2022
Pratiwi H, Windarto AP, Susliansyah S, et al. Sigmoid activation function in selecting the best model of artificial neural networks. J Phys: Conf Ser. 2020;1471(1):012010. https://doi.org/10.1088/1742-6596/1471/1/012010.
Krogh A, Hertz J. A simple weight decay can improve generalization. In: advances in neural information processing systems. Vol 4. Morgan-Kaufmann; 1991. 2022. https://proceedings.neurips.cc/paper/1991/hash/8eefcfdf5990e441f0fb6f3fad709e21-Abstract.html. Accessed 16 Dec 2022.
Kingma DP, Ba J. Adam: a method for stochastic optimization. Published online January 29, 2017. 2022. http://arxiv.org/abs/1412.6980. Accessed 16 Dec 2022.
Good IJ. Rational decisions. J Roy Stat Soc: Ser B (Methodol). 1952;14(1):107–14. https://doi.org/10.1111/j.2517-6161.1952.tb00104.x.
Refaeilzadeh P, Tang L, Liu H. Cross-validation. In: Özsu MT, LIU L, editors. Encyclopedia of database systems. New York: Springer, US; 2009. p. 532–8.
LaValle SM, Branicky MS, Lindemann SR. On the relationship between classical grid search and probabilistic roadmaps. Int J Robotics Res. 2004;23(7–8):673–92. https://doi.org/10.1177/0278364904045481.
Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12(85):2825–30.
Abadi M, Barham P, Chen J, et al. TensorFlow: A System for Large-Scale Machine Learning. In: Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation. OSDI’16. USENIX Association; 2016:265–283.
Feature Importance in Random Forests. Alexis Perrier—Data Science. Published August 27, 2015. 2015. https://alexisperrier.com/datascience/2015/08/27/feature-importance-random-forests-gini-accuracy.html. Accessed 21 Dec 2022.
Arora C, Kaur D, Lathwal A, Raghava GPS. Risk prediction in cutaneous melanoma patients from their clinico-pathological features: superiority of clinical data over gene expression data. Heliyon. 2020;6(8):e04811. https://doi.org/10.1016/j.heliyon.2020.e04811.
Wu PC, Chen YC, Chen HM, Chen LW. Prognostic factors and population-based analysis of melanoma with sentinel lymph node biopsy. Sci Rep. 2021;11(1):20524. https://doi.org/10.1038/s41598-021-99950-1.
Bobos M. Histopathologic classification and prognostic factors of melanoma: a 2021 update. Ital J Dermatol Venereol. 2021. https://doi.org/10.23736/S2784-8671.21.06958-3.
Kanaki T, Stang A, Gutzmer R, et al. Impact of American Joint Committee on Cancer 8th edition classification on staging and survival of patients with melanoma. Eur J Cancer. 2019;119:18–29. https://doi.org/10.1016/j.ejca.2019.06.011.
Kycler W, Grodecka-Gazdecka S, Bręborowicz J, Filas V, Teresiak M. Prognostic factors in melanoma. Rep Pr Oncol Radiother. 2006;11(1):39–48. https://doi.org/10.1016/S1507-1367(06)71048-3.
Bartlett E, Karakousis G. Current staging and prognostic factors in melanoma. Surg Oncol Clin N Am. 2015. https://doi.org/10.1016/j.soc.2014.12.001.
Triantafyllidis AK, Tsanas A. Applications of machine learning in real-life digital health interventions: review of the literature. J Med Internet Res. 2019;21(4):e12286. https://doi.org/10.2196/12286.
Jiang Y, Yang M, Wang S, Li X, Sun Y. Emerging role of deep learning-based artificial intelligence in tumor pathology. Cancer Commun. 2020;40(4):154–66. https://doi.org/10.1002/cac2.12012.
Wong SL, Kattan MW, McMasters KM, Coit DG. A nomogram that predicts the presence of sentinel node metastasis in melanoma with better discrimination than the American Joint Committee on cancerstaging system. Ann Surg Oncol. 2005;12(4):282–8. https://doi.org/10.1245/ASO.2005.05.016.
Safran T, Viezel-Mathieu A, Corban J, Kanevsky A, Thibaudeau S, Kanevsky J. Machine learning and melanoma: the future of screening. J Am Acad Dermatol. 2018;78(3):620–1. https://doi.org/10.1016/j.jaad.2017.09.055.
Acknowledgements
Not applicable.
Dual publication
Part of this work was presented at the XXVIII Congress of the Italian Melanoma Intergroup 2022 and published online as a poster (https://www.melanomaimi.it/images/CONGRESSO_NAZ_2022/POSTER_SESSION_2022/P103.pdf. Accessed November 10, 2022). Although several sections of this article overlap with the poster, the results and figures shown here are original and new, thus not creating dual publication issues.
Funding
This research has received "Current Research" funds from the Italian Ministry of Health to cover publication costs.
Author information
Authors and Affiliations
Contributions
Conceptualization, AB; methodology, CC; software, CC; formal analysis, CC, and ABr; investigation, AB, and CC; data curation, CC, MZ, PDF and ST; visualization, CC; writing—original draft preparation, CC, AB, and AM; writing—review and editing, AB, AM, CC, MR, AV, and GD; supervision, AB, MR, LDO, CRR, and SM; project administration, AB and SM. All authors have read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethical approval and consent to participate
The study adheres to the Declaration of Helsinki and Resolution No. 9/2016 of the Italian Personal Data Protection Authority, with the latter also confirming the permissibility of processing personal data for medical, biomedical, and epidemiological research, as well as the permissibility of using data concerning the status of people’s health in aggregate form in scientific studies. To protect privacy and anonymity, the Veneto Regional Authority removes all direct identifiers and replaces them with a code number in all datasets, while retaining the ability to link data from different administrative databases. In this case, according to Resolution No. 9/2016 of the Italian Personal Data Protection Authority, written consent from patients is not required. Ethical approval for the study was obtained from the Veneto Oncological Institute’s Ethics Committee (No. 52/2016).
Consent for publication
This manuscript does not include any detail, image, or video relating to an individual person, hence informed consent for the publication was not required.
Competing interests
The authors have no competing interest to declare.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cozzolino, C., Buja, A., Rugge, M. et al. Machine learning to predict overall short-term mortality in cutaneous melanoma. Discov Onc 14, 13 (2023). https://doi.org/10.1007/s12672-023-00622-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12672-023-00622-5