Abstract
Purpose
Watch and wait strategy is a safe and effective alternative to surgery in patients with locally advanced rectal cancer (LARC) who have achieved pathological complete response (pCR) after neoadjuvant therapy (NAT); present restaging methods do not meet clinical needs. This study aimed to construct a machine learning (ML) model to predict pCR preoperatively.
Methods
LARC patients who received NAT were included to generate an extreme gradient boosting-based ML model to predict pCR. The group was divided into a training set and a tuning set at a 7:3 ratio. The SHapley Additive exPlanations value was used to quantify feature importance. The ML model was compared with a nomogram model developed using independent risk factors identified by conventional multivariate logistic regression analysis.
Results
Compared with the nomogram model, our ML model improved the area under the receiver operating characteristics from 0.72 to 0.95, sensitivity from 43 to 82.2%, and specificity from 87.1 to 91.6% in the training set, the same trend applied to the tuning set. Neoadjuvant radiotherapy, preoperative carbohydrate antigen 125 (CA125), CA199, carcinoembryonic antigen level, and depth of tumor invasion were significant in predicting pCR in both models.
Conclusion
Our ML model is a potential alternative to the existing assessment tools to conduct triage treatment for patients and provides reference for clinicians in tailoring individual treatment: the watch and wait strategy is used to avoid surgical trauma in pCR patients, and non-pCR patients receive surgical treatment to avoid missing the optimal operation time window.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Neoadjuvant therapy (NAT) followed by total mesorectal excision (TME) is the standard of care for locally advanced rectal cancer (LARC) with curative intent. NAT was shown to downsize and downstage cancer, increase the R0 resection rate, and thus improve oncological outcomes [1, 2]. Treatment response to NAT varies between individuals, from insensitive to pathologic complete response (pCR), with no visible cancer cells in the pathologic specimens, accounting for 20% of the population [3]. The prognosis of pCR patients is better than that of non-pCR patients. Nowadays, preoperative tumor restaging after NAT is generally studied by digital rectal examination, endoscopic ultrasonography, magnetic resonance imaging (MRI), and endoscopy in clinical practice. For those who attain clinical complete response (cCR, predict as pCR preoperative), an organ-sparing protocol called watch and wait (W&W) strategy has received considerable attention in recent years. This strategy refers to the option to exempt patients from major surgery and implement intensive surveillance in the first 3 years after NAT [4]. Previous studies have demonstrated the safety and effectiveness of W&W in cCR patients, yielding little oncologic risk other than the risk of surgery [4,5,6,7,8,9,10,11]. However, the gold standard of tumor response is only available in surgical specimens. Definition of cCR lacks uniform criteria because human assessment introduce subjectivity; as a result, the reported cCR rates range from 10 to 78%, followed by tumor regrowth rates ranging from 7 to 33% [9, 12,13,14]. For some examples, a prospective cohort study comprising 385 LARC patients with cCR revealed that 87 patients developed local recurrence during the near 3-year follow-up period, of whom 97% could receive curative resection, and uncontrolled pelvic disease was 1.6% [15]. Data from the International Watch & Wait Database, which has 47 participating institutions from 15 countries in Europe, revealed a 2-year accumulated local regrowth rate of 25% in cCR patients, and 97% of them were located in the bowel wall, which requires salvage surgery [9].
With the improvement of survival outcome, patients pay more attention to pursue a higher quality of life (QOL). Nonoperative management in a highly selected population was proven to be feasible as it spares permanent colostomy, sphincterotomy, bladder, and sexual dysfunction, and avoids 1–5% surgical mortality and 34–39% morbidity, thus ensuring good QOL [16,17,18,19]. Therefore, for restaging unmet clinical needs, a more precise and easy-to-use tool with high performance on pCR prediction is imperative.
Machine learning (ML), a branch of artificial intelligence and computer science, focusing on simulating the way the human brain learns using statistics and algorithms to improve accuracy, has had an outstanding performance in disease diagnosis, prognosis prediction, antitumor drug response, and treatment response assessment [20,21,22,23,24,25]. However, studies evaluating tumor response in LARC after NAT using ML algorithms are limited. In this study, we aimed to integrate some commonly used preoperative clinicopathological parameters to develop an ML classifier to predict tumor response of LARC after NAT, and compare its performance with nomogram constructed by the conventional logistic regression method, with the aim to provide reference for individualized treatment.
Methods
Ethical statement
This study was approved by the Institutional Review Board of the Sixth Affiliated Hospital, Sun Yat-sen University (No. 2021ZSLYEC-063), and complied with the Declaration of Helsinki (World Medical Association Declaration of Helsinki 2013). All the patients gave informed consent to allow their electronic medical records to be used for cancer research voluntarily during their first visit to the hospital, and data were desensitized at the beginning of statistical analyses to protect patients’ privacy.
Study population
We included patients diagnosed with rectal cancer from 2010 to 2020 at the Sixth Affiliated Hospital, Sun Yat-sen University. Patient lists were extracted from pathology report records, and only those who received NAT were selected. Neoadjuvant chemotherapy was based on oral/intravenous 5-fluorouracil or combined with oxaliplatin/irinotecan. The neoadjuvant radiotherapy regimen was a long-term radiotherapy of 50 Gy in 25 fractions, or a short-term radiotherapy regimen of 5 Gy per day for 5 consecutive days.
The eligibility criteria were as follows: (1) T3–4/N + (conventionally staged by MRI and computed tomography), (2) age 18–75 years, (3) received a full course of NAT, (4) regular restaging that adhered to the treatment protocol, (5) underwent radical surgery, and (6) tumor regression grade information available in the surgical pathology report. Patients were excluded if they had (1) multiple synchronous tumors or newly discovered tumors during treatment; (2) familial adenomatous polyposis, inflammatory bowel disease, or other diseases that predispose to colorectal cancer; and (3) local excision of the primary tumor only.
The patients were divided into a training set and a tuning set (similar to internal validation set) at a 7:3 ratio. The training set was used to build binary classifiers, which in turn was used to predict the cluster labels on the tuning set (held-out samples).
Clinicopathological selection and outcome definition
Patient, tumor, and treatment information were extracted from the electronic medical records. Patient information mainly included demographic characteristics (sex, age, height, weight, body mass index [BMI], family history, and tumor history); tumor information included radiology findings (MRI: tumor invasion depth, distance from the anal verge, tumor location, tumor size, and tumor recurrence), endoscopic findings (tumor size by endoscopy: the proportion that the tumor occupied the intestinal lumen), and peri-NAT serological tumor biomarkers (cancer antigen 125 [CA125], CA199, CA153, carcinoembryonic antigen [CEA], and alpha-fetoprotein [AFP]). T1, T2, and DWI sequences were used to comprehensively translate tumor information on MRI. Given that CEA and CA199 play important roles in colorectal cancer progression and recurrence surveillance, we postulated that the changes in these biomarkers may reflect the microscopic alterations in the tumor after NAT; hence, we included the CEA difference, ratio of CEA difference, CA199 difference, and ratio of CA199 difference into the analysis. Treatment information included neoadjuvant radiotherapy and neoadjuvant chemotherapy regimen. The oncological outcome was pCR, which was determined by tumor regression grade 0 (TRG0) without lymph node metastasis; otherwise, it was non-pCR (TRG1-3). TRG was justified in accordance with the National Comprehensive Cancer Network guidelines [26]. In this study, the area under the receiver operating characteristics (AUROC) was considered the primary index to evaluate the performance of the models, with sensitivity and specificity as secondary evaluation indexes.
Model development
ML framework
An ML ensemble algorithm based on extreme gradient boosting (XGBoost) [27] was used to predict tumor response in LARC patients. XGBoost easily captures the nonlinear relationships between features and classification labels based on the concept of “exact greedy algorithm.” Essentially, it aggregates multiple weak classifiers with limited accuracy into stronger ones by establishing a series of classification and regression trees (CARTS) iteratively with a highly adaptive approach. This was performed by iteratively fitting decision trees, with each iteration targeting the prediction residuals (also known as the loss function) of the preceding tree. The XGBoost algorithm optimizes the loss function via second-order Taylor expansion; meanwhile, a regularization term that contained leaf nodes was applied to control the model complexity to avoid over-fitting. Gains or losses after splitting of previous lead nodes determined further splitting, which was repeated until the minimum loss function and complexity of the model was less than the threshold setting. Therefore, the final predicted probability was calculated for all trees.
When training models, hyperparameters, including max_depth, subsample, and colsample_by tree, were optimized separately for each tree in the training set by random sampling. In addition, the effect of imbalanced data was corrected by oversampling and undersampling [28, 29], and the XGBoost model was trained with the generated effectively balanced dataset. To further strengthen generalizability, a five-fold cross validation was used to split the whole training set into five subsets, with four subsets as the training set and the rest as the test subset each time until each subset was tested. For the features, the average value of the five trained XGBoost models was taken to measure their importance. SHapley Additive exPlanations (SHAP) [30] value was calculated to interpret the marginal contribution of the model features and explain the output of the model.
Nomogram model building
To compare ML models with conventional linear models, univariate analyses were used to identify factors that correlated with tumor response and incorporate them into a binary stepwise multivariate logistic regression model to identify independent risk factors. Thereafter, we constructed a nomogram model that used these independent risk factors and validated them internally. Harrell’s concordance index (C-index) and AUROC were calculated to examine the predicted power of the nomogram, and a calibration curve was drawn to judge the discrepancy between predicted and actual events.
Statistical analyses
Data are presented as mean ± standard deviation for continuous variables with normal distribution, medians with interquartile ranges for continuous variables with non-normal distribution, and frequency (percentage) for categorical variables. During nomogram construction, some continuous variables were transformed into categorical variables for analysis. t-Test/Mann–Whitney test or chi-square were used to conduct univariate analysis where appropriate, and multivariate analyses were implemented using the logistic regression method. The variables significantly related to pCR in univariate analysis were included in multivariate regression analysis, and the independent risk factors obtained from multivariate regression analysis were used for nomogram modeling. All statistical analyses were conducted using SPSS software (version 26.0 for Windows, IBM Corp., Armonk, NY, USA) and R software (version 3.4.1, The R Foundation for Statistical Computing, Austria). The ML model was developed by the “xgboost” package, “shapforxgboost” package, and “imbalance” package. ROC was plotted by the “pROC” package. A two-sided p value < 0.05 was considered significant.
Results
Patient characteristics
A total of 962 consecutive LARC patients during the 2010–2020 period from the Sixth Affiliated Hospital, Sun Yat-sen University, were retrospectively included in this study (Fig. 1). The training and tuning sets included 694 and 268 patients, respectively, with a comparable pCR rate between the groups (15.4% vs 14.9%). Compared with the training set, clinical T stage evaluated qualitatively by the depth of tumor invasion was more advanced with 100% T3–4 tumors. A higher prevalence of upper-third rectal cancer, a higher level of preoperative CEA, and a tendency of larger tumor size by colonoscopy were observed, which combined in all manifestations of more invasive tumors in the tuning set. In addition, the proportion of radiotherapy was lower, and chemotherapy regimens were significantly different from the training set, which may help explain the poorer downward trend in CEA difference and ratio of CEA difference for LARC patients after receiving NAT in the tuning set (Table 1). Other clinicopathological parameters were comparable between groups.
ML model building and validation
Twenty-eight variables were used as inputs to determine their nonlinear relationships with category labels. We first processed the training set to ensure a balanced and representative dataset, then started running the ML model with default hyperparameters, and adjusted them into an optimal model step by step. Finally, we stopped training at “max_depth” of 10, “eta” of 0.2, “gamma” of 0.9, and “scale_pos_weight” of 0.1 after repeated attempts. As displayed in Fig. 2, the AUROC of the XGBoost model was 0.95, with a sensitivity of 82.2% and specificity of 91.6%, which indicated a perfect fit between the predicted results and actual events. In addition, we used the SHAP value to quantify feature importance, calculated feature importance scores (Fig. 3A) for each sample, and averaged features to obtain their final importance ranks (Fig. 3B). As shown in Fig. 3A, we were aware of the nonlinear relationship between variables and labels that conventional statistical methods could ignore. Figure 3B shows that neoadjuvant radiotherapy, preoperative CA125, tumor size by MRI, and depth of tumor invasion by MRI ranked the top four in terms of importance in our ML model, while CA199 difference, preoperative CA153, initial AFP, and ratio of CA199 difference made little contribution. Unsurprisingly, the results in the tuning set were highly consistent with the training set, with AUROC, sensitivity, and specificity of 0.73, 71.93%, and 70%, respectively. For the feature importance evaluation, both sets showed similar trends.
Nomogram construction and validation
Univariate analysis revealed that the depth of tumor invasion, tumor size on MRI, tumor size by colonoscopy, initial CEA level, preoperative CA125 level, preoperative CEA, preoperative CA199, and neoadjuvant radiotherapy were significantly associated with tumor response. Then, we incorporated these factors into a multivariate stepwise regression analysis to find preoperative CA125, preoperative CEA, preoperative CA199, and neoadjuvant radiotherapy were independent risk factors (Table 2). A nomogram model was then applied to quantitatively visualize the contribution of each variable based on the multivariate analysis (Fig. 4A). As a result, the C-index (equal to AUROC) was 0.72, indicating a moderate predictive ability (Fig. 4B). Calibration showed that the predictive curve was close to the ideal curve (Fig. 4C). Using optimal threshold criteria, this model can accurately distinguish pCR from non-pCR with 43% sensitivity and 87.1% specificity. A similar result was obtained in the validation group (AUROC, 0.69; sensitivity, 55%; specificity, 78.5%) (Fig. 4D).
Comparison between ML classifier and nomogram model
As shown in Table 3, we compared the performance of the aforementioned models. Undoubtedly, the ML classifier presented full range superiority in AUROC, sensitivity, and specificity over the nomogram model.
Discussion
The introduction of standard management in LARC (NAT + TME) has improved cancer survival and local control in recent years. However, the physical, mental, and emotional burden caused by invasive surgery has opened the discussion on the trade-off between recurrence risk and better QOL in cCR patients. Given the evidence that the conservative approach, i.e., W&W, taken by cCR patients to avoid morbidity and functional sequelae resection was not inferior to surgery in terms of prognosis, W&W was reported as a safe alternative to major surgery [6, 8]. A questionnaire survey of patients’ perspectives showed that even by emphasizing the detailed risks toward LARC patients with cCR, 83% of them were willing to consider the W&W strategy, 94% of them were willing to bear the 25% recurrence rate, and 95% accepted intensive surveillance despite its being troublesome [31]. However, there were no uniform and reproducible criteria for cCR, which led to a low concordance rate between cCR and pCR. Previous studies have shown that the true pCR rate in those who were assessed as cCR ranged from 44 to 78%, and a proportion of approximately 10% pCR was misjudged as non-cCR. In addition, the sensitivity of biopsies was 12.9%, which was slightly consistent with 30.4% of the surgical specimens [32, 33]. Moreover, current restaging methods such as digital rectal examination, endoscopy, computed tomography, MRI, or positron emission tomography–computed tomography were not capable of being surrogate methods of surgical pathological assessment to observe tumor response. Therefore, a precise and objective tool for the preoperative assessment of pCR is urgently needed. To address this, we constructed an ML classifier to predict pCR, which demonstrated high performance, and compared it with the conventional linear model to realize individualized precision therapy.
The AUROC of our ML classifier in discriminating binary tumor response of LARC patients after NAT was 0.95, with a sensitivity of 82.2% and specificity of 91.6%, and it achieved slightly lower values in the tuning set with an AUROC of 0.73, sensitivity of 71.93%, and specificity of 70%. When we analyzed the dataset, we found that demographic characteristics, depth tumor invasion, preoperative CEA, neoadjuvant radiotherapy, CEA difference, and the ratio of CEA difference were not comparable between the training and tuning sets; whereas, it ranked the top few important parameters in both the ML classifier and nomogram model. This may fully explain the moderate performance on the tuning set. From the perspective of data analysis, the performance of our model could be optimized if we reassigned the training and tuning sets or performed propensity-score matching to ensure comparability of key features; however, our current results reflect daily clinical practice where protocols are subject to variations between different centers. Therefore, this may render our model more generally applicable. The tuning set conditions could exist in clinical practice, and we could not expect data distribution to meet the required standard. This, in another way, makes our model more representative even in some extreme situations. Based on these findings, we believe that the current results of the tuning set were the worst-case scenario that may prevail in clinical practice. However, the predictive ability was still far superior to that of the conventional linear regression model (nomogram). In addition, our ML classifier cannot only identify independent associated factors like that of the nomogram, but allows the determination of the internal law of the data for better use of the additional information than the linear model.
Neoadjuvant radiotherapy, which was recommended in various guidelines as part of the standard treatment in LARC patients, ranked first and second in feature importance in the ML classifier and nomogram, respectively. The results provide strong evidence for the guidelines once again. Previous studies have investigated the role of tumor size and pretreatment T stage as predictors of tumor response in LARC patients after NAT, and our study was consistent with their findings to show that indicators measured by MRI may help predict pCR outcomes [10, 34,35,36]. The use of CEA, CA125, and CA199 alone or in combination was reported to help in making decisions for digestive tract tumors or adenocarcinomas as prognostic indicators or in monitoring therapeutic effects [37,38,39,40,41], while the use of serological indications alone is insufficient for both sensitivity and specificity. These were in line with our findings; moreover, we found that indicator changes (e.g., CEA difference) before and after treatment may reflect treatment responses within the tumor to some extent. However, little research focused on this aspect, and this insight suggested a new view possibility in the monitoring of treatment response. Further studies are needed to investigate the value of indicator changes presenting in blood, radiology, or other areas.
Many studies predict pCR in LARC patients after NAT. van der Sande et al. [42] initiated a study in which three endoscopists predicted pCR using endoscopic features. The results showed that the sensitivity, specificity, and AUROC of human recognition fluctuated between 72 and 94%, 61 and 85%, and 0.80 and 0.84, respectively. The interobserver consistency was mild to moderate. The performance of our model was better than that of van der Sande et al. and a ML model could eliminate the empirical dependence of human recognition. Besides, Bulens et al. [43] constructed multiple MRI radiomics by T1, T2, and DWI sequences, in which the highest AUROC was 0.86. Also, Jin et al. [44] proposed a multi-task deep learning model to predict pCR using MRI imaging, the AUROC reached 0.97 when they combined imaging analysis with CEA, and the model was well externally validated in multi-center data. Although their model showed outstanding performance using artificial intelligence algorithm, different magnetic field intensity hampered them from clinical practice. However, our ML model was constructed using common clinicopathological parameters, which made it easy to apply to other centers.
This study has several limitations that restrict the interpretations of our findings. First, selection bias cannot be excluded because of the retrospective nature of this study. Second, our models were internally validated using a tuning set within our institution; thus, external validation is needed. Although pooling of data permits additional analyses, interinstitutional biases and variations in treatment practices may affect data interpretation. A large single-institution experience with standardized treatment and pathologic evaluation can avoid this limitation. To the best of our knowledge, this study included the largest number of cases in related research. Huang et al. [36] built several ML classifiers to find an artificial neural network (ANN) algorithm with the best performance in predicting tumor response compared with k-nearest neighbor, support vector machine, naïve Bayes classifier, and multiple logistic regression. The performance of the ANN model was similar to that of our XGBoost classifier, but the relatively small number of patients from the same cohort may impart substantial bias and poor generalizability. Finally, some variables included in our model were highly subjective and relied on personal experience. Despite these, we are making efforts to include more variables and patients to improve the predictive performance of the XGBoost model and collecting multicenter data for external validation. Once the predictive performance of the model meets the clinical needs, we will translate it into an online risk calculator that is free to the public [45]. Clinicians need to enter the specific values of the variables, and the calculator outputs the probability of pCR.
We constructed an ML classifier with a high-volume database using easily obtainable preoperative clinicopathological parameters to accurately predict the binary tumor response of LARC patients after NAT, which outperformed the conventional linear model in predicting ability. It may serve as a robust tool for tumor response prediction and improve QOL in patients with pCR due to the omission of major surgery.
Change history
14 July 2022
A Correction to this paper has been published: https://doi.org/10.1007/s00384-022-04215-6
References
Maas M, Nelemans PJ, Valentini V, Das P, Rödel C, Kuo LJ et al (2010) Long-term outcome in patients with a pathological complete response after chemoradiation for rectal cancer: a pooled analysis of individual patient data. Lancet Oncol 11(9):835–844
Dossa F, Chesney TR, Acuna SA, Baxter NN (2017) A watch-and-wait approach for locally advanced rectal cancer after a clinical complete response following neoadjuvant chemoradiation: a systematic review and meta-analysis. The lancet Gastroenterology & hepatology 2(7):501–513
Martin ST, Heneghan HM, Winter DC (2012) Systematic review and meta-analysis of outcomes following pathological complete response to neoadjuvant chemoradiotherapy for rectal cancer. Br J Surg 99(7):918–928
Fernandez LM, São Julião GP, Figueiredo NL, Beets GL, van der Valk MJM, Bahadoer RR et al (2021) Conditional recurrence-free survival of clinical complete responders managed by watch and wait after neoadjuvant chemoradiotherapy for rectal cancer in the International Watch & Wait Database: a retrospective, international, multicentre registry study. Lancet Oncol 22(1):43–50
Park IJ, You YN, Agarwal A, Skibber JM, Rodriguez-Bigas MA, Eng C et al (2012) Neoadjuvant treatment response as an early response indicator for patients with rectal cancer. Journal of clinical oncology : official journal of the American Society of Clinical Oncology 30(15):1770–1776
Renehan AG, Malcomson L, Emsley R, Gollins S, Maw A, Myint AS et al (2016) Watch-and-wait approach versus surgical resection after chemoradiotherapy for patients with rectal cancer (the OnCoRe project): a propensity-score matched cohort analysis. Lancet Oncol 17(2):174–183
Shin YS, Yu CS, Park JH, Kim JC, Lim SB, Park IJ et al (2017) Total mesorectal excision versus local excision after favorable response to preoperative chemoradiotherapy in “early” clinical T3 rectal cancer: a propensity score analysis. Int J Radiat Oncol Biol Phys 99(1):136–144
Dattani M, Heald RJ, Goussous G, Broadhurst J, São Julião GP, Habr-Gama A et al (2018) Oncological and survival outcomes in watch and wait patients with a clinical complete response after neoadjuvant chemoradiotherapy for rectal cancer: a systematic review and pooled analysis. Ann Surg 268(6):955–967
van der Valk MJM, Hilling DE, Bastiaannet E, Meershoek-Klein Kranenbarg E, Beets GL, Figueiredo NL et al (2018) Long-term outcomes of clinical complete responders after neoadjuvant treatment for rectal cancer in the International Watch & Wait Database (IWWD): an international multicentre registry study. Lancet (London, England) 391(10139):2537–2545
Smith JJ, Strombom P, Chow OS, Roxburgh CS, Lynn P, Eaton A et al (2019) Assessment of a watch-and-wait strategy for rectal cancer in patients with a complete response after neoadjuvant therapy. JAMA Oncol 5(4):e185896
Spiegel DY, Boyer MJ, Hong JC, Williams CD, Kelley MJ, Moore H et al (2019) Long-term clinical outcomes of nonoperative management with chemoradiotherapy for locally advanced rectal cancer in the Veterans Health Administration. Int J Radiat Oncol Biol Phys 103(3):565–573
Pang K, Rao Q, Qin S, Jin L, Yao H, Zhang Z (2019) Prognosis comparison between wait and watch and surgical strategy on rectal cancer patients after treatment with neoadjuvant chemoradiotherapy: a meta-analysis. Ther Adv Gastroenterol 12:1756284819892477
Pinto JC, Pereira AD, Pimenta A, Pedro C, Fernandez G, Marques I et al (2020) Low rectal cancer treatment strategies: a cohort study assessing watch and wait. J Cancer Res Clin Oncol 146(10):2631–2638
Asoglu O, Tokmak H, Bakir B, Demir G, Ozyar E, Atalar B et al (2020) The impact of total neo-adjuvant treatment on nonoperative management in patients with locally advanced rectal cancer: the evaluation of 66 cases. European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology 46(3):402–409
van der Sande ME, Figueiredo N, Beets GL (2020) Management and outcome of local regrowths in a watch-and-wait prospective cohort for complete responses in rectal cancer. Ann Surg
Borowski DW, Bradburn DM, Mills SJ, Bharathan B, Wilson RG, Ratcliffe AA et al (2010) Volume-outcome analysis of colorectal cancer-related outcomes. Br J Surg 97(9):1416–1430
Paun BC, Cassie S, MacLean AR, Dixon E, Buie WD (2010) Postoperative complications following surgery for rectal cancer. Ann Surg 251(5):807–818
Rödel C, Liersch T, Becker H, Fietkau R, Hohenberger W, Hothorn T et al (2012) Preoperative chemoradiotherapy and postoperative chemotherapy with fluorouracil and oxaliplatin versus fluorouracil alone in locally advanced rectal cancer: initial results of the German CAO/ARO/AIO-04 randomised phase 3 trial. Lancet Oncol 13(7):679–687
Rullier E, Rouanet P, Tuech JJ, Valverde A, Lelong B, Rivoire M et al (2017) Organ preservation for rectal cancer (GRECCAR 2): a prospective, randomised, open-label, multicentre, phase 3 trial. Lancet (London, England) 390(10093):469–479
Komura D, Ishikawa S (2018) Machine learning methods for histopathological image analysis. Comput Struct Biotechnol J 16:34–42
Levine AB, Schlosser C, Grewal J, Coope R, Jones SJM, Yip S (2019) Rise of the machines: advances in deep learning for cancer diagnosis. Trends in cancer 5(3):157–169
Duan S, Cao H, Liu H, Miao L, Wang J, Zhou X et al (2020) Development of a machine learning-based multimode diagnosis system for lung cancer. Aging 12(10):9840–9854
Mudali D, Jeevanandam J, Danquah MK (2020) Probing the characteristics and biofunctional effects of disease-affected cells and drug response via machine learning applications. Crit Rev Biotechnol 40(7):951–977
Chen S, Zhang N, Jiang L, Gao F, Shao J, Wang T et al (2021) Clinical use of a machine learning histopathological image signature in diagnosis and survival prediction of clear cell renal cell carcinoma. Int J Cancer 148(3):780–790
Tao W, Concepcion AN, Vianen M, Marijnissen ACA, Lafeber F, Radstake T et al (2021) Multiomics and machine learning accurately predict clinical response to adalimumab and etanercept therapy in patients with rheumatoid arthritis. Arthritis & rheumatology (Hoboken, NJ) 73(2):212–222
Benson AB, Venook AP, Al-Hawary MM, Arain MA, Chen YJ, Ciombor KK et al (2020) NCCN guidelines insights: rectal cancer, version 6.2020. J Natl Compr Cancer Netw 18(7):806–15
Chen T, Guestrin C (2016) editors. XGBoost: a scalable tree boosting system. The 22nd ACM SIGKDD international conference
Barua S, Islam MM, Yao X, Murase K (2012) MWMOTE—majority weighted minority oversampling technique for imbalanced data set learning 26(2):405–25
Almogahed B, Kakadiaris IA (2014) NEATER: filtering of over-sampled data using non-cooperative game theory. International conference on pattern recognition
Lundberg SM, Erion GG, Lee SI (2018) Consistent individualized feature attribution for tree ensembles
Gani C, Gani N, Zschaeck S, Eberle F, Schaeffeler N, Hehr T et al (2019) Organ preservation in rectal cancer: the patients’ perspective. Front Oncol 9:318
van der Sande ME, Maas M, Melenhorst J, Breukink SO, van Leerdam ME, Beets GL (2019) Predictive value of endoscopic features for a complete response after chemoradiotherapy for rectal cancer. Ann Surg
Jayaprakasam VS, Paroder V, Gibbs P, Bajwa R, Gangai N, Sosa RE et al (2021) MRI radiomics features of mesorectal fat can predict response to neoadjuvant chemoradiation therapy and tumor recurrence in patients with locally advanced rectal cancer. Eur Radiol
Bitterman DS, Resende Salgado L, Moore HG, Sanfilippo NJ, Gu P, Hatzaras I et al (2015) Predictors of complete response and disease recurrence following chemoradiation for rectal cancer. Front Oncol 5:286
Das P, Skibber JM, Rodriguez-Bigas MA, Feig BW, Chang GJ, Wolff RA et al (2007) Predictors of tumor response and downstaging in patients who receive preoperative chemoradiation for rectal cancer. Cancer 109(9):1750–1755
Huang CM, Huang MY, Huang CW, Tsai HL, Su WC, Chang WC et al (2020) Machine learning for predicting pathological complete response in patients with locally advanced rectal cancer after neoadjuvant chemoradiotherapy. Sci Rep 10(1):12555
Park H, Kim KA, Jung JH, Rhie J, Choi SY (2020) MRI features and texture analysis for the early prediction of therapeutic response to neoadjuvant chemoradiotherapy and tumor recurrence of locally advanced rectal cancer. Eur Radiol 30(8):4201–4211
Kang BH, Song C, Kang SB, Lee KW, Lee HS, Kim JS (2020) Nomogram for predicting the pathological tumor response from pre-treatment clinical characteristics in rectal cancer. Anticancer Res 40(4):2171–2177
Hu H, Huang J, Lan P, Wang L, Huang M, Wang J et al (2018) CEA clearance pattern as a predictor of tumor response to neoadjuvant treatment in rectal cancer: a post-hoc analysis of FOWARC trial. BMC Cancer 18(1):1145
Kim HJ, Choi GS, Park JS, Park S, Kawai K, Watanabe T (2015) Clinical significance of thrombocytosis before preoperative chemoradiotherapy in rectal cancer: predicting pathologic tumor response and oncologic outcome. Ann Surg Oncol 22(2):513–519
Huang Y, Zeng J, Liu T, Lin X, Guo P, Zeng J et al (2020) Prognostic significance of elevated preoperative serum CA125 levels after curative hepatectomy for hepatocellular carcinoma. Onco Targets Ther 13:4559–4567
van der Sande ME, Maas M, Melenhorst J, Breukink SO, van Leerdam ME, Beets GL (2021) Predictive value of endoscopic features for a complete response after chemoradiotherapy for rectal cancer. Ann Surg 274(6):e541–e547
Bulens P, Couwenberg A, Intven M, Debucquoy A, Vandecaveye V, Van Cutsem E et al (2020) Predicting the tumor response to chemoradiotherapy for rectal cancer: model development and external validation using MRI radiomics. Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology 142:246–252
Jin C, Yu H, Ke J, Ding P, Yi Y, Jiang X et al (2021) Predicting treatment response from longitudinal images using multi-task deep learning. Nat Commun 12(1):1851
Liang W, Liang H, Ou L, Chen B, Chen A, Li C et al (2020) Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med 180(8):1081–1089
Acknowledgements
We would like to thank Editage (www.editage.cn) for English language editing.
Funding
This work was supported by the National Key R&D Program of China (No. 2017YFC1308800); National Natural Science Foundations of China (No. 81970482, 81770557, and 82070684); Guangdong Natural Science Fund for Outstanding Youth Scholars (No. 2020B151502067), The Sixth Affiliated Hospital of Sun Yat-Sen University Clinical Research- “1010” Program (No. 1010PY(2020)-63), and National Key Clinical Discipline.
Author information
Authors and Affiliations
Contributions
Conception and design: XC, JH, and LL; analysis and interpretation of the data: XC, WW, LX, XH, PL, and JC; drafting of the article: XC, WW, JC, and JH; critical revision of the article for important intellectual content: XC, WW, JC LX, PL, JH, and LL; final approval of the article: all the authors.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: The equal contribution has been presented.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, X., Wang, W., Chen, J. et al. Predicting pathologic complete response in locally advanced rectal cancer patients after neoadjuvant therapy: a machine learning model using XGBoost. Int J Colorectal Dis 37, 1621–1634 (2022). https://doi.org/10.1007/s00384-022-04157-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00384-022-04157-z