Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline

L’Imperio, Vincenzo; Cazzaniga, Giorgio; Mannino, Mauro; Seminati, Davide; Mascadri, Francesco; Ceku, Joranda; Casati, Gabriele; Bono, Francesca; Eloy, Catarina; Rocco, Elena Guerini; Frascarelli, Chiara; Fassan, Matteo; Malapelle, Umberto; Pagni, Fabio

doi:10.1007/s00428-024-03794-9

Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline

ORIGINAL ARTICLE
Open access
Published: 26 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Virchows Archiv Aims and scope Submit manuscript

Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline

Download PDF

Vincenzo L’Imperio ORCID: orcid.org/0000-0002-9284-2998¹,
Giorgio Cazzaniga ORCID: orcid.org/0000-0001-5373-0611¹,
Mauro Mannino¹,
Davide Seminati ORCID: orcid.org/0000-0002-2166-301X¹,
Francesco Mascadri¹,
Joranda Ceku¹,
Gabriele Casati¹,
Francesca Bono¹,
Catarina Eloy^2,3,
Elena Guerini Rocco^4,5,
Chiara Frascarelli^4,5,
Matteo Fassan^6,7,
Umberto Malapelle⁸ &
…
Fabio Pagni ORCID: orcid.org/0000-0001-9625-1637¹

Abstract

The estimation of tumor cellular fraction (TCF) is a crucial step in predictive molecular pathology, representing an entry adequacy criterion also in the next-generation sequencing (NGS) era. However, heterogeneity of quantification practices and inter-pathologist variability hamper the robustness of its evaluation, stressing the need for more reliable results. Here, 121 routine histological samples from non-small cell lung cancer (NSCLC) cases with complete NGS profiling were used to evaluate TCF interobserver variability among three different pathologists (pTCF), developing a computational tool (cTCF) and assessing its reliability vs ground truth (GT) tumor cellularity and potential impact on the final molecular results. Inter-pathologist reproducibility was fair to good, with overall Wk ranging between 0.46 and 0.83 (avg. 0.59). The obtained cTCF was comparable to the GT (p = 0.129, 0.502, and 0.130 for surgical, biopsies, and cell block, respectively) and demonstrated good reliability if elaborated by different pathologists (Wk = 0.9). Overall cTCF was lower as compared to pTCF (30 ± 10 vs 52 ± 19, p < 0.001), with more cases < 20% (17, 14%, p = 0.690), but none containing < 100 cells for the algorithm. Similarities were noted between tumor area estimation and pTCF (36 ± 29, p < 0.001), partly explaining variability in the human assessment of tumor cellularity. Finally, the cTCF allowed a reduction of the copy number variations (CNVs) called (27 vs 29, − 6.9%) with an increase of effective CNVs detection (13 vs 7, + 85.7%), some with potential clinical impact previously undetected with pTCF. An automated computational pipeline (Qupath Analysis of Nuclei from Tumor to Uniform Molecular tests, QuANTUM) has been created and is freely available as a QuPath extension. The computational method used in this study has the potential to improve efficacy and reliability of TCF estimation in NSCLC, with demonstrated impact on the final molecular results.

Adequately defining tumor cell proportion in tissue samples for molecular testing improves interobserver reproducibility of its assessment

Article 16 November 2016

Developing an effective quality evaluation strategy of next-generation sequencing for accurate detecting non-small cell lung cancer samples with variable characteristics: a real-world clinical practice

Article 28 October 2022

Accuracy of next-generation sequencing for molecular profiling of small specimen of lung cancer: a prospective pilot study of side-by-side comparison

Article Open access 12 October 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In the precision medicine era, the assessment of molecular signatures of different cancer types is becoming of paramount importance for prognostic and predictive purposes [1, 2]. In this direction, emphasis goes to the dramatic change in the management of non-small cell lung cancer (NSCLC) in the last decade. The introduction of different targeted therapeutic options for NSCLC based on the detection of specific molecular alterations by next-generation sequencing (NGS) technologies increased the number of cases to be evaluated [3, 4]. Moreover, the progressive adoption of digital pathology in the diagnostic workflow [5,6,7,8,9] and of artificial intelligence (AI) tools for the simplification of repetitive, time-consuming, and poorly reproducible pathology tasks [10, 11] is guiding our discipline towards an holistic and integrative approach [12,13,14], with promising preliminary results even on the molecular characterization of NSCLC [15, 16]. One of the routine steps that can significantly benefit from computational algorithms is the evaluation of tumor cellular fraction (TCF), pivotal for adequacy assessment for molecular studies [17,18,19], and traditionally affected by significant inter-observer variability [20,21,22,23]. However, whether this computational evaluation may impact the final molecular analysis is still unknown and its applicability on more troublesome cancer specimens (e.g., cytology) has not been investigated yet. Thus, in the present study, we aim at applying a QuPath-based tool for the assessment of TCF on NSCLC cyto/histological specimens, evaluating the impact of this digital pathology approach on the final pathologist evaluation and the corresponding molecular results.

Methods

Cases

We prospectively enrolled NSCLC cases that underwent molecular analysis at the Oncological Molecular Pathology Unit of Fondazione IRCCS San Gerardo dei Tintori, Università degli studi di Milano-Bicocca (UNIMIB) in Monza, Italy, from October 2022 to August 2023. For each case, either histological or cell blocks (formalin-fixed and paraffin-embedded, FFPE) representative glass slides stained with hematoxylin and eosin (H&E) were selected by the molecular pathologist (path #1, DS) for the definition of tumor-containing regions (TCRs) that have been reported on the slide with pen-marks for subsequent tumor microdissection. At this time point, number of vital tumor cells (< or ≥ 100) and TCF (%) have been evaluated by the same pathologist (path #1, pTCF) and added to the final molecular pathology report for adequacy purposes. In particular, a TCF ≥ 20% and a value of viable tumor cells ≥ 100 were considered as the minimum desiderable prerequisites for the reliability of the subsequent molecular analysis [18, 24]. Slides were then blindly reviewed by a different pathologist (path #2, VL) and a pathology trainee (MM) for the assessment of pTCF. Obtained values were reported along with the original assessments for comparison purposes. The study has been approved by the local Ethics Committee (prot. 35859, 24/10/2022).

Computational pathology pipeline

The retrieved slides scored by path #1 were anonymized and scanned with NanoZoomer S60 (Hamamatsu, Shizuoka, Japan) at 20 × magnification (0.4416 MPP) [6, 9]. The obtained WSI were imported in QuPath v0.4.4 [25], and subsequent analyses were restricted to the pen-marking area representative of the microdissected region (ROI). For each ROI, a cell-by-cell visual count and classification into “Tumor” or “Non-neoplastic” categories (e.g., immune cells, stromal cells, and normal epithelial cells) was obtained by two expert lung pathologists (FB and FP) and considered like the reference/ground truth (GT), as previously suggested (Supplementary Fig. 1) [23]. Subsequently, to obtain the computational TCF (cTCF), StarDist extension was used for nuclear detection (cells surrogate) as the main variable for the final cTCF evaluation [26], as described in the Supplementary Methods. Then, the QuPath machine learning (ML)–based algorithm (object classifier, Random Tree (RT), Supplementary Fig. 2) for nuclear assignment to “Tumor” vs “Non-neoplastic” was trained using manual annotations. These annotations included a balanced ratio of five cells for each category and reached the best performances at a maximum cutoff of 10 annotations (Supplementary Fig. 3). Additionally, an “Ignore” class was created for those detections that did not correspond to nuclei (e.g., artifacts, red blood cells, mucus, anthracosis). The obtained cTCF was used then for comparison with the human-obtained pTCF. To further investigate the possible differences between cTCF and pTCF, a deep learning (DL) approach (WSInfer QuPath extension [27]) has been used to automatically outline tumor foci boundaries (TF) from surrounding stroma (“Non-neoplastic/Other,” as detailed in Supplementary Methods). Finally, the reproducibility of the obtained cTCF was assessed on a subset of five randomly selected cases for each sample category (surgical, biopsy, cell block, n = 15), on which other two pathologists (CE and EGR, validator #1 and #2) were used as independent annotators, to evaluate the robustness and reliability of the proposed tool if used by different pathologists.

Copy number variations (CNVs) and TCF

NGS occurred on the Ion Torrent™ Genexus™ System platform (Thermo Fisher Scientific), with the Oncomine™ Precision Assay (OPA, Thermo Fisher Scientific), capable of detecting 50 clinically relevant genes in NSCLC (Supplementary Methods). The computational analysis of sequencing data was performed before and after the cTCF scoring, to estimate possible changes in the final molecular report using Ion Reporter™ Software 5.20.2.0 (Thermo Fisher Scientific). In particular, the copy number variations (CNVs) parameter related to copy gains can be strongly influenced by the tumor cellularity estimation, according to the following formula employed by the software for CNVs variant calling:

$$f*x+(1-f)*2=C \Rightarrow x=(C-(1-f)*2)/f$$

where $x$ is the value of CNVs, $f$ is the tumor cellularity (ranging from 0 to 1), and $C$ the CN observed in the sample autosome. The confidence criteria for CNVs calls employed in routine practice require the fulfillment of at least five criteria among the following: tumor cellularity ≥ 50%, minimum DNA read count threshold of 10,000 reads, median absolute pairwise difference (MAPD) ≤ 0.5, CNV confidence at 5% ≥ 4 copies, CNV level ≥ 5 copies, and a p-value ≤ 0.00001 of call significance, as per the manufacturer’s instructions.

Statistical analysis

Continuous variables were summarized using mean and standard deviation (SD), as applicable, while qualitative variables were presented as counts and frequencies. To compare means and qualitative variables, t-tests, chi-square tests, and Mann–Whitney U tests were employed, depending on the nature of the data. Significance was set at p-values < 0.05. For assessing the correlation among different observers, vs the GT, and reliability of the cTCF obtained by different pathologists, the weighted Kappa coefficient (Wk, quadratic function) along with its corresponding p-value were utilized to gauge the correlation of the percentage of tumor cellularity. Statistical analyses were performed, stratifying the groups based on the type of sample (resection, biopsies, and cell blocks), as well as categorizing samples deemed non-suitable for molecular analysis. Statistical computations were carried out using Python libraries, specifically leveraging pandas and scikit-learn.

Results

Cases

A total of 121 cases were used for the study. Of these, 79 (65%) were histological samples, divided into 35 (29%) surgical and 44 (36%) small bioptic specimens, and 42 (35%) were cell blocks. Details on the pTCF, cTCF and GT per sample category and final mutational status after NGS analysis are reported in Table 1.

Table 1 Average pTCF assessed by the three pathologists, overall and divided per sample type, along with the number of DNA/RNA mutant cases

Full size table

pTCF inter-observer variability

The inter-observer variability among path #1, #2, and trainee, along with similar comparisons with the GT, is reported in Table 2. The overall Wk ranged from 0.46 to 0.83 (avg. 0.59), with path #2 and trainee showing higher reproducibility as compared to path #1. The three scorers were significantly distant from the GT (overall and for each sample category, p < 0.01), and none of them was closer to the true tumor cellularity, as demonstrated by the overall Wk (0.26, 0.28, and 0.26, for path #1, #2, and trainee, respectively). A total of 11 (9%), 19 (16%), and 23 (19%) cases were assigned < 20% pTCF by path #1, #2, and trainee, respectively, with only path #1 vs trainee showing statistically significant difference (p = 0.03, path #1 vs #2 p = 0.120 and #2 vs trainee p = 0.500). All the observers agreed on assigning < 100 cells in 12 (10%) samples, as per adequacy criteria for the molecular analysis.

Table 2 Inter-observer variability among pathologist #1, #2, and trainee, overall on the total samples and divided per type of specimens (surgical, small biopsy, and cell block), along with similar comparisons with the GT and significance of the pTCF differences. The reported k coefficients refer to the Weighted Kappa (quadratic function), while the p-values correspond to the t-test of the pTCF differences

Full size table

Computational assessment of TCF

The cTCF demonstrated excellent performance when compared to the GT, with no statistically significant differences (p = 0.129, 0.502, and 0.130 for surgical samples, small biopsies, and cell block, respectively, Fig. 1).

During the training process, an average of eight annotations (5–10) was used both for the “Tumor” and “Non-neoplastic” label, with efficacy of the model reaching performance plateau at ten annotations per category. The comparative analysis of cTCF obtained by different pathologists demonstrated substantial comparability of the results overall and for each sample type (Wk mean = 0.9, Table 3 and Supplementary Fig. 4). After the application of the cTCF model, 106 (88%), 12 (10%), and 3 (2%) cases showed a lower, higher or equal cellularity estimation as compared to the pathologists’ average, respectively, Fig. 2. Overall, the mean cTCF was significantly lower than the pTCF (30 ± 10 vs 52 ± 19, p < 0.001, respectively), as confirmed by the sub-analysis on surgical, small bioptic, and cell block samples (30 ± 9 vs 59 ± 13, p < 0.001; 30 ± 10 vs 55 ± 19, p < 0.001; 32 ± 11 vs 44 ± 20, p = 0.001, respectively). Moreover, cTCF was < 20% in a higher number of cases, although not reaching a statistically significant difference (p = 0.690, Table 4). As compared to the pathologists’ assessment, none of the cases were given < 100 vital cells by the algorithm. Although these differences around the adequacy thresholds between pTCF and cTCF, a comparable rate of wild-type/mutant cases in groups with TCFs < and ≥ 20% was noted (0.42 vs 0.45, p = 0.810). The WSInfer DL-based assessment of tumor foci (TF) on surgical and small bioptic samples showed similarities with pTCF (p = 0.070) and substantial differences with the cTCF (36 ± 29, p < 0.001, box plots in Fig. 3a and density plots in Fig. 3b showing overlap only for pathologist assessment and tumor area). The automated cTCF pipeline have been included in a script for integration as QuPath extension (Qupath Analysis of Nuclei from Tumor to Uniform Molecular tests, QuANTUM) to streamline the process for future routine clinical applications of the computational tool (Fig. 4).

Table 3 Reproducibility analysis of the cTCF obtained from different pathologists, overall, and for each sample type (surgical sample, small biopsies and cell block). The reported k coefficients refer to the Weighted Kappa (quadratic function), while the p-values correspond to the Mann–Whitney U test of the pTCF differences

Full size table

Table 4 Comparison of the cases with < or ≥ 20% tumor cellularity as per cTCF, avg. pTCF, and pTCF from single pathologists, with significance of the differences. *p-values refer to the comparison of cTCF vs the other evaluations (chi-squared test)

Full size table

Molecular output with computational TCF

A total of 29 cases (23.9%) assessed with pTCF had a CNV variant calling as per software metrics (Supplementary Table 1). From these, true CNVs were called only in seven cases with the additional confidence criteria (3 EGFR, 3 KRAS, 1 MET). After the re-evaluation with QuANTUM-derived TCF (Fig. 5), a reduction in CNVs call by the software was noted (27 vs 29, − 6.9%), with an increase in the final CNVs reported as per additional confidence criteria (13 vs 7, + 85.7%: 5 EGFR, 5 KRAS, 2 MET, 2 PIK3CA, 1 ERBB2), with 2 cases having multiple CNVs (1 case with EGFR and MET, 1 case with EGFR and PIK3CA). Thus, there was a 24% (7/29) discrepancy in final CNV calls compared to the initial analysis.

Discussion

The recent massive introduction of predictive molecular tests in the diagnostic assessment of NSCLC stressed the need for a standardization of the preanalytical phase [28]. In this setting, one of the steps involved in the tumor tissue procurement for NGS analysis is represented by the (macro)dissection of the sample, generally performed after re-evaluation of the case by the molecular pathologist, estimating the TCF to provide the higher rate of tumor-deriving nucleic acids [29]. Previous experiences, mainly focused on the single-gene testing approach (e.g., polymerase chain reaction, PCR), led to the definition of a minimum requirement of 100 cells [18] and 20% TCF [24] as the sample adequacy criteria to obtain reliable molecular results. However, although being a relatively straightforward task for pathologists, TCF assessment still suffers from limitations mainly due to lack of clear guidelines and training sessions, heterogeneity of neoplastic cell presentation, morphology, and size and the presence of non-neoplastic cells in the tumor regions, confounding visual qualitative assessments [21, 22, 30]. These factors significantly hamper interobserver reproducibility, with up to one-third of cases being assigned ± 20% to the real TCF value in some studies [20], leading to discard about 38% following the pathologists’ TCF estimation due to the too few tumor cells for downstream molecular testing. Our study confirms this perfectible interobserver reproducibility among three different scorers (overall Wk 0.46–0.83, avg. 0.59), with a range of “suboptimal” samples ranging from 9 to 19% (either for TCF < 20% or < 100 cells). Moreover, differences were noted among the single observer evaluations, attributable to discrepancies in training backgrounds and commitment to the TCF assessment (molecular vs general pathologist vs trainee), all three equally distant from the GT, further stressing the need for a more robust and reliable evaluation of TCF. The recent introduction of digital pathology tools is significantly twisting our practice through the progressive adoption of computer aided diagnostics (CADs), which already demonstrated their benefits in the assessment of predictive immunohistochemistry and fluorescence in situ hybridization [31, 32], pancreatic neuroendocrine tumors grading [33], and prostate cancer detection and Gleason grading [34]. In this setting, the application of CAD-based TCF assessment, associated with the experience of molecular pathologists, already demonstrated promising results in terms of reproducibility on lung adenocarcinoma [35], and the association of CAD-human assessment seems to even improve evaluation of challenging cases [23]. However, these preliminary studies were performed either on commercially available platforms that could not be widely available to general pathologists [35], or computational tools assessing cells within/outside the tumor area, potentially biased in case of highly immunogenic and lymphocyte-enriched tumors [23], with no estimation of the possible practical repercussions on the molecular results. To address the need for a democratic and user friendly QuPath-based CAD for TCF estimation, here we demonstrated reliability in the identification of tumor cells in different NSCLC sample types (surgical, small biopsies, and cell blocks), showing an overall underestimation of tumor cellularity as compared to the average pathologists’ assessment (30 ± 10 vs 52 ± 19, p < 0.001). This was particularly evident in surgical samples, potentially explainable by the higher influence of peri-/intratumoral background enriched in stromal and immune cells, which can act as confounding factor for the human eye and thus affecting the tumor cellularity estimation. This phenomenon has already been recognized and ascribed to the different sizes of tumor (larger) vs surrounding non-neoplastic cells, potentially leading to an overestimation of TCF by pathologists [23]. Moreover, the application of this tool showed a higher number of cases with TCF < 20% (14% vs 7% of avg. pTCF), but none with < 100 vital cells, demonstrating no significant differences in terms of mutant cases below and above the 20% threshold (0.42 vs 0.45, p = 0.81). These findings suggest the need for revising the currently used TCF cutoffs for material adequacy, especially if the high sensitivity and low limit-of-detection (LOD) values of the most recently introduced molecular platforms (e.g., NGS) are taken into account [36], allowing the identification of low-frequency genetic variants even on scanty cellular material. Trying to investigate the reasons leading to an overestimation of the tumor cellularity by pathologists, we explored whether the human eye can be biased by the area occupied by the tumor at low magnification, more than the actual tumor cellularity. In this direction, the application of free DL-based tools for the estimation of tumor areas (WSInfer) on surgical samples demonstrated comparability with pathologist-derived TCF (p = 0.07) and substantial differences with the cTCF (36 ± 29, p < 0.001). This is in line with the already known limitations that can affect the human interpretation of images, falling within the basket of “illusion of size” effects, consisting in the perception of an object’s size that is influenced by the context in which it is displayed [37]. This experience demonstrated a potential impact of cTCF on the CNVs calls, leading to an overall decrease in alterations detected by the variant calling software (27 vs 29, − 6.9%), but an improved detection of “true” CNVs on the final molecular report (13 vs 7, + 85.7%). In particular, while not having a strong clinical impact (CNVs are not still included in NSCLC 2023 ESCAT I-II variants), these new CNV calls could potentially candidate patients for clinical trials [38,39,40,41], or unveiling MET amplification as the cause of drug resistance in patients treated for an EGFR common mutation [42].

Conclusion

This study confirms the reliability of computational assessment of tumor cellularity for the NGS evaluation of NSCLC specimens, applicable on both histo and cytological samples, in the setting of a second-level Molecular Pathology department. The availability of user-friendly algorithms (i.e., QuANTUM) can increase the molecular variant detection rates of our laboratories. The proposed QuANTUM computational tool can be easily used by users through the QuPath extension available at https://github.com/Gizmopath/QuANTUM.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request. The QuANTUM QuPath extension is available at https://github.com/Gizmopath/QuANTUM.

References

Dietel M, Jöhrens K, Laffert M et al (2013) Predictive molecular pathology and its role in targeted cancer therapy: a review focussing on clinical relevance. Cancer Gene Ther 20:211–221
Article CAS PubMed Google Scholar
Matias-Guiu X, Stanta G, Carneiro F et al (2020) The leading role of pathology in assessing the somatic molecular alterations of cancer: position paper of the European Society of Pathology. Virchows Arch 476:491–497
Article PubMed PubMed Central Google Scholar
Luca CD, Pepe F, Pisapia P et al (2022) RNA-based next-generation sequencing in non-small-cell lung cancer in a routine setting: an experience from an Italian referral center. Per Med 19:395–401
Article CAS PubMed Google Scholar
Pisapia P, Pepe F, Baggi A et al (2022) Next generation diagnostic algorithm in non-small cell lung cancer predictive molecular pathology: the KWAY Italian multicenter cost evaluation study. Crit Rev Oncol Hematol 169:103525
Article PubMed Google Scholar
Eccher A, Dei Tos AP, Scarpa A et al (2023) Cost analysis of archives in the pathology laboratories: from safety to management. J Clin Pathol 76:659–663
Article PubMed Google Scholar
L’Imperio V, Casati G, Cazzaniga G et al (2023) Improvements in digital pathology equipment for renal biopsies: updating the standard model. J Nephrol 37(1):221–229. https://doi.org/10.1007/s40620-023-01568-1
Article PubMed Google Scholar
Fraggetta F, L’Imperio V, Ameisen D et al (2021) Best practice recommendations for the implementation of a digital pathology workflow in the anatomic pathology laboratory by the European Society of Digital and Integrative Pathology (ESDIP). Diagnostics (Basel) 11(11):2167. https://doi.org/10.3390/diagnostics11112167
Article PubMed Google Scholar
L’Imperio V, Gibilisco F, Fraggetta F (2021) What is essential is (no more) invisible to the eyes: the introduction of BlocDoc in the digital pathology workflow. J Pathol Inform 12:32
Article PubMed PubMed Central Google Scholar
L’Imperio V, Brambilla V, Cazzaniga G et al (2021) Digital pathology for the routine diagnosis of renal diseases: a standard model. J Nephrol 34:681–688
Article PubMed Google Scholar
L’Imperio V, Wulczyn E, Plass M et al (2023) Pathologist validation of a machine learning-derived feature for colon cancer risk stratification. JAMA Netw Open 6:e2254891
Article PubMed PubMed Central Google Scholar
Cazzaniga G, Bolognesi MM, Stefania MD et al (2023) Congo Red staining in digital pathology: the streamlined pipeline for amyloid detection through Congo Red fluorescence digital analysis. Lab Invest 103:100243
Article PubMed Google Scholar
Caputo A, L’Imperio V, Merolla F et al (2023) The slow-paced digital evolution of pathology: lights and shadows from a multifaceted board. Pathologica 115:127–136
Article PubMed PubMed Central Google Scholar
Pisapia P, L’Imperio V, Galuppini F et al (2022) The evolving landscape of anatomic pathology. Crit Rev Oncol Hematol 178:103776
Article PubMed Google Scholar
Caputo A, Gibilisco F, Belmonte B et al (2023) Real-world digital pathology: considerations and ruminations of four young pathologists. J Clin Pathol 76:68–70
Article PubMed Google Scholar
Beretta C, Ceola S, Pagni F et al (2022) The role of digital and integrative pathology for the detection of translocations: a narrative review. Precis Cancer Med 5:16. https://doi.org/10.21037/pcm-21-56
Article Google Scholar
Monaco L, De Bernardi E, Bono F et al (2022) The ‘digital biopsy’ in non-small cell lung cancer (NSCLC): a pilot study to predict the PD-L1 status from radiomics features of [18F]FDG PET/CT. Eur J Nucl Med Mol Imaging 49:3401–3411
Article CAS PubMed Google Scholar
Devereaux KA, Souers RJ, Graham RP et al (2022) Neoplastic cellularity assessment in molecular testing: a multi-institutional practice survey and performance challenge identifies a need for standardization. Arch Pathol Lab Med 146:1062–1071
Article CAS PubMed Google Scholar
Kalemkerian GP, Narula N, Kennedy EB et al (2018) Molecular testing guideline for the selection of patients with lung cancer for treatment with targeted tyrosine kinase inhibitors: American Society of Clinical Oncology Endorsement of the College of American Pathologists/International Association for the Study of Lung Cancer/Association for Molecular Pathology Clinical Practice Guideline Update. J Clin Oncol 36:911–919
Article CAS PubMed Google Scholar
Viray H, Li K, Long TA et al (2013) A prospective, multi-institutional diagnostic trial to determine pathologist accuracy in estimation of percentage of malignant cells. Arch Pathol Lab Med 137:1545–1549
Article PubMed Google Scholar
Smits AJ, Kummer JA, de Bruin PC et al (2014) The estimation of tumor cell percentage for molecular testing by pathologists is not accurate. Mod Pathol 27:168–74
Mikubo M, Seto K, Kitamura A et al (2020) Calculating the tumor nuclei content for comprehensive cancer panel testing. J Thorac Oncol 15:130–7. https://doi.org/10.1016/j.jtho.2019.09.081
Article CAS PubMed Google Scholar
Kazdal D, Rempel E, Oliveira C et al (2021) Conventional and semi-automatic histopathological analysis of tumor cell content for multigene sequencing of lung adenocarcinoma. Transl Lung Cancer Res 10(4):1666–1678. https://doi.org/10.21037/tlcr-20-1168
Article CAS PubMed PubMed Central Google Scholar
Frei AL, Oberson R, Baumann E et al (2023) Pathologist computer-aided diagnostic scoring of tumor cell fraction: a Swiss national study. Mod Pathol 36:100335
Article PubMed Google Scholar
Raccomandazioni AIOM e SIAPEC-IAP per la valutazione delle mutazioni di RAS nel carcinoma del colon-retto. SIAPeC-IAP. https://www.siapec.it/2014/04/01/raccomandazioni-aiom-e-siapec-iap-per-la-valutazione-delle-mutazioni-di-ras-nel-carcinoma-del-colon-retto/ (accessed 13 Nov 2023).
Bankhead P, Loughrey MB, Fernández JA et al (2017) QuPath: open source software for digital pathology image analysis. Sci Rep 7:1–7
Article CAS Google Scholar
Schmidt U, Weigert M, Broaddus C et al (2018) Cell detection with star-convex polygons. In: Frangi AF, Schnabel JA, Davatzikos C, Alberola-López C, Fichtinger G (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. Springer International Publishing, Cham, pp 265–73
Chapter Google Scholar
Kaczmarzyk JR, O’Callaghan A, Inglis F, et al (2023) Open and reusable deep learning for pathology with WSInfer and QuPath. http://arxiv.org/abs/2309.04631 (accessed 9 Nov 2023).
Compton CC, Robb JA, Anderson MW et al (2019) Preanalytics and precision pathology: pathology practices to ensure molecular integrity of cancer patient biospecimens for precision medicine. Arch Pathol Lab Med 143:1346–1363
Article CAS PubMed Google Scholar
Hamilton PW, Wang Y, Boyd C et al (2015) Automated tumor analysis for molecular profiling in lung cancer. Oncotarget 6:27938–27952
Article PubMed PubMed Central Google Scholar
Lhermitte B, Egele C, Weingertner N et al (2017) Adequately defining tumor cell proportion in tissue samples for molecular testing improves interobserver reproducibility of its assessment. Virchows Arch 470:21–27
Article PubMed Google Scholar
Palm C, Connolly CE, Masser R et al (2023) Determining HER2 status by artificial intelligence: an investigation of primary, metastatic, and HER2 low breast tumors. Diagnostics (Basel) 13(1):168. https://doi.org/10.3390/diagnostics13010168
Article CAS PubMed Google Scholar
Varga Z, Cassoly E, Li Q et al (2015) Standardization for Ki-67 assessment in moderately differentiated breast cancer. A retrospective analysis of the SAKK 28/12 study. PLoS One 10(4):e0123435. https://doi.org/10.1371/journal.pone.0123435
Article CAS PubMed PubMed Central Google Scholar
Luchini C, Pantanowitz L, Adsay V et al (2022) Ki-67 assessment of pancreatic neuroendocrine neoplasms: systematic review and meta-analysis of manual vs. digital pathology scoring. Mod Pathol 35(6):712–20
Article PubMed PubMed Central Google Scholar
Pantanowitz L, Quiroga-Garza GM, Bien L et al (2020) An artificial intelligence algorithm for prostate cancer diagnosis in whole slide images of core needle biopsies: a blinded clinical validation and deployment study. Lancet Digit Health 2:e407–e416
Article PubMed Google Scholar
Sakamoto T, Furukawa T, Pham HHN et al (2022) A collaborative workflow between pathologists and deep learning for the evaluation of tumour cellularity in lung adenocarcinoma. Histopathology 81:758–769
Article PubMed PubMed Central Google Scholar
Passiglia F, Calandri M, Guerrera F et al (2019) Lung cancer in Italy. J Thorac Oncol 14:2046–2052
Article PubMed Google Scholar
Aeffner F, Wilson K, Martin NT et al (2017) The gold standard paradox in digital image analysis: manual versus automated scoring as ground truth. Arch Pathol Lab Med 141:1267–1275
Article PubMed Google Scholar
Hendriks LE, Kerr KM, Menis J et al (2023) Oncogene-addicted metastatic non-small-cell lung cancer: ESMO Clinical Practice Guideline for diagnosis, treatment and follow-up. Ann Oncol 34:339–357
Article CAS PubMed Google Scholar
CTG labs - NCBI. https://clinicaltrials.gov/ct2/show/NCT02276027 (accessed 22 Nov 2023).
CTG labs - NCBI. https://clinicaltrials.gov/study/NCT03505710 (accessed 22 Nov 2023).
CTG labs - NCBI. https://clinicaltrials.gov/study/NCT06125834 (accessed 22 Nov 2023).
Gomatou G, Syrigos N, Kotteas E (2023) Osimertinib resistance: molecular mechanisms and emerging treatment options. Cancers 15(3):841. https://doi.org/10.3390/cancers15030841
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Open access funding provided by Università degli Studi di Milano - Bicocca within the CRUI-CARE Agreement. The work has been funded by the project in the Italian Ministry of the University MUR Dipartimenti di Eccellenza 2023–2027 (l. 232/2016, art. 1, commi 314—337).

Author information

Authors and Affiliations

Department of Medicine and Surgery, Pathology, IRCCS Fondazione San Gerardo Dei Tintori, University of Milano-Bicocca, Milan, Italy
Vincenzo L’Imperio, Giorgio Cazzaniga, Mauro Mannino, Davide Seminati, Francesco Mascadri, Joranda Ceku, Gabriele Casati, Francesca Bono & Fabio Pagni
Pathology Laboratory, Institute of Molecular Pathology and Immunology of University of Porto (IPATIMUP), Porto, Portugal
Catarina Eloy
Pathology Department, Medical Faculty of University of Porto, Porto, Portugal
Catarina Eloy
Division of Pathology, European Institute of Oncology IRCCS, Milan, Italy
Elena Guerini Rocco & Chiara Frascarelli
Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy
Elena Guerini Rocco & Chiara Frascarelli
Surgical Pathology and Cytopathology Unit, Department of Medicine, DIMED, University of Padua, Padua, Italy
Matteo Fassan
Veneto Institute of Oncology, IOV-IRCCS, Padua, Italy
Matteo Fassan
Department of Public Health, University of Naples Federico II, Naples, Italy
Umberto Malapelle

Authors

Vincenzo L’Imperio
View author publications
You can also search for this author in PubMed Google Scholar
Giorgio Cazzaniga
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Mannino
View author publications
You can also search for this author in PubMed Google Scholar
Davide Seminati
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Mascadri
View author publications
You can also search for this author in PubMed Google Scholar
Joranda Ceku
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Casati
View author publications
You can also search for this author in PubMed Google Scholar
Francesca Bono
View author publications
You can also search for this author in PubMed Google Scholar
Catarina Eloy
View author publications
You can also search for this author in PubMed Google Scholar
Elena Guerini Rocco
View author publications
You can also search for this author in PubMed Google Scholar
Chiara Frascarelli
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Fassan
View author publications
You can also search for this author in PubMed Google Scholar
Umberto Malapelle
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Pagni
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

VL, GiCa, DS, and MM defined the study design. MM, JC, and FM scanned the glass slides from NSCLC cases. DS, VL, and MM performed the pTCF. GiCa and MM applied the computational pipeline for TCF assessment on QuPath. FB and FP assisted in the creation of the ground truth for TCF assessment. CE and EGR applied the obtained pipeline on a subset of the cases to evaluate the reliability of the computational tool. FB, DS, UM, MF, CE, CF, EGR, GaCa, and FP provided molecular pathology expert view and computational elaboration of NGS data. FP provided the funding acquisition and administrative support. All authors were involved in writing the paper and had final approval of the submitted and published versions.

Corresponding author

Correspondence to Vincenzo L’Imperio.

Ethics declarations

Ethical approval

Approval was obtained from the local ethics committee (prot. 35859, 24/10/2022).

Competing interests

UM received personal fees (as consultant and/or speaker bureau) from Amgen, Boehringer Ingelheim, Diaceutics, Eli Lilly, GSK, Merck, MSD, Roche, Thermo Fisher Scientific, and from AstraZeneca, Diatech, Hedera, Janssen, and Novartis unrelated to the current work. FP received personal fees (as consultant and/or speaker bureau) from Amgen, Eli Lilly, GSK, Merck, MSD, Roche, AstraZeneca, Diatech, Janssen, Novartis, Diapath, and LogiBiotech unrelated to the current work. VL received personal fees (as consultant and/or speaker bureau) from Eli Lilly, Roche, and Novartis unrelated to the current work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 6548 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

L’Imperio, V., Cazzaniga, G., Mannino, M. et al. Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline. Virchows Arch (2024). https://doi.org/10.1007/s00428-024-03794-9

Download citation

Received: 04 January 2024
Revised: 19 February 2024
Accepted: 20 March 2024
Published: 26 March 2024
DOI: https://doi.org/10.1007/s00428-024-03794-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline

Abstract

Similar content being viewed by others

Adequately defining tumor cell proportion in tissue samples for molecular testing improves interobserver reproducibility of its assessment

Developing an effective quality evaluation strategy of next-generation sequencing for accurate detecting non-small cell lung cancer samples with variable characteristics: a real-world clinical practice

Accuracy of next-generation sequencing for molecular profiling of small specimen of lung cancer: a prospective pilot study of side-by-side comparison

Introduction

Methods

Cases

Computational pathology pipeline

Copy number variations (CNVs) and TCF

Statistical analysis

Results

Cases

pTCF inter-observer variability

Computational assessment of TCF

Molecular output with computational TCF

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Competing interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 6548 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Digital counting of tissue cells for molecular analysis: the QuANTUM pipeline

Abstract

Similar content being viewed by others

Adequately defining tumor cell proportion in tissue samples for molecular testing improves interobserver reproducibility of its assessment

Developing an effective quality evaluation strategy of next-generation sequencing for accurate detecting non-small cell lung cancer samples with variable characteristics: a real-world clinical practice

Accuracy of next-generation sequencing for molecular profiling of small specimen of lung cancer: a prospective pilot study of side-by-side comparison

Introduction

Methods

Cases

Computational pathology pipeline

Copy number variations (CNVs) and TCF

Statistical analysis

Results

Cases

pTCF inter-observer variability

Computational assessment of TCF

Molecular output with computational TCF

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Competing interests

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 6548 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation