FormalPara Key Points

Data reporting on the epidemiology of EGFR exon 20 insertions in NSCLC are lacking.

A pragmatic literature review identified prevalence of EGFR exon 20 insertions.

Meta-analyses precisely quantified exact EGFR exon 20 insertion prevalence.

Results can be used to assess numbers of patients eligible for emerging therapies.

1 Introduction

Lung cancer is the leading cause of cancer-related deaths in men and the second cause in women, accounting for 18% of all cancer-related deaths worldwide [1, 2]. In 2016, lung cancer accounted for approximately 20% of all cancer-related deaths and 5% of all deaths in Europe [3]. Non-small-cell lung cancer (NSCLC) accounts for approximately 85% of all lung cancers and comprises three main histological subtypes. Adenocarcinoma is the most common, accounting for ~ 60% of all NSCLC cases (~ 50% of all lung cancer), followed by squamous cell carcinoma (~ 25% of all NSCLC cases) [4].

NSCLC has several driver mutations, the majority of which occur in adenocarcinomas [5]. In adenocarcinoma, predominant alterations include mutations in Kirsten rat sarcoma virus (KRAS), epidermal growth factor receptor (EGFR) and anaplastic lymphoma kinase (ALK) genes, alongside ROS1, MEK1, HER2 and RET genes. Other driver genes include FGFR1, PDGFRA, PTEN, MET, STK11, BRAF, PIK3CA, DDR2, AKT1 and NRF2 [5,6,7,8]. These mutations play important roles in the oncogenesis of NSCLC and are associated with numerous oncogenic effects, such as activating signalling proteins, leading to increased cell proliferation, metastasis and decreased apoptosis [6, 9].

EGFR is a transmembrane glycoprotein with an intracellular tyrosine kinase domain that contributes to the regulation of cell proliferation [9, 10]. EGFR mutations have been reported to represent approximately 14% of all cases of NSCLC in European countries [11]. Mutations in the EGFR gene can be categorised into four main types, based on the exon in which they occur. Common EGFR mutations are exon 19 deletions and exon 21 L858R point mutation, constituting approximately 85% of EGFR mutations. The remaining 10–15% are made up of uncommon EGFR mutations, including exon 18 mutations and exon 20 insertion mutations, along with rarer exon 19 and 21 mutations, such as L861Q on exon 21 [12,13,14].

Exon 20 insertion mutations were among the earliest EGFR mutations to be identified and can be characterised as in-frame insertions or duplications between amino acid positions 762 and 774 of the EGFR protein (Fig. 1) [14, 15]. However, despite being one of the earliest identified mutations, data reporting on the epidemiology of exon 20 insertions are currently lacking [14, 16].

Fig. 1
figure 1

EGFR exon 20 insertion mutations in NSCLC [14]. Prevalence estimates originate from: aStella et al. J Cancer Res Clin Oncol 2013; 139:1327–1335; bLocatelli-Sanchez et al. Lung 2013;191:491–499; cKerner et al. PLoS One 2013;8:e70346. Exon 19 deletions and L858R exon 21 mutations are considered common EGFR mutations. Figure source: Vyse S. et al Signal Transduct Target Ther. 2019;4:1–10. Adapted based on evidence identified by the pragmatic literature review. EGFR epidermal growth factor receptor, NSCLC non-small-cell lung cancer

Uncertainty in epidemiological estimates is compounded by variations in testing methodologies for EGFR mutations, which have varying degrees of sensitivity. For example, direct sequencing by polymerase chain reaction (PCR), which was the historical standard for EGFR mutation testing, has low sensitivity. It therefore only detects mutations when sufficient levels of mutant DNA are present [17]. Furthermore, PCR can only detect predetermined target sequences, meaning it has a low discovery power to detect novel genes and to quantify rare variants [18]. A number of alternative methods of mutation testing have been developed, such as next-generation sequencing (NGS), high-performance liquid chromatography (HPLC) and high-resolution melting analysis (HRMA). These methods have improved sensitivity and can detect more mutations using a relatively small amount of DNA [17]. They may also detect abnormalities not readily detected by previous PCR-based techniques, with a higher discovery power to detect all variants without prior sequence knowledge [18, 19].

Patients with activating EGFR mutations are typically treated with tyrosine kinase inhibitors (TKIs), which have positive results for NSCLC with common EGFR mutations [20]. However, therapy options for patients with rare EGFR mutations have historically been lacking. Exon 20 insertion mutations are associated with de novo resistance to TKIs, and many TKIs, such as osimertinib and poziotinib, have been shown to have limited efficacy against exon 20 insertion mutations [21, 22]. The limited efficacy of potential treatments has contributed to the poorer prognosis reported for these patients compared with patients with other EGFR mutations [23]. A 2020 study reported a 75% increased risk of death with exon 20 insertion mutations compared with common EGFR mutations, as well as 8% of patients achieving 5-year survival compared with 19% of patients with common EGFR mutations [24].

However, recently developed investigational therapies such as amivantamab, a monoclonal antibody, and mobocertinib, a small-molecule TKI, are expected to have specific activity against exon 20 insertion mutations. In 2021, amivantamab was granted approval by the US Food and Drug Administration (FDA) and European Medicines Agency (EMA) [25, 26]; mobocertinib was granted approval by the FDA based on positive clinical trial data [27]. There is a need for accurate estimates of the number of patients that would benefit from such emerging therapies.

A systematic literature review (SLR) and meta-analysis (with searches run in 2013) conducted by Zhang 2016 reported on the frequency of EGFR mutations, including exon 20 insertion mutations, in NSCLC. It found a pooled frequency of 1.7% (1.4, 2.0) for exon 20 insertion mutations in any stage NSCLC in Europe [11]. However, this estimate may now be outdated. More recently, other sources have reported a wide range in estimates, for example an SLR conducted by Burnett 2021 reported a frequency of exon 20 insertion mutations in Europe which ranged from 0.3 to 1.3% in all NSCLC cases, and 4–12% in EGFR-positive NSCLC [23]. As a meta-analysis was not performed, the true prevalence cannot be ascertained.

Lung cancer accounts for approximately 5% of all deaths in Europe, ranging from 4% in Sweden to 7% in the Netherlands. The objective of this pragmatic literature review (PLR) was to identify current estimates of the incidence and prevalence of EGFR exon 20 insertion mutations in NSCLC in nine key European countries (Belgium, France, Germany, Italy, the Netherlands, Spain, Sweden, Switzerland and the UK). These countries were selected as current European prevalence estimates are outdated, lack granularity and are wide ranging [11, 23].

The identified ranges for prevalence estimates from the PLR were wide. As such, the true prevalence of EGFR-positive and exon 20 insertion positive NSCLC was unclear. Therefore, a subsequent meta-analysis was conducted with the objective of establishing more accurate estimates of the epidemiology of EGFR-positive NSCLC in these countries, including exon 20 insertions.

2 Materials and Methods

The PLR was performed in accordance with a pre-specified protocol and reported in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The review protocol was not registered.

2.1 Identification of Studies

The SLR conducted by Zhang 2016 collected data from 456 studies worldwide, published up to June 2013 [11]. In order to avoid the duplication of work and to capture only the most recent evidence using modern testing technologies, the searches for this PLR were limited to 2013 or later. The Zhang 2016 SLR was judged to be of high quality according to the AMSTAR 2 critical appraisal tool, and therefore a reliable source of reference data [28].

A comprehensive approach was taken to evidence identification. Electronic database searches were conducted in MEDLINE and Embase simultaneously via Ovid on 5 May 2020. The complete search strategy is presented in the Online Supplementary Material (OSM) Table S1. Additionally, conference proceedings from the last three years (2018 to 2020) were searched for the American Association for Cancer Research (AACR), the American Society of Clinical Oncology (ASCO), the European Society for Medical Oncology (ESMO), the International Association for the Study of Lung Cancer (IASLC) European Conference on Lung Cancer and ESMO Lung Cancer Annual Congress. Bibliographies of published SLRs and/or (network) meta-analyses identified during the searches were also reviewed to identify additional relevant publications.

2.2 Selection of Studies

Each abstract and full text was assessed for inclusion by a single reviewer using pre-defined eligibility criteria, based on the population(s), intervention(s), comparator(s) and outcome(s) (PICO) framework (OSM Table S2). Studies were included if they reported the prevalence or incidence of mutations in the EGFR gene in patients in the following European countries: Belgium, France, Germany, Italy, the Netherlands, Spain, Sweden, Switzerland, UK. To avoid selection bias, only observational studies were included. A second independent reviewer checked all included and 10% of the excluded articles.

2.3 Data Extraction and Reporting of Results

Key information from each included study, including study characteristics, patient characteristics and epidemiological outcomes, was extracted into a pre-specified data extraction grid by a single individual. A second individual independently verified 10% of the extracted information (with a particular focus on the quantitative data) to ensure its accuracy. There was a low discrepancy rate in the verified data (1.06%); therefore, there were low concerns about the quality of the unverified extractions.

Where not reported in the publication, simple calculations were performed to generate prevalence estimates, if possible based on the available data. Results for the same outcome across different studies, for example EGFR mutation frequency, were used to inform a range from lowest reported value to highest reported value. The more common T790M mutation was not included within estimates of exon 20 insertion prevalence.

2.4 Quality Assessment

The quality of all included studies was assessed using the Joanna Briggs Institute (JBI) Critical Appraisal tools Checklist for Prevalence Studies [29]. The quality assessment was completed by one individual and 10% of the information was verified by a second independent reviewer.

2.5 Meta-analysis

Meta-analyses were conducted to synthesise the prevalence estimates identified by the PLR. Specifically, meta-analyses were performed for the prevalence of overall EGFR mutations in any stage NSCLC, EGFR mutations in advanced/metastatic NSCLC, EGFR mutations in patient subgroups (men and women, smokers and non-smokers, adenocarcinoma and non-adenocarcinoma) and the frequency of exon 20 insertion mutations in NSCLC and in EGFR-positive NSCLC. The prevalence reported by each study was analysed using logistic regression with a logit transformation. Prevalence reported by individual studies was used as the input for the models, along with their calculated exact Copper-Pearson confidence intervals (CIs). The I2 statistic was used to assess whether fixed-effect or random-effect models were appropriate. A maximum likelihood estimator was used to estimate the between-study variation, τ2, and all meta-analyses were run using statistical software, R, [30] with the function metaprop from the meta package [31].

3 Results

3.1 Study Selection

A total of 1230 records were retrieved by the electronic database searches. After de-duplication of results, 1217 unique records were reviewed (Fig. 2). After title and abstract review, 163 records were selected for full-text review. Of these, 90 were found to fulfil the eligibility criteria for inclusion in the review. Supplementary searches of congresses and SLR bibliographies yielded three additional records that fulfilled the eligibility criteria. In total, 93 publications reporting 80 unique studies were included in the review.

Fig. 2
figure 2

PRISMA diagram. Due to the large volume of evidence identified throughout the review, a prioritisation strategy was implemented in order to synthesise evidence and draw conclusions of the highest possible utility. The deprioritised evidence included: studies with specific patient populations, such as patients with specific conditions (e.g. HIV, Li-Fraumeni syndrome) or patients selected based on having specific mutations (e.g. METex14, ALK translocations, MAP2K1, HER2, BRAF). ALK anaplastic lymphoma kinase, BRAF v-raf murine sarcoma viral oncogene homolog B1, EGFR epidermal growth factor receptor, HER2 human epidermal growth factor 2, HIV human immunodeficiency virus, MAP2K1 mitogen-activated protein kinase 1, METex14 mesenchymal-epithelial transition gene exon 14, PRISMA Preferred Reporting Items for Systematic Reviews and Meta-Analyses

Forty studies reported the prevalence of overall EGFR mutations in any stage NSCLC and 30 studies reported the prevalence of overall EGFR mutations in advanced/metastatic NSCLC. Nine studies reported on exon 20 insertion mutations in any-stage NSCLC and one study reported on exon 20 insertion mutations specifically in advanced/metastatic NSCLC (Table 1).

Table 1 Overview of studies reporting the prevalence of exon 20 insertion mutations

3.2 Study Characteristics

All included studies reported on the prevalence of mutations. No evidence was identified for incidence. Across all studies, data were collected between 2008 and 2017. The nine studies that reported the frequency of exon 20 insertions in any-stage NSCLC were located in France (n = 4) [32,33,34,35], Italy (n = 1) [36], UK (n = 2) [37, 38], the Netherlands (n = 1) [39] and Sweden (n = 1) [40]. The one study reporting the frequency of exon 20 insertions in advanced or metastatic NSCLC was conducted in Spain (Table 1) [41].

Study population size ranged widely, from 134 participants in a study performed in Italy to 10,117 participants in the French ERMETIC-IFCT study [32, 36]. The nine studies reporting on exon 20 insertions in any-stage NSCLC included a pooled total of 32,236 participants. The one study that reported on exon 20 insertion mutations specifically in advanced/metastatic NSCLC included 184 participants [41]. Although the majority of studies included patients with all stages of NSCLC, it was noted that a high proportion of these patients were stage III or IV NSCLC (ranging from 28 to 99%) (Table 1).

Direct sequencing by PCR was used to identify and measure EGFR mutations in the majority of studies. However, most studies also reported using more than one method, such as fragment analysis and NGS (Table 1). PCR methods included PCR-high resolution melt (HRM) and allele-specific PCR, such as the Therascreen EGFR Rotor-Gene Q (RGQ) PCR kits.

Of the ten included studies, eight were reporting results in TKI-naïve patients. Of the two studies that included patients who had been exposed to prior therapies, the ERMETIC-IFCT study publication specified these therapies included erlotinib and gefitinib [32, 35].

3.3 Study Quality

Overall, the risk of bias of the ten included studies reporting exon 20 insertion mutations was deemed to be low-to-moderate, based on the questions in the adapted JBI checklist [29]. A summary of the quality for each question domain and each study is presented in OSM Table S3.

Of the included studies, 7/10 had a primary aim of identifying mutation prevalence or frequency [32,33,34, 38,39,40,41]. The remaining three studies did not have the primary aim of measuring mutation frequency [35,36,37], rather data were collected as part of routine clinical work.

The majority (9/10) of studies performed well in the domains of sufficient coverage of the identified sample and valid methods for testing mutational status in that most studies included all patients for whom mutational status was available and standard methods of testing including direct sequencing or fragment analysis were typically used [32,33,34,35,36,37,38, 40, 41].

3.4 Estimated Prevalence of Overall EGFR Mutations and Exon 20 Insertion Mutations from the Pragmatic Literature Review

The frequency of overall EGFR mutations in any stage NSCLC ranged from 4.0–28.9%, and 5.0–37.5% in advanced/metastatic NSCLC (Table 2; Fig. 3). The overall frequency of exon 20 insertion mutations in any stage NSCLC ranged from 0.3–2.2% amongst a general NSCLC population and 2.5–23.1% within EGFR-positive NSCLC, based on nine studies (Table 2; Fig. 3).

Table 2 Frequency of EGFR mutations including exon 20 insertions in any stage and advanced/metastatic NSCLC
Fig. 3
figure 3

Estimated frequency from the meta-analysis and PLR of exon 20 insertion mutations in any stage and advanced/metastatic NSCLC. aDoes not include T790M. The figures present a summary of the pooled results from the random effects meta-analyses overlayed with the range of results from the PLR. Full forest plots for the individual meta-analyses are presented in the supplementary materials. The outcome for the exon 20 insertion mutation frequency in advanced/metastatic NSCLC is based on findings of the one study. CI confidence interval, EGFR epidermal growth factor receptor, exon 20i exon 20 insertion mutations, NSCLC non-small-cell lung cancer, PLR pragmatic literature review, RE random effects

The lowest reported frequency of exon 20 insertions within an overall NSCLC population (0.4%) was in a study conducted in the UK [38]. The lowest value within an EGFR-positive NSCLC population (2.5%) was reported in a study conducted in France [38]. The highest reported frequency of exon 20 insertions within both an overall NSCLC population and an EGFR-positive NSCLC population was in a study conducted in Italy (2.2% and 23.1%, respectively) [36]. This Italian study had the smallest sample size of all studies reporting on exon 20 insertions, at 134 patients. It reported rates of three separate mutations (N771-P772 insV, del N771 insG and del S768I-V769 insI), which were summed to give a total value under the assumption that patients would not have multiple exon 20 insertion mutations.

The two largest studies (ERMETIC-IFCT in France, n = 10,117 and Evans 2019 in the UK, n = 17,046), both reported similar frequencies of exon 20 insertions [32, 38]. ERMETIC-IFCT found 0.4% and Evans 2019 found 0.4% of all patients with NSCLC had exon 20 insertions, and exon 20 insertions constituted 3.9% and 3.6% of EGFR-positive NSCLC, respectively [32, 38]. Evans 2019 used allele-specific PCR to measure EGFR mutations, and in ERMETIC-IFCT several different methods including direct sequencing, fragment analysis, pyrosequencing and PCR-HRM were used across different study centres.

The GGCP 048-10 study, which was the only study to report on the frequency of exon 20 insertions in advanced/metastatic NSCLC, found a frequency of one exon 20 insertion amongst 184 patients with NSCLC, or among 25 patients with EGFR-positive NSCLC (0.5% in NSCLC; 4.0% in EGFR-positive NSCLC) [41].

Seven studies reported on "Insertions" in exon 20 [32,33,34, 37, 38, 40, 41], while three studies reported on specific insertions [35, 36, 39]. These included N771_P772>V (0.1% in overall NSCLC; 0.8% in EGFR-positive NSCLC) [35], P772_H7773>HV (1.5% in overall adenocarcinoma; 15.6% in EGFR-positive adenocarcinoma) [36], V769_D770>GVV (0.75% in overall adenocarcinoma; 7.7% in EGFR-positive adenocarcinoma) [36], D770_N771>SVD (0.3% in overall NSCLC; 2.6% in EGFR-positive NSCLC) [39], D770G>Y (0.6% in overall NSCLC; 5.3% in EGFR-positive NSCLC) [39], del N771>G (0.1% in overall NSCLC; 0.8% in EGFR-positive NSCLC) [35] and del S768I_V769>I (0.1% in overall NSCLC; 0.8% in EGFR-positive NSCLC) [35] (Fig. 1).

For exon 20 insertions in any stage NSCLC, the earliest year of data collection was 2001 and the latest year was 2015. For advanced/metastatic NSCLC, the GGCP 048-10 study collected data in 2011 (Fig. 4).

Fig. 4
figure 4

Frequency of exon 20 insertion mutations in any stage NSCLC and advanced/metastatic NSCLC per year and method of data collection. ‘Multiple’ = direct sequencing, fragment analysis, pyrosequencing and HRM. Bubble size represents the number of patients with NSCLC in each study. Numbers in square brackets represent the article citation. EGFR epidermal growth factor receptor, HRM high-resolution melt, NGS next-generation sequencing, NSCLC non-small-cell lung cancer

The most common method for measuring mutation status was direct sequencing using PCR or a combination of methods. Only one study, conducted in the UK (Moore 2018), used NGS to measure exon 20 insertion mutation (Fig. 4) [37]. This study had the most recent data collection (starting in 2015) and reported a prevalence estimate for exon 20 insertions of 1.6% in overall NSCLC and 9.0% in an EGFR-positive population. These values were higher than the estimates from the majority of studies, including the two largest studies (Evans 2019 and ERMETIC-IFCT) (Fig. 4) [32, 38].

Complex mutations or co-mutations (the occurrence of multiple different mutations in a single tumour) involving exon 20 insertion mutations were reported in five of the ten included studies [33, 35, 38,39,40]. Prevalence of complex or co-mutations involving exon 20 insertion mutations ranged from 0.8% for exon 19/exon 20 mutations and exon 20/exon 21 mutations to 56.9% for exon 18/exon 20 mutations in an any stage EGFR-positive NSCLC population.

Of the seven studies that had a primary aim of identifying mutation prevalence or frequency, six reported exon 20 insertion mutations in any-stage NSCLC. Prevalence in these studies ranged from 0.4% to 1.7% in a NSCLC population, and 3.6–15.0% in an EGFR-positive population. These ranges were narrower than the estimated range that included all study designs.

3.5 Estimated Prevalence of overall EGFR Mutations and Exon 20 Insertion Mutations from the Meta-analysis

A high-level of between-study heterogeneity was identified based on the I2 statistic. As such, random effect models were specified instead of fixed effect models. That is, study-specific random intercepts were specified to account for heterogeneity in the estimates of prevalence across studies. Of the 30 studies included in the PLR reporting EGFR mutations in advanced/metastatic NSCLC, two publications were excluded from the meta-analysis because they did not report sample size, therefore no measure of uncertainty around the proportion could be generated. For all other analyses, the same number of publications as identified in the PLR were included in the meta-analysis.

The overall analysed prevalence for EGFR mutations was 12.5% (95% CI: 11.0, 14.1) (PLR range: 4.0–28.9%) in any stage NSCLC and 14.8% (12.8, 17.1) (PLR range: 5.0–37.5%) specifically in advanced/metastatic NSCLC (Fig. 3; OSM Fig. S1). The overall analysed prevalence of exon 20 insertions in any-stage NSCLC was 0.7% (0.4, 1.1) (PLR range: 0.3–2.2%), and 6.1% (4.0, 9.4) (PLR range: 2.5–23.1%) in EGFR-positive NSCLC (Fig. 3; OSM Fig. S1). The CIs generated from the meta-analysis are considerably narrower than the ranges across studies identified in the PLR, and therefore provide a more precise single point estimate than the studies identified in the PLR.

Only one study reported the frequency of exon 20 insertions specifically in advanced/metastatic NSCLC, therefore a meta-analysis of these results was not conducted. This study showed a prevalence of 0.5% (calculated 95% CI: 0, 3.0) in overall NSCLC and 4.0 (calculated 95% CI: 0.1, 20.4) in EGFR-positive NSCLC (Fig. 3; OSM Fig. S1) [41].

None of the studies reporting on exon 20 insertion mutations presented results separately for specific patient subgroups, for example stratified by sex, smoking status or NSCLC histology. However, these data were reported separately in studies investigating overall EGFR mutation frequency, with higher EGFR frequency in adenocarcinoma (12.7%) versus non-adenocarcinoma (1.8%), women (20.1%) versus men (7.2%) and non-smokers (43.1%) versus smokers (4.8%) with NSCLC (OSM Fig. S2).

4 Discussion

This PLR and meta-analysis identified substantial moderate- to high-quality evidence on the frequency of EGFR mutations in NSCLC. The overall prevalence of EGFR mutations identified by the PLR ranged from 4.0 to 28.9% in any stage NSCLC, and 5.0–37.5% in advanced/metastatic NSCLC. The single-point estimates and 95% CIs obtained by the meta-analyses for EGFR mutations were 12.5% (11.0, 14.1) in any-stage NSCLC and 14.8% (12.8, 17.1) in advanced/metastatic NSCLC. These values align well with previously reported estimates in the literature [11], increasing confidence that the exon 20 insertion results identified in this PLR and meta-analysis are robust.

The prevalence of exon 20 insertion mutations identified by this PLR in any-stage NSCLC ranged from 0.3 to 2.2% in overall NSCLC and 2.5–23.1% in EGFR-positive NSCLC. In advanced/metastatic NSCLC, the prevalence of exon 20 insertion mutations was 0.5% in overall NSCLC and 4.0% in EGFR-positive NSCLC. The single-point estimates and 95% CIs obtained by the meta-analyses for exon 20 insertions in any stage NSCLC were 0.7% (0.4, 1.1) for overall NSCLC and 6.1% (4.0, 9.4) in EGFR-positive NSCLC. In advanced/metastatic NSCLC, prevalence was 0.5% (0, 3.0) in overall NSCLC and 4.0% (0.1, 20.4) in EGFR-positive NSCLC. The meta-analysis results provide more exact exon 20 insertion mutation prevalence data, adding value to the broad prevalence ranges identified by the PLR.

The majority of studies reporting on exon 20 insertion mutations were conducted in France (4/10), and these studies contained the largest sample sizes [32,33,34,35]. The one study that used NGS reported a higher prevalence than the majority of studies that used other testing methodologies,[37] including the two largest studies, indicating that NGS may be a more sensitive test to identify these mutations. The majority of studies used direct sequencing, either alone or in combination with other methods of mutation detection. For these studies, prevalence rates ranged from 2.5% to 23.1%, suggesting no clear consistency in results between studies using the same method of mutation detection. Additionally, no clear trend was identified between method of mutation assessment or year of data collection and EGFR mutation frequency in any stage or advanced/metastatic NSCLC, meaning that conclusions about their contribution to heterogeneity in results cannot be drawn.

The SLR and meta-analysis conducted by Zhang 2016 (the reference source for this PLR) and another recently-conducted SLR, Burnett 2021, can be used to contextualise this review's findings [11, 23]. Zhang 2016 included 456 studies (62 from Europe). The EGFR mutation frequency in Europe was 14.1% (95% CI 12.7, 15.5%). This was consistent with the results of this meta-analysis for EGFR mutation frequency (12.5% [11.0, 14.1]). Zhang 2016 did not report prevalence values for specific countries but did report frequency for stage III and stage IV NSCLC at 33.8% (29.8, 7.8) and 37.5% (33.2, 41.7), respectively. These are considerably higher than the estimated prevalence for advanced/metastatic disease from the current meta-analysis (14.8% [12.8, 17.1]). Additionally, the frequency of exon 20 insertion mutations in any stage NSCLC was reported at 1.7% (1.4, 2.0) by Zhang 2016. This is also higher than the estimated prevalence from this meta-analysis (0.7% [0.4, 1.1] in any-stage NSCLC and 0.5% [0, 3.0] in advanced/metastatic NSCLC). This difference may be explained by the fact that Zhang 2016 included locations outside of Europe (including Asia and North and South America) for the exon 20 insertion estimate. For context, EGFR mutation prevalence in any-stage NSCLC was substantially higher in Asia and America (38.4% [36.5, 40.3] and 24.4% [22.1, 26.8]) [11]. Additionally, the race of patient populations may contribute to the difference. In all of the studies included in this PLR that reported ethnicity, ≥90% patients were White. In the Zhang 2016 SLR, rates of EGFR mutation were significantly higher for subgroups of Asian patients (38.8%) compared with Caucasian (17.4%) or African-American (17.2%) patients [11]. This explanation is supported by other studies that have specifically investigated exon 20 insertions in other regions, with frequencies as high as 4.0% for Asia Pacific, 2.6% for the USA and 2.1% for Latin America [23]. This re-emphasizes the need for prevalence estimates specifically for European regions as epidemiological conclusions cannot be generalised across geographical regions.

Burnett 2021 included 78 studies reporting on the frequency of exon 20 insertion mutations (13 from Europe), which in Europe ranged from 0.3 to 1.3% of all NSCLC cases, and 4–12% in EGFR-positive NSCLC [23]. This aligns with the range identified in this PLR (0.3–2.2% amongst a general NSCLC population and 2.5–23.1% within an EGFR-positive NSCLC population), particularly when excluding the small Italian study that reported values at the upper limit of the range or studies that did not have a primary aim of identifying mutation prevalence or frequency. Burnett 2021 did not conduct a meta-analysis, therefore a more precise prevalence estimate was not generated and comparisons cannot be made with the results from the meta-analyses performed in this work [23].

A key strength of this review includes the identification and synthesis of recent, up-to-date evidence for all EGFR mutations, with a particular focus on exon 20 insertions. Furthermore, the meta-analysis provides the added value of synthesising the available evidence from the scientific literature to provide accurate point estimates of prevalence. Although only estimates for exon 20 insertion mutations are reported here, this PLR additionally identified data on other mutation types, including exon 18 mutations, which could contribute to valuable future research. The review was targeted to nine European countries that were of particular interest as previously reported estimates of prevalence in these countries were largely outdated and wide-ranging. Results from this PLR and meta-analysis provide current insights and add to the epidemiological understanding of exon 20 insertion mutations in these countries.

However, there are some limitations to this work. Regarding the review methodology, a single-reviewer approach was used during the study selection and data extraction. As such, it is possible that not all relevant articles were captured, and errors may have been introduced during data extraction. However, it is likely that the articles with high relevance to the review question will have been identified and a second individual independently verified 10% of the extracted information, where there was a low discrepancy rate in the verified data (1.06%). As a result, there are low concerns about missing studies and the quality of the unverified extractions.

There were also limitations in the evidence base. Firstly, only prevalence data were identified, with no reported evidence for the incidence of exon 20 insertion mutations, and data specifically in advanced/metastatic populations of NSCLC were only reported by one study (with other studies including varied proportions of patients with stage I–IV disease). There is therefore higher uncertainty in the results and conclusions for this key population. Furthermore, heterogeneity between different studies, including study design and patient baseline characteristics may mask true differences between different markets or populations. Data reporting on the epidemiology of exon 20 insertion mutations were not reported for subgroups, such as sex, histological subtype and smoking status, so no conclusions can be drawn on how these factors may influence mutation frequency. Additionally, the occurrence of co-mutations or complex mutations and variation in testing methodologies may confound the measured frequency of exon 20 insertion mutations and cause their occurrence to be underestimated. More sensitive mutation testing methods may be required to accurately detect and distinguish between uncommon EGFR mutations to allow for more accurate prevalence estimates.

Lastly, with the PLR’s searches conducted in May 2020, more recently published evidence has not been accounted for. However, a targeted PubMed search in January 2022 identified two key studies reporting exon 20 insertions in European populations since May 2020. One was a large real-world cohort study from France that included 9,435 patients with non-squamous NSCLC [42]. The other included 505 patients from a Spanish registry [43]. Both studies reported exon 20 insertion prevalence within the range of values identified by this PLR, at 3.9% and 4.7% of all EGFR mutations, respectively [42, 43].

The prevalence estimates identified in this review and meta-analysis will be useful in calculations to estimate the upper and lower bounds of numbers of patients in European populations who would be eligible to receive emerging therapies that have specific activity against exon 20 insertion mutations, such as amivantamab or mobocertinib. Useful future work would include larger prospective cohort studies with longer follow-up periods to increase the likelihood of measuring precise results and validating the estimates of EGFR mutation prevalence; provide incidence data; and better clinically characterise the prognosis of patients with exon 20 insertion mutations. Regular updates to European analyses, along with an exploration of data in a wider range of countries, including real-world data from North America and Asia, would also be beneficial, particularly to capture key differences between regions.