Introduction

The global incidence of cancer is rapidly increasing, specifically among older populations [1]. Older patients, however, are strongly underrepresented in clinical trials in all fields of medicine [2]. This is the case for large cancer trials, which are important for the establishment of international guidelines, as well [3,4,5]. Barriers for trial inclusion can be raised by the system, by care-providers, but also by patients themselves [6].

Besides the evident difficulty of including older patients in clinical studies, retaining older patients in clinical studies may be difficult as well, and lead to higher non-response and study drop-out [7, 8]. This may be referred to as ‘attrition’. Especially with the growing use of patient-reported outcome measures (PROM’s), the risk of non-response is lurking, and this may be even more the case in the older and frail population [9]. PROM’s, however, such as questionnaires for quality of life (QoL), are increasingly being recognized as important outcome measures, besides recurrence or survival alone. Specifically for older patients this may be the case, as they may prioritize outcomes such as QoL over length of life, for example [10].

Yet, the occurrence of non-response and study drop-out for older and frail patients relative to their younger and fit counterparts is important to know. Systematic loss of patients from specific study groups may lead to attrition bias [11]. Consequences of this may be under- or overestimating outcomes, misinterpretation of the results and poor generalizability.

The age of patients with head and neck cancer averages around 65 and the burden of geriatric deficits and therewith frailty is large in this population, compared to patients with other solid malignancies [12]. The risk of introducing bias into studies may therefore be high. In our previous studies we encountered that frail patients were more difficult to include because of their poor response to baseline questionnaires [13]. Therefore, the goal of the current study was to investigate whether frail patients exhibit more non-response than non-frail patients to follow-up questionnaires and whether specific items of a routinely performed geriatric assessment (GA) are associated with non-response.

Materials and methods

Study design

This study covers a retrospective analysis of prospectively collected data from the longitudinal observational Oncological Life Study (OncoLifeS), a large hospital-based oncological data-biobank at the University Medical Center Groningen (UMCG), Groningen, The Netherlands [14]. OncoLifeS is approved by the Institutional Review board of the UMCG, this study was approved by the scientific committee of OncoLifeS. In OncoLifeS patients are included after providing written informed consent. Between October 2014 and May 2015, all patients referred with (suspicion of) primary or recurrent cancer in the head and neck area (mucosal, salivary gland and cutaneous) were consecutively included. Patients were seen at the outpatient clinic of the departments of Otorhinolaryngology, Head and Neck Surgery, Oral and Maxillofacial Surgery and Radiation Oncology. Patients underwent a GA, including frailty screening, at baseline, before treatment. Patients were excluded from the analysis when initially palliative or non-standard treatment was conducted or when patients did not return the baseline questionnaires. Also, data of patients were excluded when recurrence or death occurred during follow-up, from that moment onward. Patients were followed up during 2 years after treatment using QoL questionnaires (see Follow-up).

Patient, tumour and treatment characteristics

Patient, tumour and treatment characteristics were withdrawn from the OncoLifeS data-biobank. Disease was staged according to the seventh edition of the Union for International Cancer Control’s TNM Classification [15].

Baseline assessments

Before treatment patients underwent GA, including a frailty screening and assessment of the somatic, functional, psychological and socio-environmental domains. Somatic assessments included scoring of the 27-item Adult Comorbidity Evaluation (ACE-27), polypharmacy (5 or more medications) and the Malnutrition Universal Screening Tool (MUST) [18,19,20]. Functional assessments were Activities of Daily Living (ADL), Instrumental Activities of Daily Living (IADL) and the Timed Up & Go (TUG) with a cut-off at 13.5s [21,22,23,24]. The Mini Mental State Examination (MMSE) and the 15-item Geriatric Depression Scale (GDS-15) were used for the psychological assessments [25,26,27]. Marital status, living situation and educational level assessed for the socio-environmental domain and were registered as part of a standardized questionnaire. Frailty screening consisted of the Groningen Frailty Indicator (GFI) and Geriatric 8 (G8) questionnaires [16, 17].

Follow-up

Patients were followed up using the European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire Core 30 (EORTC QLQ-C30) and Head and Neck 35 (EORTC-QLQ-HN35) at 3, 6, 12 and 24 months after treatment. Follow-up was conducted by sending and returning questionnaires by mail (dept. of Otorhinolaryngology, Head and Neck Surgery, dept. of Oral and Maxillofacial Surgery) or by filling out questionnaires at the outpatient clinic (dept. of Radiation Oncology). This difference between methods was incorporated as a variable in the dataset.

Outcome

Non-response was defined as both complete QoL questionnaires missing in the dataset. This was recalculated to a binary outcome (yes/no) for each of the follow-up moments (3, 6, 12 and 24 months) after treatment initiation, regardless of the previous outcomes, and until recurrence or death occurred.

Statistical analysis

All statistical procedures were performed with SPSS Statistics 28 (IBM). Descriptive statistics are presented as n (%) unless specified otherwise. Generalized Linear Mixed Models (GLMM) were used to calculate odds ratios of the association between frailty and non-response for any data point in the follow-up. As an advantage, this allows for using all data points before exclusion due to death or recurrence and thus reducing risk of bias. Patients with upcoming (but not yet diagnosed) recurrence or death may have worse response; therefore, ‘subsequent recurrence’ and ‘subsequent death’ tested as variables as well. For all models, non-response was the target variable in a binary logistic fashion. For fixed effects an intercept and the predictor variable were included. For random effects an intercept was included and covariance type was set to unstructured. At first, GLMMs were carried out for patient characteristics, both univariate and in a multivariable model. Second, frailty screening instruments and GA items were evaluated in a GLMM, both in an unadjusted model and then in a model adjusted for all relevant patient characteristics.

Results

During the study period, 369 patients with mucosal, salivary gland and cutaneous malignancies in the head and neck area were included in OncoLifeS. After exclusion of patients receiving palliative or non-standard treatment and patients not responding to baseline questionnaires, 289 patients remained in the study for analysis (Fig. 1). The mean age was 68.4 years and 68.5% were male. 54.5% of patients had advanced stage disease. Recurrence, death and response to follow-up questionnaires are shown in Fig. 2. From all patient and study characteristics, age [OR 3.21 (1.80–5.72)], time [per year OR 1.47 (1.10–1.97)] and subsequent death [OR 2.84 (1.62–4.99)] were significantly associated with non-response to follow-up questionnaires, in univariate GLLMs (Table 1). All remained significant in the multivariable model (Table 1).

Fig. 1
figure 1

Flowchart of study inclusion. * = experimental or unknown treatment. n = number

Fig. 2
figure 2

Response and non-response to questionnaires, recurrence and death among patients

Table 1 Patient characteristics and generalized linear mixed models for non-response

Regarding GA items, restrictions in ADL [OR 4.46 (2.04–9.78)], IADL [OR 4.33 (2.27–8.24)], impaired mobility on the TUG [OR 3.95 (1.85–8.45)], signs of cognitive decline on the MMSE [OR 4.85 (2.28–10.35)], assisted living or living in a nursing home [OR 5.54 (2.63–11.67)] were significantly associated with non-response to questionnaires in univariate GLLMs (Table 2). This remained the case after adjusting for patient and study characteristics that showed significance in the univariate model, such as age, time and subsequent death (Table 2).

Table 2 Geriatric assessment, frailty screening and generalized linear mixed models for non-response

Frailty screening by both G8 and GFI, was significantly associated with non-response [OR respectively 2.64 (1.51–4.59) and 2.52 (1.44–4.44)], even after adjusting for the abovementioned factors (Table 2).

Discussion

In this longitudinal observational study, we investigated whether frail patients exhibit more non-response to follow-up questionnaires than non-frail patients and whether specific items of a routinely performed GA are associated with this. Main findings were that frailty screening tools were associated with worse response to follow-up questionnaires. Besides, impaired ADL and IADL, restricted mobility, cognitive decline and dependent living situation were specifically associated with poorer response to follow-up questionnaires. These associations were independent of other significant factors, such as age, duration of the study and subsequent death during the study. To our knowledge, this is the first study demonstrating the association between geriatric factors and response to PROMs in patients with HNC. These results are important for the interpretation of all studies dealing with PROMS, because of the increasing proportion of older and frail patients.

In our study, higher age was significantly associated with non-response during follow-up. This is in line with some earlier studies [8, 28,29,30]; however, other studies found no significant differences [9, 31,32,33]. Comparison is difficult, given the different cancer types (and therewith age groups) and study methodologies which may explain the divergent outcomes.

A recent study, however, did investigate study retention and attrition in a longitudinal study of HNC patients in the Netherlands, collecting PROMs, fieldwork data and biobank materials up to 2 years [34]. In this study, age was not associated with attrition, unlike other factors such as higher tumour stage, poorer physical performance and worse comorbidity score. The latter, comorbidity, was in line with other studies [28, 30, 32]; however, not with our study which identified no significant differences in response between patients with none to mild and moderate to severe comorbidities. A reason for this may be the fact that other studies often assign patients with recurrent disease and even deceased patients to the attrition or non-response group as well. In this way, there is a risk of predicting death or recurrence rather than non-response due to other (geriatric) factors. In our study, we have excluded patients with recurrence or death from the analyses, from the moment that recurrence or death occurred. This gives superior understanding underlying non-response mechanisms.

Other items of GA or frailty screening with respect to non-response, drop-out or attrition have rarely been investigated and not at all in the unique population of HNC. In other studies, the most valuable data available is originating from the PROMs themselves that patients were asked to fill out, but then used at baseline as a predictor for drop-out. Among some different studies in other cohorts, poor functional status, symptom burden, depressive symptoms, cognitive failure, psychosocial symptoms, lower socioeconomic status, low educational level, and poor baseline QoL were associated with attrition [9, 29, 31,32,33]. It must be noted that study methodology differed greatly between studies, and none of the studies specifically aimed HNC. Besides, one may question the ability of a QoL questionnaire subscale to diagnose, e.g. ‘cognitive failure’, often based on just a few questions, compared to specifically developed screening tools such as MMSE in the case of cognition. In our current study, where we have employed well-known and frequently used instruments for GA (and not subscales of the PROMs), we have seen consistent associations of restricted ADL and IADL, poor mobility, cognitive decline and dependent living situation with non-response.

Frailty screening tools, such as the G8 and GFI, were significantly associated with increased non-response as well, which was also expected given the share of functional, cognitive and psychosocial items in the screening tools. This is in line with another study, in which frailty was significantly associated with drop-out from a cohort study [35].

Attrition is common in longitudinal studies, especially with the use of PROMs. When data are missing (completely) at random, this usually does not lead to bias. However, when attrition rates are distinct for different study groups, e.g. in this case when comparing frail to non-frail patients, this may introduce attrition bias [36]. Data may be not missing at random anymore, as for instance frail patients systematically respond worse to the questionnaires and may have different outcomes as well. In such studies, such as in studies evaluating QoL outcomes between frail and non-frail patients [37, 38], the observed differences may be an underestimation of the real difference. Although ideally this should be prevented ahead of time by creating a strategy to take care of frail patients at risk for dropping-out (e.g. alternative study visits, using patients peer support, supportive telephone contacts) [39], it is important to know how to handle and interpret these data. According to experts, mixed models remain the best choice for the analysis of repeated measures and longitudinal data [36].

Strengths of this study include the prospective inclusion of patients, the large range of validated screening instruments, the ability to adjust for relevant covariates such as subsequent death or recurrence and study characteristics, and the maximum use of data points using mixed models and therewith limiting bias as much as possible. Limitations of this study may be the different collection methods of PROMs between departments (which was adjusted for), the absence of information why patients dropped out, the relatively small and heterogeneous study cohort, which included both mucosal as cutaneous malignancies. Besides, by excluding patient not responding to baseline questionnaires, some form of bias may already be present from the beginning.

Conclusion

Frailty, measured by deficiencies on GA, such as impaired ADL and IADL, restricted mobility, cognitive decline and dependent living situation, or by frailty screening instruments (G8 and GFI), is significantly associated with worse response to follow-up PROMs. This is of importance when handling and interpreting data on older or frail HNC patients, as with the resulting attrition bias the observed effects may be an underestimation of the real differences. Not only researchers but also clinicians need to be aware of this potential bias during the interpretation of studies dealing with PROMs, as the frailest patients are less likely to be included and more likely to be lost from follow-up.