Introduction

“Evidence based medicine (EBM) is the conscientious, explicit, judicious and reasonable use of modern, best evidence in making decisions about the care of individual patients” [1]. This ensures that the best research is integrated with clinical practice to provide optimum patient care. Simply put, EBM requires an evidence base.

The Australian Spine Registry published an annual report in 2019, identifying that the age group of highest prevalence requiring spinal surgery was 70–79 years old [2]. This is unsurprising as ageing is a natural and inevitable process which is time-related and leads to the deterioration of necessary physiological functions for survival [3]. Specifically in the spine field, degenerative changes due to age alone were found in the spines of over 80% of patients over a period of 10 years, this was independent of other factors such as gender, smoking, and body mass index [4]. In addition to the increased comorbidities associated with age, a significant body of literature has demonstrated that older adults have worse post-operative complications and poorer outcomes following spinal interventions [5,6,7,8,9]. Therefore, as convenient as it would be, trial data from younger patients must not be applied to older patients as the external validity of these trials would be compromised.

Although ageism has been studied in other areas of medicine, within the spine field, there has only been one systematic review to our knowledge. Paeck et al. [10] conducted a systematic review with meta-analysis specifically looking at the exclusion of older adults from lower back pain randomised controlled trials (RCTs). Approximately 41% of RCTs from 1992 to 2010 excluded people over the age of 65-years, and the mean age of participants was 44.3 years old. This study was, however, conducted in 2014, and the authors acknowledged that one of the limitations was that they did not categorise exclusion of participants into age-groups, but rather, just divided studies into whether or not they had an upper age limit i.e., explicit exclusion.

Other studies included qualitative questionnaire-based studies, seeking opinions of health professionals, where majority agreed that older adults were often excluded from clinical trials [11, 12]. Ludmir et al. [13] also studied the age disparities in oncology trials by comparing the median ages of participants in the trials, and the median age of the actual target population. Although they did not look specifically at the exclusion of the ageing population, they noted that in all trials, the median age of participants in clinical trials were on average 6.49-years younger than the actual population age. There was also a significantly greater difference in median age in industry sponsored trials.

Moreover, in a systematic review conducted by Bourgeois et al. [14], it was concluded that elderly people were excluded from RCTs for drugs for ischaemic heart diseases. The type of funding of RCTs were also extracted in the process. RCTs with upper age cut-offs were more likely to be funded by non-industry sources.

The main gaps identified included the comparison of trials with an explicit upper age cut-off to trials that did not have an age cut-off. By doing this, explicit exclusion has been identified, however this does not translate to the applicability of evidence. Applicability of research goes beyond the explicitly stated upper age cut-offs. For example, a study with an upper age cut-off of 100-years-old might have participants with a median age of 40-years-old, with it’s 75th percentile age being 60-years-old. This means that 25% of less of the participants are 60-years-old and above, making it highly unapplicable for anyone over 60-years-old. Most studies also did not categorise exclusion of older adults by age groups, making it difficult to comprehend the extent of ageism.

Hence, we sought to understand the extent of ageism—the exclusion of a demographic based solely on their age—within the spinal research field by calculating how applicable the current evidence base was for each age group, bearing in mind that the patients with the age group of highest prevalence receiving spinal intervention were 70–79-years-old [2]. Our hypothesised reasons for age-based exclusion included the presence of upper age cut-offs when recruiting participants for trials, as well as the effect of industry sponsorship on trial design.

Methods

Framework

This research took the form of a systematic review [15]. Hence, Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines [16] were followed as applicable.

Informed consent was not obtained because no individual patient data was collected and we performed a meta-analysis only of publicly available data.

Eligibility criteria

Our eligibility criteria included any RCTs published from the 1/1/2016 to the 31/12/2020, studies published only in the English language, and published in the 7 spine journals most commonly used by spinal research professionals [17]—European Spine Journal, Spine, The Spine Journal, Global Spine Journal, Journal of Neurosurgery, Journal of Spinal Disorders Techniques, and Asian Spine Journal. Full texts were screened and studies containing any paediatric patients (under 18 years-old), studies that were not RCTs, and RCTs on animals and cadavers were excluded.

Search strategy

The search was performed in PubMed on the 23/03/2021 with the search strategy [“randomized controlled trial” (Publication Type) AND (“2016”[PDAT]:“2020”[PDAT])] AND (((((“Spine”[Journal] OR “European spine journal: official publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research So0ciety”[Journal]) OR “The spine journal: official journal of the North American Spine Society”[Journal]) OR “Journal of neurosurgery. Spine”[Journal]) OR “Asian spine journal”[Journal]) OR “Global spine journal”[Journal]).

Data extraction

Dual extraction of the data was carried out by two members of the research team. Data identified included any explicit age cut-offs or nominal upper age, mean age, standard deviation of the mean age, median age, interquartile range, maximum age recruited, sponsorship type of research, and a broad classification of the intervention type. The mean age of the entire research population was extracted, however, if it was not provided, the mean age of the intervention group was chosen. The type of sponsorship was divided into three categories—industry sponsored, publicly sponsored, and no sponsorship. Classification of intervention types were divided into implants, surgery, interventional radiology, pharmacology, psychology and cognitive therapy, physiotherapy, bracing, and other. Any discrepancies between the dual extraction were resolved by adjudication by a third reviewer not involved in the extraction process.

Data transformation

The number of applicable trials were calculated for each 5-year age category from 40 to 100 years old. Trials were considered applicable if 25% or more of patients in the trial were of the patient’s age or older. This was determined by the using any given 75th percentile ages. If this was not provided, a 75th percentile age was extrapolated from the implied normal distribution using the mean age and standard deviation. This was calculated by the formula:

$$75{\text{th}}\;{\text{percentile}}\; = \;{\text{Mean}}\; + \;{\text{Standard}}\;{\text{Deviation}}\; \cdot \;0.674$$

For secondary outcomes, the mean ages of patients in clinical trials with explicit upper age cut-offs were compared with those that did not have an upper age cut-off. The mean ages of patients in clinical trials with industry sponsorship were compared with public sponsorship as well as no sponsorship.

Data analysis and presentation

Data was analysed using SPSS Statistics and Microsoft Excel and will be presented primarily with estimation statistics and confidence interval testing. Data will be presented on heat maps, Gardner–Altman plots, and Cumming plots. Null-hypothesis significance testing—Chi-square and t-tests—will also be used.

Results

In total, 228 records were identified with the search strategy on PubMed. Full text articles were screened and assessed for eligibility, in which 42 were excluded due to the eligibility criteria. 186 studies were included with a total of 26,238 participants. Of the remaining 186 studies, 75th percentiles were obtained or extrapolated from 146 studies, which were used as our primary outcomes. Of the 186 studies, 173 studies had means which were used to for the secondary outcomes. 94 out of the 186 studies, approximately 50%, had explicit age cut-offs.

7 out of 186 trials explicitly stated that they excluded participants based solely on an arbitrary upper age limit (Fig. 1).

Fig. 1
figure 1

PRISMA flowchart

Primary outcomes

Primary outcomes are displayed in the form of a heatmap in Figs. 2 and 3, where the values in the figures are expressed as a percentage. Figure 2 demonstrates the number of studies applicable with the denominator excluding studies without a 75th percentile, while Fig. 3 demonstrates the number of studies applicable with the denominator including all studies, including those without a 75th percentile. The highest percentage, 100%, is shaded green and the lowest percentage, 0%, is shaded red.

Fig. 2
figure 2

Heatmap of the percentage of trials applicable for each age category from 40 to 100 years old with the denominator excluding trials without a 75th percentile

Fig. 3
figure 3

Heatmap of the percentage of trials applicable for each age category from 40 to 100 years old with the denominator including all trials

Our findings demonstrate that older adults are at a higher risk of being excluded from RCTs. The overall trend observed was that as participant age increased, the number of relevant clinical trials decreased (Figs. 2, 3). This trend was evident in all intervention types. Overall, approximately 55% of studies were applicable to a 55-year-old individual, 6% of trials apply to a 75-year-old, and no trials were applicable to an 85-year-old individual. The implant intervention had the highest rate of exclusion, with no trials applicable to a 75-year-old individual.

Figure 3 exhibits a higher rate of exclusion in each 5-year age category. The denominator of the values in Fig. 3 included all trials, including those that we were not able to extrapolate a 75th percentile from. Approximately 44% of studies were applicable to a 55-year-old individual, 5% of trials were applicable to a 75-year-old individual, and no trials were applicable to anyone aged 85 and older.

Secondary outcomes

Explicit age cut-offs

84 out of the 173 studies (48%) with means had explicit age cut-offs. The mean age of patients in trials with cut-off ages and no cut-off ages were compared. Clinical trials with explicit age cut-offs were more likely to have a younger mean age. The data is represented in a Gardner–Altman estimation plot (Fig. 4). Participants in clinical trials with a cut-off age had a mean age of 46.8 years old, while participants in clinical trials without a cut-off age had a mean age of 55.9 years old. The mean difference was 9.1 years old with a 95% confidence interval of 6.2 and 11.8. The P value of the t-test was 0.0, the difference was statistically significant.

Fig. 4
figure 4

Gardner–Altman estimation plot displaying the mean difference between trials with cut-off ages and trials with no cut-off ages. Each trial is depicted as a dot on the left axes, and the mean difference is depicted as a dot on the floating axes on the right as a bootstrap sampling distribution. The 95% confidence interval is depicted by the ends of the vertical error bar

Sponsorship

Out of the 173 trials with mean ages, 55 had industry sponsorship (31.8%), 63 public sponsorship (36.4%), and 55 had no sponsorship (31.8%). The mean age of participants in clinical trials with public sponsorship and no sponsorship were compared against industry sponsored clinical trials. The type of sponsorship did not affect the mean ages of the clinical trials. The data is displayed in a Cumming estimation plot (Fig. 5). The mean ages for industry sponsored, publicly sponsored, and unsponsored clinical trials were 50.8, 50.7, and 53.0 years respectively. The mean difference between industry and publicly sponsored clinical trials was 0.1 with a 95% confidence interval of − 4.1 and 3.63. The mean difference between industry sponsored and unsponsored trials was 2.14 with a 95% confidence interval of − 2.1 and 6.24. The P values of the tests were 0.96 and 0.31 respectively. The difference was not statistically significant.

Fig. 5
figure 5

Cumming estimation plot comparing publicly sponsored and industry sponsored clinical trials against unsponsored clinical trials. The upper axes demonstrate the raw data, and the lower axes demonstrate the mean differences depicted as a dot and plotted as bootstrap sampling distributions. The 95% confidence interval is depicted by the ends of the vertical error bar

Fisher’s exact test was conducted on SPSS. The P value was 0.116—the type of sponsorship did not affect the likelihood of an explicit cut-off age in clinical trials (Table 1).

Table 1 Contingency table used for Fisher’s exact test

Discussion

Our primary results identified the lack of applicability of most spinal clinical trials for the ageing population. There was a decrease in applicability of spinal research with an increased age. The 75th percentile was used as a threshold for determining applicability, as it allows flexibility for normally and not normally distributed data, given that non-normal data is usually presented with an interquartile range. Figure 3 was created in addition to Fig. 2 as it expresses the true percentage of research applicable to an individual if they require treatment. Because the denominator for Fig. 3 also includes where trials cannot be applied because inadequate information is given to determine the spread of participant ages, it gives a stronger indication of evidence applicability as published.

Estimation statistics was primarily used to illustrate our secondary outcomes as overreliance on null-hypothesis significance testing—an accept or reject dichotomy—can divert attention of readers from the quantification of effect [18]. As hypothesised, clinical trials with explicitly stated cut-off ages had a significantly younger group of participants. While explicit age cut-offs do influence the participant ages, this only captures and explains a small part of exclusion of older adults, as less than half of the trials have explicit age cut-offs.

Industry sponsored trials in spine research have been shown to have an odds ratio of 3.3 times higher positive outcomes [19]. One of the potential reasons for this includes bias in study design, including the selection criteria of participants. Hence, sponsorship types were extracted and the mean ages in each sponsorship type was compared. Contrary to our hypotheses, the type of sponsorship did not affect the mean ages of participants in RCTs.

Upon examination of the results, we conclude that exclusion goes beyond explicit age cut-offs and sponsorship. Exclusion exists in three different layers—explicit exclusion, implicit exclusion, and systemic exclusion. Explicit exclusion of older adults is most noticeable, with a clearly stated age exclusion criteria. Implicit exclusion is more insidious—it is when there is no explicit exclusion, however, participants in the clinical trials do not match the research question. Essentially, very few older patients are actually included, even when they are technically not excluded. This was very obviously demonstrated by Ludmir et al. [13] when median ages of research participants were compared to median ages of patients in his study. Explicit and implicit exclusion applies to individual trials. A third layer of exclusion was identified in this study—systemic exclusion. This is demonstrated in Fig. 6, where the research question might match the research population, however, there is a mismatch with the actual population. Research as a whole is targeted towards diseases impacting younger people. In essence, there are discrepancies in where research efforts are going. Systemic exclusion goes beyond the level of individual trials and arises at a supra-trial level. Our results from this study indicate that systemic exclusion exists in the spine field.

Fig. 6
figure 6

Illustration of systemic exclusion demonstrating a normal distribution of the ages of participants in a research population, research question, and burden of disease

Reasons for the exclusion of the elderly range from logistical factors to patient and family related factors. These include difficulty in accessing hospitals and transportation, the requirement of technology in many newer trials, and compliance with the suggested intervention [20, 21]. Furthermore, age-related hearing loss is not uncommon in the older population, which can create a barrier in communication [22]. Older adults are also at a higher risk of cognitive decline [23] which can affect their capacity when obtaining informed consent.

Older adults are more prone to multimorbidity, “the simultaneous occurrence of two or more diseases that may or may not share a casual link” [24]. Doctors may exclude participants due to concerns of unexpected side effects and intolerance of the treatment.

Supra-trial reasons for exclusion include resistance to change, an absence of models to follow, and a lack of adequate resources [25].

As describe previously, the ageing population is increasing and there has been a shift in epidemiology. Especially within the spine field, the marginalisation of older people in trials must be eliminated as many spinal conditions are age-related. Interventions used on younger adults may have differing efficacies for older adults. Although rates of complications are not expected to be identical, this information is critical for patient decision making. Chang et al. [26] conducted a systematic review with 7 million participants in 2020 and found that worse health outcomes were found in 95.5% of studies that excluded the elderly. Strong evidence was also found between ageism and elderly mistreatment, including physical abuse, financial abuse, sexual abuse, psychological/emotional abuse, and neglect [27].

Aside from the ethical concerns that arise from that, poor health outcomes lead to inevitable consequences—individuals aged 60 years or older account of 23% of the global burden of disease [28], and age-related diseases account of 51.3% of global burden among adults in 2017 [29]. In the United States of America, data from the Medicare Current Beneficiary Survey shows that medical expenses double between 70 and 90-year-olds, and that 52% of medical expenses stems from the top 10% of spenders [30]. In China, a country with an increasing ageing population, the medical expense of the ageing population per capita was almost three times more than those under 65-years of age [31].

While certain elderly individuals will need to be excluded due to certain comorbidities for health and safety reasons, this should not hinder research in the ageing population. Exclusion due to age alone for convenience is unethical. Researchers should never exclude participants solely based on an arbitrary age limit. Inclusion is essential.

To prevent ageism, it is unacceptable to just lift the upper age limit. Rather, targeted and specific research should be conducted for the elderly. On an individual trial basis, geriatricians should have an increased input in interventions as they are in a strong position to reduce unethical discrimination [32]. Researchers should also eliminate any bias as this can cause implicit exclusion. Ethics committees should request for justification for any explicit age cut-offs when looking at the methods of a study. Although these are small interventions, any small-scale intervention has been shown to yield a positive result [33].

However, as identified in this study, exclusion occurs on a supra-trial level. Table 2 identifies the roles of different bodies in preventing exclusion. While sponsorship did not have a significant impact, more funding should be provided towards trials for the ageing population as this encourages and provides incentives for research towards this group. More models of care and research protocols specific to the older population should be created and publicised to facilitate future research. Finally, we identify that more meta research should be undertaken to examine the impact of trial age populations across individual interventions. Ultimately, the end goal is to achieve optimal patient care.

Table 2 Roles of different bodies to prevent exclusion