Introduction

Patients with advanced cancer have an illness trajectory that typically begins with a period of relatively preserved functional status followed by a period of marked deterioration and increased symptoms during the last months of life [13]. This period of functional decline is associated with frequent hospitalization and increased costs of care. Analysis of Medicare data for patients who died with poor-prognosis cancers indicates that 65 % were hospitalized and 25 % had intensive care unit stays within 30 days of their death [4].

Early identification of patients who are approaching the end of life can improve patient outcomes by improving clinical decision-making. Earlier referral to hospice and palliative care can result in improvements in quality of life and even length of life [57]. Aggressive anticancer treatment in the terminal phase increases costs of care, may reduce quality of life, and may fail to yield any improvement in survival [8]. Between 20 and 50 % of patients with advanced cancer receive chemotherapy within 30 days of death [9]. Patients with systemic chemotherapy in the last months of life have a substantially increased risk of dying in an intensive care unit [10]. Chemotherapy in the last weeks of life is also associated with reduced likelihood of receiving hospice care [11].

Patient self-reported health status measures are useful for estimating survival in cancer patients and can provide unique information of importance for monitoring patient health status trajectories [1215]. While numerous patient-reported outcome measures have been developed over the years, recent national efforts have produced standardized assessment measures. Beginning in 2004, the National Institutes of Health sponsored the development of the Patient-Reported Outcomes Measurement Information System (PROMIS) to provide a uniform methodology for quantifying physical, mental, and social health across patient populations [16, 17]. The PROMIS project measures provide reliable and valid assessment of patient self-reported health status for use in cancer patient care delivery and clinical research [18, 19]. This analysis examines the relationship between survival and PROMIS domain measures among patients with advanced cancer at a large health system and provides evidence regarding their potential utility in identifying those with declining symptoms in need of referral to palliative care or more aggressive symptom management.

Methods

Computer-based assessment of patient-reported outcomes (PROs) was implemented for patients with advanced loco-regional or metastatic cancer (stage 4 solid tumor or other advanced cancer identified as incurable by the referring oncologist or surgeon) who were referred to this program at the University of Virginia Health System Cancer Center. Palliative care services include an outpatient clinic located in the cancer center, inpatient consultation, a palliative care unit, and home hospice components. Patients referred to this clinic are typically followed concurrently with medical oncology, radiation oncology, and surgical oncology for symptom management and end of life care planning. The palliative care clinic staff includes RNs, MDs and NPs, with available social worker and chaplaincy services.

Assessment was initiated for all patients beginning in October 2013, following a pilot testing phase completed in February 2013 [20]. The set of PRO measures collected for each patient was selected based on national recommendations and physician and patient stakeholder review [21]. The assessment includes five PROMIS health domains: anxiety, depression, fatigue, pain interference, and physical function. Additional PROs were developed for the project to address the following cancer-specific symptoms: bowel function, nausea, vomiting, anorexia, dyspnea, and peripheral neuropathy.

Patient data available from the health system Electronic Health Record (EHR) and from health system administrative records was collected to identify patient demographics, cancer diagnoses, and survival outcomes for all patients with assessments. Cancer diagnoses were identified using ICD-9-CM diagnosis codes reported in hospital and outpatient care encounter summaries for events occurring up to 2 years prior to the date of the first assessment. Individual ICD-9-CM codes were grouped into categories of site-specific cancer diagnoses using the criteria of the Agency for Healthcare Research and Quality Clinical Classification System [22].

The collection of PROs was implemented using software integrated within the patient’s electronic health record. The assessment software was accessed by patients through an online browser using a tablet computer. The assessment protocol included instructions on the assessment process and on the use of the tablet computer, which was provided by the clinic staff to patients at each visit. Direct assistance with the assessment process was provided by clinical staff to any patient who requested assistance. Patients completed an assessment at each routine follow-up appointment, before their clinical encounter. The availability of direct assistance with the assessment was included in the protocol based on the results of the pilot study, which demonstrated that while most patients were able to complete the assessment without assistance, some patients required direct assistance with the use of the tablet computer, with the assessment software, or with the assessment questions [20].

Completed assessments were posted immediately to the patients’ EHR records. Displays of both current and longitudinal results were available during the patient encounter for use in therapeutic decision-making, assessment of response to treatment, symptom management, and end of life care planning. This analysis includes results collected for the five PROMIS health domains during the 18-month period from October 2013 through March 2015.

Each PROMIS domain was scored using the version 1.0 short form item series, with 5 ordinal response options (1 to 5) for each question in the domain [23]. Mean scores for the total responses indicate whether patients have more or less of the domain being measured. For example, higher mean scores indicate worse anxiety, depression, fatigue, and pain interference, but better physical function. Mean scores were calculated for each domain, for each patient assessment. Mean scores were also calibrated to the weighted distribution of mean scores from a large representative sample of the US general population, using the “T-score” algorithm [24]. Population-calibrated scores are referent to the surveyed population mean of 50 and standard deviation of 10, such that a patient with a T-score of 40 in the study population has a score that is one standard deviation below the US general population mean. Frequency distributions for PROMIS domain score values were calculated at the mean score scale and T-score scale.

Patients in the study population were referred at different points in time in the progression of their disease. Given this heterogeneity in stage of disease progression among patients, there was no shared reference point for defining the start point of trajectory patterns measured prospectively in the total study population. Rather, trajectory patterns were assessed retrospectively for deceased patients only, using the patient’s date of death as a shared reference point for the end of the trajectory.

Multi-level mixed-effects models were used to estimate the significance and direction of PROMIS domain score trajectories over time. Differences among patients in the number of assessments and in the spacing between assessments prior to death are accommodated by the hierarchical structure of these models [25]. The shape of the score trajectories was estimated using linear, quadratic, and spline functions. Linear models estimate only the linear association between scores and months prior to death. Quadratic models have components for estimating both the linear and non-linear association. Spline functions are piecewise polynomials with components that combine into curved functions fit to the pattern of scores. Spline functions yield maximum precision for estimation of non-linear relationships [26]. The statistical significance of each functional relationship was assessed using the fixed effect F-test statistic p value calculated for each model parameter. The extent of non-linearity in the relationship between PROMIS scores and months prior to death was measured using the Akaike information criterion (AIC) statistic to calculate the probability of information loss from excluding non-linear relationships in the models [27].

Cox regression analysis was used to assess the statistical significance of differences in the probability of survival by months post assessment associated with PROMIS domain scores for all patients in the study population. The statistical significance of the model score coefficients was assessed using the Wald chi-square test statistic. Survival probabilities at the time of assessment were estimated using models with time-dependent covariates conditional on the interaction between the PROMIS domain score and the number of months elapsed since the beginning of follow-up [2830]. A logarithmic function was used to represent the non-linear shape of change in scores over time [31]. Probabilities of survival at 6 and 12 months post assessment were estimated with reference to the 25th, 50th, and 75th percentiles of the domain score distributions. The statistical significance of the difference in survival by quartiles of the PROMIS domain scores was calculated using the stratified log-rank test statistic.

Data management, statistical programming, and graphics programming was conducted using SAS 9.4 and R statistical software, version 2.13 (R foundation for Statistical Computing, Vienna, Austria). The statistical significance of parameter effect estimates and of test statistics was assessed at the a priori threshold value of p < 0.01.

Results

All patients presenting in the clinic who met the selection criteria were included in the study, and all selected patients were able to complete the assessments. A total of 472 patients completed assessments during the 18-month study period, and 203 of these patients (43 %) died during the study period. Demographic and diagnostic characteristics of the study population are listed in Table 1. The mean age was 58.9 years, and 56.6 % of patients were female. Cancer of the bronchus and lung (17.4 %), breast (14.6 %), head and neck (11.9 %), ovary (9.1 %), colon (8.5 %), and other cancers for gastro-intestinal sites and peritoneum (8.9 %) were the most frequently occurring cancer sites reported for the study population.

Table 1 Study population demographics and cancer diagnosis groups

A total of 1992 assessments were completed during the study period, with repeated assessments available for most patients: >1 (73.3 %), >2 (55.1 %), >3 (41.5 %). The mean number of assessments per patient was 4.2, with a mean of 3.8 months between assessments. Figure 1 illustrates the patterns of consecutive scores reported for three example patients who died during the study period.

Fig. 1
figure 1

A plot of example assessment score values reported for three patients who died during the study period, for each PROMIS domain. Dates of assessment are indexed by months prior to death along the horizontal access, and mean domain score values reported for each assessment are represented on the vertical axis. Consecutive scores for example patients are linked by reference lines, for each of the three example patients identified in the plot legend

Table 2 lists the distributional characteristics of the PROMIS mean domain scores and the T-score scaled mean domain scores in the study population. The T-score scaled results demonstrate that patients in the study population had overall score means that were higher than the US general population medians for anxiety (56.8), depression (51.9), fatigue (56.0), and pain interference (58.1) and had lower mean scores for physical function (44.4).

Table 2 PROMIS score distributions and statistical model results

Table 2 also displays the multi-level mixed-effects model analysis results for functional trends in scores during the months prior to death. Statistically significant linear relationships were demonstrated for fatigue, pain interference, and physical function scores. Linear trends in anxiety and depression scores were not statistically significant. Statistically significant non-linear trends were demonstrated by the quadratic model for fatigue, pain interference, and physical function scores. Quadratic model terms were not significant for trends in anxiety scores (p = 0.4207, p = 0.5055) and were near significance for trends in depression scores (p = 0.0599, p = 0.0545) over the months preceding death. The spline models demonstrate that non-linear function components were significant or nearly significant for each of the five PROMIS score domains. Figure 2 illustrates the shapes of the estimated spline model trajectories for each PROMIS score domain among the deceased patients. Non-linear functional transformations using the spline models significantly improved the fit of trajectories estimated for the depression, fatigue, pain interference, and physical function scores. The probability of information loss associated with excluding non-linear terms was 1.0 for each of these four PROMIS domains.

Fig. 2
figure 2

A plot of the spline function trajectory of estimated score values reported for deceased patients, for each PROMIS domain. All mean score values are plotted, with dates of assessment indexed by months prior to death along the horizontal access, and mean domain score values represented on the vertical axis, for each PROMIS domain. The spline function trajectory of estimated score values is depicted by bars plotted for each assessment date, superimposed on the scatterplot of all score values

Figure 3 presents plots of the exact functional relationships over time for scores at the 25th, 50th, and 75th percentile reference values of each of the five PROMIS domain score distributions. Reference values for the mean score and T-score values corresponding to the 25th, 50th, and 75th percentiles of each PROMIS domain score distribution are listed in Table 2. Results for the stratified log-rank test statistic demonstrate that the differences in survival functions by quartiles of the anxiety and depression scores were not statistically significant. However, highly statistically significant differences in survival were demonstrated for quartiles of scores for fatigue, pain interference, and physical function (p < 0.01).

Fig. 3
figure 3

A plot of the survival functions estimated for all patients in the study population, with reference to the mean scores at the 25th, 50th, and 75th percentiles of the overall domain distributions, for each PROMIS domain. Months post assessment are indexed on the horizontal access. Estimated probabilities of survival are indexed on the vertical axis

The predictive relationship of PROMIS domain scores to post assessment survival was evaluated for all patient assessments. Table 2 lists results obtained from the Cox regression analysis. Each of the PROMIS domain scores had highly statistically significant relationships to patient survival, except the PROMIS domain scores for anxiety (p = 0.8692). Large absolute differences in survival at 6 and 12 months post assessment occur between patients with domain scores at the 25th and 75th percentiles for fatigue, pain interference, and physical function. Patients who reported high levels of physical function (scores at 75th percentile) had a 0.99 probability of 6-month survival and a 0.89 probability of 12-month survival. In comparison, patients with low levels of physical function (25th percentile) had a 0.89 probability of 6-month survival (−0.10 point difference) and a 0.72 probability of 12-month survival (−0.17 point difference). Table 2 lists the absolute differences in 6- and 12-month survival for patients at the 25th and 75th percentiles of each of the five PROMIS domain scores.

Discussion

This research demonstrates that PROMIS scores for fatigue, pain interference, and physical function provide reliable indications of reduced 6- and 12-month survival in patients with advanced cancer. In our study population, patients with scores at the high or low ends of the scales demonstrate clinically meaningful absolute differences in the probability of survival at 6 and 12 months after assessment.

The PROMIS assessment tools provide a standardized and readily available method for scoring fatigue, pain interference, and physical function. Regular computer-based patient assessment provides longitudinal information useful for informing clinical decisions during patient visits, while ensuring that patient data is fully protected within the EHR. When PROs are available in routine EHR reports, patients with severe symptoms or trend patterns of concern can be identified early and referred to appropriate services prior to symptom escalation to the point of requiring inpatient management.

Patient-reported outcomes are increasingly recognized as relevant to oncology. Patient-reported outcomes provide a unique patient perspective on treatment benefit and risk that goes beyond clinician-reported adverse events [32]. Patients and clinicians often differ in their assessment of the severity and impact of cancer and treatment-related symptoms [33]. As a result, PROs can provide crucial information for guiding patient care. The Center for Medicare and Medicaid Innovation is developing new payment and delivery models for oncology care that utilize patient-reported outcomes assessment and recommends the use of the PROMIS short forms for this purpose [34].

In the era of increasingly shared care and shortened patient visits, computer-based assessment of PROs can enhance the continuity of care provided. Clinicians often must make decisions using information from a snapshot in time, and without the patient’s longitudinal history being available and readily accessible at the point of care, it can be difficult to identify patients with meaningful changes in these status measures. Longitudinal PROs in the EHR are highly useful for identifying patients with reduced probabilities of survival, and for identifying patients with declining symptoms in need of referral to palliative care or more aggressive symptom management. We hypothesize that end of life health interventions that target reducing fatigue and pain interference and increasing physical function are the ones most likely to improve quality of life and prolong survival.