Background

Insight into the change from pre- to post-injury health status of trauma patients is important in order to derive population estimates of the impact of injuries on health-related quality of life (HRQL). The difficulty in measuring the impact of injuries is that the patient’s HRQL after sustaining an injury may be influenced by factors other than the injury [1]. For instance, pre-existing comorbidity may contaminate our estimates of the injury-related disability, since HRQL scores might incorporate the impact of one or more comorbid diseases instead of solely reflecting the impact of the injury. To overcome attribution bias (i.e., attributing post-injury HRQL scores solely to the injury when it may have been caused by other factors), information on pre-injury HRQL is vital to make valid estimates of the change from pre- to post-injury HRQL due to the injury under study. However, prospectively collected information on the pre-injury HRQL of injury patients is difficult to obtain.

This has led researchers to use alternative methods to assess the contrast between pre-injury and post-injury HRQL, such as use of patient recall or retrospective baseline scores (in other words, pre-injury HRQL that is assessed after sustaining the injury). However, retrospective baseline scores of pre-injury health status are potentially subject to bias [2, 3]. Patients may remember their pre-injury HRQL as better or worse than it actually was (recall bias) [2]. Moreover, patients’ perception on HRQL may change after the injury, due to a change in internal standards or values (response shift) [4]. This change in perception of HRQL after the injury may also affect the retrospectively assessed pre-injury HRQL.

Other methods are the application of general population norms (i.e., using normative values from the general population as a reference point for the health status before the injury), or the use of a matched non-injured comparison group as a baseline to assess the reduction in health due to the injury. The application of population norms or a matched non-injured comparison group may lead to an inaccurate estimate of the change in health status, as injured people may differ from the general non-injured population [5, 6]. Research indicated that injured people have a higher prevalence of comorbidity, hospitalization, and health service utilization prior to their injury in comparison to non-injured people [5]. This suggests that pre-injury health status is worse compared to population norms and conflicts with the reported better pre-injury health status compared to the general population [68]. On the other hand, the injured population might be healthier and more likely to participate in activities, exposing them to a higher risk of injuries [6].

The current systematic review identifies the methods that are used to assess pre-injury health status of trauma patients and to estimate the change from pre- to post-injury HRQL due to an injury. Moreover, bias that may occur from these methods is examined, by comparing the reported pre-injury HRQL scores with population norms by calculating age- and gender-specific norm scores based on the demographics of the included study samples.

The objectives of this study are:

  1. 1)

    To identify the methods which are used to measure pre-injury HRQL;

  2. 2)

    To compare the reported pre-injury HRQL scores with calculated general age- and gender-adjusted norms;

  3. 3)

    To address the pre-injury HRQL scores per HRQL instrument and injury type;

  4. 4)

    To examine the change between pre- and post-injury HRQL in injury patients; and

  5. 5)

    To formulate recommendations for future studies on (pre-injury) HRQL.

Methods

Relevant studies were identified through systematic literature searches in the databases EMBASE, MEDLINE (via Ovid SP), Cochrane Central, PubMed, Web of Science, SCOPUS, PsycINFO, CINAHL, Lilacs, Scielo, ScienceDirect, and ProQuest. Grey literature was examined via Google Scholar. Search strategies were developed in consultation with a search expert, and included a combination of subheadings and text words (Appendix). Reference lists and citation indices of the included papers and relevant reviews were inspected to identify additional relevant citations.

Study selection

We included studies that assessed the pre-injury HRQL of injury patients, published in English in peer-reviewed journals until July 6, 2015. We included studies on general injury populations, as well as injury-specific studies (e.g., traumatic brain injury or hip fractures). There was no restriction in the methods of patient selection used in the studies (e.g., samples drawn from the Emergency Department (ED), hospital, or outpatient programs). HRQL was conceptualized as an individual’s perception of how an illness and its treatment affect physical, mental, and social aspects of his/her life [9]. Studies that assessed only some domains of HRQL (e.g., functional status, activities of daily living, mobility, mental health) were excluded. We included studies that assessed the HRQL of patients before the injury, whether assessed before the injury or retrospectively. Studies that solely used population norms, as a substitute of pre-injury HRQL, were excluded. For studies using data from the same study sample, one study was chosen as the reference study by giving priority to the study that focused on reporting pre-injury HRQL summary scores or utility scores (e.g., instead of percentage of problems per HRQL domain).

Data extraction and methodological quality

The first review author (AS) screened all titles and abstracts and deleted obviously irrelevant papers. Two independent review authors (AS and SP) screened the remaining citations on title and abstract and those obtained in full text. Results from both reviewers were compared by a third review author (JH) and any disagreement was be resolved by discussion between the three authors.

We extracted information on the participants (age and gender), injury (type, severity, and mechanism), the assessment of pre-injury HRQL (instrument, procedure, and timing), and recovery of injury patients (change between pre- and post-injury HRQL).

The methodological quality of the studies was evaluated with four elements of the STROBE checklist [10] which were most relevant to the quality of reported pre-injury HRQL by injury type: setting, participants, data sources/measurement, and study size. In addition, risk of bias was assessed using items from the Research Triangle Institute item bank for observational studies on attrition bias (“Impact missing data adequately assessed”) and reporting bias (“No important primary outcomes missing”) [11].

Statistical analysis

Pre-injury HRQL scores from the study samples were compared with norm scores derived from the general population. To provide population norms for all studies, we used norms by age and sex groups of the EQ-5D (UK population) [12], SF-36 [13], and SF-12 [14] (both US population) to calculate age- and gender-adjusted norms based on the demographics in the study samples.

Heterogeneity between pre-injury HRQL scores was assessed with the Q-statistic and I2-statistic, using a random-effects model in a Microsoft Excel spreadsheet [15]. The Q-statistic is a Chi2-test for heterogeneity, which assesses whether observed differences in results are compatible with chance alone. A significant Q (low p-value) indicates heterogeneity among the HRQL scores and a variation that is beyond chance [16]. The I2-statistic describes the percentage of variation across studies that is due to heterogeneity rather than chance, with an I2 value of 25% or lower is associated with low heterogeneity, 50% indicating substantial heterogeneity, and 75% or higher indicating high heterogeneity [17]. In case of substantial or high heterogeneity, pooled results should not be calculated, or at the very least, should be interpreted with caution.

Results

Literature search

The extensive search strategy identified 2,286 unique titles of potentially relevant articles (Fig. 1). Screening of the titles and abstracts resulted in a selection of 383 articles that appeared to meet all selection criteria. After screening and selection of the full text papers, we retrieved 31 studies described in 41 publications. The main reasons for exclusion were not measuring pre-injury health status, not reporting on injuries or only reporting part of the outcomes on HRQL.

Fig. 1
figure 1

Study selection

Study characteristics

Of the 31 studies included in our systematic review, most were conducted in the US (n = 8) [1825], Australia (n = 5) [6, 2629], and Canada (n = 5) [3034] (Table 1). Eight studies measured the pre-injury HRQL of patients with a hip fracture [24, 30, 3439], followed in frequency by extremity injury (n = 6) [19, 23, 32, 37, 40, 41], general injury (n = 5) [8, 29, 31, 42, 43] and traumatic brain injury (TBI, n = 4) [22, 25, 27, 44]. Sample sizes of the studies varied widely, ranging between 34 [33] and 2,842 [8] participants, with most studies having sample sizes between 100 and 600 (n = 17). The majority of the participants were males (>50% men in 20 out of the 31 studies). The nine studies that included more women than men [18, 23, 30, 34, 36, 38, 39, 45, 46] often focused on hip fractures (n = 5) [30, 34, 36, 38, 39], or reported on the outcomes after a motor vehicle crash of a longitudinal (annual) survey (n = 2) [18, 46]. The mean age of the participants in the included studies ranged between 10 [19] and 87 [30], with an average of 30 to 54 in half (n = 16) of the studies and 75+ in six of the 31 included studies. Four studies measured the pre-injury HRQL for children and adolescents [19, 22, 25, 31], of which the author names are indicated in bold in Table 1.

Table 1 Study characteristics

Methodological quality

Over half (n = 19) of the 31 articles included in our review reported on attrition. Most studies faced several problems in the participation of eligible patients, as patients refused to participate (n = 15), could not be contacted (n = 6), did not complete the HRQL assessment (n = 6), had died (n = 5), or were not able to respond to the questionnaires (e.g., due to the consequences of the trauma, n = 3). Overall, response rates ranged from 60 to 98% in 17 of the 22 studies that reported on response rates.

Limited variation existed in the selection of samples between the studies. Most patients were recruited during or after a treatment in a (pediatric) hospital (n = 21), while others were selected from a specialized burn center (n = 2) [20, 28], sports center (n = 1) [23], or nursing home facility (n = 1) [30].

In four out of the 31 studies, the measurement of pre-injury HRQL was one of the primary aims [8, 18, 46], while in all other studies pre-injury HRQL scores were used to assess the change in HRQL after the injury or to validate HRQL instruments.

Methods to measure pre-injury HRQL

The 36-item Short-Form (SF-36, n = 14) [21, 24, 2629, 3234, 40, 41, 4547] was the most frequently used instrument to assess the pre-injury HRQL of injury patients, followed by the EuroQol-5 Dimension Questionnaire (EQ-5D, n = 9) [8, 30, 31, 3539, 42], and the SF-12 (n = 4) [6, 18, 23, 43] (Table 1). The remaining studies used the Pediatric Quality of Life Inventory (PedsQL, n = 3) [19, 22, 25], or a combination of the EQ-5D and SF-36 (n = 1) [44]. The majority of the studies assessed the participants’ pre-injury HRQL by using a questionnaire (n = 16) [18, 20, 2224, 27, 28, 3133, 35, 37, 41, 4446] or a face-to-face interview (n = 11) [8, 21, 25, 29, 30, 34, 36, 3840]. At follow-up, most studies used questionnaires (n = 17) [18, 23, 24, 27, 28, 3133, 35, 37, 3941, 4447] followed by the use of telephone interviews (n = 10) [19, 21, 22, 25, 26, 30, 34, 36, 42, 43].

All but two studies in this review retrospectively assessed the pre-injury HRQL of patients, by asking them to recall their HRQL before the injury occurred. Only two studies provided prospectively collected pre-injury health status of participants [18, 46] (articles in bold and italics in Table 1): the Medical Expenditure Panel Survey (MEPS) [18] and the Seguimiento Universidad de Navarra (SUN) [46] cohort. These studies used data from longitudinal cohort studies in which participants who were initially non-injured were followed for several years, by means of questionnaires comprising the SF-36 [46] or SF-12 [18]. In addition, only one of the included studies measured the recalled pre-injury health status of trauma patients and not their post-injury HRQL [6], while all other studies measured both pre- and post-injury HRQL.

Pre-injury scores were often reported as assessed “soon after” injury or admission (n = 12), in-hospital or “soon after” discharge (n = 5) [21, 29, 34, 36], at inclusion or initial contact/visit (n = 5) [23, 25, 32, 37, 45], within six months after ED discharge (n = 2) [8, 43], or years after injury (n = 2) [42, 44]. The focus of the questionnaire and/or interviews (i.e., a specified period prior to the injury) was often not specifically defined (e.g., “before the injury,” n = 15) or not reported (n = 2) [28, 38]. The studies that specified the period of their pre-injury assessment used a day before injury (n = 1) [31], “just” before injury (n = 1) [30], a week before injury (n = 4) [25, 39, 45], the previous week (n = 2) [23, 29], or the month or four weeks before injury (n = 3) [20, 24, 26].

Most studies (n = 16) made a comparison of pre-injury HRQL between injury patients or with controls (e.g., TBI vs. no TBI) [18, 19, 25, 27, 37, 44, 46], between subgroups (e.g., by age, gender, ethnicity) [6, 22, 29, 36, 39], or between survival or recovery status (e.g., survived vs. dead, recovered vs. not recovered) [8, 24, 30, 39]. In addition, twelve studies compared the participants’ pre-injury health status with general population norms [8, 20, 21, 23, 28, 29, 33, 39, 40, 43, 45].

Comparison of pre-injury HRQL between injury patients and with population norms

Within-study comparisons between retrospectively collected pre-injury HRQL and general population norms indicated that self-reported pre-injury HRQL scores were consistently higher than population norm scores (n = 3) [8, 29, 48]. Three studies found that scores were higher in either the physical domains (n = 3) [20, 21, 28] or mental domains (n = 1) [20] or in certain age or sex groups (n = 1) [6]. Five studies found no differences between the recalled pre-injury HRQL and population norms (n = 5) [33, 39, 40, 43, 45] (Table 2).

Table 2 Pre- and post-injury HRQL

The self-reported pre-injury HRQL scores also exceeded the calculated age- and gender-adjusted population norm scores on the EQ-5D [8, 3739, 42, 44] (Fig. 2), as well as the physical and mental domains of the SF-36 and SF-12 (Fig. 3). Exceptions were injury types of higher severity, including elderly hip fracture patients (aged 80+ years) [30, 35, 36], or patients with a motor vehicle injury [18, 46], vertebral fracture [33], or TBI [27].

Fig. 2
figure 2

Pre-injury EQ-5D scores by injury type and in comparison to population norm scores. 1Adjusted by the age and sex distribution in the study population, based on the weighted health state index by age and sex [12]. 2Final post-injury measurement: at discharge [35], 1 year post-injury [8, 30, 3639], 2 years post-injury, or 2–7 years post-injury [42]

Fig. 3
figure 3

Pre-injury SF-36 and SF-12 scores by injury type and in comparison to population norm scores. Studies in bold and italics prospectively measured pre-injury HRQL. 1 Adjusted by the age and sex distribution in the study population, based on the weighted health state index by age and sex [13, 14]. 2 Final post-injury measurement: at 3 [27], 6 [43] or maximal 9 months post-injury [18], 1 year post-injury [26, 28, 29, 32, 33, 48], 2 years post-injury [44], or 48 years post-injury [46]. Heterogeneity: PCS Chi2 = 12.48, df = 11, (p = 0.33), I2 = 12%; MCS Chi2 = 11.88, df = 11, (p = 0.37), I2 = 7%. MVC: injury due to motor vehicle crash; Ortho: orthopedic injury; TBI: traumatic brain injury

Within-study comparisons of pre-injury HRQL between injury patients or with controls showed that patients who were injured due to a motor vehicle injury or who sustained a TBI had significantly lower mental health at baseline [18, 27, 44, 46] and lower scores across all HRQL domains [46] compared to those without a motor-vehicle injury or TBI (Table 2). Higher pre-injury HRQL was found in those who survived than those who eventually died during follow-up (significant differences found on the SF-36 PF, RP and GH [24], no significant differences found between EQ-5D scores [30]) and in those recovered than those not recovered at follow-up (not significant) [8].

Pre-injury HRQL scores per HRQL instrument and injury type

There was a large variation in the presentation of the pre-injury HRQL of patients (Table 2). Most studies reported the total scale scores on the EQ-5D (n = 10) [8, 30, 31, 3539, 42, 44] or PedsQL (n = 3) [19, 22, 25]. The studies that used the SF-36 or SF-12 often presented the physical (PCS) and mental component scores (MCS) (n = 10) [18, 2729, 32, 33, 43, 44, 48], while some studies provided an oversight of all domain scores without summary scores [20, 21, 24, 34, 41].

Pre-injury HRQL scores varied between patients with a hip fracture, ranging from 0.56 in an operatively managed sample of primarily 80 + −year-old females [36] to 0.80 in a hospitalized sample of women aged 45+ [37]. Highest pre-injury EQ-5D scores were seen in study populations who experience a TBI [44], major trauma [42], unintentional injury [8], or wrist or vertebral fracture [37] (mean EQ-5D 0.94, SD 0.04) while lowest pre-injury EQ-5D scores were reported in hip fracture populations [30, 3539] (mean EQ-5D 0.71, SD 0.10); two-sample t(9) = 5.01, 95% confidence interval (CI) [0.13–0.34], p = 0.001. Overall, pre-injury EQ-5D scores decreased with age, from 0.99 in populations with a mean age of 40 years (SD 21) [44] to 0.56 in those aged 80+ years [30, 35, 36].

Patients with a vertebral injury reported lowest pre-injury PCS (SF-36, PCS 49) [33], while those with orthopedic injury reported highest pre-injury PCS scores (SF-36, PCS 59) [26]. Lowest pre-injury MCS on both the SF-36 (MCS 47) [46] and SF-12 (MCS 49) [18] was reported in the two studies that prospectively assessed the pre-injury HRQL of participants before the occurrence of a motor-vehicle injury. Overall, rather similar pre-injury HRQL scores were reported in all studies, showing low heterogeneity (PCS: I2 = 12%, MCS: I2 = 7%), with generally better pre-injury PCS than MCS (mean 54.6 vs 52.9).

Change between pre- and post-injury HRQL

Most studies used a longitudinal design (n = 23) with multiple follow-up measurements over time (n = 18), often measuring post-injury HRQL at three months, six months and/or 12 months. All studies showed a decrease in post-injury HRQL compared to their pre-injury levels of HRQL (Table 2). Looking at the EQ-5D, only one out of the 12 studies showed full recovery to pre-injury HRQL at one year after the injury [38], while the other studies still reported reduced levels of HRQL post-injury. Looking at the SF-36 and SF-12, injuries showed to have the highest impact on the physical component of HRQL (reduction in PCS with 15 to 30 points from pre-injury to first post-injury assessment) compared to the mental component of HRQL (reduction in MCS with 5 to 9 points) [23, 27, 29, 32]. At the final follow-up measurement, both prospective studies showed almost full recovery to pre-injury HRQL levels on the PCS and full recovery on the MCS [18, 46], while only one retrospective study showed such recovery on the PCS [48] or MCS [28].

Discussion

This systematic review summarized the methods that were used to assess pre-injury health status and to estimate the change from pre- to post-injury HRQL. All but two of the 31 studies in our review used retrospective assessment (recall) to assess pre-injury HRQL. The studies most often applied the SF-36, followed by the EQ-5D or SF-12, by means of questionnaires or face-to-face interviews. Recalled pre-injury HRQL scores consistently exceeded general population norms, except in a limited number of studies on injury types of higher severity (e.g., traumatic brain injury and hip fractures). All studies reported reduced post-injury HRQL compared to pre-injury HRQL. Both prospective studies reported that patients had recovered to their pre-injury levels of physical and mental health, while in all but one retrospective study patients had not returned to their reported pre-injury levels of HRQL, even years after the injury.

Prospective assessment is the preferred method to determine pre-injury HRQL as it is not subject to bias that may occur due to experiencing an injury. In our review, only two out of the 31 studies used prospective assessment of pre-injury HRQL. These studies used longitudinal data from the Medical Expenditure Panel Survey (MEPS) among the US general population [18] and the Seguimiento Universidad de Navarra (SUN) cohort comprising university graduates in Navarra, Spain [46]. Both prospective studies reported lowest pre-injury mental health on the SF-36 (MCS 47) [46] as well as SF-12 (MCS 49) [18] of all studies in our review, which otherwise all used retrospective assessment. These prospective studies indicate that the retrospective assessment and population norm approach are highly likely to be biased.

Our review shows that the retrospectively assessed pre-injury HRQL systematically differed from the age- and gender-adjusted norms we calculated based on population data on the EQ-5D, SF-36, and SF-12. Despite the use of different HRQL instruments, recalled pre-injury HRQL scores in our review consistently exceeded these adjusted population norms. An exception to this were samples including patients with a hip fracture [30, 35, 36, 39], motor vehicle injury [18, 46], vertebral fracture [33] or TBI [27], that reported poorer pre-injury HRQL than our calculated adjusted norms. These injury patients are likely to be less healthy than their counterparts [18, 27, 44, 46], in terms of socioeconomic status [18], comorbidity [18, 49], or frailty and older age [12, 49, 50].

The difference between retrospectively assessed pre-injury HRQL and population norm scores might be caused by several reasons.

Recall bias may have influenced the outcomes of the retrospective assessment, as patients may have remembered their pre-injury HRQL differently than it actually was [2, 51, 52]. Patients may, for example, have overestimated their health status before the injury, resulting in higher recalled pre-injury HRQL than seen in the general population.

Response shift might have occurred, as patients’ perception of HRQL may have changed due to the injury and a change in health [4]. After having had experience with poor HRQL, patients may have inflated the rating of their health status before the injury [53].

Nevertheless, some researchers argue for the use of retrospective assessment of pre-injury HRQL, as this method applies one internal standard of HRQL values (reference point) in the assessment of both pre-injury HRQL and post-injury HRQL [4, 53]. According to them, such a reference point is essential for the interpretation of the change from pre- to post-injury HRQL, since patients may have changed their judgement of HRQL due to new insights since the injury (e.g., although a patient has a serious injury, he/she has seen others who are far worse off), or patients have become used to their new health state. However, both recall bias and response shift might result in an overestimation of the pre-injury HRQL by patients. This is underpinned by our finding that, even years after the injury, in all but one retrospective study patients had not returned to their reported levels of pre-injury PCS and MCS, while recovery to pre-injury HRQL levels was seen in both prospective studies.

Moreover, selection bias may have threatened the validity of the findings from the studies included in our review, as the study populations were often not randomly selected from the injury population for which the findings are reported [54]. For example, studies had excluded patients with pre-existing morbidities (e.g., physical illness, cognitive impairment), as it was anticipated that these patients would be difficult to follow up. Exclusion of patients with impairments before the injury may have increased the overall pre-injury HRQL scores of these study samples, as healthier participants were recruited.

In contrast, attrition bias may have decreased the overall pre-injury HRQL scores measured in the studies, as a higher proportion of the non-participants were less educated [26], cognitively impaired [38], victim of intentional injury [6], shorter hospitalized [21] and had lower injury severity [28, 29, 44], less pain [34], better mental health [34]. These factors are all expected to be associated with better HRQL and incorporation of these patients may have resulted in higher pre-injury HRQL scores. Additionally, pre-injury HRQL levels may have increased after loss of follow up, resulting in higher pre-injury HRQL in the final study sample with complete response compared to the eligible study sample [32].

Finally, retrospectively assessed pre-injury HRQL scores may differ from the population norms as injury populations may differ from the general population. The findings of the retrospective assessments (recall) in our review suggest that injured populations are generally healthier than the general population. Previous studies reported that, as injured populations might be healthier, they are more likely to participate in activities, exposing them to a higher risk of injuries [6]. However, the comparisons of injury patients with matched controls in our review showed injury patients to be less healthy than their counterparts, as they reported significantly lower pre-injury mental health than controls [18, 27, 44, 46] and lower scores across all HRQL domains [46]. Previous research showed that injury patients had a higher occurrence of comorbidity, higher admission rates to the hospital, higher health service utilization, and a lower socioeconomic status prior to their injury in comparison to uninjured people [5, 18]. It is argued that the general population has not been exposed to a similar injury experience as the injury population, which emphasizes the use of retrospective assessments over the application of general population norms to estimate the impact of injury on HRQL [7].

Strengths and limitations

Our review included studies on the pre-injury HRQL from children, adolescents, and adult patients, with various injury types, using a range of HRQL instruments. Moreover, this review compared the reported pre-injury HRQL scores with general population norms, calculated for each study based on the reported mean age and gender distribution of the study sample, to identify bias that may occur from the different methods to assess pre-injury HRQL.

There are limitations to this review that need to be addressed. First, there was no restriction in the methods of patient selection used in the studies. Therefore, the studies in this review included samples retrieved from a variety of injury settings (e.g., hospital or outpatient programs). Their conclusion may not be applicable to injury patients from other injury settings. However, most studies selected their patients during or after treatment in a (pediatric) hospital or specialized treatment center, which may enhance the generalizability of their results to patient populations with similar case mix.

Second, the review included studies with patient samples from a broad range of injury types and injury severity levels, which may have complicated the comparability of the results between studies. Nonetheless, this way we were able to provide a full oversight of the pre-injury health status of injury patients and the differences in pre-injury HRQL between injury types.

In addition, there are limitations to the studies included in our review. First, more than half of the included studies had difficulties in recruiting research participants, as patients often could not be contacted, had died, refused to participate, or did/could not complete questionnaires. The studies often reported limited generalizability of their results due to differences between the eligible patients and study participants, loss to follow-up, their limited number of subjects, and recruitment of participants from a single center.

In some studies pre-injury HRQL was assessed after a long period of time since the injury, for example several months up to years after the injury [8, 42, 44]. This longer time frame may have increased the recalled pre-injury HRQL scores [31], as these studies also reported the highest pre-injury HRQL scores on the EQ-5D (0.94–0.99) compared to the studies that used shorter time frames. However, these three studies assessed the HRQL of a relatively young injury population. Moreover, no differences were found between the time frame and pre-injury HRQL in studies that used the SF-36 or SF-12.

Finally, unfortunately not all studies reported the HRQL scores in the text or tables (e.g., only in graphs). After contacting the authors, in three publications HRQL scores had to be manually obtained from the graphs presented in the article [36, 40, 45]. This may have resulted in some small differences in the levels of pre- and/or post-injury HRQL.

Recommendations for future research

Our review clearly showed that recalled pre-injury HRQL systematically exceeded population norms. These differences in pre-injury HRQL may generate different estimates of the change in HRQL from pre- to post-injury due to an injury.

Researchers should use prospectively derived pre-injury HRQL scores wherever possible to estimate the impact of injury on HRQL. If it is not feasible to prospectively assess the pre-injury health status of trauma patients, researchers should be aware of the bias that may arise when pre-injury HRQL is assessed retrospectively or when population norms are applied. Overall, more research is needed to examine the effect of recall bias and response shift on the reported levels of pre-injury HRQL among trauma patients, in which different methods to assess pre-injury HRQL are compared and within-study comparisons between reported pre-injury HRQL and population norms are made.

The results of our review imply that there are a number of methodological advances regarding pre-injury HRQL interpretation left. Researchers should be aware of the different purposes the information on pre-injury HRQL of patients may have. For instance, pre-injury HRQL may be seen as a baseline health status to which patients are expected to return after the injury. On the other hand, pre-injury HRQL may be used to measure total loss in health, or may be used to offer insight into inter-patient differences in recovery after an injury.

In general, when assessing pre-injury HRQL, researchers should carefully consider and specify the timing of the assessment of pre-injury HRQL and the period of the pre-injury assessment. The time period shows to be one of the essential factors influencing patient recall, as recall bias is generally worse when asking for a recall over longer periods [55]. A short time frame within the injury and retrospective assessment of pre-injury HRQL may increase recall and may increase the correlation between pre- and post-injury measures [31]. This implies that pre-injury HRQL should be assessed as soon as possible after the injury, preferably within the first week after the injury [56]. Whether or not the measurement of pre-injury HRQL is the primary purpose of studies, publications on the measurement of HRQL should include information on the applied methods to measure HRQL.

Levels of pre-injury HRQL also may have been influenced by the use of telephone interviews. In our review, the highest or one of the highest pre-injury HRQL on the EQ-5D [42], SF-36 (PCS and MCS) [26], or SF-12 [43] were reported by studies that had conducted telephone interviews to assess the pre-injury levels of HRQL. Previous research indicated that telephone-administered questionnaires provide higher HRQL scores than self-administered questionnaires [5759]. Preferably, the same method should be used for the assessment of both pre-injury and post-injury HRQL throughout the study, at all post-injury HRQL measurements and among all individuals.

Researchers should choose a validated HRQL instrument that has shown good performance in the type of injury under study, and that is sensitive to changes in HRQL and differentiate well between health states. In order to assess the change from pre- to post-injury HRQL, the same HRQL instrument should be applied throughout the study. Preferably, a HRQL instrument should be chosen for which national age- and gender-adjusted population norms are available. In order to enable comparison of the impact of injuries on HRQL between studies, injury types and other diseases, it is recommended to report the pre- and post-injury HRQL scores for specific age and sex groups, which correspond to the age and sex distribution of the norm groups for the applied instrument.

Finally, to examine the change in HRQL due to the injury, a longitudinal design is recommended with multiple follow-up measurements over time (e.g., at 1–3 months, 3–6 months, and 6–24 months post-injury) [56].

Conclusions

So far, primarily retrospective research has been conducted to assess pre-injury HRQL. This research shows consistently higher pre-injury HRQL scores than population norms and a recovery that lags behind that of prospective assessments, implying a systematic overestimation of the change in HRQL from pre- to post-injury due to an injury. More prospective research is necessary to examine the effect of recall bias and response shift. Researchers should be aware of the bias that may arise when pre-injury HRQL is assessed retrospectively or when population norms are applied, and should use prospectively derived HRQL scores wherever possible to estimate the impact of injury on HRQL.