Introduction

Functional Capacity Evaluations (FCEs) are sets of functional tests to measure the ability to perform work-related activities. FCEs are used in occupational rehabilitation, return to work determinations, disability determinations and medico-legal matters. Hart et al. [1] have suggested guidelines and demands for clinical use of FCE with regards to safety, reliability, validity, practicality and utility. Validity and reliability of FCE have been addressed, [2] but safety has scarcely been addressed so far. Five studies reported about the safety of FCE [37]. One study suggested that safety can be monitored in three main areas: physiological (heart rate and blood pressure), biomechanical (muscle fatigue or weakness) and psychophysical (pain or fear of re-injury) [3]. The outcome of this study was that the FCE could be administered safely when these safety issues are carefully taken into account and if the recommended guidelines, as provided by the US National Institute for Occupational Safety and Health (NIOSH) [8] are applied to minimize the risk of further or other injury [3]. The California Functional Capacity Protocol (Cal FCP) was found to be a safe protocol because no new injuries or exacerbations of current impairments were reported in a study sample of 64 patients suffering from soft tissue musculoskeletal injuries [4]. A study, in which the development of the EPIC Lift Capacity (ELC) test was described, reported this test to be safe based on 687 tests in healthy and disabled subjects because no incidents were reported. Several healthy subjects however reported next-day symptoms which were identified as soreness [5]. Others concluded that FCE appeared to be safe in patients with Chronic Low Back Pain (CLBP) because no injury reports had been filed and the pain decreased to pre FCE levels [6]. The study lacked normative data to compare the results to. It was suggested that further research should focus on operational definitions for injury, in order to test the safety of FCE. In a study in which FCE lifting results were compared with the NIOSH recommended safe weight lifting (RWL), it was observed that the RWL of the NIOSH and the FCE lifting results differed substantially [7]. It is unclear which outcome could be used as a RWL of safe lifting. Additional research into safety and operational definitions of injury are needed to test these contrary findings.

Safety in FCE is a key issue because kinesiophysical FCEs demand a patients’ maximum physical effort [9]. A normal physical response during maximum physical effort includes an increase in heart rate, systolic blood pressure, body temperature, sweat secretion and breath frequency [10]. Besides this, delayed onset muscles soreness (DOMS) after intensive and uncommon exercising is a normal reaction of the musculoskeletal system [11]. It is currently not known how healthy workers respond to an FCE and therefore unknown what should be considered to be a normal pain response. Consequently, it is difficult to clinically interpret the pain increase in patients following an FCE. If the pain response in healthy subjects is known, a comparison can be made between the pain response of healthy subjects and the pain response of patients.

The objectives of this study were to quantify the location, intensity, duration and nature of the pain response in healthy subjects following a 12-item FCE. The second goal of this study was to explore differences and similarities between the data from this study and data from patient reports used in a previous study [6]. In the current study, an FCE was considered safe when the FCE does not lead to injuries and when the pain response is considered to be normal. According to the physiology literature, a pain response was considered normal when symptoms increased within the first 24 h following FCE, peaked between 24 and 72 h and subsided and disappeared within 5–7 days after the FCE [11]. Any response not following this definition was interpreted as an abnormal response.

Methods

Subjects

A total of 232 healthy adults working in a broad range of jobs voluntarily participated in this study. Subjects were recruited via local press and personal networks. Subjects were included after providing informed consent and signing a statement of good health, when meeting the criteria of the Physical Activity Readiness-Questionnaire (PAR-Q) [12, 13], and when they were between 20 and 60 years of age and worked at least 20 h per week for the last year. Excluded were subjects who had been absent from work for more than 2 weeks because of dysfunction of the musculoskeletal system during the year prior to FCE, or subjects whose blood pressure in rest exceeded 159 mmHg (systolic) or 100 mmHg (diastolic) [14].

Procedures

Prior to the FCE, subjects filled in a questionnaire to obtain demographic information, a Pain Response Questionnaire and the PAR-Q. Subjects performed a 2 h, 12-item FCE (see Table 1). After an introduction to general FCE procedures, subjects were briefly instructed how to perform each individual test. Each test was first demonstrated by the evaluator. Subjects were allowed to start the next test when the heart rate (HR) was below 70% of the age related maximum HR (220-age). Subjects received instructions on how to use the Borg CR-10 scale which was used for measurement of perceived exertion after each test [15, 16]. Subjects were individually evaluated by 1 of 15 physiotherapy students who had completed a 2-day FCE-training. This study was approved by the Medical Ethical Committee of the University Medical Center Groningen, the Netherlands.

Table 1 Content of the FCE

Endpoints

Tests could be terminated for three endpoints [3]: cardiac, biomechanical and subject endpoints. Cardiac endpoint was reached when HR was above 85% of age related maximum. HR was measured with a heart rate monitor. Biomechanical endpoints were: loss of solid standing basis during lifting tasks or loss of control of the load. Biomechanical endpoints were determined by the evaluators. Subject endpoints were reached when subjects stopped the test. Subjects were instructed to stop at any point if they wished to do so.

Pain Response Questionnaire

A Pain Response Questionnaire (PRQ; Appendix 1) was developed to obtain information about the pain response prior to FCE and 7 days following FCE, medication use following the FCE and to control for unusual or heavy physical activity in days following the FCE. The intensity of the pain response was measured by an 11-point numeric rating scale (NRS) for pain ranging from ‘0’ (no pain) to ‘10’ (worst pain imaginable) for 17 body parts separately [20]. Subjects were asked whether their pain score was reducible to muscle soreness, of a different origin, a combination of these, or whether the origin of pain was considered unknown. The PRQ was filled in by the subjects just prior to the FCE, directly following the FCE and was taken home to fill in for at least on three consecutive days after the FCE to a maximum of 7 days. Subjects were asked to fill in the PRQ on the days following the FCE after 13.00 h. The PRQ could be returned in a reply-paid envelope if pain was no longer reported or when the maximum of 7 days had been reached. The PRQ was pilot tested in 14 healthy subjects [21] and was found suitable for testing. None of the 14 subjects reported a response longer than 7 days in this pilot study.

Data Analyses

Descriptive statistics were used to describe the subject characteristics and pain response. Box plots were made because the data of pain intensity was not normally distributed. To be able to compare data of healthy subjects with data of patients with chronic low back pain from a previous study [6], independent samples Mann–Whitney and t-tests were performed. Results were considered statistically significant when P < 0.05. A Bonferroni correction was applied to reduce type 1 error (P = α/n = 0.006 for significance).

Results

A total of 35 subjects of the original group of 232 volunteers were excluded from analyses because they did not meet the inclusion criteria (n = 20), did not return the PRQ (n = 8) or because they resigned to participate after all (n = 7). Included were 197 subjects (102 males and 95 females), whose data were used for analyses. Subject characteristics are presented in Table 2.

Table 2 Subject characteristics (n = 197; 102 men, 95 women)

Pain Response

A total of 162 subjects (82%) reported a pain response following the FCE. 53 subjects reported pain prior to the FCE and 57 subjects reported pain directly after the FCE. Descriptive statistics of the origin and intensity of the pain of the main body parts are presented in Table 3. Because most subjects reported their maximum pain intensity on the first day following the FCE, Table 3 reflects the status 24 h following the FCE. Most often, a pain response was reported in the upper legs (51%), the lower back (38%) and the upper arm (37%) and shoulders (36%). The location of the pain response of the 17 main body regions is presented in Fig. 1. The intensity of the pain over time from pre FCE level to the next 7 days following the FCE is presented in Fig. 2. Figure 2 demonstrates that the highest pain response occurred within 24 h following the FCE with a median pain intensity of 3 on a 0–10 NRS. The pain had returned to pre-FCE levels 3 days post FCE. A total of 35 subjects (18%) reported a pain intensity over 2.5 × the interquartile range (IQR) above the median (Fig. 2) which was not expected in a population with a normal distribution. Additional analyses were performed to explore whether different individual health related characteristics could be identified to explain a high pain response. Results are presented in Table 4. A total of 35 subjects reported a value over 2.5 x IQR above the median on pain intensity. This group scores lower on vitality, general health perception and social functioning. A pain response lasting longer than 7 days was reported by 2 subjects (1%). One subject reported low back pain which lasted for 22 days. After a medical consultation of an independent physician by telephone it was concluded that these symptoms were non-specific of origin. After 22 days the person involved had fully recovered. The other subject was diagnosed with a ‘trigger-finger’. These complaints lasted for 21 days until full recovery. All other values over 2.5 × IQR above the median had recovered to pre-FCE level within 7 days following FCE. Four subjects reported to have used non-steroidal non-prescriptive pain medication following the FCE. One subject used a homeopathic ointment. All of these subjects had stopped their medication use 3 days following FCE. One subject has used a massage as a means of coping with post FCE pain response.

Table 3 Description of intensity, location and origin of pain for those subjects who reported pain 24 h following FCE (n = 162)
Fig. 1
figure 1

Location of the pain response expressed in percentage of subjects

Fig. 2
figure 2

Boxplots of the median concerning the reported pain response per day. Bold horizontal lines: medians. Gray boxes: interquartile range (+ and −25% of median). Vertical tick marks: 1.5 × interquartile range. Stars: value over 2.5 × IQR above the median. Numbers: number of subjects

Table 4 Descriptives of RAND-36 and t-tests of differences on the RAND-36 between subgroups 1 and 2

Comparison with Patients with CLBP

Table 5 lists the descriptives of former research in patients with CLBP [6] being compared to the data of healthy subjects in the current study. To enable comparison of both datasets on pain intensity, scores from the PRQ were transformed to a 3-point scale (pain decrease, no difference, pain increase). No significant differences in pain increase were found between healthy subjects and patients with CLBP (Table 5). Independent t-tests show that differences in the duration of the pain response between patients with CLBP and healthy subjects were not significant (Table 5).

Table 5 Independent t-test and Mann–Whitney test of the differences of pain intensity and duration between patients with CLBP (n = 54) [6] and healthy subjects (n = 197)

Discussion

The results of this study show that the pain response in healthy subjects followed a pattern which resembles normal exercise physiology [10]. The group of subjects in this study can be generalized to the Dutch population because personal characteristics such as gender, length, weight and health perception resemble the Dutch population (Tables 2 and 4). The three domains Pain, Physical Functioning and Role limitation (physical) of the RAND-36 however differ clinically from the Dutch population (Table 4). This may be due to the inclusion criteria of this study; only healthy working subjects between 20 and 60 years of age were included, who had no dysfunction of the musculoskeletal system at the time of FCE. There were 53 subjects who reported pain prior to the FCE while stating that they were without complaints when they signed informed consent. In total there were 35 subjects who reported a value over 2.5 × IQR above the median on pain intensity partly because they reported a pain response prior to the FCE. Post hoc analyses of personal characteristics reveal that 2 sub-groups may have been included in this study, namely a group who rated their pain over 2.5 × IQR above the median, and a group who rated their pain under 2.5 × IQR above the median. Table 4 illustrates that the group who rated their pain over 2.5 × IQR above the median scores lower on vitality, general health perception and social functioning but not on pain and physical functioning. When we take into consideration the point-prevalence of musculoskeletal pain in the Dutch population (53.9%) than, of this population, 70% reports no limitation in daily life resulting from musculoskeletal pain [23]. This means there is a large number of persons in the Dutch population who do experience pain but are not restricted in daily life activities. The data of the current study may possibly have included a part of this group. Thus, with regards to pain status and self reported ADL functioning, the subjects of this study appear similar to the open Dutch population. Consequently, the general pattern and diversity of the pain response as seen in this study should be considered to be a normal response rather than an indication of injury. There were two subjects (1%) in this study who reported an abnormal reaction. However, it remains unknown whether this may be expected in a normal population after intensive exercise. The 1-week incidence of neck and back pain, calculated by The Dutch National Institute for Public Health and the Environment (RIVM), of the Dutch population is 0.15% for men and 0.18% for women [24]. Data concerning the incidence of musculoskeletal complaints after intensive exercise such as an FCE is unavailable but is presumed to be higher.

The second goal of this study was to investigate whether the pain response of healthy subjects is different from the pain response of patients with CLBP. Earlier research of the pain response of patients with CLBP [6] reported a pain increase in 76% of the subjects, which lasted for a mean of 3.4 days after the FCE. In our study we found a symptom increase in 82% of all subjects and a normalization of pain to pre-FCE levels on the third day following the FCE (Fig. 2). Differences between our study and the patient study [6] were non significant for the pain intensity increase as well as for duration of the pain increase (Table 5). However some differences in study design must be taken into account before drawing such a conclusion. The first concerns the content of the FCE. Our study included an additional treadmill ergometer test which may have led to a difference in pain response. The second difference concerns the patient characteristics. There was a difference in male/female distribution in both studies. Post hoc analyses show a significant gender difference of 1.3 on pain intensity measured by the PRQ on the first day following the FCE. Female subjects are slightly “statistically significant” reporting more pain. Other possible characteristics of subjects may have led to differences between the studies. This, however, is beyond the scope of this paper and will be reported separately. Overall it can be said that the pain response of healthy subjects in this study appear similar to patient data [6]. Additionally, former research to the Cal FPC and the ELC test showed similar results and demonstrated safety of these protocols [4, 5]. These protocols however differed substantially from the protocol used in this study. The FPC and the ELC protocols are based on psychophysical principles in which a patient determines his/her own acceptable maximum effort. The protocols used in this study are based on kinesiophysical principles in which the evaluator also determines maximal safe effort. Former research into differences between both the psychophysical and kinesiophysical approaches has found differences between outcomes [25]. It appears, however, that when concerning safety, both the psychophysical and the kinesiophysical evaluations can be administered safely.

A weakness of this study was that no correction was made for pain responses after different exercises besides the FCE or for pain prior to the FCE. This has not been done because causality is not always clear. For example, if a subject reports a pain intensity of 2 on the PRQ on the first day following the FCE and has an intensive exercise afterwards, he or she might report a pain intensity of 5 the day following. In this example it is unclear whether the response is causal to the FCE or to any activity besides the FCE. On the other hand, all responses were reported in this study and the real pain intensity following the FCE could, therefore, be an overestimation. A total of 53 subjects reported pain before the FCE and 57 subjects reported pain directly after the FCE. It is remarkable that on an average, subjects reported a higher level of pain before undergoing the FCE than 4 days following the FCE. A reason for this might be that subjects are focused on the severity of pain they feel before the FCE and/or that they are just reporting about the pain they think is causal to the FCE on the days following the FCE.

The NIOSH guidelines are safety guidelines for recommendation of safe lifting. Former research to the concurrent validity between the WorkWell FCE lifting task and the NIOSH Recommend Weight Lift showed significant differences between both safe lifting recommendations in patients with CLBP [7]. Patients lifted on average 15 kg more on the WorkWell FCE lifting task than on the NIOSH RWL. It was however unknown whether the lifting tasks in the FCE could be administered safely, because data is lacking with regards to normative data of healthy subjects and it was unknown whether the pain responses in patients with CLBP were considered abnormal or normal. The normative data in the current study is therefore additional to the question whether the FCE can be administered safely. This study indicates that an FCE can be administered safely if all safety procedures are followed.

In conclusion, the pain response of most healthy subjects (99%) was interpreted as a normal physiological reaction of the musculoskeletal system after intensive exercise such as an FCE. 1% of the subjects reported an abnormal reaction. For a clinician the data from this study is of concern because it means that a pain response can be expected and considered normal after an FCE. Healthy workers and patients should be informed that a pain response can be expected and that this should in the vast majority of the cases be interpreted as a normal reaction of the musculoskeletal system after intensive exercise.