Skip to main content

A model to differentiate WAD patients and people with abnormal pain behaviour based on biomechanical and self-reported tests


The prevalence of malingering among individuals presenting whiplash-related symptoms is significant and leads to a huge economic loss due to fraudulent injury claims. Various strategies have been proposed to detect malingering and symptoms exaggeration. However, most of them have been not consistently validated and tested to determine their accuracy in detecting feigned whiplash. This study merges two different approaches to detect whiplash malingering (the mechanical approach and the qualitative analysis of the symptomatology) to obtain a malingering detection model based on a wider range of indices, both biomechanical and self-reported. A sample of 46 malingerers and 59 genuine clinical patients was tested using a kinematic test and a self-report questionnaire asking about the presence of rare and impossible symptoms. The collected measures were used to train and validate a linear discriminant analysis (LDA) classification model. Results showed that malingerers were discriminated from genuine clinical patients based on a greater proportion of rare symptoms vs. possible self-reported symptoms and slower but more repeatable neck motions in the biomechanical test. The fivefold cross-validation of the LDA model yielded an area under the curve (AUC) of 0.84, with a sensitivity of 77.8% and a specificity of 84.7%.


Whiplash-related injuries are estimated to account for approximately 80% of all traffic injuries [1], representing a critical health, social, and economic issue [2]. For instance, whiplash is in Germany is the most common consequence of road traffic accidents, with approximately 20,000 cases yearly and costing insurance companies more than 500 million euro [3]. Although the numbers vary significantly across countries, whiplash-related injuries in Europe were estimated to cost annually up to 10 billion euros [4], with an increment in recent years [1].

Whiplash is characterized by a high variability of symptoms, commonly referred to as whiplash associated disorders (WAD) [5]. They may encompass diffuse neck pain, neck stiffness, back pain and back stiffness, headaches, fatigue, vision disorders, and dizziness. Patients may also report anxiety, depressive symptoms, difficulties in concentration, and memory deficit [6]. Although it is recommended to conduct an in-depth evaluation, collecting a range of circumstantial, clinical, and instrumental data [7], the diagnosis of whiplash is still largely based on self-reported symptoms [8], as the current medical diagnostic techniques are unable to accurately detect soft tissue injuries, which are predominant in minor WAD [9]. This makes whiplash a clinical condition hard to diagnose and, at the same time, easy to simulate. Moreover, the lack of objective evidence of symptoms, together with the prospect of obtaining compensation, may encourage policyholders to feign or exaggerate their symptomatology [1]. In support of this, Cassidy and colleagues (2000) showed that the elimination of financial compensations for pain and suffering was associated with a drop in the number of insurance claims, as well as with a faster recovery [10]. Similarly, in countries where compensation for late whiplash-related injuries is not formally provided (e.g., Lithuania, Greece), patients rarely develop chronic symptoms [3, 10].

Overall, the literature indicates that the prevalence of malingering among individuals presenting late whiplash-related symptoms is significant [11], especially in litigation cases, where a proportion of up to 60% is reached [12]. Therefore, the economic loss linked to fraudulent injury claims is huge, making the detection of exaggerated or feigned whiplash-related symptoms a priority. Consequently, valid and accurate tools that allow practitioners in a medicolegal context to identify malingered WAD are needed.

WAD malingering detection

One traditional approach to detecting malingering is the qualitative analysis of the symptomatology, applying clinical and epidemiological rules to forensic practice. For example, the discrepancy method consists of qualitatively analysing the reported symptoms considering their incidence in the clinical population affected by the claimed disorder. In short, the plausibility of the reported symptoms profile is evaluated by comparing it with the typical clinical profile [13]. It has been shown that malingerers tend to report a larger number of symptoms compared with the clinical population (indiscriminate symptom endorsement), including rare and impossible symptoms, that is, symptoms that are infrequent or unlikely to be seen among genuine clinical patients [14, 15]. Moreover, malingerers are more prone to amplify the severity of the disorder, describing their symptoms as “extreme” or “unbearable”. In fact, there is a common misconception that reporting more symptoms or overreporting their severity increases the probability of being identified as affected by a genuine syndrome. Moreover, as malingerers do not have in mind a clear representation of the pattern of symptoms typically associated with a specific disease, they can show a symptom, or a pattern of symptoms, even if it is not plausible for the disease they are trying to feign [16, 17].

This evidence has contributed to building tools for the evaluation of malingering, especially in the psychiatric field. For instance, the Structured Inventory of Malingered Symptomatology (SIMS), a self-report questionnaire based on asking about rare and impossible symptoms, was conceived to detect malingering of psychiatric disorders and cognitive impairments [18]. Concerning the simulation of whiplash, tools that check the presence of non-organic signs, namely behavioural signs that are not compatible with the organic injury, have been proposed. Sartori et al. developed the Whiplash Syndrome Questionnaire [19], a self-report measure that includes eight scenarios, each with ten daily life actions (e.g., driving in traffic for 40 min) that responders are asked to rank according to the ease with which they can be performed. The rationale is that only patients with an authentic WAD can recognize easy versus non-easy daily life actions to perform. In a small validation sample, the questionnaire was shown to correctly identify 94% of the simulators and 84% of the exaggerators. Sobel et al. [20] proposed a tool to identify abnormal illness behaviours, which consists of the clinical observation of eight non-organic cervical signs (superficial and nonanatomic tenderness; head/shoulder/trunk rotation; range of motion; sensory loss and motor loss; overreaction). The presence of two or more signs indicates a suspect of simulation. However, the accuracy, sensitivity, and specificity of the use of non-organic signs for WAD malingering detection are not known [21]. Several authors criticized this approach, arguing that these signs are not necessarily indicative of malingering, as they may be a response affected by fear from injury and development of chronic incapacity [22, 23]. On the other hand, some authors support the use of non-organic signs from the Sobel test in clinical practice as a starting point of simulation suspicion from a physical point of view and within a holistic approach to the patient [24, 25].

Other methodologies proposed in the literature to detect malingered WAD are differential spinal blocks implementation, thermographic amytal evaluation, pentothal administration, isometric strength testing [26], posturography technique [27], and mechanical testing [22]. In particular, in the mechanical approach, the kinematic parameters of cervical and neck mobility are used as cues to detect whiplash malingering [28, 29]. The rational is that the evaluation of movements performed multiple times and/or under different circumstances helps reveal inconsistencies between repeated performances or abnormal and improbable patterns of impairment. Similarly, the Fly test records head movements while participants are following a fly, and computes three kinematic parameters (amplitude accuracy, time on target, and jerk index) that differentiate patients with genuine WAD from fakers with an accuracy of 71.8–81.5% [30] identifying abnormal movement patterns in terms of amplitude, time and jerk index.

Finally, completely different strategies derive from the studies of the cognitive mechanisms of deception [31]. The cognitive-based lie detection techniques rely on the evidence that lying is more cognitively demanding than truth telling [32], and this greater cognitive effort is reflected in the time to respond to a stimulus (e.g., a question about whiplash symptoms). Among these, the autobiographical Implicit Association Test (aIAT), which detects liars focusing on response times (RTs) during a classification task, appears particularly promising [33]. Notably, in a preliminary study, the aIAT was successfully applied to detect the malingering of whiplash-related injuries, showing an accuracy of approximately 90% [34]. Other encouraging malingering detection techniques include mouse dynamics [35] and keystroke analysis [36]. Nevertheless, the literature on these techniques is still in its infancy, and further studies are needed to apply and validate them for WAD malingering detection.

Aim of the study

Practitioners in the medicolegal context are still looking for solid criteria to detect WAD malingering and symptoms exaggeration. As reported above, various strategies have been proposed in previous literature. However, most of them have not been consistently validated and tested to determine their accuracy in detecting feigned whiplash. The aim of the present study was to merge two different approaches from among those most commonly used by forensic practitioners to detect WAD malingering – the mechanical approach and the qualitative analysis of the symptomatology – to obtain a malingering detection model based on a wider range of indices, both biomechanical and self-reported. To this end, we tested malingerers and genuine clinical patients using a kinematic test used in the assessment of WAD-related pain [37], and a self-report questionnaire, which was built ad hoc for this study based on rare and impossible whiplash symptoms. Then, the collected measures were used to train and validate classification models, investigating the accuracy, sensitivity, and specificity of the two approaches together in detecting WAD malingerers.



A cohort of subjects was measured between November 2018 and February 2020, in Italy (Unit of Rehabilitation, University-General Hospital of Padova, UNIPD) and Spain (Hospital MAZ). All participants gave informed consent to participate in the study and process the data recorded in the tests. The methodology was approved by the Ethics Committee of UNIPD.

The cohort consisted initially of two groups of people: one group of patients with cervical pain due to a traffic accident with whiplash, classified on the I-to-III scale of the Quebec classification [5], and people who had recovered from previous episode with similar characteristics, without current signs of neck or other musculoskeletal pain. Two examiners assessed the subjects belonging to the patient group, looking for cervical nonorganic signs [20], and classified them into a “control” group of patients (C) and “suspects” of abnormal or exaggerated pain behaviour (S). Since such classification based on experience and nonorganic signs may not be directly related to the patient’s intention, and might be affected by personal bias, a third group of participants was constituted by “fakers”, recovered patients who were asked by the researchers to reproduce the symptomatology of their previous painful episode (F), which could be compared to the S group and previous studies with similar participant profiles [28, 29].


All subjects completed a Neck Pain Symptoms Questionnaire (NPSQ). The NPSQ was conceived ad hoc for this study, according to the previous literature about rare and impossible symptoms strategy [18]. It consisted of 65 “true/false” questions asking for possible (n = 21; e.g., I often suffer of muscle stiffness), rare (n = 14; e.g., I lost the sensibility in both upper limbs), and impossible (n = 30; e.g.; Sometimes I hear a constant sound in my ears) whiplash-related symptoms. The questions were identified by investigators of the Instituto de Biomecánica (IBV) based on the most solid clinical literature on WAD, and then translated into Italian and Spanish (an English version of the questionnaire is reported in Annex A Table 5). The NPSQ was administered through an online form.

Neck motion was analysed as described by De Rosario et al. [37], measuring range of motion (ROM), maximum angular velocity (MAV), phase-area ratio (PAR), and harmonicity (HARM) of flexion–extension (FE), rotation (R), and lateral flexion (LF) movements. Spanish patients were measured using the NedCervical/IBV system, whereas the Italian cohort was measured using the WAAS/IBV system. The systems differed in the instrumentation (optical sensors in NedCervical/IBV vs. inertial sensors in WAAS/IBV), but they implemented the same measurement protocol, and a suitable placement of sensors and postural calibration were considered to ensure that the discrepancies between instruments remained below 3 degrees for ROM, 2% of the range of MAV, and less than 1% of PAR and HARM [37].

Statistical analysis

The agreement of the examiners’ judgements was tested using Fleiss’ kappa [38], and the characteristics of the subjects (sex and age) were compared across sites and groups to verify that the sample was balanced (Chi-squared test of homogeneity for sex, and ANOVA for age). Then, the distributions of the NPSQ scores and the normalized biomechanical parameters were compared across the three patient groups, using an ANOVA and post-hoc comparisons, to investigate what variables might be the most promising discriminators between patients who are not suspect of malingering pain at all (C) and the rest (S or F), or between “suspects” (S) and healthy people deliberately feigning pain (F). Pearson’s correlation coefficients were also calculated between the measured variables to spot potential multicollinearities that should be avoided in the subsequent steps [39].

A minimal set of weakly correlated and potentially discriminant variables was chosen according to those results to conduct a linear discriminant analysis (LDA). The study sample was split into five subsets, which were used to test the goodness of the model to distinguish binarily between non-suspect patients (C) and the other groups, in a fivefold cross-validation.

All analyses were conducted using the R package for statistical computing [40,41,42,43,44].


Description of the participant sample

One hundred and five people participated in the study, classified into three groups, as shown in Table 1. Of the patients, 26% were assigned to the “suspect” group (S), and a similar number of “fakers” (F) were recruited. The opinions of the two judges who classified the patients showed a good level of agreement (Fleiss’ κ = 0.928). Participants’ ages ranged between 22 and 85 years (average 39.2, std. dev 13.5), and there was a majority of female participants (70%), but there were no significant differences in either sex (p > 0.58) or age (p > 0.37) between sites or groups.

Table 1 Distribution of the participants

Selection of NPSQ parameters

Controls reported fewer symptoms of all types (possible, rare, and impossible) than the other two groups did, but that difference was not significant for the impossible symptoms (Table 2), so it was left out of the model. Furthermore, the total NPSQ score was discarded, because it was strongly correlated with possible, rare, and impossible single scores (ρ > 0.77). The NPSQ “possible” and “rare” scores were retained as potentially discriminatory variables, with a moderate correlation between them (ρ = 0.54).

Table 2 Average (and std. dev) score for each subset of symptoms of the NPSQ in the three groups (C, F, and S), and results of the post-hoc comparisons of differences between groups

Selection of biomechanical parameters

All biomechanical parameters were significantly different between controls and the other two groups, which were hardly distinguishable from each other (Table 3), but it was necessary to choose only a minimal set of those parameters, and pick only one movement (FE, R, or LF) for each parameter, because all parameters were strongly correlated between movements (ρ between 0.67 and 0.91). Due to the smaller differences observed in HARM compared with the other parameters, and in all parameters comparing LF and the other movements, they were left out of the model. Furthermore, ROM and MAV were strongly correlated in all movements (ρ > 0.70), so the choice was narrowed down between a pair of parameters (either ROM or MAV plus PAR) for FE or R. The set that minimized the correlations between variables was MAV and PAR in R (ρ = -0.12).

Table 3 Statistics of the post-hoc comparisons of differences between groups for the normalized biomechanical parameters: F(1,99) (p-value)

Linear discriminant analysis

Considering the previous analysis, the LDA model was fitted using the “possible” and “rare” NPSQ scores, plus MAV(R) and PAR(R) as predictors. The coefficients of the linear discriminant functions (LD1 and LD2, cf. Table 4) indicate that the LD1 increased as more possible symptoms but fewer rare symptoms were reported on the NPSQ, and when PAR increased or MAV decreased. The same relationships were obtained for LD2, but with different proportions: possible symptoms and MAV were more relevant for LD1 and less for LD2 than were rare symptoms and PAR.

Table 4 Coefficients of the discriminant functions (LD1, LD2)

Figure 1 shows the distributions of cases in the space of the discriminant functions. The two groups of patients (C and S) were separated, mainly in the direction of LD1. On the other hand there was a large overlap between both groups with F, due to its large dispersion, specially in the axis of LD2.

Fig. 1
figure 1

Distribution of the three groups of participants in the space of the discriminant functions: C represented as empty circles, S as crosses, and F as triangles. The ellipses and their centres represent the confidence regions around the group means, within a distance of ± 1 standard deviation from the group means. (Mahalanobis distances, which have a different scale in each axis considering the variance structure of the data.)

The fivefold cross-validation of the LDA model as a binary classifier between the groups of patients C and S yielded an average area under the curve equal to 0.84 (Fig. 2), which can be considered “excellent discrimination” [39]. At the standard cut-off point (\(\mathrm{Pr}\left(C\right)=0.5\)), sensitivity was 77.8% (people in the S group successfully classified as “suspects”), and specificity was 84.7% (people in the C group successfully classified as “non-suspects”). On the other hand, the model could not label the subjects of the F group better than a random classifier could.

Fig. 2
figure 2

Average ROC curve and standard errors for the fivefold cross-validation

Discussion and conclusion

This study looked for signs in self-reported measurements and biomechanical tests that could be used to differentiate ordinary patients suffering WAD and people with abnormal pain behaviour, who might be feigning or exaggerating their symptoms. Our study sample incorporated a group of suspected malingerers among real patients, which were expected to represent better the kind of malingerers that physicians encounter in daily practice, as well as a group of purposeful fakers, less realistic but whose intent could not be misjudged.

The statistical analysis showed that those signs were a larger number of symptoms reported on the NPSQ — particularly a greater proportion of “rare” symptoms vs. “possible” symptoms — and slower but more repeatable neck motions in the biomechanical test (smaller MAV and greater PAR).

The discriminant model built with those variables presented a large overlap between actual patients who were “suspect” of some response bias and former, recovered patients who were asked to fake the symptoms, although the quantity of rare symptoms reported and the repeatability of neck motion in the tests tended to be greater in the “suspect patients”. Such a model had a good discriminant ability when used for a binary classification between the two different types of patients. At the standard cut-off point, the model showed higher specificity than sensitivity to detect possible simulators (i.e. it is a “conservative” model that failed on the safe side from the patient’s perspective).

These results are comparable with those obtained by Baydal-Bertomeu et al. [28] with a similar biomechanical test, although in other, less challenging conditions. The present study was conducted at two sites, with a greater variety of patients, including not only people voluntarily “faking” their behaviour, but also patients who were suspected of performing abnormal or exaggerated pain behaviors; to strengthen the analysis, the test also included movements in different directions, the motion parameters were normalized, and self-reported questionnaires were added. The influence of MAV and PAR in the model was similar in both studies, as well as the sensitivity/specificity balance, although their values were about 10% smaller in the present study.

Concerning the NPSQ, this is one of the first self-reported measures specifically developed to detect whiplash malingering. Indeed, the only instrument already present in the literature is the Whiplash Syndrome Questionnaire [19], for which an accuracy of 90% in detecting WAD malingerers is reported. Although this tool performs better than the questionnaire we presented in this study (NPSQ), it was cross-validated in a sample of 40 exclusively Italian participants.

Finally, regarding the Sobel test, no studies in the literature report metrics about its accuracy in detecting malingering, according to the fact that this test was conceived to detect non-organic signs and not to classify malingerers.

The overlap between the groups of “suspects” and actual “fakers” in the model parameters indicates that the signs that raised the suspicion of the examiners partly coincided with the patterns exhibited by people who were asked to feign. But contrary to our initial hypothesis, the group of “suspects” was more clearly differentiated from the “normal” patients than from the “fakers”. Because the assignment of patients to the “suspect” group was the result of subjective assessment and agreement by two examiners, there might have been a selection bias that narrowed down the profile of that group, such that only those who exhibited unusual traits of exaggeration were singled out as suspects. If that was the case, the values of sensitivity and specificity of the model would be overestimated and underestimated, respectively. Alternatively, it might be that the unrealistic condition of the group of fakers made that group unusually heterogeneous.

Empirical research on the phenomena of sub-maximal effort, insincerity of effort, or even the simulation of pain implicitly entails the difficulty of selecting patients suspected of this behaviour [45]. More generally, this is an experimental issue common to all research in the field of deception detection [36]. The nature of the problem prevents an objective and reliable investigation. This has led some researchers to adopt a strategy whereby insincerity has been modelled by simulation in normal subjects. In our case, we chose to use the Sobel method, although we are fully aware that non-organic signs from the Sobel test do not have to be directly related to the patient’s intention to simulate pain in order to obtain a secondary benefit. Many groups of researchers have attempted to clarify the concept of simulation and its relationship to secondary gain, which could facilitate the selection of groups in this research field [46,47,48,49]. However, this information does not seem to be widely agreed upon among clinicians, due, in part, to unresolved theoretical issues.

Another aspect of the study that might have an impact on the results was the selection of the classification tool. LDA was chosen because its simple mathematical model facilitated the interpretation of its coefficients and understand how the results of the tests influenced the classification, and also allowed working with moderately sized samples, and the comparison with previous literature. Other machine learning classifiers, as Support Vector Machines, Neural Networs or K-nearest Neighbour (K-NN), might provide better fits, if trained with properly sized samples. This study was limited by the unbalanced sizes of the groups, especially the smaller size of the S group, which limited the confidence of the reported results (a difference of one person in the classification would mean a gain or loss of 5% in sensitivity). This is a consequence of the difficulty of spotting suspects of simulation in a realistic setting.

Data Availability

Data are available upon reasonable request to the authors.

Code availability

Data analysis code is available upon reasonable request to the authors.


  1. Cassidy JD, Leth-Petersen S, Rotger GP (2018) What happens when compensation for whiplash claims is made more generous? J Risk Insur 85:635–662

    Article  Google Scholar 

  2. Previtera, AM (2004) Il colpo di frusta cervicale. Diagnosi, biomeccanica e trattamento. Grafica MA. RO, Copiano (PV)

  3. Noll-Hussong M (2017) Whiplash Syndrome Reloaded: Digital Echoes of Whiplash Syndrome in the European Internet Search Engine Context. JMIR Public Heal Surveill 3:e15.

    Article  Google Scholar 

  4. Janitzek T (2007) Reining in Whiplash: Better Protection for Europe’s Car Occupants. Corporate Authors: European Transport Safety Council

  5. Spitzer WO, Skovron ML, Salmi LR, Cassidy JD, Duranceau J, Suissa S, Zeiss E (1995) Scientific monograph of the Quebec Task Force on Whiplash-Associated Disorders: redefining “whiplash” and its management., Spine (Phila. Pa. 1976). 20 1S-73S.

  6. Binder A (2007) The diagnosis and treatment of nonspecific neck pain and whiplash. Eura Medicophys 43:79

    CAS  PubMed  Google Scholar 

  7. Ferrara SD, Ananian V, Baccino E, Banczerowski P, Bordignon D, Boscolo-Berto R, Domenici R, Quevedo JG, Graw M, Hell W (2016) Whiplash-Associated Disorders. Int J Legal Med 130:13–22

    CAS  Article  Google Scholar 

  8. Hartling L, Brison RJ, Ardern C, Pickett W (1976) Prognostic value of the Quebec classification of whiplash-associated disorders, Spine (Phila. Pa 26(2001):36–41

    Google Scholar 

  9. Broos J, Meijer R (2016) Simulation Method for Whiplash Injury Prediction Using an Active Human Model, in: Proc. IRCOBI Conf.

  10. Cassidy JD, Carroll LJ, Côté P, Lemstra M, Berglund A, Nygren Å (2000) Effect of Eliminating Compensation for Pain and Suffering on the Outcome of Insurance Claims for Whiplash Injury. N Engl J Med 342:1179–1186.

    CAS  Article  PubMed  Google Scholar 

  11. Pearce JMS (1999) A critical appraisal of the chronic whiplash syndrome. J Neurol Neurosurg Psychiatry 66(3):273–276.

  12. Schmand B, Lindeboom J, Schagen S, Heijt R, Koene T, Hamburger HL (1998) Cognitive complaints in patients after whiplash injury: the impact of malingering. J Neurol Neurosurg Psychiatry 64:339–343

    CAS  Article  Google Scholar 

  13. Larrabee GJ (1990) Cautions in the use of neuropsychological evaluation in legal settings. Neuropsychology 4:239–247.

    Article  Google Scholar 

  14. Cornell DG, Hawk GL (1989) Clinical presentation of malingerers diagnosed by experienced forensic psychologists. Law Hum Behav 13:375–383.

    Article  Google Scholar 

  15. Berry DTR, Baer RA, Harris MJ (1991) Detection of malingering on the MMPI: A meta-analysis. Clin Psychol Rev 11:585–598.

    Article  Google Scholar 

  16. Rogers R, Correa AA (2008) Determinations of Malingering: Evolution from Case-Based Methods to Detection Strategies, Psychiatry. Psychol Law 15:213–223.

    Article  Google Scholar 

  17. Rogers R, Shuman DW (2005) Malingering and deception in criminal evaluations, Fundam. In: Rogers R, Shuman DW (eds) Fundamentals of forensic practice: mental health and criminal law, pp 21–55.

  18. Smith GP, Burger GK (1997) Detection of malingering: validation of the Structured Inventory of Malingered Symptomatology (SIMS). J Am Acad Psychiatry Law Online 25:183–189

    CAS  Google Scholar 

  19. Sartori G, Forti S, Birbaumer N, Flor H (2003) A brief and unobtrusive instrument to detect simulation and exaggeration in patients with whiplash syndrome. Neurosci Lett 342:53–56.

  20. Sobel JB, Sollenberger P, Robinson R, Polatin PB, Gatchel RJ (2000) Cervical nonorganic signs: a new clinical tool to assess abnormal illness behavior in neck pain patients: a pilot study. Arch Phys Med Rehabil 81:170–175

    CAS  Article  Google Scholar 

  21. Murphy DR, Hurwitz EL (2011) Application of a diagnosis-based clinical decision guide in patients with neck pain. Chiropr Man Therap 19:19.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Mendelson G, Mendelson D (2004) Malingering pain in the medicolegal context. Clin J Pain 20:423–432

    Article  Google Scholar 

  23. Fishbain DA, Cole B, Cutler RB, Lewis J, Rosomoff HL, Rosomoff RS (2003) A structured evidence-based review on the meaning of nonorganic physical signs: Waddell signs. Pain Med 4:141–181

    Article  Google Scholar 

  24. Capilla Ramírez P, González Ordi H (2009) A Protocol for detection of malingered pain symptomatology in clinical practice: case studies, TRAUMA-SPAIN. 20:255–263

  25. Capilla Ramírez P, González Ordi H (2012) Simulaciónenpatología dolorosa crónica del raquis cervical (cervicalgia/esguince cervical). Rev. Española Med. Leg. 38:76–84.

    Article  Google Scholar 

  26. Vernon H, Tran S, Soave D, Moreton J (2010) Simulated malingering in the testing of cervical muscle isometric strength. J Back Musculoskelet Rehabil 23:117–127

    Article  Google Scholar 

  27. Endo K, Suzuki H, Yamamoto K (1976) Consciously postural sway and cervical vertigo after whiplash injury, Spine (Phila. Pa 33(2008):E539–E542

    Google Scholar 

  28. Baydal-Bertomeu JM, Page ÁF, Belda-Lois JM, Garrido-Jaén D, Prat JM (2011) Neck motion patterns in whiplash-associated disorders: Quantifying variability and spontaneity of movement. Clin Biomech 26:29–34.

    Article  Google Scholar 

  29. Baydal-Bertomeu J-M, García-Mas M-A, Poveda R, Belda J-M, Garrido-Jaén D, Vivas M-J, Vera P, López J (2007) Determination of simulation patterns of cervical pain from kinematical parameters of movement. Challenges Assist Technol AAATE 07(20):429–433

    Google Scholar 

  30. Oddsdottir GL, Kristjansson E, Gislason MK (2015) Sincerity of effort versus feigned movement control of the cervical spine in patients with whiplash-associated disorders and asymptomatic persons: a case–control study. Physiother Theory Pract 31:403–409

    PubMed  Google Scholar 

  31. G. Sartori, A. Zangrossi, G. Orrù, M. Monaro 2017 Detection of Malingering in Psychic Damage Ascertainment, in: P5 Med. Justice, Springer, Cham, pp. 330–341.

  32. Vrij A, Fisher R, Mann S, Leal S (2008) A cognitive load approach to lie detection. J Investig Psychol Offender Profiling 5:39–43

    Article  Google Scholar 

  33. S Agosta G Sartori 2013 The autobiographical IAT: a review Front Psychol 4.

  34. Sartori G, Agosta S, Gnoato F (2007) High accuracy detection of malingered whiplash syndrome. International Whiplash Trauma Congress, Miami, FL

  35. S Zago E Piacquadio M Monaro G Orrù E Sampaolo T Difonzo A Toncini E Heinzl 2019The Detection of Malingered Amnesia: An Approach Involving Multiple Strategies in a Mock CrimeFront Psychiatry 10.

  36. G. Sartori, A. Zangrossi, M. Monaro, Deception Detection With Behavioral Methods, in: J.P. Rosenfeld (Ed.), Detect. Concealed Inf. Decept., Elsevier, 2018: pp. 215–241.

  37. De Rosario H, Vivas MJ, Sinovas MI, Page Á (2018) Relationship between neck motion and self-reported pain in patients with whiplash associated disorders during the acute phase. Musculoskelet Sci Pract 38:23–29.

    Article  PubMed  Google Scholar 

  38. Landis JR, Koch GG (1977) The Measurement of Observer Agreement for Categorical Data. Biometrics 33:159.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  39. Hosmer DW, Lemeshow S (2000) Applied Logistic Regression, John Wiley & Sons Inc, Hoboken, NJ. USA.

    Article  Google Scholar 

  40. H. De Rosario, phia: Post-Hoc Interaction Analysis, (2013). Accessed 17 Sept 2020

  41. R Core Team (2020) R: a language and environment for statistical computing. Accessed 17 Sept 2020

  42. Sing T, Sander O, Beerenwinkel N, Lengauer T (2005) ROCR: visualizing classifier performance in R. Bioinformatics 21:3940–3941.

    CAS  Article  PubMed  Google Scholar 

  43. W.N. Venables, B.D. Ripley, Modern Applied Statistics with S, Fourth, Springer, New York, 2002.

  44. M. Gamer, J. Lemon, I. Fellows, P. Singh, Various Coefficients of Interrater Reliability and Agreement, (2019). Accessed 17 Sept 2020

  45. M Monaro A Toncini S Ferracuti G Tessari MG Vaccaro P Fazio De G Pigato T Meneghel C Scarpazza G Sartori 2018The Detection of Malingering: A New Tool to Identify Made-Up Depression Front Psychiatry 9.

  46. Ferrari R, Kwan O (2001) The no-fault flavor of disability syndromes. Med Hypotheses 56:77–84.

    CAS  Article  PubMed  Google Scholar 

  47. Fishbain DA (1994) Secondary gain concept. APS J 3:264–273.

    Article  Google Scholar 

  48. Fishbain DA, Cutler R, Rosomoff HL, Rosomoff RS (1999) Chronic Pain Disability Exaggeration/Malingering and Submaximal Effort Research. Clin J Pain 15:244–274.

    CAS  Article  PubMed  Google Scholar 

  49. Gatchel RJ, Adams L, Polatin PB, Kishino ND (2002) Secondary loss and pain-associated disability: theoretical overview and treatment implications. J Occup Rehabil 12:99–110.

    Article  PubMed  Google Scholar 

Download references


Open access funding provided by Università degli Studi di Padova within the CRUI-CARE Agreement. This work was supported by funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 777090.

Author information




HDR, JMB-B, MM, GS: Conceptualization; HDR, JMB-B, MM, FC, MM-C, MB-L, SM: Data curation; HDR, JMB-B: Formal analysis; HDR, JMB-B, MM: Investigation; HDR, JMB-B, MM: Methodology; MM: Project administration; GS: Supervision; HDR, JMB-B, MM: Writing—review & editing.

Corresponding author

Correspondence to Merylin Monaro.

Ethics declarations

Ethics approval

The methodology was approved by the Ethics Committee of UNIPD.

Consent to participate

All participants gave informed consent to participate in the study.

Consent for publication

All participants gave consent for publication of the aggregate data.

Conflicts of interest

The authors declare that there are no conflicts of interest that could have influenced the outcome of this work.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Annex A

Annex A

Table 5 Neck Pain Symptoms Questionnaire (NPSQ)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Monaro, M., De Rosario, H., Baydal-Bertomeu, J.M. et al. A model to differentiate WAD patients and people with abnormal pain behaviour based on biomechanical and self-reported tests. Int J Legal Med 135, 1637–1646 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI:


  • WAD
  • Whiplash
  • Malingering detection
  • Whiplash kinematic test
  • Whiplash self-report questionnaire