Introduction

Drivers who experience reduced alertness or other adverse effects of central nervous system (CNS) drugs often show impaired driving. Being aware of adverse CNS drug effects and recognizing that driving is impaired are essential for making appropriate countermeasures. If individuals are aware of adverse drug effects, they may choose not to drive. Alternatively, if they acknowledge that their driving is impaired, they can stop driving before an accident occurs or can drive shorter periods of time, or avoid taking the drug on future occasions when they plan to drive. Unfortunately, research has shown that drivers can poorly judge their own driving performance.

Drivers often overestimate their skills and abilities and do not recognize impairment. For example, it has been shown that 70–90% of drivers feel their driving is better and less risky than that of the average driver (Svenson 1978, 1981; McCormick et al. 1986; Groeger and Brown 1989; Sivak et al. 1989). Men and women do not differ in overestimating their own driving ability (McCormick et al. 1986; Groeger and Brown 1989). Subjective ratings of driving ability were shown to differ between countries and cultures and may depend on factors such as driving experience, age, personality, and crash experience (Matthews and Moran 1986; Sivak et al. 1989; Holland 1993; Lajunen and Summala 1995; Lajunen et al. 1998). Although there is a bias towards a more positive view of drivers’ own skills and abilities, this does not necessarily mean that they are unable to adequately assess changes in their driving performance, for example those induced by CNS drugs and alcohol. Research that tested drivers who were unfit for driving due to medical conditions such as stroke showed that patients who fail a driving test often have limited awareness of their disability and corresponding negative consequences with regard to driving (Patomella et al. 2008). Of 152 elderly drivers tested in a driving simulator, 65% rated their performance better than the average driver (Freund et al. 2005). However, about half of them showed unsafe driving performance on the simulator test.

Previous research that compared subjective driving quality and actual driving performance suffers from methodological issues. The measurement of actual driving performance was typically accomplished by means of psychometric tests in the laboratory, driving simulators, or closed-road driving tests. It can be questioned to what extent these tests represent and predict actual driving and the degree to which subjects think this relates to real driving. The primary reason for this is that in these tests there is no risk to the driver, which will cause participants to behave differently than in actual traffic. Other studies did use on-road assessments, but performance was rated by the experimenter or a driving instructor, not the driver. Finally, measurement tools are not standardized across studies which make comparing the results difficult.

This paper examines the relationship between subjective assessments and actual driving impairment, objectively measured by the on-the-road driving test. The on-the-road driving test in normal traffic (Verster and Roth 2011) is the gold standard method to examine the effects of CNS drugs on driving. The 100-km standardized test was developed in the 1980s (O’Hanlon et al. 1982; O’Hanlon 1984) and has been applied in over 75 clinical trials. Participants are instructed to drive with a steady lane position and a constant speed. The Standard Deviation of Lateral Position (SDLP), i.e., the “weaving” of the car, is the primary parameter of the test. The test has proven sensitivity to measure dose-dependent impairment of a variety of CNS drugs, and comparative research with alcohol, THC, and benzodiazepines showed that dose-related changes in SDLP correlate high with the risk of becoming involved in traffic accidents (Owens and Ramaekers 2009).

As a standard procedure in clinical trials applying the on-the-road driving test, subjective assessments are conducted to determine levels of alertness before driving, and after driving perceived driving quality, and mental effort needed to perform the drive. Based on the literature discussed above, it is hypothesized that drivers are unable to accurately estimate (changes in) their driving performance.

Methods

Data from three studies were used to compose the current data set (Verster et al. 2002a, b, 2003, 2006). The Medical Ethics Committee of the University Medical Center Utrecht approved the study protocols, and subjects were treated according to ICH guidelines for Good Clinical Practice and the Declaration of Helsinki and its amendments.

Participants

A total of 96 healthy volunteers completed the three studies. In each study, male and female participants were equally represented. Mean (SD) age was 23.9 (2.2) years old. Written informed consent was obtained before their participation. Subjects were medically screened, used no concomitant medication other than paracetamol and oral contraceptives, and they had no unstable medical disorders, history of psychiatric disease, alcohol or drug abuse. Before the start and at the end of the studies, blood chemistry and hematology and urinalysis were determined, and a 12-lead ECG was recorded. All assessments were within normal limits. To confirm drug status, at all visits subjects were tested on the presence of alcohol and drugs of abuse (amphetamines, barbiturates, cannabinoids, benzodiazepines, cocaine, and opioids). In addition, female subjects underwent β-HCG pregnancy tests. None of the subjects were positive on any of these drugs. Subjects possessed a valid driver’s license, drove more than 5,000 km/year during each of the past 3 years, and were trained on the driving tests. A thorough discussion on the inclusion and exclusion criteria of participants and description of the study designs can be found in the individual reports (Verster et al. 2002a, b, 2003, 2006).

Treatments

A variety of treatments were tested in the three studies, including alcohol, hypnotics, anxiolytics, analgesics, and H1-antihistamines. The specific drugs and their dosages are summarized in Table 1.

Table 1 Administered drugs in the three driving studies

The on-the-road driving test

Standardized 100-km driving tests (Verster and Roth, 2011) were performed on a primary highway during normal traffic, between the cities of Utrecht and Arnhem. A camera, mounted on the roof of the test vehicle, measured the vehicle’s lateral position relative to the road delineation. Participants were instructed to drive with a steady lateral position within the right traffic lane while maintaining a constant speed of 95 km/h (60 mph). The amount of “weaving” of the car, measured by the standard deviation of the lateral position (SDLP, centimeters), was the primary outcome parameter. The standard deviation of speed (kilometers per hour) was a secondary parameter. Duration of the driving test was approximately 75 min.

Patients were allowed to deviate from the instructions to overtake a slower-moving vehicle in the same traffic lane. The vehicle’s speed and lateral position were continuously recorded. A licensed driving instructor who had access to dual controls sat in the right front seat, guarding the subject’s safety. Tests could be terminated if the driving instructor or the subject felt it was unsafe to continue. Before disclosure of treatment blinding, the data were edited off-line to remove data that were disturbed by extraneous events (e.g., overtaking maneuvers and traffic jam), and SDLP values were calculated.

Subjective assessments

Before the driving test, subjective alertness was measured using a 21-point visual analog scale. After the driving test, subject estimates of their driving quality and mental effort to perform it were measured. Participants indicated the perceived quality of their driving performance on a visual analog scale, which ranged from 0 (“I drove exceptionally poorly”) to 20 (“I drove exceptionally well”) around a midpoint of “I drove normally.” The level of mental effort they had to exert to perform the task was indicated on a 15-cm interval scale, ranging from “absolutely no effort” to “extreme effort” (Zijlstra and Van Doorn 1985; Meijman et al. 1986).

Statistical analysis

For each treatment, difference scores from placebo were calculated for each dependent variable. The data of the three studies were combined into a single dataset to allow comparisons between changes in the primary parameter of the driving test (SDLP) and difference scores for the subjective assessments (driving quality, mental effort taken after driving, and alertness taken before driving). Pearson’s r correlations were computed; significance was established if p < 0.05 (two-tailed). The analyses were done for all data combined, as well as for the individual drug treatments.

Results

Data from N = 604 driving tests (N = 96 participants) and corresponding subjective assessments were available for the statistical analysis. For the difference scores from placebo, a total of N = 431 comparisons could be made. Table 2 summarizes differences from placebo and their statistical significance. The relationship between changes in SDLP and subjective assessments is shown in Figs. 1, 2, and 3.

Table 2 Differences from placebo (centimeters) for SDLP, perceived driving quality, mental effort to perform the test, and alertness before driving
Fig. 1
figure 1

Relationship between changes from placebo of SDLP and subjective driving ability

Fig. 2
figure 2

Relationship between changes from placebo of SDLP and mental effort to perform the driving test

Fig. 3
figure 3

Relationship between changes from placebo of SDLP and alertness before the driving test

The overall analyses revealed significant correlations for perceived driving quality (r = −0.498, p < 0.0001), mental effort to perform the test (r = 0.408, p < 0.0001), and alertness taken before driving (r = − 0.115, p < 0.017). Thus, the predictive validity (R 2) of perceived driving quality (24.8%) and mental effort (16.6%) was moderate to low, but importantly, predictive validity of alertness before driving (1.3%) was virtually absent. Thus, the only measure taken before the actual drive, and therefore the only measure of predicting rather than evaluating performance, showed the lowest predictive validity. Overall results and those for individual drugs are summarized in Table 3.

Table 3 Correlation between SDLP differences from placebo (centimeters) and perceived driving quality, mental effort to perform the test, and alertness before driving

It is evident from Table 3 (data from individual drug treatment analyses) that the correlation between SDLP differences from placebo and subjective assessments is highly variable between studies with no clear relation with drugs, doses, or times of drug administration relative to time of driving. It is important to recognize that these are the data that are gathered for individual research projects.

Discussion

The analyses show that subjective assessments, especially predictive ones, do not robustly relate to actual driving performance either in terms of judgments about alertness before the drive or ratings of performance after the drive. This is especially true when the data are analyzed study by study. The present findings have important clinical and public health implications as it is typically the patient’s responsibility to make a decision whether or not to initiate a drive and whether to continue driving after they have experienced how they drive under the medication. In fact, the direction to patients to experience the drug several times before they undertake operating machinery presupposes that patients will accurately assess their performance under the influence of the medication and thereby make informed decisions.

Whenever participants drive a vehicle, they make a subjective estimate of how they are feeling at that time and location in comparison to their normal driving state. Therefore, the rationale for the statistical analyses was to use difference scores from placebo. This approach was chosen because it provides information if drivers can adequately assess changes from their normal driving performance. Where there are significant differences from placebo for SDLP (see Table 3), these are associated with increases in mental effort (except for alcohol) and decreases in perceived driving quality. Therefore, participants were retrospectively consistently aware of impairment. These findings were much less consistent for differences in SDLP and alertness assessed before driving. For example, while both drugs significantly impair driving, alertness before the test was significantly decreased for zolpidem, but increased for diphenhydramine (day 4). The analyses on difference scores from placebo revealed statistically significant correlations for perceived driving quality, mental effort to perform the test, and alertness before driving, but the percentage of predictive validity towards changes in SDLP was low. Correlations between absolute values of SDLP and subjective assessments of driving ability are not very informative. The primary reason for this is that SDLP values differ greatly between participants (Verster and Roth 2011). Hence, one subject may regard an SDLP of 15 cm as normal, whereas another participant regards an SDLP value of 25 cm as normal. As placebo (absolute) values between participants are so variable, they cannot be used to provide a quantitative comparison.

These data show that participants are not always able to accurately judge changes in their driving performance after administration of CNS drugs. The data also show that mental effort to conduct the test is a poor indicator of the driving impairment they experienced. Of greater concern is that the level of alertness before driving does not predict driving performance. Although the correlation was significant (at least in part due to the large number of subjects as well as the number of different drug treatments), the predictive validity was only 1.3%. The poor correlation between perceived level of alertness and actual performance is caused by the fact that the awareness of reduced alertness differs between CNS drugs (See Table 3). For example, whereas driving was severely impaired after both administration of zolpidem (20 mg) and alprazolam (1 mg), alertness correlated significantly with SDLP only after zolpidem, but not after alprazolam. Also, perceived driving quality did not correlate significantly with driving impairment caused by alprazolam. In contrast, after administration of zolpidem even at a high dose (20 mg), drivers were fully aware of their performance impairment. These findings illustrate the importance of obtaining these subjective assessments to compliment the objective measurements of driving performance. It provides useful information whether drivers are aware of driving impairment, or if participants have to invest significantly more effort to prevent driving impairment.

A limitation of this study is that we did not directly ask participants before the start of the driving test to what extent they anticipated their driving would be impaired. Nevertheless, it can be assumed that many people make this judgment based on the level of alertness they experience. Of interest for future research is to conduct assessments of subjective driving quality during the driving test. The objective and subjective measurements would then be truly concurrent assessments. Making these assessments several times during the driving test would also enable showing whether the subjective assessments are in line with the objective performance decrement generally seen during the driving test (Verster and Roth 2011).

The results of the current analyses are consistent with the literature on sleep deprivation that also shows a lack of a relation between perceived performance impairment and actual performance impairment (Herscovitch and Broughton 1981; Van Dongen and Belenky 2009). This has important clinically relevant implications. First, people should not rely solely on their perceived alertness when deciding whether or not it is safe to drive a car after using CNS drugs. The poor correlation between subjective and objective driving ability shows that the general advice that “patients should listen to their body, and not drive if they feel their driving is impaired” should not be relied on because patients may not be aware of their driving impairment. Physicians should therefore urge patients to follow-up labeling instructions regarding drug effects on driving. In addition, patients should be informed that they may not be aware of impaired driving, or do not experience reduced alertness while at the same time their performance may be significantly impaired. Physicians should take a more active role in educating patients about the possible risks of drug use in combination with daily activities such as driving. This proactive approach by physicians is important because many patients do not read nor follow-up instructions of prescription drug labeling (Veldhuijzen et al. 2006). Although several CNS drugs can impair driving, often alternative pharmacological treatments are available that do not affect driving ability or more typically have a lesser effect on driving. When a patient needs to drive a car regularly, physicians should incorporate the data on the effects of the drug on driving performance into their overall selection of the most appropriate medication for this patient.