Delirium is a neuropsychological syndrome characterized by a sudden disturbance in attention and awareness. This syndrome typically presents with acute cognitive and/or perceptive alterations that are not best explained by a pre-existing or evolving neurodegenerative disorder.1 Delirium occurs in up to 50% of older patients who require acute hospital care in the context of major illness or following major surgery.2,3 Despite being a strong predictor of adverse events4 and cognitive decline5 resulting in longer hospital stays,6 loss of autonomy,7 and increased mortality,8 delirium remains under-recognized.9 The routine use of validated instruments is key to the early identification of delirious patients.10 Since its validation in 1990,11 the Confusion Assessment Method (CAM) has become the most widely used diagnostic instrument in clinical practice; however, it is time consuming and requires specialized training.12 The 3-Minute Diagnostic Assessment for Delirium using the Confusion Assessment Method (3D-CAM) was designed and validated in 2014 to facilitate routine delirium diagnosis by bedside caregivers, irrespective of their level of expertise in the field.13 The instrument is divided in two sections: ten questions administered to the patient directly, followed by ten to 12 questions to the caregiver. The original English version of the 3D-CAM instrument showed excellent psychometric properties (sensitivity, 95%; specificity, 94%) in a cohort of 201 older patients with and without dementia. Minimally trained clinical staff needed only three minutes on average to complete the assessment. A brief and easy to use questionnaire, 3D-CAM is well suited to postoperative or medical high-risk patients.14

Since 2014, the 3D-CAM has been translated and adapted to German, Italian, Danish, Spanish, Portuguese, Malay, Chinese, Japanese, Polish, Thai, and Turkish. To this day, it remains unavailable in French. We translated the 3D-CAM into French and optimized its adaptation (face validity, ease of use, identification of factors associated with lack of clarity) to older high-risk patients and caregivers in perioperative and medical intermediate care units.

Methods

After obtaining authorization to use the 3D-CAM instrument for research purposes from the instrument developer (Dr. Edward R. Marcantonio, Beth Israel Deaconess Medical Center, Boston, MA, USA), the local Research Ethics Board (Cantonal Research Ethics Commission, Geneva, Switzerland) approved the protocol on 11 July 2018 (Swissethics ID 2018-00211).

Basic principles

We used principles of good practice for the translation and cultural adaptation process for patient-reported outcomes measures issued by the International Society for Patient Outcomes Research (ISPOR)15 to produce a culturally adapted French version of the 3D-CAM instrument.

Translation process

We adopted a multistep multidisciplinary approach involving a centralized review process16 (Fig. 1, upper half). To achieve semantic and conceptual equivalence of the translated instrument, four bilingual clinicians used to working with older inpatients conducted forward and backward translations. Translators worked independently of each other. They were instructed to favour conceptual rather than literal translations, strive for simplicity and clarity, and use language for a broad audience. All translations were assessed and reconciliated by a panel of experts composed of a researcher experienced in instrument development, two acute care physicians, and an acute care nurse.

Fig. 1
figure 1

3-Minute Diagnostic Assessment for Delirium using the Confusion Assessment Method (3D-CAM) French translation and cultural adaptation process. Both the translation (upper half) and adaptation (lower half) processes relied on qualitative (expert panel reconciliation, face validity assessment by patients, ease of use assessment by caregivers) and quantitative (Sperber score, clarity score) criteria

The instrument developer formally reviewed the backward translation using the Sperber rating scale (1 is best agreement; 7 is worst agreement).17 Using this scale, any item with a semantic and/or conceptual score ≥ 3/7 was identified as inadequate, revised by the panel of experts, then resubmitted to the instrument developer until all items scored < 3/7.

Cultural adaptation process and setting

Once translation was completed, we prospectively assessed the face validity, ease of use, and clarity of wording of the experimental French 3D-CAM in two intermediate care units (Fig. 1, lower half). We also aimed to identify demographic, cultural, or medical factors that may affect clarity.18 All cultural adaptation assessments were conducted in the accredited, anesthesia-led peri-interventional (perioperative) or medical intermediate care (high dependency) units at Geneva University Hospital (Geneva, Switzerland) between 9 January 2019 and 10 October 2019.

Patient selection for cultural adaptation

We approached patients aged 65 yr and older who were hospitalized in peri-interventional or medical intermediate care units for at least 24 hr. We excluded patients who (1) could not communicate in French, (2) could not give informed consent, (3) were visually impaired, or (4) were diagnosed with a terminal condition. Since delirium poses significant ethical challenges for the informed consent process,18 we excluded patients identified as delirious at the time of assessment. Nevertheless, we did not exclude those with cognitive deficits able to give direct or surrogate written consent. All eligible patients were informed that their help was needed to ascertain whether the questionnaire is understandable and acceptable. They were encouraged to make comments during the assessment.

Selection and role of caregivers during cultural adaptation

For questions 11A through to 22 (observations by caregivers), we approached a different intermediate care physician or nurse for each patient enrolled in the study. The data set thus contained as many caregivers as patients. We collected information on their professional experience (years of practice) and first language, and assessed ease of use by asking whether they thought the instrument was adequate in an acute care setting.

Face validity of the instrument

To assess the extent to which the instrument appears to measure what it claims to measure based on face value, we asked patients what they thought the questionnaire was designed to test, then grouped their answers by keywords.

Clarity of the instrument

We asked all participants (patients and caregivers) to grade the wording and phrasing of each item on a ten-point Likert scale (0: the question is not clear at all; 10: the question is very clear). Any time participants did not find a question very clear (score < 10 points), we inquired about problematic words or expressions that may need adjustment. The first round of 20 patients was designed to identify inadequate questions as any item with a mean clarity score < 9/10. Each following round of ten patients was designed to test successive revised versions of the questionnaire until all items scored ≥ 9/10 for clarity. We anticipated the process of cultural adaptation would require at least one round of revision.

Patients’ health status and education level

We assessed physical health, mental health, and level of education to identify factors that may affect clarity. We used an American Society of Anesthesiologists Physical Status classification (ASA) of > 3/5 to identify patients with a severe systemic disease that is a threat to life (unstable condition).19 We used a Clinical Frailty Scale (CFS) of ≥ 5/9 to identify frail patients.20 We used a Mini-Cog score of < 3/5 to identify patients more likely to have dementia.21 We used a Two-Question Depression Screening test (2QDS) score of ≥ 1/2 to identify patients more likely to have a depressive disorder.22 We used the 2011 International Standard Classification of Education (ISCED) scale to distinguish patients with a low level (< 4/8) or a high level (≥ 4/8) of education.23 Additionally, we asked all participants (patients and caregivers) which language they felt more comfortable with to distinguish native from non-native French speakers.

Role of the instrument developer

The final translated and culturally adapted French version of the 3D-CAM instrument was submitted to the original instrument developer for approval.

Statistical analyses

Compared with psychometric validation studies that require large sample sizes, 30 to 50 participants are typically necessary to translate and adapt instruments to a different culture.24,25,26 We intended to include at least 30 patients and 30 caregivers to maximize the power to detect potential problems during pretest experiments, as previously recommended.27 We used mean with standard deviation (SD) or median with interquartile range [IQR] to summarize continuous variables. We used count with percentage to describe categorical variables. We used Fisher’s exact test to study whether severe systemic disease (ASA > 3/5), frailty (CFS ≥ 5/9), dementia (Mini-Cog < 3/5), depression (2QDS ≥ 1/2), or low educational level (ISCED < 4/8) are associated with lack of clarity (mean clarity score < 9/10). We also examined whether non-native French speakers struggled with the questionnaire, leading to negative association with clarity scores. Unadjusted P values < 0.05 were considered statistically meaningful. We used Microsoft Excel (Microsoft Corporation, Redmond, WA, USA) and R 3.6.2 (R Core Team 2020, R Foundation for Statistical Computing, Vienna, Austria) to store and analyze all data.

Results

We enrolled a total of 30 patients with a mean (SD) age of 76.6 (7.4) yr and 30 caregivers (2/3 were nurses) for the adaptation process (Table 1).

Table 1 Patients and caregivers’ characteristics during the cultural adaptation phase

Translation process

The expert panel made several incremental adjustments to the forward and backward translations before preliminary work was submitted to the instrument developer. Four items were judged inadequate (Sperber score ≥ 3/7) by the original instrument developer. They were revised until all reached a score < 3/7 for both language and interpretation (Table 2).

Table 2 Quantitative assessment of translation versions

Face validity of the instrument

At the end of the questionnaire, 21/23 (91%) patients thought the scale was designed to assess “memory”, “thoughts”, or “reasoning”. Two patients thought the scale was designed for other, noncognitive purposes. Seven patients opted out of summarizing what the instrument is designed to assess, mostly because of fatigue.

Ease of use of the instrument

All caregivers (100%) thought the instrument was adequate in an acute care setting.

Clarity of the instrument

After 20 questionnaires were completed (first round), five items (Q4, Q8, Q10, Q11B, and Q14) were judged inadequate (clarity score < 9/10). Additionally, three items (Q5, Q9, and Q12) included terms or expressions that were judged ambiguous. Using comments made by participants, we tested a revised version of the questionnaire in a separate group of ten patients and ten caregivers (second round). All questionnaire items reached clarity scores ≥ 9/10 (Table 3).

Table 3 Clarity scores during the cultural adaptation phase

Identification of patient factors that may affect clarity

We focused on questions 4, 8, and 10, which several patients found less clear in the first round. Neither medical nor sociodemographic characteristics were associated with clarity judgement. Nevertheless, 91% of demented patients found question 8 unclear (score < 9/10), compared with 50% of nondemented patients (P = 0.04) (Fig. 2).

Fig. 2
figure 2

Impact of language skills, health status, and education level on the proportion of patients reporting lack of clarity (y axis, higher value = lower clarity) for questions 4, 8, and 10 (x axis). Those three questions were selected because they had mean clarity scores < 9/10 during the first round of cultural adaptation. For each question, we compared patients with vs without (A) an unstable physical status (defined as an American Society of Anesthesiologists Physical Status classification > 3/5), (B) frailty (defined as a Clinical Frailty Score ≥ 5/9), (C) dementia (defined as a Mini-Cog score < 3/5), (D) depression (defined as Two-Question Depression Screen score ≥ 1/2), (E) a lower level of education (defined as an International Standard Classification of Education level < 4/8), and (F) French as their first language

Approval by the Instrument Developer and access to the instrument

The final French version of the 3D-CAM instrument was approved by the original instrument developer. To access the final French version and all updated 3D-CAM material freely and without registration, go to https://americandeliriumsociety.org/cam-and-help-tools/. For further information, please contact info@americandeliriumsociety.org.

Discussion

We produced a French version of the 3D-CAM instrument that is culturally adapted to older patients and professionals in peri-interventional and medical settings. We complied with ISPOR good practice standards15 and used both qualitative and quantitative methods to identify and address several issues during the translation and cultural adaptation phases. We found that technical and conceptual words are harder to translate. We showed that the French 3D-CAM is easy to use and has good face validity. We also showed that cognitively impaired patients hospitalized in an acute care setting are more likely to struggle answering questions about changes over time.

With approximately 270 million speakers, French was the seventh most spoken language worldwide in 2020.28 Once validated, implementation of the French 3D-CAM will facilitate the identification of delirious patients hospitalized in French-speaking areas.29,30 Our results are particularly relevant in acute monitored care settings, such as intermediate care units, where more than one in five patients develop delirium on average across medical and surgical specialties.31,32 Compared with the standard CAM, which is available in French, the 3D-CAM is much shorter to administer at the bedside. All 30 caregivers involved in our study found the 3D-CAM to be adapted to their setting. Compared with the CAM-ICU or the ICDSC, the 3D-CAM was designed to assess noncritically ill patients33 and is therefore adapted to a broader population of patients. Contrary to screening tools such as the Nu-DESC or the 4AT that typically require further testing when positive, the 3D-CAM is a diagnostic instrument that identifies delirious patients who should receive adequate care promptly.34

We used robust, ISPOR-compliant methods to achieve crosscultural adaptation, which is an essential step towards fully validating any questionnaire in a foreign language.26 Unfortunately, the quality and methodological approaches of crosscultural studies have been shown to vary greatly, and often depart from recommendations.35 Although poor cultural adaptation work leads to poor validation studies and biased psychometric results,36 translating and culturally adapting research instruments is often treated as an unimportant step of study protocols.17

All four questions that required additional translation work also proved more challenging to adapt culturally. Translators and several healthcare professionals struggled with the terminology used to describe levels of consciousness (questions 11B and 12) or quality of communication (question 14). Words used to describe disorders of consciousness fail to capture the complexity of the underlying neurobiology.37 Perhaps unsurprisingly, they are often confusing, if not controversial.38 Qualitative assessment of communication can be affected by multiple cultural and medical factors.39 We found that replacing technical terms such as “stuporous”, “hypervigilance”, “verbose”, and “tangential” with more common, though less precise, wording such as “sleepy”, “state of heightened alertness”, “speaks a lot”, or “lacks focus” achieved both translational and cultural goals. In addition to improving clarity, shorter and simpler questions are generally perceived to be better as they tend to reduce the number of missing answers.40 Nevertheless, oversimplifications carry the risk of inducing measurement errors.41 Such issues highlight the importance of appropriate training prior to administration of the questionnaire, as well as the availability of a clear training manual in French.42

We found that cognitively impaired patients (identified using the Mini-Cog test) were more likely to find question 8 unclear. Declines in cognitive function interfere with the question-answer process.43 Question 8 asks patients to identify a change over time (“during the past day”). Although cognitively impaired patients often present with compromised ability to project themselves in time44, Marcantonio et al. did not encounter similar clarity issues in their original validation study.13 Overall, they found that the 3D-CAM performs well in patients both with and without dementia (sensitivity, 96% vs 93%; specificity, 86% vs 96%). Contrary to their study that included older patients in general medicine or geriatric medicine services, we studied high risk patients admitted to intermediate care units, where symptoms of dementia may be aggravated.45 More specifically, noise and frequent interventions disrupt circadian rhythms and exacerbate otherwise minor timekeeping issues.46 Our finding further reinforces the importance of multicomponent strategies to promote “chronofitness” in acute care settings.47

Our work has two significant limitations. First, although we used robust methods to translate the 3D-CAM instrument, all French speakers around the world do not share the same cultural background. The meaning of words can slightly fluctuate from one French speaking region to another, and local semantic or conceptual variations may affect the psychometric properties of the questionnaire.48 The French version presented here, being produced in a single Swiss hospital with surgical and medical patients, may not be fully adapted to all 29 countries where French is listed as an official language. We encourage clinicians working in different French-speaking areas to verify the clarity of the instrument locally. Second, we produced a culturally adapted, French version of the 3D-CAM instrument but did not aim to validate it by investigating its psychometric equivalence for delirium diagnosis with the original English instrument. Our work now calls for further investigations to confirm the validity and reliability of the French 3D-CAM. Such work will require a larger sample size and the inclusion of patients with and without delirium.

Conclusion

We used robust qualitative and quantitative ISPOR-compliant methods to produce a French version of the 3D-CAM instrument that is adapted to the acute monitored care setting. Prior to clinical implementation in French-speaking areas, the French instrument must be culturally validated by showing its psychometric equivalence for delirium identification with the original English instrument.