Background

Although Arabic is one of the most widely used languages worldwide, little is known regarding the standardized neuropsychological tests available in Arabic [1]. One of the most widely used assessment packages in dementia is the Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) which was established in 1986. The major standardized instruments developed by CERAD are now used by many Alzheimer’s disease research centers in the USA and abroad, by physicians in clinical practice, and in population-based surveys [2, 3]. They have been translated, in whole or in part, into Arabic, Bulgarian, Chinese, Dutch, Finnish, French, German, Hebrew, Hindi (for a rural, illiterate population), Italian, Japanese, Korean, Norwegian, Polish, Portuguese, Russian, Spanish, and Swedish and into English sign language [4]. The CERAD neuropsychological battery and assessment package has been translated into Arabic, adapted and validated for the Egyptian language and culture [5]. For the purpose of the current study, we used the Arabic versions of the subset of tests from the CERAD battery that were included in this study: the animal category test of semantic verbal fluency (VF) [6], the 15-item short version of the Boston Naming Test (BNT) [7], the Mini-Mental State Examination (MMSE) [8], and the clock drawing test (CDT) [9]. All have been translated into Arabic language and have been validated as a part of the Arabic version [5]. The animal category test of semantic VF is a measure of impairment in verbal production and a test of semantic memory and language. It is rapidly administered, readily available, and easy to administer and interpret. The individual is asked to name all the animals that he/she can name within 60 s. This task can be understood by patients with cognitive impairment and is sensitive to Alzheimer’s disease [10]. The BNT is a test of language ability in which participants are asked to name objects depicted by line drawings [7]. The short BNT contains five high-frequency, five medium-frequency, and five low-frequency items. Each participant is presented with 15 line drawings in an accompanying flipbook. The correct names for the objects are listed on the Response Sheet of CERAD Form J2. The third test included was the CDT. It was originally used to assess visuoconstructive abilities. It has been proposed as a quick screening test for cognitive dysfunction [11]. The strength and weakness of the CDT lies in the number of cognitive, motor, and perceptual functions that must be used simultaneously to successfully complete the task, which requires visual memory, visuospatial abilities, orientation, conceptualization of time, executive function, auditory comprehension, motor programming, numerical knowledge, semantic instruction, inhibition of distracting stimuli, concentration, and frustration tolerance [12]. Serial CDTs can be used to follow a patient with progressive dementia or to observe recovery from delirium [13]. The MMSE is in some ways the best known and most widely used measure of cognition in clinical practice worldwide [14], and it represents a benchmark against which all newer tools can be measured [15]. It is an easily administered, simple, and widely available test that assesses orientation, memory, attention and calculation, language, and visual construction. The MMSE takes approximately 10 min to administer, and its total score is 30. It has been translated and adapted for the culture of elderly Arabic-speaking Egyptians as a part of the Arabic translation and adaptation of the CERAD battery [5].

Aim

The aim is to determine the diagnostic performance and the best cutoff value for the clock drawing test (CDT), animal category test of semantic verbal fluency (VF), and the short form of the Boston Naming Test (BNT) in identifying patients with mild and major neurocognitive disorders among community-dwelling educated Egyptian older adults using Arabic versions of those tests.

Methods

Participants

Community-dwelling educated male and female older adults aged 60 years or older. Education is defined as 6 years or more of formal education. The setting, sample size, and selection criteria are as follows: all patients attending the outpatient geriatric clinic at Ain Shams University Hospital, Cairo, Egypt, in the period from January 1, 2016, till December 31, 2016, excluding, at the start of the study, those patients who or their next of kin refused to participate in the study. (Successive patients attending the outpatient geriatric clinic at Ain Shams University Hospital during a 12-month study period from January to December 2016. Patients were excluded if they or their next of kin declined to participate.) We estimated the minimum sample size required, based on the prevalence of cognitive disorders and both sensitivity and specificity of the screening and diagnostic values of tests studied. The power is set to be at least 80% and the p value and is set to be less than 0.05. The calculated sample size is found to be 110 with minimal number of 35 in each studied group.

A two-step protocol was followed.

Step 1: Participants were divided into three groups according to DSM-V diagnostic criteria for neurocognitive disorders. The assessment of cognitive function included Mini-Mental State Examination (MMSE), Clinical Dementia Rating Scale (CDR), patient’s current functional status regarding basic and instrumental activities of daily living by the Blessed Dementia Scale (BDS) [16], and patient detailed history and examination based on protocol five of the CERAD assessment packet. Those with MMSE score of less than 24 were identified as having cognitive impairment [5, 17]: CDR score, 0 = no dementia; 0. 5 = uncertain or deferred diagnosis; 1 = mild dementia; 2 = moderate dementia; 3 = severe dementia; 4 = profound dementia; 5 = terminal dementia.

Blessed Dementia Scale: The original Dementia Scale of Blessed et al. has been modified by minor wording changes and by the deletion of questions related to personality, interests, and drives. The informant is asked to rate, on a 3- or 4-point scale as indicated, the subject’s cognitive ability during the preceding 6 months, as compared with premorbid ability in performing practical tasks of everyday living. Fully preserved capacity is represented by a total score of 0, whereas maximum impairment is reflected by a total score of 17. Less than four is cognitively unimpaired, 4–9 is mild impairment, and more than nine is moderate to severe impairment.

Step 2: The application of the Arabic versions of EL OKL et al. 2001 VF test, the BNT, and the CD by a clinical psychologist blinded to the initial assessment results.

Statistical method

The collected data was revised, coded, tabulated, and entered into a PC using the Statistical Package for the Social Sciences (IBM SPSS 20.0). The analyses performed were determined by the type of data obtained for each parameter. The sensitivity and specificity of each test are calculated, and the receiver operating characteristic (ROC) curves were drawn. The area under the curve (AUC) is a measure of diagnostic accuracy. Interpretation of areas under the curve (AUCs) is as follows: 0.90–1 = excellent, 0.80–0.90 = good, 0.70–0.80 = fair, 0.60–0 .70 = poor, 0.50–0.60 = fail. The cutoff values were determined as the values with the highest balanced sensitivity and specificity.

Results

The study sample is 122 community-dwelling educated older adults who attended the outpatient geriatric department clinic at the Ain Shams University Hospitals. 32.79% of the samples (40 subjects) have mild neurocognitive disorder while 38.52% (47 subjects) have major neurocognitive disorder. As regards the mean age ± SD, it is 70.1 ± 6.73 for the group with mild neurocognitive disorder (NCD) and 71.1 ± 6.24 for the group with major neurocognitive disorder. As regards gender, males are 62 (50.8%) of the sample, 45.2% of major NCD, and 24.2% of mild NCD.

Figure 1 and Table 1 show the statistical findings for the mild neurocognitive disorder group. AUC for CD, VF, and BN are 0.807, 0.77, and 0.753 respectively while the cutoff values for CD, VF, and BN are 2, 12, and 14 respectively.

Fig. 1
figure 1

ROC for VF, BN, and CD tests in the mild neurocognitive disorder group

Table 1 Diagnostic performance and cutoff values for CD, VF, and BN in the mild neurocognitive disorder group

Figure 2 and Table 2 show the statistical findings for the major neurocognitive disorder group.

Fig. 2
figure 2

ROC for VF, BN, and CD tests in the major neurocognitive disorder group

Table 2 Diagnostic performance and cutoff values for CD, VF, and BN in major neurocognitive disorder group

AUC for CD, VF, and BN are 0.884, 0.877, and 0.839 respectively while the cutoff values for CD, VF, and BN are 2, 9, and 13 respectively. So, the three tests are good tests for major NCD.

Discussion

With the projected increase in the elderly population and the expected rise in the prevalence of dementia, early case identification is necessary for planning and delivering clinical services. The effectiveness of dementia screening depends on the availability of suitable screening tools with good sensitivity and specificity to confidently distinguish normal cognitive function from minor and major NCD. There is a demand for rapidly administered, sensitive, and reliable Arabic versions of cognitive assessments particularly those which are designed for identifying older adults with minor and major neurocognitive impairment. To meet this need, we used in the current study the Arabic version of CDT, VF animal category, and BNT as a part of the Arabic version of CERAD neuropsychological battery. The CERAD battery was translated into the Arabic language and validated in the Egyptian elderly. We chose rapidly administered and sensitive tools to find out the best cutoff for patients with minor and major neurocognitive disorders in educated Egyptian older adults. Our results present the best cutoff values that discriminate between cognitively healthy individuals and those with minor and major NCD. Our study included 122 community-dwelling educated older adults matched for age and gender, attending outpatient clinics at one of the largest tertiary hospitals in Cairo, Egypt. 71.3% of participants (number = 87) have either mild or major NCD. This could be explained by the fact that the participants were attending the outpatient clinics of the tertiary hospital with different comorbidities. This was not a community-based study, although the participants were community-dwelling and not living in residential care or nursing homes. Our study population represents the community-dwelling educated elderly adults who attended the outpatient general geriatric clinic with different medical illnesses. To our knowledge according to available resources to us, there are no available previous studies with the same aim and tools in Egypt to compare our results with, but there are studies for VF, CDT, and BN performance and cutoff points for other purposes. In the current study results, AUC for CD, VF, and BN for participants with minor NCD are 0.807, 0.77, and 0.753 respectively. It means that CD is a good test while VF and BN are fair tests for detection of minor NCD. For participants with major NCD, AUC for CD, VF, and BN are 0.884, 0.877, and 0.839 respectively. It means that the three tests are good tests for the detection of major NCD. In addition, the cutoff values for CD, VF, and BN are 2, 12, and 14 respectively for mild NCD and 2, 9, and 13 respectively for major NCD. According to Wiechmann et al. [18] study, ROC analysis indicated that the CDT was able to distinguish between normal elders and those with a dementia diagnosis. The cutoff score for differentiating patients with Alzheimer’s disease from normal participants was ≤ 3. The cutoff score for differentiating those with vascular disease from normal participants was ≤ 3. Mirandez [19] found that the cutoff score for the animal category test of VF was 14, and the AUC = 0.794. In comparison, Brucki et al. [20] found that the cutoff points were 9 for people with less than 8 years of education and had a mean of 13 for those with higher educational levels. With the abovementioned ROC and best cutoff values in our study, it is clear that our results are comparable to those studies, and those tests can be used to screen for cognitive impairment in educated Egyptian older adults.

Conclusion

The study presents the best cutoff for a group of neuropsychological tests which will help to advance research in the area of cognitive impairment in Egypt. Verbal fluency, clock drawing, and Boston naming showed reasonable diagnostic performance in educated Egyptian elderly and should be considered separately or in combination for the assessment of cognitive function.

Recommendations

Further research on a larger number and on a more diverse group of the population is warranted.