Introduction

At any point of time, a significant minority of children and adolescents all over the world suffer from serious emotional, cognitive, and/or behavioral problems threatening their sound development, mental wellbeing, and hence the future of humanity at large [1, 2]. Throughout the history of psychological sciences, great efforts have been made to formulate clinical models that can successfully define and measure such clinical problems and differentiate them from normal variations in development [3]. Parallel to the “top-down” diagnostic approaches, which start from experts’ pre-defined constructs, a “bottom-up” approach emerged to explore normal versus abnormal psychological phenomena using empirical data comprising ratings of large samples of individuals in multiple societies. An important example of this approach is the Achenbach System of Empirically Based Assessment (ASEBA) [4].

Starting more than 50 years ago, the ASEBA team developed a family of instruments that assess different aspects of human behavior, as observed by multiple raters, spanning ages 1.5 to 90 + years [5]. Ratings of the ASEBA problem items were repeatedly analyzed via both exploratory and confirmatory factor analyses to derive scales for scoring syndromes of problems that co-occur in each age group [6]. The ASEBA forms are also scored on DSM-oriented scales comprising problems judged by international experts to correspond to DSM-IV and subsequently to DSM-5 diagnostic categories. The ASEBA scales and their correlates have been supported by data from dozens of societies across the globe [5, 6].

Three ASEBA instruments are designed to assess school-age children and adolescents. The Child Behavior Checklist for Ages 6 to 18 (CBCL/6–18) is completed by parents and other caregivers. The Teacher’s Report Form (TRF) is completed for 6–18-year-olds by teachers and other school staff. The Youth Self-Report (YSR) [4] is completed by 11–18-year-olds to describe themselves. Translations are available in more than 110 languages. Confirmatory factor analyses have supported the ASEBA’s eight syndromes in dozens of societies. Syndromes designated as Anxious/Depressed, Withdrawn/Depressed, and Somatic Complaints are scored on the broad-spectrum Internalizing scale. Syndromes designated as Rule-breaking Behavior and Aggressive Behavior are scored on the broad-spectrum Externalizing scale. Syndromes designated as Social Problems, Thought Problems, and Attention Problems are not scored on either of the broad-spectrum scales, but are included with all other problems in the Total Problem score [6]. Analyses of CBCL/6–18 forms completed by the parents of 72,493 children living in 45 societies across the world revealed that only 10% of the variance in problem scores was explained by cultural and societal differences. About 90% of the variance was explained by individual differences in parents’ ratings of their children. That is to say that most of the differences in these children’s behavior could be explained by individual variations rather than by cultural or societal differences [6, 7]. According to their deviation from the “omnicultural mean” (the mean of the mean Total Problem score in all societies), Achenbach classified different societies into three groups, Multicultural Norm Group 1 that included societies scoring more than one standard deviation (SD) below the omnicultural mean, Multicultural Norm Group 3 including societies scoring more than one standard deviation (SD) above the omnicultural mean, and Multicultural Norm Group 2 which included societies scoring from 1 SD below to 1 SD above the omnicultural mean [6]. The Total Problem scores overlapped between all societies so that no society was categorically different from any other society [6]. The modest correlations between ratings by different informants indicated that no one informant’s ratings sufficed for comprehensive evaluations of children [5].

With a large, mostly youthful [8], population, the Arabic world needs standardized instruments to evaluate the behavior of younger generations. Arabic, a semitic language within the afro-asiatic family originating in the Arabian Peninsula, is spoken by more than 350 million people around the world. However, Arabic language has different forms and dialects that include traditional Arabic (the language of the Quran), Modern Standard Arabic (MSA), used in schools, official papers and news bulletins, plus spoken dialects used for everyday oral communication varying greatly from one area to another [9]. The mutual intelligibility between spoken dialects is usually related to the distance between the areas where these dialects are spoken, with some dialects being mutually unintelligible [10]. The 22 Arabic-speaking states are roughly divided into 3 major segments: North African western Arabic world, Eastern Mediterranean, and gulf states. Egypt is the most populous country in the Arab world with a population exceeding 100 million, more than 40% less than 19 years old. Its central location and strong cultural relations to almost every Arab country both make Egypt a representative country for the Arab world [11]. As spoken dialects are region-specific and not usually used in written language, findings with Modern Standard Arabic translations of ASEBA instruments in Egypt may be especially informative for the Arabic world.

Several Arabic translations of ASEBA forms have been used in the Arab world. Two unpublished theses utilized Arabic translations of the 1991 versions of the CBCL/1.5–5 and CBCL/6–18 in community studies but did not establish psychometric properties in Egyptian society [12, 13]. Most subsequent studies used these Arabic translations for evaluation of clinical samples without testing its psychometric properties in community normative samples [14,15,16,17,18,19,20,21,22,23,24,25]. Two studies attempted to establish the psychometric properties of ASEBA forms in Arabic speaking countries, Algeria and Tunisia, using the 1991 versions of the CBCL/6–18 and YSR. However, these two studies used the French, not the Arabic, translations of the instruments [26, 27]. Another study attempted to evaluate the psychometric properties of the CBCL/6–18 in a large representative sample of youth in Qatar, a relatively small gulf state, and supported the factor structure of Achenbach’s eight-syndrome scales of the CBCL/6–18 [28]. However, because this study did not include other forms, such as the TRF and YSR, it could not test cross-informant correlations. An unpublished thesis tested psychometric properties of the CBCL/6–18 and TRF in a community sample of 1440 6–12-year-olds in the Egyptian city of Alexandria. The authors reported strong internal consistency and acceptable cross informant agreement but did not compare the results with previously reported multicultural norms for the ASEBA instruments [29]. To our knowledge, ours is the first test of the CBCL/6–18, TRF, and YSR in a community sample of Arab speaking families. We tested the psychometric proprieties and cross- informant correlations of the translated instruments and compared their scores with scores for previously established Multicultural Norm Groups [30]

Subjects and methods

First, the extant translations of the three ASEBA forms to Modern Standard Arabic were obtained from the ASEBA team. The translations were all thoroughly revised and modified by three bilingual professionals, a child psychiatrist, a psychologist, and a specialist of rhetoric and linguistics. Modified translations and back translations were sent back to the ASEBA team for checking. After final translated forms were agreed upon, the required licenses were obtained from the Research Center for Children, Youth, & Families at the University of Vermont, USA. Our study was approved by the Ethical Research Committee of the Faculty of Medicine, Tanta University under the approval code of 33,171/06/19.

The study was performed in Tanta University Hospital, a large public tertiary hospital serving a wide catchment area in the middle of the Nile delta. This area includes the Kafr El-Sheikh, Gharbia, and Monofia governates, with a population of about 13 million people, approximately 75% of whom live in rural areas [11]. We visited the general pediatric clinic in Tanta University Hospital on a regular basis between February 2020 and March 2022 and attempted to recruit all 6–18-year-olds coming to the clinic for routine check-ups and vaccinations for COVID-19 with no specific behavioral or psychiatric complaints, enrolled in regular school education, and accompanied by parents who had completed at least the 6 years of education needed to comprehend written Modern Standards Arabic. Parents signed statements of informed consent, while their children provided verbal assent. Parents were assured that information would be confidential and anonymous. We checked every form for completeness. Cases with missing data were excluded. Among 1998 distributed forms, 1333 were completed, including 595 CBCL/6–18 forms, 409 YSR forms, and 329 TRF forms. Some 66.7% of the targeted informants completed the forms.

Data analysis

All forms were scored using the Assessment Data Manager (ADM 9.1) software, Burlington, Vermont, USA. Confirmatory factor analyses (CFA) were performed using MPlus software to check if our data fit the eight-syndrome structure of the forms found in other societies [4, 7, 31]. As the observed variables are ordinal, Weighted Least Square Mean and Variance Adjusted (WLMSV) CFAs were conducted. The minimum sample to detect an effect was 177, which was exceeded for all three forms. To confirm the structure of the previously reported eight-factor model of the ASEBA instruments, the Jeffreys Amazing Statistics Program (JASP) software was utilized [32]. First, the reliability of the tools was evaluated using the McDonald Omega [33] coefficient. Values equal to or greater than 0.65 were considered satisfactory. Second, to confirm the construct validity, a CFA was conducted. Finally, the goodness-of-fit was evaluated using the Comparative Fit Index (CFI), root mean square error of approximation (RMSEA), and Tucker-Lewis Index (TLI).

Results

The mean age was 12.7 ± 3.1 years for the CBCL/6–18 sample (n = 595) and 11.9 ± 3.4 years for the TRF sample (n = 329). Youth who completed YSR forms for themselves (n = 409) ranged in age from 11 to 18 years, with a mean age of 14.0 ± 2.1 years. Females comprised 47.4% of the CBCL/6–18 sample, 49.8% of the TRF sample, and 47.9% of the YSR sample.

The McDonald Omega coefficients of the eight syndromes ranged from 0.65 (withdrawn/depressed) to 0.89 (aggressive behavior) for the CBCL/6–18; from 0.66 (thought problems) to 0.92 (aggressive behavior) for the TRF; and from 0.71 (withdrawn/depressed) to 0.87 for both Somatic Complaints and Aggressive Behavior for the YSR. RMSEA values for the CBCL/6–18, YSR, and TRF were 0.047, 0.040, and 0.044, respectively, indicating good fits for the three forms in the Egyptian sample. Good fit was also indicated by CFIs of 0.96 for the CBCL/6–18, 0.97 for the TRF, and 0.98 for the YSR. Finally, TLI values were 0.95, 0.96, and 0.98 for the CBCL/6–18, TRF, and YSR, respectively. The overall goodness-of-fit was thus good for all three forms. Most of the individual items showed good correlations with their factors (factor loadings of 0.50 and higher). Fewer items had moderate correlations (factor loadings between 0.30 and 0.50) [34] while only three items (32, 52, and 56 g) showed factor loadings less than 0.30 (For complete list of McDonald Omega coefficients and factor loadings, see Supplementary Tables S13). Factors were all significantly correlated to each other in all three instruments (Supplementary Tables S46).

The mean Total Problem scores were 28.7 ± 25.9 for the CBCL/6–18, 29.8 ± 30.3 for the TRF, and 36.7 ± 29.5 for the YSR. Relative to ASEBA Multicultural Norm Groups [5,6,7, 31], the mean score for our CBCL/6–18 sample was at the high end of Multicultural Norm Group 2 (medium scores). The mean score for our TRF sample was at the low end of Multicultural Norm Group 3 (high scores), while the mean score for our YSR sample was at the low end of Multicultural Norm Group 2 (medium scores). The means and standard deviations of the individual scale scores for the three instruments are listed in Tables 1, 2, and 3. Cross-informant correlations were computed for forms completed for the same individuals. Moderate correlations were found between Internalizing, Externalizing, and Total Problem scores from the CBCL, TRF, and YSR (Table 4).

Table 1 CBCL empirically based scores for Egyptian normative sample (n = 595)
Table 2 TRF empirically based scores for Egyptian normative sample (n = 329)
Table 3 YSR empirically based scale scores for Egyptian normative sample (n = 409)
Table 4 Trans-informant correlations between the three ASEBA forms

Discussion

To our knowledge, this is the first test of the psychometric properties of the Modern Standard Arabic translations of the CBCL/6–18, TRF, and YSR. We found good psychometric properties, including replication of the previously reported [28, 31] eight syndromes with good fit of the model for all three instruments. When compared to the previously reported multicultural norms [5, 6, 31], our Egyptian Total Problem scores were at the high end of the ASEBA Group 2 (medium scores) for the CBCL/6–18, the low end of Group 2 for the YSR, and at the low end of Group 3 (high scores) for the TRF.

The problem scores for all three forms were in the medium to high range for studied societies, an unsurprising finding given the central location of Egypt in the Middle East that made it a historical hub for many interacting Eastern and Western cultures, which have traded, colonized, and mixed with the Egyptian population. The frequent long-term mixing between different ethnic groups characteristic for regions like Egypt may lead to moderation of genetic susceptibility to different emotional and behavioral problems leading to the noted pattern of medium scores. The mean Egyptian CBCL/6–18 scale scores approximated the corresponding mean scale scores in Multicultural Group 2 except for the Anxious/Depressed, Withdrawn/Depressed, and Internalizing problem scores of the school-age boys, which were comparable to the scores of Multicultural Group 3. A previous study by our team [35] found depressive disorders to be the second most common diagnoses after ADHD in a large clinical sample of Egyptian children. Overcrowded population together with strained economic conditions and culturally accepted physical punishment might contribute to elevated levels of depressive problems in Egyptian children [25]

The modest cross-informant correlations reported in previous studies [36, 37] support the use of forms completed by multiple informants to obtain a comprehensive understanding of the child. Our correlation of 0.54 between CBCL and YSR Total Problem scores was larger than the mean of 0.41 reported by Rescorla et al. [38] in a sample totaling 27,861 adolescents from 25 societies. Rescorla et al. reported correlations ≥ 0.50 in samples from Tunisia, Algeria, Germany, Lithuania, Korea, and Denmark. Our correlation of 0.52 between CBCL and TRF Total Problem scores was also above the mean of 0.26 reported by Rescorla et al. [39] in samples totaling 27,962 children from 21 societies, but a correlation as high as 0.49 was reported for the Danish sample. The large percentage of Egyptian families that use private home tutoring instead of regular schools [40], with continuous communication between family and home tutors and being at home rather than in classrooms, might explain the higher correlation between parent and teacher reports than found in many other societies. Despite the relatively high cross-informant correlations in our sample.

Our study was limited by a relatively small sample size, recruited from one center. This makes it difficult to generalize our findings to the entire Egyptian population. Second, the socioeconomic status of the families was not assessed. Third, only parents who completed 6 years of school and youth who attend school participated in the survey. Illiterate parents and youth who dropped out of school were excluded as they cannot read Modern Standard Arabic. The omission of school dropouts might not affect the representativeness of the youth sample, as the drop-out rate from Egyptian middle schools is currently only 0.87% [11]. However, the representativeness of the parent sample is limited, as 24.9% of Egyptian adults were illiterate in 2019 [11]. Finally, the sample was partially recruited during COVID-19 lockdown, which although limited in Egypt, might still be a psychosocial stressor that could have altered the psychological status of both youth and their families.

Conclusions

The use of standardized psychometric tools in assessment of youth is still a rare phenomenon in the clinical practice of child and adolescents in Egypt and other Arab countries. By providing Egyptian data for ASEBA forms, our study may help practitioners to screen, assess, and follow up Egyptian and other Arab children and adolescents with emotional and behavioral problems.