Assessing Reactive and Proactive Aggression in Detained Adolescents Outside of a Research Context

The Reactive Proactive Aggression Questionnaire (RPQ) is a self-report tool for assessing reactive aggression (RA) and proactive aggression (PA). This study contributes to the literature by testing the psychometric properties of the RPQ across detained boys from various ethnicities whilst using data that were gathered during clinical assessments. The factorial, convergent, and criterion validity, and the internal consistency of the RPQ scores received strong support in the total sample and across four ethnicity groups. Also, three groups of boys were identified, with the group including boys with high levels of both RA and PA including the most severe boys in terms of anger, delinquency, alcohol/drug use, and psychopathic traits, and having the highest prevalence rate of conduct disorder and substance use disorder. Together, these findings suggest that the RPQ may hold promise for assessing RA and PA in detained boys, even when confidentiality and anonymity of the information is not guaranteed.


Introduction
Aggression is an umbrella term that captures different types of aggression that differ in underlying mechanisms, forms, functions, and prognosis [1]. One distinction that received a lot of attention relates to the differentiation between proactive and reactive aggression. Reactive aggression is a hostile, impulsive, and unplanned reaction to perceived frustration or threat [2]. Reactive aggression is often accompanied by anger and rage, autonomic arousal and loss of impulse control, and, therefore, has been described as a 'hot' form of aggression [3,4]. Proactive aggression is displayed in the absence of provocation or anger [5], with the goal to take possession of things, or to dominate or intimidate others [6]. Because no or only little autonomic arousal is involved, proactive aggression has been described as a 'cold' form of aggression [3]. Although reactive aggression and proactive aggression are highly correlated [7], previous research in adolescents has shown that both types of aggression differ in the direction and/or strength of relations to variables of interest. With respect to internalizing problems, reactive aggression is positively related to depression [8], suicide risk [9], and anxiety [10], whereas proactive aggression is not. With respect to externalizing problems, proactive aggression is positively correlated to conduct problems [8], bullying [11] and substance abuse [7], whereas reactive aggression is not, or less strongly, related to these features. In addition, reactive aggression, but not proactive aggression, is positively related to impulsivity [12]. With respect to offending, proactive aggression, but not reactive aggression, is independently and positively related to offending in general [12], and violent offending in specific [7], though it must be noted that studies have also shown a positive relationship between reactive aggression and offending, or did not reveal any relationship between both types of aggression and offending [13]. With regard to psychopathic traits, studies have shown that these traits were merely or most consistently related to proactive aggression [14,15].
The Reactive Proactive Aggression Questionnaire (RPQ; 12) is a promising self-report questionnaire designed to assess reactive and proactive aggression in children and adolescents (see ''Methods'' for more details). Factor analyses supported the two-factor structure of the RPQ in different settings [11,14], in different countries/cultures [16,17], and in boys and girls [16]. Overall, the internal consistencies of the RPQ total score and the reactive aggression (RA) and proactive aggression (PA) scale scores are acceptable to very good [14,18]. Finally, the convergent and criterion validity of RPQ scores are supported by evidence that the total score, and the RA and PA scales manifested the expected relationships with variables of interest [8,12,14]. As parents or teachers of detained adolescents are very often not available, unable or unwilling to provide information [19,20], the availability of a reliable and valid self-report tool is very much welcomed to assess reactive and proactive aggression among these criminal justice-involved adolescents. However, in clinical practice, detained adolescents may be reluctant to provide information that is unknown to their parents and clinicians (e.g., aggression, drug use), and can be used against them (e.g., in court). Therefore, it is highly relevant to test if RPQ scores are still as reliable and valid when detained youths complete the RPQ as part of a clinical protocol, thus, outside of a research context where anonymity and confidentiality of the information is guaranteed.

This Study: Aims and Hypotheses
Worldwide, youths who are culturally different from the culture of the host nation are overrepresented in youth detention centers [21]. However, we are aware of no study that has examined the psychometric properties of the RPQ across detained youths from various ethnicities. As such, the first aim of the present study was to fill this void by examining the factorial, convergent and criterion validity of RPQ scores. It was hypothesized that a good model fit for the two-factor structure and good internal consistencies of the RPQ scores would be revealed, not only in the total sample but also in the different ethnicity groups. In support of the convergent validity of RPQ scores, it was hypothesized that positive relationships between the RPQ scores and other indices of aggression would be revealed. Specifically, it was expected that the RA score would be strongly related to anger and irritability. In contrast, and because proactive aggression is often displayed in the absence of anger, it was expected that the relationship between the PA score and anger would be poor at best. In support of the criterion validity of RPQ scores, it was expected that the RA score would be most strongly and consistently positively related to depressive feelings, anxiety, and suicide risk, and that the PA score would be most strongly and consistently positively related to substance use, callous-unemotional traits, and offending.
Demonstrating different correlates for reactive and proactive aggression is important but not sufficient to support the usefulness of the dichotomy in clinical practice, in which clinicians deal with persons rather than variables. The second aim of this study was to test whether meaningful groups of detained youths could be identified that differ in their levels of reactive and proactive aggression. Prior work showed that individuals with proactive aggression typically show significant levels of reactive aggression as well [22]. In addition, several studies also identified a group of individuals with high levels of reactive aggression only and a group with low levels of proactive and reactive aggression (for an overview see [23]). Though not impossible, prior studies suggested that it is difficult to identify individuals that merely display high levels of proactive aggression [5,[23][24][25]. Interestingly, there is evidence that youths with high levels of reactive and proactive aggression differ from youths with reactive aggression only; with the former group showing the highest levels of aggression, impulsivity, bullying, anger dysregulation, and callous-unemotional traits [18,22,24]. Of note, these two groups were not different with respect to social competence, depressive-anxious feelings, expectation for rewards, and thrill-seeking behavior [18,22,24]. On the basis of prior research, it was hypothesized that detained adolescents with high scores on reactive and proactive aggression, and detained adolescents with high scores on reactive aggression only would be identified and that both groups would have more internalizing and externalizing problems, alcohol and drug use problems, and higher prevalence rates of conduct disorder and substance use disorder than youths with low scores of reactive and proactive aggression. In line with a severity model [22], youths with high scores on reactive and proactive aggression were also expected to have more problems and higher prevalence rates of conduct disorder and substance use disorder than youths with high scores on reactive aggression only.

Participants
The current study used data involving male adolescents from two large youth detention centers in the Netherlands.
These data were gathered as part of the standardized mental health screening and assessment the two youth detention centers provide to each youth entering the institution. In the first phase of implementing standardized mental health screening and assessment (May 2008-July 2009), the Massachusetts Youth Screening Instrument-Second Version [26] and Strengths and Difficulties Questionnaire [27] were implemented. In the second phase (July 2009-July 2011), an extensive screening and comprehensive psychiatric assessment was implemented by means of self-report questionnaires (e.g., RPQ and self-reported offending), and structured psychiatric diagnostic interviews (see ''Measures''). In the third phase (from July 2011), the presence of oppositional defiant disorder, conduct disorder, and substance use disorders other than alcohol and marijuana, was no longer assessed, and some self-report questionnaires were no longer administered to the youths (e.g., selfreported offending). For the purpose of the current study, data for 807 detained male adolescents who completed the RPQ, the Massachusetts Youth Screening Instrument-Second Version (MAYSI-2), and the Strengths and Difficulties Questionnaire (SDQ) were made available to the author. For a sub-sample of these 807 youths, data from other screening and assessment tools were available and were also used in the current study. Because of the developments described above (see the third phase) and because some disorders could not be assessed in some youths (e.g., release before the clinical screening and assessment protocol was complete), the number of youths that were used when examining the relationship between RPQ scores and variables of interest assessed by other instruments than the MAYSI-2 and SDQ will be lower than 807 (see ''Measures''). The mean age of our sample (N = 807) was 16.71 years (SD = 1.30). With respect to ethnicity, 20.7 % were of Dutch ethnicity, 26.9 % were of Moroccan ethnicity, 19.8 % were of Dutch Antillean or Surinamese ethnicity, and 31.1 % were from another ethnicity (e.g., Turkish). Data regarding ethnicity were missing for 1.5 % of the sample. In addition, 95 % of the boys were detained while awaiting final trial (pretrial), whereas the remaining 5 % were detained following conviction. The number of days between entrance into the facility and completing the RPQ ranged from 0 to 66 days and was on average 5.66 (SD = 4.54).

Reactive Proactive Aggression Questionnaire (RPQ)
The RPQ [12,14] includes 23 items that were based on previous teacher-rating measures of reactive aggression and proactive aggression, and on conceptual and theoretical relevance. These items were developed to reflect physical or verbal aggression and the motivation and situational context for the aggression. Eleven items focus on reactive aggression (e.g., Reacted angrily when provoked by others, Gotten angry when frustrated), and twelve items focus on proactive aggression (e.g., Had fights with others to show who was on top; Taken things from other students). The items must be answered as never, sometimes or often.

Massachusetts Youth Screening Instrument-Second Version (MAYSI-2)
The MAYSI-2 [26,28] is a 52-item screening tool (yes or no responses) in which youths report the presence or absence of symptoms or behaviors related to several areas of emotional, behavioral, and psychological disturbances experienced ''within the past few months.'' Factor analyses indicated that the items produce scores on six clinical scales: alcohol-drug use, angry-irritable, depressed-anxious, somatic complaints, suicide ideation, and thought disturbance (for boys only) and one non-clinical scale (traumatic experiences) that screens for exposure to potentially traumatic events. There is no MAYSI-2 total score as the developers did not intend to develop scales that contribute to a broader construct, such as internalizing problems [26]. For the purpose of this study, four scales of the MAYSI-2 were used: alcohol/drug use (eight items; a in the present study = .83); angry-irritable (nine items; a = .77); depressed-anxious (nine items; a = .66); and suicide ideation (five items; a = .73). The Dutch MAYSI-2 has been shown to have promising psychometric properties [29].

Strengths and Difficulties Questionnaire (SDQ)
The self-report version of the SDQ [27,30] is a screening instrument for psychosocial functioning for children and adolescents. The SDQ has four difficulty subscales (hyperactivity, conduct problems, peer problems, emotional symptoms) and one strength subscale (prosocial behavior). Each of these five SDQ subscales consists of five items that need to be answered as being not true, somewhat true, and certainly true. In this study only the prosocial behavior subscale (five items; e.g., I often volunteer to help others; a = .62) was used. 1 A high score on this scale is indicative of high levels of prosocial behavior. 1 The MAYSI-2 was specifically designed for use in juvenile justice involved youths. Therefore, if constructs that are relevant (e.g., depressive feelings) to test the convergent and criterion validity of the RPQ were also measured through questionnaires other than the MAYSI-2, the MAYSI-2 was given preference. Youth Self-Report (YSR) The YSR [31] includes 118 'problem' items that youths answer as being not true, sometimes true, or very true for themselves. The responses to the problem items contribute to eight narrow-band scales that identify problem areas (withdrawn/depressed, somatic complaints, anxious/depressed, social problems, thought problems, attention problems, rule-breaking behavior and aggressive behavior).
For the purpose of this study, only the aggression behavior (17 items; a = .84) and social problems (11 items; a = .63) subscales were used. Data were available for 443 youths.

Diagnostic-Interview Schedule for Children-Fourth Version (DISC-IV)
The DISC-IV is a structured diagnostic interview that covers many psychiatric diagnoses in DSM-IV and ICD-10, and can be administered by trained non-clinicians [32,33]. For the purpose of this study, the past year prevalence of conduct disorder (CD) and substance use disorder (SUD) was assessed. SUD refers to the presence of alcohol use disorder, marijuana use disorder, and/or other SUD (e.g., cocaine, amphetamines). In addition, a continuous variable was created, reflecting the number of aggressive DSM-IV CD symptoms that were reported (range 0-7). Finally, participants with CD who reported at least one aggressive CD symptom were also referred to as having aggressive CD. Data regarding CD and SUD were available for 424 and 537 youths, respectively.

Youth Psychopathic Traits Inventory (YPI)
The YPI [34] is a self-report questionnaire with 50 items that are organized into 10 subscales (with five items each) and three dimensions, including an interpersonal (a = .89), affective or callous-unemotional (a = .77), and behavioral/lifestyle dimensions (a = .86). Each item in the YPI is scored on a four-point Likert scale, ranging from ''Does not apply at all'' to ''Applies very well.'' Data were available for 757 youths.

Self-Reported Offending
The Research and Documentation Center Monitor [35] was used to determine whether or not participants ever committed an offense. Based on previous studies [36], five continuous offense categories were created. First, seven items referring to violence were classified as ''violent offenses'' (e.g., using violence to steal from someone, trying to have sex while the other refuses). Second, 11 items referring to income-related non-violent delinquent behaviors were classified as ''property offenses'' (e.g., burglary, shoplifting, theft in school). Third, five items referring to deliberately damaging property were classified as ''vandalism'' (e.g., damaging a car). Fourth, three items referring to dealing or selling drugs were classified as ''drugrelated offenses'' (e.g., selling hard drugs). Fifth, five items referring to threatening and insulting were classified as ''threats and insults'' (e.g., threatening someone to make him or her scared, insulting someone because the other person is homosexual). Data were available for 430 (violent offenses) up to 438 (drug-related offenses) youths.

Ethnicity
Based on the Dutch standard classification of ethnicity, participants were categorized as ''Moroccan'' or ''Antillean/Surinamese'' when the adolescent and/or at least one parent had been born in Morocco, or the Dutch Antilles or Surinam, respectively. When both parents were of non-Dutch ethnicities, we used the country of birth of the mother to determine the ethnicity of the child. Participants were classified as Dutch when both parents and the child were born in The Netherlands. All other participants (e.g., from Turkish, Afghan, Italian, and Polish origin) were assigned to the mixed ethnicity group.

Procedure
The MAYSI-2 and SDQ were administered on a standalone computer in the presence of non-clinical personnel from the youth detention centers (YDCs) to all youths within a few days after detention entry. Master students and test assistants with a Master's degree trained by clinically experienced researchers performed the comprehensive assessments. Youths were aware that the mental health screening and assessment were part of the YDCs' clinical protocol, and that the outcomes from mental health screening and assessment were available to YDCs personnel. Through standardized information provided by the YDCs upon start of detention, youths and their parents/care-takers were informed that the mental health screening and assessment outcomes would be used for scientific research, unless they declined (cf. passive informed consent). They were also informed that, if they did not decline, their information would be transferred anonymously to the researchers, so that it would be impossible to trace the information back to them. Given that routine mental health screening and assessment was part of clinical care, the relevant boards of the YDCs waived the requirement to obtain active informed consent from youths, and for youths \18 years of age, to obtain active informed consent from their parent(s)/caretaker(s) as well. The Medical Ethical Review Board of the Leiden University Medical Center certified that the study met the Dutch law of behavioral research because all data were derived as part of the clinical assessment.

Data-Analyses
To test the two-factor model of the RPQ, confirmatory factor analyses (CFA) were performed using Mplus 6.1 [37]. Because the items of the RPQ are scored on an ordinal scale, the robust weighted least squares estimator was used. Model fit was assessed using v 2 , root mean square error of approximation (RMSEA), and the comparative fit index (CFI). RMSEA scores \.05 indicated a good fit, whereas scores between .05 and .08 indicated an acceptable fit. A CFI score C.95 indicated an excellent fit, and a CFI score C.90 indicated a good fit [38]. With respect to v 2 , a good fit is indicated when v 2 /df B 2, whereas v 2 /df B 3 is indicative of an acceptable fit [39]. To evaluate the internal consistency of the RPQ scores, Cronbach's alphas (a) were calculated. Reliability coefficients were interpreted as follows: \.60 = insufficient; .60-.69 = marginal; .70-.79 = acceptable; .80-.89 = good, and .90 or higher = excellent [40]. Differences between ethnicity subgroups were examined using one-way analyses of variance for continuous variables (e.g., RPQ scores) and Chi square tests for categorical variables (e.g., CD) using Bonferroni correction for multiple comparisons. In case assumptions for one-way analyses of variance were violated differences between ethnic subgroups were examined with a series of Mann-Whitney tests (U). To test the convergent and criterion validity of the RPQ total score and the RA and PA scores, bivariate linear regression analyses (in case of continuous dependent variables) and bivariate logistic regression analyses (in case of categorical dependent variables) were performed. To test the unique association between the RA and PA scores on the one hand and variables of interest on the other hand, these regression analyses were repeated while simultaneously including the RA and PA scores as a predictor. Of note, the RPQ total, the RA, and PA scores were not significantly related to age in the total sample or in the four ethnicity groups, so age was not included in the regression analyses. K-means cluster analyses were performed to test whether meaningful clusters of youths could be identified that differ in standardized RA and PA scores. Between cluster-comparisons were performed in the same way as described for the between ethnicity groups comparison. Because of the large number of significance tests conducted, an alpha of p \ .01 was used as an indicator for statistical significance. All analyses were performed using SPSS 20.0 unless otherwise specified.

Factorial Validity and Internal Consistency
Model fit indices of the two-factor model for the total sample were in the adequate range, with the exception of the v 2 /df ratio, which was 4.17, and thus indicative of an unacceptable fit (Table 1). For the four ethnicity groups, the v 2 /df ratio and RMSEA were below (i.e., Moroccan and mixed) or just above (i.e., Dutch and Antillean/Surinamese) the recommended cut-off values, and thus indicative of a good model fit. Also, in all but one of the ethnicity groups, the CFI was greater than the .95 CFI cut-off value, and thus also indicative of a good model fit. For Antillean/ Surinamese youths, the CFI suggested the model fit to be acceptable (.93). The relative v 2 difference test indicated a significantly better fit for the two-factor model over the   Table 1 also shows good-to-excellent internal consistency of the RPQ total score, and the RA and PA scores. In Antillean/Surinamese youths, a for the PA scale was just below the cut-off value for good reliability, but was still acceptable. Finally, the correlation between the RA and PA scores was .66 (total sample), .65 (Dutch), .61 (Moroccan), .64 (Antillean/Surinamese), and .62 (mixed ethnicity). All correlations were significant at p \ .001. Table 2 shows the mean RPQ scores for the total sample and the four ethnicity groups. Moroccan boys had significantly lower RPQ total and RA and PA scores than all three other ethnicity groups, and boys of mixed ethnicity had significantly lower scores than Dutch and Antillean/ Surinamese youths. The magnitude of the differences between Moroccan and Dutch boys was moderate (Cohen's d: total = .71; RA = .66; PA = .61), whereas the magnitude of the differences between Moroccan and Antillean/Surinamese boys ranged from moderate-to-almost large (Cohen's d: total = .79; RA = .75; PA = .66). D's for significant group differences presented in Table 2 but not described here were (far) below .50 (available upon request).

Convergent Validity
In the total sample, the RPQ total score was significantly positively related to aggressive behavior, angry-irritability and number of aggressive CD symptoms (Table 3). At the zero-order level, both RPQ scale scores were positively related to these outcomes as well. After controlling for the PA score, the RA score remained significantly related to angry-irritable features and aggressive behavior, but was no longer significantly related to the number of aggressive CD symptoms. After controlling for the RA score, the PA score remained significantly related to aggressive behavior, aggressive CD symptoms and angry-irritability (Table 3).
In each of the four ethnicity groups, 2 results were generally similar to the results reported for the total sample, with two exceptions, though. First, after controlling for the RA score, the PA score was not significantly related to aggressive CD symptoms Antillean/Surinamese and mixed ethnicity boys. Second, after controlling for the RA score, the PA score was not significantly related to the angryirritability in boys from Dutch, Moroccan and mixed ethnicity (Table 3).

Criterion Validity
In the total sample, all RPQ scores (zero-order level) were negatively related to prosocial behavior and positively related to all other outcomes presented in Table 4. After controlling for the PA score, the RA score was positively related to symptoms of depression/anxiety, suicide ideations, social problems, alcohol and drug use, SUD, psychopathic traits and threats/insults, but unrelated to prosocial behavior, (aggressive) CD, violent offenses, theft, vandalism and drug offenses. After controlling for the RA score, the PA score was not significantly related to symptoms of depression/anxiety, suicide ideations, social problems, negatively related to prosocial behavior, and positively related to all the other outcomes.
When repeating the analyses in each of the four ethnicity groups (Table 5), the pattern of relationships between the RPQ scores (zero-order level) and variables of interest were substantially the same as reported for the total sample. Table 5 also shows that after controlling for the other score the pattern of relationships between the RA or PA scores and variables of interest were mainly the same as reported for the total sample, with a few notable exceptions being described next. After controlling for PA, RA was not related to feelings of depression and anxiety in boys from mixed ethnicity, not related to suicide ideations in boys from Dutch and mixed ethnicity; not related to social problems in Dutch and Antillean/Surinamese boys; and not related to alcohol/drug use, substance use disorder, the interpersonal and affective psychopathy dimensions, and threats/insults in Dutch, Moroccan, and Antillean/ Surinamese boys. After controlling for RA, PA was not related to prosocial behavior in Dutch and mixed ethnicity boys, and was not related to SUD, aggressive CD, and threats/insult in boys from mixed ethnicity.

Deriving Clusters
K-means cluster analyses were performed to determine whether three clusters of youths could be meaningfully identified in the total sample. The results showed a cluster with boys scoring [ SD below the mean on RA and PA (labeled as low aggression), a cluster with boys scoring [1 SD above the mean on RA and PA (labeled as combined aggression), and a cluster of boys scoring [ SD above the mean for RA and close to the mean for PA (labeled as reactive aggression only). Next, it was tested whether a four-cluster solution would identify a group of youths with high scores on PA but with low scores on RA. The same three clusters as described above were revealed as well as a fourth cluster of youths with a high score on PA but with a score on RA that was as high as the reactive aggression only cluster. As such, the three-cluster solution was used in all further analyses. Additional analyses showed that this three-cluster solution could be replicated in the four ethnicity groups, that the number of youths in each cluster was relatively similar across the four ethnicity groups, and that in each ethnicity group most youths were assigned to the low aggression cluster, followed by the reactive aggression only and combined aggression clusters (Table 6).

Comparing Clusters
The three clusters were not significantly different regarding the number of youths from various ethnicity, except that the low aggression cluster included more Moroccan boys (15 %) than the combined aggression cluster (13 %), but fewer than the reactive aggression only cluster (19 %). The low aggression cluster also included fewer Antillean/ Surinamese boys (16 %) than the combined aggression cluster (28 %). Table 7 shows that for all variables of interest, except for peer problems, boys in the combined aggression and the reactive aggression only cluster had significantly higher scores and prevalence rates (but less prosocial behavior) than the boys in the low aggression cluster. Only boys in the reactive aggression only cluster had more peer problems than boys in the low aggression cluster. Table 7 also shows that for all variables, except depressed/anxious feelings, suicide ideation, prosocial behavior, and peer problems, boys in the combined aggression cluster had significantly higher scores and prevalence rates than boys in the reactive aggression only cluster.

Discussion
The current study was designed to test the psychometric properties of the RPQ and its usefulness to identify meaningful subgroups of detained youths. Notwithstanding that various studies already examined reactive and proactive aggression in detained youths, this study substantially contributed to the literature by examining associations with variables of interest across ethnic groups whilst using data that were gathered outside of a research context. Overall, our findings provided strong support for the factorial validity, convergent and criterion validity and the internal consistency of RPQ scores in the total sample and in each ethnicity group. In addition, three meaningful clusters of youths could be identified, with the combined RA and PA cluster including the most severe boys in terms of aggression, anger, delinquency, alcohol and drug use, psychopathic traits, and prevalence rates for CD and SUD.
Results from CFA supported the two-factor structure over the one-factor structure of the RPQ, and showed that a significant distinction can be made between reactive aggression and proactive aggression in detained adolescent males. Specifically, all model fit indices were indicative of an acceptable or good model fit in the total sample and in youths from various ethnicities, except for the v 2 /df ratio for the total sample. However, with increasing sample size and a constant number of degrees of freedom, the v 2 value increases, and the v 2 /df ratio, therefore, may suggest to reject a plausible model [38]. Because the v 2 /df ratio was below the cut-off value in three subgroups and because all the other fit indices supported the two-factor model of the RPQ, this model can be considered to be acceptable in the total sample as well.
Our findings also provide support for the convergent validity of the RPQ scores. At the zero-order level, RPQ total, RA and PA scores, were positively related to other indices of aggressive behavior and features of anger-irritability. Also, after controlling for the PA score, only the RA score remained significantly related to anger and irritability, but was no longer related to aggressive CD symptoms. These results support the view that reactive, but not proactive aggression, is often accompanied with anger and a loss of impulse control [4,5]. Although some of the aggressive CD symptoms can occur as an uncontrolled response to frustration or anger (e.g., forcing someone into sexual activity, initiating fights, using a weapon that can cause serious physical harm), the RA score was never significantly related to aggressive CD symptoms after  controlling for the PA score. Yet, after controlling for the RA score, the PA score remained significantly related to aggressive CD symptoms (Table 3) in the total sample and Dutch and Moroccan boys. This suggests that the aggression displayed by detained youths with a CD diagnosis is likely to be premeditated and planned, a notion that is supported by the finding that only the PA score was positively related to aggressive conduct disorder (Tables 4,  5).
The results also supported the criterion validity of the RPQ score in detained male youths. As hypothesized, only the RA score was positively related to depressive feelings, anxiety, and suicide ideation after controlling for the other RPQ scale score. Although there were no clear expectations about the relationship between the RPQ and social problems, the RA score was positively associated with this outcome in the total sample and some ethnicity groups. Overall, our findings are in accordance with recent work, including studies that scrutinized relations with suicide risk, social problems and peer rejection [41,42], and support the claim that reactive aggression is an indicator of overall poor psychosocial adjustment [13]. However, the results do not support the suggestion that reactive aggression is primarily related to low prosocial behavior, and that proactive aggression has little or no association with prosocial behavior independent of reactive aggression [13]. In contrast, the present study showed that only the PA score was significantly negatively related to prosocial behavior after controlling for the RA score. Given that few studies addressed the relationship between self-reported reactive and proactive aggression and prosocial behavior, future studies are warranted. The finding that a higher PA score was associated with a lower level of prosocial behavior, nevertheless, corresponds with the finding that only PA was positively related to self-reported offenses (Tables 4,  5).
After controlling for the other RPQ scale score, only the PA score was positively related to alcohol and drugs use and a SUD in boys from Dutch, Moroccan and Antillean/ Surinamese ethnicity. Yet, in boys of mixed ethnicity the RA score also was positively related to both outcomes (after controlling for the RA score), which may explain why this relationship was also reported for the total sample. Overall, this study supports the suggestion that proactive aggression is more likely to be related to alcohol and substance use than reactive aggression [43], and replicates prior RPQ work with students that showed that only the PA score was positively related to substance use [7]. Also, in agreement with this latter study [7] was the finding that only the PA score was positively related to violence and theft, whereas the RA score was not. This study added to the rather limited literature on the relationship between reactive and proactive aggression and delinquency by showing that the RPQ's PA score, but not its RA score, was positively related to all other types of offenses as well (except for threats and insults in youths from a the mixed ethnicity group). This suggests that youths with high levels of proactive aggression are not only amongst the most violent offenders, but are also likely to be versatile offenders. Finally, and after controlling for the other RPQ scale score, only the PA score was significantly positively related to the affective (and interpersonal) psychopathy dimension, a finding that supports the view that proactive aggression is a cold form of aggression [44].
Although most often small in magnitude, significant RPQ mean score differences between the four ethnicity groups were revealed, with Moroccan boys reporting the lowest RA and PA scores. In each ethnic group, boys had higher RA scores than PA scores a finding that dovetails with prior RPQ work that compared youths from various other ethnic groups [16,24]. Of note, Moroccan boys had the lowest scores on aggression, but also on most of the other variables used in this study, including mental health problems (e.g., depressed/anxious, angry-irritable, alcohol/drug use), and self-reported offenses, a finding that also converges with prior work in boys being detained in the Netherlands [21,45]. Importantly, the validity of the RPQ scores was well supported in all four ethnicity groups, a finding that bears substantial clinical relevance as detained youths are most often not from the major ethnicity group of the country where they live and are being detained. Thus, to the extent that clinicians have an interest in assessing reactive and proactive aggression through standardized tools, they need to have confidence that these tools provide a reliable and valid assessment of reactive and proactive aggression regardless of the boys' ethnicity. The present study provides preliminary support that the RPQ fulfills this requirement. Person-centered analyses identified three groups of detained youths that previously had been labeled as low aggression, combined aggression, and reactive aggression only [23]. Importantly, these groups differed from each other in a meaningful and clinically relevant way. Specifically, boys in the combined aggressive cluster had more social problems, displayed more aggression, and anger, more often met criteria for CD and SUD, reported higher levels of psychopathic traits, and committed more violent and other types of offenses than boys in the other two clusters and effect sizes (ES) indicated that most of these differences were large. Of note, boys in the reactive aggression only cluster were also more disturbed than boys in the low aggression cluster. These results clearly suggest that the combined aggression and reactive aggression only clusters differ in severity of risk factors rather than in the type of risk factor [22]. Of note, our finding contrasts the notion that CU traits are a unique risk factor for combined aggressive youths [22,23,46]. In the present study, both the combined aggression and the reactive aggression only clusters had much higher levels of CU traits than boys from the low aggression cluster (ES of .80 or higher), whilst the difference in the level of CU traits between both the combined aggression and reactive aggression only clusters was small in magnitude (ES B .30). Future studies are needed, especially because it has been speculated that juveniles with high levels of reactive aggression are low in CU traits and are distressed by the effect of their behavior [22,46]. Finally, although the combined aggression and reactive aggression only clusters were significantly different in terms of depressive and anxious feelings, and suicide ideations (see also [18]), boys in both clusters displayed higher levels of these problems than boys in the low aggression cluster. Taken together, reactive aggression is likely to be a robust risk factor for internalizing turmoil [13], whether or not co-occurring with proactive aggression.
As always, the results of this study must be interpreted in the context of several limitations. First, the cross-sectional study design does not allow making conclusions about the temporal relationships between reactive and proactive aggression and criterion variables of interest. Second, due to the sole reliance on self-reported information, it cannot be excluded that strong associations between variables of interest and differences between the clusters are inflated due to shared method variance. Third, the number of youths within each of the ethnic groups was too Table 6 Mean reactive aggression (RA) and proactive aggression (PA) scores within three cluster in the total sample and in ethnic groups  restricted to determine whether the differences and similarities between the three clusters could have been replicated in each of these groups. Fourth, no experimental design was used to determine whether RPQ scores differed between youths who had or had not been assured that the information they provided would be handled confidentially and anonymously. Fifth, only detained males were included in this study, implicating that this is the only population that an inference can be drawn upon.

Summary
This study suggests that the RPQ provides a reliable and valid assessment of reactive and proactive aggression in severely antisocial, criminal-justice involved adolescents from different ethnicities, even in a context in which confidentiality and anonymity of the information are not guaranteed. Both the variable-oriented and person-oriented analyses supported the notion that there are meaningful differences between reactive and proactive aggression, with reactive aggression being most robustly related to internalizing problems and disorders, and proactive aggression being most robustly related to externalizing problems and disorders, self-reported offending and psychopathic-traits. These findings altogether support prior suggestions that reactively and proactively aggressive youths have different treatment needs [47]. childhood reactive and proactive aggression. Biol Psychol 84(3):488 Table 7 Means (SD) for each cluster and differences between three clusters regarding variables of interest