Introduction

Evidence-based targeting aims to find the most efficient and cost-effective allocation of police resources. A process of prioritising resources—or triaging—can be more transparently justified based on clear evidence than on subjective preferences. Numerous evidence-based targeting strategies have emerged in recent years, establishing one clear and consistent theme: not all crimes or criminals are equally harmful (Sherman et al. 2016). The evidence points to a clear “Pareto curve” phenomenon: not only are most criminal events attributable to a small group of people, places or times (Sherman 2007); so, too, is most crime harm concentrated in a small percentage of places (Weinborn et al. 2017), victims (Dudfield et al. 2017) and victim-offenders (Sandall et al 2018).

In this article, we present what may be the first published evidence on the disproportionate concentration of harm amongst a unit of analysis previously uncharted for harm: co-offenders. Co-offending is a widespread characteristic of criminal events (Reiss Jr. 1988), yet policing scholars have generally ignored its potential for targeting. For example, group size alone was found to predict the harm caused by organised crime groups and gang members in the West Midlands, UK (Denley and Ariel 2019). Similarly, based on data on co-offending networks compiled from intelligence records in London, it is evident that co-offending is a strong correlate of future offending (Linton 2016). Co-offenders were also found to produce substantially higher levels of crime counts compared with ‘solo’ offenders in Sacramento, California (e.g. Englefield and Ariel 2017). Despite this growing body of evidence, we suspect that most police departments do not (yet) strategically target co-offenders. Rather, they often seem to guide their activities on the basis of tactical opportunities and ad hoc operations (Cornish and Clarke 2002, pp. 55–57).

To address the gap in the literature about the harm composition generated by co-offending networks, we have analysed police charging records in Denmark. Currently, the Danish Police force identifies offenders who will be subjected to prioritised targeting based on a variety of factors, including the number of prior offences, but especially crimes indicating their role in a criminal network (gangs, organised crime groups, etc.). These offenders are ‘on-boarded’ for police interventions based purely on professional judgement tools, including the Canadian SLEIPNIR method (Strang 2009), rather than on the basis of an actuarial, systematic approach to triaging. Despite the accumulated evidence on poor predictive accuracy of professional judgement (see, for example MacBeth and Ariel 2017; Sutherland and Angel 2019), more evidence is clearly necessary in each domain of police targeting.

Research Questions

Our study therefore aims to explore a potential solution to this issue by applying the concept of the ‘power few’ to a database of co-charged offenders in Denmark. In this regard, we are primarily interested in two issues related to co-offending patterns:

  1. 1.

    To what extent are volume of crime and crime harm concentrated amongst populations of co-offender relationships?

  2. 2.

    What does social network analysis (SNA) tell us about the distribution of harm within populations of co-offenders?

Co-offending

The term ‘co-offending’ (Reiss Jr. 1988) is used to identify different types of relationships between offenders in groups of two or more, including gangs and organised crime networks (Carrington 2002). We should emphasise that gang-related and organised crime represents a small proportion of all crimes that are committed by co-offenders (Englefield and Ariel 2017). Co-offending represents a wide range of ‘criminal enterprises’, and not just well-organised groups. It includes any crimes in which people offend in cooperation with others (McGloin et al. 2008). This means that co-offending groups vary in group size, longevity of the group and the extent to which they are specialists or diverse in their criminal portfolio (Lantz and Hutchison 2015; Sorensen 2009; Warr 1996). The term can also cover the execution of both simple and complex crimes, including those for which special skills are required (Weerman 2003).

There are different findings on the prevalence of co-offending in society between studies using police-records and studies using self-reported offending. Warr (1996) looked at self-reported offences drawn from the National Survey of Youth; he found that 73% of self-reported delinquent acts involved co-offending. In contrast, police records studied by Carrington (2002) and Van Mastrigt and Farrington (2011) found that no more than 15% of offences involved multiple offenders. Any study that relies on official statistics alone necessarily misses the majority of crime incidents because much of the co-offending crime phenomenon is ‘unknown’ to the police. These findings remain useful, however, as the most systematic and reliable current data on co-offending networks for police targeting decisions.

Given the wide variability in the types of co-offending relationships that can exist, we also see evidence that co-offending is not equally distributed across crime types (McCarthy & Hagan, 2001). Studies examining the distribution of co-offending across offence types have found that it is more prevalent in burglary and robbery, but violent offences are more likely to be solo-offences (Carrington 2002; Carrington and van Mastrigt 2013; Reiss and Farrington 1991; van Mastrigt and Farrington 2009). Juveniles are more likely to take part in co-offending behaviour than adults (McCord and Conway 2002).

The ‘Power Few’ of the Targeting of the Most Harmful Offenders

The concept of the ‘power few’ (Sherman 2007) takes its inspiration from the Pareto curve, a non-linear j-curve in which a small percentage of units in any given population accounts for any given characteristic of that particular population (Eck et al. 2007; Sherman 2007; Zipf 1949). Concentrations of crime in time, space and populations have been extensively documented (see Ariel and Partridge 2017; Weisburd and Mazerolle 2000; Weisburd and Eck 2004). These studies collectively show that a disproportionate amount of crime is committed/experienced by a relatively small number of units (Sherman et al., 1989; Sherman 2007). Limiting the number of potential units greatly increases our ability to then predict and potentially prevent future risk these power few units cause to society (see Zeng et al. 2017).

However, the overwhelming majority of studies that have investigated disproportionate crime concentrations have paid insufficient attention to the fact that not all crimes cause equal damage to victims (Sherman et al. 2016). Recently, analyses have emerged that consider these ‘power few’ concentrations through the lens of harm, rather than purely based on crime frequency or volume. As pointed out by Norton et al. (2018), a crime hotspot with 30 robberies in a given year is not the same as one with 30 thefts: the severity of these behaviours is not the same. In domestic violence studies, a very small subset of offenders has been found to cause the most harm to victims (Barnham, Barnes, & Sherman, 2017; Bland and Ariel 2015). Dudfield et al. (2017) have shown that fewer than 5% of victims suffer 85% of the harm, with sex offences and robbery contributing almost two-thirds of total harm (63%). Similarly, a small subset of organised crime groups (OCGs) and gangs produce the most harm where these groups operate (Denley and Ariel 2019). Again, in place-based studies, it has been shown that harm is more heavily concentrated than crime counts (e.g. Weinborn et al. 2017). Collectively, then, targeting these power few harmful units of analysis in a strategic way seems more useful than targeting concentrations of crime count volume without regard to harm.

There are different methods of defining or measuring harm. One method that has gained traction is the use of a crime harm index based on sentencing guidelines. The first systematic approach to use these metrics was the Cambridge Crime Harm Index (Sherman et al. 2016)—and soon, several conceptual and literal replications emerged since (e.g. Mitchell 2017; Ratcliffe 2015). Each crime event is multiplied by the number of prison days a first-time offender would be sentenced to in court. The totals—or means per certain unit of measurement—can then be compared across places, individuals or groups of offenders. When the list of units is then rank-ordered based on frequency, the power few can then be identified, according to any parameter the researcher decides to use (e.g. two standard deviations from the mean, the top 5%, the top 100).

In this study, we use a variation of the Cambridge Crime Harm Index developed in Denmark by Andersen and Mueller-Johnson (2018). This Danish Crime Harm Index (DCHI) uses data from the Danish Director of Public Prosecutions Office—which specify the (recommended) sentence that the prosecutor should ask for in court—while prosecuting a first-time offender in a case with no mitigating or aggravating circumstances. Like its Cambridge counterpart, the DCHI excludes police-discovered offences such as drug crimes, which are more commonly an indicator of police proactivity than harm. Andersen and Mueller-Johnson’s (2018) analysis showed that theft and related offences—which represented 51% of all offences in 2016, in terms of volume—only accounted for 16% of inflicted harm. On the other hand, robbery only represented 1% of all offences in terms of volume, but 10% of harm.

Co-offending and Social Network Analysis

Social network analysis (SNA) is ‘a way of thinking about social systems that focuses our attention on the relationships amongst the entities that make up the system, which we call actors and nodes’ (Borgatti et al. 2013: p. 1). SNA examines the ties (relationships) between two (or more) nodes (actors) (Robins, 2015; Wasserman and Faust 1994). These ties between individuals tell us something about the people with whom an individual associates, and the focus or purpose of these ties (McGloin and Kirk 2010; Morselli et al. 2006).

There have been several studies that explore the utility of SNA in understanding co-offending populations. Research by McGloin and Kirk (2010) and Warr (2002) reveals that the number of co-offenders an offender is surrounded by is a powerful indicator of criminal behaviour; the more co-offenders to whom they are linked, the more ‘centrality’ they are said to have. Offenders generally co-offend with people they know, and actively seek out suitable candidates within their groups of associates (Warr 1996). Such relationships may be short-lived (Charette and Papachristos 2017). The most criminally active individuals, however, are also those with the highest level of centrality in a network (Haynie 2001; Lantz and Hutchison 2015).

Englefield and Ariel (2017) used SNA to explore the interconnectivity of groups of co-offenders, showing that some crime categories are more likely to see offenders engage in the recruitment of younger and less-experienced offenders—narcotics distribution and auto theft, for example. To this extent, SNA—as a conceptual as well as a technical approach—can be applied in the search for the most harmful power few offending networks in Denmark.

Data and Methods

Data and Settings

Denmark has one national police force that is divided into 12 territorial police districts. All registered offences in Denmark are recorded nationally in the Danish Police case management system, called POLSAS. For this study, we extracted from POLSAS all criminal charges under the Danish Criminal Code, the Illegal Substances Act and the Weapons Act in the 11 years between 2007 and 2017. The original dataset consisted of 1,554,943 records, which we reduced to 437,717 by excluding charges for all offences in which only one person was charged with the offence, thus excluding charges that did not involve co-offending. The total number of charges involving co-offending compared with the total number of charges generated for solo- and co-offences is illustrated in Fig. 1. The total number of charges (solo- and co-offending) increased over the period studied, whereas the charges involving co-offending remained relatively consistent throughout the same period.

Fig. 1
figure 1

Total number of charges involving both solo- and co-offending, 2007–2017

Analytic Process

Identifying the Harmful Power Few Co-Offenders

We built on the DCHI (Andersen and Mueller-Johnson 2018) and developed a fit-for-purpose harm index, referred to as the Danish Offender Targeting Tool (DOTT). The DOTT uses the weightings established by the DCHI, but also computed new values for crime categories explicitly excluded from the DCHI model, particularly around more serious crime categories—drug offences and organised crime. We acknowledge the need to separate between police-generated and public/victim-generated crime data. Because the measurement of the former is driven by enforcement policies, it does not constitute a measure of the rate of these crime categories (e.g. drug-dealing) in the population. Nonetheless, much of the targeting of potentially harmful offenders and co-offenders incorporates serious offences associated with individual offenders, such as weapon violations, possession of large quantities of drugs or enslaving people in unreported cases of human trafficking. Therefore, we incorporated 19 additional police-discovered categories and created 12 new groupings based on offence type, to the existing 46 crime categories included in the original DCHI. Values from the resulting DOTT were then assigned to each charge case in the database.

In order to detect the ‘power few’, offenders who were charged in groups of two or more were identified based on the charging data. These co-offenders were then rank-ordered twice—once based on the total number of charges in their criminal records, and then again with the associated DOTT score they had accumulated for their crimes. The accumulated crime-volume and crime-harm scores across the co-offending population were then calculated to identify the power few population that generated 50% of the total number of harm incidents involving co-offenders. Our rationale here was based on the UK Home Office study of 2001, which identified around 10% of the offender population that generated 50% of the total number of offences (Home Office 2001). Other studies may, of course, consider other thresholds, depending on their research questions.

Social Network Analysis

Due to computational limitations, SNA was conducted only on the co-offending network of the harmful power few co-offenders, rather than the entire population of co-offenders. We first isolated the power few crime-harm cohort and their co-offenders into a separate dataset. We then converted the charges data from a ‘two-mode’ network to a ‘one-mode’ network. The conversion was done using a Python-script specifically created for this dataset. This conversion was necessary in order to apply the SNA algorithms correctly. If an SNA algorithm was applied to a two-mode network, it would have given the now-converted node a centrality score, thereby possibly identifying the wrong nodes as central actors on the network. As illustrated in Fig. 2, if two offenders commit the same offence, in a two-mode network, it would look as follows: offender 1 commits offence 3 together with offender 2. However, in a one-mode network, the same offence is a link: offender 1 is directly linked to offender 2.

Fig. 2
figure 2

Illustrated comparison of a two-mode network and one-mode network

The centrality measure known as ‘betweenness centrality’ was applied to identify the nodes with the shortest path to all other nodes in the network (Borgatti et al. 2013; Freeman 1977; Newman 2005; Tayebi et al. 2011). The higher the measured value, the more central the actor is in the network. Betweenness centrality is often used when attempting to identify gatekeepers in a network (Borgatti et al. 2013)—the individuals who link networks together through their offending patterns.

Following this step, we used i2 Analyst Notebook 9.1 to create cartographic network charts visualising the co-offending networks embedded in the dataset. The size of each node illustrates how well the node—or co-offender—is connected to the rest of the network: the larger the node, the better connected the node is. Figure 3 shows a section of the main co-offending network. As can be seen, node #1751 is larger than most of the other nodes, indicating that node #1751 has the highest betweenness centrality score of the nodes seen in Fig. 3.

Fig. 3
figure 3

Illustration of SNA with Denmark charge data: node size represents betweenness measure

Findings

‘Power Few’ Co-Offenders in Denmark Based on Crime Counts (Charges)

We found that by counting the number of offences each co-offender in the dataset was charged with, a power few co-offending population of 8,832 offenders could be identified. This 7.42% of the co-offending population accounted for 49.99% of the 437,717 charges involving co-offending in Denmark in 2007–2017, according to charges data (Fig. 4).

Fig. 4
figure 4

The ‘power few’ co-offenders for count of crime

The gender composition across the total co-offending population, compared with the power few in terms of crime volume, showed a higher percentage (91%) of males amongst the power few in terms of crime volume compared with the total co-offending population. Amongst the total co-offending population, 78% were identified as male and 22% as female, as shown in Fig. 5.

Fig. 5
figure 5

Gender composition of co-offending count ‘power few’ compared with all co-offender population

Of the 8,832 co-offenders identified as the ‘power few’ in terms of crime volume, 7.3% (n = 648) were known to be members of an identified Organized Crime Group (OCG). Of these individuals, 206 were identified as members of street gangs, and 195 were members of one of the seven ‘biker clubs’ present in Denmark. Another 225 were members of OCGs not classified as street gangs or biker clubs. The remaining 22 were registered as associated to OCGs but not formally on the OCG members’ registrar.

The power few were, on average, active for 3.78 years out of the 11 years studied in this dataset, almost identical with to the non-power few offending population’s average of 3.68 years of activity. However, we found significant differences within the power few when gender was taken into account: male members of the power few were active for 3.87 years, compared with 2.86 years for the females.

There were also no differences in the types of crime committed by the power few and non-power few co-offenders. The percentage of charges involving burglary (15.30% vs. 19.88%) and theft (29.76% vs. 35.02%) was somewhat higher for the power few co-offenders, but the proportion of charges involving violence/threats, homicide and sexual offences were all lower for the power few co-offending networks. This finding demonstrates that merely counting the number of offences, without taking the crime type into consideration, has limited value in the identification of the most harmful power few co-offending offenders.

Finally, we found that 7.42% of the co-offending population generated 14.0% (n = 61,325) of all the charges in Denmark in the period analysed—including charges with single offenders. As the volume of co-offending power few only consists of 8,832 offenders out of a total offender population of 357,215 (2.5%) over the 11 years examined, we are able to use this analysis to generate suitable targets for future policing, as discussed below.

‘Power Few’ Co-Offenders in Denmark Based on Crime Harm Index (DOTT)

Based on the DOTT, the crime harm power few consists of 4,296 co-offenders out of the 119,069 co-offenders in the dataset. As illustrated in Fig. 6, this power few population equates to 3.61% of the entire co-offending population and generates 50% of the crime harm—comprising only half of the 7.42% of the co-offenders that produced the top 50% of crime counts.

Fig. 6
figure 6

The ‘power few’ co-offenders for crime harm

The power few of harmful co-offenders is comprised of 4,013 males and 283 females. Of these, 13.2% (n = 569) were recorded as members of an OCG; and, of that figure, 185 were identified as members of street gangs, 177 were members of one of the seven biker clubs in Denmark and 190 were members of OCGs not classified as street gangs or biker clubs. The remaining 17 were registered as associated to OCGs, with a larger proportion of the harm power few overall being found to be members of OCGs (see Fig. 7).

Fig. 7
figure 7

Distribution of ‘power few’ co-offenders amongst OCGs

Of the 11 years studied in the dataset (2007–2017), the crime harm power few were active for an average of 3.68 years, with the majority of that population (79%) active for between 1 and 5 years. This distribution of activities across the years studied was similar to the volume of the power few, and this was equally the case when the duration of activity was analysed by gender.

Figure 8 illustrates how the distribution of crime type changes when focusing on crime harm (measured by DOTT) instead of crime counts. While fraud/embezzlement accounted for more than 44% of all the charges, it generated just under 18% of crime harm. Crime types such as theft (15.6%) and burglary (14.4%) accounted for large percentages of the total number of charges generated by the power few in terms of crime counts. However, theft represented only 1.3% and burglary only 3.68% of the crime harm generated by the power few co-offenders. Homicide and robbery accounted for 2.5% and 6.7%, respectively, of the charges generated, but homicide accounted for 26.7% and robbery 38.8% of the crime harm generated.

Fig. 8
figure 8

Distribution of DOTT harm by crime type amongst the harm-generating ‘power few’ co-offenders

Overall, the harm-generating power few co-offender cohort made up 1.2% of all offenders in the database (including non-co-offenders), yet accounted for 23.6% of all charges. This group did not only co-offend with other offenders in the power few population but also with a large number of other co-offending partners. These co-offending partners, who made up 13.2% (n = 15,748) of the co-offending population, contributed another 14.1% of the total crime harm generated by the co-offending population for a total contribution (charged and connected) of 64.1% of the total crime harm produced by the co-offending population.

Social Network Analysis and Central Actors in the Co-Offending Networks

The majority of the harm-generating power few co-offenders were connected in one main network that included many far less-harmful offenders. That network consisted of 82.4% of the power few population and 94.9% of all the co-offending population, across the 11 years studied. The remaining 757 co-offenders in the power few population and their 807 co-offending partners were found in isolated clusters and dyads. These isolated co-offenders had 75% more charges for fraud/embezzlement, 33% more homicide charges and 55% more sexual offence charges than those connected to the main network. The connected co-offenders were more commonly engaged in arson, violence/threats and weapon offences.

We then applied the betweenness centrality measure to this network in order to further examine the benefit of focusing on the power few. We identified the nodes showing the shortest path to all other nodes in the network, and rank-ordered them by that measure. Table 1 presents the data on the top ten co-offenders, or the most interconnected power few. This subgroup of the network (2%) is linked to 196 other power few co-offenders (44%) and 241 co-offenders not within the power few. Figure 3 illustrates the connections of the top ten to other co-offenders. Our analysis showed that if the top ten were removed from the network, an additional 122 co-offenders would be isolated.

Table 1 Top 10 co-offenders in the main network based on betweenness centrality (ten dyads)

Discussion and Conclusions

Identifying the most harmful potential offenders for investigative and preventive targeting is routinely done without evidence-based tracking, let alone evidence-based selection. The approach we employ in this article may appear difficult and expensive, but that conclusion is not supported by the facts. What is expensive is investing substantial police resources in a medium-harm group of co-offenders, while very high-harm co-offenders are not targeted. What is difficult is to decide which offenders to target in the absence of evidence-based policing analyses.

As our tools develop, so does our ability to identify certain individuals in the offender population—the targeting of whom can lead to significant reductions in criminal behaviour (e.g. Barnes et al. 2017). Equally, the body of evidence on different units of targets—individuals, hotspots, facilities, etc.—is growing as well (see Ariel 2018; Ariel et al. 2015).

In this paper, we attempted to show two additional layers to this line of research. First, we offer an alternative perspective to the ‘one-at-a-time’ approach and focus instead on co-offending networks; second, we echo the call for greater attention to measuring harm rather than counts of crime when determining the suitability of targets. So how can these findings help improve the ways in which the police identify offenders using an evidence-based approach?

‘Get Them in Packs’

Targeting groups of two or more co-offenders, rather than individual offenders one at a time, would limit the total number of suitable targets in which the police invests. This compressed approach to targeting is not only efficient. It may also be more effective.

A growing body of research on peer influence and network communications is leading to a greater understanding of how sanction threat messages cascade through groups (Papachristos et al. 2015). The relevance of targeting is clear: people talk, and the police can leverage their talking. Assume a hypothetical police department identified actors within social networks of co-offenders who are responsible for transmitting messages to the group members. If the police targeted these key actors in the network by either a preventative or a pursuit exercise, it can be assumed that they would communicate their interaction with the police to other members of the network.

By targeting key actors, the effect exerted on those individuals may cascade into more of their co-offending parties than by contacting less ‘central’ co-offenders. Even though the police have not been directly involved with these co-offending parties, they may leverage them for sending effective messages, in a strategy of focussed deterrence (Braga and Weisburd 2012). By piggybacking on the social links offenders have with others, any police force can increase its crime-prevention utility by interacting with co-offenders with the view of preventing the criminal behaviour of others. So far, however, very little has been done through prospective and controlled studies (but see Ariel et al. 2019).

Given the ubiquity of the Pareto curve in criminal enterprises, it should be no surprise that we have identified, within the already over-represented population of co-offenders, a subgroup of co-offenders that produce disproportionally more crime than others. However, the results are still telling; these so-called ‘power few’ represent less than 8% of all co-offenders, but they produce half of all the charged crimes in Demark. Identifying these relationships—particularly the persisting co-offending relationships (Denley and Ariel 2019)—could lead to both more efficiency and effectiveness in utilisation of police resources. By rank-ordering the co-offending units—either crudely by frequency and volume only, or conditional of other features that may explain more crime, such as age, betweenness, specialisation or persistency—the police can then strategically target the ‘heavier’ producers of crimes in society, and leverage their communications with associated offenders.

In this paper, we have demonstrated that SNA can be used to meaningfully narrow the co-offender population even further, by identifying key individuals whose co-offending interactions with other offenders classify them as central players in the co-offending network with which they are affiliated. The algorithms employed for these tasks are built into widely available analytical software and can be applied quickly and cheaply by police analysts. Such techniques offer much promise for the future of targeting analyses and provide many police departments with a ready-made opportunity to try big-data policies—especially when it comes to targeting criminal networks and the gatekeepers within these communities.

Harm, Severity and Seriousness, But Not Counts

Our second contribution is to expand the evidence on what a ‘power few’ in offending can mean, beyond frequency and counts of crimes. Unlike the pivotal studies by Morselli (2009), Morselli et al. (2006) and others who implemented count-based models to identify suitable targets in criminal networks, our study joins Sherman et al. (2016) to see a more weighted approach to identifying suitable targets based on the harm that they cause. Whereas previous studies necessarily focussed on the number of offences offenders commit, we focus on the severity harm.

In this study, we have found tangible evidence of a harm-generating power few that produce a substantial portion of the crime harm in Denmark. We have found that 3.6% of the co-offending population accounts for 50% the total harm, caused by that population, based on the DOTT harm index we applied.

These findings join other recent studies on the concentration of crime harm across different units of measurement. Collectively, the studies reviewed earlier suggest at least two conclusions. Firstly, crime harm is substantially more concentrated than crime counts; our findings show that this pattern is true for co-offending networks as well. When considering the ‘power few of the power few’ of harm, then the target list is shorter than any list of offenders who are rank-ordered by the frequency of crimes that they have committed. If the most harmful co-offenders can be identified, they may well be predicted. While description is never the same as prediction, it is a helpful first step. If just 1.2% of the overall offending population in Denmark contributed 24% of overall harm over 11 years, that pattern is a good place to begin predictive studies of who will cause the most harm next year. When we consider the list of offenders in Table 1, it becomes obvious that ‘removing’ just these 20 offenders could potentially isolate 27% of their co-offenders from the main network—suggesting a potential displaced value from selecting such individuals for priority targeting.

Finally, we are encouraged by the utility of SNA in the analysis of crime harm. As demonstrated in this article, it provides additional opportunities not just for tactical law enforcement decisions. It should also help criminologists and other scientists who are interested in expanding the scope of analysis on the ways in which co-offenders interact. Our focus on one large network in Denmark suggests that there is potential for dismantling the criminal behaviour of many actors at once. That potential comes from focusing solely on offenders who have committed crimes with an unusually high number of other co-offenders. Such an approach—testing the effect of cascading messages within groups—is a promising avenue for research, as well as for evidence-based policing.