1 Introduction

Issues of Corporate Social Responsibility (CSR) and sustainability have become increasingly important for both academic research and business practice (Chen and Chen-Hsun 2017; Hult 2011). As society becomes more and more concerned with environmental and social issues, the public increasingly expects companies to behave in environmentally and socially responsible ways. Business communities have responded to these expectations (Sethi et al. 2017). Business-school accrediting bodies have begun to add ethics and sustainability to their accreditation standards (AACSB International 2017; IACBE 2017), and many companies have established sustainability-officer positions. The perceived urgency and importance of CSR at times have escalated to a race between organizations to launch initiatives, whether or not benefits of such actions materialize (Wirl 2014).

The subject of CSR has drawn considerable interest for research. Most of CSR research studies its effect on company performance and the factors moderating and mediating that effect (see for example: Chen et al. 2013; Chih et al. 2008; Hong and Andersen 2011; Heltzer 2011; Jo and Harjoto 2011; Jo and Na 2012; Luo and Bhattacharya 2006; McGuire et al. 2012; Moura-Leite and Padgett 2014; Mulyadi and Anwar 2012; Nelling and Webb 2009; Park et al. 2014; Peters and Mullen 2009; Sun 2012; Sun and Stuebs 2013). Researchers also have attended to related subjects, such as how CSR affects and is affected by business and society, the role of administration, and the role of government and law makers (Carroll 1999; Cochran 2007; Bosch-Badia et al. 2013; Li et al. 2017). Altogether, the common opinion is that CSR is no longer a collateral activity of the companies, but rather part of the core corporate strategy aimed at streamlining and improving imperfections of narrowly defined market goals (Porter and Kramer 2011; Bosch-Badia et al. 2013). Failure to engage or adequately engage in CSR may have negative consequences, potentially resulting in bad publicity, lowered reputation, or diminished value of a brand (Peloza and Shang 2011).

Researchers have described a number of multidimensional systems for engaging in CSR (see, for example, Schreck 2011). One of the most popular multidimensional systems for engaging in CSR is what is sometimes referred to as the “4D List” (Albinger and Freeman 2000; Peloza and Shang 2011):

  • Social: directed at the local community and society at large;

  • Labor: directed at own employees;

  • Environment: directed at the habitat and natural environment; and

  • Governance: directed at ensuring transparent and just corporate governance.

These dimensions are considered the most important and general, covering almost all possible CSR activities. Measurement scales for these and some other multidimensional systems mostly have been unified, standardized, and maintained by the Kinder, Lydenberg, and Domini (KLD) database, currently handled, in a modified form, by the MSCI. This database serves as a popular and prominent source for research projects exploring empirical evidence of CSR activities along the dimensions in the 4D List. Typically, published research concerns the interrelation between individual CSR dimensions and company activities in various areas, such as strategy, control, or performance (see, for example, Martinez 2014; Block and Wagner 2014; Krüger 2015; Michelon et al. 2013; Sen and Bhattacharya 2001). To date, no publication has analyzed the distribution of corporate efforts across dimensions of CSR as a whole. Identified patterns could provide companies or other bodies with a context and reference point in the analysis, planning, and assessment of CSR activities.

There can be different structural patterns of CSR activity along its dimensions. Whereas some companies may focus on just one dimension, such as the environment (“going green”) or labor (staff development), others might prefer to split their efforts by contributing equally to two or more dimensions. One would expect that a large company’s CSR policies in this regard would be more or less consistent.

This paper analyzes the largest international companies in the MSCI database to determine whether any consistent multivariate profiles of CSR activities exist. If such a profile or a set of profiles do exist, a related question then would be, what changes in profile types occur over time? The method of K-means clustering, arguably the most popular clustering method because of its computational and intuitive simplicity (Hartigan and Wong 1979, Hennig et al. 2015, Lord et al. 2017, Mirkin 2019), will enable us to identify patterns across profiles.

The remainder of this article has the following organizational structure. Section 2 outlines the most popular view of CSR according to its roots in stakeholder theory, discusses the multidimensional nature of CSR, and outlines research questions. Section 3 describes data and methods. We use the square-error criterion of the popular K-means clustering as a benchmark and change it for an equivalent (complementary) clustering criterion. The complementary criterion requires finding big anomalous clusters. This gives a substantiation to a method that extracts anomalous clusters one-by-one and then leaves only those largest of them. This strategy mitigates a common drawback of K-means: the need for a user-defined number of clusters and the initial location of them—a real issue for an uninitiated user, especially in the global analysis of CSR activities. Section 4 describes and analyzes thus identified clusters and discusses methods, findings, and corresponding possible future developments in CSR activities. Section 5 concludes the paper with a brief account of the results and directions for future work.

2 Corporate social responsibility and its patterns

2.1 Defining CSR

The usual understanding of corporate social responsibility (CSR) is typified by McWilliams and Siegel definition: as a company’s “actions that appear to further some social good, beyond the interests of the firm and that which is required by law” (McWilliams et al. 2011). Despite the popular view epitomized by Milton Friedman’s famous pronouncement that “the social responsibility of business is to increase its profits” (Friedman 1970), society, business, and academia have moved in the direction of a wider perspective, whereupon businesses do have additional responsibilities to society (Carroll 1979; Porter and Kramer 2011).

Although scholars somewhat differ in their definitions of CSR and “sustainability” (e.g., Chan and Cheung 2015), a general consensus exists that “sustainability” is essentially CSR plus efforts to remain profitable (for example, Hult 2011). Considering John Elkington’s description of the triple bottom line of sustainability as “people, planet, and profit” (Elkington 1998, p. 73), CSR can be assigned to the double bottom line of “people and planet.”

2.2 Stakeholders

The understanding of what exactly a corporation is responsible for and to whom it is responsible has been maturing from ad hoc, hodge-podge early views to more systematic approaches (Fassin 2009). Central to the operationalization of CSR is the idea of stakeholders as groups that have interest in the way a company does business, in addition to its outcomes (Freeman 1984). Traditionally, CSR literature treats any responsibility to stakeholders other than the shareholders (or, oftentimes, customers) as a social one. However, some researchers argue for making a distinction between stakeholders and social issues (Clarkson 1995). Categorized according to their role vis-à-vis the corporation (as employees, customers, etc.), different stakeholder groups have different concerns. These “functional” groupings of stakeholders are not uniform or homogeneous in their interests or concerns, sometimes to the point of conflict (Betts and Taran 2011).

CSR is then understood as a multidimensional construct (Schreck 2011; Weber and Gladstone 2014) that includes a variety of actions and principles directed at satisfying society-related concerns of non-shareholder stakeholders. These concerns include helping the environment, community (local or global), society, and people (including employees), while pursuing just and ethical governance.

2.3 Measuring CSR

Practitioners and scholars have made considerable efforts to measure CSR (Jones 2017), ranging from a CSR ranking based on numeric values for each possible “good practice” of a firm (as identified by some authoritative body) with an accompanying summative score (Chen et al. 2013), to counting the number of company policies for each identified area of concern (Welford 2005). Consulting organizations have emerged with guidelines on measuring and reporting of CSR-related practices and their outcomes, usually referred to as CSP (Corporate Social Performance). For example, London Benchmarking Group (LBG) provides guidelines regarding charitable community contributions, including their impact on business and society (London Benchmarking Group 2015).

Practitioners and scholars evaluate and rank organizations based on different areas of concern for different groups of stakeholders (see Clarkson 1995), resulting in several data sets, such as the KLD Socrates database, which has been superseded by the MSCI ESG database (Jo and Na 2012; MSCI 2011).

Kinder, Lydenberg, and Domini rated companies based on a number of strengths and concerns (Lougee and Wallace 2008; Mattingly and Berman 2006; See Exhibit 1).

As the data service changed hands and evolved, some variables and methods of measurement have changed as well; however, the general spirit of KLD ratings has been somewhat retained in the MSCI historical ratings data (See Exhibit 2, cf MSCI 2011). Nevertheless, the database did undergo further changes in the newer MSCI rankings (MSCI, 2011). We note that scorings in the MSCI are treated as continuous-valued variables, so that the operation of score averaging is not out of scope for the MSCI data.

The final IVA score is determined as a weighted sum of the four major factors from 4D list. For example, FedEx faced huge labor disputes and issues with the carbon emissions of its fleet, which led to a CSR rating of CCC (the lowest grade). FedEx improved its rating by significantly changing its carbon emissions. In the same industry, Deutsche Post AG received excellent scores on all four CSR factors: environmental efforts, labor relations, stakeholder relations, and governance. Deutsche Post AG earned a rating of AAA (the highest grade).

2.4 CSR profile

Let us refer to the set of grades of a corporation across the four dimensions in the 4D list above as its CSR profile. Different patterns emerge for company CSR profiles. These patterns may vary on a continuum between an “even” pattern and a “focused” one. We consider that a company exercises an even CSR profile if its performance along each of the four dimensions is about the same. A company could strive to be responsible on all CSR fronts and “be a good citizen.” Alternatively, a company that pays no attention to CSR at all has an even pattern as well. A company that concentrates its CSR efforts on just one dimension has a focused CSR pattern. Discerning such patterns from the aggregate data is all but impossible in most cases.

Even more complicated would be trying to predict possible directions of changing CSR profiles from the aggregate data over time. An important factor to consider here is the fact that practitioners have been strongly advised to be strategic in their CSR by choosing directions that would most matter to their reputations. According to Peloza and Shang (2011),

“The opportunity to differentiate through various CSR activities means that managers should not simply look to outspend their competition on CSR, or assume that greater levels of CSR investment will lead to improved consumer perceptions of value…. Brands that will be most successful are those that use CSR activities to provide incremental consumer value matched to product category salience of those values” (Peloza and Shang 2011 p. 130).

One would expect that after the initial push in the 1990s to “be good in general,” businesses would have become more strategic with their CSR choices. Although that may be true, “being strategic” means different things to different companies and makes prediction difficult from aggregate data. Specifically, each of the following scenarios—(SA), (SB), (SC), (SD), and (SE)—is compatible with the strategic-behavior approach.

(SA):

Companies might strive to become better “global citizens” along every dimension. Then, one would see increase in total ratings as well as increase in the number of even profiles

(SB):

Corporations may keep losing interest in social responsibility, so that CSR ratings go down and even profiles with low average ratings proliferate

(SC):

Businesses could pursue policies oriented at just one or two CSR dimensions and display focused profiles

(SD):

Businesses might start with even profiles, but narrow their efforts to only those dimensions that have the greatest impact on their stakeholders. This would show an increase in focused profiles in the data by the end of the period

(SE):

Businesses could start with focused profiles, but expand their efforts to other dimensions. This would show an increase in unfocused profiles

2.5 Research questions

It stands to reason that over time, companies would change their CSR profiles. There are several possible scenarios of such change: they could become more committed to CSR in general; they can shift away from the even profiles and more toward the focused profiles; they can shift away from focused profiles to even profiles; they can shift among focused or unfocused profiles. Here are our research questions:

Research Question 1:

What are the patterns in CSR activities?

Research Question 2:

What are the dynamics of patterns of CSR activities?

3 K-means clustering: classic and complementary criteria

3.1 K-means: an introduction

The task is to discern patterns prevailing in 2007 and in 2012, and compare and contrast these two time points. The most appropriate multidimensional statistics technique for this type of analysis is clustering, because it is specifically oriented towards finding different patterns in a data set. A popular clustering method, K-means partitioning, seems especially suitable for our goals (for more information about K-means partitioning, see Hartigan and Wong 1979, Hennig et al. 2015, Mirkin 2019). It partitions a data set represented by multidimensional points corresponding to observations (companies over the four CSR dimensions, in our case) into non-overlapping clusters. Each cluster is a bunch of points around its center, a pattern computed as a mean across objects in the cluster (calculated separately for each variable). Usually, the user has to guess the right number of clusters and specify some hypothetical cluster representatives, the “seeds.”

The number, K, and the initial location of “seeds” constitute the input to K-means algorithm. The algorithm outputs a partition of the set of objects in subsets, clusters, and cluster centers, points in the multivariate space that correspond to within-cluster averages. The K-means algorithm runs a sequence of iterations, each consisting of two steps: a cluster update and center update.

This technique has two big advantages. First, it locally minimizes a natural criterion, the sum of squared Euclidean distances between the objects and their cluster centers. Second, it computationally makes a typology. It is intuitive and computationally convenient. However, the method has limitations, as well. It requires the user to pinpoint initial cluster seeds or, if the user cannot, generates them randomly, thus leading to possibly inadequate results. The number of clusters may be difficult for the user to specify, as well. In the literature, scholars have proposed ideas for how to automate this process (see, for example, Mirkin 2019; Rodriguez and Laio 2014).

This paper uses a complementary criterion for K-means. The complementary criterion is mathematically equivalent to the original K-means criterion, but it provides a very different rationale for the clustering process. According to this complementary criterion, the goal is to find big anomalous clusters. Although finding a globally optimal solution is as computationally intensive as minimizing the original K-means criterion, the complementary criterion leads to a simple heuristic for building “anomalous” clusters one-by-one, thus making the choice of the number of clusters much easier. In this way, the complementary criterion serves as a substantiation of the so-called anomalous-cluster initialization heuristic (Chiang and Mirkin 2010; de Amorim et al. 2016).

3.2 K-means square-error and complementary criteria

Given K, the problem is to find such a partition S = {S1, S2, …, SK} and cluster centroids ck = (ck1, ck2, …, ckV), k = 1, 2, …, K, that minimize the square-error criterion:

$$D(S,c) = \sum\limits_{k = 1}^{K} {\sum\limits_{{i \in S_{k} }} {\sum\limits_{v \in V} {(y_{iv} \, - c_{kv} )^{2} = \sum\limits_{k = 1}^{K} {\sum\limits_{{i \in S_{k} }} {d(y_{i} ,c_{k} } )} } } } ,$$
(1)

where d(yi, ck) is the squared Euclidean distance between data point yi and cluster center ck.

The K-means algorithm follows the so-called alternating minimization scheme for criterion (1). Starting with some set of K centers c, it finds an optimal partition S, minimizing D(S, c) at the given c, and then finds c′ minimizing D(S, c) at just found S. The procedure is repeated until convergence—that is, until c′ coincides with c. In practice, the method converges fast to a local minimum, which is dependent on the choice of initial c. The issues related to choice of K and initial c are well known and subject to ongoing debate (for a sample of literature, see de Amorim and Hennig 2015; Mirkin 2019; Mur et al. 2016, and references therein).

Let us consider T(Y) = \(\varvec{ }\mathop \sum \limits_{{\varvec{i} = 1}}^{\varvec{N}} \mathop \sum \limits_{v \in V} y_{iv}^{2}\), referred to as the data scatter, and

$$F\left( {S, \, c} \right)\, = \, \mathop \sum \limits_{k = 1}^{K} \left| {S_{k} } \right|\mathop \sum \limits_{v \in V} c_{kv}^{2} = \mathop \sum \limits_{k = 1}^{K} \left| {S_{k} } \right|\left\langle {c_{k} , c_{k} } \right\rangle ,$$
(2)

where \(\left\langle {c_{k} , c_{k} } \right\rangle\) is the inner product of ck by itself, the squared Euclidean distance from ck to 0. These are related to K-means criterion in (1) via equation:

$$T\left( Y \right)\, = \,F\left( {S,c} \right)\, + \,D(S,c).$$
(3)

Equation (3) implies that the complementary criterion in (2) is to be maximized to minimize D(S, c).

Provided that the origin preliminarily is shifted into the point of “norm”, i.e., the gravity center, the meaning of the complementary criterion is as follows: find as numerous and as anomalous clusters as possible, to maximize F(S, c). In contrast to the square-error criterion D(S, c), which does not depend on the location of the space origin, 0, the criterion F(S, c) pertains to the origin, as its items \(\left\langle {c_{k} , c_{k} } \right\rangle\) heavily depend on that, which is used in the one-by-one greedy optimization approach below. To sharpen the structure of a data set, we counterpose it to a point of “norm”, the grand mean of the data set. Therefore, when using the complementary criterion, we do not skip a data preprocessing option, the subtraction of the point of “norm” from all data points.

3.3 Maximizing the complementary criterion: one-by-one approach

An option for finding big and anomalous clusters would be to begin by building anomalous clusters independently, so that each cluster S and its center c maximize the contribution:

$$f\left( {S,c} \right)\, = \,\left| S \right|\,\left\langle {c,c} \right\rangle .$$
(4)

An exact solution to this non-polynomial problem cannot be found easily. Therefore, we consider a locally optimal solution. Assume, for the start, an initial cluster to be a singleton, so that |S| = 1. To maximize (4) then, one has to put it into the point that is furthest away from the origin, 0. This, unlike the conventional K-means, gives us a reasonable initialization for the clustering process. To move further, we attend to the same alternating minimization scheme that is utilized in the conventional K-means algorithm. Given cluster S, its center c is computed as the average:

$$c\, = \,c\left( S \right)\, = \,\frac{{\mathop \sum \nolimits_{i \in S} y_{i} }}{\left| S \right|},$$

where yi is a row of the data matrix corresponding to observation iI. Given c, an optimal update of cluster S should be computed according to the following rule CUR:

Cluster update rule (CUR):

Given a cluster S, remove i S from S if f(S, c) > 2|S| 〈c, yi〉 − 〈yi, yi〉 and

add i S to S if f(S, c) < 2|S| 〈c, yi〉 + 〈yi, yi〉.

A proof, that thus updated S indeed maximizes criterion (4) at a given c, is in Appendix to the paper. This update rule gives rise to the following algorithm for building an anomalous cluster.

Algorithm EXTAN (EXTracting an ANomalous cluster)

Input: A data matrix.

Output: List of observations S and its center c.

  1. 1.

    Initialization: Find an observation maximally distant from 0 and make it the initial center, c, of the anomalous cluster being built.

  2. 2.

    Anomalous cluster update: Given c, update S according to CUR rule above.

  3. 3.

    Anomalous center update: Given S, update the center as the within-S mean c′.

  4. 4.

    Test: If c’c, assign c = c’ and go back to step 2. Otherwise, move on to Step 5.

  5. 5.

    Output: Output the list S and its center c.

EXTAN works according to the alternate minimization principle. One could use an incremental approach by making only one point to move at a time: adding to or removing from S that one object i in the CUR rule which gives the maximum increase in the value of criterion f(S, c), and halting the process when no move can increase the criterion.

Of course, the result of EXTAN depends on the location of 0, as already mentioned.

Using EXTAN as a subroutine, we can propose the following one-by-one algorithm for greedily maximizing the complementary criterion F(S, c) in (2).

Algorithm BANCO (Big Anomalous Clusters One-by-one)

Input: A data matrix and a user-defined integer t—the minimum cluster size (and, possibly, the point of norm, g).

Output: A partition of the set of observations S in K clusters (K is determined by t) and cluster centers c1, c2, …, cK.

  1. 1.

    Data preprocessing: Centering Take the input point g if provided or, if not, compute the grand mean, the vector of average values of the features, and take it as g. Subtract g from all the data matrix rows. Optionally, normalize data features. Set counter of clusters k = 1. Define Ik the set of all the observations.

  2. 2.

    Iterative EXTAN: At a given k, apply EXTAN to the data matrix over the set of observations Ik to output cluster Sk and its center ck. Define Ik+1 = Ik − Sk. If Ik+1 ≠ ∅, set k = k + 1 and start step 2 again. Otherwise, move to the next step.

  3. 3.

    Large cluster’s centers’ selection: Consider all the sets Sk obtained, and determine those satisfying |Sk| > t. Define K the number of these clusters Sk (k = 1, 2, …, K).

  4. 4.

    Output the K anomalous clusters and their centers.

The BANCO’s output serves as the input to a run of K-means.

According to the algorithm BANCO, the number of clusters K is determined by another user-defined quantitative parameter, t, the minimum number of observations in a cluster. Both K and t relate to the level of granularity of an aggregate representation of the data set structure by clusters. However, the same value of K usually corresponds to an interval of t values. Therefore, specifying a t value imposes on the user somewhat less pressure: first, the minimum cluster size requires less data knowledge, and, second, the resulting K is not that sensitive to errors over the value of t. Sometimes, the choice of t can be operationalized as follows. Take all the anomalous clusters built by BANCO, and look at the sequence of their cardinalities sorted in the descending order. Take that t which is defined by a large drop in the values if there is a drop. Also, BANCO can be easily adapted to the case when K is known to the user: just take the K largest clusters at the Step 3.

3.4 Previous research on the issue of the right number of clusters

The issue of finding the right number of clusters is attracting considerable research efforts (see reviews in Mirkin 2019 and Chiang and Mirkin 2010 and for some later attempts, de Amorim and Hennig 2015, Lord et al. 2017, Zhou et al. 2017, and references therein). Generally speaking, all of the efforts can be divided in three groups:

  1. a.

    those based on exploring the data set before clustering,

  2. b.

    those based on values of some indexes while clustering, and

  3. c.

    those based on exploration of the set of partitions found at multiple runs of a clustering algorithm.

Approaches falling in the group (c) are most popular. Some of them—such as Gap and Jump statistics, Hartigan and Calinski–Harabasz indexes, and consensus-based indexes—have been tested by Chiang and Mirkin (2010) in their extensive computational experiments. In these experiments, the Hartigan index appeared superior to the others in terms of recovery of the number of clusters K, although not in terms of recovery of clusters themselves.

A genuine method in group (a) is a precursor to the proposed algorithm EXTAN, an anomalous clustering method proposed by Mirkin (1990) as an extension of the Principal Component Analysis to binary factor scores. Given an N × V data matrix Y, the first principal component consisting of N × 1 vector of factor scores z and V × 1 loading vector c minimizes the square error D = ||Y − zcT||2 where ||·||2 is the sum of squared differences between the corresponding N × V matrices. As is well known, the optimal product zcT is equal to μvwT, where μ is the maximum singular value of Y, and v and w are its corresponding normed singular vectors. The method of principal clusters takes the same square error D = ||Y − zcT||2 criterion, but constraints the solution to binary 0/1 vectors z only, so that the binary z corresponds to cluster S = {i: zi = 1} and the optimal c, to S within-cluster mean vector. Geometrically, this is equivalent to finding cluster S and V-dimensional vector c minimizing criterion (Mirkin 1990):

$$A(S,c) = \sum\limits_{i \in S} {d(y_{i} , {\text{c}}) + \sum\limits_{i \notin S} {d(y_{i} , 0)} } ,$$
(5)

Criterion A(S, c) in Eq. (5) is similar to K-means criterion in Eq. (1), except that the number of clusters here is 2 and one of them has a non-varying center, 0. The principal cluster analysis method, also referred to as the anomalous-cluster method in Chiang and Mirkin (2010), is alternating minimization method, in some aspects similar to EXTAN above, except that the goal of maximizing the cluster sizes, so much prominent in EXTAN, is absent from the former method.

To illustrate the difference between the two methods, consider, for example, the set of eight two-dimensional observations A–H in Table 1. Assume that the “norm” here is specified as the origin, point 0 = (0, 0), and no normalization is required. Then, repeated applications of EXTAN bring two non-trivial clusters S1 = {A, B, C} and S2 = {D, E, F}, leaving G and H singletons. In contrast, anomalous-cluster method outputs only one non-trivial cluster, S1 = {A, B, C} while leaving the other five points singletons. Obviously, no threshold t can reconcile these results.

Table 1 Illustrative example of eight objects

4 Empirical analysis

4.1 Ratings database: description of MSCI

Having superseded the KLD data, MSCI historical data provide the opportunity to observe patterns in CSR over time. The variables available are in Exhibit 2. Ratings are in a letter-grade format from AAA, AA, A, etc. to CCC, converted into a seven-point scale, with AAA being a 7 and CCC a 1. The universe of companies included in this data has kept expanding over time, so that by March, 2007, the number of companies grew to provide grounds for a global comparison. As many as 1850 companies have ratings in both March, 2012 and March, 2007 data. These companies form the set of observations used in this study.

A number of companies rated in 2012, but not in 2007 (904 altogether), are not included in the analysis, since they cannot be used for comparison across time.

We included into our analysis all of the major components in 4D List. These variables form the Social and Eco ratings: Strategic Governance Factor v17, Human Capital Factor v18, Environment Factor v19, and Stakeholder Capital Factor v20 (see the Exhibit 2 for components of the factors). Table 2 provides descriptive statistics for the variables. Altogether, the data set used in our analysis includes the above variables along the scores for each of the major dimensions for the 1850 companies that have been in both 2007 and 2012 lists.

Table 2 Basic statistics

4.2 Means and correlations at aggregate data

We first try to look at the data structure using the means of the grades over dimensions under consideration and correlation coefficients between the dimensions. The feature means at the entire data set are, as usual, referred to as grand means to distinguish them from the means at clusters.

When looking at the levels of the values over the entire data set (see Table 2), one can see that all of the mean levels are close to 5.0, whereas the levels of variation are about 30–35%. This pattern does not change much from 2007 to 2012, which we attribute to the existence of various pattern types that are hidden behind the average values. Therefore, grand means alone provide no insights that are useful for our analysis.

Looking at the pattern of correlations between the CSR components (Table 3), we can see a great difference between the time points. In 2007, correlations between all of the four dimensions are much greater than those in 2012. The level of correlations in 2007 is at the level of 0.65–0.70. That means, on average, if a company earned a high score on one of the CSR dimensions, then it was very likely to score high over the other dimensions as well. Similarly, the companies scoring low over one dimension were likely to score low over the other dimensions.

Table 3 Correlation coefficients between the variables

Table 3 shows that the starting point of the CSR activity process is not something that happened a long time ago. The starting period, however, early CSR discussions started, did cover year 2007 and probably later. However, we can safely claim that a new phase already started by 2012 because of the significant changes in the pattern of correlations in 2012 (see bottom of Table 3). In 2012, the correlations are not high anymore, but rather small, around 0.15–0.25. The shift to low correlation illustrates dramatic changes in the patterns of CSR activities: there is little agreement between the dimensions now.

There is no point to claim that the prevailing pattern in 2012 is a focused one, because small correlations can hide contradictory processes in different subsamples. However, one cannot deny that the even patterns of CSR activity are not prevailing anymore in 2012. At a first glance, these two sentences may seem to contradict each other. Yet, they do not contradict each other, because there can be plenty possibilities between the even and focused pattern types. Say, while pattern a = (5, 5, 5, 5) is even and b = (7, 2, 2, 2) is focused, patterns c = (1, 7, 3, 7) and d = (9, 9, 2, 2) are neither even nor focused.

Probably, these changes have occurred because of a great global recession 2008/9, which affected the economic behavior significantly. The recession required companies to restructure their CSR activities as innate parts of the businesses, rather than as purely external activities.

Developing better understanding of the structure of CSR activity patterns requires our finding and analyzing CSR similarity clusters of corporations.

4.3 The structure of clusters: comparative analysis of CSR activities from 2007 to 2012

We applied method BANCO to both data sets, 2007 and 2012. As described later in Sect. 5.2, the method produced six clusters at each of the datasets. Characteristics of the found clusters are shown in Tables 4 and 5.

Table 4 Clusters 2007
Table 5 Clusters 2012

Let us first take a look at the patterns exhibited in the 2007 clusters (Table 4). In Table 4, there are six clusters represented by their centers expressed in the natural scales of MSCI scores. Additionally, the centers are presented in their relation to the grand mean as the relative difference (m − Gm)/Gm, per cent, where m is the within-cluster central value and Gm is the grand mean of the same feature. The column on the left shows the cluster sizes.

One can see from the Relation to Grand Mean on the right of Table 4, the clusters from 2007 indeed manifest more or less uniformly even patterns. In particular, clusters 1 and 2 (totaling to about 600 companies) perform much better than the grand mean values, by about 45% and 22% on average, respectively. In contrast, clusters 5 and 6 (totaling to 579 companies) exhibit profiles that are lower than the grand mean values. Clusters 5 and 6 are uniformly underperforming by about 15–30% and 40–50%, respectively. Middle clusters 3 and 4 are less uniform: cluster 3 slightly outperforms the averages in all but the Environment; this pattern is reversed in cluster 4.

In general, the 2007 clusters manifest a more or less a simplistic structure of CSR activities: corporations form a continuum of CSR activity effort ranging from approximately 2–3 to 7–8 on the KLD grading scale; at any point on this continuum, the total company effort is more or less uniformly even across the four CSR dimensions. The number of companies that either overperform on all of the dimensions or underperform on all of the dimensions is 1184, about two-thirds of the total number of companies in the sample.

The 2012 clusters (see Table 5, which is formatted similarly to Table 4) show a rather different picture. The difference can be seen even in the distribution of companies over the clusters. That was rather uniform in 2007, with cluster sizes varying between 219 and 307. The size differences are much sharper in 2012: they range from 170 to 492. Moreover, the spread over the grading scale is smaller by a grade on each side of the range, from 3–4 to 6–7 (versus 2–3 to 7–8 in 2007) on all but the Stakeholder Capital scales. The thoroughly overperforming and underperforming clusters still are present, but they cover a much smaller part of the set. Specifically, only cluster 1 (258 companies), on the plus side, and cluster 6 (287 companies), on the minus side, fall within this category. This represents a sharp decline of the balanced effort: from about 1250 companies in 2007 to 545 companies in 2012.

Also, the levels of deviation from the grand mean values are smaller in 2012. They reach about − 27% in cluster 1 and − 33% in cluster 6 on average, whereas these differences in 2007 are approximately 46% and − 48% on average, respectively.

Other clusters do not manifest even patterns at all. In particular, clusters 2 and 3 in Table 5 pertain to single-focus profiles. Cluster 2 exhibits a high-level activity over Human Capital, which is here 44% greater than the grand mean, while the other three components are closer to the average level, varying from − 8.5 to 13.6%. Similarly, Cluster 3 exhibits a high level of activity over Environment, 40% greater than the grand mean. Cluster 4 can be also counted as a single-focus group, this time at the Stakeholder capital as the focus dimension. Moreover, cluster 4 significantly underperforms over another dimension, Environment. Cluster 5 also may be considered as exhibiting a single-focus CSR pattern, assuming that it includes businesses that have less equity to devote to CSR activities. It underperforms on most dimensions, most notably by almost 60% on the Stakeholder capital front, but is 22% above average on Strategic governance.

Therefore, clusters 2012 illustrates a great change in patterns of CSR activities during the period 2007–2012. In the beginning of the period, the prevailing pattern for companies was to uniformly split their efforts, whichever they were, to each of the components of CSR activities. About 600 of them were overperforming and 600 underperforming in terms of the MSCI grades on each dimension. By 2012, this majority shrunk to only clusters 1 and 6 staunchly maintaining the even pattern on either overperforming or underperforming ends. The total number of companies in these two clusters, 545, accounts for just about 30% of the company set. The others have noticeably switched to single-focused profiles of CSR activities. Each of the clusters 2–5 overperforms noticeably on one dimension and has varying performance over other dimensions, at least some of which they underperform on. The change of CSR activities can be seen on the level of individual companies, too: most of them have changed their CSR pattern from 2007 to 2012.

4.4 Discussion of trends in CSR evolution over the four dimensions

Overall, the found clusters provide the following answers to our Research Questions: in 2007, prevailing pattern was to uniformly outperform or underperform on all four dimensions. By 2012, while a little less than 1/3 of the companies still exhibited the even pattern, the prevailing pattern changed to single focus. Also, it appears; the early period of CSR developments did not occur in the remote past, but rather was still running as recently as 2007. One can also see that the cluster structure of the CSR activities in the set of 1850 of the largest global companies has considerably changed between 2007 and 2012—probably yet another consequence of the Great Recession in 2008/9. We see a dramatic turn of the largest companies from a more or less uniform pattern of CSR efforts (or lack of such) in 2007 towards single-focus patterns in 2012, which goes in line with the scenario (SD) out of four scenarios outlined above. This dramatic turn shows a tendency that is likely to continue in future developments. Probably, the process of building more focused profiles is going to grow further, provided that the business environment framework in the world does not change much.

Most likely, the clusters of “staunchly uniform” overperformer and underperformer will remain, although at further reduced sizes. The clusters with contrast single-focus patterns will be further enhanced. Probably, while CSR activities mature and further consolidate as parts of the business process, the single-focus groups could expand to embrace two or even more dimensions. Therefore, a greater number of CSR clusters should be expected, with those displaying a double-focus activity, featuring more prominently in the future data.

5 Discussion

We discuss our findings in relation to two perspectives: (a) the data and (b) the method.

5.1 Issues related to the data

This paper uses the MSCI database as the main data source.

The MSCI ratings are not perfect even beyond the obvious limitation of any ranking system in so far as they are rooted in their creators’ sets of assumptions and simplifications. For example, they do not account for “whether a company’s political activities support or undermine environmental regulation” (Schendler and Toffel 2011). Also, many relevant companies appear to have been excluded from the database (Adam and Shavit 2008). The choice of lower level input variables of a greater granularity, which MSCI aggregates into the four indexes, and the particular method of aggregation may need further peer-review validation.

Additionally, the data aggregate CSR performance across the subsidiaries and strategic business units, which might obscure the picture considerably. More precisely, MCSI provides only aggregate measures showing only the resulting balance between positive and negative aspects of a company’s social performance and concealing the raw scores. Other flaws in MSCI methodologies are also a limiting factor. The database is rather expensive and not available to some researchers or organizations, which limits the scope of scientific and practical discourse that can rely on it.

For our purposes, what is important is that the 10-grade scoring system utilized in MSCI is robust and well serves various application purposes, which implies that the scores, as well as conventional statistics—the means and the like, derived from them can be used for reasoning about phenomena which they relate to.

5.2 Issues related to the method

Here, we discuss the specifics of our method related to (a) the choice of the number of clusters and (b) to the cluster anomality concept. Also, we are going to support our conclusions using a more conventional form of K-means clustering.

  1. (a)

    The number of clusters and granularity

There have been a lot of research efforts devoted to the issue of the “right” number of clusters (see Mirkin 2019, Mur et al. 2016, Zhou et al. 2017 and references therein). No approach proposed so far can be considered universally adequate. Our view is that clustering transforms data from one level of granularity to a coarser one. The level of granularity should be specified either externally or based on some preliminary data analysis. The number of clusters K is a characteristic of granularity that goes well with our intuition. Unfortunately, our intuition is not so good for choosing the number of clusters. When we cannot determine that the K-clusterings under consideration are optimal, we cannot decide which K is better. The K-means clustering criterion, as is, may provide no guidance on this, because the criterion’s value monotonely decreases when K grows. In this aspect, the other characteristic of granularity with which the EXTAN algorithm operates—the minimum number of objects in a cluster—suggests a more convenient tool for choosing K. One might be able to take a look at the distribution of anomalous-cluster cardinalities to make a reasonable choice.

Specifically, let us take a look at Table 6 presenting EXTAN results including the cardinalities of ten largest anomalous clusters. As one can see, for both data sets, there are six relatively large anomalous clusters, with a significant drop in the cardinality of the next anomalous cluster: from more than a hundred to 56 or less. This leads us to accept the number 6 as the number of clusters to build. We apply K-means clustering starting from the centers of the six largest anomalous clusters.

Table 6 Parameters of the results from the EXTAN algorithm to the data 2007 and 2012
  1. (b)

    The cluster anomality concept

The concept of cluster anomality is not postulated here, but rather derived from the K-means square-error criterion (1). To give intuition to the concept of anomalous cluster, consider a uniformly distributed one-feature data set whose range is between a and b reals. If a need in dividing the set in two clusters emerges, it would be reasonable to divide the interval [a, b] in two halves separated by the mid-point (a + b)/2. By contrast, the EXTAN algorithm would first find an anomalous cluster around a, constituting a chunk between a and half-distance between a and the grand mean, which should be near the mid-point (a + b)/2—that is, the interval [a, (3a + b)/4]. The next anomalous cluster would be on the right end, constituting an interval about [(a + 3b)/4,b]. These are the largest anomalous clusters. A run of bisecting K-means starting from these cluster centers would converge into the original two clusters separated by the mid-point. Therefore, even in such a difficult case as the uniform distribution, the concept of anomalous cluster as the starting point of K-means is consistent with our intuition. The anomalous-cluster approach was experimentally validated in Chiang and Mirkin (2010) and de Amorim et al. (2016).

We compared our results with those found via a more conventional method. Specifically, we applied a conventional version of K-means implemented in the popular computational environment Matlab (Matlab 2018). This version uses the so-called K ++means method by Arthur and Vassilivitskii (2007). The program still needs a user-defined number of clusters K. It generates a random set of K seeds using the principle that the next seed must be far away from the previous ones. This introduces a structure in the process of randomly generating initial centers, but leaves the selection random, thus each time different. In the Matlab version, initial centers are generated five times, and from the resulting five partitions, the program generates the minimum value of criterion (1).

We ran the Matlab operation K-means at K = 6 on our data, standardized as usual by subtraction of the grand mean from each feature and division over the feature range.

We obtained 6-cluster solutions for both standardized data sets: 2007 and 2012. Below is a contingency table for two partitions, one with clusters-2007 described above, and the other found by the Matlab command K-means on the same dataset. The former corresponds to rows, the latter to columns (Table 7):

Table 7 Contingency table of cluster membership in our results vs. K-means

Essentially, most objects are stable under two clusterings. Hypothetically, if 243 out of 246 objects of our cluster 1 go to a same cluster in the Matlab’s partition, 250 out of 258 objects of cluster 2 remain stable, etc. Altogether, 1587 objects, which comprise 85.8% of the total 1850, are stable. A popular measure of agreement exists between partitions: Cohen’s kappa (see Fleiss et al. 1969). Largely, the Cohen’s kappa is a measure of stability. In our case, kappa = 0.8299, which is an “almost perfect match”, according to Landis and Koch (1977). When considering the average profiles within Matlab clusters, we found very similar values; so our interpretation remains valid here.

It should be mentioned, though, that the results of Matlab’s K-means are rather random. Therefore, we generated a hundred 6-cluster partitions with that program; the average kappa for them is 0.7977 with a standard deviation of 0.1565 which is about 20% of the average.

Regarding the data 2012, we obtained a Matlab K-means partition whose contingency table with our clustering is in Table 8 giving an even better match, with kappa = 0.9135.

Table 8 Contingency table of cluster membership in our results vs. K-means

Further computations, with a hundred runs of Matlab kmeans, lead to somewhat more modest, but still rather high, average kappa equal to 0.7858 with the standard deviation of 0.1290.

These results support our conclusions.

6 Conclusion

In this paper, we assume that the four dimensions—Environmental, Social/Stakeholder, Labor, and Governance—in the 4D list appropriately represent the structure of the CSR process, albeit in a somewhat aggregate form. To analyze the status and tendencies of the CSR activities among the most advanced global companies, we use a set of companies that covers two separate moments in time. To find out structures of prevailing patterns, we develop a version of K-means method, creating more flexibility in the number of clusters than the conventional K-means has. To this end, we consider a clustering criterion emerging in the context of data-scatter decomposition in two items: the square error and complementary criterion. This complementary criterion explicitly states the goal of clustering as that of finding big anomalous clusters. A method EXTAN for one-by-one finding the clusters to maximize separate items in the complementary criterion is developed to more reasonably determine the right number of clusters. The method is applied as a precursor to the conventional K-means run over the MSCI data sets.

The cluster structures found for 2007 and 2012 data sets lead us to conclusions that can be stated, in brief, as follows.

An unexpected phenomenon is observed. As recently as of 2007, the overall CSR process on the level of a single corporation was as yet at an early stage of the CSR efforts, more or less uniformly distributed over all the dimensions in the 4D List, both on high and low levels. We can see a process of change from the predominantly uniform patterns of CSR activities in 2007 to the predominantly single-focus patterns of CSR activities in 2012. This process is probably influenced by the Great Recession of 2008/9. This leads us to hypothesize that this process probably will continue to cover not only the Stakeholder Capital dimension in full, as is currently the case, but also in the directions of focusing at other dimensions, as well as to double-focus and triple-focus efforts in the near future. These increased efforts should lead to a more complex cluster structure of the CSR activities in the future.

Limitations of this study lie with the limited scope of the MSCI database and empirical nature of cluster analysis.

Directions for further research involve, on one hand, increasing the granularity of the inputs and, on the other hand, investigating interrelation between managerial attitudes, financial and reputational outcomes, and CSR processes within the clusters to formulate prescriptions for both the managers and the policy-makers.

The results from this study have implications for marketing practitioners. Companies striving to do well along each of the four CSR dimensions belong to an elite group of the best performing companies. At companies with aims to enter this elite milieu, marketing practitioners would be wise to raise their stakeholders’ awareness of that fact. In companies with a single-focus CSR profile, marketing practitioners would benefit from tracking and following the communication efforts of other companies with a similar profile, both inside and outside the industry. This, by no means, implies that the companies’ CSR efforts should be somehow just an insincere “marketing ploy”. Rather, these are suggestions for framing the dialog with the consumers with the view that business ethics is undergoing a deep transformation to include the social responsibility as its immanent and important part.

For scholars and analysts studying the impact of various CSR activities, this paper may lay the groundwork for further empirical investigation of CSR as a multidimensional process. In particular, CSR profiles may provide important variables mediating and moderating the financial and reputational impact of CSR. Additionally, this paper describes a useful and practical modification of the K-means method that provides a different angle to efforts in finding the right number of clusters and that may serve as an example of how not to determine the number of clusters ad hoc, but rather to derive them from the data.

Policy-makers should note that these results suggest that companies may increasingly concentrate on only some CSR dimensions at the expense of others.