1 Introduction

Efficiency analysis has been a core issue of marketing since the discipline’s origins [e.g., 2, 35, 36]. In general, efficiency analysis assesses the relative productivity of comparable decision-making units (DMUs), e.g., competing firms in a market, stores of a retailer, geographic branches of a sales organization, etc. According to such analyses, a 100 % efficient DMU is one that produces the most output(s) given the same input(s) as other DMUs or, equivalently, uses the least input(s) to produce the same level of output(s) as other DMUs being compared. To understand how to better convert scarce marketing resources into improved performance outcomes, marketing scholars have conducted both internal and external efficiency benchmarking analyses in a variety of contexts, e.g., assessing relative efficiencies of stores in a retail chain [17], sales districts of a sales organization [20], and large banks in the banking industry [25].Footnote 1

However, to date, efficiency analyses in marketing have ignored the fact that in many contexts, the DMUs in question are comprised of “sub-DMUs,” e.g., organizational departments, that are linked in that some or all of the outputs of one department’s productive activity may serve as inputs for the other’s activity. For example, customer leads generated by the marketing department of an organization are critical inputs for the sales department’s activity of converting prospects into customers, while orders booked by sales are critical inputs for production and so on. Moreover, in many instances, organizational departments are interlinked in that some or all of the outputs of one department serve as the inputs for the other and vice versa. For example, revenues from the customer orders successfully booked by sales may drive marketing efforts that yield leads utilized by sales to produce orders. Intuitively, the overall efficiency of a DMU is an aggregation of the efficiency of each sub-DMU or department after accounting for the portion of its inputs (outputs) supplied (utilized) by other sub-DMUs with which it interfaces.Footnote 2 Marketing scholars argue that organizational performance is impacted by coordination (or lack thereof) at various marketing-related interfaces such as the marketing–sales interface [e.g., 22, 33], marketing–production interface [e.g., 12, 13, 19], and marketing–R&D–sales interfaces [e.g., 11, 14, 18]. However, so far, no study has performed overall marketing efficiency benchmarking of DMUs of interest accounting for the respective efficiencies of interfacing sub-DMUs or departments of which they are comprised. Such an analysis promises to provide deeper insight into specific sub-DMU interfaces that need greater attention in order to boost overall DMU productivity. Our research in this paper aims to fill this gap in the extant organization-level marketing efficiency analysis literature.

More formally, we identify three criteria that must be met by a method for assessing relative efficiencies of a set of DMUs that accounts for sub-DMU level efficiencies:

  1. 1.

    Produce efficiency scores at the sub-DMU level: When relative efficiencies of a set of DMUs are expressed in terms of sub-DMU level relative efficiency scores, the sub-DMU/s that contribute more or less to overall efficiency can be identified. This analysis is likely to be of more diagnostic value than the typical aggregate analysis for managers considering where and how to intervene to boost overall productivity of a DMU.

  2. 2.

    Model interconnectedness among sub-DMU when producing efficiency scores: Modeling the interconnectedness among sub-DMUs will help produce efficiency scores that explicitly adjust for the portion of an sub-DMU’s inputs (outputs) supplied (utilized) by other sub-DMUs with which it interfaces and also correct for any upward/downward bias that might arise when the interfaces between sub-DMUs are ignored.

  3. 3.

    Provide statistical inference about the impact of sub-DMU efficiency: When sub-DMUs within a DMU are interconnected, the increase in each sub-DMU’s efficiency should contribute not only to its own outputs but also to the outputs of all other sub-DMUs with which it interfaces. It is meaningful to provide metrics that document the impact of an increase in sub-DMU efficiency to sub-DMU and overall DMU output. Since numerous random factors (e.g., luck, weather) could alter the impact of each sub-DMU’s efficiency on overall DMU efficiency, it is useful to provide an uncertainty range around the impact metric. Thus, we should control for the role of random error and need a means to estimate (rather than simply calculate) and statistically test (rather than assume) the impact of an increase in sub-DMU efficiency on the each DMU output.

A review of the literature on efficiency analysis techniques, e.g., data envelopment analysis (DEA) [8] and stochastic frontier analysis [1], reveals that no available method meets all three criteria noted above. More specifically, the DEA approach, developed in operations research [8], is a nonparametric way to relate output(s) to input(s) without assuming any restrictive form for the production function [e.g., 44, 45]. Network DEA (NDEA), developed in [15], conceptualizes a DMU as a set of smaller interconnected sub-DMUs (departments) and calculates the efficiency scores for the DMU and each sub-DMU. However, neither DEA nor NDEA provides statistical inference on the impact of efficiency. In contrast, the stochastic frontier analysis (SFA) approach, developed in economics [1], parametrically estimates the relationship between an output and various inputs, providing formal statistical inference about the impact of increase in efficiency on outputs [42]. However, SFA does not model the interconnectedness among sub-DMUs.

In this paper, our goal is to develop and demonstrate a suitable approach to benchmarking of DMUs that meet all the three criteria simultaneously. This new two-stage approach not only employs advanced forms of both DEA and SFA that preserves their basic advantages (discussed in the next section) but also permits efficiency analysis of DMUs with interconnected sub-DMUs. This approach combines NDEA with multivariate sliced inverse regression (MSIR) [24]. While NDEA allows us to effectively characterize the interfaces among various sub-DMUs, MSIR provides nonparametric statistical inference about the impact of increase in sub-DMU efficiency on each DMU level output of interest.

We demonstrate the benefits of the proposed approach in a practical context, currently of great interest to marketers and media economists. Specifically, our empirical application involves external benchmarking of print newspaper firms, where output–input links between key sub-DMUs (newsroom or editorial, distribution, and advertising space sales departments) exist. Facing tough economic times in the USA today, newspaper firms are struggling to improve their efficiency and profitability [6]. They would benefit from a benchmarking methodology that identifies those firms that are most efficient as well as reveals the relative efficiencies of individual sub-DMUs taking into account their direct and indirect impacts on outputs of interest to the firms. For example, the advertising sales outputs of the sales department are of value in themselves and also an input impacting a newspaper’s circulation, the main output of the newsroom department.

Using syndicated data from over 300 newspapers, we first obtain the efficiencies of the newsroom, distribution, and sales force sub-DMUs for each newspaper and then assess the impact of efficiency on the performance of newspapers. We also replicate the analyses using another sample of over 200 newspapers. We find that the proposed method outperforms the current aggregate approach and offers substantive guidance to the newspaper industry. For example, the newsroom department is the most efficient, yet newsroom efficiency does not statistically increase the outputs of newspapers; whereas the efficiencies of sales force and distribution departments increase both the subscriptions and ad-space revenues.

The rest of the paper is organized as follows. Section 2 reviews the available procedures for efficiency and their limitations with respect to modeling sub-DMUs and their interfaces. Section 3 develops the proposed method, Section 4 presents the empirical analysis, and Section 5 concludes by summarizing the key takeaways.

2 Review of Existing Efficiency Analysis Techniques

In this section, we describe the commonly used approaches for efficiency analysis and highlight their limitations when analyzing the efficiency of interconnected sub-DMUs in conjunction with the overall efficiency of a DMU. We then develop a suitable approach that meets three desired criteria.

2.1 Data Envelopment Analysis

Consider the comparison of n DMUs that each use m inputs x 0 to produce s outputs y 0. DEA uses the data (x 0 , y 0 ) to calculate the scalar efficiency of a DMU as a weighted ratio of the DMU’s outputs to the DMU’s inputs. DEA calculates the DMU’s weighting scheme and scalar efficiency simultaneously [8]. Specifically, DEA sets up a linear program to optimally determine the weights that enable each DMU to maximize its weighted ratio of outputs to inputs (efficiency) subject to all DMUs’ efficiencies. Thus, DEA calculates every DMU’s weights at the point where the DMU is most efficient. Appendix 1 presents the details of calculating the efficiency scores. A DMU is 100 % efficient if its score is unity.

Column 2 in Table 1 below evaluates the use of DEA for efficiency analysis of DMUs with interconnected sub-DMUs. To apply DEA to our setting, we could simply combine all the outputs that come out of a DMU and all the inputs that go into it, regardless of which sub-DMU they belong to, and then obtain the DMU’s efficiency score with respect to the conversion of all inputs to outputs. However, this would treat a DMU as a “black box,” and the sub-DMUs and their interfaces would not be modeled. Conversely, one could perform as many DEA analyses as there are sub-DMUs by treating each sub-DMU as an entire DMU that uses inputs to produce outputs. While this would overcome the challenge of providing sub-DMU level scores, it would ignore the interfaces shared by the sub-DMUs. Finally, since DEA is purely deterministic, it cannot provide statistical inference about the impact of efficiency [26]. Therefore, it is not a comprehensive tool for our purpose (see Table 1).

Table 1 Comparison of efficiency analysis techniques

2.2 Stochastic Frontier Analysis

SFA explicitly accounts for the role of random errors in computing efficiency (e.g., [1]). Consider the comparison of \( n \)DMUs using \( m \) inputs in vector x 0 to produce the scalar output y 0 . SFA assumes that a DMU uses a production technology f(x 0 , β) to produce y 0 , where β represents the slope coefficient vector corresponding to x 0 .

SFA challenges the assumption that DMUs actually produce as much as their technology function would imply. Specifically, while each of the n DMUs may try to produce outputs according to its technology, they may fall short by an amount v k (k = 1…,n) due to inefficiency. Efficiency in SFA is the ratio of observed output produced by a DMU to the maximum feasible output producible by the DMU. Additionally, the SFA incorporates error (u) in our knowledge of a DMU’s production function to account for myriad random factors not explicitly included in the model. In sum, we represent a DMU’s output as:

$$ {y}_0=f\left({\boldsymbol{\mathsf{x}}}_{\mathbf{0}},\boldsymbol{\beta} \right) \exp \left({\varepsilon}_{\mathrm{SFA}}\right) $$
(1)

where ε SFA is the difference of the normal error term (u) and the one-sided error term (v > 0). Thus,

$$ {y}_0=f\left({\boldsymbol{x}}_{\mathbf{0}},\boldsymbol{\beta} \right) \exp \left(u-v\right) $$
(2)

Column 3 in Table 1 evaluates the use of SFA for efficiency analysis of DMUs with interconnected sub-DMUs. Similar to DEA, one could again perform as many SFA analyses as there are sub-DMUs by treating each sub-DMU as an entire DMU that obtains inputs to produce outputs. This would provide sub-DMU efficiency scores but ignore the interfaces shared by the sub-DMUs. However, maximum likelihood estimation provides the estimates and inference of β and the DMU-specific estimates of inefficiency (v k). However, we require pre-specification of a functional form f(⋅). In light of these limitations, SFA also falls short of meeting the first two of our three desired criteria.

2.3 NDEA

NDEA falls in the class of DEA models in that it also uses the mathematical programming idea to calculate the weights and efficiency of a DMU. The basic innovation of NDEA, developed in [15], is that it breaks a DMU into a set of smaller interconnected sub-DMUs (departments) and calculates the overall efficiency score of a DMU as well as each sub-DMU’s efficiency score. While doing so, it also recognizes that sub-DMUs may be structured so that one DMU’s output may serve as another’s input. Performing benchmarking through NDEA involves the specification of the interfaces within a DMU and then the solution of a linear program similar to that of standard DEA. Appendix 2 provides the details for calculating the weights and efficiency scores via mathematical programming (see [15]).

Column 3 in Table 1 evaluates the use of NDEA for efficiency analysis of DMUs with interconnected sub-DMUs. We would identify the inputs, outputs, and the interfaces across sub-DMUs. By obtaining sub-DMU and DMU scores while accounting for interconnectedness due to the sub-DMU interfaces, NDEA addresses the first two issues in Table 1. However, NDEA is purely deterministic and does not provide statistical inference about the impact of increasing a sub-DMU’s efficiency on all DMU outputs. Therefore, NDEA also falls short of all three criteria.

2.4 Two-Stage (DEA + Regression) Approach

Arnold et al. [3] introduce a two-stage efficiency approach to combine the nonparametric advantages of DEA and the inference capabilities of SFA. They propose the joint use of DEA and ordinary least squares (OLS), i.e., they include the results from a DEA analysis in the linear regression of outputs on inputs.

Specifically, for a set of n DMUs, they perform a standard DEA (stage 1) to obtain the efficiency score of each DMU. Their second stage consists of two parts in the case of a DMU producing multiple outputs. In part 1, they condense multiple outputs into one composite output via canonical correlation of outputs on inputs. Specifically, if a DMU produces s outputs each denoted by y s , then they define composite output CY by

$$ \mathrm{CY}={\displaystyle \sum_{j=1}^s}{\gamma}_j \ln {y}_j, $$
(3)

where γ j represents the canonical coefficients obtained between the outputs and inputs (see, e.g., [40]). In part 2, they regress the composite output CY on the multiple inputs and efficiency score of the DMU. Denoting the efficiency score by ρ, their approach assumes the Cobb-Douglas (double log) functional form as follows:

$$ \ln \mathrm{CY}={\tau}_0+{\tilde{\boldsymbol{x}}}_{\mathbf{0}}^{\prime }{\boldsymbol{\tau}}_{\mathbf{1}}+{\tau}_e\rho +{\varepsilon}_{\mathrm{CY}}, $$
(4)

where \( {\tilde{\boldsymbol{\mathsf{x}}}}_{\boldsymbol{0}} \) is an m × 1  vector that equals \( \ln {\boldsymbol{\mathsf{x}}}_{\boldsymbol{0}} \), τ 1 represents the m × 1  slope coefficient vector corresponding to \( {\boldsymbol{\mathsf{x}}}_{\boldsymbol{0}} \), and τ e   captures the impact of efficiency on the composite output CY.

Column 5 in Table 3 evaluates the use of the two-stage (DEA + Regression) technique for efficiency analysis of DMUs with interconnected sub-DMUs. The first step involves DEA and hence cannot take into account the relatedness among the various sub-DMU interfaces. The two-stage approach allows for statistical inference since it calculates ρ from a nonparametric DEA and carries out inference about τ e via OLS. In a simulation study, Bardhan et al. [5] show that this two-stage method performs better in terms of retrieving the true parameters of known (simulated) production functions than DEA and SFA taken individually. However, the Arnold et al. [3] method uses DEA and suffers from the limitations that sub-DMU efficiency scores and their interconnectedness are not modeled.

3 Method Development

As the previous section shows, extant methods do not simultaneously satisfy all the three methodological criteria needed for efficiency analysis of DMUs with interconnected sub-DMUs (see Table 1). Therefore, in this section, we propose a novel dimension reduction technique introduced in the statistics literature (multivariate sliced inverse regression) which, when combined with NDEA, provides the solution that we are seeking. Similar to Arnold et al. [3], the proposed method employs a two-stage approach that combines the usefulness of DEA-based techniques and statistical techniques but at each stage employs an augmented procedure that preserves their respective advantages throughout the analysis.

3.1 Modeling the Interconnectedness Among Sub-DMUs

The first stage of the Arnold et al. method to compute efficiency ignores the interconnectedness among sub-DMUs within a DMU. To embed these interfaces, we use NDEA to calculate efficiency scores for both the DMU and sub-DMUs. That is, we allow for some sub-DMUs’ output to serve as other sub-DMUs’ input.

3.2 Incorporating the Nonparametric Structure

To avoid pre-specifying a functional form such as Cobb-Douglas in the second stage of Arnold et al., we apply link-free regression via sliced inverse regression [10]. The general representation of a SIR model is as follows:

$$ Y=g\left(\boldsymbol{X}{\alpha}_1,\boldsymbol{X}{\alpha}_2,\cdots, \boldsymbol{X}{\alpha}_r\right), $$
(5)

where Y is a univariate vector of n observations n × 1, g(.) is an unknown function, X is an n × m matrix of m inputs, α j (j = 1, …, r) are m × 1 vectors of coefficients (r < m), and ε is the error term that needs not be normally distributed (i.e., it belongs to a broad class of elliptically contoured distributions). Although a generalized eigenvalue decomposition yields all the r alpha vectors (see [23]), for our purpose, we extract only the principal eigenvector to estimate the model y = g(X α 1, ε SIR) without pre-specifying the link function g(⋅). Appendix 3 provides the details of estimation and inference of the vector α 1.

To accommodate multiple outputs, we apply MSIR, where Y is a matrix of n observations on s outputs. We create a composite dependent variable Y θ 1, where θ 1 is the s × 1 vector of weights in the model:

$$ \boldsymbol{Y}{\theta}_1=g\left(\boldsymbol{X}{\alpha}_1,{\varepsilon}_{\mathrm{MSIR}}\right). $$
(6)

Appendix 3 describes how to estimate (θ 1, α 1) via the inverse regression theory (see [9, 23]) without pre-specifying the link function g(⋅). We denote the resulting composite vectors as \( {\mathrm{COMP}}_Y=\boldsymbol{Y}{\widehat{\theta}}_1 \) and \( {\mathrm{COMP}}_X=\boldsymbol{X}{\widehat{\alpha}}_1 \).

MSIR generalizes the second stage of Arnold et al. as follows. First, we estimate the COMP X and COMP Y   without pre-specifying the functional form  g(⋅). The canonical correlation in part 1 of the Arnold et al. approach is a special case of MSIR. Specifically, the canonical correlation assumes the link function  g(⋅) to be linear, whereas MSIR does not impose the linear restriction. Second, once we extract COMP X and COMP Y , we could either create a scatter plot to visually assess the shape of the link function g(⋅) (e.g., [16]) or choose an appropriate function, Cobb-Douglas or other, post hoc. In contrast, in part 2 of the Arnold et al. approach, the link function g(⋅)  is pre-specified to be of the log–log type. In contrast, MSIR discovers the shape nonparametrically, which could be used to post-specify the relevant parametric link function.

3.3 Algorithm for the Proposed Method

  1. Stage 1

    Sub-DMU Efficiency Computation

    • Identify the sub-DMUs, inputs, outputs, and linked inputs of each sub-DMU of the DMU. Compute the efficiency scores of each sub-DMU. Appendix 2 provides the computational details.

  2. Stage 2

    Nonparametric Estimation and Inference

    1. Part 1

      Perform an MSIR estimation where Y includes all the outputs produced by the DMU and X includes all the inputs used by the DMU and the efficiency scores of each sub-DMU. Extract the composites, COMP Y and COMP X , without pre-specifying the link function g(⋅). Appendix 2 provides the estimation and inference details.

    2. Part 2

      Plot a graph of the extracted composites, COMP Y versus COMP X , to determine an appropriate functional form COMP Y  = g(COMP X ). Test whether a sub-DMU’s efficiency significantly increases the outputs.

We next apply this algorithm to US print newspaper firms. We first describe the data set and the interfaces among the sub-DMUs within a newspaper firm and then discuss the empirical results and substantive insights.

4 Empirical Application

We focus on the newspaper industry for our empirical application for the following reasons. First, the newspaper industry is characterized by the presence of a few large companies that have each acquired several hundred independently operating DMUs [30]. For example, the leading media giant Gannett Incorporated owns 82 daily newspapers that function as independent DMUsFootnote 3. Efficiency analysis could provide timely and accurate guidance to DMUs performing below the mark. Second, recent years have seen a wave of debatable consolidation and deconsolidation decisions by media firms. For example, Clear Communication Inc. grew to dominate the US radio market by acquiring DMUs from nearly 70 companies. Such acquisitions led to criticism since analysts did not consider all DMUs as strategic assets. Efficiency analysis helps during acquisition situations when consolidating firms seek to replace/discard some poorly performing DMUs [34]. Third, daily newspaper sorely needs guidance on efficient allocation of resources given they are going through tough economic times due to dwindling circulations (e.g., [6]). After several interviews with newspaper owners and managers of syndicated data collection agencies, we find that no formal efficiency analysis has been performed to benefit the industry at large. Moreover, no academic application exists on benchmarking to gauge the productivity of media firms. Hence, we acquired data from a syndicated press association to perform efficiency analysis, which we next describe.

4.1 Data

The Inland Press Association (IPA) provided the data. The IPA venture began as a service to small and medium newspapers (with daily circulation of less than 85,000) that would otherwise lack reference to the industry norms. Since 1916, IPA kept annual records of data on financial aspects of hundreds of US print newspapers. The IPA database includes data on costs incurred by various departments of a newspaper (number of employees, expenses on equipment), revenues obtained, and profits generated. For our analysis, we obtained data from 310 newspapers in 1999. We also perform a replication analysis with data on 225 newspapers from 2002. To protect newspaper confidentiality, the IPA does not reveal information about newspapers’ identities and locations; hence, we treat the dataset from 1999 and 2002 as a separate cross-sectional sample.

Each newspaper is a stand-alone DMU comprised of three main sub-DMUs: the newsroom for creating news content, the distribution department for circulation, and the sales force for selling advertising space. The inputs utilized by each of these sub-DMUs are employees and equipment. Their outputs are measured by the pages of news content produced (in inches), the number of subscriptions sold, the amount of ad space sold (in inches), and the amount of local, national, and classified advertising revenue.

Table 2 presents the descriptive statistics of the outputs and inputs for 1999 and 2002. The newsroom is the most highly staffed sub-DMU with 40 % of all the employees, whereas the distribution sub-DMU contributes the most to equipment expenses.

Table 2 Descriptive statistics

4.2 Sub-DMU Efficiency Computation

4.2.1 Interfaces of the Newspaper Organizational Structure

Newsroom

The role of the employees in the newsroom is to produce and market a product of high quality and integrity that positively impacts circulation [27]. Specifically, the employees and equipment in the newsroom (inputs) combine to generate news content and subscribers (outputs). Local newspapers’ own staff writes most of their news; external wire services contribute much less. As shown in past research (e.g., [38]), the type (retail, national, classified) and amount of ad space also impact newsroom outputs. Therefore, we treat outputs of the sales force department as linked inputs to the newsroom department.

Distribution

Its responsibility is to effectively deliver the newspaper to subscribers. A newspaper’s delivery systems are strategic assets [28]. A newspaper’s distribution elasticity is large (0.23) and significant [27], suggesting that a 10 % increase in distribution intensity results in 2.3 % increase in sales. Because the delivery of a print newspaper on time and in good condition retains subscribers’ interest in the service, the main output of the distribution department is also the number of subscribers.

Sales Force

Its role is to sell ad space to advertisers. The total space sold and the revenues from retail, national, and classified ads are the outputs of sales force efforts [43]. Over 85 % of ads in newspapers promote a product or service; the remaining ads promote events such as public meetings and personal messages [29]. The output produced by the newsroom (pages of content, subscriptions) and distribution departments (subscriptions) influence advertisers in buying ad space in a newspaper [37]. Therefore, we treat the outputs from the newsroom and distribution departments as being linked inputs to the sales force [32].

Figure 1 summarizes the newspaper organizational structure. The gray boxes show the newsroom, distribution, and sales force sub-DMUs. These input boxes point towards the departments with block arrows. We depict subscriptions and pages of content (newsroom output) to the right of the newsroom department box, with block arrows pointing from the newsroom towards the outputs. The distribution department is also responsible for producing subscriptions, and hence, a block arrow points from the distribution department towards subscriptions. We show ad space and ad revenue (sales force output) to the right of the sales force department box, with block arrows pointing from the sales force department towards the outputs.

Fig. 1
figure 1

Interconnectedness of the newspaper organizational structure

Turning to the linked inputs, since ad revenue and ad space (sales force outputs) serve as inputs to the newsroom, we draw a dashed arrow pointing from ad space and ad revenue to the newsroom department. Similarly, since the sales force uses subscriptions and content pages as linked inputs from the newsroom, we draw a dashed arrow pointing from subscriptions and pages of content to the sales force department. This description shows us how the newsroom, distribution, and sales force sub-DMUs are intricately linked.

4.2.2 Empirical Results

As in Tone and Tsutsui [41], we apply linear programming to obtain the efficiency scores and compare them to those from the traditional DEA model. Table 3 provides the mean efficiency scores of the newspapers in 1999 and 2002. The standard DEA-based mean overall efficiency is 0.77 for 1999 and 0.86 for 2002. In contrast, the proposed method which captures the interconnectedness estimates the mean overall efficiency as 0.53 for 1999 and 0.62 for 2002. The smaller score reflects the more conservative nature of the proposed method as it identifies more linked variables to penalize DMUs for their inefficiency.

Table 3 DMU and sub-DMU efficiency

Furthermore, the proposed method provides the sub-DMU efficiencies as well as the overall DMU efficiency for each firm. This information is unavailable from the standard DEA analysis. In Table 3 below, we see that the newsroom’s efficiency is higher than the efficiency of the distribution and advertising sales departments in 1999 and 2002. This finding is interesting because newsrooms have been the major focus of cutbacks during 2002–2010 [39] even when they seem to operate at high efficiency relative to other departments.

Table 4 further reveals the benefits of the proposed method relative to the standard DEA. Specifically, we identify and discriminate a few outstanding performers from the rest. Table 4 shows that, in 1999 and 2002, respectively, the standard DEA performs poorly because it classifies 197 firms (about 63 %) and 163 firms (about 72 %) as being 100 % efficient. Such a prevalence of high-performing firms is hard to reconcile given the downturn soon thereafter [27]. On the other hand, the proposed method identifies only 30 (22) firms in 1999 (2002) as fully efficient, which suggests an average of 10 % or fewer outstanding firms in the sample. Indeed, this ability of the proposed method to identify a selective sample of outstanding firms empowers the managers to learn from the finest performers rather than from the average performers.

Table 4 DMUs classified as 100 % efficient

Finally, we expect the DMUs with higher efficiency scores to earn higher profits. The correlations between efficiency scores (based on the proposed method) and profits were 0.34 and 0.32 in 1999 and 2002, respectively, and statistically significant. In contrast, the correlations between the efficiency scores based on the standard DEA and profits are much lower (0.19 in 1999, 0.20 in 2002).

To summarize, the proposed method captures the interconnectedness of sub-DMUs and outperforms the standard DEA. It provides discriminating information as well as disaggregated (sub-DMU level) information. It assists less efficient firms by identifying the finest firms to benchmark and emulate. It offers diagnostic information with respect to the efficiency improvements at the departmental (sub-DMUs) levels. We next demonstrate how MSIR outperforms in the second stage.

4.3 Nonparametric Estimation and Inference

Stage 2 of the proposed method discovers how the inputs in each department together with the departmental efficiencies impact the outputs produced by the firm. In part 1 of stage 2, we extract the composite variables COMP Y and COMP X using MSIR. Six outputs comprise the dependent variables: pages of content, pages of ad space, subscriptions, retail, local, and classified ad revenues. Three inputs comprise the independent variables: full-time and part-time employees, investments in equipment in the three sub-DMUs, and the three sub-DMU efficiency scores. Therefore, the vector of independent variables has (3 × 3) + 3 = 12 inputs. Table 5 shows the MSIR estimates for 1999 and 2002. It shows the estimates of the α 1 vector which reduces the dimensionality of the 12 inputs and the estimates of the θ 1 vector which reduces the dimensionality of the 6 outputs.

Table 5 MSIR estimation results

In the part 2 of stage 2, we discover the relationship between COMP Y   and COMP X and do not impose a functional form a priori. Rather, we let the data inform us the most appropriate shape. Because we log-transform the output and input variables, we expect a linear relationship between COMP Y and COMP X if the Cobb-Douglas assumptions were to hold. To formally ascertain the presence of nonlinearity, we estimate a series of nonlinear models given by COMP Y  = π (COMP X )δ/δ, where δ ranged from 0 to 1 in increments of 0.1. When δ = 1, the relationship between COMP Y and COMP X   is linear, thereby nesting the Arnold et al. [5] Cobb-Douglas form. We compute the corrected Akaike information criterion (AICC) for each of the models; the best model is one that attains the smallest AICC [21].

Table 6 presents the results for model selection across the 11 alternative models in 1999 and 2002. Focusing on results for 1999, the model with δ = 0 obtains an AICC value of 1,188.24, while the model with δ = 0.6 attains the minimum AICC score of 1,099.60. The difference in information criterion exceeds the critical threshold of 2 points for rejection [7]. Hence, we reject the model with δ = 0 in favor of the model with δ = 0.6. Also, the model with δ = 1 (the Arnold et al. model) obtains an AICC value of 1,590.47, which is more than 2 points greater than the model with δ = 0.6. Hence, we reject the Arnold et al. model (δ = 1) in favor of the model whose δ = 0.6.

Table 6 Model comparisons

Focusing on results for 2002, the model with δ = 0 obtains an AIC C value of 671.46, the model with δ = 1 obtains an AICC value of 1,167.67, and the model with δ = 0.5 attains the minimum AICC score of 669.29. The difference in information criteria between the models with δ = 0 and δ = 0.5 and the difference in information criteria between the models with δ = 1 and exceed 2, respectively. Hence, we again reject the models with δ = 0 and δ = 1 over the model with δ = 0.5. Thus, our proposed approach allows flexibility in the choice of nonlinearity. Table 7 shows the estimates of π for 1999 and 2002.

Table 7 Estimate from regression of COMP X   on COMP Y  

We calculate the percentage increase in a newspaper output (e.g., ad space sold) in response to a 1 % change in the department’s efficiency by combining information from Table 4 (links the individual outputs to COMP Y and the individual inputs to COMP X ) and Table 5 (links COMP Y and COMP X ). Table 8 shows the results for year 1999 (year 2002 results are similar).

Table 8 Impact of efficiency on outputs (year 1999)

Table 8 indicates that increases in sales force efficiency significantly increase all outputs of the newspaper, with subscriptions, pages of content produced, and classified ads benefitting the most. To boost sales force efficiency, newspaper firms should continue to invest in sales force training and coaching [43]. Table 8 also shows that the distribution efficiencies generate more outputs. Specifically, it enhances the subscriptions and sales of classified ads. Indeed, when readers receive newspapers on time, their utility to stay subscribed increases which, in turn, boosts advertisers’ inclination to buy ad space. At the current levels, an incremental increase in newsroom efficiency does not statistically impact any of the six outputs that the newspaper produces (i.e., the statistical impact of a 1 % increase in efficiency in the newsroom on all outputs is 0). This finding occurs because, first, the operating levels of newsroom efficiency are 20–30 % higher than the efficiencies of the distribution and sales force departments, and diminishing returns set in as the efficiency increases. Second, newspapers tend to be understaffed in the newsroom, and the average newsroom in our database is already performing as efficiently as it can, given its limited staff.

5 Conclusion

Efficiency analysis is an important for managing multi-unit business operations, since it provides accurate and timely productivity benchmarks for sub-DMUs within firms and DMUs at large. However, the literature so far has largely ignored the efficiency analysis of DMUs with interconnected sub-DMUs. It is important to address this gap to identify sub-DMUs and interfaces that are relatively inefficient and need more management attention.

Towards this end, we first outline three requirements in the efficiency analysis of DMUs with interconnected sub-DMUs: a need to produce sub-DMU efficiency scores, to account for the interconnected input–output relationships between the sub-DMUs, and to provide statistical inference on the impact of efficiency on overall DMU outputs. A detailed review of four existing techniques (DEA, SFA, NDEA, and a two-stage DEA/OLS method) shows that each method lacks a comprehensive treatment of the problem.

Hence, we propose a new two-stage method that meets all three criteria. We generalize the existing two-stage method that uses DEA in the first stage by embedding the sub-DMU structure in a DMU. Consequently, we obtain sub-DMU level efficiency and understand the interconnectedness among sub-DMUs. To provide statistical inference, we generalize the existing two-stage method by performing nonparametric estimation in the second stage of efficiency analysis. Specifically, we propose multivariate sliced inverse regression, MSIR, which provides estimation and inference capability without requiring the users to pre-specify any link function between the multiple outputs and multiple inputs. MSIR allows researchers the flexibility to discover the relationship between DMUs’ outputs and inputs, rather than imposing any a priori relationship.

Our empirical application illustrates how the proposed method outperforms DEA as an efficiency analysis tool. The proposed method is applicable to a wide variety of efficiency analysis situations; it empowers managers to obtain sub-DMU level insights, measures reliably the impact of efficiency gains on outputs, and avoids pre-specifying the functional form between the outputs and inputs.

We close by offering three takeaways for the newspaper industry. First, newspaper departments are organized so as to perform tasks that mainly serve one consumer group (e.g., reader). The reality is that a media firm produces a joint product serving both end user groups. In this unique business model, one department’s outputs enhance the efficiency of other departments and any benchmarking analysis must appreciate this key feature. Our methodology embraces this two-sided feature while providing departmental and overall efficiency scores. Therefore, our approach is suitable to benchmark newspapers and similar media platforms and in process development efforts that follow. Second, we provide the cash-strapped and bankruptcy-ridden industry with a technique to perform benchmarking analysis to decide which newspapers and newspaper departments are most efficient. Third, further increases in newsroom efficiency do not statistically increase the outputs of the newspapers while increases in sales force and distribution efficiencies help newspaper firms. This finding occurs because newsroom efficiency is 20–30 % higher than the efficiencies of the distribution and sales force departments, and diminishing returns set in as the efficiency increases. We hope practitioners and researchers find the proposed method and results useful.