1 Introduction

In this paper, we apply the \({\text {wH}}^{{\text {My}}}\) value (weighted hierarchy value based on the Myerson value) introduced by Casajus et al. (2009) and Hiller (2014) to the problem of optimal size of the firm. Our approach is similar to the work done by van den Brink and Ruys (2008) using a value of cooperative game theory that takes hierarchies in firms into account. As in van den Brink and Ruys (2008), we analyze the influence of production technology on the size of the firm.

To analyze firms within the framework of cooperative game theory, it is necessary to model hierarchies within this framework. One approach was presented by Kalai and Samet (1987). They used ordered partitions and weights to model hierarchies. Winter (1989) used a sequence of bargaining components in the sense of Owen (1977) to model hierarchies. Both approaches model levels in the sense of Lazear and Rosen (1981), Carmichael (1983) and Prendergast (1993). Another approach involves antimatroid games (Algaba et al. 2003, 2004a, b). Unfortunately, these approaches are unable to model a manager-subordinate structure. For hierarchies in firms, this structure is a main characteristic (Radner 1992; Meagher 2001).

Another approach in cooperative game theory that is used to model hierarchies applies an undirected graph/network on the set of players (Myerson 1977; Borm et al. 1992; Herings et al. 2008). However, the problem with this approach is that all players are symmetrical in the graph; there are no superiors and subordinates. In van den Brink (2008) a directed graph/permission structure on the set of players is used to analyze the effects of hierarchies on players’ payoffs or to the wages of employees respectively. The basic idea of permission values is that a player/employee needs approval from all predecessors (Gilles et al. 1992; van den Brink and Gilles 1996). In van den Brink and Ruys (2008) the approach is used to determine the organizational structure of the firm by considering production technology.

In van den Brink (2008) single dominance relationships, i.e., the influence of a predecessor on direct successors, are equally strong. It is, however, a plausible assumption that the relations among the players may have different levels of strength (e.g., different leadership styles could be modelled by weights). Using this idea to enhance the modelling of hierarchies, Casajus et al. (2009) introduced the \({\text {wH}}^{{\text {Sh}}}\) value (a weighted hierarchy value based on the Shapley value). In this approach a weighted directed graph is used to model hierarchies. Two elements form the basis of the \({\text {wH}}^{{\text {Sh}}}\) value. First, in order to create the output, all players work symmetrically together. In this step, the output is distributed to the players according to the Shapley value (Shapley 1953). In a second step, the weighted hierarchy reallocates a certain fraction of these payoffs. The weighted hierarchy has only allocational effects.Footnote 1 In Hiller (2014), the \({\text {wH}}^{{\text {Sh}}}\) value has been generalized for games with a cooperation structure/graph.Footnote 2 This approach takes account of the coordinating role in the distribution of output; the players do not work symmetrically together (first step). The player who coordinates the remaining players will be rewarded if there is a cooperative win. Again, in the second step of calculating the payoffs, the weighted hierarchy reallocates a certain fraction of the payoffs that players obtain in the first step.

This paper applies the \({\text {wH}}^{{\text {My}}}\) value to analyze the question of optimal size of the firm. Our analysis is analogous to van den Brink and Ruys (2008). One result is that an increase of production technology increases the size of the firm. With respect to the theory of the firm, our approach could be a technical contribution. Other technical approaches with interesting insights for an optimal hierarchy are by Williamson (1967), Calvo and Wellisz (1978; 1979), Rosen (1982), Keren and Levhari (1979; 1989), Meagher and van Zandt (1998) and Meagher et al. (2003) for example.

The remainder of our article is organized as follows. In Sect. 2, we present some preliminaries. Section 3 provides results on the size of the firm. Section 4 concludes and outlines further research.

2 Preliminaries

A transferable utility (TU) game is a pair \(\left( N,v\right)\) where \(N=\{1,2,\ldots ,n\}\) is the non-empty and finite set of players. The number of players in N is denoted by n or \(\left| N\right| .\) The coalitional function v assigns every subset \(K\subseteq N\) a certain worth \(v\left( K\right)\) reflecting the economic abilities of K,  i.e., \(v:2^{N}\rightarrow {\mathbb {R}}\) and \(v\left( \emptyset \right) =0.\) A TU game (Nv) is called symmetric if a function \(f:N\rightarrow {\mathbb {R}}\) exists such that \(v(K)=f(|K|)\) for all nonempty sets \(K\subseteq N.\) A symmetric game is monotone if \(v\left( T\right) <v\left( S\right)\), for all \(T\subset S, T,S\subseteq N.\)

A value is an operator \(\phi\) that assigns (unique) payoff vectors to all games \(\left( N,v\right)\) (i.e., uniquely determines a payoff for every player in every TU game). One important value is the Shapley value. In order to calculate the players’ payoffs, rank orders \(\rho\) on N are used. They are written as \((\rho _{1},\dots ,\rho _{n})\) where \(\rho _{1}\) is the first player in the order, \(\rho _{2}\) the second player, etc. The set of these orders is denoted by \(RO\left( N\right)\); n! rank orders exist. The set of players before i in rank order \(\rho\) and player i is called \(K_{i}(\rho ).\) The Shapley payoff of i is the average of the marginal contributions taken over all rank orders (Shapley 1953):

$$\begin{aligned} {\text {Sh}}_{i}\left( N,v\right) =\frac{1}{n!}\sum _{\rho \in RO\left( N\right) }v\left( K_{i}\left( \rho \right) \right) -v\left( K_{i}\left( \rho \right) \backslash \left\{ i\right\} \right) . \end{aligned}$$
(1)

Other value-like solution concepts of cooperative game theory are presented by Banzhaf (1965), Schmeidler (1969), Holler (1982) and Tijs (1987).

As in Gilles et al. (1992), van den Brink and Gilles (1996), van den Brink (1997), van den Brink (2008), van den Brink and Ruys (2008), Casajus et al. (2009) and van den Brink and Dietz (2014) the permission structure is a mapping \(S:N\rightarrow 2^{N}.\) To each player are assigned the players who are direct successors of i. S can be interpreted as a directed graph (Bollobás 2002). \(S\left( i\right)\) identifies the direct successors of \(i\)with \(i\notin S\left( i\right)\). \(\left| S\left( i\right) \right|\) could be interpreted as span of control of player i. The players in \(S^{-1}\left( i\right) =\left\{ j\in N:i\in S\left( j\right) \right\}\) are the direct predecessors of i; \(S^{-1}\left( K\right) = {\textstyle \bigcup \nolimits _{i\in K}} S^{-1}\left( i\right)\).

For the hierarchy of the firm, we assume a tree structure as usual in the literature (Radner 1992; Meagher 2001). These structures satisfy two conditions:

  • there is one player \(i_{0}\in N\), such that \(S^{-1}\left( i_{0}\right) =\emptyset\) and \({\hat{S}}\left( i_{0}\right) =N\backslash \left\{ i_{0}\right\}\) and

  • for every player \(i\in N\backslash \left\{ i_{0}\right\}\) we have \(\left| S^{-1}\left( i\right) \right| =1\).

In a tree structure, a path T in N from i to j is a sequence of players \(T\left( i,j\right) =\left\langle r_{0},r_{1},\dots ,r_{k-1},r_{k}\right\rangle\) with \(i=r_{0}\), \(j=r_{k}\) and \(r_{l+1}\in S\left( r_{l}\right)\) for all \(l=0,\dots ,k-1\). A path can be interpreted as a chain of commands/chain of reporting between i and j, whereby i is a predecessor of j. The set of successors of i is \({\hat{S}}\left( i\right) :=\left\{ j\in N\backslash \left\{ i\right\} :\text {there is a path from }i\text { to }j\right\}\). Analogously, we denote the set of i’s predecessors by \({\hat{S}}^{-1}\left( i\right) :=\left\{ j\in N\backslash \left\{ i\right\} :\text {there is a path from\ }j\text { to\ }i\right\}\).

Besides the hierarchy S, weighted relationships between the players are taken into account. The vector \(w:N\rightarrow {\mathbb {R}}\) assigns every player i a weight \(w_{i}, 0\le w_{i}\le 1.\) For \(i_{0}\) we have \(w_{i_{0}}=0.\) If a vector maps all players the same weight \({\bar{w}}\), except \(i_{0},\) i.e., \(w_{i}=w_{j}={\bar{w}}\) for all \(i,j\in N\backslash \left\{ i_{0}\right\}\), we denote the vector by \({\bar{w}}\).

Hierarchy S characterizes a level structure or partition \({\mathfrak {L}} =\left( L_{0},\ldots ,L_{M}\right)\) of N with

  • \(L_{0}=\left\{ i_{0}\right\}\) and

  • \(L_{k}=\left\{ \left. i\in N\backslash \overset{k-1}{\underset{u=0}{ {\textstyle \bigcup } }}L_{u}\right| S^{-1}\left( i\right) \subset L_{k-1}\right\} , 1\le k\le M, L_{M}\ne \emptyset\) and \(L_{M+1}=\emptyset .\)

The definition of the level structure characterizes a level by the distance to \(i_{0}\) at the top of the firm, i.e., this can be called a top-down-hierarchy.Footnote 3\(L_{M}\) is the lowest level in the firm. We call players at the lowest level workers.

A weighted hierarchical game is a tuple \(\left( N,v,S,w\right)\). A value for these games is an operator \(\varphi .\) The \({\text {wH}}^{{\text {Sh}}}\) value is one of these values. All players j,  with \(j\in {\hat{S}}^{-1}\left( g\right)\), respectively, and all players in the path \(T\left( i_{0},g\right)\), get a fraction of g’s Shapley payoff. For \(i\in N\) we calculate the fraction of \(g^{\prime }\)s payoff by (Casajus et al. 2009):

$$\begin{aligned} f_{i}\left( S,w,g\right) =\left\{ \begin{array} [c]{ll} \left[ 1-w_{i}\right] \underset{_{\begin{array}{c} l\in {\hat{S}}\left( i\right) ,\ \\ l\in T\left( i_{0},g\right) \end{array}}}{ {\textstyle \prod } }w_{l}, &{} i\in T\left( i_{0},g\right) ,\\ 0, &{} \text {else.} \end{array} \right. \end{aligned}$$
(2)

From this, we have for \({\text {wH}}^{{\text {Sh}}}\) payoff of player \(i\in N\) (Casajus et al. 2009):

$$\begin{aligned} {\text {wH}}_{i}^{{\text {Sh}}}\left( N,v,S,w\right) =\sum _{j=1}^{n}f_{i}\left( S,w,j\right) \cdot {\text {Sh}}_{j}\left( N,v\right) . \end{aligned}$$
(3)

For a literature review regarding values on games with hierarchies, see as an example van den Brink (2017).

In order to honor players who coordinate other players, the \({\text {wH}}\) value based on the Myerson value (Myerson 1977) has been introduced and axiomatized (Hiller 2014). Some further preliminaries are necessary to introduce this value. First, a graph L on the set of players is considered. The set of possible pairwise links between players is called \(L^{N}=\left\{ \left\{ i,j\right\} :i,j\in N,\ i\ne j\right\}\), whereat \(\left\{ i,j\right\}\) and \(\left\{ j,i\right\}\), respectively, (or ij and ji) is the direct link between players i and j. A cooperation structure CO on N is a graph \(\left( N,L\right)\) with \(L\subseteq L^{N}.\) A CO game is characterized by (NvL). From hierarchy S we construct \(L_{S}\) in the following way: \(L_{S}=\left\{ ij:i\in S\left( j\right) \right\}\) for all \(i,j\in N\); hence, the CO game \((N,v,L_{S})\) results. The graph \(L_{S.}\) partitions N into components \(C_{1},\dots ,C_{k}.\) This partition is denoted by \(N\backslash L_{S}.\) Each player is in one component; \(C_{i}\cap C_{j}=\emptyset , i\ne j, N=\bigcup C_{j}. N\backslash L_{S}\left( i\right)\) denotes the component of i. Two players i and j with \(N\backslash L_{S}\left( i\right) =N\backslash L_{S}\left( j\right)\) are connected. The restricted coalitional function \(\left. v\right| _{L_{S}}\) is given by:

$$\begin{aligned} \left. v\right| _{L_{S}}\left( K\right) :=\sum \limits _{C\in K\backslash L_{S}}v\left( C\right) \text { }\forall \text { }K\subseteq N. \end{aligned}$$
(4)

The worth of a coalition K corresponds to the sum of the worths of its components. In the case \(\left| K\backslash L_{S}\right| =1\) we have \(v\left( K\right) =\left. v\right| _{L_{S}}\left( K\right)\). A CO value is an operator \(\psi\) that assigns (unique) payoff vectors to all CO games. The most popular CO value is the Myerson value (Myerson 1977). According to this value, player \(i^{\prime }\)s payoff is calculated by:

$$\begin{aligned} {\text {My}}_{i}\left( N,v,L_{S}\right) ={\text {Sh}}_{i}\left( N,\left. v\right| _{L_{S}}\right) . \end{aligned}$$
(5)

Further values for CO games are the position value (Borm et al. 1992), the average tree solution (Herings et al. 2008) and the center value (Navarro 2020). For a literature survey on CO games, see Slikker and van den Nouweland (2001) and Gilles (2010). With these preliminaries, \({\text {wH}}_{i} ^{{\text {My}}}\) is calculated by (Hiller 2014):

$$\begin{aligned} {\text {wH}}_{i}^{{\text {My}}}\left( N,v,S,w\right) =\sum _{j=1}^{n}f_{i}\left( S,w,j\right) \cdot {\text {My}} _{j}\left( N,v,L_{S}\right) \end{aligned}$$
(6)

To exemplify the calculation, we introduce an example:

Example 1

For our example, we assume \(N=\left\{ 1,2,3\right\}\) and

$$\begin{aligned} v\left( K\right) =\left\{ \begin{array} [c]{rl} 2, &{} \left| K\right| =1\\ 10, &{} \left| K\right| =2\\ 15, &{} \left| K\right| =3\\ 0, &{} \text {else.} \end{array} \right. \end{aligned}$$
(7)

In addition, we have \(S\left( 2\right) =S\left( 3\right) =\varnothing , S\left( 1\right) =\left\{ 2,3\right\}\), and \(w=\left( 0,\frac{1}{3},\frac{1}{4}\right)\). For \(\left. v\right| _{L_{S}}\), we have:

$$\begin{aligned} \left. v\right| _{L_{S}}\left( K\right) =\left\{ \begin{array} [c]{ll} 2, &{} \left| K\right| =1\\ 4, &{} K=\left\{ 2,3\right\} \\ 10, &{} K=\left\{ 1,2\right\} ,K=\left\{ 1,3\right\} \\ 15, &{} \left| K\right| =3\\ 0, &{} \text {else.} \end{array} \right. \end{aligned}$$
(8)

From these assumptions, we obtain \({\text {My}}_{2}\left( N,v,L_{S}\right) ={\text {My}}_{3}\left( N,v,L_{S}\right) =4\) and \({\text {My}}_{1}\left( N,v,L_{S}\right) =7\). Since player 1 coordinates the other players, player 1 gets a higher payoff. For the \({\text {wH}}^{{\text {My}}}\) payoffs, we have:

$$\begin{aligned} {\text {wH}}_{1}^{{\text {My}}}\left( N,v,S,w\right) =9\frac{1}{3},{\text {wH}}_{2}^{{\text {My}}}\left( N,v,S,w\right) =2\frac{2}{3},{\text {wH}}_{3} ^{{\text {My}}}\left( N,v,S,w\right) =3. \end{aligned}$$

Building on our definition of a weighted hierarchical game \(\left( N,v,S,w\right)\) and the calculation of \({\text {wH}}_{i} ^{{\text {My}}}\left( N,v,S,w\right)\) we can sketch the roles of various players in the firm. In our analysis in Sect. 3, only the workers at \(L_{M}\) are productive with respect to v. Managers in the levels between \(L_{M}\) and \(i_{0}\) generate an additional worth, if cooperation between workers is superadditive. Analogous to van den Brink and Ruys (2008), we interpret \(i_{0}\) as an owner of the firm, i.e., the payoff of \(i_{0}\) is the profit of the firm. Hence, the structure and size of the firm that maximizes \(i_{0}\)’s payoff is profit maximizing (see Sect. 3). The weights \(w_{i}\) can be interpreted as the level of redistribution/exploitation in the firm. High weights result in a large part of the worth being redistributed to \(i_{0}.\) If the worth is not only interpreted in monetary terms, this could also be mean that superiors like to bask in the success of their employees—and thus as part of the corporate culture. Outside our model, this can lead to a low incentive for innovation or performance when there is hidden information. Like the span of control \(\left| S\left( i\right) \right|\), the weights \(w_{i}\) as well as the coalition function v are given exogenous.

3 Results

In this section, we present some new insights regarding the structure and size of the firm based on the \({\text {wH}}^{{\text {My}}}\) value as a scheme that allocates produced worth to the players. For the structure and production technology of the firm, we assume (again analogous to van den Brink and Ruys (2008)):

  • \(w_{i}=w, 0<w<1,\) for all \(i\in N\backslash \left\{ i_{0}\right\}\),

  • \(\left| S\left( i\right) \right| =s\ge 1\) for all \(i\in N\backslash L_{M},\) and

  • \(v:\left\{ 1,\cdots ,\left| L_{M}\right| \right\} \rightarrow {\mathbb {R}}^{+}.\)

For our firm, we have a constant weight \({\bar{w}}\) for all employees without \(i_{0}\) at the top of the firm. The span of control s is equal in the whole firm. So in an m-level firm, the number of employees (with \(i_{0})\) equals \(n= {\textstyle \sum \nolimits _{k=0}^{m}} s^{k}=\frac{s^{m+1}-1}{s-1}.\) The number of workers at the lowest level is given by \(\left| L_{M}\right| =s^{m}.\) Only these workers influence the coalitional function \(v\left( K\right)\). In addition, the workers are identical with respect to \(v\left( K\right)\). In van den Brink and Ruys (2008), the coordination costs per level are a percentage \(\alpha , 0<\alpha <1.\) They have the effect that adding a level (increasing the firm size) may thus benefit \(i_{0}\) by increasing the scale of production, at the cost of an increase in coordination costs (Williamson 1967). In our model, w can be interpreted as level costs, since the employees between the workers and \(i_{0}\) acquire a part of the worth produced by workers.

Example 2

A firm with \(N=\left\{ 1,2,3\right\} , S\left( 2\right) =S\left( 3\right) =\varnothing , S\left( 1\right) =\left\{ 2,3\right\} ,\) and \(w=\frac{1}{3} \forall ~i\in N\backslash \left\{ 1\right\}\) meets the requirements. The span of control is 2.

For first insights, we assume a linear relation between the number of workers and the worth produced; i.e., there is a production function p based on the set of workers \(L_{M}\) with \(p:L_{M}\rightarrow {\mathbb {R}},p(K)=c\cdot \left| K\right| ,\) for all nonempty sets \(K\subseteq L_{M}\) with \(c>0.\) From this we deduce the coalitional function v with:

$$\begin{aligned} v\left( K\right) =\left\{ \begin{array} [c]{ll} c\cdot \left| L_{M}\cap K\right| , &{} L_{M}\cap K\ne \emptyset \\ 0, &{} else. \end{array} \right. \end{aligned}$$
(9)

The worth produced by the firm is \(v\left( N\right) =c\cdot s^{m}.\) Applying the \({\text {wH}}^{{\text {My}}}\) value, we have in a first step for the Myerson payoffs:

$$\begin{aligned} {\text {My}}_{i}\left( N,v,L_{S}\right) =c,\text { }i\in L_{M}\text { and }{\text {My}}_{j}\left( N,v,L_{S}\right) =0,\text { }j\in N\backslash L_{M}. \end{aligned}$$
(10)

Based on these payoffs, the following \({\text {wH}} ^{{\text {My}}}\) payoffs occur:

$$\begin{aligned} {\text {wH}}_{i}^{{\text {My}}}\left( N,v,S,w\right)=\left( 1-w_{i}\right) \cdot c,\\\text { }i\in L_{M} {\text {wH}}_{i_{0}}^{{\text {My}}}\left( N,v,S,w\right)=s^{m}\cdot w^{m}\cdot c. \end{aligned}$$
(11)

Since the production function p is linear in \(\left| L_{M}\right|\), the coordination by \(i_{0}\) does not generate marginal contributions. From this, we deduce our first result regarding the size of the firm:Footnote 4

Theorem 3

In a firm with a linear production function p and an allocation of the worth using the \({\text {wH}}^{{\text {My}}}\) value, there is no optimal size from the point of view of \(i_{0}\).

The proof is given in “Appendix 2”. Hence, there is no optimal size of the firm. If \(s\cdot w>1, i_{0}\) could increase the payoff by adding an additional level to the firm. Hence, without more detailed modelling, the constellation of sw and c provides only an initial indication of whether the firm exists or not. A short example illustrates this:

Example 4

The firm of Example 2 is the starting point and called firm A. In addition, the production function for the workers is given by \(p_{A} (K)=2\cdot \left| K\right| , K\subseteq L_{M}\). For the payoffs, we have \({\text {My}}_{1}\left( N_{A},v_{A},L_{S_{A}}\right) =0, {\text {My}}_{2}\left( N_{A},v_{A},L_{S_{A}}\right) ={\text {My}}_{3}\left( N_{A},v_{A},L_{S_{A}}\right) =2\) and \({\text {wH}}_{i}^{{\text {My}}}\left( N_{A},v_{A},S_{A},w_{A}\right) =\frac{4}{3}\ \forall i\in N\). With respect to Theorem 3, we have case \(s\cdot w<1.\) Adding a new level gives a firm B with \(N_{B}=\left\{ 1,\ldots ,7\right\} , S_{B}\left( j\right) =\varnothing \)with \(j\in \left\{ 4,5,6,7\right\} , S_{B}\left( 2\right) =\left\{ 4,5\right\} , S_{B}\left( 3\right) =\left\{ 6,7\right\}\), \(S_{B}\left( 1\right) =\left\{ 2,3\right\}\) and \(w_{B}=\frac{1}{3}~\forall ~i\in N\backslash \left\{ 1\right\}\); see Fig. 1. The Myerson payoffs for the players in the last level are unchanged. For \({\text {wH}}^{{\text {My}}}\) payoffs, we obtain \({\text {wH}}_{i}^{{\text {My}}}\left( N_{B},v_{B} ,S_{B},w_{B}\right) =\frac{4}{3} \forall i\in L_{M}, {\text {wH}} _{j}^{{\text {My}}}\left( N_{B},v_{B},S_{B},w_{B}\right) =\frac{8}{9} \forall j\in L_{M-1}\) and \({\text {wH}}_{i_{0} }^{{\text {My}}}\left( N_{B},v_{B},S_{B},w_{B}\right) =\frac{8}{9};\) the payoff for \(i_{0}\) has been reduced.

Fig. 1
figure 1

Example: Firm B

In the next step, we assume a production function on the set of workers \(L_{M}\) with \(p(K)=\left| K\right| ^{\gamma },\) \(K\subseteq L_{M}\) with \(\gamma >0.\) If \(\gamma >1\), coordinating the workers generates an additional worth. The exponent \(\gamma\) could be interpreted as technology of the firm. A higher \(\gamma\) means higher productivity by the workers. Applying the \({\text {wH}}^{{\text {My}}}\) value results in an optimization process. In a general analysis we get:

Theorem 5

In a firm with a production function p with \(0<\gamma \le 1\) and an allocation of the worth using the \({\text {wH}} ^{{\text {My}}}\) value there is an optimal size of the firm, i.e., there is a \({\widehat{m}}\) with \({\text {wH}}_{i_{0} }^{{\text {My}}}\left( {\widehat{N}},v,S,w\right) \ge {\text {wH}} _{i_{0}}^{{\text {My}}}\left( {\overline{N}},v,S,w\right)\) with \(\left| {\widehat{N}}\right| = {\textstyle \sum \nolimits _{k=0}^{{\widehat{m}}}} s^{k}\) and \(\left| {\overline{N}}\right| = {\textstyle \sum \nolimits _{k=0}^{{\overline{m}}+1}} s^{k}\).

The proof is given in “Appendix 2”. It is also possible in firms with \(\gamma >1\) that an optimization process may occur. To illustrate this, we present an example with \(p(K)=\left| K\right| ^{\frac{5}{4}}\) and \(s=2.\) Table 1 shows the results for \(i_{0}, j\in L_{M}\) and \(l\in N\backslash \left\{ \left\{ i_{0}\right\} \cup L_{M}\right\}\). In the case of \(w=0.1,\) it is not worth it for \(i_{0}\) to insert an additional level of employees in the firm with an aim to raise the number of workers.

In addition, we deduce from Eqs. 6 and 11:

Corollary 6

In a firm with a production function p with \(0<\gamma\) an increase in

  • span of control s

  • weight w

  • productivity of workers \(\gamma\)

increases the (optimal) size of the firm.

Hence, the higher the productivity of the workers, the larger a firm will be. The same holds for the span of control and the allocation weight.

Analogous to van den Brink and Ruys (2008), finally we briefly consider a reservation wage for workers in \(L_{M}\).Footnote 5 This wage is denoted by \(r_{i}\) for \(i\in L_{M}\) with \(r_{i} \in {\mathbb {R}}^{+}\). For workers in \(L_{M}\) this wage is the lowest wage rate at which they are willing to work at the firm. Hence, \({\text {wH}}_{i}^{{\text {My}}}\left( N,v,S,w\right) \ge r_{i}\) is necessary. For firms that are exposed with an optimization process as noted in Theorem 5 or Table 1, an increase of reservation wages/minimum wages could reduce the optimal number of levels. This occurs if the new reservation wage is above the worker’s wage \({\text {wH}} _{i}^{{\text {My}}}\left( N,v,S,w\right)\). In order for the worker to continue to operate at the firm, the weight w must be lowered. This reduces the optimal number of levels \({\widehat{m}}.\) Hence, the payoff for \(i_{0}\) is bounded above by the reservation wage. This result is in line with van den Brink and Ruys (2008).

Table 1 Productivity and firm size

4 Conclusion

In this note, we apply the \({\text {wH}}^{{\text {My}}}\) value to determine the optimal size of the firm from the point of view of the owner \(i_{0}\) of the firm. In modelling with a linear production function \(p(K)=c\cdot \left| K\right|\), an increase of level costs/weights w and an increase in the span of control s,  increases the probability that the firm exists. For production function \(p(K)=\left| K\right| ^{\gamma }\) the higher productivity of workers at the lowest level of the firm leads to an increase in optimal firm size. In addition, an increase of w and an increase of s are combined with a growing optimal size of the firm. This intuitive result shows that the \({\text {wH}} ^{{\text {My}}}\) value is an appropriate method to model the allocation of the produced worth in the firm.

With respect to other technical approaches mentioned in the Introduction, in Williamson (1967), two factors limit the size of the firm/span of control. On the one hand, the quality of the information that the manager receives at the top position of the firm decreases, since each additional hierarchical level has a negative impact on this quality. In addition, the quantity of information to be processed by the top manager increases with the size of the company, so this is also a limiting factor. This modeling was enhanced by Calvo and Wellisz (1978; 1979). Their analysis assumes asymmetric information regarding the effort of employees at lower levels/hidden action of these employees. The limitation of firm size is caused by loss of control and the high costs of supervision. In our model, asymmetric information is not considered. In addition, Calvo and Wellisz (1978; 1979) show that reservation wages/minimum wages could increase employment since higher wages reduce workers’ incentive to shirk and therefore reduces the required supervision and additional employees can be hired and observed. In our model, an increase in the minimum wage cannot lead to an increase in the size of the firm. In Rosen (1982), the hierarchy and size of the firm is an outcome of assignments of employees to hierarchical positions. More able employees are allocated to top positions and firms with persons of superior talent in top positions are larger. In our analysis, workers in the last level are symmetric and so we do not capture this question (see “Appendix 1” for results from Casajus et al. (2009)). In Keren and Levhar (1979; 1989) the span of control for every firm level is a result of minimizing labor costs on the one hand and costs caused by delays in decision making on the other. One result is that the span of control is higher at lower hierarchy levels. In our model, the span of control is exogenous and the optimization remains open for further research. In Meagher and van Zandt (1998) and Meagher et al. (2003), information processing is the production activity of firms. Hierarchies/firms with more than one employee occurs in order to accelerate/parallelize information processing. The wages of employees and the cost of delay in information processing determine the optimal size of the firm. With respect to our model, the coalition function can be interpreted as the speed of information processing. The managers of the firm coordinate this processing. Adding more managers/hierarchy levels increases the number of information processing workers at the lowest hierarchy level—but delays increase due to additional coordination.

For a linear case, our results are in line with van den Brink and Ruys (2008). In their article, the owner chooses the deepest organization if a certain amount for w (in their article \(\alpha )\) is exceeded. For a Cobb-Douglas production function, their approach gives: if there exists an optimal number of levels, the number of levels is one. We think that our approach is more intuitive at this point.

On a more abstract level on the theory of the firm, our technical contribution—in particular w—may be used to model the efficiency of the management of the firm in the context of “transaction cost” theories (Coase 1937; Williamson 1975, 1985). The property rights approach of the theory of the firm (Grossman and Hart 1986; Hart and Moore 1990) has strong similarity with concepts of cooperative game theory. Hence, the integration of hierarchical structures of cooperative game theory seems to be possible. For the third line of the theory of the firm—the contracting approach—or the principal-agent approach, respectively—(Alchian and Demsetz 1972; Jensen and Meckling 1976) the player’s payoffs in the cooperative game could be used as possible outcomes in a principal-agent game.

In our modeling, \(w_{i}, s\) and v are exogenous. This can also be enhanced in future research. For example, it is conceivable that \(i_{0}\) has a choice between investing in production technology (and hence v) or investing in organizational innovation (an increase in s).

In addition, our approach and the other values by cooperative game theory mentioned in the introduction could be used to analyze the question of to whom the profits of a firm belong—capital, labor or the entrepreneur. In our model of the firm’s hierarchy, \(i_{0}\) is the owner (capital, entrepreneur) of the firm.Footnote 6 To model multiple shareholders of the firm instead of only one player \(i_{0},\) weighted voting games could be used (Shapley and Shubik 1954). Level \(L_{0}\) could be a component in the sense of Owen (1977) (or Winter 1989 respectively), with the shareholders as members of this component.Footnote 7 A further enhancement could be a modification of the production function of the firm with aspects of an apex game (von Neumann and Morgenstern 1944). An overview of the existing literature on apex games is provided by Montero (2002). To model social development between capital providers, entrepreneurs and workers over time, dynamic/evolutionary cooperative game theory (see Newton (2018) for an overview and Casajus et al. (2020) for some new insights) could be applied.

Additionally, our approach provides at least two starting points for future theoretical research. First, the coalition structure approach of cooperative game theory (Aumann and Drèze 1974) can be used to model the possibility of \(i_{0}\) to split up the firm into several companies. Hence, this provides an opportunity to analyze not only the number of hierarchical levels, but also the number of firms. In addition, other CO values can be used as a basis for the \({\text {wH}}\) value to analyze the robustness of our insights.

Another line for future research could be amalgamating the hierarchical approach of cooperative game theory as used in our article and the research done by Newton et al. (2019). In their article, employees of a team have ties to employees of other teams; and as such, there is an additional communication network in the firm besides the disciplinary hierarchy in our model.