Complex Networks
Following the terminology of [2], a network’s graph G(V, E) is a set of vertices V and a set of edges E that connect elements of V. Mathematically, the edges can be represented by an adjacency matrix A. For a binary, undirected graph, that only encodes if there is any connection between two vertices i and j or not, the adjacency matrix is
$$\begin{aligned} A_{i,j} = {\left\{ \begin{array}{ll} 1 &{} \text {if }i\text { and }j\text { are connected} \\ 0 &{} \, \text {else.} \\ \end{array}\right. } \end{aligned}$$
(1)
In case of a directed graph, A is no longer symmetric and additionally encodes the direction of the edge. For a directed weighted network, the adjacency matrix also contains the strength of the connection from i to j. Thus, the adjacency matrix generalises to
$$\begin{aligned} A_{i,j} = {\left\{ \begin{array}{ll} w_{i,j}&{} \text {if there is a connection from }i\text { to }j\text { with weight }w_{i,j} \\ 0 &{} \, \text {else.} \\ \end{array}\right. } \end{aligned}$$
(2)
Naturally, there can also be undirected weighted graphs with a symmetric A. This article uses several different representations of the football transfer network because most methods that assess a graph’s structure are only restricted to a certain type of graph. An overview of the different network types and their properties of interest discussed later in this section is given in Table 1.
Table 1 Overview of different network representations and their properties Constructing a Network of Football Transfers
For this analysis, the clubs are the vertices and the transfers between them the edges. Only permanent transfers with a nonzero fee are considered as a valid edge between two clubs and disregard e.g. loans, because of two reasons: First, the transferred money between two clubs is used as the weight of an edge between two clubs and therefore, transfers without a fee can be disregarded. And second, including non-permanent transfers can cause the same transfer being accounted for in the transfer networks of multiple years (e.g. Kylian Mbappé was transferred from Monaco to Paris in 2017 on a loan which was transformed into a permanent transfer in 2018). All transfer fees from club i to j are aggregated as a sum, but notably, there can be transfer streams from i to j and j to i simultaneously. For each year, the summer and winter transfer windows are combined because the winter window usually includes only few transfers and therefore may rather be seen as a "correction" of the summer transfer window than as an individual transfer market (cf. [24] for the difference in total transfer fees during summer and winter as an indicator of how small the winter market is compared to the summer market).
Key Properties of Complex Networks
The following paragraphs explain some of the key properties of complex networks which will be estimated with the football transfer data, namely the degree distribution p(k), the disparity \(Y_2(i)\), the clustering coefficient C and the small-world coefficient \(\omega \). An overview about which property can be found in which kind of network type is given in Table 1.
A complex network can be characterised by its degree distribution p(k) which measures the likelihood that a vertex has k edges attached to it. For directed networks, this distribution can be split up into in-degree and out-degree distributions. Generally, a power law tail \(p(k)\sim k^{-\gamma }\) with \(2\le \gamma \le 3\) is associated with a scale-free network [8]. Such a behaviour can be observed if there is a preferential attachment rule during the network’s growth phase [27]. This is to be expected because the transfer network emerges over the course of the entire transfer period (growth) and clubs that have lost a player may seek to replace them by buying another player, therefore having a higher market activity (preferential attachment). Especially relegated or promoted clubs have a necessity to exchange many of their players in order to have a team that is within their current financial restrictions and therefore are expected to have a high number of connections in each direction.
For weighted undirected networks, the disparity \(Y_2\) can be used to measure the heterogeneity of the network. For each vertex i, the strength \(s_i\) of the vertex is defined as the sum of all weights \(w_{i,j}\) attached to that vertex [28]. The disparity for each vertex i is then defined as
$$\begin{aligned} Y_2(i) = \sum _{j \in \nu (i)} \left( \frac{w_{i,j}}{s_i} \right) ^2, \end{aligned}$$
(3)
where \(\nu (i)\) is the neighbourhood of vertex i and contains the \(k_i\) vertices connected to i. If all attached edges carry similar weight for a non-trivial neighbourhood with \(k_i>1\), then \(Y_2(i) \sim k_i^{-1}\) and if few weights dominate then \(1\ge Y_2>>1/k_i\).
The clustering coefficient has been introduced in [29] and can be interpreted as the probability that two vertices connected to a third vertex are also connected to each other. This quantity has been generalised for weighted and directed complex networks [30]. For binary directed networks, the clustering coefficient of each vertex i is
$$\begin{aligned} C_i = \frac{(A+A^T)^3_{ii}}{2(d_i^tot (d_i^tot -1)-2d_i^\leftrightarrow )} \end{aligned}$$
(4)
with adjacency matrix A, its transpose \(A^T\), \(d_i^tot \) the total number of connections of the \(i^{th}\) vertex and \(d_i^\leftrightarrow \) the number of vertices \((j_i)_i\) with a connection to i in both directions (\(i\rightarrow j\) and \(j\rightarrow i\)). Here, the power \(M^\alpha \) of a matrix M means element-wise \((M_{ij}^\alpha )_{ij}\). For weighted directed networks, this expression is generalised to
$$\begin{aligned} C_i = \frac{\left( A^{1/3}+ (A^T)^{1/3} \right) ^3_{ii}}{2(d_i^tot (d_i^tot -1)-2d_i^\leftrightarrow )}. \end{aligned}$$
(5)
Note that for binary networks, \(A^{1/3} = A\). In both cases, the overall clustering coefficient for the entire network is the mean of the vertices’ cluster coefficients \(C = \sum _{i=1}^N C_i/N\).
Many real-world social systems show a small-world property, i.e. from any vertex i, any other vertex j can be reached via few edges [29]. Small-world networks are therefore characterised by high clustering coefficients and a low mean path length between two vertices. The quantity \(\omega \) defined in [31] measures this property by calculating a trade-off between clustering and shortest path lengths for the analysed network and comparing these quantities to those of idealised networks: Consider a binary undirected network and let C and \(C_l\) be the clustering coefficients for the given network and for a lattice network, while L and \(L_r\) are the mean shortest distance between any two nodes of the given network and of a random network, then \(\omega \) is defined as
$$\begin{aligned} \omega = L_r/L - C/C_l. \end{aligned}$$
(6)
\(\omega \) is restricted to \([-1,1]\) and places the network on a continuum from regular lattices (\(\omega \approx -1\)) to random graphs (\(\omega \approx 1\)) and is more robust than the similar small-world measure \(\sigma \) from [29] given as
$$\begin{aligned} \sigma = \frac{C}{C_r} / \frac{L}{L_r} \end{aligned}$$
(7)
with \(C_r\) being the clustering coefficient of a random graph. The computation of the clustering coefficients and of the small-world \(\omega \) is done via the package NetworkX in python [32] with the functions networkx.algorithms.cluster.clustering and networkx.algorithms.smallworld.omega.