Abstract
The relationship between income and growth rates has been an elementary problem of research on economic convergence. In the present paper, we study growthincome paths in a new perspective. We assess the similarities of transitional growth trajectories with the use of novel conceptbased model. Further, we group economies on the basis of the assessed similarities and we evaluate withingroup growthincome relationships. The obtained results point to distinct patterns of development, which help to understand the puzzles of absolute convergence and divergence. Among others, we find evidence of a humpedshaped path of longrun transitional growth. Simultaneously, we identify countries which got stuck in poverty and in the middleincome trap.
1 Introduction
The crosscountry variegation of growth experience has been one of the pivotal topics in economics for decades. A special place in this research agenda have computational models supporting the analysis of the paths of longrun growth of countries at different stages of development. The persistent interest on this debate stems from the direct link between convergence hypothesis and decision making in regards to economic policy. As indicated by Johnson and Papageorgiou (2020), a confirmation of a global convergence gives support to policy interventions on a smallscale which could speedup transition of the poor countries toward a common steadystate. On the other hand, an existence of diverging clubs with different steadystates, calls for largescale policy interventions, which could contradict unfavorable initial conditions, and could enable poor economies to escape from the development traps.
In the present paper, we aim to contribute to the research on computational models applied to the analysis of convergence phenomena in economies. In particular, we attempt to discover distinct growth trajectories and to find clusters of economies which follow similar development paths. For this purpose, we propose a novel, datadriven approach to mine patterns of transitional growth with the use of conceptbased models.
Concepts in information sciences are abstract entities that represent information chunks. They can be instantiated with the use of various information representation formalisms such as real intervals or sets. A collection of concepts makes a conceptbased model. Depending on the assumed information representation scheme, we can obtain models with different properties.
In this research, we focus on fuzzy setbased concepts, thus, we work with fuzzy conceptbased models. The choice of fuzzy sets was motivated by their flexibility and versatility. They allow expressing knowledge in the terms of belongingness measured on a scale from 0 to 1. One instance may belong to multiple fuzzy sets to different degrees. Thus, we can express the diverse properties of the given data. What is more, we can capture uncertainty in the description of phenomena. Stressed properties make fuzzy modeling especially valuable when we wish to represent the innerworkings of complex systems, such as economic phenomena.
The modeling framework introduced in this paper provides comprehensive means for transforming raw multidimensional data concerning multiple problem instances into a joint fuzzy conceptbased model, which is then used as a base for similarity evaluation of various problem instances. The proposed framework is focused on providing usercentred phenomena representation. With that in mind, it is of the utmost importance that the model is intuitive and easy to visualize. Therefore, in the application discussed in this paper, we relate to the problem of absolute convergence conceptualised as a relationship between income and growth rate. A focus on multivariate data with two variables enables to directly represented the set of available information using twodimensional plots.
Let us reiterate the key novelty elements introduced in this paper:

We propose a novel procedure to assess similarity of multivariate timeseries on the ground of a conceptbased model.

We apply the proposed framework to identify similarities in the growthincome paths. That, in turn, let us to cluster economies on the basis of their development trajectories. The ability to detect distinct group of countries allows to pinpoint patterns of transitional growth specific to a given group.

The obtained results inform the researchers and policy makers which countries have followed similar development paths, and which countries—given their development trajectories—has been stuck in development traps.
The remainder of the paper is structured as follows. Section 2 addresses a brief review of the relevant literature. Section 3 presents the proposed method. In Sect. 4, we attempt to identify the main patterns of the incomegrowth relationship. The paper ends with concluding remarks in Sect. 5.
2 Literature review
One of the most important hypotheses developed within growth economics states that the initial conditions have no consequences on the longrun per capita income. Such a hypothesis is built upon the neoclassical growth models, pioneered by the work of Solow (1956) and Swan (1956), which predict countries to converge to a single global steadystate, due to declining marginal product of capital. The converging process is described with transitional dynamics (Barro and Salai Martin 2004, p. 37), and thus a growth rate that deviate from the steadystate level is referred to as transitional growth (Gundlach and Paldam 2020). In practice, as countries differ in the timing of reaching a steadystate, we ought to observe that economies with lower output per capita (further below the common steadystate) should grow faster than others, which would eventually lead to a global convergence of incomes.
In opposition to neoclassical growth models lies the reasoning based on models allowing for increasing marginal returns, which can produce multiple basins of attraction of the growth processes. Introducing feedback loops from output to the production factors in the growth models helped to theorize the existence of multiple steadystates, into which economies are sorted based on their initial conditions (Azariadis and Drazen 1990; Mankiw et al. 1992; Galor and Weil 2000; Galor 2011). Around these two views evolved a vast literature. Given the number of significant contributions to this debate (see Battisti et al. 2020; Bergeaud et al. 2020; Comin and Mestieri 2018 for the latest), we aim not to comprehensively review the whole discussion, we rather send a reader to the recent survey of the literature of Johnson and Papageorgiou (2020). Nevertheless, we review selected recent works, which have the closest connection with our paper.
As we demonstrate in the next section, our approach leads to the identification of groups of countries with similar growthincome trajectories. Up to date, researchers have extensively used many techniques aimed at splitting countries into distinct groups to study growth patterns, the behaviour of income distribution, and the formation of convergence clubs.
One stream of research in this domain is focused on the identification of countries that are similar in terms of their initial income or other proxies of initial conditions. Early contributions were based on exogenously defined groups (Baumol 1986). More recently, researchers attempt to form clusters endogenously, i.e. directly from the data. For instance, Tan (2009) uses the regression tree algorithm to identify the threshold level of predefined variables which groups countries into distinct growthregimes. More recently, Fiaschi et al. (2019) detect different regimes based on an algorithm designed to compare Akaike information criterion (AIC) from a set of semiparametric Generalized Additive Models. Both, the set of determinants and the thresholds levels which split the sample in the most informative way are derived from the AIC comparisons. The approach of Tan (2009), Fiaschi et al. (2019) has its advantages, as it enables comparing development paths of economies with different initial conditions. Nevertheless, we see merit in an alternative: i.e. in clustering based on the growth experiences during the whole time horizon. Such a procedure eventually gives space to the investigation of the underlying forces of a given class membership.
Another application of the procedures of endogenous partitions is the detection of convergence clubs. Phillips and Sul (2007, 2009) propose a datadriven clustering algorithm that tests for convergence within a certain group. As the main focus of the algorithm lies in the relationship between idiosyncratic transitions and a common growth component of income, the resulting clusters tend to contain countries at very different levels of development, which are expected to converge in the future. Results with similar interpretations are obtained by Beylunioglu et al. (2020). The authors apply maximum and maximal clique algorithms to identify clubs based on unit root tests of pairwise income differences. The methods of Phillips and Sul (2007) and Beylunioglu et al. (2020) differ from ours in the sense of the economic meaning of the clusters. While the convergence clubs contain countries with a variety of transition paths and at a different level of development, we aim to seek clusters of countries that have experienced similar growth rates at a given stage of their development. Therefore, our focus lies in the similarity of transitional growth, rather than crosscountry convergence. The research which is the closest to our paper is the analysis of growth dynamics by Brida et al. (2011). They propose to cluster economies based on the membership to one of four regimes—the regime of low income/low growth, low income/high growth, high income/high growth, and high income/low growth. The thresholds of low/high income and growth are set as the global averages of the corresponding variables. The membership to a given regime is changing over time, so the authors represent these memberships for a given country as a vector and propose a distance measure to assess their similarity. Our main idea is akin, i.e. we also represent the dynamics of development in a twodimensional space of income and growth rates, and we propose to cluster the countries based on a distance measure capturing similarity of the growth paths. However, we propose two important extensions. First of all, we see the evaluation of membership to four regimes as unsatisfactory. Such a procedure effectively collapses the information contained in the value of output and growth rate to be above or below its averages. In the aftermath, a country with output percapita close to the subsistence levels and with no growth is described as similar to a country with income and growth rates just below the world’s averages. We see such simplification to be too extensive. Thus, we propose a method that allows assessing membership to any number of concepts representing the distribution of the data. As we will show in the next sections, this step is crucial to get reliable measures of (dis)similarity for economies that are not on the opposite ends of world income (growth) distributions. Furthermore, since concepts are abstract, we use fuzzy set memberships to describe the evolution of GDP per capita.
Finally, we shall mention the recent work in the area of assessing nonlinearity in the transitional growth models. Most notably, Gundlach and Paldam (2020) with the use of kernel regression present empirical regularities in favour of a humpedshaped longrun development path. The authors justify such a shape of the growthincome relationship with the predictions of Lucas (2009) twosector model. The model predicts that poor countries’ takeoff is delayed due to their low capability to absorb technology. As countries move along the development path their initially low growth rates accelerate thanks to positive externalities of human capital. The growthmiracles are thus observed at lowtomiddle income countries that simultaneously benefit from the advantage of backwardness and which developed sufficient stock of human capital. At the later stage of development, the growth rates fall again, as economies approach “modern” steadystate. Importantly, the model can predict a parabolic incomegrowth relationship of a different shape, depending on model parameters that reflect countries’ internal characteristics. In sum, the development path is thought of as a “greattransition” between traditional and modern steadystate, with the highest rates of growth observed during the transition stage, i.e. around the middle of the income distribution. As Gundlach and Paldam (2020) note, the humpedshaped transition path is expected for any variable generated by a sigmoid function. We build upon these findings, and we evaluate the shape of the development paths in subsamples of similar economies.
3 The method
3.1 Preliminary notions
Empirical evidence on economic growth is available in the form of multivariate time series. That is, given is a sequence of M time series
where \(z_{i_j}\) is the value of the ith variable time series at the jth moment in time. N is the length of the time series we analyse.
Let us introduce labelling of the data which allows us to differentiate between multivariate data sets of various countries. For example, Eq (2) presents multivariate data set concerning a certain country L.
Available data, as in Eqn. (1), forms a cloud of points in an Mdimensional space, where dimensions correspond to variables. Each point in this space can be denoted as \(\mathbf{z} _i = [z_{1_i}, z_{2_i}, \ldots , z_{M_i}]\). \(\mathbf{z} _i\) is an Melement vector corresponding to the moment in time i. Data points in this space are labeled with their respective country labels.
In more general terms, labels allow distinguishing between samples coming from different classes of phenomena. Since this paper is devoted to analyzing patterns of transitional growth, one class corresponds to one country. Thus, the narration refers to the term country not to obfuscate the narration. In the wider context, the approach described in this paper can be altered to cluster and classify multivariate data that is an essential machine learning task (Ren et al. 2016). Furthermore, in the framework introduced in this paper time flow is not represented in the model itself. Thus, the method can be applied to data that is not temporal.
To enhance the temporal data representation ability of the model, we can introduce variables that encode time flow. The most straightforward solution is to introduce firstorder differences of a given temporal variable. The series of firstorder differences for the ith variable is computed as the differences \(z_{i_j}  z_{i_{j1}}\).
3.2 Extraction of concepts
The first phase of the procedure aims at extracting meaningful concepts to represent the underlying data set concerning multiple countries. That is, data as in Eq. (2) concerning all countries that we wish to analyze is at first concatenated.
The role of concepts is to (i) aggregate and (ii) generalize knowledge present in the empirical evidence based on which the concepts are created. The most straightforward procedure that can be applied to extract concepts is centroidbased clustering (Askari et al. 2017; Yang and Nataliani 2017). Not only does it partition the data into subsets, but also it produces centroids that can be treated as “average” representatives of each cluster. To align the choice of algorithms to the methodology put forward in this paper, we employ the wellknown fuzzy cmeans algorithm delivered by Dunn (1973) and Bezdek (1981). It is a fuzzy variant of the kmeans clustering algorithm.
In the fuzzy cmeans algorithm, empirical evidence is associated with each centroid using a membership value. Membership values, in contrast to standard clustering algorithms, are not crisp, but fuzzy, which means that one data point may belong to more than one cluster. The sum of all membership values for a single data point adds up to 1.
The fuzzy cmeans algorithm creates a partition matrix \(\mathbf{U} = [\mu _{ij}], \mu _{ij} \in [0,1], j=1, \ldots , C, i=1, \ldots , P\), where C is the number of clusters, P is the number of data points in the data set. An element \(\mu _{ij}\) represents the degree of membership of the ith data sample i to the jth cluster. The clustering procedure is regulated by a fuzzification coefficient \(m \in {\mathbb {R}}\), such that \(1.0< m < \infty \). The higher the value of m, the more fuzzified the solution. m close to 1 makes the fuzzy cmeans behave like the binary kmeans clustering. Hathaway and Bezdek (2001) recommend setting \(m = 2.0\).
Fuzzy partitioning is carried out by an iterative procedure that aims at minimization of the following objective function:
\(\mathbf{z} _i\) is an ith tuple of the clustered data (ith observation). Each datum \(\mathbf{z} _i\) is located in the Mdimensional space, thus, technically speaking \(\mathbf{z} _i\) is an Melement vector. \(\mathbf{v} _j\) is an Melement vector with coordinates of the jth cluster (coordinates of the jth centroid). m is the aforementioned fuzzification coefficient. \(\mu _{ij}\) is the degree of membership of \(\mathbf{z} _i\) to \(\mathbf{v} _j\). \(\Vert *\Vert \) is any norm expressing similarity of two vectors.
The iterative procedure adjusts membership values \(\mu _{ij}\) from the matrix \(\mathbf{U} \) and the cluster centres \(\mathbf{v} _j\) by:
and
The procedure stops when the greatest change in any value in the matrix \(\mathbf{U} \) from kth to \((k+1)\)th iteration was smaller than a given \(\varepsilon \) or when a predefined number of algorithm’s iterations was exceeded (Zhang et al. 2016).
After applying the fuzzy cmeans, we obtain concepts that are used to represent the given data set. The key parameter is the number of concepts to be extracted that has to be given before the procedure is launched. The more concepts we have, the more finegrained phenomena description we obtain. The downside of having many concepts is that the more concepts we have, the less intuitive the interpretation of the model. Furthermore, a datadriven concept extraction procedure which is advocated in this paper has the property that the more concepts we produce, the more likely it is to obtain concepts representing outlying observations. On one hand, it may seem like an advantage, to be able to pinpoint outliers. On the other hand, the model will become highly biased towards the data that was used to train it. Thus, there is a need for a sensible balance between model generality and specificity (Kim et al. 2018).
3.3 A modification of the concept extraction procedure
The proposed modeling framework will be applied to represent economic phenomena. A frequently seen property in such data is that in different regions (subspaces) of the assumed Mdimensional space variance of observations differs and the clusters are far from being spherical. An example of such a situation is plotted in Fig. 1. The plot concerns a twodimensional space. The first considered variable is GDP per capita (constant 2000 U.S. dollars). The second variable is the net primary income in current U.S. dollars. The data was downloaded from the World Bank API and it concerns the years 1970–2019. Plotted data points concern five large economies in the European Union: France (FR), Germany (DE), Italy (IT), Spain (ES), and the United Kingdom (UK). The interval between the smallest and the largest value found in observations in each dimension was split into three even parts and we plotted horizontal and vertical lines to designate nine regions.
Let us at the moment steer clear from the interpretation of the plot, since it was given only to illustrate certain challenges in data processing that reappears in many data sets in economics.
With a bare eye, we notice that the density of data points varies in designated regions. Furthermore, it is hard to describe the shape of the formed clouds of data points. From the qualitative point of view, we are aware that interesting patterns may be represented with clusters of unequal variance and odd shapes. Yet, the fuzzy cmeans algorithm is most suitable to detect spherical clusters. This stems from the innerworking of this method which measures distances of data points to each centroid. Since the relation of distance is symmetric, the method is particularly wellsuited when the data forms spherical clusters.
Thus, instead of running the fuzzy cmeans algorithm directly in Mdimensional data, we propose to employ it separately for each variable in the given multivariate data set. We perform fuzzy cmeans to extract \(u_1\) centroids for the first variable, \(u_2\) centroids for the second variable, and so on. Next, we apply the Cartesian product to the obtained centroids to generate Mdimensional centroids.
A comparison of the results achieved with the direct and the Cartesian productbased procedure for the aforementioned data set is illustrated in Fig. 2.
Cartesian productbased procedure forces the concepts to spread out more in the given space. In contrast, the direct method tends to position centroids within the densest regions. Since the data scatter is not spherical, more than one centroid will likely be used to represent one crescentshaped cluster. The described situation can be observed in the topright subregion in the lefthandside plot in Fig. 2.
The downside of the Cartesian productbased method is that it may produce superfluous concepts. This case can be observed in the topleft subregion in the righthandside plot in Fig. 2. However, we can easily envision introducing a measure for concept evaluation that could be used to evaluate and eliminate superfluous concepts. At the same time, our previous investigations into conceptbased models suggested that superfluous concepts do not deteriorate prediction accuracy, but rather cause issues in model interpretation (Homenda et al. 2014). In the cited paper, the authors propose several strategies to evaluate concepts.
3.4 Linking concepts with countries and comparing countries
The introduced conceptbased data representation allows transforming the data previously present in the Mdimensional space into a Cdimensional space. In the new space, each dimension corresponds to cluster membership, thus, we have C dimensions. The data in this space is normalized to the [0, 1] because the membership function values are in [0, 1] (see Eq. 4). The multivariate data set concerning a certain country L is now in the form as in Eq. (6).
\(\mu _{i_j}\) is the membership value of the jth data point to the ith cluster.
In the next step, we aim at aggregating knowledge concerning each country. We propose to represent each country with a Celement vector of averaged membership values computed as follows:
The introduction of the averaging makes data representation much more succinct than it was before. We can interpret a jth entry in the vector from the Eq. (6) as an average membership obtained for one country to a certain concept. We compute this vector for each country separately. Since the values are normalized (in [0, 1]), we can directly compare vectors corresponding to different countries.
The subsequent step of the proposed modelling framework aims at evaluating the similarity of countries. We seek a mapping operating on a collection of vectors in the form as in Eq. (6). We propose to use the Jaccard index, which is given as:
To adapt Eq. (8) to fuzzy sets, we assume that \( * \) denotes the cardinality of the fuzzy set computed via summation, \(\cap \) and \(\cup \) can be replaced with a tnorm and an snorm respectively (Hwang et al. 2018; Ramli and Mohamad 2009). We propose to use minimum and maximum as a tnorm and an snorm respectively. In particular, for a given two countries L and K, the computations of the nominator in Eq. (8) look as follows: \(\sum _{i=1}^{C} \min (l_i,k_i)\). Computations in the denominator rely on \(\sum _{i=1}^{C} \max (l_i,k_i)\). In both cases, \(l_i\) and \(k_i\) are ith entries in a vector as in Eq. (6) obtained beforehand for the countries L and K respectively.
3.5 Similaritybased clustering of countries
As a result of the application of the procedure leading from the extraction of concepts to the similarity evaluation for any pair of countries liked to the extracted concepts, we can construct a similarity matrix for any given set of countries. This matrix can be, in turn, used to split the set of countries into disjoint subsets—clusters. We propose to apply hierarchical clustering, because of its intuitive model representation form (Murtagh and Contreras 2017; Xu et al. 2016).
Hierarchical clustering creates a treebased model, where leaves contain single data samples (here countries) and the root holds the whole set of countries we wish to partition. The procedure can start either on top or on the bottom. Depending on the assumed strategy, the procedure is called either divisive or agglomerative (Bouguettaya et al. 2015; Zhang et al. 2013). In the research presented in this paper, we employ the latter one, where the procedure starts with tree leaves containing single observations and merges nodes until a root with all observations is reached. An important choice regarding the treebuilding process is the method of similarity/difference evaluation for a given candidate pair of intermediate nodes to be merged. In this paper, we utilized the popular Ward’s clustering criterion (Murtagh and Legendre 2014) to decide which pair gets merged. The decision about merging certain intermediate clusters is made based on a similarity matrix. Here, we use a similarity matrix computed using the steps addressed in the previous sections.
A hierarchical clustering outcome can be presented in the form of a dendrogram. Based on a dendrogram a human expert can select the proper number of clusters. Subtrees in the formed structure correspond to the clusters. After obtaining clusters, further processing can be employed. In particular, we may look at and evaluate certain parameters of observations that were grouped and interpret the overall outcome from the point of view of the applications area.
4 Data and results
As our goal was to identify patterns of longrun transitional growth, our primary source of information on GDP was the Maddison database (Bolt et al. 2018), which contains historical reconstructions of incomes for a relatively big number of countries. In our first sample, we used annual observations from 1870 to 2016, for all countries with at least 100 observations. We excluded extreme values from the sample, i.e. we cut off 1st and 99th percentile of our variables. Such a procedure gave 5506 observations on 41 countries. Furthermore, we proceeded with an analysis of a broader sample of 144 countries with at least 50 observations in the period 1950–2016, ending with 9228 observations. As described by Bolt et al. (2018), variable cgdppc gives a better estimation of income for crosscountry comparisons, while variable rgdpnapc is more suitable to study variation in growth rates. Therefore, we used the logarithm of the former variable as a measure of income, and we used the differences of logarithms of the latter one to calculate growth rates. We applied our conceptbased model to evaluate the similarity of transitional growth paths in the following manner: we set the number of concepts to 25 and 49 for smaller and larger samples respectively. Then, we applied the Cartesian product to centroids extracted with fuzzy cmeans. Further, we matched observations for a given country with each of the concepts and we use arithmetic averaging to aggregate membership values for a given country. As a next step, we construct a similarity matrix, based on the Jaccard index. Finally, we employed hierarchical clustering on the similarity matrix. As we are interested in obtaining relatively compact clusters, we chose to use Ward’s method.
4.1 Longrun growth patterns (1870–2016)
At first, we turn our attention to the long time horizon and we start with the sample for the years 1870–2016. Clusters obtained for this sample are presented in Fig. 3. Having used a hierarchical clustering algorithm, we can observe the groupings at different levels of aggregation. In Fig. 3, we observe three distinct clusters, which can be divided into smaller, and more similar groups as depicted with different colours representing six clusters.
Starting from the lefthand side of Fig. 3, we note that cluster No. 1 consists of somehow divergent countries, both in terms of geography and the current level of development. A common characteristic of these countries is a late takeoff of their incomes. As a result of being initially poor, these countries form a group of contenders to catchup with the global leaders. Cluster No. 2 is the first candidate for further splits. A decision to cut a whole sample into six parts divides cluster No. 1 into the following groups:

1a: India, Indonesia, Sri Lanka, Venezuela, and Brazil;

1b: Italy, Japan, Finland, Portugal

1c: Poland, Mexico, Panama, Bolivia, Peru, Philippines, Colombia,cuador.
Next, we note that cluster No. 2 (in the middle of Fig. 3) consists of advanced economies, which at the first sight form a compact group. Splitting cluster No. 2 into two groups divides it into the group of European and nonEuropean countries with an exception of France. Looking at even smaller clusters, we grasp a reaffirmation of domain knowledge—the highest similarities are found between Germany and Austria, Norway and Sweden, Denmark and Belgium, etc. We consider this fact as a confirmation of the adequacy of the proposed algorithm.
Finally, cluster No. 3 groups countries, of which development paths similarity is nonobvious at a first glance. Yet, we notice a common characteristic of these countries, which is a relatively turbulent history of their economic systems such as long periods of centrallyplanned communist economies followed by rapid and sharp transformation (Former USSR, Romania, Taiwan), as well as long periods of political unrest spanning throughout the twentieth century (Chile, Greece, Uruguay, Argentina, South Africa). In consequence, cluster No. 3 is characterised with the highest standard deviation of the growth rates as documented in Table 1. Romania and Taiwan fit cluster No. 3 to the smallest extent and thus they form cluster No. 3a. The remaining countries are gathered up in cluster No. 3b.
Having identified groups of countries with similar transitional growth paths, we turn to the analysis of the relationship between growth rates and income levels. To do so, we fit a generalised additive model of a growth rate (with cubic spline smoothing function) for each cluster, as presented in Figs. 4 and 5. In line with Gundlach and Paldam (2020) we observe a humpedshaped relationship between income and growth rates within clusters Nos. 1 and 2 (in the latter case we leave aside a vague shape of the function for the lowest level of income, with very wide confidence intervals). Figure 4 visualises some differences between the estimated path of transitional growth between catchingup countries (cluster 1) and economic leaders (cluster 2). In particular, the fitted growth curve of the former is shifted leftwards, as compared to the curve of the latter one. The acceleration of the growth rates of advanced economies occurred at a relatively high level of their development. Such differences can be interpreted as resulting from the global technological shock which allowed the “grand transition” and which paved the way to a rapid development to countries at different levels of income. Turning to cluster 3, we note a constant growthincome path, with growth rates just below the global median. In this case, as we mentioned above, we observe rather volatile growth, which additionally turned out to be—on average—sustained at the same rate at different levels of incomes.
As a high level of aggregation may disguise the variegation of incomegrowth paths, we move towards the analysis of smaller clusters. Transitional growth paths in clusters 1a and 1b show important dissimilarities. Although in both cases, a humpshaped pattern prevails, countries in 1a seem to converge to a lower level steadystate. A drop in their growth rates occurred much faster than in the case of countries in cluster 1b (which can serve as an example of economies that caughtup with the global leaders successfully). As a result, Venezuela, Brazil, India, Sri Lanka, and Indonesia, although experienced relatively high growth rates, tend to slow down too early to converge with Japan, Italy, Finland, and Portugal. In cluster 1c, in turn, we observe only the increasing function of the growth rate, which can be a part of a humpedshaped function with a relatively late slowdown. Given the wide confidence intervals, the investigated relationship seems to be vague in this group. Finally, the average growth rates in cluster 3b remained at constantly low levels, showing no sign of a humpedshaped curve.
Table 2 presents a further description of the growthincome paths. It reports the outcomes of growth regressions for each cluster. In particular, we regress the growth rates in year t on income in year \(t1\) with the use of pooled OLS (Eq. 9):
where \(y_{i,t1}\) is the income of country i in year t1, and \(\epsilon _{i,t}\) stands for residuals at a countrytime level. The results confirm, that a humped shaped relationship is dominant in cluster 1a, 1b, and 2. A positive, linear relationship is found in cluster 1c. Coefficients of \(y_{i,t1}\) for cluster 3a and 3b are not significantly different from zero.
In sum, we find evidence of a humpedshaped longterm growth, as in Gundlach and Paldam (2020) for a big part of the sample. However, the growth patterns differ substantially across clusters. The discrepancies in these patterns add to our understanding of longterm convergence processes. Given the general humplike shape of the growthincome relationship, we can expect the poor countries to temporally diverge from the middleincome ones. Moreover, countries experiencing a transition from traditional to modern steadystate seem to be on the convergence path with advanced economies. Importantly, such convergence would occur only if a slowdown corresponding with an approaching modern steadystate happened relatively late. In other words, seemingly converging economies may indeed converge to other equilibrium than developed countries. On top of it, we identified a group of countries that tend to follow a different development path. Finally, we notice that the sample for years 1870–2016 contains only countries that are relatively well developed. These economies could have followed different growth paths than underdeveloped ones. Such a sampling issue turns our attention to the analysis of growth patterns in a shorter time dimension but among a broader set of countries.
4.2 Growth patterns in 1950–2016
An application of the conceptbased model and hierarchical clustering algorithm to the GDP series spanning from 1950 to 2016, led us to distinguish clusters as illustrated in Fig. 6. For further analysis, we decided to split the sample into four and eight groups (see Table 5 in the appendix for the list of countries).
We repeat the analysis presented in the previous steps, i.e., we first turn to the groups at a higher level of aggregation. Next, we show how disaggregation into smaller and more compact groups reveals dissimilarities in the transitional growth paths. Looking at the fitted growth curves in Fig. 7, we note that the observed curves of the growth rates reflect the much shorter history of development, as the sample starts in 1950. Thus, the inverted Ushaped curve, dominant in 1870–2016 sample, is no longer visible. In the case of cluster 1, 2, and 3, the relationship between income and growth rates is negative, i.e. the higher the level of income the lower the growth rate. The differences across these clusters are nonnegligible and depict the high growth rates of the richest countries in the world (gatheredup in cluster 1), and a low dynamics of development in cluster 3, which groups countries exhibiting a growth slowdown at relatively early stages of development. The most striking discrepancy in the growth experiences is revealed in cluster 4, which bring together the poorest economies, with the lowest growth rates.
The disaggregation of the sample into eight clusters uncovers another important dissimilarity, as it splits countries at a comparable level of development accordingly with their growth dynamics. In particular, cluster 2b has higher growth rates than cluster 2a. Growth rates differ drastically between cluster 3a and cluster 3b. Cluster 4a consists of initially poor countries that show signs of a takeoff. In contrast, economies in 4b are characterised by a consistently low level of growth. Finally, countries in 4c, despite having a similar level of output as in cluster 4a, tend to stagnate as their income grows (Table 3).
We see the above findings as having important economic meaning. First of all, we can identify distinct growth paths, which relate directly to hypotheses present in the current literature. For example, in cluster 3b we find a signal of the mediumincome trap (Eichengreen et al. 2013), while cluster 3a points to the examples of countries that were able to avoid this trap. Concurrently, in clusters 4b and 4c we identify countries that can be classified as trapped in poverty (Bowles et al. 2016). Secondly, the findings presented in Fig. 8, relate to the unconditional convergence hypothesis. Although our methods do not allow to directly test this hypothesis, the disparities across clusters, such as between the groups 2a and 2b, 3a and 3b, 4a and 4b or 4c, indicate that countries are heading towards different steadystates, and tend to be a part of distinct basins of attractions of their growth processes. Moreover, the withincluster negative relationship between income and growth rate (f.e. in cluster 1, 2a, 3b, 4c) suggest the convergence within these clusters. However, the negative slope of the growth function may be an artifact of longitudinal, withincountries patterns of development, rather than crosscountry convergence. We provide further insight into this issue with the withinbetween model (Bell et al. 2019), which we propose to modify as in Eq. (10):
where \(y_{i,t1}\) is the income of country i in year \(t1\), \(y_{i,1950}\) is the initial income of a country i, \(\epsilon _{i,t}\) stands for residuals at countrytime level, and \(\mu _{i}\) represents random effect attached to intercept. Note, that model 10 is a special case of multilevel models, where observations are clustered within countries. We propose to separate the variable y into its value in 1950 (initial income) and its value centred around initial income (income diff). Such a procedure allows us to simultaneously assess the crosscountry and longitudinal relationships between income and growth rates. We interpret the former one as consequences on growth stemming from cross country differences in the initial conditions. The latter reflects the withincountry relations between growth and output. The results are reported in Table 4.
In general, cross country and longitudinal relationships within clusters are akin, i.e. the coefficients of initial income and income diff have the same signs. The coefficient of the initial income variable indicates a crosscountry convergence within each cluster, except 4a and 4b. Interestingly, the estimations on the whole sample indicate no crosscountry convergence (given the positive coefficient of initial income). Such results stand in line with the overall picture emerging from Fig. 8.
5 Conclusion
In the present paper, we have proposed the modeling framework to evaluate the similarity of multidimensional timeseries data. The backbone of the proposed method consists of a set of abstract concepts, which are linked to the data with fuzzy membership values. These, in turn, serve as a tool to calculate pairwise similarity matrix.
Subsequently, we have applied the conceptbased model to identify main patterns of transitional GDP growth. The proposed procedure has revealed a variegation of development paths. In particular, for a number of economies we have found evidence of a humpedshaped pattern of the longrun growth, and we have demonstrated how these patterns differ across groups of countries. Furthermore, analysing growth rates in 1950–2016, we have identified groups of countries that follow distinct transitional growth paths despite being at a relatively similar stage of development. In sum, the obtained results help to pinpoint main transition processes, which facilitates explaining crosscountry inequalities of income.
The key novelty of the paper, i.e. the conceptbased model, led us to identify groups of countries with similar growth trajectories. In our view, such a procedure enables a balance between oversimplification related to the analysis of pooled data, and overspecificity of the individual country timeseries explorations. Given the variety of the development histories, and a need of formulation of some generalisations, such balance seems to be of the uttermost importance. Moreover, the proposed method opens a room for the discussion on the determinants of the membership to a given cluster. Following the domain literature (e.g. Fiaschi et al. 2019; Tan 2009), we envision that the potential drivers are the initial conditions, which constitute ’deep’ determinants of economic growth and which are not captured by initial income variable. Thus, determinants such as culture, geography, institutions and social capital may serve as a potential explanation of the diversified path of transitional growth. Nevertheless, we also expect these variables to be correlated with the initial income, and to be a weak explanation of the different patterns of growth of countries at a similar level of development. Thus, we see a potential in the qualitative research on the policy measures introduced in these economies. A promising research constitute also an qualitative investigation of growth shocks, and associated structural breaks within individual countries. Last but not least, one could also use tools typical to growth econometrics to seek for withinclusters relationships between growth and its proximate determinants. Efforts to explain class membership is the direction of followup studies, which we aim to undertake in the future research.
Data availability
Not applicable.
References
Askari, S., Montazerin, N., Fazel Zarandi, M.: Generalized possibilistic fuzzy cmeans with novel cluster validity indices for clustering noisy data. Appl. Soft Comput. 53, 262–283 (2017)
Azariadis, C., Drazen, A.: Threshold externalities in economic development. Q. J. Econ. 105(2), 501 (1990)
Barro, R..J., Salai Martin, X.: Economic Growth. MIT Press, Cambridge (2004)
Battisti, M., di Vaio, G., Zeira, J.: Convergence and divergence: a new approach, new data, and new results. Macroecon. Dyn. 1–34 (2020)
Baumol, W.J.: Productivity growth, convergence, and welfare: what the longrun data show. Am. Econ. Rev. 76, 1072–1085 (1986)
Bell, A., Fairbrother, M., Jones, K.: Fixed and random effects models: making an informed choice. Qual. Quant. 53(2), 1051–1074 (2019)
Bergeaud, A., Cette, G., Lecat, R.: Convergence of GDP per capita in advanced countries over the twentieth century. Empir. Econ. 59(5), 2509–2526 (2020)
Beylunioglu, F.C., Yazgan, M.E., Stengos, T.: Detecting convergence clubs. Macroecon. Dyn. 24(3), 629–669 (2020)
Bezdek, J.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, New York (1981)
Bolt, J., Inklaar, R., de Jong, H., van Zanden, J.L.: Rebasing Maddison: new income comparisons and the shape of longrun economic development. In: Maddison Project Working, paper 10(10) (2018)
Bouguettaya, A., Yu, Q., Liu, X., Zhou, X., Song, A.: Efficient agglomerative hierarchical clustering. Expert Syst. Appl. 42(5), 2785–2797 (2015)
Bowles, S., Durlauf, S.N., Hoff, K.: Poverty Traps. Princeton University Press, Princeton (2016)
Brida, J.G., London, S., Punzo, L., Risso, W.A.: An alternative view of the convergence issue of growth empirics. Growth Chang. 42(3), 320–350 (2011)
Comin, D., Mestieri, M.: If technology has arrived everywhere, why has income diverged? Am. Econ. J. Macroecon. 10(3), 137–178 (2018)
Dunn, J.: A fuzzy relative of the isodata process and its use in detecting compact wellseparated clusters. Int. J. Cybern. Syst. 3, 32–57 (1973)
Eichengreen, B., Park, D., Shin, K.: Growth Slowdowns Redux: New Evidence on the Middleincome Trap. Technical Report (2013)
Fiaschi, D., Lavezzi, A.M., Parenti, A.: Deep and proximate determinants of the world income distribution. Rev. Income Wealth 66, 677–710 (2019)
Galor, O.: Unified Growth Theory. Princeton University Press, Princeton (2011)
Galor, O., Weil, D.N.: Population, technology, and growth: from Malthusian stagnation to the demographic transition and beyond. Am. Econ. Rev. 90(4), 806–828 (2000)
Gundlach, E., Paldam, M.: A humpshaped transitional growth path as a general pattern in longrun development. Econ Syst 44, 100809 (2020)
Hathaway, R., Bezdek, J.: Fuzzy cmeans clustering of incomplete data. IEEE Trans. Syst. Man Cybern. 31(5), 735–744 (2001)
Homenda, W., Jastrzębska, A., Pedrycz, W.: Time series modeling with fuzzy cognitive maps: simplification strategies. In: Saeed, K., Snášel, V. (eds.) Computer Information Systems and Industrial Management, pp. 409–420. Springer, Berlin (2014)
Hwang, C.M., Yang, M.S., Hung, W.L.: New similarity measures of intuitionistic fuzzy sets based on the Jaccard index with its application to clustering. Int. J. Intell. Syst. 33(8), 1672–1688 (2018)
Johnson, P., Papageorgiou, C.: What remains of crosscountry convergence? J. Econ. Lit. 58(1), 129–175 (2020)
Kim, E., Oh, S., Pedrycz, W.: Design of reinforced interval type2 fuzzy cmeansbased fuzzy classifier. IEEE Trans. Fuzzy Syst. 26(5), 3054–3068 (2018)
Lucas, R.E.: Trade and the diffusion of the industrial revolution. Am. Econ. J. Macroecon. 1(1), 1–25 (2009)
Mankiw, N.G., Romer, D., Weil, D.N.: A contribution to the empirics of economic growth. Q. J. Econ. 107(2), 407–437 (1992)
Murtagh, F., Contreras, P.: Algorithms for hierarchical clustering: an overview, II. WIREs Data Min. Knowl. Discov. 7(6), e1219 (2017)
Murtagh, F., Legendre, P.: Ward’s hierarchical agglomerative clustering method: which algorithms implement ward’s criterion? J. Classif. 31(3), 274–295 (2014)
Phillips, P.C.B., Sul, D.: Transition modeling and econometric convergence tests. Econometrica 75(6), 1771–1855 (2007)
Phillips, P.C.B., Sul, D.: Economic transition and growth. J. Appl. Economet. 24(7), 1153–1185 (2009)
Ramli, N., Mohamad, D.: On the Jaccard index similarity measure in ranking fuzzy numbers. Matematika 25(2), 157–165 (2009)
Ren, Y., Zhang, L., Suganthan, P.N.: Ensemble classification and regressionrecent developments, applications and future directions [review article]. IEEE Comput. Intell. Mag. 11(1), 41–53 (2016)
Solow, R.M.: A contribution to the theory of economic growth. Q. J. Econ. 70(1), 65 (1956)
Swan, T.W.: Economic growth and capital accumulation. Econ. Record 32(2), 334–361 (1956)
Tan, C.M.: No one true path: uncovering the interplay between geography, institutions, and fractionalization in economic development. J. Appl. Economet. 25(7), 1100–1127 (2009)
Xu, J., Wang, G., Deng, W.: Denpehc: Density peak based efficient hierarchical clustering. Inf. Sci. 373, 200–218 (2016)
Yang, M.S., Nataliani, Y.: Robustlearning fuzzy cmeans clustering algorithm with unknown number of clusters. Pattern Recogn. 71, 45–59 (2017)
Zhang, L., Lu, W., Liu, X., Pedrycz, W., Zhong, C.: Fuzzy cmeans clustering of incomplete data based on probabilistic information granules of missing values. Knowl. Based Syst. 99, 51–70 (2016)
Zhang, W., Zhao, D., Wang, X.: Agglomerative clustering via maximum incremental path integral. Pattern Recogn. 46(11), 3056–3065 (2013)
Funding
This research was supported by the National Science Centre, Grant No. 2019/35/D/HS4/01594, decision no. DEC2019/35/D/HS4/01594.
Author information
Authors and Affiliations
Contributions
AJ developed the proposed conceptbased model, and wrote the R code with the implementation of the model. AJ helped to draft the manuscript. JB carried out the application of the proposed method to the analysis of transitional growth, and drafted the manuscript. Both Authors participated in the design of the study, read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Code availability
Code available upon request.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bartak, J., Jastrzębska, A. Mining patterns of transitional growth using multivariate conceptbased models. Qual Quant 56, 4395–4419 (2022). https://doi.org/10.1007/s11135022013188
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11135022013188
Keywords
 Fuzzy sets
 Conceptbased models
 Economic growth
 Convergence