1 Introduction

In the last twenty years, private equity (PE) has acquired particular importance in the global financial system and already exceeds 1.1 trillion dollars in assets under management. PE investment involves the purchase of shares in companies that are not listed on the stock market. It is distinguished by its often long-term approach to enhance the performance and value of the company, seeking significant benefits when carrying out the divestment (Cumming et al., 2023). The main investment strategies in PE include the acquisition of companies, investment in growing companies, and restructuring companies with financial problems (Gredil et al., 2023; Gupta & Nieuwerburgh, 2021). Over the years since the founding of the venture capital industry, PE transactions have emerged as an increasingly crucial mechanism for carrying out rapid and significant restructurings in organizations globally (Cumming et al., 2007; Wright & Bruining, 2008). On the other hand, knowing the PE return drivers has greatly interested researchers and academics (Brown et al., 2021a, 2021b). Understanding the factors that drive profitability enables PE investors and managers to detect opportunities that maximize the returns on their investments and improve risk assessment. This ability is essential given the often illiquid and long-term nature inherent to these investments. Furthermore, deep knowledge of the profitability drivers enables PE funds to optimize their investment strategies, focusing on sectors or companies with a higher potential for growth and profitability (Cumming et al., 2023; Harris et al., 2023). In this regard, some studies have developed statistical models to determine PE return drivers. Still, the explanatory capacity of these models has certain limitations related to their precision levels and exclusive focus on groups of countries located in Europe and the EE.UU. (Dai, 2022). For example, Diller and Kaserer (2009), Aigner et al. (2008), and Achleitner et al. (2010) studied the European PE, and their accuracy levels were at most 64.10%. On the other hand, Manigart et al. (2002) and Jegadeesh et al. (2015) used PE samples from Europe and EE.UU. Their accuracy was between 42.60 and 60.10%. Finally, Franzoni et al. (2012) addressed samples corresponding to EE.UU., Europe and Asia, but their precision regression did not exceed 12.50%. Consequently, the current literature demands new analysis models for PE return drivers that provide a better fit in scenarios worldwide (Caporale et al., 2024; Dai, 2022; Easton et al., 2020).

To cover the aforementioned research gap, this study aims to develop models capable of identifying the PE returns with high accuracy. We use computational methods that provide good precision and a sample with PE funds worldwide. The computational methodology has previously been used successfully in studies related to PE. Calafiore et al. (2020) apply Multilayer Perceptron for PE that carries out public share offerings. Using computational techniques, Sugathan and Baid (2013) also analyze the various factors affecting PE investment decisions. The models built here can be used in all regions, achieving accuracy above 86%. These models are built from a sample of 1,606 PE funds with a geographical focus on the world´ five regions. Our study includes a much larger sample in number and regions than those usually used in previous studies. For example, Manigart et al. (2002) used a sample of 200 PE funds in four European countries and the EE.UU. Diller and Kaserer (2009) selected 791 European PE funds. Jegadeesh et al. (2015) used a sample of 26 PE funds from Europe, EE.UU. and Australia. Therefore, we make at least two additional contributions to the literature. First, we improve the precision of identifying PE return drivers concerning that obtained in previous studies using innovative methodologies. Second, our study has addressed the return of PE globally and is therefore not restricted to Europe or EE.UU.

This study is structured as follows. Section 2 reviews the empirical research literature on PE return drivers. Section 3 establishes the methodology used. Section 4 details the data and variables used in the study. Finally, Sect. 5 analyzes the results obtained. The article concludes by presenting the conclusions of the study, its implications, and future research.

2 Background

Previous studies on PE have addressed various issues, including the return, investment criteria, valuation of PE funds, and the dynamics of collaboration with partners (Caporale et al., 2024). For its part, the research focused on PE return drivers has been carried out from theoretical and empirical perspectives. From a theoretical perspective, different hypotheses about the functioning of private capital markets impact PE return. The first refers to the specialization effect. By specializing, PE managers are more likely to work on a significant number of similar transactions. In addition, thanks to this, the manager creates a more extensive network of contacts also specialized in a single niche, improving profitability (Manigart et al., 2002). This implies that greater specialization by industry implies higher returns (Ewens et al., 2013). The second hypothesis is linked to the speed effect on investment. According to Phalippou and Gottschalg (2009), funds with a slower investment rate obtain worse returns. Thirdly, the hypothesis on economies of scale and perimeter postulates that when PE management teams´ ability to create value and expertise have to be distributed among several investments, their returns begin to decrease. Therefore, a lesser specialization of the PE manager among several industries can have the same negative effect (Lopez-de-Silanes et al., 2015). The persistence of return hypothesis maintains that PE managers who perform better than the industry are more likely to continue with that positive spread in their subsequent managed funds (Kaplan & Schoar, 2005). The hypothesis on macroeconomic conditions also presupposes an impact on PE return (Kwabi et al., 2022). PE return is characterized by being procyclical and improves with GDP growth, and deteriorates with increases in interest rates. Likewise, it´s directly correlated with the evolution of listed markets (Phalippou & Gottschalg, 2009). Finally, according to the Resource-Based Theory, the increase in value in PE is explained by the accumulation of different internal resources, synergies and degrees of specialization. Thus, the greater the number of managed companies and the smaller they are, the less profitability they usually require. A possible reason that reinforces this hypothesis is that, in these cases, the numerous investments represent a hedge for the manager regarding the variation in returns (Manigart et al., 2002).

From an empirical perspective, the study of PE return drivers has gained importance from the seminal work of Gompers (2000), who coined money-chasing transactions, a concept that establishes the flow of funds in the PE industry as the most crucial factor driving the valuation. Subsequently, various studies have used statistical methods such as regression to identify PE return drivers. For example, Manigart et al. (2002) studied PE returns in Europe and the EE.UU from 1994 to 1997. They concluded that fund location, degree of participation, and shorter holding periods would produce higher returns. With a sample of 200 funds, their regression results achieved an accuracy of 42.6%. For their part, Diller and Kaserer (2009) confirmed the findings of Gompers (2000). Using a regression approach on a sample of 791 PE funds from 1980 to 2003, they obtained a precision of 47% and attributed a significant portion of the variation in PE returns to general fund inflows, partner skills, and independent income. Furthermore, they found that PE returns are unrelated to stock markets and negatively related to economic growth rates.

Kaplan and Schoar (2005) find evidence of fund performance persistence in their PE study. Fund managers who have been successful in the industry are more likely to repeat the success in the next fund. Aigner et al. (2008) also found persistence in PE performance and showed that past performance determines future return. In addition, other factors that drive returns include fund experience, stock market movement, economic trends, year of seniority, stage of funding, and the number of portfolio companies. They obtained an accuracy of 48.2% in their regression models with a sample of 358 funds. Jegadeesh et al. (2015) used regression and samples of 26 PE funds from Europe, EE.UU., and Australia from 1994 to 2008, achieving a precision of 65.65%. Franzoni et al. (2012), with global data from 1975 to 2006, obtained an accuracy of 18.9%. Both studies concluded that abnormal PE returns are due to risk factors stemming from a lack of liquidity. Scarpati and Ng (2013) pointed out that this abnormal profitability is due to traditional financial theory, rational factors that are not risk factors, irrational factors, behavioural factors, and internal cost factors.

Achleitner et al. (2010) investigated the drivers of PE value creation in European leveraged buyouts. They attributed one-third of returns to leverage and two-thirds to trading and market effects. Their accuracy levels did not exceed 64.10%. For Gohil and Vyas (2016), PE returns are affected by skill and market factors, and petite are affected by structural elements. These factors include the size of the investment, industry, sponsor, type of exit and stage of investment. The study by Steger (2017) indicates that macroeconomic conditions influence PE yields. It finds that weak economic growth, low bond yields, and low stock market valuations during the period in which investments are made favour returns.

Korteweg (2019) recently reviewed empirical methods to assess the risk and PE return. His findings indicate that profitability estimates vary substantially by method, period time, and data source. For their part, Roggi et al. (2019) investigated the relationship between performance and characteristics in EE.UU. Using linear and polynomial regressions, they detected a concave relationship between fund size and performance. Morri et al. (2021) investigated the performance of unlisted European real estate PE between 2001 and 2014. Their results show the importance of size and duration in the performance of funds, emphasizing that the effects of the independent variables on performance do not change significantly in different business cycles. Finally, Brown et al. (2021a, 2021b) have provided evidence of continued PE outperformance of public companies.

None of the studies discussed above examine the PE return using computational methods and for a wide range of geographic areas. Furthermore, the results presented by these studies are far from reaching a high level of precision in estimating of PE return drivers. On the contrary, the present article adopts such computational methods to obtain a global view of PE returns and to establish with high precision which drivers are important.

3 Methods

3.1 Research Design

This section discusses selecting the most critical drivers in the PE return. For this purpose, we use two PE return proxies as dependent variables: Internal Rate of Return and Cash Multiple (Gompers et al., 2016). An illustration of this procedure appears in Fig. 1. First, we prepare the database eliminating outliers and cases with missing values. Second, to check the robustness of the PE return drivers model, we use a set of computational classifiers that have provided excellent accuracy results in previous empirical studies on financial investments (Patel et al., 2015; Subasi & Cankurt, 2019; Ullah et al., 2020). These classifiers are Multilayer Perceptron, Random Forest, Näive Bayes, Vector Support Machine, and C4.5 Algorithm, which have been finally combined by the Voting Algorithm using a majority approach. Third, we use Filter and Wrapper methods for feature selection, namely Info Gain Attribute Eval, Cfs Subset Eval, Correlation Attribute Eval, Gain Ratio Attribute Eval, ReliefF Attribute Eval, Symmetrical Uncert Attribute Eval, and Classifier Attribute Eval.

Fig. 1
figure 1

Research design

3.2 Classifiers Theory

Multilayer Perceptron (MLP) is one of the widespread examples of feedforward neural networks. In MLP, the initial processing elements are unidirectionally biased. In these networks, information evolution occurs as a function of communications between three types of overlapping layers: input, hidden, and output layers. The networks between these layers are associated with weighting values, performing two functions in each MLP node, which are called summation and activation functions (Ojha et al., 2017). For this, the weights W are adjusted with the information from the sample set, considering that both the architecture and the network´s connections are known, and the objective is to obtain those weights that minimize the learning error. Given, then, a set of pairs of learning patterns {(x1, y1), (x2, y2)… (xp, yp)} and an error function ε (W, X, Y), the training process implies the search for the set of weights that minimizes the learning error E(W) (Shang & Benjamin, 1996), according to the expression (1).

$$ min_{w} E\left( W \right) = min_{w} \mathop \sum \limits_{i = 1}^{p} E\left( W \right) $$
(1)

For its part, Random Forest (RF) is a data mining tool to solve problems related to classification and regression. Growing a set of trees and deciding the class type by voting has significantly improved classification accuracy (Breiman, 2001). To that end, random vectors are built to grow these sets. Each tree is generated from one of the random vectors. Classification problems are solved by analyzing the output of trees. A majority of the class votes determine the RF prediction. The training algorithm can be summarized as follows: A sample of the training data is drawn for each tree in the set. By growing the tree Tb over Z, the available features as candidates for splitting at the respective node are randomly selected. Finally, the grown tree Tb is added to the set. During inference, each tree makes a prediction \(\hat{c}_{b} \left( x \right)\) for the class label of the new observation x. The final prediction of the random forest \(\hat{c}_{RF} \left( x \right)\) is then the majority vote of the trees, as expressed in (2).

$$ \hat{c}_{RF} \left( x \right) = majority\, vote \left\{ {\hat{c}_{b} \left( x \right)} \right\}. $$
(2)

On the other hand, the Naive Bayes (NB) classifier is considered one of the probabilistic classifiers. This classifier assumes that probability distributions govern the attributes to be classified and that the optimal decision can be made considering counts of these probabilities together with the observed data (Joachims, 1998; Lewis & Ringuette, 1994). To predict the probability, it uses the concept of Bayes’ theorem, which is helpful because it provides a way to compute the posterior probability, P(C|X), from P(C), P(X|C), and P(X). Bayes’ theorem states Eq. (3).

$$ P\left( {C|X} \right) = \frac{P(X|C)P\left( C \right)}{{P\left( X \right)}} $$
(3)

where P(C|X) indicates the posterior probability that hypothesis C is true given that event X has occurred.

Also, Support Vector Machines (SVM) have been shown to achieve good generalization performance over several classification problems. In geometric terms, SVM is seen as the attempt to find a hyperplane that separates the positive examples from the negative ones by the widest possible margin (Xu et al., 2009). The final decision is to find the maximum margin hyperplane. Assume that xi Є Rd, i = 1, 2,…, N forms a set of input vectors with class labels yi Є {+ 1; − 1}, i = 1, 2,…, N. SVM can map the input vectors xi Є Rd in a high-dimensional feature space ɵ(xi) Є H. A kernel function K (xi, xj) performs the mapping ɵ (.). The resulting decision boundary is defined as (4).

$$ f\left( x \right) = {\text{sgn}} \left( {\mathop \sum \limits_{1}^{N} y_{i} \alpha_{i } .K \left( {x, x_{i} } \right) + b} \right) $$
(4)

Another of the computational classifiers used in this study is C4.5 Algorithm (C4.5), designed as an extension of ID3 Algorithm described by Quinlan (1986). The latter is part of the classifiers known as decision trees, which are represented by trees where their internal nodes are labelled as attributes, and the outgoing branches of each node represent tests for the attribute values. The leaves of the tree identify the categories. C4.5 is formed in the following order: (a) select the attributes as roots, (b) create a branch for each value, and (c) repeat the process for each branch until all instances of the branches have the same class. The highest gain is used to select attributes, such as the root, according to the Eq. (5).

$$ Gain \left( {S,A} \right) = Entropy\left( S \right) - \mathop \sum \limits_{i = 1}^{n} \frac{{\left| {S_{i} } \right|}}{\left| S \right|} = Entropy\left( S \right) $$
(5)

where S is the set of cases, A is the attributes, n is the partition number of attribute A, and Si is the number of patients in the i-th partition. For its part, the value of Entropy is expressed according to Eq. (6).

$$ Entropy\left( S \right) = \mathop \sum \limits_{i = 1}^{n} - p_{i} *log_{2} *p_{i} $$
(6)

where n is the number of partitions of S, and pi is the proportion of S.

Finally, the Voting Algorithm matched each basic-level classifier using a voting approach. The most straightforward voting approach is majority voting, in which the base-level classifier votes for its predictions. The instance is ranked in the class that gets the most votes. The plurality voting method is modified for the situation where basic classifiers estimate class probability distributions (Dietterich, 1997). This algorithm calculates the weights based on the distance between the module outputs (7).

$$ W_{i} = \frac{1}{{1 + \mathop \prod \nolimits_{{\begin{array}{*{20}c} {i = 1, j = 1, } \\ {i \ne j} \\ \end{array} }}^{N} \frac{{d^{2} \left( {x_{i} , x_{j} } \right)}}{{a^{2} }}}} $$
(7)

where d (xi,xj) is the distance between the output values of modules i and j, and a is a scaling factor. After assigning the weights, the output of the voter is calculated according to (8).

$$ x_{0} = \mathop \sum \limits_{i = 1}^{N} \left( {\frac{{w_{i} }}{S}} \right) \cdot x_{i} $$
(8)

where S is the sum of all the weights.

Additionally, this study uses the statistical Logit classifier to compare its precision to computational classifiers. The Logit model specification may be represented according to Eq. (9).

$$ y_{i}^{*} = x_{i} + \varepsilon_{i} $$
(9)

with

$$ y_{i} = \left\{ \begin{gathered} 1 \, if{\text{ y}}_{i}^{*} > 0.5 \hfill \\ 0 \, if{\text{ y}}_{i}^{*} \le 0.5 \, \hfill \\ \end{gathered} \right. $$

Then

$$ P[y_{i} = 1] = P[x_{i} \beta + \varepsilon_{i} > 0.5] = F(x_{i} \beta ) $$
(10)
$$ P[y_{i} = 0] = 1 - F(x_{i} \beta ) $$
(11)

Frequently, models involving discrete dependent variables are presented as index function models. In this context, discrete choice is interpreted as a reflection of an underlying regression. (Alaminos et al., 2016). The model was calculated using the backward stepwise method, in which variables are eliminated based on the probability of the plausibility statistic. As such, from Eq. (9), we obtain (12).

$$ P(y_{i} = 1) = \frac{{e^{\beta ^{\prime}x} }}{{1 + e^{\beta ^{\prime}x} }} = \frac{1}{{1 + e^{ - (\beta ^{\prime}x)} }} $$
(12)

Thus, the ratio between the two probabilities (known as the Odds ratio) is established as follows in Eq. (13).

$$ Odds = \frac{{P(y_{i} = 1)}}{{1 - P(y_{i} = 1)}} = \frac{{1/[1 + e^{ - (\beta ^{\prime}x)} ]}}{{1/[1 + e^{(\beta ^{\prime}x)} ]}} = \frac{{1 + e^{(\beta ^{\prime}x)} }}{{1 + e^{ - (\beta ^{\prime}x)} }} = e^{(\beta ^{\prime}x)} $$
(13)

The estimated coefficients (β) represent measurements of the changes in the odds ratio. The Odds ratio may be interpreted as the number of times the phenomenon is more likely to occur than it is not (Hair et al., 1999). By applying the logarithms in (13), we obtain (14), a linear expression of the model under consideration.

$$ y_{i}^{*} = \ln \frac{{P(y_{i} = 1)}}{{1 - P(y_{i} = 1)}} = \ln e^{\beta ^{\prime}x} = \beta ^{\prime}x $$
(14)

3.3 Feature Selection Methods

Feature selection is the process of selecting the most important and relevant features from a data set to improve the prediction performance of predictors, provide faster and more cost-effective predictors, and provide a better understanding of the process. The feature selection methods used in this study were of two classes: Filter and Wrappers. Wrapper methods perform better than filter methods because the feature selection process is optimized for a given classifier. However, they are usually too expensive to use if the number of features is significant because they must evaluate each set of elements considered with the given classifier. On the other hand, Filter methods are much faster than wrapper methods. They are independent of any learning method since they focus on the data characteristics, making them very useful for large data sets. In this study, a set of Filter methods has been applied (Info Gain Attribute Eval, Cfs Subset Eval, Correlation Attribute Eval, Gain Ratio Attribute Eval, ReliefF Attribute Eval, Symmetrical Uncert Attribute Eval) and Wrapper (Classifier Attribute Eval), that cater to most feature selection techniques (Omuya et al., 2021).

Info Gain Attribute Eval (IGA) evaluates the worth of an attribute by measuring the information gained to the class. It provides a way to use entropy to calculate how a change in the dataset impacts the distribution of types. Information Gain (\(GAIN \left( {X{|}Y} \right){ }\) is expressed through the Eq. (15).

$$ GAIN \left( {X{|}Y} \right) = H\left( X \right) {-} H(X|Y) $$
(15)

where H(X) describe the entropy of a discrete random variable X, and H(X|Y) is the conditional entropy of the class-given attribute. The higher the value of mutual information between types and attributes, the higher the relevance between types and attributes (Ashraf et al., 2010).

Cfs Subset Eval (CFS) identifies a subset of the attributes highly correlated with the class without being strongly correlated. By default, it searches through the space of possible attribute subsets for “the best” using a specified search method. It does this by evaluating a subset of attributes by calculating each attribute´s correlations (Pearson) against the class and the correlations between attributes (Chandrashekar, 2014; Megha, 2013). The equation used to filter out the irrelevant, redundant feature that leads to the class´s poor prediction is defined as (16).

$$ F_{s} = \frac{{N*r_{a} }}{{N + N\left( {N - 1} \right)r_{n} }} $$
(16)

For its part, Correlation Attribute Eval (COA) evaluates the attributes of the target class. Consider nominal attributes based on value, each value acts as an indicator. In addition, using the Pearson correlation coefficient measures the correlation between each of the attributes and the attribute of the target class (Gnanambal et al., 2018).

Gain Ratio Attribute Eval (GAA) evaluates the worth of an attribute by measuring the gain ratio to the class. It was proposed to reduce a bias towards multi-valued attributes by considering the number and size of branches when choosing an attribute (Quinlan, 1986). The Gain Ratio (\(GainR\)) is given through the Eq. (17).

$$ GainR\left( {Class , Attribute} \right) = \frac{{H\left( {Class} \right) - H(Class | Attribute)}}{{H\left( {Attribute} \right)}} $$
(17)

ReliefF Attribute Eval (RFA) evaluates the worth of an attribute by repeatedly sampling an instance and considering the value of the given attribute for the nearest instance of the same and different class. It consists of assigning a weight to each attribute and selecting the attributes whose weight exceeds a predetermined threshold (Chandrashekar, 2014). If X and Y are taken as two instances, then the differences in feature values between these two instances are defined by Eq. (18).

$$ diff\left( {x_{k} , \, y_{k} } \right) = \left( {x_{k} - \, y_{k} } \right){/}nu_{k} $$
(18)

where nuk is a normalization unit to normalize the values of diff into the interval [0,1].

Symmetrical Uncert Attribute Eval (SUA) overcomes the IGA bias towards features with more values by normalizing its value to the range [0,1] (Hall & Smith, 1998). Symmetric uncertainty is expressed through the Eq. (19).

$$ SU\left( {X, Y} \right) = \frac{2 * GAIN(X|Y)}{{H\left( X \right) + H\left( Y \right)}} $$
(19)

where \(GAIN\left( {X{|}Y} \right){ }\) describe the mutual information, and H(X) represents the entropy of a discrete random variable X. A value 1 of SU(X,Y) indicates stronger dependence/correlation between X and Y. In contrast, value 0 indicates the independence/no correlation between X and Y.

Finally, Classifier Attribute Eval (CLA) is a wrapper algorithm that uses an individual classifier as a function to evaluate the subsets and cross-validation to estimate the accuracy of the classifier (Diao et al., 2014; Kohavi & John, 1997).

4 Sample and Variables

This study aims to determine which variables explain PE return at a global level. For this purpose, we have chosen a sample composed of PE funds corresponding to five world regions between 1980 and 2020. This sample comes from the Preqin Dataset and is made up of 1606 PE funds. Of the total sample, 1.72% are from Asia, 0.16% are from Australia, 24.94% are from Europe, 72.41% are from America, and 0.77% are from Africa. In addition, among the strategies, Buyout (34.19%), Fund of funds (18.94%), Venture (17.66%) and Early Stage (12.17%) stand out. Likewise, the PE funds in the sample cover 11 industries, including Diversified (58.95%), Information Technology (19.86%), and Healthcare (7.94%) (Table 1).

Table 1 Sample characteristics

On the other hand, the present study uses a set of variables selected from the previous literature (Aigner et al., 2008; Dai, 2022). As dependent variables, two main measures are used to assess the returns generated by a PE in a portfolio company exit: Internal Rate of Return (IRR) and Cash Multiple. The IRR is the most common means of assessing the performance of PE and represents the discount rate that makes the cash return multiple equal to the initial investment for a given period, according to (20).

$$ IRR = \left( {\frac{CM}{{\left[ {1 + IRR} \right]^{t} }}} \right) - 1 = 0 $$
(20)

where CM is the cash multiple of the transaction and t is the holding period.

Cash Multiple is the ratio of cash received from income divided by cash invested by PE investors and calculated according to (21).

$$ Cash \, Multiple = \frac{Cash\, received\, from\, income}{{Cash\, invested\, by\, PE\, investors}} $$
(21)

Regarding the independent variables, those related to the ability factors of the fund managers to execute a certain agreement have been considered. Specifically, they refer to the experience of the managers (Assets manager) (Gohil & Vyas, 2016), the size of the fund (Fund size and Log fund size) (Aigner et al., 2008; Farooq et al., 2021; Gohil & Vyas, 2016), and the location of the administrators (Region fund manager) (Caporale et al., 2024). Also, variables refer to market factors and related to the country´s financial market where the PE is located. Seven factors have been included in the study as market drivers: three-month average interest rates in Germany and the EE.UU. over the life of a fund (Germany interest and EE.UU. interest) (Steger, 2017), nominal GDP growth in Germany and EE.UU. (Germany GDP and EE.UU. GPD) (Aigner et al., 2008), the performance of the MSCI World Performance Index as a proxy for developing the stock market (MSCI return) (Aigner et al., 2008; Steger, 2017) and the amount of money committed with PE worldwide (Log world PE and World PE) (Aigner et al., 2008). Finally, in third place, the variables that include the variables related to the PE structure: diversification between regions, industrial sectors and stages of investment, measured by the Herfindahl–Hirschman index (Diversification) (Ick, 2005), the geographical area in which the fund has made its investment (Geography focus) (Caporale et al., 2024), the specific PE industry (Industry focus), the fund categories according to whether they are venture capital and private capital (Type of fund), and the duration of the fund in years (Lifetime) (Aigner et al., 2008). Table 2 shows the description of the variables used in the research.

Table 2 Econometric variables

5 Empirical Results

This section presents PE return drivers analysis results for the two proposed independent variables (IRR and Cash Multiple).

5.1 Descriptive Analysis

The objective of the descriptive analysis is to examine the variables to be used and to know their main statistical parameters. A summary of the quantitative variables used in the research is shown in Table 3. All the variables present a moderate dispersion to the mean values, except for Assets manager and Fund size, due to the broad spectrum of PE in the sample.

Table 3 Descriptive statistics

5.2 Classification Fit

Tables 4 and 5 report the classification results obtained in analyzing PE return drivers for IRR and Cash Multiple. Considering all goodness-of-fit criteria (Global accuracy, Precision, Recall, F-Measure, and MCC), the classifier that has provided the highest accuracy for IRR is C4.5, with 86.13% (Table 4). For Cash Multiple, the highest accuracy has been obtained with RF (81.81%) (Table 5). For its part, Fig. 2 illustrates the ROC curve of all the classifiers, indicating that, in all cases, the classification fit has been acceptable.

Table 4 Classification results for IRR
Table 5 Classification results for Cash Multiple
Fig. 2
figure 2

ROC curve analysis

5.3 Feature Selection

Tables 6 and 7 show the variables selected by each feature selection method for IRR and Cash Multiple variables, respectively. For a better understanding of the ranked variables obtained, three groups have been established based on the importance of the variables. Thus, group G1 (green colour) is identified with the highest-ranked variables. Group G2 (pink colour) with those of a moderate ranking. And finally, the group G3 (grey colour), with the lowest rated. Next, the frequencies of each variable were counted according to the feature selection method used, obtaining a summary of the importance of the independent variables (Table 8). The results indicate that the most important variables for IRR are, in the first place (G1), Lifetime, World PE, Log world PE, MSCI return, EE.UU. GPD, Germany interest, EE.UU. interest, and Industry focus. In second place (G2), Germany GDP, Diversification, and Assets manager. The least important ones (G3) are Fund size, Log fund size, and Geography focus. On the other hand, and for Cash Multiple, it is confirmed that the most important variables (G1) are Lifetime, World PE, Log world PE, MSCI return, Germany GDP, EE.UU. GPD, Germany interest, and EE.UU. interest. In second place (G2), Geography focus, Region fund manager, and Industry focus. And in third place (G3), Type of fund, Fund size, Log fund size, and Assets manager.

Table 6 Results feature selection for IRR.
Table 7 Results feature selection for Cash multiple.
Table 8 Variables frequency in the feature selection

Globally, considering the results obtained for IRR and Cash Multiple, the variables of the most significant importance for the analysis of PE return have turned out to be the set made up of Lifetime, World PE, Log world PE, MSCI return, EE.UU. GDP, Germany interest, and EE.UU. interest (Fig. 3).

Fig. 3
figure 3

Frequency of the most important variables

5.4 Discussion

The results of the present study show that the computational methods exceed the accuracy of previous studies on PE return using statistical methods. For example, Diller and Kaserer (2009), Aigner et al. (2008), and Achleitner et al. (2010) did not exceed 64.10% accuracy with European PE samples, and Manigart et al. (2002) and Jegadeesh et al. (2015), with samples from Europe and EE.UU., were only between 42.60 and 60.10%. Likewise, Franzoni et al. (2012) study, with samples from EE.UU., Europe, and Asia, only achieved a regression fit of 12.50%. However, our models on PE return have provided accuracy of 86.13% and 81.81% for IRR and Cash Multiple models, respectively. Specifically, C4.5 and RF have been the classifiers with the best classification fit. This greater precision of computational classifiers that traditional statistical classifiers have also been verified with data from our sample. The LOGIT classifier applied in the present study has provided the lowest level of precision. Therefore, computational classifiers offer greater accuracy in modelling the PE return factors with global samples from various world regions. The non-linear relationship of the variables that explain PE return is possibly better measured with computational methods because they do not require previous assumptions about the relationship between variables (Núñez de Castro & Von Zuben, 1998).

Our results provide very robust global models to identify PE return drivers. This represents a significant advance in the generalization of PE return models. However, significant differences in economic environment, regulation, business culture, and other factors between the five regions studied could provide different results when applying such global models with data from a single region (Morri et al., 2021). Perhaps the use of a worldwide sample is more directed towards a broad and complete vision of the trends and patterns that may be present worldwide, that is, when the objective is to obtain a more holistic and diverse understanding of the problem studied.

Regarding the most critical variables in the analysis, our results suggest that market factors are highly significant, specifically those related to interest rates (Germany interest and EE.UU. interest), with GDP growth (EE.UU. GPD), and market development (MSCI return, World PE, and Log world PE). These results are similar to those obtained by Phalippou and Gottschalg (2009) and Steger (2017) since they indicated that macroeconomic conditions affect PE return and that it is characterized by being pro-cyclical. However, our results differ from those obtained by Morri et al. (2021) and Diller and Kaserer (2009), who found that PE returns are unrelated to macroeconomic factors. Perhaps using more extensive databases in the present study has provided new information on the variables that affect PE return.

On the other hand, in our study, the variable that refers to the years of PE duration (Lifetime) has turned out to be highly significant, and this is in line with what was obtained by Aigner et al. (2008), as they found that duration was a variable directly related to PE return. In general, our results partially confirm the previous findings of Gohil and Vyas (2016) by verifying that PE returns are mainly affected by market factors. However, although Gohil and Vyas (2016), Roggi et al. (2019) and Morri et al. (2021) also pointed out the importance of variables related to the ability of PE managers, in our study, this point has not been relevant. Such is the case of variables such as Assets manager, Fund size, Log fund size, and Region fund manager.

Finally, our results confirm what Gohil and Vyas (2016) proposed by indicating that some variables related to the PE structure have not shown significance in explaining PE return either. These structural variables that have shown little importance in the analysis refer to the distribution of PE between regions and industrial sectors (Diversification, Geography focus, and Industry focus).

6 Conclusions and Implications

In the present study, a comparison of computational methodologies has been carried out. Consequently, new models have been developed that determine the PE return drivers at a global level with high precision. Therefore, the models built here can be used in all countries, achieving accuracy above 86%. These models have been built from a sample of 1,606 PE funds with a geographical focus on the world´s five regions. For this, different computational classifiers and feature selection methods have been applied, which have provided robustness in determining the PE return drivers. Specifically, the objective has been to improve the accuracy of the models developed in previous studies using different methodologies and increasing the sample size to all world regions. The results obtained in this study are significantly higher than those obtained in previous literature, with a precision range of 81.81–86.13% for selecting PE return drivers. These results suggest that market factors are of great importance in the selection of PE return drivers, especially the interest rate in Germany and EE.UU., GDP growth in the EE.UU., the performance of the MSCI World Performance Index, and the amount of money committed with PE around the world. In addition, the variable related to the PE duration has also been significant, including one of the drivers of PE structural characteristics. In short, the variables selected in this study constitute a unique set of PE return drivers to estimate the PE performance globally with high precision.

The conclusions obtained present important theoretical and practical implications. Unlike previous research, this study has expanded knowledge about PE return drivers beyond the experience with funds focused on Europe and EE.UU, offering a global analysis model. Our study suggests a unique set of significant explanatory variables to forecast PE return. This research also demonstrates that the C4.5 and RF classifiers are the most accurate for PE return analysis, thus contributing to existing knowledge in financial forecasting. These results can be used as a reference to establish better PE management decisión-making. In addition, it has also been possible to verify that the PE returns are linked to macroeconomic and market factors. PE benefits from a growing economy over the life of the fund. Therefore, investors should commit when the markets are weak and the economy begins to recover. Finally, contrary to what might be expected, we have yet to find any significant advantage in specialization for a PE manager. Industrial or regional specialization effects are only somewhat relevant in the constructed models. One explanation for this phenomenon could be that PE managers are often specialized within their organization. The managing entity of the fund is not a single person but a management team that, together, creates a well-established organization and good performance.

This study has limitations that suggest future research lines. Our models on PE return drivers have considered both specific variables of the PE funds and others referring to the market context. However, variables related to the countries' environmental policies have yet to be taken into account. Future research in this field could examine whether these new variables impact PE performance, as it seems particularly important considering that governments' capital investments related to sustainable technologies are a vital issue. Secondly, the present study has used a set of robust computational classifiers to measure the models´ precision. Still, other computational methods have not been considered. This is the case, for example, of quantum computing methods, which recently also show an interesting trend in finance. Therefore, future studies could evaluate PE return models with these new computational techniques. Finally, only three variables on the characteristics of the fund manager have been considered in the present study. Further research could shed more light on the skill factors that promote the return of PE.