1 Introduction

Fifteen years after its publication, this is perhaps a good time to reflect on the influence of our paper “The Log of Gravity” (Santos Silva and Tenreyro 2006).Footnote 1 In that paper we challenged the long-established practice of estimating constant-elasticity models in their log-linearized form, and proposed as an alternative the use of an estimator that conveniently coincides with the Poisson pseudo maximum likelihood (PPML) estimator of Gourieroux et al. (1984). Building on early contributions of Goldberger (1968), Papke and Wooldridge (1996), and Manning and Mullahy (2001), in Santos Silva and Tenreyro (2006) we presented a clear explanation of why the estimation of log-linearized models could lead to misleading results, provided an unequivocal recommendation for the use of the PPML estimator, and clearly illustrated the advantages of this estimator. In our view, the simple message of the paper and the clarity and relevance of the examples we provided, were the key factors for its popularity.Footnote 2

In this paper, we consider the reasons for the impact of “The Log of Gravity” and summarize some of the developments that contributed to its enduring relevance. In doing this, we provide a brief review of the state-of-the-art in the estimation of the gravity equation for trade, which may be useful to the less experienced researchers. Many of the methods and developments we discuss are also relevant for the estimation of constant-elasticity (multiplicative) models for other kinds of data, and we also refer to some of these applications.

The remainder of the paper is organized as follows. Section 2 briefly presents the problem with the traditional least squares estimator of gravity equations and Section 3 discusses several aspects of the PPML estimator. Section 4 discusses specification tests for gravity models and Section 5 reviews the simulations and the results of the empirical application we presented in Santos Silva and Tenreyro (2006). In Section 6 we provide examples of the use of the PPML estimator in different fields and, finally, Section 7 contains some brief concluding remarks.

2 The problem

Following Goldberger (1991, p. 5), in Santos Silva and Tenreyro (2006) we interpret non-stochastic economic models such as the gravity equation as the conditional expectation of the variable of interest.Footnote 3 That is, if economic theory suggests that the non-negative variable y and the vector of explanatory variables x are linked by a constant-elasticity model of the form

$$ y=\exp \left( x\beta \right) , $$
(1)

the function \(\exp \left (x\beta \right ) \) is interpreted as the conditional expectation of y given x, denoted \(E\left [ y|x\right ] \), where the vector of (semi) elasticities β is the object of interest. An example of a model of this kind is the gravity equation for trade which, in its simplest form, can be written as

$$ \begin{array}{@{}rcl@{}} T &=&\beta_{0}Y^{\beta_{1}}D^{\beta_{2}} \end{array} $$
(2)
$$ \begin{array}{@{}rcl@{}} &=&\exp \left[ \ln \left( \beta_{0}\right) +\beta_{1}\ln \left( Y\right) +\beta_{2}\ln \left( D\right) \right] , \end{array} $$
(3)

where T denotes the trade flow from an origin to a destination, Y is a measure of the size of the trading partners, D represents the distance between the partners, and β0, β1 and β2 are unknown parameters.

All econometrics textbooks that we are aware of suggest that the parameters in models such as Eq. 1 can be estimated by the least squares regression of \(\ln \left (y\right ) \) on x. However, this approach may be inappropriate for two reasons. An obvious problem, and our initial motivation to consider alternative estimators, is that this approach is infeasible if y is zero for some observations. The more serious problem is that, due to Jensen’s inequality, the least squares regression of \(\ln \left (y\right ) \) on x is generally an inconsistent estimator for the parameters of \(E\left [ y|x\right ] =\exp \left (x\beta \right ) \).Footnote 4

The key insight to understand why the regression in logs is not generally valid is that, although we can go from Eq. 1 to its logarithmic form, and vice-versa, the same is not true for their stochastic counterparts. Indeed, because economic models do not hold exactly, estimation has to be performed using stochastic versions of the equations suggested by economic theory, and that is where Jensen’s inequality becomes important.

The stochastic counterpart of Eq. 1 can be written as

$$ y=\exp \left( x\beta \right) +\varepsilon =\exp \left( x\beta \right) \eta \text{,} $$
(4)

where ε is an additive error term such that \(E\left [ \varepsilon |x\right ] =0\), and \(\eta =1+\left . \varepsilon \right / \exp \left (x\beta \right ) \) is a multiplicative error term with \(E\left [ \eta |x \right ] =1\).Footnote 5 Ignoring for the moment that y can be equal to 0, the model can be made linear in the parameters by taking logarithms of both sides of the equation, leading to

$$ \ln \left( y\right) =x\beta +\ln \left( \eta \right) \text{.} $$
(5)

In Eq. 5, the least squares estimator is consistent for β if \(\ln \left (\eta \right ) \) is uncorrelated with x, but since \(\eta =1+\left . \varepsilon \right / \exp \left (x\beta \right ) \), that condition will be met only under very restrictive conditions on the distributions of ε, and therefore the least squares estimator of the regression defined by Eq. 5 is generally inconsistent for β.Footnote 6 In the next section we consider alternative approaches to estimate β and explain why PPML should be preferred.

3 The PPML estimator

At first sight, the natural approach to estimate \(E\left [ y|x\right ] =\exp \left (x\beta \right ) \) without transforming the model would be to use non-linear least squares, as done by Frankel and Wei (1993). The problem with this approach is that, as we noted in Santos Silva and Tenreyro (2006), it is based on moment conditions of the form

$$ E\left[ \exp \left( x\beta \right) \left( y-\exp \left( x\beta \right) \right) x\right] =0, $$

which give more weight to the observations with larger variance, and therefore can be inefficient to the point of being useless in empirical applications. This problem has been documented in several simulation studies; see, e.g., Manning and Mullahy (2001) and Santos Silva and Tenreyro (2006, 2011a).

The alternative we proposed in Santos Silva and Tenreyro (2006) is to base the estimator on moment conditions of the form

$$ E\left[ \left( y-\exp \left( x\beta \right) \right) x\right] =0, $$
(6)

which give the same weight to all observations. As will be discussed below, besides being intuitively appealing, this estimator has several other properties that make it particularly attractive in this context.

One of the advantages of the estimator based on Eq. 6 is that it coincides with the Poisson regression estimator and therefore most statistical softwares have commands that make its use very simple. Of course, because in trade-data applications y certainly does not follow a Poisson distribution, this is a pseudo maximum likelihood estimator (see Gourieroux et al. 1984), and a suitably robust estimator of the standard errors should be used.

3.1 Why not use other estimators for count data

The fact that in Santos Silva and Tenreyro (2006) we recommended that gravity equations should be estimated using a method designed for count data generated some misunderstandings.

In count data models, researchers are often interested in estimating the conditional probability of some event, such as \(\Pr \left (y=k|x\right ) \), where k is some non-negative integer. To obtain a consistent estimator of this probability we need to correctly specify the conditional distribution of y, and the Poisson distribution is often seen as too restrictive for this purpose. Therefore, alternative methods based on different distributions have been proposed to estimate count data models, and many of these approaches are more flexible than the basic Poisson regression. This has led some authors to advocate that these estimators would also out-perform the PPML estimator when the objective is to estimate gravity equations. As we explain below, this is wrong.

The first thing to note is that when estimating a gravity equation we want to have an estimator of \(E\left [ y|x\right ] =\exp \left (x\beta \right ) \) that is valid under very mild assumptions, and we do not need to estimate quantities such as \(\Pr \left (y=k|x\right ) \). Therefore, estimators of β whose consistency depends on incidental distributional assumptions are not as attractive as the PPML estimator, whose consistency depends only on the validity of the assumption that \(E\left [ y|x\right ] =\exp \left (x\beta \right ) \); i.e., that the gravity equation is correctly specified. Footnote 7 Therefore, estimating gravity equations using, for example, the estimator for zero-inflated count data models introduced by Mullahy (1986) is not attractive in this context because the validity of the estimator would depend on very strong assumptions about the distribution of the data.

Another aspect to note is that, in the context of count data models, most of the alternatives to Poisson regression allow for the so-called overdispersion (see, e.g., Cameron and Trivedi, 2013). However, overdispersion is not defined when the dependent variable does not have a natural scale. Indeed, when the dependent variable can be measured in different units, the relation between the conditional mean and the conditional variance will depend on the scale of the data. This implies that estimates obtained using models that allow for overdispersion are sensitive to the scale of the dependent variable and to the units in which it is measured, and therefore are arbitrary. This problem was noted by Bosquet and Boulhol (2014) for the case of the negative binomial estimator, but it affects all estimators that try to accommodate overdispersion, such as the zero-inflated models whose use has been recommended by some authors.

3.2 PPML, fixed effects, and the incidental parameter problem

Since the seminal work of Anderson and van Wincoop (2003), it has become standard to estimate gravity equations accounting for multilateral resistance by including a dummy for each origin and a dummy for each destination, the so-called origin and destination fixed effects (see also Hummels, 1999). In this case, the number of parameters to estimate depends on the number of countries included in the sample, and therefore we need to account for the incidental parameter problem because, in general, it is not possible to obtain consistent estimators for models in which the number of parameters depends on the sample size (see, e.g., Lancaster, 2000).

It is well known that PPML does not suffer from the incidental parameter problem in the traditional panel data case where a single fixed effect is included; see Wooldridge (1999). Because that result does not cover models with two sets of fixed effects, some authors have claimed that PPML suffers from the incidental parameter problem when the model includes origin and destination fixed effects. That claim is, however, incorrect. Indeed, Fernández-Val and Weidner (2016, p. 301) have shown that PPML is immune to the incidental parameter problem in models with two sets of fixed effects, as long as the sizes of the two sets of fixed effects grow at the same rate and the regressors are strictly exogenous or predetermined.

Although PPML is consistent in the two-way gravity model, the usual estimator of the covariance matrix accounting for clustering is invalid due to the incidental parameter problem (see, e.g., Egger and Staub (2015), and Jochmans (2017)). Weidner and Zylkin (2020) provide a solution to this problem.Footnote 8

Following the suggestion of Baier and Bergstrand (2007), researchers sometimes use panel data to estimate three-way gravity models that include origin-time and destination-time fixed effects, as well as pair-fixed effects. The consistency of the PPML estimator in this context does not follow from the results of Fernández-Val and Weidner (2016), but Weidner and Zylkin (2020) have recently shown that PPML is still consistent in this context. Remarkably, Weidner and Zylkin (2020) also show that PPML is the only member of a family of pseudo maximum likelihood estimators that has this property. Weidner and Zylkin (2020) also show that standard significance tests and confidence intervals are invalid in three-way gravity models because the asymptotic bias and the asymptotic standard deviation of the estimator vanish at the same rate. Weidner and Zylkin (2020) propose solutions for this problem and we refer the interested reader to their paper for more details.

Baier and Bergstrand (2007) motivation to introduce a third set of fixed effects is to control for the possible endogeneity of free trade agreements. An alternative way to address this issue would be to use instrumental variable methods, but it is difficult to find convincing instruments that can be used in this context. Additionally, the estimation of gravity models with endogenous regressors is challenging because the instrumental variables counterparts of the PPML estimator (Mullahy 1997; Windmeijer and Santos Silva 1997) suffer from the incidental parameter problem and therefore cannot be used to estimate models that include fixed effects. However, Jochmans’s (2017) estimator can be used in this context because it partials-out the origin and destination fixed effects, and therefore does not suffer from the incidental parameter problem. Jochmans and Verardi (2019) present a Stata command that implements Jochmans’s (2017) instrumental variables estimator for the case of gravity equations with two-way fixed effects estimated with cross-sectional data.

3.3 Computational aspects

One of the advantages of the PPML estimator is that its objective function is globally concave and therefore it has at most one maximum; the problem is that there are cases where the pseudo loglikelihood function does not have a maximum, and therefore the estimates do not exist. Heuristically, this problem is caused by the presence of regressors that perfectly predict some of the observations for which the dependent variable is zero, implying that the maximum likelihood estimator of their coefficients goes to (minus) infinity.

It is well known that the presence of perfect predictors can lead to the non-existence of the maximum likelihood estimates for binary choice models such as the logit (see, e.g., Albert and Anderson, 1984), but it is much less known that such problem also affects the PPML and other estimators such as the Tobit.

In Santos Silva and Tenreyro (2010), we described the issue and provided a simple method to detect and solve this problem. Subsequently, in Santos Silva and Tenreyro (2011b) we described other numerical issues that can lead to convergence problems and introduced the ppml Stata command, which implements the methods discussed in Santos Silva and Tenreyro (2010).

More recently, Correia et al. (2019) revisited the problem and, building on much earlier contributions by Verbeek (1989, 1992) and Wedderburn (1976), presented a refined version of the algorithm to detect the non-existence of the PPML estimates. This method, and the associated solution to the problem of non-existence, are implemented in their ppmlhdfe Stata command (Correia et al. 2020). In practice, both ppml and ppmlhdfe effectively deal with the non-existence problem and therefore nowadays this is not a serious issue in empirical applications.

An interesting result in Correia et al.’s (2019) paper is that Poisson regression is rather special in that the solution to the non-existence of the estimates is simpler in that case than in related estimators such as the gamma and inverse Gaussian pseudo maximum likelihood estimators. This, therefore, is another reason to prefer PPML to other generalized linear models for non-negative data.

The non-existence of the PPML estimates is particularly likely to occur in models with a large number of dummy variables, such as models with origin and destination fixed effects. Although the non-existence in itself is not problematic, estimation of these models is challenging due to the sheer number of parameters that have to be estimated. Correia et al. (2020) address this issue in their ppmlhdfe Stata command. Combining earlier results by Guimarães and Portugal (2010) with the Frisch-Waugh-Lovell theorem, Correia et al. (2020) develop an algorithm that greatly simplifies the estimation by PPML of models with multiple sets of fixed effects.Footnote 9

3.4 PPML and structural gravity

The gravity equation provides a reliable way to describe trade flows and to evaluate the partial equilibrium effects of trade policies. To go beyond the partial equilibrium analysis, which ignores the effect of trade policies on third-party counties, we need structural gravity models that take into account the general equilibrium effects of trade policies.

Anderson and van Wincoop (2003) introduced a structural gravity model that permits the general equilibrium analysis of trade policies by considering their effects through multilateral resistance channels. Anderson and van Wincoop (2003) estimate their structural gravity model using a non-linear method, but notice that an alternative is simply to include origin and destination fixed effects in a standard gravity equation, as done by Hummels (1999). However, in general, there is no guarantee that the estimated fixed effects are consistent with the definition of the multilateral resistance indexes and with the equilibrium conditions that they must verify. Remarkably, Fally (2015) has demonstrated that under reasonable assumptions the estimated fixed effects automatically satisfy these conditions when the gravity equation is estimated by PPML, and therefore the multilateral resistance indexes can be recovered from the estimated fixed effects. Moreover, Fally (2015) also shows that PPML is the only pseudo maximum likelihood estimator with this property.

Building on Fally’s (2015) results, Anderson et al. (2018) propose a method to compute general equilibrium effects of trade policies based on a structural gravity model and on the properties of the PPML estimator; see also Yotov et al. (2016).

4 Specification tests

In general, constant-elasticity models can be estimated consistently using any of the pseudo maximum likelihood estimators introduced by Gourieroux et al. (1984). Because all these estimators are consistent under the same mild set of conditions, researchers may use specification tests to choose the best estimator in this family; i.e., the more efficient pseudo maximum likelihood estimator. Manning and Mullahy (2001) suggested that the traditional Park (1966) test could be used for this purpose, but in Santos Silva and Tenreyro (2006) we noted that the test is generally invalid in this context and proposed alternative approaches. However, as we explain below, both the Park (1966) test suggested by Manning and Mullahy (2001) and the tests we suggested in Santos Silva and Tenreyro (2006) are of little use when estimating gravity equations.

As noted in the previous section, the PPML estimator is the only pseudo maximum likelihood estimator for gravity equations that is valid under very mild assumptions, that is valid in models with high-dimensional fixed effects, that is not adversely affected by the possible non-existence of the estimates, and whose results are compatible with structural gravity models. Therefore, there is not really much choice when it comes to selecting a pseudo maximum likelihood estimator for a gravity equation, and the PPML is the only credible option. In other words, PPML is efficient in the class of pseudo maximum likelihood estimators that are valid in models with fixed effects and are compatible with structural gravity models. Therefore, tests to check the relation between the conditional mean and the conditional variance, such as those proposed in Manning and Mullahy (2001) and Santos Silva and Tenreyro (2006), are redundant when the purpose is to estimate gravity equations, and they serve no purpose in this context.Footnote 10

In Santos Silva and Tenreyro (2006) we also used a version of Ramsey’s (1969) RESET test to check the specification of the models. Although often misinterpreted as a test for omitted variables, the RESET is a very useful general misspecification test and it can be useful to check the specification of gravity equations (not to choose the estimation method). One thing to keep in mind when performing a RESET-type test in models with fixed effects is that some of the fixed effects may be estimated with a very small number of observations and therefore their estimates will be very noisy. In this case, the fitted values of the linear index whose powers are used in the test should not include the estimates of the fixed effects.

The standard formulation of the gravity equation has been extremely successful in practice and has solid theoretical underpinnings (see, e.g., Anderson and van Wincoop (2003), and the references therein). This standard formulation is a single-index model in which zero and positive observations of trade are treated in the same way. However, authors such as Helpman et al. (2008) have suggested double-index trade models that separate the extensive-margin decision to export from the intensive-margin decision of how much to export.Footnote 11 In other areas (e.g., health economics) it is also often the case that researchers have to choose between single- and double-index models, and therefore it is interesting to have a method to choose between these competing specifications for models for non-negative data.

Because the standard gravity equation and most single-index models can be estimated by PPML, which does not require the correct specification of the likelihood function, the choice between single- and double-index models for trade cannot be based on information criteria because these are likelihood based.Footnote 12 Likewise, the vast majority of tests for non-nested hypotheses also cannot be used for this purpose because they are also likelihood based. However, in Santos Silva et al. (2015) we developed a simple test that can be used for this purpose. The test has not been widely used and that probably reflects the fact that most researchers are comfortable with the traditional gravity equation and do not consider double-index alternatives.

5 Simulations and application

In Santos Silva and Tenreyro (2006) we provided overwhelming simulation evidence that the traditional approach of estimating gravity equations using the least squares regression of \(\ln \left (y\right ) \) on x could lead to very misleading results, and that PPML is generally very well behaved, even when it is not the optimal estimator. However, the dependent variable in the main simulation design considered by Santos Silva and Tenreyro (2006) is strictly positive. The fact that the dependent variable did not include zeros led several researchers to question the validity of our results, and to unfounded claims that PPML performed poorly in situations where the dependent variable has many zeros.

The reason why we used a strictly positive dependent variable in our main simulations is simple: at the time we did not know how to generate non-negative data with zeros and with an exponential conditional expectation.Footnote 13 We solved this problem in Santos Silva and Tenreyro (2011a) by introducing an attractive data generating process in which the dependent variable can have an arbitrarily-high proportion of zeros and has an exponential expectation.Footnote 14 The simulation results presented in Santos Silva and Tenreyro (2011a) confirmed that the performance of PPML is very strong even in the presence of a very high percentage of zeros and, together with the theoretical properties of the PPML estimator established by Gourieroux et al. (1984), should be enough to convince even the more skeptic that the fact that the dependent variable can have a high proportion of zeros does not affect the performance of the PPML estimator.Footnote 15

The empirical illustration we presented in Santos Silva and Tenreyro (2006) confirmed that the results obtained with the PPML estimator were substantively different from those obtained with the traditional method and with other methods that are difficult to justify in this context, such as estimators based on the Tobit and estimators based on adding an arbitrary constant to the dependent variable before taking logs.Footnote 16

The application also provided an unexpected result that was later confirmed by many other authors: the PPML estimates change very little if the estimation is performed excluding the observations for which the dependent variable is zero. With the benefit of hindsight, we were able to explain why dropping the zeros has little impact on the PPML estimates: observations where the conditional mean is close to zero have low variance and therefore the residuals are close to zero for observations for which the value of trade is small or zero. This implies that observations for which the dependent variable is equal to zero have a very small contribution to the value of the pseudo loglikelihood function, and therefore contribute little to the estimation results. Therefore, what was our initial motivation for using PPML turned out not to be particularly important, but the problems caused by disregarding the implications of Jensen’s inequality were more serious than we anticipated.Footnote 17

6 The PPML estimator in other contexts

The suggestion that the well-established practice of estimating elasticities using log-linear regressions could lead to misleading results was initially met with skepticism;Footnote 18 even the referees noted that they “were unconvinced by the practical importance of the issue.” However, the importance of the problem has gradually been recognized and PPML is now widely used for the estimation of gravity equations for trade.

However, as we noted in Santos Silva and Tenreyro (2006), the PPML estimator can be used in a broad range of economic applications where the equations under study are traditionally estimated in their log-linearized form, and PPML is now also gaining acceptance in many other areas. Although it is not possible to provide here a comprehensive review of all the applications that have used the PPML estimator, in this section we refer some interesting examples of its use to estimate gravity equations and other models.

Gravity equations are frequently used in the study of migration flows (see, e.g., Beine et al., 2016). In these studies, the dependent variable is often (but not always) a count, and therefore the use of Poisson regression in this context is even more natural. Indeed, the use of this method was suggested by Flowerdew and Aitkin (1982), but at that time the attractive properties of Poisson regression were not yet known, and therefore this work did not have much impact. More recent work (e.g., Beine and Parsons, 2015) use PPML to estimate models that include a number of fixed effects and where the dependent variable is not a count, very much like in the trade literature.

The study of foreign direct investment (FDI) also relies heavily on the gravity equation, and PPML is now often used in this context (see, e.g., Head and Ries (2008), for an early example). Here, however, there is a possible complication: net FDI flows can be negative. The fact that some observations are negative does not imply that the gravity equation is inadequate and that the PPML estimator should not be applied. Indeed, all that is needed for the validity of the PPML estimator in this context is that the conditional expectation of the net flows is given by the gravity equation, and therefore is always non-negative. If that is the case, the PPML estimator continues to be appropriate even if some net FDI flows are negative.Footnote 19

Going beyond the estimation of gravity equations, and reflecting the influence of the pioneering work of Manning and Mullahy (2001), we find many examples of PPML estimation in health economics. For example, Kaiser et al. (2014) use PPML to evaluate the impact of a reform on the retail price of drugs, and Powell and Seabury (2018) use PPML to estimate models for medical expenditures. Models for other kinds of expenditures have also been estimated by PPML. For example, Fisher (2016) uses PPML to estimate models for household expenditures, and Jeong and Siegel (2018) use PPML to estimate models for briberies paid by businesses.

Another early use of the PPML estimator outside of the trade literature relates to the estimation of wage equations, an area we explicitly mentioned in Santos Silva and Tenreyro (2006). Blackburn (2007) estimates wage equations in levels using several pseudo maximum likelihood estimators, including PPML. More recently, Petersen (2017) and Powell and Seabury (2018) also estimate equations for earnings by PPML.

The Cobb–Douglas production function is one of the best-known constant-elasticity models and therefore it is not surprisingly that one of the first uses of PPML outside of the trade literature involved the estimation of production functions. Building on Santos Silva and Tenreyro’s (2006), who explicitly mentioned that this is a context in which PPML could be useful, Sun et al. (2011) advocated the estimation of production functions in levels and used PPML in their application. More recently, Dias and Marques (2021) showed that estimates of productivity dynamics based on firm-level data depend on whether logs or levels are used, and argue in favour of using data in levels when the analysis is based on weighted measures of productivity.

More generally, PPML has been employed to estimate models for durations (Abboud et al. 2016 and Call et al. 2018), investment in R&D (Cowan et al. 2015, and Guceri and Liu 2015), debt (Oksanen et al., 2015 and Lee and Mori 2021), losses and returns (Levieuge et al. (2021), and Paniagua et al. (2018)), value of mergers and acquisitions (Todtenhaupt et al. 2020), values of illicit drug sales (Nurmi et al. 2017), wind power capacity (Goetzke and Rave 2016), and to estimate models evaluating the effects of wild fires (Eskelson et al. 2016 and Peterson et al. 2019).

Finally, we note that PPML is also becoming important is the study of intergenerational income mobility. Mitnik and Grusky (2020) make a strong case for the use of PPML in the estimation of models of intergenerational mobility and show that its use makes a material difference; Helsø (2021) also uses PPML in this context.

7 Concluding remarks

The PPML estimator is extraordinarily well suited for the estimation of gravity equations. That was the point we made in Santos Silva and Tenreyro (2006) and, thanks to the follow-up work done by us and many others, that result is today even clearer and widely accepted. Indeed, in the vast majority of cases, there is no reason at all to consider alternative estimators for gravity equations because no other estimator shares all the attractive features of PPML that we discussed in Section 3.

Some years ago, the use of the PPML estimator could be challenging because of computational issues. Indeed, some authors even state that they do not report PPML estimates because of the computational challenges they faced. However, the introduction of the ppmlhdfe Stata command by Correia et al. (2020) made it very easy to estimate even complex gravity equations using very large panels. This command represents the state-of-the-art and essentially removed the final obstacles to the generalized used of the PPML estimator.

We often see papers that present results of the estimation of gravity equations using a potpourri of methods, and some authors go as far as recommend that practice. We do not see what can be gained by complementing the PPML estimates of gravity equations with those obtained by methods that are almost certainly invalid, and suggest that it is better to spend research time making sure that the model is correctly specified and can be used to answer the question of interest.

We conclude with a small anecdote that the readers may find interesting. “The Log of Gravity” started when one of the authors serendipitously emailed the other asking for a copy of a ten-year old working paper (Santos Silva 1991); we did not know each other when we started to work on our paper, our collaboration was entirely done by fax and by email (which was challenging because of the different time zones), and we only met when the paper had already been accepted for publication. No matter how much we plan and how hard we work, luck will always play a big part in our lives and careers, and we have been more fortunate than most.