1 Introduction

In a vast part of the geopolitical context, there is an increasing demand of production systems that reduce the environmental impact and there exist many public finance instruments, decided by national and international authorities, pushing the so-called transition to a green economy, such as the financial recovery and resilience plan at the European level for the post-pandemic restart phase.

It is clear that readjusting older generation productions to the new pollution regulations can lead in the short term to profitability reductions for the companies that implement them, even though it is possible to assume increases in profitability over medium-long time horizons. In a recent paper (Darvas and Wolff 2021) by Zsolt Darvas and Guntram Wolff presented at ECOFIN 2021, the authors highlight the importance of an international tax pact, which supports the transition to a green economy.

Analogously a European Commission publication of June 2021 (Incentives to boost the circular economy—a guide for public authorities) argues that the optimal incentives to increase the weight of the green economy are linked to strategies related to the tax system, which together with direct funding are the most common policy instruments to drive sustainable development.

In Kozhevina (2015) and Aleksejeva (2016), the role of the transformation of industrial production is underlined as a crucial factor for the achievement of a really whole green economy.

Therefore, a dynamical model addressing the problem of the transition to a greener economy, that is to an economy with a lower level of pollution than the initial one, can investigate the efficiency of public incentive policies as far as they can act on the assumed gap between profitability of a classic and a lower environmental impact production system. The use of incentive policies (and/or penalties for those who do not proceed in this direction) for the transition to a greener economy, considering aspects related to competition, which can make it not convenient to convert one’s production system if competitors do not, must be such as to achieve virtuous balances of the production system, that is high profitability associated with low levels of pollution.

The work is organized as follows. The second section proposes a review of the literature, recalling the different approaches to the economic-financial evaluation of green production strategies compared to non-green ones. The third section is dedicated to the presentation of our model. In the concluding section, we propose comments and hints at possible future research.

2 Literature review

In the literature, there exist multiple quantitative approaches to the topic of green economy analysis compared to classic profitability maximization approach.

An existing field of literature has faced the problem of achieving the objectives of the successive protocols for driving a transition to less polluting production systems (Kyoto 1997, Paris 2015, Sharm el-Sheikh 2022) from the point of view of game theory, using both cooperative and non-cooperative games (see for example Wood 2011; Ciano et al. 2021).

One approach concerns the possibility of investing in financial instruments supporting the green economy, which can be reconducted to the theory of portfolio, as it offers opportunities for diversification with respect to classic financial instruments. In Cheema-Fox et al. (2021), the authors show that “decarbonizing” a portfolio of US and European-listed equities generates better returns compared to not-decarbonized portfolios in the period 2009–2018.

In Habel and Scholz (2020), the yield differential in terms of share price between companies with and without ESG purposes is analyzed, showing an over performance of no-ESG-driven companies.

In Hang et al. (2019), the crucial importance of time horizon for evaluating the profitability of companies investing in a green direction is developed, highlighting how in the short term the “green conversion” is not convenient, while it can become so over longer time horizons.

Another approach is the classic econometric one, which analyzes the effect of different parameters of multivariate models, influencing the measurement of the level of pollution for production systems with different propensity to environmental protection. For example, in Gawrycka and Szymczak (2021) the impact of the green transition on the Gross Domestic Product of several national economies is studied.

Optimal control models have been also considered, with control variables related to technologies of production systems and green economy public incentive policies: see, for example, Tan et al. (2021) and the explanatory flowchart of the production cycle, object of these authors’ investigation.

Moreover, green economy can be considered as playing a role in mitigating environmental damages, especially the impact of climate change on economic system and human society. The magnitude of the climate change problem is emphasized by several analyses and forecasts, and specific plans involving financial support have been discussed.

Finally, in recent years, many scholars have studied the relationship between environmental regulation and enterprise technological innovation using evolutionary game models, which involve mainly economic incentives and fiscal strategies (see Suyong et al. 2019; Zhang and Li 2018; Liang and Weijun 2020).

3 A model of investment return and pollution

This section is devoted to describing our dynamical model and how its equilibria can depend on different public incentives policies for green economy.

We consider two variables:

  • x is the current total return of a given industrial sector, measured by a normalized numerical scale, with minimum value 0;

  • p is the current total pollution produced by a given industrial sector, measured by a normalized numerical scale, with minimum value 0.

Hence, we set \(p=0\) to indicate the minimal polluting contribution attainable by that sector.

We will consider a continuous time dynamics. On one hand, that allows to transform any time horizon T into infinity, through the change of the time variable \(t^{\prime }=\frac{t}{T-t}\) (see Hartman 1982). On the other hand, we can assume the adjustments of x and p to be instantaneous, which can be justified by a high frequency data pattern of observations.

The dynamical system is defined in the non-negative quadrant \(\overline{Q} =\left\{ x,p\ge 0\right\} \), while we pose \(Q=\left\{ x,p>0\right\} \). We assume x to increase when x and p stay below certain thresholds, whereas p increases if x and p are sufficiently high (in particular, when x is high, we can think of a flux of investments in the sector which, with the given technology, will make its contribution to pollution grow). However, we assume that, when p is possibly high but x is very low, the investments in that sector will fall, thus leading to a lower pollution.

Translating the above considerations into a formal dynamical system, we require, first of all, \(x\ge 0\) and \(p\ge 0\). Then, the condition that x increases when both x and p stay under a certain threshold is formalized by requiring that a suitable linear combination of x and p is lower than a quantity h.

Next, the evolution of the pollution p is considered. Here we assume that p increases when a combination of x and px is higher than a quantity k: in fact, as we observed, one can argue that when x is very low the investment in the sector falls, leading, therefore, to a decrease in the pollution produced. Finally, we propose an economic interpretation of all the coefficients involved. Hence, the system we choose as a description of the evolution of x and p is the following:

$$\begin{aligned} \left\{ \begin{array}{l} \overset{\cdot }{x}=x\left( -ax-bp+h\right) \\ \overset{\cdot }{p}=p\left( cx+dpx-k\right) \end{array} \right. \end{aligned}$$
(1)

where the parameters \(a,b,c,d,h,k>0\) depend both on technical (industrial and financial) and political (e.g., fiscal) factors. In fact, we can interpret the system parameters as follows.

For the first equation, e.g., return dynamics, we observe that:

  • a denotes the rate of contribution to return decreasing, in the productive sector, of the current level of return: the higher is a, the more competition contributes to return decreasing.

  • b can be considered the unitary penalty for the pollution generated by the sector: the higher is the pollution, the more the return decreases.

  • h is the upper limit (when \(x,p\rightarrow 0\)) of the return increasing rate.

    For the second equation, pollution dynamics, we observe that:

  • c measures the contribution to pollution increasing due to the profitability of the sector: the higher is the return, the more numerous will be the firms operating in the sector, making the pollution increase.

  • d measures the contribution to pollution increasing due to the combined effect of the current levels of return and pollution (the effect of such a combination is small if either x or p is sufficiently small).

  • k is the upper limit (when \(x,p\rightarrow 0\)) of the pollution decreasing rate.

As we have said, the technology is assumed not to change in any considered period of time (i.e., in \(\left( 0,+\infty \right) \) in a continuous model).

The first observation concerns the equilibria (or fixed points) of the system, which are of two types:

  • boundary equilibria, lying on \(\left\{ x=0\right\} \cup \left\{ p=0\right\} \)

  • internal equilibria, lying in the positive quadrant \(Q=\left\{ x,p>0\right\} \).

The former are two: \(E_{0}=\left( 0,0\right) \) and \(E_{1}=\left( \frac{h}{a},0\right) \). It is easily checked that \(E_{0}\) is a saddle, while \(E_{1}\) can be either a saddle or a sink (i.e., an attractor). On the other hand, the internal equilibria can be one, zero or two. In fact, by suitable choices of the parameters, several dynamical scenarios can occur.

3.1 Scenario 1

Let us consider the condition \(\frac{k}{c}<\frac{h}{a}\), where

  • \(\frac{k}{c}\) can be interpreted as a ratio between technical improvements for pollution reduction and the return appetite due to the current level of return, which implies more pollution;

  • \(\frac{h}{a}\) can be interpreted as a ratio between cost rationalization for increasing returns and economic competition in the sector, which implies return reduction.

So, the condition can be interpreted in this way: effects on returns are “more reactive” than ones on pollution.

Theorem 1

If \(\frac{k}{c}<\frac{h}{a}\), system 1 exhibits exactly one internal equilibrium, \(E_{2}=\left( x_{2},p_{2}\right) \), \(x_{2}<x_{1},0<p_{2}<\frac{h}{b}\). Precisely, \(E_{1}\) is a saddle, while \(E_{2}\) is an attractor (sink) if \(dp_{2}<a\), or is a repellor (source) surrounded by an attracting periodic orbit (limit cycle) if \(dp_{2}>a\). In any case, all the internal orbits (that is, those starting in the positive quadrant Q) converge, as \(t\rightarrow +\infty \), to the internal attractor (equilibrium or limit cycle).Footnote 1

Proof

See “Appendix A”.

Figure 1a provides a graphical representation of the isoclines of the system for Scenario 1, while the next figures illustrate the convergence to \(E_{2}\) or to a limit cycle around it.

In particular, it follows from the proof of Theorem 1 that a globally attracting limit cycle appears when, fixed abcd, \(h\in (\frac{b }{d}a,\frac{b}{d}(a+c))\) and \(k<(\frac{h}{a}-\frac{b}{d}(a+c))\).

Fig. 1
figure 1

a: Scenario 1 isoclines b: Convergence to the poverty trap from starting point (h/a, 0.1) c: Convergence to the poverty trap from starting point (h/a, 0.1) d Convergence to a limit cycle around the poverty trap

We consider the following values of the parameters

\(a=0.092, b=0.39, h=0.08, c=0.09, d=0.4, k=0.49\)

where \(\frac{k}{c}=0.05444<0.8695=\frac{h}{a}\)

Then we can take an orbit from any point of the half-line \((\frac{h}{a},\cdot )\) with \(\cdot >0\), for instance \((\frac{h}{a},0.1)\) and \((\frac{h}{a},0.8)\), generating, respectively, the orbits of Fig. 1b, c, converging to the attractor \(E_{2}\)

Vice-versa, if we vary the parameter h, so as to remain in Scenario 1 but with the condition \(dp_{2}>a\), e.g., setting

\(a=0.092, b=0.39, h=0.2, c=0.09, d=0.4, k=0.49\),

an orbit starting from any point of the half-line \((\frac{h}{a},\cdot )\) with \(\cdot >0\), for instance from \((\frac{h}{a},0.1)\), converges to a limit cycle around \(E_{2}\), as shown by Fig. 1d.\(\square \)

3.2 Scenario 2

Assume \(\frac{k}{c}>\frac{h}{a}\) holds, which is the opposite condition with respect to Scenario 1, meaning that effects on returns are “less reactive” than ones on pollution.

Assume, moreover, that the curves (isoclines)

$$\begin{aligned} \gamma _{1},~p=\frac{a}{b}\left( \frac{h}{a}-x\right) \hbox { and } \gamma _{2},~p=\frac{k-cx}{dx} \end{aligned}$$

do not intersect in Q. Then the following theorem holds.

Theorem 2

Under the above assumptions, no internal equilibrium exists. Then \(E_{1}\) is an attractor and all the orbits starting in Q converge to \(E_{1}\).

Proof

By writing the Jacobian matrix \(J_{1}\) at \(E_{1} \), \(E_{1}\) is checked to be an attractor. Moreover, by Poincaré–Bendixson theorem (Hartman 1982), \(E_{1}\) is the only attractor in \( \overline{Q}\). Then, following the vector field \(\left( \overset{\cdot }{x}, \overset{\cdot }{p}\right) \), it is easily seen that all the orbits of Q converge to \(E_{1}\). \(\square \)

Figure 2a provides a graphical representation of the isoclines of the system for Scenario 2, while Fig. 2b, c shows the convergence of the internal orbits to the virtuous equilibrium \(E_{1}\) respectively, from (h/a, 0.1) and (h/a, 0,8).

Fig. 2
figure 2

a: Scenario 2 isoclines b: Convergence to the virtuous equilibrium from starting point (h/a, 0.1) c: Convergence to the virtuous equilibrium from starting point (h/a, 0.8)

We consider the following values of the parameters, generating Scenario 2,

\(a=0.092, b=0.39, h=0.08, c=0.09, d=0.4, k=0.079\)

where \(\frac{k}{c}=0,8777>0,8695=\frac{h}{a}\).

We can consider orbits from any point of the half-line \((\frac{h}{a},\cdot )\) with \(\cdot >0\), for instance from \((\frac{h}{a},0.1)\) and \((\frac{h}{a},0.8)\), which are represented by orbits in Fig. 2b, c, converging to the attractor \(E_{1}\)

3.3 Scenario 3

Assume \(\frac{k}{c}>\frac{h}{a}\) and two internal equilibria exist, which requires the further conditions \(dh>bc\) and \(\left( dh+bc\right) ^{2}>4abdk\). Let us denote such equilibria as \(E_{2}=\left( x_{2},p_{2}\right) \) and \( E_{3}=\left( x_{3},p_{3}\right) \), \(x_{2}<x_{3},\) \(p_{2}>p_{3}\). Then, the following theorem holds.

Theorem 3

Under the above assumptions, \(E_{1}\) is a sink (attractor), \(E_{3}\) is a saddle and \(E_{2}\) is a sink if \(dp_{2}<a\) (higher competition, expressed by the value of a), or is a source (repellor) surrounded by an attracting limit cycle if \(dp_{2}>a\) (lower competition). In any case, there exist two basins of attraction, which are separated by the stable manifold of \(E_{3}\) (that is, by the union of the two orbits converging to \(E_{3}\)).

Proof

See “Appendix B”.

In fact, it can be seen that the above separatrix has the shape of an unbounded loop (the closure of the loop is at the point \(\left( x,p\right) =\left( +\infty ,0\right) \)). Hence, orbits starting inside the loop converge to the internal attractor (equilibrium or limit cycle), while orbits starting in Q outside the loop converge to \(E_{1}\).

In fact R, the region bounded by the loop, can be described as a “poverty trap” (see Antoci et al. 2011), since orbits lying in R converge to an attractor performing worse than \(E_{1}\), both in terms of pollution level and profitability.

From an economic point of view, we are interested in the strip bounded by the half-lines \({x=0}\) and \({x=\frac{h}{a}}\), which represents the maximum attainable return. Hence, in order to describe the attracting basins of Scenario 3, we should find the segment \(\left( {p}_{\min }{,p}_{\max }\right) \) on \(x=\frac{h}{a}\), from whose points depart the orbits lying in the poverty trap R.

In fact, it is easily observed that \({p}_{\min }=0\) implies \({p}_{\max }=+\infty \), while, when the isocline \(\overset{\cdot }{{p}}=0\) is tangent to \({ax+bp=h }\), so that \(\left( {dh+bc}\right) ^{2}{=4abdk}\), and hence \({p}_{{\min }}={p} _{{\max }}\).

Actually, when \(p_{\min }\) increases, \(p_{\max }\) decreases. This justifies the fact that we seek an approximation of \(p_{\min }\), which also allows to find an upper limit, starting from \({p=0}\), for orbits from \({x=}\frac{h}{a} \) tending to the optimal equilibrium \({E}_{1}\).

In fact, in “Appendix C” a lower and an upper approximation of \(p_{\min }\) are calculated.

Figure 3a provides a graphical representation of the isoclines of the system for Scenario 3, while Fig. 3b illustrates the shape of the poverty trap, bounded by the orbits starting from \(\left( \frac{h}{a},p_{\min }\right) \) and \(\left( \frac{h}{a},p_{\max }\right) \).

The generic orbits in Q are divided in two classes: those lying in the poverty trap and those converging to the virtuous equilibrium \(E_{1}\). \(\square \)

Fig. 3
figure 3

a: Scenario 3 isoclines b: Poverty trap attracting basin c: Convergence to the poverty trap from starting point (h/a, 0.0127) d: Convergence to the virtuous equilibrium from starting point (h/a, 0.8)

We consider the following set of values of the parameters generating Scenario 3

\(a=0.092, b=0.39, h=0.05, c=0.09, d=0.8, k=0.049\)

In fact, \(\frac{k}{c}=0.05444<0.5434=\frac{h}{a}\), \( dh=0.04>0.0351=bc\) and \(\left( dh+bc\right) ^{2}=0.00564>0.00562=4abdk\).

If the orbit starts from \((\frac{h}{a}, 0.0127)\), it converges to the attractor \(E_{2}\) as shown by Fig. 3c.

and this implies that the point \((\frac{h}{a},0.0127)\) lies in \(( \frac{h}{a},{p}_{\min }),(\frac{h}{a},{p}_{\max })\), that is within the poverty trap represented in Fig. 3b. Moreover, in this case \(dp_{2}<a\) and there is no limit cycle.

Vice-versa, the orbit starting from \((\frac{h}{a},0.8)\) converges to the attractor \(E_{1}\) (see Fig. 3d).

which implies \((\frac{h}{a},0.8)\) lying outside the poverty trap represented in Fig. 3b.

3.4 Comments

Several comments can be made, in particular concerning the equilibrium \( E_{2}=\left( x_{2},p_{2}\right) \), when it exists.

We recall that, when the quantity \(dp_{2}\) crosses the value a, a change of stability occurs: \(E_{2}\) is an attractor as long as \(dp_{2}<a\), whereas when \(dp_{2}>a\) the attractiveness migrates from \(E_{2}\) to a limit cycle surrounding the equilibrium.

So, it is interesting to investigate the role played by a, which can measure the effect of competition. In particular, assuming Scenario 3, since, when a increases, \(p_{2}\rightarrow p_{3}\), our model implies that the higher is the level of competition, in a given industrial sector, the easier is its transition toward a greener production and even, in the long-term, toward a steady profitability.

A second comment concerns the policy to be implemented in order to reduce the poverty trap. In fact, it follows from formula (2) that \(p_{\min }\) is so much higher, and thus the poverty trap is so more narrow, as the absolute value of the integral in the formula 3 (see “Appendix C”) is smaller. Then, increasing b—the unitary penalty for pollution—contributes to this goal.

Finally, the Scenario 3 shows an apparent paradox. In fact, with the same initial return, \(\frac{h}{a}\), an orbit with a higher initial pollution can escape the poverty trap, while another one, with a lower initial pollution, cannot, as shown by Fig. 3c, d.

However, such an apparent paradox can be mathematically explained, with an important consequence from an application point of view. In fact, starting from a higher level of pollution, the penalty, given by \(-bp\), produces a higher reduction of the investment profitability, so that, when the return approaches the value zero, that sector can be induced to change policy, reducing its pollution impact and therefore, in the long run, improving also its profitability. Vice-versa, when the initial point belongs to the poverty trap, that can be interpreted as the case of a productive sector which, being less penalized by its current pollution, can settle for a long-run lower profitability.

In fact, the analysis of such apparent paradox emphasizes even more the crucial role of an incentive-punishment policy (which is expressed, in particular, by the value of the parameter b).

4 Conclusions

The aim of the present work is to introduce a dynamical model that describes the problem of the transition toward a production system with lower emissions, through a system of penalties and incentives, on the basis of the current level of emissions.

The dynamics is described in continuous time, thus obtaining asymptotic results which, through a discrete re-reading of the dynamics, can be transferred to any time horizon. Although it was not possible to give full confirmation of the setting of the model parameters, not having information databases for some of them, the numerical simulations allow to extrapolate some important considerations in terms of sensitivity analysis of the results, which confirm what can be observed through the analysis of the theoretical dynamical model.

One of the lines of research that we intend to carry on will be related to improve even the theoretical model, relying on parameters derived from observable data.