1 A point of view for social simulation modeling

1.1 A replicator dynamics as a method of configuring evolutionary economic process and practical decision-making on business

Decision-making, whether by organizations or individuals, is configured from diverse evaluation axes. The behavior of organizations and individuals arises from a bundle of such decision-making. The model in this paper is a micro-agent-based model, whether the agent is an organization or an individual, which is configured using one replicator dynamics for each decision-making behavior. Each replicator dynamics is a unit making one decision. This paper describes a method of social simulation model primarily focusing on an inter-organizational decision-making.

On the other hand, there could be controversial issues against the use of replicator dynamics as the agent-based model that it is too simple and preliminary to depict industrial organizations and macroeconomic system. First, replicator dynamics mainly investigates selection issues while the mutations are not addressed at all. That is to say, the replicator dynamics describes selection only, no drift and no mutation. It is true, but a factor of mutation can be embedded in a model by other ways.

From a macroscopic point of view, we may certainly observe a phenomenon of mutation in such an evolutionary process as a society or economic system. However, a phenomenon of mutation in society or economic system cannot simply attribute to genetic miscopying or randomness in biology but can be derived from complex “path-dependency” on previous activities. For example, let us consider the problem of economic growth with technological change, which is a typical evolutionary process.

Dosi and Orsenigo (1988) argue that innovation is generally dominated by unintended results or uncertainty brought by the decision-making of an alien object with diverse technical capabilities, expectations, and beliefs. In other words, the existence of “events (world status)” to be aimed at by innovation is not fixed, but often must be recognized as a “non-stationary” existence that cannot be separated from the action of an activity object and is endogenously given.

Dosi (1990) says that the problems that need to be solved before implementing innovation activities are not only the renewal of methodologies or knowledge, which is required for the implication but also the dependency on problem settings in implementing the innovation itself. These problem settings depend on the technical capability and knowledge stored by an activity object. In this sense, innovation activity itself distinctively becomes “path-dependent” on previous activities and takes on a non-stationary characteristic.

In the sense mentioned above, it is virtually impossible to predict a phenomenon of mutation, that is, what kind of new technology can find practical application in the future or which technology becomes commercially practical in what area. Accordingly, it is impossible to determine the most appropriate allocation of R&D investments by specifying individual areas.

Although it is difficult to specify an investment area, it is possible to address this problem as an economic resource allocation issue under the presumption that mutations should occur. That is, we can consider how to contribute more to the sustainable growth of companies or the entire national economic system as described below by setting the ratio between investments in extending existing (well known) technology and investments in changing to totally new (unknown) technology. As a result, we can manage an economic balance between selection and mutation in the context of sustainable growth.

This paper attempts to configure a method for modeling social and economical system associated with the evolutionary process as a resource allocation problem using a replicator dynamics. In other words, it models the evolutionary economic process in which both an occurrence and diffusion of a new phenomenon (mutation and selection) take place continuously and are implemented in a system where two alternative phases A and B are alternately selected over time. We suppose that phase A has occurred when its diffusion rate becomes infimum over time.

Second, Cantner et al. (2016) point out that the pure model of replicator dynamics, though providing important insights in the evolution of markets, has not found much of empirical support such as the so-called “adverse selection”. They developed an extended replicator dynamics model to analyze the market share changes of firms vertically integrated into value chains and succeeded to show several propositions. For example, they concluded in proposition 2 as follows:

“The average fitness in upstream layers of a value chain can increase rather than decrease over time if upstream firms that are less fit than the average of the market they are operating in are linked to highly fit partners in downstream layers of the value chain. This dynamics violates the predictions of the standard replicator dynamics model and can be considered a regressive development of market selection driven by value chain relations.”

But adverse selection phenomena in the market, as well as natural selection, can also be commonly occurred in a replicator dynamics model based on a coordination game with very simple settings dependent on initial values, counterpart agent’s preferences, transaction relations and so on. Therefore, in this paper, we adopt replicator dynamics model because of producing and depicting diverse phenomena or state transitions and of its fitness for practical decision-making on business, as described below.

1.2 A way of thinking for practical decision-making on business

Generally, decision-making in economics model is supposed to be configured under rational expectations, whether bounded rationality or perfect foresight. Accordingly, a general equilibrium state is supposed to exist where transactions are all balanced. On the contrary, practical decision-making on business, for example, is executed based on PDCA (Plan, Do, Check and Action) cycle. That is,

  1. 1.

    Collection, compilation and analysis of information on macroeconomy, industry (counterparties and competitors) and internal accounts.

  2. 2.

    Planning based on the above evidence.

  3. 3.

    Selection of plans based on feasibility and profitability.

  4. 4.

    Execution of the optimum plan at the concerned time.

  5. 5.

    Comparison between the forecast and actual achievement of the plan.

  6. 6.

    Modification of the plan.

The above process on PDCA cycle can be based on rational activity where the most profitable and therefore optimum plan is selected of the possible ones at the concerned time. But it is almost difficult theoretically and computationally to configure the process approaching the general equilibrium based on the practical PDCA cycle by each concerned agent.

On the contrary, we can propose the activity hypothesis that the relation between each practical decision making on business and the whole system (market mechanism and macroeconomy) can be configured through the actions where each decision-making agent on PDCA cycle adapts to the business environment as a whole system. A replicator dynamics, which the model covered in this paper is based on, can be consistent with the adaptive decision-making.

1.3 An outline for configuration procedure of replicator dynamics

In this section, we demonstrate the configuration procedure where the action based on a series of PDCA cycle is constructed using replicator dynamics as in Table 1.

Table 1 Summary of configuration procedure on PDCA cycle using replicator dynamics

The first stage configures a social organization such as a corporate organization or inter-corporate relations for a virtual system composed of (1) agent, (2) field and (3) interaction.

In the second stage, we define strategies that agents select as demonstrated above. Both strategies “P1, P2,…, Pn” and “Q1, Q2,…, Qm”, which each agent select, respectively, can be planned based on the research results at the concerned time. Newly setting ups and abolitions of strategies can be approached whereby selection rates of strategies are varied from zero to a positive value or positive to zero, respectively, only if all the possible strategies are supposed in advance.

The third stage configures payoffs, respectively, based on each strategy of the party concerned and counterparties as demonstrated above. A state transition can be embedded into the payoff configuration over time.

In the fourth stage, we can summarize the above three-stage procedures using the following payoff matrix (see Table 2).

Table 2 Payoff matrix

In the fifth stage, we can define replicator dynamics according to the above payoff matrix. Generally, replicator dynamics assumes a well-mixed population with a finite number of strategies and posits that the per capita growth rate of each strategy is proportional to its fitness. The state transitions of each strategy can be configured as population dynamics where each agent selects the concerned strategies over time. The selection ratios of each strategy in the whole system are equivalent to the population ratios of the agents that adopt the concerned strategy.

1.4 An outline for configuration procedure of social simulation model

The model covered in this paper is configured from a combination of multiple replicator dynamics, whereby a state transition of each decision is made using the respective replicator dynamics. Through the model in this paper, it is possible in principle to uniformly describe all decision-making from the decision-making of individuals, which consists of a bundle of diverse decision-making, to the organizational decision-making, such as corporate organizations and the national economy consisting of many agents.

This paper will configure a general-purpose simulation model for collective decision-making such as social or organizational one through the following three-stage procedure.

Stage 1 Replicator dynamics as the basis for decision-making.

Stage 2 Indirect control among replicator dynamics through the cross-reference of payoffs: interests in them.

Stage 3 Control between the systems configured using multiple replicator dynamics.

In indirect control model of Stage 2, each agent behaves based on its own decision-making, and the relationship among agents is mutually autonomous. In terms of social system, the model is configured as a laissez-faire market system so to speak.

It is, however, also possible to embed control functions based on some kind of value standards between the systems configured using multiple replicator dynamics. These are the corporate planning departments in corporate organizations and correspond to the policymaking departments of government, etc., in the macroeconomy. This Stage 3 model is the general-purpose model that is the objective of this paper. This paper presents this three-stage model configuration method in the following sections and case studies in the Appendices based on Sakaki (2004a).

2 Basic structure of decision-making: replicator dynamics

Generally, replicator dynamics describes the relationship, whereby n types of agent “j” directly affect each other while making decisions out of mj types of alternative, respectively. A state transition, called the ratio of adoption of mj types of alternative, is derived from replicator dynamics. This selection ratio is an interpretation called the mixed strategy of the players in game theory but corresponds to the state of organizational and social selection in replicator dynamics, which is the dynamics of the agent group. Especially in the case of inter-system control explained later, direct policy interpretation is possible about the state of system selection.

In the replicator dynamics that form the basis for the model in this paper, the number of types is limited to two types of agent and two types of alternative, without losing generality in relation to the model configuration. In other words, the definition is based on

  1. 1.

    the interdependence between two different agents X and Y; and

  2. 2.

    two alternatives (strategies) A and B that each agent could adopt.

By deriving a collective choice from individual decision-making through this replicator dynamics and by creating a complex of micro replicator dynamics, a social system can be formed. In other words, the general-purpose simulation model configured in this paper will be built as an integrated system for social and collective decision-making, based on replicator dynamics consisting of these basic constituent elements, via the following stages:

  1. 1.

    indirect control through the cross-reference of payoffs between agents;

  2. 2.

    the configuration of multiple systems using multiple replicator dynamics; and

  3. 3.

    control between each of the above systems.

This section will introduce the basic configuration of replicator dynamics based on the above problem statement. Let us start using Ut to represent the payoff of agent X, which is one of the groups of parties, at time t and using Vt to represent the payoff of the other group Y. Furthermore, we will define the payoff that each agent will obtain if it adopts alternatives A and B according to the alternatives A and B of the counterparty. Let us make the left side of the superscript express the selection outcome of X and the right side express the selection outcome of Y. For example, the payoff that agent X will obtain if it chooses A is defined as follows according to the alternatives A and B of counterparty Y.

$$ U_{t}^{AA} ,\;U_{t}^{AB} $$

Alternatively, the payoff that party Y will obtain if it chooses A can be defined as follows according to the alternatives A and B of counterparty X.

$$ V_{t}^{AA} ,\,V_{t}^{BA} $$

The matrix in Table 3 expresses the payoffs that can be defined from all the interrelationships between the above two groups.

Table 3 Payoff matrix of agents X and Y

Next, based on the payoff matrix in Table 3, let us try to configure replicator dynamics. Now, let us assume that X and Y are each configured as groups consisting of agents that could adopt the two alternatives A and B. We then express the agents that adopt alternative A at a certain time t as a ratio to the total populations of X and Y, respectively, as xt and yt. Since there are two types of behavior the agents could choose, the population ratio of agents that adopt B can be expressed as 1 − xt and 1 − yt, respectively.

If we then use E(U A t ) and E(U B t ), respectively, to represent the expected payoff that agents belonging to Group X will obtain if they adopt alternative A or B, we are able to express them, respectively, as an average payoff in proportion to its population ratio of agents belonging to the counterparty Group Y that adopt each alternative.

$$ E(U_{t}^{A} ) = y_{t} U_{t}^{AA} + (1 - y_{t} )U_{t}^{AB} $$
(1)
$$ E(U_{t}^{B} ) = y_{t} U_{t}^{BA} + (1 - y_{t} )U_{t}^{BB} $$
(2)

The total payoff that agents belonging to Group X can expect on average through alternatives A and B at this time: E(Ut) can be expressed by averaging the expected payoffs: E(U A t ) and E(U B t ) using population ratios xt and 1 − xt, which are the respective selection ratios of Group X itself.

$$ \begin{aligned} E(U_{t} ) & = x_{t} E(U_{t}^{A} ) + (1 - x_{t} )E(U_{t}^{B} ) \\ & = x_{t} \left\{ {y_{t} U_{t}^{AA} + (1 - y_{t} )U_{t}^{AB} } \right\} + (1 - x_{t} )\left\{ {y_{t} U_{t}^{BA} + (1 - y_{t} )U_{t}^{BB} } \right\} \\ & = x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} + (1 - x_{t} )y_{t} U_{t}^{BA} + (1 - x_{t} )(1 - y_{t} )U_{t}^{BB} \\ \end{aligned} $$
(3)

Generally, replicator dynamics assumes a well-mixed population with a finite number of strategies as mentioned above and posits that the per capita growth rate of each strategy is proportional to its fitness. Thus, using the above result, the population dynamics of the group of agents belonging to Group X that adopt alternative A can be defined as shown below (Weibull 1995). In other words, the state transition of the population ratio of the group adopting alternative A is defined based on and proportional to the ratio of expected payoff in the case of adoption of alternative A to total average payoff.

$$ \frac{{x_{t + 1} }}{{x_{t} }} = \frac{{E(U_{t}^{A} )}}{{E(U_{t} )}} = \frac{{y_{t} U_{t}^{AA} + (1 - y_{t} )U_{t}^{AB} }}{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} + (1 - x_{t} )y_{t} U_{t}^{BA} + (1 - x_{t} )(1 - y_{t} )U_{t}^{BB} }} $$
(4)
$$ x_{t + 1} = \frac{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} }}{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} + (1 - x_{t} )y_{t} U_{t}^{BA} + (1 - x_{t} )(1 - y_{t} )U_{t}^{BB} }} $$
(4′)

Similarly, expected payoff to be obtained by agents belonging to Group Y if they adopt alternative A or B: E(V A t ) and E(V B t ), the total average payoff: E(Vt), and the population dynamics of the group that adopts alternative A can each be expressed as follows:

$$ E(V_{t}^{A} ) = x_{t} V_{t}^{AA} + (1 - x_{t} )V_{t}^{BA} $$
(5)
$$ E(V_{t}^{B} ) = x_{t} V_{t}^{AB} + (1 - x_{t} )V_{t}^{BB} $$
(6)
$$ \begin{aligned} E(V_{t} ) & = y_{t} E(V_{t}^{A} ) + (1 - y_{t} )E(V_{t}^{B} ) \\ & = y_{t} \left\{ {x_{t} V_{t}^{AA} + (1 - x_{t} )V_{t}^{BA} } \right\} + (1 - y_{t} )\left\{ {x_{t} V_{t}^{AB} + (1 - x_{t} )V_{t}^{BB} } \right\} \\ & = y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} + (1 - y_{t} )x_{t} V_{t}^{AB} + (1 - y_{t} )(1 - x_{t} )V_{t}^{BB} \\ \end{aligned} $$
(7)
$$ \frac{{y_{t + 1} }}{{y_{t} }} = \frac{{E(V_{t}^{A} )}}{{E(V_{t} )}} = \frac{{x_{t} V_{t}^{AA} + (1 - x_{t} )V_{t}^{BA} }}{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} + (1 - y_{t} )x_{t} V_{t}^{AB} + (1 - y_{t} )(1 - x_{t} )V_{t}^{BB} }} $$
(8)
$$ y_{t + 1} = \frac{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} }}{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} + (1 - y_{t} )x_{t} V_{t}^{AB} + (1 - y_{t} )(1 - x_{t} )V_{t}^{BB} }} $$
(8′)

If the interdependence of transactions between the groups is defined as shown in Table 3 and the payoffs shown in the matrix are defined concretely through the above process, the ratio of behavior selection adopted over time by each group can be found as population dynamics.

Replicator dynamics as a unit of micro decision-making is based on adaptive behavior theory in relation to the behavior selection of the direct counterparty, as seen above. For example, a typical example of decision-making by companies is management plans, such as plans for the next fiscal year, medium-term plans and long-term plans covering capital investment. All management plans involve drawing up an initial plan based on forecasts and estimates, reviewing the plan as the plan progresses in light of the movement in competitors, the external macroeconomic environment, and other factors, and reformulating a revised plan, etc. In addition, in the case of day-to-day decision-making related to profit and loss during the period, managers always compare previous budgets and plan with actual results and revise the plan where necessary. In this way, adaptive decision-making behavior takes place based on a PDCA cycle at every level from day-to-day decision-making to long-term management plans. The principles of decision-making through replicator dynamics are consistent with decision-making in practice.

3 Indirect control among replicator dynamics through cross-reference of payoffs

According to Deguchi (2000), rather than simply describing the interaction that depends directly on the population ratio of each agent, which is the definitional equation of replicator dynamics, it is now possible to configure the relationship, whereby agents indirectly affect each other by cross-referencing information concerning the payoff structure. Through this system configuration called indirect control, it became possible to describe the relationship among multiple replicator dynamics as well, and the application domain expanded dramatically.Footnote 1 This is because even if agents are not directly affected through the decision-making of other agents, generally they are affected by each other through competition to acquire payoffs.

Let us try looking at the next system consisting of replicator dynamics. These are Eqs. (4′) and (8′) for the replicator dynamics of Groups X and Y described in Sect. 2 (reproduced), and Eq. (9) for new Group Z. Group Z is configured such that decision-making is made within a group consisting of the same type of agents unlike the decision-making between different agents such as Groups X and Y. As before, there are two types of decision-making alternatives A and B. In addition, since Group Z is a homogeneous group, the payoffs in the case of differing alternatives between agents are symmetric, and WAB = WBA.

$$ x_{t + 1} = \frac{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} }}{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} + (1 - x_{t} )y_{t} U_{t}^{BA} + (1 - x_{t} )(1 - y_{t} )U_{t}^{BB} }} $$
(4′)
$$ y_{t + 1} = \frac{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} }}{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} + (1 - y_{t} )x_{t} V_{t}^{AB} + (1 - y_{t} )(1 - x_{t} )V_{t}^{BB} }} $$
(8′)
$$ z_{t + 1} = \frac{{z_{t}^{2} W_{t}^{AA} + z_{t} (1 - z_{t} )W_{t}^{AB} }}{{z_{t}^{2} W_{t}^{AA} + z_{t} (1 - z_{t} )W_{t}^{AB} + z_{t} (1 - z_{t} )W_{t}^{AB} + (1 - z_{t} )^{2} W_{t}^{BB} }} $$
(9)

Equations (4′) and (8′) for the replicator dynamics of Groups X and Y mutually depend directly on the population ratios of the agents that belong to X and Y. Let us note, however, that Eq. (9) for Group Z directly depends on the population ratio of agents that belong to Group Z only. In other words, Eqs. (4′), (8′) and (9) do not mutually depend directly on the population ratios of each other and the definition of replicator dynamics is not valid between these three equations. Equations (4′), (8′) and (9) are independent replicator dynamics, respectively.

However, with the indirect control expounded by Deguchi (2000), it became possible to configure a system having relationships through common variables within the payoff configuration of Eqs. (4′), (8′) and (9), and enabling practical interpretation. For example, if a relationship whereby each payoff depended on each of the population ratios:

$$ U_{t} \left( {x_{t} ,y_{t} ,z_{t} } \right),\;V_{t} \left( {x_{t} ,y_{t} ,z_{t} } \right),\;W_{t} \left( {x_{t} ,y_{t} ,z_{t} } \right), $$

could be defined, interdependence between Eqs. (4′), (8′) and (9) would arise through their respective population ratios and this would indirectly affect the state transition of each population ratio.

4 Inter-system control

The model of indirect control in Sect. 3 configured a relationship between systems, whereby multiple replicator dynamics indirectly affect each other by cross-referencing information concerning the payoff structure. Any of agents then behave based on their own decision-making, and their relationship is configured as a mutually autonomous laissez-faire market system so to speak.

It is, however, also possible to embed direct control functions based on some kind of value standards among the systems configured using multiple replicator dynamics. These are the business planning departments in corporate organizations and correspond to the policymaking departments such as government in the national economy. With significant progress in information network technologies, the gathering, compiling, analyzing and distribution functions of the information necessary for such control have improved dramatically, and the possibility has emerged that the mechanisms, which achieve control or inducement among systems, can be realistically organized.

And so to configure the mechanisms of inter-system control, let us consider multiple virtual systems. Let the virtual systems be replicator dynamics configured as social systems, whereby the configurations of agents are the same but elements such as the payoff configuration and/or interdependence relations are different. We then define the aggregate value per capita to be generated using a configuration equation that is common to these multiple virtual systems. Using this as a standard measure, we control the existence ratio of each system through “the third replicator dynamics”, which selects the multiple virtual systems. The inter-system control is a system design model combining the merits of each virtual social system according to these existence ratios based on such an idea. This paper will consider control of the system in which the two virtual systems are two opposite extreme alternatives.

As seen in Sect. 2, generally, the replicator dynamics of a group consisting of two different types of an agent can be expressed using the following equations (Reproduced). Each of the equations is configured from the population ratios at the beginning of the period of the two types of groups X and Y: xt and yt and each of their payoffs. And since the payoffs themselves also depend on the population ratios of the two types of the group at the beginning of the period, this means that the replicator dynamics of each group can consequently be expressed using functions of the population ratios of the two types of the group at the beginning of the period.

$$ \frac{{x_{t + 1} }}{{x_{t} }} = \frac{{E(U_{t}^{A} )}}{{E(U_{t} )}} = \frac{{y_{t} U_{t}^{AA} + (1 - y_{t} )U_{t}^{AB} }}{{x_{t} y_{t} U_{t}^{AA} + x_{t} (1 - y_{t} )U_{t}^{AB} + (1 - x_{t} )y_{t} U_{t}^{BA} + (1 - x_{t} )(1 - y_{t} )U_{t}^{BB} }} $$
(4)
$$ \frac{{y_{t + 1} }}{{y_{t} }} = \frac{{E(V_{t}^{A} )}}{{E(V_{t} )}} = \frac{{x_{t} V_{t}^{AA} + (1 - x_{t} )V_{t}^{BA} }}{{y_{t} x_{t} V_{t}^{AA} + y_{t} (1 - x_{t} )V_{t}^{BA} + (1 - y_{t} )x_{t} V_{t}^{AB} + (1 - y_{t} )(1 - x_{t} )V_{t}^{BB} }} $$
(8)

So let us try rewriting Eqs. (4) and (8) for the replicator dynamics of groups X and Y consisting of two different types of agents as functions of the population ratios at the beginning of the period of the two types of groups: xt and yt. And we will distinguish the population ratios between the two virtual social systems consisting of the same group configuration, in other words, each system using the superscript 0 and 1, respectively. The replicator dynamics (10) and (11) are for the virtual social system “0”, and (12) and (13) are for the virtual social system “1”. The functional types of f, F, g, and G will be different between systems “0” and “1”, respectively, due to the configuration of payoffs, etc,

$$ \left\{ {\begin{array}{*{20}c} \displaystyle{\frac{{x_{t + 1}^{0} }}{{x_{t}^{0} }} = \frac{{E(U_{t}^{A} )}}{{E(U_{t} )}} \equiv \frac{{f^{0} \left( {x_{t}^{0} ,y_{t}^{0} } \right)}}{{F^{0} \left( {x_{t}^{0} ,y_{t}^{0} } \right)}}\quad (10)} \\ \displaystyle{\frac{{y_{t + 1}^{0} }}{{y_{t}^{0} }} = \frac{{E(V_{t}^{A} )}}{{E(V_{t} )}} \equiv \frac{{g^{0} \left( {x_{t}^{0} ,y_{t}^{0} } \right)}}{{G^{0} \left( {x_{t}^{0} ,y_{t}^{0} } \right)}}\quad (11)} \\ \end{array} } \right. $$
$$ \left\{ {\begin{array}{*{20}c} \displaystyle{\frac{{x_{t + 1}^{1} }}{{x_{t}^{1} }} = \frac{{E(U_{t}^{A} )}}{{E(U_{t} )}} \equiv \frac{{f^{1} \left( {x_{t}^{1} ,y_{t}^{1} } \right)}}{{F^{1} \left( {x_{t}^{1} ,y_{t}^{1} } \right)}}\quad (12)} \\ \displaystyle{\frac{{y_{t + 1}^{1} }}{{y_{t}^{1} }} = \frac{{E(V_{t}^{A} )}}{{E(V_{t} )}} \equiv \frac{{g^{1} \left( {x_{t}^{1} ,y_{t}^{1} } \right)}}{{G^{1} \left( {x_{t}^{1} ,y_{t}^{1} } \right)}}\quad (13)} \\ \end{array} } \right. $$

Next, let us define the aggregate value per capita produced in society as a whole. This value is configured as a measure that is common to virtual social systems “0” and “1”. The configuration procedure (calculation formula) for the aggregate value per capita is the same, but due to differences of the population ratios in the configured virtual social systems, each value at the relevant point in time will be different.

$$ h\left( {x_{t}^{0} ,y_{t}^{0} } \right) $$
(14)
$$ h\left( {x_{t}^{1} ,y_{t}^{1} } \right) $$
(15)

With this aggregate value per capita as the evaluation standard, and using the virtual social systems “0” and “1” as two opposite extreme alternatives, let us consider the sector that determines the selection ratios: the system control sector. The system control sector is configured of groups consisting of the same type of agents, which select the two virtual social systems. Then, with aggregate value per capita as the evaluation standard, if we use virtual social systems “0” and “1” to represent each of the alternatives and use \( \rho_{t}^{0} \) to represent the population ratio of agents that select the former system and \( \rho_{t}^{1} \) to represent the population ratio of agents that select the latter, we can define the following replicator dynamics:

$$ \rho_{t + 1}^{0} = \frac{{(\rho_{t}^{0} )^{2} h\left( {x_{t}^{0} ,y_{t}^{0} } \right) + \rho_{t}^{0} \rho_{t}^{1} h\left( {x_{t - 1}^{0} ,y_{t - 1}^{0} } \right)}}{{(\rho_{t}^{0} )^{2} h\left( {x_{t}^{0} ,y_{t}^{0} } \right) + \rho_{t}^{0} \rho_{t}^{1} h\left( {x_{t - 1}^{0} ,y_{t - 1}^{0} } \right) + \rho_{t}^{1} \rho_{t}^{0} h\left( {x_{t - 1}^{1} ,y_{t - 1}^{1} } \right) + (\rho_{t}^{1} )^{2} h\left( {x_{t}^{1} ,y_{t}^{1} } \right)}} $$
(16)
$$ \rho_{t + 1}^{1} = \frac{{(\rho_{t}^{1} )^{2} h\left( {x_{t}^{1} ,y_{t}^{1} } \right) + \rho_{t}^{1} \rho_{t}^{0} h\left( {x_{t - 1}^{1} ,y_{t - 1}^{1} } \right)}}{{(\rho_{t}^{1} )^{2} h\left( {x_{t}^{1} ,y_{t}^{1} } \right) + \rho_{t}^{1} \rho_{t}^{0} h\left( {x_{t - 1}^{1} ,y_{t - 1}^{1} } \right) + \rho_{t}^{0} \rho_{t}^{1} h\left( {x_{t - 1}^{0} ,y_{t - 1}^{0} } \right) + (\rho_{t}^{0} )^{2} h\left( {x_{t}^{0} ,y_{t}^{0} } \right)}} $$
(17)
$$ \rho_{t}^{0} + \rho_{t}^{1} = 1 $$
(18)

In the system control sector, as in the research and development sector in Appendix 2, for agents adopting each alternative, the existence of agents adopting the same alternative affects average payoff. If the groups of agents belonging to the system control sector make the same judgment on the virtual social system to be selected, the aggregate value per capita of this system in the “current period” is used as the evaluation standard. Because they can calculate “the aggregate value for this term” as they can share the same information on the current term relating to the selection ratio of knowledge stocks chosen in the groups X and Y.

However, if they make different judgments on the virtual social system to be selected, the aggregate value per capita in the “previous period” is used as the evaluation standard. Because in this situation, only the already known “aggregate value for the previous term” can be calculated as they cannot share information on the current term.Footnote 2

The aggregate value per capita in each of these periods is produced based on the population ratios in each period, respectively. In addition, since there are two types of virtual social system as alternatives, the sum total of the population ratios of the groups in the system control sector that choose both is 1 as shown in Eq. (18).

Then, if the selection ratios of the two virtual social systems are determined by Eqs. (16) and (17), we can define a state transition in decision-making in the real society. This real society can be induced to be controlled into the more desirable state based on the aggregate value per capita standard. The numerator, which expresses the average payoff if both agents X and Y adopt one of the alternatives, and the denominator, which expresses the average payoff through both alternatives, are, in both equations, average values calculated through the two virtual social system selection ratios, and satisfy the definition configuration of replicator dynamics.

$$ \frac{{x_{t + 1} }}{{x_{t} }} = \frac{{\rho_{t}^{0} f^{0} \left( {x_{t} ,y_{t} } \right) + \rho_{t}^{1} f^{1} \left( {x_{t} ,y_{t} } \right)}}{{\rho_{t}^{0} F^{0} \left( {x_{t} ,y_{t} } \right) + \rho_{t}^{1} F^{1} \left( {x_{t} ,y_{t} } \right)}} $$
(19)
$$ \frac{{y_{t + 1} }}{{y_{t} }} = \frac{{\rho_{t}^{0} g^{0} \left( {x_{t} ,y_{t} } \right) + \rho_{t}^{1} g^{1} \left( {x_{t} ,y_{t} } \right)}}{{\rho_{t}^{0} G^{0} \left( {x_{t} ,y_{t} } \right) + \rho_{t}^{1} G^{1} \left( {x_{t} ,y_{t} } \right)}} $$
(20)

To sum up the above, the system of the general-purpose simulation model using inter-system control can be configured as a complex form of replicator dynamics (A), (B) and (C). And it guides real society using the distribution ratios obtained from simulation of the virtual systems: \( \rho_{t}^{0} \) and \( \rho_{t}^{1} \) as policy variables.

  1. (A)

    Two virtual social system sectors: (10), (11) and (12), (13).

  2. (B)

    System control sector: (16), (17).

  3. (C)

    Real society sector: (19), (20).

5 Concluding remarks

This paper presented a method for the configuration of a social simulation model intended for general-purpose application as “inter-system control model” to analyze real societies engaged in complex and diverse decision-making. The author so far developed three types of simulation model applying the inter-system control concept to economic problems. These are “Management Model for Technological Change and Sustainable Growth (Sakaki 2004a)”, “Model for Evaluation between Open Access and Reinforcing Private Ownership (Sakaki 2009)”, and “Model for Income Distribution and Economic Growth (Sakaki 2017)”.