Complex & Intelligent Systems

, Volume 3, Issue 2, pp 121–132

An evolutionary hybrid method to predict pistachio price

  • Azim Heydari
  • Farshid Keynia
  • Nasser Shahsavari-Pour
  • Reza Sedaghat
Open Access
Original Article

DOI: 10.1007/s40747-017-0038-8

Cite this article as:
Heydari, A., Keynia, F., Shahsavari-Pour, N. et al. Complex Intell. Syst. (2017) 3: 121. doi:10.1007/s40747-017-0038-8
  • 163 Downloads

Abstract

Pistachio produce is of great importance in the world and considered as a valuable agricultural produce. This product is important for the economy of the producing countries as well as other importer countries. In this regard, countries which are capable of getting control over this product’s market would obtain considerable benefits; one way to do so is predicting market trend or the price of this product and the affecting factors. In this survey, initially, the affecting parameters on pistachio price are identified. Then, using a novel combined intelligent method, the price of this product is predicted. In addition, using analysis of variance method, the results of the proposed method are compared to other intelligent combined methods, such as Feed Forward-Particle Swarm Optimization, Elman- Imperialist Competitive Algorithm, Feed Forward-Imperialist Competitive Algorithm, Elman-Genetic Algorithm, and Feed Forward-Genetic Algorithm. The results of the mentioned comparison indicate that the proposed method owns an excellent performance.

Keywords

Pistachio Artificial neural networks Meta-heuristic algorithms Analysis of Variance 

Introduction

Nowadays, managers’ success in various decision-making conditions and the identification of the future events for better decision-making requires a methodology to predict these events and obtain a better estimate. The economic decision-making to achieve the maximum benefit is regarded as one of management success determinants. Since prediction is the process to forecast almost near future [1], therefore, using methods which result in less error prediction has a broad application in many fields. In the recent decade, global market of agricultural produce has experienced significant fluctuations; as in some cases, considerable amount instability was observed.

The aforementioned issue may be intensified in some countries due to the currency instability which is dramatic in case of US dollar. These fluctuations can be extremely harmful for the developing countries’ stable benefits—resulted from export, and consequently, may lead to some problems in paying off their debts and growth [2]. Pistachio produce has a great importance and high value in agricultural and economic sectors of the producing countries. In recent years, those countries, capable to dominate the pistachio market, using great mass production, have gained considerable benefit.

There are several methods to model and predict time series; the traditional–statistical methods which introduce linear predictions for future values of the variables, such as: Moving Average, Weighted Average, ARIMA, Regression Method, Markov Chain, Neural Networks, and etc. Despite some advantages, linear methods face some limitations like incapability to work with non-linear relations. To compensate this shortcoming, several non-linear methods have been suggested. In addition, in recent years, artificial neural networks have been employed to predict time series. One of the most important advantages of neural networks is their flexibility in prediction of different types of non-linear models. These networks are inspired by human neural network and brain; therefore, they have better efficiency compared to other mentioned methods in different problems. Some of the studies conducted on economic dimensions of pistachio are: Moshiri and Cemeron reviewed advantages of neural networks over econometric models and compared Back-Propagation Artificial Neural Network (BPN) model with statistic and econometric models [3]. Their results showed that BPN model is capable of excellent prediction. Olson and Mossman compared error back-propagation neural network method with Logit model and Ordinary Least Square methods [4]. The results show that the neural network model is more accurate and possesses lower error. In a study, Chen claimed that currency fluctuations of China’s agricultural export have had a positive effect on Japan [5]. Abolagba et al. evaluated the export of agricultural products [6]. In their study, the effective factors on agricultural produce export with Ordinary Least Square regression method (OLS) was estimated, regarding variables of product amount, export amount, and currency rate, production price, and the amount of domestic and national consumption. In a study, Zheng et al. investigated the role of US in the global production and pistachio business as well as affecting factors on the US’s demand of pistachio export [7]. Results of this study indicated that, using advanced technologies and higher food standards, US pistachio production was able to increase its rank in the global markets. In a case study by Yazdani, risk management tools of pistachio farmers were discussed. Sirjan city is one of the main pistachio producers in Kerman which itself is a very significant pistachio producer province in Iran. The author mentioned and analyzed 144 dangers that farmers face while producing pistachio. The results indicate that: use poisons before damage (3.55), use risk management tools before damage (3.52), do not using new products (3.21), agricultural insurance (2.85) has the highest preference on risk management tools. In addition, stepwise regression results show these three factors: family members, Education, and age of farmers approximately 67% effect on risk management tools [8]. Tohidi et al. represented the hybrid artificial neural network-panel data method to predict the price of pistachio, raisin, and date exports. The results show that the novel artificial hybrid neural network-panel data method has a better performance in the prediction of the price of Iran’s pistachio, raisin, and date exports than that of the regression method. Thus, it is proposed that exporters, policy makers, and researchers use this method to predict economic variables [9].

Based on the previous conducted studies about pistachio, there has been no study on pistachio price prediction. In this study, two methods are used to determine the effective parameters on pistachio price. In addition, a new hybrid method (El-PSO) is used to predict the price of pistachio. The results of the suggested method were compared with other combined methods, such as El-GA, El-ICA, FF-PSO, FF-GA, and FF-ICA. It was pointed out the proposed method has an excellent performance and a relatively negligible error.

This investigation is organized as follows: “Data and methods” explains the methods to determine correlation coefficient. Intelligent methods are illustrated in the next section. The proposed method for prime cost is explained further. “Numerical results” and “Conclusion” present the study’s results of the proposed intelligent combined methods, their comparison, and a general conclusion of this investigation, respectively.

Data and methods

Nowadays, due to economic crisis, fluctuation in currency rate, and inflation rate, it is difficult to predict the produce’s price; in addition, the identification of the influential parameters has its own complexity. The employed parameters in this study are: Domestic Pistachio Market Price (DPMP), Pistachio Global Price (PGP), Iran’s Pistachio Export Quantity (IPEQ), Iran’s Pistachio Production Quantity (IPPQ), Pistachio Global Production Quantity (PGPQ), Domestic Inflation Rate (DIR), Pistachio Production Cost Index (PPCI), and currency rate [US Dollar (UD), Canadian Dollar (CD), England Pound (EP), Emirate Dirham (ED)]. These data were collected in a period of 15 years (1996–2010). Consider Fig. 1.
Fig. 1

Parameters affecting pistachio price

Table 1

Effect of the parameters on pistachio price, based on two proposed methods

Methods

IPEQ

IPPQ

PGPQ

PGP

DIR

PPCI

UD

CD

EP

ED

Pearson correlation

0.057

0.332

0.819

0.872

0.964

0.745

0.689

0.918

0.815

0.745

Mutual information (MI)

0.003

0.013

0.199

0.55

0.606

0.434

0.606

0.606

0.606

0.606

Pearson correlation

Correlation method is employed as a statistic tool to measure the state of the relation between variables. Several correlation methods exist to determine the effects of variables on each other. One of the most common of all is Pearson correlation coefficient, because when there is a linear relation between two variables, Pearson correlation coefficient shows a high sensitivity [10, 11, 12]. Pearson correlation coefficient measures the relationship between two variables by dividing the covariance of the two variables by their standard deviation [13]. The correlation coefficient \(\rho _{X,Y} \) between two random variables X and Y, with expected value of \(\mu _X ,\mu _Y \), and standard deviation \(\sigma _X \sigma _Y \) is calculated as:
$$\begin{aligned} \rho _{X,Y}= & {} \text {Corr}\left( {X,Y} \right) =\frac{\text {Cov }\left( {X,Y} \right) }{\sigma _X \sigma _Y }\nonumber \\= & {} \frac{E \left[ {\left( {X-\mu _X } \right) \left( {X-\mu _Y } \right) } \right] }{\sigma _X \sigma _Y } \end{aligned}$$
(1)
where E is the expected value, Cov is the covariance, \(\sigma \) is the standard deviation, and Corr stands for Pearson correlation coefficient.

Mutual information

Based on the following definitions and equations, the effect of two variables on each other can be specified.

Definition 1

MI between two the discrete random variables of X and Y regarding the distribution of p( xy ) is simultaneously and collectively obtained as follows [14]:
$$\begin{aligned} I( {X;Y})=\sum _{x,y} {p( {x,y} )} \;\log \frac{p( {x,y} )}{p( x )p( y )}. \end{aligned}$$
(2)
In addition, it should be considered that an appropriate MI is in arguments as follows [15]:
$$\begin{aligned} I( {X;Y} )=I( {Y;X} ). \end{aligned}$$
(3)

Definition 2

MI between two continuous random variables of X and Y is obtained by p.d.f, f( xy ) as follows [15]:
$$\begin{aligned} I( {X;Y} )=\int \int {f( {x,y} )\;\log \frac{f( {x,y} )}{f( x )f( y )}} \text {d}_x \text {d}_y. \end{aligned}$$
(4)

The coefficient of the selected parameters

All parameters, already introduced to predict pistachio price, are evaluated and their effectiveness is studied, using two suggested methods. Table 1 shows the influence of the introduced parameters on pistachio price.

As it can be observed in Table 1, the obtained coefficients are in the interval of [0 1] and these coefficients illustrate that larger coefficients (closer to 1) are more influential. All the parameters, except the quantity of internal production and export quantity, have large coefficients. Figures 2, 3, and 4 illustrate the trend of export quantity, pistachio global price, and pistachio the price in Iran, respectively.
Fig. 2

Iran’s pistachio export quantity data, in a time period of 2006–2010

Fig. 3

Pistachio global price data, in a time period of 2006–2010

Fig. 4

Domestic pistachio market price data, in a time period of 2006–2010

Fig. 5

General schematic of Elman Neural network, along with specific inputs and outputs

Intelligent methods

Elman neural network

In general, Artificial Neural Networks (ANNs) are briefly mathematical techniques planned and designed to accomplish different types of tasks. Nowadays, neural networks can be configured in a variety of arrangements to perform a range of tasks including pattern recognition, data mining, classification, forecasting, and process modeling. ANNs are made up of attributes which lead to perfect solutions in applications where learning a linear or non-linear mapping is needed [16]. Elman network is considered as one of the most common and efficient recurrent neural networks. This network is normally a two-layer network, in which the first layer is the hidden layer or input layer and the second layer is the output layer. This network performs like a perceptron network; however, in the Elman network, the output of the hidden layer is applied to the input of the hidden layer, via feedback loops. Hence, the output value in each moment will depend on the previous efficient values. Figure 5 depicts a schematic of Elman neural network, along with its inputs and outputs of the problem.

Feed-forward neural network

Feed-forward neural network, inspired by biologic classification algorithm, consists of different neurons and layers to process and organize data. The existence of several layers of neurons with a non-linear transfer function makes the network capable to learn linear and non-linear relation between inputs and outputs. Neurons, which are the elements of the signal processing, are connected to each other by synaptic feed-forward cross layer joints. The input–output connection may be performed by non-linear mapping functions [17]. Appropriate determination and selection of weights and biases is very important in network performance. Weight coefficient expresses the level of the given importance to the parameter. In multi-layer neural networks, the output of a layer is the input of the succeeding one. The equation is as follows [18]. Figure 6 shows the feed-forward neural network, along with inputs and outputs of the problem.
Fig. 6

Feed-forward neural networks, with specific inputs and outputs

Genetic algorithm

Genetic algorithm is an optimization method, inspired by living nature, which can be introduced as a numerical, direct, and random search method. This algorithm is based on population and repetition. Its principle concepts are derived from genetic science [19]. In addition, to apply the concept of genetic evolution to an optimization problem, in the real world; two points should be considered: (1) encoding the potential solutions; (2) defining the fitness function (cost function) [20]. The general structure of genetic algorithm employed in this study is defined in a step-by-step process [21].

Step 1: Defining the practical solution of the problem as a genetic problem

Step 2: Generating an initial population \(P( 0 )=x_1^0 ,\ldots , x_N^0.~{\text {Set}}~ t=0.\)

Step 3: the calculation of the mean of fitness \(\bar{{f}}( t )=\sum _i^N {{f( {x_i } )}/N} \). Allocating fitness value to each person \({f( {x_i })}/{\bar{{f}}( t )}\).

Step 4: selection operator; in this study, tournament selection operator is employed for selection of parents for crossover.

Step 5: running the crossover operator with a defined probability for each pair.

Step 6: applying mutation operator with a defined probability for each child.

Step 7: forming a new population \(p ( {t+1} )\) using surviving mechanism.

Step 8: setting \(t=t+1\) and returning to the third step.

Imperialist competitive algorithm

Imperialist Competitive Algorithm (ICA) is a novel political–social strategy for optimize different problems [22]. This algorithm, as an evolutionary optimization strategy, possesses an excellent performance in convergence speed and absolute optimum point [22, 23, 24, 25, 26, 27]. ICA algorithm is simulated based on political-social imperialism process and imperialist competition [28]. In different optimization methods, different arrays have been used to name problem solutions. For example, in PSO, GA, and ICA algorithms, this array is named as particle, chromosome, and country, respectively. Different optimization problems consist of different dimensions. For an \(N_{\text {var}} \) dimension optimization problem, a country has \(1\times N_{\text {var}} \) arrays; the array is defined as below:
$$\begin{aligned} \text {Country}=[ {P_1 , P_2 ,\ldots ,P_{N_{\text {var}}} } ] \end{aligned}$$
(5)
where P shows the dimensions of the problem and each dimension of the problem represents a specific characteristic of the country. Figure 7 depicts some of these characteristics [26].
Fig. 7

Country including a combination of some socio-political features

Fig. 8

Imperialistic competition

Each country’s value is evaluated by a cost function. This function, based on the type of the problem, is different and its general form is as:
$$\begin{aligned} \text {Cost}=f( \text {country} )=( {P_1 , P_2 ,\ldots ,P_{N_{\text {var}}} }). \end{aligned}$$
(6)
This algorithm starts its work by generating an initial population from countries with \(N_{\text {POP}} \) size. The countries are divided to two categories: imperialist and colony. A number equal to \(N_{\text {imp}} \) and \(N_{\text {col}} \) is selected from the powerful countries and countries as the empire and colony, respectively. Each colony belongs to an empire. To form the initial empires, colonies are attracted to the empire, based on empire’s power. In other words, the initial number of colonies is directly proportional to the power of the empire. The normalized cost of an imperialist to divide the colonies among the appropriate imperialists is as follows [26]:
$$\begin{aligned} C_n =c_n -\max _i \left( {c_i } \right) \end{aligned}$$
(7)
where \(c_n \) is the cost of nth imperialist and \(C_n \) is the normalized cost. Regarding the normalized cost of all imperialists, the normal power of each imperialist is defined as follows:
$$\begin{aligned} P_n =\left| {\frac{C_n }{\sum _{i=1}^{N_{\text {imp}} } {C_i } }} \right| . \end{aligned}$$
(8)
First, the initial colonies are divided based on the power of the empire. Afterward, the initial number of colonies of the nth empire is calculated as follows:
$$\begin{aligned} NC_n =\text {round}\left\{ {P_n \cdot N_{\text {col}} } \right\} \end{aligned}$$
(9)
where \(NC_n \) is the initial number of colonies of the nth imperialist and \(N_{\text {col}} \) is the number of all colonies. To divide the colonies between imperialists, a quantity equal to \(NC_n \) from the colonies was randomly selected and allocated to imperialists. The power of an empire equals to the power of the colonial country plus a percentage of total power of its colonies. In this way, the total cost of an empire is as:
$$\begin{aligned} T.C._n= & {} \text {Cost}( {\text {imperialist}_n } )\nonumber \\&+\;\xi \text {mean}\left\{ {\text {Cost }( {\text {colonies}\;\text {of}\;\text {empire}_n } )} \right\} . \end{aligned}$$
(10)
where \(T.C._n \) is the total cost of the nth empire and \(\xi \) is a positive number, normally considered between 0 and 1 and close to 0. A smaller value of \(\xi \) leads to an approximate balance between the total cost of an empire and the cost of its central government, while, if \(\xi \) increases, it will increase the influence of the quantity of the colonies’ cost of an empire on the determination of its total cost. Typically, \(\xi =0.05\) leads to desirable responses in most of the applications [26].

Each empire which is unable to improve its power and loses its competition power will be eliminated through imperialist completions; this elimination is performed gradually. In other words, as time goes on, weak empires lose their colonies and more powerful empires occupy these colonies and enhance their power. In order to model this, it is assumed that the eliminating empire is the weakest existing one. Hence, in the algorithm’s repetition process, one or several weakest colonies are selected, and then—to seize these colonies—a competition is generated within all empires. The mentioned colonies are not necessarily seized by most powerful empire; although the more powerful empires are a higher chance to seize. Figure 8 illustrates the schematic of this section of the algorithm [26].

In this figure, empire 1 is considered as the weakest empire and one of its colonies is exposed to imperialistic competition, and empires 2 to N compete to seize it. To model the competition among empires to seize the colony, primarily, the probability of each empire to seize—which is proportional to the power of that empire—is calculated, regarding the total cost of the empire. First, through the total cost of the empire, its total normalized cost is determined:
$$\begin{aligned} N.T.C._n =\mathop {\max }\limits _i \{ {T.C._i }. \}-T.C._n. \end{aligned}$$
(11)
In this equation, \(T.C._n \) is the total cost of the nth empire and \(N.T.C._n \) is the total normalized cost of the empire. Each empire which has less \(T.C._n \), the higher its \(N.T.C._n \) will be. As a matter of fact, \(T.C._n \) is equal to the total cost of an empire, while NTCn is its total power. The empire with the least cost is the one with the highest power. Having the total normalized cost, the probability of the ownership, by each empire, of the colony—exposed to the competition—is calculated as:
$$\begin{aligned} P_{p_n } =\left| {\frac{N.T.C._n }{\sum _{i=1}^{N_{\text {imp}} } {N.T.C._i } }} \right| . \end{aligned}$$
(12)
Regarding the probability of each empire to seize, it is required to have a mechanism like roulette wheel in genetic algorithm to assign the colony—exposed to the competition—to one empires, based on their appropriate probability. Having probability of each empire to seize, to divide the colony to empires randomly, but with the dependent probability to the seizing probability of each empire, P vector is formed based on the above probability values as follows [26]:
$$\begin{aligned} P=\left[ {P_{p_1 } , P_{p_2 } , P_{p_3 } , \ldots P_{p_{N_{\text {imp}} } } } \right] \end{aligned}$$
(13)
where P vector is of \(1\times N_{\text {imp}} \) size and consists of probability values of empires’ seizing. Consequently, random vector R is generated with the same size of P vector. Elements of this vector are random numbers with uniform distribution in [0, 1] interval:
$$\begin{aligned} R=\left[ {r_1 , r_2 , r_3 , \ldots ,r_{N_{\text {imp}} } } \right] \quad r_1 , r_2 , r_3 , \ldots ,r_{N_{\text {imp}} } \;{\sim }\,\, \mathrm{U}\left( {\mathrm{0,1}} \right) \end{aligned}$$
(14)
Therefore, D vector is generated as:
$$\begin{aligned} \begin{array}{l} D=P-R=\left[ {D_1 , D_2 , D_3 , \ldots ,D_{N_{\text {imp}} } } \right] \\ \;\;\; =\left[ {p_{p_1 } -r_1 , p_{p_2 } -r_2 , p_{p_3 } -r_3 ,\ldots ,p_{p_{N_{\text {imp}} } } -r_{N_{\text {imp}} } } \right] . \\ \end{array} \end{aligned}$$
(15)
Possessing vector D, the colonies are given to an empire that its index is greater in D vector, as compared to other empires; the empire which has the highest probability to seize, whose index—with higher probability—in vector D, will have the greatest quantity [26].

Particle swarm optimization

Particle swarm optimization algorithm is a social search algorithm which is modeled based on social behavior of bird species. First, this algorithm was employed to identify patterns of simultaneous flight of birds and rapid path change and optimum re-formation of the group. In this algorithm, each solution of the problem is modeled as a particle that has a value and fitness value. In this algorithm, vector method is used to search for the optimum solution. This algorithm has been used in many study fields, such as problem optimization, economic problems, neural network training, and optimization of production systems [29]. PSO is designed to search for the best global solution, using a swarm of particles, and is updated through each stage [30]. Each particle represents a potential solution in search space, which sets its location and velocity regarding the its own best experience or local best and the so-far-obtained best experience or local best of all particles (global best) [31]. Consider an N dimensional problem with i particles and t generations. \(X_{i,N} (t)\) and \(V_{i,N} ( t )\) are location and velocity of the ith particle, respectively. The velocity of the ith particle for the \(( {t+1} )\) generation is calculated through the following equation [32, 33]:
$$\begin{aligned} V_{i,N} ( {t+1} )= & {} \omega ( {t+1} )V_{i,N} ( t )+c_1 r_1 ( {X_{\text {pbest},N} -X_{i,d} ( t )} )\nonumber \\&+\,c_2 r_2 ( {X_{\text {gbest},N} -X_{i,d} ( t )} ) \end{aligned}$$
(16)
$$\begin{aligned} X_{i,N} ( {t+1})=X_{i,N} ( t )+V_{i,N} ( {t+1} ) \end{aligned}$$
(17)
where \(1\le i\le m\), \(1\le t\le k\); and \(c_1 \) and \(c_2 \) have the values \(c_1 =c_2 =2\). \(r_1 \) and \(r_2 \) are the two independent random numbers that follow the uniform functions in [\(-1,\) 1] interval. \(\omega \) is the inertia coefficient that controls the effect of the previous speed on the current speed:
$$\begin{aligned} \omega ( t )=\omega ( 1 )-( {\omega ( 1 )-\omega ( k )} )t/k \end{aligned}$$
(18)
where \(\omega ( 1 )\) is the initial inertia weight and \(\omega ( k )\) is equal to the inertia weight which evolved swarm in the last generation of k and t is equal to maximum repetition and initial repetition of the maximum generation, respectively. PSO, as a type of simple and effective random searching algorithm, may lead to better results than the gradient descent method, penalty function method, and genetic algorithm, when solving some non-linear optimization problems [34, 35].

Proposed method

Within the last two decades, optimization has spread its application in different fields, such as industrial engineering, electrical engineering, computer engineering, telecommunication, and transportation. Nowadays, the application of intelligent methods such as artificial neural networks and meta-heuristic algorithms has an important position in different study fields. In this survey, Genetic, Imperialist Competitive, and Particle Swarm (GA, ICA, and PSO) algorithms have been used as optimizers of the neural network parameters. Since the performance of these algorithms is based on population, the combination of each algorithm with the Elman neural network and feed-forward network is almost similar. However, based on their nature, they show different performance. The purpose of combining these algorithms with the neural network is to optimize weights and biases of the neural network. As stated in the previous sections, these algorithms are of a population based nature, and the problem has to be defined, so that the algorithms can optimize it as a swarm of the population. In problem-defining stage, chromosome (GA), particle (PSO), or country (ICA) should be in a way to optimize the weights and biases, which mean the number of each chromosome’s gens, and should be equal to the number of weights and biases. Training the neural network, the number of weights and biases depends on the number of layers and neurons. To make a particle, the following procedure is performed:

$$\begin{aligned} N=\underbrace{I \cdot n_{l_i } +\sum _{i=1}^{l_i -1} {( {n_{l_i } \cdot n_{l_{i+1} } } )} }_{\hbox {Weight }\,= \,{ W}}+\underbrace{\sum _{i=1}^{l_i } {n_{l_i } } }_{\hbox {Bias }\,=\,{ b}} \end{aligned}$$
(19)
where N is the number of neural network parameters or in other words, the dimension of the optimization problem. \(n_{l_i } \) and I are the number of neurons in the ith layer and input of the network, respectively. When the solution chromosome is specified, a cost function should be defined for the optimization problem to calculate the optimal value of parameters (W, b). The employed cost function is represented in the following section.

Cost function

MMSE cost function is used in this section to optimize parameters of the neural network. Here, each solution of the problem (chromosome, particle, or country) has a cost; the cost function calculates these costs for solutions. Furthermore, based on the nature of the cost function, it should be minimized using optimization algorithms: to find a solution with the least possible cost. In this investigation, three optimization algorithms of PSO, ICA, and GA methods are used to minimize the cost function. MMSE is the Minimum Mean Squares Error of (MSE). MSE and MMSE are:
$$\begin{aligned} \text {MSE}=\frac{1}{m}\sum _{i=1}^m {( {X_{\text {ACTi}} -X_{\text {FORi}} } )} ^{2} \end{aligned}$$
(20)
$$\begin{aligned} \text {MMSE}=\left( {\frac{1}{m}\sum _{i=1}^m {\left( {X_{\text {ACTi}} -X_{\text {FORi}} } \right) } ^{2}} \right) \end{aligned}$$
(21)
where, \(X_{\text {FOR}} \) is the predicted value, \(X_{\text {ACT}} \) is the actual value, and n is the number of data.

Training and testing of the neural network

After defining the neural network parameters as a problem and optimizing the problem with aforementioned algorithms, the neural network should be trained and tested. In this stage, the optimal value of the parameters should be provided to the neural network and the network employs a part of input data (75% in this study) for training. Optimized parameters by the optimization algorithms such as PSO, ICA, and GA as well as transfer functions (here, tangent sigmoid transfer function is used), relation, and behavior of the data are obtained, and then, the trained network is evaluated by different criteria. In the current study, to evaluate the performance of the neural network training, two criteria of Mean Squared Errors (MSE) and Root Mean Square of Errors (RMSE) are used.
$$\begin{aligned} \text {RMSE}=\sqrt{\frac{1}{m}\sum _{i=1}^m {\left( {X_{\text {ACTi}} -X_{\text {FORi}} } \right) } ^{2}}. \end{aligned}$$
(22)
If the training is performed well, the neural network testing phase would be initiated. The network is tested by data which had been preserved for this phase. Afterwards, the network is evaluated by different error criteria. To evaluate testing phase, two error criteria of Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) are used:
$$\begin{aligned} \text {MAE}=\frac{1}{m}\sum _{i=1}^m {\left| {X_{\text {ACTi}} -X_{\text {FORi}} } \right| } \end{aligned}$$
(23)
$$\begin{aligned} \text {MAPE}=\frac{100}{m}\sum _{i=1}^m {\left| {\frac{X_{\text {ACTi}} -X_{\text {FORi}} }{X_{\text {ACTi}} }} \right| } \end{aligned}$$
(24)
where \(X_{\text {ACT}} \) and \(X_{\text {FOR}} \) are actual value and predicted value, respectively. Figure 9 illustrates flowchart of combination of the proposed method.
Fig. 9

Flowchart of the proposed method (El-PSO)

Input data

In the current investigation, monthly data in a period of 15 years (1996–2010) were used as inputs of the neural network. These data are for Islamic Republic of Iran collected from different organizations, such as Iran’s Pistachio Research Institute (IPRI), customhouse, and the Central Bank of I.R.I. After the determination of input and output variables of the network, the data should be normalized prior to each procedure of neural network training in the bracket of [−1 1]. The purpose of normalizing the data is that all the training data should have a uniform statistic distribution. In this study, linear normalization is used to convert data.
$$\begin{aligned} X_{\text {norm}} =\frac{( {X-\text {Min}( X )} )}{( {\text {Max}( X )-\text {Min}( X )} )}\times 2-1 \end{aligned}$$
(25)
where \(X_{\text {norm}} \) is the normalized value, X is the non-normal value, and \(\text {Min}( X )\) and \(\text {Max}( X )\) is minimum and maximum of the non-normal, respectively.

Numerical results

Case study

Pistachio product is of great importance in either producing countries or other countries and is considered as the green gold. Two major pistachio producing countries are Iran and the US. In this study, the price in Iran of pistachio data is used to predict the price of this produce. In addition to Iran’s domestic market prices data, the data of inflation rate, export value, and production amount, and etc.—which have been mentioned in previous sections—were used as well.

Results of the proposed method and other combined methods

As stated in the previous sections, combined method of El-PSO was used to predict Iran’s domestic market pistachio prices. Table 2 illustrates the best results of this method and other combined methods. Figure 10 shows the actual and the predicted data by the El-PSO method.
Table 2

Comparison of the best performance of the El-PSO method and other combined methods, using four measurement criteria

Methods

MSE

RMSE

MAE

MAPE (%)

FF-PSO

0.003

0.0548

0.0545

0.0545

El-ICA

0.0046

0.059

0.059

11.736

FF-ICA

0.0065

0.059

0.1117

11.78

El-GA

0.0189

0.1374

0.106

9.353

FF-GA

0.0078

0.0883

0.1027

7.0336

Proposed method (El-PSO)

0.0015

0.0387

0.0315

4.215

Fig. 10

The comparison of actual data and predicted data of the El-PSO method

Moreover, 60 data were selected (for 5 years) and the performance was once again evaluated. Table 3 depicts the performance result of this part of data, which was obtained by El-PSO and other methods.
Table 3

Comparison of performance of the proposed method and other methods, using 60 data (equal to 5 years)

Methods

MSE

RMSE

MAE

MAPE (%)

FF-PSO

0.0049

0.07

0.0636

4.042

El-ICA

0.0764

0.2764

0.247

6.4709

FF-ICA

0.697

0.8349

0.217

9.093

El-GA

0.0137

0.117

0.064

4.216

FF-GA

0.0103

0.1015

0.0769

5.0651

Proposed method (El-PSO)

0.0029

0.0539

0.0526

3.169

According to the results of the table, El-PSO method has an excellent performance as compared to other methods, which proves that the efficiency of this method to predict pistachio price is, indeed, a complicated task.

Results analysis, using analysis of variance method

Since several novel combined methods are used to predict pistachio price in this study, one of the best techniques to evaluate these methods’ performance is analysis of variance method (ANOVA). Using this technique, more than a society (method) can be evaluated. In this section, the results of MAE error criterion, which are obtained by all the methods, are compared by ANOVA. Figures 11 and 12 are the output of this method, which show mean and distribution, respectively.
Fig. 11

Mean of the proposed method and other methods, using MAE criterion

Fig. 12

Distribution of the proposed method and other methods, using MAE criterion

Table 4

Statistical characteristic of the proposed method and other combined methods to predict pistachio price

Methods

Mean

Median

SE mean

Standard deviation

FF-PSO

0.0621

0.06

0.0084

0.0206

El-ICA

0.159

0.1653

0.0339

0.0831

FF-ICA

0.161

0.1673

0.0337

0.0825

El-GA

0.079

0.0932

0.0152

0.0372

FF-GA

0.225

0.096

0.129

0.315

Proposed method (El-PSO)

0.0473

0.0499

0.0499

0.0079

Table 4 represents statistical characteristics obtained by ANOVA method. Using these results, it can be concluded that the suggested method of El-PSO has an excellent performance and limited distribution. It should be mentioned that Minitab 16 software is employed to implement ANOVA.

Conclusion

According to complexity of pistachio price in Iran and the world, it is very difficult to propose a model capable to predict pistachio price, based on relevant factors and results with a lower error. Suggesting an efficient method can be helpful to the economy of producing countries, as well. In the current survey, El-PSO method is proposed to predict pistachio price. This method owns an excellent performance and so efficient that can be used in other problems, also.

Copyright information

© The Author(s) 2017

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Authors and Affiliations

  • Azim Heydari
    • 1
  • Farshid Keynia
    • 2
  • Nasser Shahsavari-Pour
    • 3
  • Reza Sedaghat
    • 4
  1. 1.Young Researchers and Elite Club, Kerman Branch, Islamic Azad UniversityKermanIran
  2. 2.Energy Research Institute, Institute of Science and High Technology and Environmental ScienceGraduate University of Advanced TechnologyKermanIran
  3. 3.Department of Industrial ManagementVali-e-Asr UniversityRafsanjanIran
  4. 4.Agricultural EconomicsIran’s Pistachio Research InstituteRafsanjanIran

Personalised recommendations