1 Introduction

Agriculture is one of the most important sectors along with oil, automotive, and the pharmaceutical industries, etc., in the economy of any country, and plays an important role in its political and economic independence (Mousavi et al., 2015). The existence of many natural gifts and special climatic conditions have turned Asia countries into a four-season land, which provides the necessary basis for focusing this sector on the country's economy (Mogale et al., 2020). One of the most obvious challenges in the agricultural sector in the country is the lack of awareness of farmers about the balanced cultivation of agricultural products on demand. The disruption of this balance, on the one hand, causes the abundance of a product, a significant reduction in its price in one year and harms farmers. On the other hand, the reduction of other products increases the price and dissatisfaction among the people (Bottani et al., 2019; Keshavarz-Ghorbani & Pasandideh, 2021).

Iran is one of the world’s largest fruit producers and provides the finest conditions for citrus fruit production (Cheraghalipour et al., 2019). Hence, the citrus supply chain network (CSCN) is one of the main parts of the agriculture-food supply chain in Iran. The definition of the CSCN illustrates the activities of production to distribution of agricultural and horticultural products from farms. One of the fundamental factors in the CSCN is to increase the quality and safety of foods and other variables relevant to weather conditions (Mousavi et al., 2015; Sharma et al., 2020). Supply chains are primarily responsible for the production or assembly of products and their distribution. In the production and distribution stages, there is always a need to check the inventory of products. The reason for this is to satisfy customer demand and prevent shortages of products. Previous studies have often focused on one aspect of production, distribution, and inventory, but integrating these three issues brings the issue closer to the real world. The integrated production and distribution problem is very complicated due to its many variables and constraints. Therefore, many techniques were applied to find the best solution for this problem within a reasonable time.

Today's world is facing a series of environmental challenges, many of which have led to environmental crises, inflicting irreparable damage that has often been impossible to prevent from spreading and, even if possible, there are many costs involved. The CO2 emissions and greenhouse gases pose a major threat to the environment. An agri-food supply chain is a network of food-related business enterprises through which food products move from production through consumption (Kamble et al., 2020; Soto-Silva et al., 2016) and in the process contribute to CO2 emissions.

Agriculture crops refer to the part of agricultural products in which the time from planting to flowering (end of life of the crop) is less than one year. In fact, their life cycle lasts less than a year, while agricultural products have a long life cycle and, after flowering, the plant does not end its life and can be harvested in successive periods without replanting. Another issue that needs to be addressed is irrigated and rainfed cultivation (Sellitto et al., 2018). In this regard, irrigated cultivation is a type of agricultural production in which farmers provide the water needed by the plant promptly, while, in rainfed cultivation, farmers are hoping to get the required water from rainfall during the growing, cultivating, and harvesting stages.

To tackle the current challenges in the citrus fruit supply chain, the Ministry of Agriculture Jihad (MAJ) in Iran is moving toward the citrus fruit supply chain network in terms of holding, transportation, and storage. In this network, citrus fruit is transported in the box using the truck as well as specially designed Nissans and stored in specially designed citrus fruit storage. Proper planning and coordination among all the entities of the citrus fruit supply chain network are essential to reduce transportation, inventory holding, and waste costs (Sgarbossa & Russo, 2017). Likewise, the determination of each type of capacitated vehicle used for transporting between various entities is also a crucial aspect of the citrus fruit supply chain problem (Rong et al., 2011). Sufficient availability of capacitated vehicles helps in the quick transfer of citrus fruit from producer to consumer cities (Roghanian & Cheraghalipour, 2019). In the existing literature, there is currently no comprehensive planning to produce citrus fruit (see for example Rong et al., 2011; Sgarbossa & Russo, 2017; Roghanian & Cheraghalipour, 2019; Mogale et al., 2020).

In this paper, therefore, a new bi-objective, multi-echelon, and multi-period agriculture food supply chain network (AFSCN) model for a production–distribution-inventory problem considering CO2 emissions and time windows is developed. The main reason for using time windows is to prevent corruption of products. Therefore, in the proposed supply chain, the production process is performed in farms and inventory control is performed at fridges and farms levels. Distribution operations are also performed from farm to fridges and from fridges to citrus markets.

In this regard, a Mixed-Integer Non-Linear Programming (MINLP) model is formulated for the current problem. The main aim of this study is to minimise the total costs (transportation, inventory holding, warehouse, waste, and operational costs) and the released CO2 emissions through transportation systems. In the proposed model, the optimal amount of the production of the agricultural foods (citrus fruits) in the farms according to the location of the fridges for citrus holding and the allocation of citrus fruits to them according to the shelf life of the citrus fruits are considered. In the AFSCN, the demand areas are citrus fruit markets (in urban areas), the supply areas are large and small farms can be considered as centralised areas in each urban area. Another novelty in this paper, based on the production planning problem, is a new method for monitoring the perishability of citrus fruit and combining locations. In addition, a performance coefficient for saving agricultural land and water consumption is considered for the first time. To solve the AFSCN model and find Pareto solutions, simulated data as test problems including small-, medium-, and large-sized problems are considered. Therefore, the CPLEX method using GAMS software is employed to solve the presented model in the small-sized problem. Then, meta-heuristic algorithms containing multi-objective ant colony optimisation (MOACO), Fast PGA, non-dominated sorting genetic algorithm II (NSGA-II), and multi-objective simulated annealing (MOSA) algorithms are used to solve the mathematical model in different size problems. Additionally, a novel hybrid heuristic method based on multi-objective ant colony optimisation (ACO) and multi-objective simulated annealing (SA) algorithms, called the HACO-SA algorithm, is developed in this paper. In summary, the present paper has the following novelties:

In summary, the present paper has the following key contributions:

  • Designing a bi-objective mathematical model for production-allocation-inventory problem in the AFSCN for citrus fruit and analysing it,

  • Formulation of an MINLP model considering time windows for servicing to citrus fruit markets in the AFSCN problem and analysing it,

  • Considering CO2 emissions as environmental effects through used vehicles in the proposed network and analysing it,

  • Employing the performance coefficient of the citrus fruit and needed water for the production of the citrus fruit in the suggested model,

  • Considering the capability to re-cultivate on the agricultural land after citrus fruit harvest,

  • Developing a new heuristic method based on a meta-heuristic algorithm (HACO-SA algorithm) to solve the mathematical model and comparing it with other methods.

The rest of this work is as follows: Sect. 2 presents the literature review. The problem description and mathematical model along with notations, objective function, and constraints are demonstrated in Sect. 3. The solution methodologies to solve the AFSCN model are stated in Sect. 4. The implementation of the presented model along with the numerical results and sensitivity analysis of the models are proposed in Sect. 5. Finally, Sect. 6 presents the conclusions and future work.

2 Literature review

A review of existing studies shows that literature related to the agri-food supply chain networks is scant. This section, therefore, reviews the previous studies in the fields of (I) the agri-food supply chain network along with meta-heuristic algorithms and (II) the agri-food supply chain management.

2.1 Meta-heuristic algorithms in agri-food supply chain network

Osvald and Stirn (2008) presented an algorithm for the distribution of fresh vegetables. They provided a vehicle routing problem with time windows and time-dependent travel times where the travel times between two locations depended on both the distance and the time of the day. Validi et al. (2014a) designed an integrated low-carbon/green distribution system. The DoE-guided MOPSO approach was used to solve their proposed model. A scenario analysis of the realistic vehicle routes is provided by considering alternative possible outcomes for further guidance to DMs. In another work, Validi et al. (2014b) presented a sustainable multi-objective model for food supply chain management. They evaluated distribution and routing problems in a two layers green supply chain network. Their main goal was to minimise supply chain costs and carbon dioxide emissions. They investigated a real case study in the Irish dairy. Moreover, Validi et al. (2015) developed a sustainable distribution model to minimise costs and carbon dioxide emissions in the Irish dairy processing industry supply chain. In their study, experts prioritised realistic routes with alternate transportation scenarios. Masson et al. (2016) developed a two-stage approach according to an adaptive large neighbourhood search to solve the dairy supply chain network model. The first phase solved the transportation problem and the second phase ensured that the optimisation of plant assignment was performed. Mogale et al. (2017a) investigated the multi-period multi-modal bulk wheat transportation and storage problem in a two-stage supply chain network of public distribution system while seeking to minimise the total costs. They developed an MINLP after studying the Indian wheat supply chain scenario. In addition, they suggested and used two meta-heuristic algorithms. Mogale et al. (2017b) designed a multi-period inventory transportation model for tactical planning of the food grain supply chain. Then, they formulated an MINLP model, while their main aim was the minimisation of the cost of the food grain supply chain. Cheraghalipour et al. (2019) considered the rice supply chain and also presented a bi-level optimisation model for the rice supply chain. They designed an MILP mathematical model, while investigating minimising total cost concerning the two decision-makers' opinions. Thus, two meta-heuristic algorithms were presented and two hybrid algorithms developed. Finally, they studied a real case study in Iran. Sahebjamnia et al. (2020) developed a new multi-objective integer non-linear programming model for designing citrus fruit three-echelon supply chain network. They considered two objective functions including minimising network costs and maximising the network’s profits. To solve their model, firstly, the model was converted to a linear programming model. Then, they used three multi-objective meta-heuristic algorithms for finding efficient solutions. The outcomes of their proposed algorithms were compared by several assessment criteria including a number of Pareto solutions, maximum spread, mean ideal distance, and diversification metric. Dwivedi et al. (2020) proposed an MINLP model for a two-echelon food grain supply chain along with sustainability aspects (carbon emissions). Their main aim was minimising the total transportation cost and carbon emission tax in gathering food grains from farmers to the hubs and later to the selected demand points (warehouses). They used two meta-heuristic algorithms to solve their model and also adopted simulated data to validate their model. Validi et al. (2020) proposed a robust mathematical model for routing and location in the three-echelon supply chain network. Their proposed chain included factories, distribution centres, and retailers. The DoE-guided Multiple-Objective Genetic Algorithm of type II (MOGA-II) was used to solve their proposed model. They considered a real case study in the east of Ireland. Liao et al. (2020) developed a new MILP model for a citrus fruit closed-loop supply chain along with minimising total costs and air pollution. They used meta-heuristic algorithms to solve their model. Assessment criteria were used to compare their proposed algorithms. To validate their model, they conducted a real case study. In this regard, a four-echelon citrus fruit supply chain including gardeners, distribution centres, citrus fruit storage, and fruit market was developed by Fakhrzad and Goodarzian, (2021). They formulated an MINLP model, which showed minimising the total cost and maximising the profit of the citrus fruit supply chain network. To solve their model, ant colony optimisation and simulated annealing algorithms were used. Gómez-Lagos et al. (2021) proposed an MILP model for a fresh fruit supply chain network based on supporting tactical decisions during the harvest season in order to reduce total costs. They used a greedy randomised adaptive search procedure meta-heuristic algorithm to solve their model. Finally, they provided a real case study in the fruit market field. Validi et al. (2021) presented a bi-objective mathematical model for location, routing, and distribution centres in a three-echelon sustainable supply chain. The main purpose of their proposed model was to minimise supply chain costs while minimising carbon dioxide emissions. DoE-guided meta-heuristic-based model was used to solve their proposed model. Due to the two objectives of their proposed model, they used the TOPSIS approach to prioritise Pareto points and meta-heuristic algorithms to solve their model.

2.2 Agri-food supply chain network

Rong et al. (2011) developed an optimisation method to manage fresh food quality in the food supply chain network. Additionally, they integrated food quality in decision-making on production and distribution in a food supply chain. Also, they designed a mixed-integer linear programming (MILP) model for production and distribution planning problems. Their resulting model is used in a real case study and is able to use to design and operate food distribution systems, utilising both food quality and cost criteria. Etemadnia et al. (2015) designed a fruit and vegetable supply chain network based on an optimal national wholesale or hub location network for serving food consumption markets through efficient connections with production sites. Therefore, they formulated an MILP model that decreases the total costs including transportation costs of products and the location of the facilities. To solve their model, heuristic and exact methods are presented. A scenario study is utilised to investigate their model's sensitivity to parameter changes. An application is made to the U.S. fruit and vegetable industry. Bortolini et al. (2016) suggested a three-objective distribution problem for fresh food network along with minimising carbon footprint, operating cost, and delivery time. They formulated a linear programming (LP) model considering the farmer production capacities, the food quality dependence on the delivery time and the geographically distributed market demand. To validate their model, they applied a real case study dealing with the distribution of fresh fruits and vegetables in Italy. Lamsal et al. (2016) proposed an integrated linear programming (ILP) model for planning the movement of the crop from the farm to the processing plant. They considered economically significant agricultural systems in the United States including sugarcane in Louisiana and sugar beets in the northern areas of the United States. Eventually, they demonstrated that their model is computationally tractable by introducing new datasets according to the sugarcane industry in Louisiana. Hyland et al. (2016) introduced conceptual and mathematical models of the domestic grain supply chain including elevator storage, rail transportation, and trucking. They compared conventional rail service supported by country elevators with shuttle service supported by terminal elevators with critical transportation service dimensions divided into three categories incorporating capacity, travel time, and cost. The time and cost model outcomes showed that shuttle service carried grain faster and diminished supply chain costs more than conventional service, even after considering trucking and elevator storage. Their rail capacity model outcomes demonstrated that shifting grain from conventional to shuttle service considerably raises rail capacity. Accorsi et al. (2016) considered the food supply chain as an ecosystem and defined more inclusive boundaries. They presented a design framework that supports strategic decision-making on agriculture and food distribution issues while addressing climate stability. They merged an original land-network problem with localised and large-sized decisions as land-use allocation and location-allocation problems in an agro-food network. A linear programming model that optimises infrastructure, agriculture, and logistics costs was formulated as well as carbon emissions within the agro-food ecosystem were balanced. They suggested a regional potato supply chain for the effectiveness of their model. Jabarzadeh et al. (2020) presented a closed-loop supply chain network optimisation problem for a perishable agricultural product to attain three pillars of sustainability. They formulated a multi-objective MILP model and used the LP-Metric and weighted Tchebycheff approaches to solve their model. A set of experiment problems was provided to validate their model. Motevalli-Taher et al. (2020) optimised a sustainable wheat supply chain network and formulated a multi-objective mathematical model in order to minimise network costs and water consumption and maximise job opportunities. They used the meta-goal programming approach for transforming the multi-objective to the single-objective problem. Thus, they employed a simulation approach to cope with the uncertainty of the demand for wheat flour. Finally, a real case study was provided to assess their proposed model. Mehrbanfar et al. (2020) presented an efficient network of greenhouses, agricultural lands, processing plants, and agricultural distribution centres (farmer’s markets) according to sustainability concepts. Also, they used an augmented ε-constraint approach to solve the model and provided a real case study in Iran to investigate their model. Varas et al. (2020) proposed a new multi-objective MILP model to support wine grape harvesting. They considered the opposing nature of operational cost minimisation and grape quality maximisation, subject to several constraints, such as grape requirements and routing decisions. They developed a negotiation protocol that can lead to an agreed final harvest schedule according to the operations of a winery they worked with. Based on the augmented weighted Tchebycheff method, the protocol included an initial obtained Pareto optimal solution. To solve their model, they used the augmented ∊ -constraint approach. Yakavenka et al. (2020) extended a multi-objective model for sustainable supply chains. They formulated an MILP model for this network of perishable food products while minimising the total costs, social-time, and emissions. Finally, their model was implemented in the case of a fruit importer in the North-Eastern European region considering its geographical settings.

Table 1 shows the summary of the examined papers in the literature review.

Table 1 The summary of the examined papers (S  sustainable, SCN  supply chain network, CLCN  closed-loop supply chain)

According to the examined papers, the main gaps are explained as follows:

  • In terms of mathematical modelling, they didn't pay attention to a multi-period and bi-objective model for designing an AFSCN considering CO2 emission and time windows that are designed in this paper.

  • Other differences between our paper and studied papers are, first of all, the considered citrus fruit is orange, which is a perishable product. For this reason, the waste cost is considered in this network. Secondly, some studies have examined a single-objective mathematical model (minimising total costs), but a bi-objective model including minimising total cost and minimising CO2 emission is formulated and designed simultaneously in this paper. Thirdly, the performance coefficient and needed water for the production of the citrus fruit (orange) in the AFSCN model are considered. Additionally, the capability to re-cultivate in the agricultural land after the orange harvest is considered in this paper which the examined papers did not.

  • Unlike previous models, which are defined as separate variables and constraints for staying in the fridge/cold storage and imposing perishability constraints, this variable considers simultaneously both the sent amount to the fridge/cold store and the duration of staying in the fridge/cold storage. In this paper, planning for the citrus fruit (orange) is performed with irrigated cultivation. Due to limited water resources in some regions, water resource constraints are considered that were ignored in the examined papers in the literature review.

  • In terms of solution methodology, a significant gap is specified. Since the AFSCN model is an NP-hard problem, exact solvers cannot solve large-scale problems in a reasonable time. Several studied papers have used metaheuristic algorithms, including NSGA-II, MOGA, GA, MOPSO, CRO, TS, and SADE algorithms, to solve their model.

3 Problem description

In this paper, the proposed AFSCN problem considers three echelons including (I) citrus fruit (orange) farms that can be planted in different periods, (II) the fridges to hold citrus fruit, which must be selected among a number of candidate points, and (III) citrus fruit markets (demand points) where the location of supply the products to the final customer and then their location and demand for each period is determined in advance. In this regard, the orange harvest season is from late November to mid-February, which indicates the multi-period nature of the proposed model. The types of flow in the presented network are considered as follows (see Fig. 1):

  • The flow of product transfer from farms to fridges and citrus fruit markets,

  • The flow of product transfer from the fridge to the citrus fruit markets, including products that are stored in the fridge for at least a period, or products that are not stored, and only due to the lower cost of transportation through the fridge to the fruit market is transported.

Fig. 1
figure 1

The structure of the proposed network in this paper

The main aims of the proposed model include minimising the total costs (production, transportation, fridge rental, and inventory holding) and the released carbon dioxide emissions through vehicles as environmental effects simultaneously, which are considered in this paper. The Ministry of Agriculture Jihad (MAJ), Iran, is considered as operators (managers) at all stages of the chain. Therefore, the production of the citrus fruit (orange) is the responsibility of the farmers, but it is assumed that the MAJ pays the farmers the cost of production and the profit of the farmers in the form of the guaranteed purchase at harvest time. Also, this is considered a production cost for the MAJ. In the presented model, the planning time evaluates for the cultivation and transportation of citrus fruit and considers other agricultural technical requirements. In this paper, some of them are mentioned as follows:

  • Perishable concept: according to this concept, the shelf life of the citrus fruit (orange) from the harvest time to the time of reaching the demand points should be less than the perishable time of the product.

  • Processing time: the time from cultivating to harvest is called the processing time of the product; therefore, post-cultivating, the products must pass the necessary processing time until harvest.

  • Possibility of re-use of the citrus fruit garden: one of the characteristics of citrus fruit cultivating is the possibility of re-use of the citrus fruit garden, which based on post-harvest the citrus fruit from the citrus fruit garden can be again cultivated on the citrus fruit garden. All the papers related to the supply chain of perishable products, because of planning for a period of cultivation, have ignored this feature.

  • Cultivation capability: cultivation of products in each region is allowed in certain periods; for example, for the citrus fruit (orange) product in Mazandaran province, Iran, there is the ability to cultivate during the autumn and winter seasons and there is no ability to cultivate in other seasons.

  • Water resources: in this paper, planning for the citrus fruit (orange) is performed with irrigated cultivation. Due to limited water resources in some regions, water resource constraints are considered in the AFSCN model as a novelty.

  • Fridge/cold storage: in the proposed model, there are a specific number of fridges, in which a number of them are to be selected. The capacity of each fridge is limited. Each fridge has a fixed cost called the fridge rental during the planning horizon.

  • Accessible budget: the accessible budget for fridge rental is limited.

  • Minimum income of farmers: in order to motivate farmers to follow the provided planning by the AFSCN model, earning the minimum income for farmers is considered.

  • The minimum acceptable amount of cultivating: according to this limitation, cultivating the citrus fruit in each period will be unauthorised if less than a low limit.

The reasons for the f agricultural technical requirements in this network are divided into several categories including one aspect of the impact of fridges (cold storage) and its direct impact on reducing citrus fruit perishes/waste. Also, the existence of cold storage to store citrus fruit and processed products in one area can affect various economic and environmental conditions. Another influential and important parameter in the presented model is the available budget for fridge/cold storage rental, which controls the number of fridges/cold storages that can be rented. As water is the most essential agricultural requirement, it’s important to consider this in the cultivation planning of citrus fruit.

The details of the assumptions of the proposed mathematical model are explained as follows:

  • The cultivation of the agricultural products (citrus fruit/orange) is considered as an irrigated cultivation.

  • For each amount of citrus fruit (orange) cultivated, after the processing period, it is harvested to the same required amount.

  • A time window is considered to respond to citrus fruit market demands.

  • After harvesting and before transferring the products to the citrus fruit market or fridge/cold storage, the decayed orange is removed.

  • The planning horizon is included a number of specified periods.

  • The products become completely corrupt and unusable after the end of the shelf life.

  • If there is a cultivation or harvesting operation in a period, this operation is performed at the beginning of the period.

  • The required time to transport the products is assumed to be negligible.

  • The transportation cost of the oranges depends on their weight.

  • The holding cost of the oranges depends on the weight and duration of holding.

  • The transport fleet is considered homogeneous. The problem dealt with a homogeneous fleet and assumes that each vehicle has the same characteristics, such as capacities and cost.,

  • In the proposed supply chain in the process of production and holding of inventory, CO2 emission is not considered as it only occurs in the distribution process,. The reason for this is that the amount of carbon released during the production and holding phase is very small compared to the distribution process and can be ignored.

3.1 Notations

3.2 Indices

l :

The indices of citrus fruit markets (demand points) l: 1,…, L

i :

The indices of agriculture farms i: 1,…, I

j :

The indices of the fridge (cold storage) candidate points j: 1,…, J

m :

The indices of products m: 1,…, M

n :

The shelf life time of the products in the fridge n: 1,…, N

t :

The indices of the time periods (the time horizon T) t: 1,…, T

3.3 Parameters

CP mit :

Production cost (guaranteed purchase price) per unit of product m on the farm i at the period t (10 Rials (R)/ton)

CV mit :

Fixed cost for cultivating (launching cultivating) product m on the farm i at the period t

CH mjt :

Holding cost per unit product m in the fridge j at the period t (10R/ ton)

CA mij :

Transportation cost per unit product m from farm i to fridge j per each distance (10 R/ ton /KM)

CB mjl :

Transportation cost per unit product m from fridge j to citrus fruit markets l per each distance (10 R/ ton /KM)

CC mil :

Transportation cost per unit product m from farm i to citrus fruit markets l per each distance (10 R/ ton /KM)

Ωl:

Violation rate of the vehicle time window for servicing to the citrus fruit market l

dA ij :

Distance between farm i to fridge j (KM)

dB jl :

Distance between fridge j to citrus fruit market l (KM)

dC il :

Distance between farm i to citrus fruit market l (KM)

F jT :

Fridge rental cost j(fixed cost at the planning horizon T) (10R)

K it :

Maximum accessible land on the farm i at the beginning of the planning horizon t(Hectare)

α mit :

Percentage of waste harvested products from farm i at the period t of product m

S j :

Fridge capacity j (ton)

σ :

Vehicle capacity

B:

Maximum accessible budget for fridge rental (10R)

L mit :

The minimum acceptable amount of the cultivating product m on the farm i at the period t

g mi :

The coefficient of conversion of the amount of product m (tons) to the occupied area on the farm i per hectare (hectares/ton) (this parameter is the inverse of the performance coefficient)

q m :

The required time to produce (from cultivation to harvest) the product m (Month)

w m :

The amount of required water to produce the product m per hectare (cubic meters/hectare)

MinP i :

Minimum acceptable income for farm farmers i (10Rials)

ζ:

The CO2 emissions coefficient of vehicle fuel

η:

Required fuel for the vehicle at each kilometer of transporting

\([\overline{\omega}_{l},\,{\mu}_{l}]\) :

The time window for servicing to citrus fruit markets l

Δl:

Duration of service to the citrus fruit market l

\(\tau_{l}\) :

Arrive time of the vehicle to the citrus fruit market l

ϕ l :

The amount of waiting for the vehicle for serving to the citrus fruit market l

I mjt :

The amount of product m in the fridge j at the end of the period t (tons)

TW i :

The total amount of accessible water resources for cultivating the desired products on the farm i in the whole planning horizon (cubic meters)

M :

A big number

λ mit :

1if there is a capability of cultivating product m on the farm i at the period t; Otherwise 0;

3.4 Decision variables

X mit :

The amount of total production (harvest) product m on the farm i at the period t (tons)

Xmit:

The amount of total cultivating product m on the farm i at the period t (tons)

xa mijt :

The amount of product m transported from farm i to fridge j at the period t (tons)

xc milt :

The amount of product m transported from farm i to citrus fruit market l at the period t (tons)

xbd mjlt :

The amount of product m that entered to fridge j at the period t and after n period stays in the fridge, at the period t + n from fridge to citrus fruit markets l is transported (tons)

xb mjlt :

The amount of product m that without delay from fridge j to citrus fruit market l at the period t is transported (ton)

R jT :

1 if fridge j rented for the whole of the planning horizon T, Otherwise 0

V mit :

1 if cultivating product m performed on the farm i at the period t, Otherwise 0

3.5 The AFSCN model

$$ \begin{aligned} MinZ_{1} = & \sum\limits_{m} {\sum\limits_{i} {\sum\limits_{t} {(CP_{{mit}} } } } X_{{mit}} + CV_{{mit}} V_{{mit}} ) + \sum\limits_{m} {\sum\limits_{i} {\sum\limits_{j} {CA_{{mij}} } } } dA_{{ij}} \left( {\sum\limits_{t} {xa_{{mijt}} } } \right) \\ + & \sum\limits_{m} {\sum\limits_{j} {\sum\limits_{l} {CB_{{mjl}} } } } dB_{{jl}} \left( {\sum\limits_{t} {\left( {xb_{{mjlt}} + \sum\limits_{n} {xbd_{{mjl\left( {t - n} \right)n}} } } \right)} } \right) + \sum\limits_{m} {\sum\limits_{i} {\sum\limits_{l} {CC_{{mil}} } } } dC_{{il}} \left( {\sum\limits_{t} {xc_{{milt}} } } \right) \\ + & \sum\limits_{j} {\sum\limits_{T} {F_{{jT}} } } R_{{jT}} + \sum\limits_{m} {\sum\limits_{j} {\sum\limits_{t} {I_{{mjt}} } } } CH_{{mjt}} \\ \end{aligned} $$
(1)

The first objective function (1) refers to the minimisation of the total costs including fixed costs for cultivating produces, the transportation cost from farms to fridges, from fridges to citrus fruit markets, and from farms to citrus fruit markets, the cost of fridge rental, production cost, and holding cost of the citrus fruit in the fridge.

$$\begin{aligned} Min\,\,Z_{2} &= \zeta \left( \sum\limits_{m} \sum\limits_{i} \sum\limits_{j} \sum\limits_{t} dA_{ij} \times \frac{{xa_{mijt} }}{\sigma } \times \eta\right.\\ &\left. + \sum\limits_{m} {\sum\limits_{j} {\sum\limits_{l} {\sum\limits_{t} {dB_{jl} \times \frac{{xb_{mjlt} }}{\sigma } \times \eta + \sum\limits_{m} {\sum\limits_{i} {\sum\limits_{l} {\sum\limits_{t} {dC_{il} \times \frac{{xc_{milt} }}{\sigma } \times \eta } } } } } } } } \right)\end{aligned} $$
(2)

The second objective function (2) seeks to minimise the total released CO2 emissions by vehicles which are formulated based on distances between networks, required fuel for the vehicle at each kilometre of transporting, vehicle capacity, and the amount of transported citrus fruit.

$$ \begin{gathered} Constraint K_{it} = K_{{i\left( {t - 1} \right)}} - \mathop \sum \limits_{m} X^{\prime}_{{mi\left( {t - 1} \right)}} g_{mi} + \mathop \sum \limits_{m} X^{\prime}_{mit} g_{mi} \quad \forall i,t \hfill \\ \end{gathered} $$
(3)

Constraint (3) shows the cultivable level on the farm \(i\) to the period \(t\). Therefore, this amount is equal to the cultivable level in the previous period minus the occupied level by cultivation in the previous period plus the released level from the harvested products at the beginning of the same period.

$$ X_{mit} = X^{\prime}_{{mi\left( {t - q_{m} } \right)}} \quad \forall i,m,t $$
(4)

The amount of obtained product (\({X}_{mit}\)) at the period t is equal to the amount of the planted product (\({{X}^{^{\prime}}}_{mi(t-{q}_{m})}\)) at the period \(t-{q}_{m}\) is indicated in constraint (4).

$$ \mathop \sum \limits_{m} X^{\prime}_{mit} g_{mi} \le K_{it} \quad ,t $$
(5)

Constraint (5) indicates the cultivating capacity of the citrus fruit and also guarantees the cultivating rate of different products on each farm in each period should be less than the cultivable surface capacity.

$$ X_{mit} \left( {1 - \alpha_{mit} } \right) = \mathop \sum \limits_{l} xc_{milt} + \mathop \sum \limits_{j} xa_{mijt} \quad \forall i,t,m $$
(6)

Constraint (6) depicts that the produced product will be allocated to fridges or fruit markets after deducting the percentage of waste. As such, the balance of the product on the farm is determined.

$$ I_{mjt} = I_{{mj\left( {t - 1} \right)}} \mathop \sum \limits_{j} xa_{mijt} - \left( {\mathop \sum \limits_{l} \left( {xb_{mjlt} + \mathop \sum \limits_{n} xbd_{{mjl\left( {t - n} \right)n}} } \right)} \right)\quad \forall j,t,m $$
(7)

In addition, the amount of inventory of each fridge in each period and per each product is shown by constraint (7). This amount is equal to the inventory of the previous period plus the total input of the product in that period minus the total output of the product in that period.

$$ \mathop \sum \limits_{i} xa_{mijt} = \mathop \sum \limits_{l} \left( {xb_{mjlt} + \mathop \sum \limits_{n} xbd_{{mjl\left( {t - n} \right)n}} } \right)\quad \forall j,t,m $$
(8)

Constraint (8) proves a balance in fridges that, in each period, the total entered product from all farms into one fridge is equal to the amount transferred to the citrus fruit markets without storage plus the amount of products entering the fridge for holding.

$$ \mathop \sum \limits_{l} \mathop \sum \limits_{m} \left( {CP_{mit} X_{mit} + CV_{mit} V_{mit} } \right) \ge MinP_{i} \quad \forall i $$
(9)

The minimum income of farmers is shown in constraint (9). In order to motivate farmers to follow the presented programme through this model, earning a minimum income for farmers is considered.

$$ \mathop \sum \limits_{m} I_{mjt} \le S_{j} \mathop \sum \limits_{T} R_{jT} \quad \forall j.t $$
(10)

Constraint (10) also states that the amount of storage of all products in the fridge j in each period must be less than the capacity of that fridge if the \({R}_{j}\) variable equal is 1, and that fridge has been rented over the total planning horizon.

$$ \mathop \sum \limits_{T} R_{jT} F_{jT} \le B\quad \forall j $$
(11)

Constraint (11) guarantees that the renting cost of selected fridges should not more than the maximum available budget (B). The maximum total budget is also determined based on the opinion of the government and distributors.

$$ xa_{mijt} \le \mathop \sum \limits_{T} R_{jT} M\quad \forall i,j,t,m $$
(12)

Constraint (12) states that the product can be sent only if the fridge is selected. Based on the statement of MAJ and experts, citrus fruit (oranges) will be transported to the selected fridges/cold storages and, usually, only one fridge/cold storage is selected for a farm.

$$ L_{mit} V_{mit} \lambda_{mit} \le X^{\prime}_{mit} \le V_{mit} \lambda_{mit} M\quad \forall i,t,m $$
(13)

Constraint (13) ensures that the planting of the product per each farm and in each period based on the setting up of the planting was performed and the ability to plant the product. Additionally, if the planting of the product is performed, the amount must be greater than the specified minimum value. The minimum acceptable amount of the cultivating product on the farm is based on the opinion and agreement of the environmental organisation and the farmer.

$$ \varphi_{l} = \max \left( {\left( {\varpi_{l} - \tau_{l} } \right),0} \right)\,\quad \forall l $$
(14)
$$ \Omega_{l} = \max \left( {\left( {\tau_{l} + \Delta_{l} - \mu_{l} } \right),0} \right)\,\,\forall l $$
(15)
$$ \mathop \sum \limits_{t} \mathop \sum \limits_{m} X^{\prime}_{mit} w_{m} g_{mi} \le TW_{i} \quad \forall i $$
(16)

Constraints (14) and (15) determine the rate of violation of the time window for servicing to citrus fruit markets. The logic of these two constraints (i.e., 14 and 15) is based on the upper and the lower limit of the time window. If the arrival time of the vehicle to the citrus fruit market is greater than the range of time window, the amount of violation is greater. Constraint (16) indicates the amount of accessible water per farm.

$$ R_{jT} ,V_{mit} ,\lambda_{mit } \in \left\{ {0, 1} \right\}\quad \forall j,T,m,i,t $$
(17)
$$ X_{mit} ,X^{\prime}_{mit} ,xa_{mijt} ,xc_{milt} ,xbd_{mjltn} ,xb_{mjlt} ,\emptyset_{l} ,{\Omega }_{l} ,I_{mjt} ,K_{it} \quad \forall l $$
(18)

Constraints (17) and (18) display the binary and continuous variables in the AFSCN model, respectively.

4 Solution approach

It has been proven that supply chain models are NP-hard (Goodarzian et al., ). Thus, meta-heuristic algorithms are used to solve these NP-hard problems. In this regard, to solve the proposed model, four well-known meta-heuristic algorithms, including MOACO, MOSA, NSGA-II, and Fast PGA algorithms, along with a hybrid method based on multi-objective SA and multi-objective ACO, are suggested. Then, the multi-objective solution technique (augmented Epsilon-constraint method), encoding scheme, and mentioned algorithms are discussed. It should be noted that, in this research, the augmented Epsilon-constraint approach along with meta-heuristic approaches has been used to solve small- and medium-scale problems. Then, the accuracy of meta-heuristic algorithms is compared with the augmented Epsilon-constraint algorithm. If the accuracy of the model is proved, the proposed model will be solved in large sizes using meta-heuristic approaches. The explanations of the MOSA, MOACO, NSGA-II, and Fast PGA are stated in Appendix AD. Then, there are several reasons to use the presented meta-heuristic algorithms including the proposed algorithms that use elitism and a crowded comparison operator to keep diversity without specifying any additional parameters. These algorithms can cope with continuous search spaces. In addition, distributed computation in the suggested algorithms evades premature convergence. All presented algorithms utilise a population-based evolutionary algorithm. The presented algorithms are especially useful when time and resource constraints permit only a small number of solution evaluations, which cause to boost the algorithm’s efficiency in finding Pareto optimal solutions while decreasing computational effort. The represented algorithms can converge to a true global optimum and also are useful in scientific research and engineering. Additionally, the proposed meta-heuristic algorithms have a very fast rate of convergence and reduced computational time. For this reason, these algorithms are used for comparison together in this paper.

4.1 Augmented Epsilon-constraint method

In multi-objective design, a single solution does not necessarily optimise all-objective functions at a time. For this reason, non-dominated solutions that address Pareto optimal are used for the concept of optimality. The Pareto set refers to the generated feasible solution for a problem. The decision-makers choose a solution from this set according to the dominance principle. There is a necessity to generate these optimal fronts iteratively. This work utilises the augmented-constraint method to generate the optimal fronts (Mavrotas, 2009). The most important observation in this method is that one objective is kept as a constraint while optimising the other objective. This method starts with computing the Payoff Table in lexicographic order. The Augmented Epsilon-constraint approach solves many of the problems of the Epsilon-constraint approach. One of the advantages of this approach is the ease of value selection for the ε vector. Another advantage of this approach over the Epsilon constraint is that it solves the problem with a more logical number of iterations than the traditional epsilon constraint method (Cooper et al., 2017; Mavrotas & Florios, 2013; Yang et al., 2014, 2021; Zhang & Reimann, 2014;). The information of this method is shown in Fig. 2. \({z}_{1}^{*}\) denotes the optimal value for maximising -\({F}_{1}\) and \({m}_{2}\) denote the maximum value of \({F}_{2}\) for constraint-\({F}_{1}(X)\ge {z}_{1}^{*}\). The same procedure is repeated for \({z}_{2}^{*}\) and \({m}_{1}\). Here, \(\left[{n}_{2},{z}_{1}^{*}\right]\) and \(\left[{n}_{1},{z}_{2}^{*}\right]\) are ranges for \({F}_{1}\) and \({F}_{2}\) respectively. Figure 3 represents the process procedure (Pseudocode) of the augmented Epsilon-constraint method.

Fig. 2
figure 2

The pseudo-code of the payoff Table in lexicographic order

Fig. 3
figure 3

The pseudo-code of the augmented Epsilon-constraint method

4.2 HACO-SA algorithm

In this section, to solve the proposed model, a heuristic method based on meta-heuristic algorithms called HACO-SA is developed for solving the bi-objective production–distribution-inventory holding problem. The details of the developed algorithm are proposed as follows:

In the developed algorithm, HACO-SA focuses on an initial temperature of the multi-objective SA operator in the main loop of the multi-objective ACO algorithm. Moreover, the initial temperature causes HACO-SA to have a much higher convergence speed and efficiency than other suggested algorithms. The multi-objective ACO algorithm is improved by using the initial temperature operator of the multi-objective SA algorithm. Furthermore, the proposed algorithm causes an ant that isn't limited to the pheromone trail which is already aggregated in its march process; however, it utilises both the search (exploration) of new routes and the information (exploitation) which has been detected. This increases the system's robustness significantly. On the other hand, the multi-objective ACO algorithm still falls into a local, at least in the trial process. In this regard, this process causes a route that doesn't lead the ant to expand the Pareto solution so that it provides the total system premature convergence. When the local minimum situation appears, we add an initial temperature in the main loop to avoid this premature convergence and run away from a local minimum. Generally, by adding initial temperature in the main loop of the multi-objective ACO algorithm, HACO-SA causes the near-optimal solutions to run away from the trap of a local minimum and to expand in the direction of the optimisation.

Hybrid meta-heuristic algorithms show an important role in improving the search capability of algorithms. In terms of hybridisation, the main goal of the hybridisation of the meta-heuristic algorithm is to decrease any significant disadvantage (Poorzahedy et al., 2007). Generally, the results of the hybridisation can perform several improvements according to the accuracy or computational speed. In a hybrid algorithm, two or more meta-heuristic algorithms are cooperatively and collectively solving a predefined problem (Poorzahedy et al., 2007). In order to find better solutions by searching for the optimal parameter for better performance, the hybrid meta-heuristic algorithm is developed in this paper. Another reason is that the presented hybrid algorithm improves the results of the overall convergence speed and accuracy and acts better than other suggested meta-heuristic algorithms. In the following, the details of the developed algorithm are explained.

  • Step 1 The parameters of the ACO and SA algorithms are adjusted (Appendix E and F) and the pheromone trails are initialised.

  • Step 2 Until the stop condition isn't met the below steps should be implemented:

    • Start to generate the candidate solutions (construct ant solution stage). In this step, a set of ant m generates candidate solutions to the hybridised optimisation problem using elements of a finite set of candidate solution components available to \(F=\left\{{f}_{rt}\right\},r=1,\dots ,n, t=1,\dots ,\left|{D}_{r}\right|\). The generation phase of candidate solutions begins with the generation of a partial candidate \( w^{s} = \phi \) solution. Then, in the next steps of generating candidate solutions, the partial candidate solution generated by \({w}^{s}\) is expanded by adding a component from the sets of feasible neighbours \({N(w}^{s})\subseteq F\).

      The process of generating candidate solutions can be considered as a path in the structural graph \({G}_{F}=(Q, U)\). In other words, the purpose of extending the optimal solution is to determine the possible paths for the ant in the structure graph of the pheromone model. In this way, the Neighborhood areas of the partial candidate solution are searched to determine the best path to the optimal global solution. Permitted paths in graph \({G}_{F}\) are implicitly defined by the solution generation mechanism. The solution generation mechanism defines the set of possible neighbours \({N(w}^{s})\subseteq F\) for each of the partial solutions separately.

      At each stage of the generation of candidate solutions, the method of selecting the components of the possible neighbour set to extend the partial candidate solution is performed quite possible. The rules for selecting a component from a set of possible neighbours differ in different implementations of the ant colony algorithm. However, the most well-known rule concerns the ant system algorithm, which is formulated as Eq. (19).

      $$p\left({f}_{rt}\left|{w}^{s}\right.\right)=\frac{{\delta }_{rt}^{\alpha }.\mu {\left({f}_{rt}\right)}^{\beta }}{\sum_{{f}_{re\in {N(w}^{s})}}{\delta }_{re}^{\alpha }.\mu {\left({f}_{re}\right)}^{\beta }},\forall {f}_{rt}\in {N(w}^{s})$$
      (19)

      In Eq. (19), \({\delta }_{rt}\) shows values ​​of the pheromone to component \({f}_{rt}\). \(\mu (0)\) is a function that assigns a so-called "heuristic" value to each candidate solution \({f}_{rt\in {N(w}^{s})}\) at each stage of candidate solution generation. The heuristic values ​​generated by the function \(\mu (0)\) are called "heuristic information". Also, the parameters \(\alpha \) and \(\beta \) indicate parameters with positive values ​​whose values ​​demonstrate the relative importance (weight) of pheromone information (values ​​of the variables of a candidate solution), and heuristic information determines the generation of the potential value based on Eq. (19).

    • Start local search stage. At this stage, local optimal solutions are used to determine which pheromones need to be updated.

    • Do the pheromone update stage. The goal of the pheromone update stage is to increase the pheromone values related to the good and optimal candidate solutions and to decrease the pheromone values related to the bad solutions. This is done through two main processes:

      1. 1.

        Decreasing the pheromone values related to all candidate solutions through the "Pheromone Evaporation" process

      2. 2.

        Increasing the pheromone values related to the candidate solutions of the "good solution" or \({W}_{upd}\)

    • Therefore, the two above processes are controlled by Eq. (20). Equation (20) is called the rule of the pheromone update.

      $${\delta }_{rt}\leftarrow \left(1-\pi \right){\delta }_{rt}+\pi \sum_{w\in {W}_{upd}\left|{f}_{rt\in w}\right.}Z(w)$$
      (20)

      The first part of this equation controls the pheromone evaporation process (reducing the pheromone content of all candidate solutions). The second part only increases the pheromone values ​​related to the candidate solutions of the good solutions or \({W}_{upd}\). In this regard, \({W}_{upd}\) includes a set of candidate solutions that are highly appropriate; That is, they are closer to the optimal global solution. Parameter \(\pi \in \left(0\right.,\left.1\right]\) shows the "Evaporation Rate" and \(Z:W\to {R}_{0}^{+}\) indicates the "Fitness Function".

      Accordingly, this process leads to an increase in the number of pheromones related to the ants that are in the best available paths to the optimal solution (closer to the optimal solution) and have higher fitness (lower cost or higher profit). As a result, other ants converge in this direction.

  • Step 3 Algorithm MOSA uses a probabilistic function to accept neighbouring solutions that have dominated the current solution of the algorithm to escape the local optimisation. Hence, this function is shown in Eq. (21).

    $$P\left(\nabla E, T\right)={e}^{\frac{-\sum_{i}\nabla E({f}_{i})}{T}}$$
    (21)

    In Eq. (21), \(\nabla E\) shows the difference between the objective function of the current solution and the neighbour solution, and \(T\) indicates a parameter called temperature. Equation (21) demonstrates the probability of accepting solutions that are dominated by the current solution. In other words, the worse solution is directly related to the temperature \(T\) and the sum of the changes in the objective functions. If the temperature of \(T\) is considered too high, the probability of accepting these types of solutions increases, and almost all the solutions are accepted and the algorithm transforms to a probabilistic algorithm. If \(T\) is considered too low, only solutions that do not dominate the current solution will be accepted and become a heuristic algorithm. Therefore, at the beginning of the algorithm, the initial temperature must be chosen so that a ratio of bad solutions is accepted. There are several methods to reduce the temperature and the dynamic geometric method is used in this paper (see Eq. 22).

    $${T}_{i+1}=\alpha {T}_{i}, 0\nless \alpha <1$$
    (22)
  • Step 4 In this study, in addition to the condition of stopping reaching the final temperature, another condition, called the number of times that the objective functions are calculated, is considered for termination. If one of these two conditions is met, the algorithm finishes. Otherwise, do the steps again.

The schematic steps and pseudo-code of the developed algorithm are shown in Appendix E and F.

The presence of the pheromone evaporation in TS and temperature in SA parameters is necessary to prevent the "Rapid and Premature Convergence" of the proposed algorithm. These two parameters provide a kind of "forgetting" mechanism in the optimisation process and cause more emphasis on searching and exploring new areas in the search space of implemented algorithm.

The proposed approach aims to enhance the exploitation of the multi-objective ACO algorithm. In the hybrid method, multi-objective SA is utilised as a component in the multi-objective ACO algorithm. Hence, when showing the optimal local minimum, initial temperature is applied to avoid this premature convergence and running away from a local minimum. Therefore, to find the best solution, the process of implementation of the multi-objective SA is applied in the main loop of the multi-objective ACO algorithm after the stop condition of this algorithm.

5 Computational experiments

In this section, first of all, the experiment instances for the AFSCN model are generated. Four evaluation metrics are proposed to evaluate the quality of Pareto or non-dominated solutions of the algorithms based on two objective functions. Additionally, the Taguchi technique is employed to set the appropriate values for the proposed algorithm’s parameters. As a result, the best trade-off among two objective functions is addressed by a comparison of presented algorithms and the best algorithms are chosen. Also, various sorts of algorithms are compared and hybridised together to find the best strategy for the developed problem. Finally, a set of sensitivity analyses is employed to investigate the validation of the AFSCN model. It should be mentioned that the relevant code was written in MATLAB on a laptop with Corei7, 3.6 GHz in CPU, and 8 GB RAM and utilising Windows 10.

5.1 Numerical example

In this subsection, based on the novelty of the proposed AFSCN model, no available paper or research has used a similar problem in the examined studies. Moreover, the accessible benchmarks in the literature review are not available for the problem and a strategy is required to design the experiment instances. To evaluate the efficiency of the proposed algorithms, the proposed model is solved with several problems in different sizes. Table 2 shows the parameters of the AFSCN model. Each problem instance is characterised by the number of citrus fruit markets (\(l\)), number of farms (\(i\)), number of fridge/cold storage (\(j\)), number of products (\(m\)), the shelf life time of the citrus fruit in the fridge (\(n\)), and number of periods (\(t\)). The detailed delineation of all the problems is mentioned in Table 3. In addition, all the problem instances are classified into three groups according to the total number of decision variables of the problem instances. Furthermore, all the test instances are divided into three categories, including small, medium, and large. It is necessary to mention that available data in Tables 2 and 3 are shown according to the simulated data.

Table 2 Data ranges of used parameters in the AFSCN model
Table 3 The size of the test problems

5.2 Evaluation metrics for Pareto optimum solutions

Generally, the comparison of multi-objective algorithms with each other is difficult. In this regard, researchers have proposed several metrics to evaluate the quality of Pareto fronts for the algorithms (Devika et al., 2014; Fathollahi-Fard et al., 2020). Therefore, four famous evaluation metrics including the number of Pareto solution (NPS) (Goodarzian et al., 2021b), mean ideal distance (MID) (Govindan et al., 2015), the spread of non-dominance solution (SNS) (Esmaeilikia et al., 2016; Govindan et al., 2015), and maximum spread (MS) (Fathollahi-Fard et al., 2020) are employed in this paper.

  • NPS: The number of Pareto optimal solutions is ascertained using this metric. The maximum number of Pareto solutions is indicative of the algorithm's high performance.

  • MID: MID measures the gap between the Pareto solution and the ideal solution. Lesser MID is always preferred to achieve better performance.

    $$MID= \frac{{\sum }_{i=1}^{n}\sqrt{{\left(\frac{{f}_{1i}-{f}_{1}^{best}}{{f}_{1,total}^{max}-{f}_{1,total}^{min}}\right)}^{2}+{\left(\frac{{f}_{2i}-{f}_{2}^{best}}{{f}_{2,total}^{max}-{f}_{2,total}^{min}}\right)}^{2}}}{n}$$
    (23)

    where \(n\) represents a total non-dominated set, \({f}_{i,total}^{max}\) and \({f}_{i,total}^{min}\) shows the maximum and minimum values of the \({i}^{th}\) objective function in non-dominated solutions obtained from algorithms, and \({f}_{i}^{best}\) indicates the ideal solution of \({i}^{th}\) objective function.

  • SNS: SNS is used to measure the spread of ideal solutions and non-dominance solutions. It is calculated with the help of the average distance between the ideal solution and the optimal solution. The higher value of SNS ensures the spread of optimal solutions.

    $$SNS= \sqrt{\frac{{\sum }_{i=1}^{n}{\left(\overline{c}-{c}_{i}\right)}^{2}}{n-1}}$$
    (24)
  • Here, \({c}_{i}=\Vert \overrightarrow{{f}_{i}}-\overrightarrow{{f}_{ideal}}\Vert , \overline{c} = \frac{{c}_{i}}{n} , \overrightarrow{{f}_{ideal}}=\{\mathrm{min}\left({f}_{1}\right),\mathrm{min}\left({f}_{2}\right)\dots \dots \dots \mathrm{min}\left({f}_{k}\right)\}\)

  • MS: MS measures the distance between solutions with respect to the true Pareto front in the objective space. The higher value of MS reflects larger area is covered by the true Pareto front. The expression for MS can be given as:

    $$MS= \frac{1}{M}\sum_{m=1}^{M}{\left(\frac{\mathrm{min}({F}_{i,known}^{max},{F}_{i,true}^{max})-\mathrm{max}({F}_{i,known}^{min},{F}_{i,true}^{min})}{{F}_{i,true}^{max}-{F}_{i,true}^{min}}\right)}^{2}$$
    (25)

    where \(M\) shows the considered total objective, \({F}_{i,known}^{max}\) and \({F}_{i,known}^{min}\) represent the maximum and minimum of \({i}^{th}\) function value in \({F}_{known}\) respectively, and \({F}_{i,true}^{max}\) and \({F}_{i,true}^{min}\) are the maximum and minimum of \({i}^{th}\) function value in \({F}_{true}\) respectively.

5.3 Taguchi technique: parameter tuning

One of the goals of test design is to be able to observe and identify output changes by consciously making changes to process input variables. There are several ways to design an experiment using the Taguchi method. Since the proposed meta-heuristic algorithms have a set of control parameters, a plan should be provided to tune the algorithms’ parameters. If the algorithms are not controlled well, the proposed algorithm behaviour would be inefficient. In this paper, the Taguchi technique is employed to control and tune the presented algorithm's parameters. The Taguchi method involves reducing the variation in a process through the robust design of experiments. Taguchi developed a method for designing experiments to investigate how different parameters affect the mean and variance of a process performance characteristic, which defines how well the process is functioning. The experimental design proposed by Taguchi involves using orthogonal arrays to organise the parameters affecting the process and the levels at which they should be varied. Instead of having to test all possible combinations like the factorial design, the Taguchi method tests pairs of combinations (Goodarzian et al., 2021a). Taguchi (1986) introduced this method to decrease the number of tests for quality engineering scopes. Then, several studies have suggested seeing the application of the Taguchi method and its definition and introduction for interested scholars (see Devika et al., 2014; Cheraghalipour et al., 2019; Fathollahi-Fard et al., 2020; Goodarzian et al., 2020, 2021a, 2021c). The characteristics of the comparison of the Taguchi approach are divided into two categories: control and noise factors. The approach considers calculating the variation value of response according to the signal-to-noise ratio to attain the aim to tune and control the proposed algorithm's parameters. Signal-to-noise ratio (SNR or S/N) is a measure used in science and engineering that compares the level of a desired signal to the level of background noise. SNR is defined as the ratio of signal power to the noise power, often expressed in decibels. A ratio higher than 1:1 (greater than 0 dB) indicates more signal than noise. Additionally, the Taguchi approach mechanism focuses on the response type. The presented response is divided into three categories including the nominal is better, the smaller is better, and the larger the better type. Whereas the presented response of this paper is a minimisation sort, "the smaller is better" is utilised to tune and control each algorithm's parameters. Then, the S/N ratio is formulated based on Eq. (26).

$$S/N=-10\times \mathrm{log}\left(\frac{{\sum }_{i=1}^{n}{Y}_{i}^{2}}{n}\right)$$
(26)

where \({Y}_{i}\) shows the value of the response for \(i\) th orthogonal array and \(n\) is the number of orthogonal arrays.

In this paper, the MOSA, MOACO, NSGA-II, and Fast PGA are used and HACO-SA is developed. The algorithms’ parameters are considered as the factors for each algorithm. Furthermore, the proposed levels along with the factors are described in Table 4. Hence, a maximum of three levels is provided to the algorithms’ factors. Accordingly, the Taguchi approach reduces the total number of experiments by suggesting a set of orthogonal arrays to control and calibrate the algorithms in a reasonable time. Thus, the Taguchi method suggests L8 for MOSA and L16 for HACO-SA, Fast PGA, NSGA-II, and MOACO using MINITAB 16 Statistical Software, as reported in Tables 5, 6, 7, 8. In Taguchi's approach, three levels are selected for the parameter change range. Then three numbers are randomly selected in the desired range. In each instance, these three combinations are repeated. Govindan et al. (2019) and Roghanian et al. (2019) have also used this approach.

Table 4 The values of parameter setting of the presented algorithms
Table 5 The orthogonal array L16 for the NSGA-II and MOACO
Table 6 The orthogonal array L8 for the MOSA
Table 7 The orthogonal array L16 for the Fast PGA
Table 8 The orthogonal array L16 for the HACO-SA

Therefore, the results of the S/N ratio should be analysed to find the best levels of each algorithm. The results of the S/N are shown in Figs.4, 5, 6, 7, 8. Based on the signal-to-noise figures, the level with the lowest value will be selected. According to Figs.4, 5, 6, 7, 8, A2, B3, C1, D2, and E2 for NSGA-II, A3, B2, C1, D2, and E1 for MOACO, A2, B3, C1, and D3 for MOSA, A3, B3, C3, D2, E3, and F1 for Fast PGA, and A1, B3, C2, D3, E3, F3, G3, and H2 for HACO-SA are selected to run of the proposed algorithms in different size problems based on S/N.

Fig. 4
figure 4

The results of the S/N ratio of the NSGA-II

Fig. 5
figure 5

The results of the S/N ratio of the MOACO

Fig. 6
figure 6

The results of the S/N ratio of the MOSA

Fig. 7
figure 7

The results of the S/N ratio of the Fast PGA

Fig. 8
figure 8

The results of the S/N ratio of the HACO-SA

5.4 Pareto optimum analysis: comparison of metaheuristics

In this section, the MOACO, NSGA-II, Fast PGA, MOSA, HACO-SA algorithms are employed to solve the Pareto solutions and the AFSCN model as well as to find a good and suitable solution. To evaluate the performance of the proposed algorithms, the obtained solutions from these three algorithms are investigated in three different sizes. Also, the optimal response values, the best response in three different sizes, the average, and their standard deviation (SD) of objective functions are provided in Table 9.

Table 9 The results obtained are the objective functions and CPU time (based on seconds) of the methods

According to the proposed four evaluation metrics of Pareto optimum analyses, meta-heuristic algorithms are compared with each other. Finally, to improve the reliability of meta-heuristic algorithms, the average of the results for thirty run times is considered to utilise in this section. The behaviour of meta-heuristic algorithms based on computational (CPU) time is indicated in Fig. 9. From Fig. 9 and Table 9, it is clear that the HACO-SA meta-heuristic is quicker and more robust than other suggested algorithms. Thus, the CPU time of HACO-SA is less than other proposed meta-heuristics on different problems. Therefore, HACO-SA has a minimum average of CPU time (378.2967 s), but the MOSA has the maximum rate of this item (623.3133 s). As can be seen in Table 9, the CPU time in the Augmented Epsilon-constraint approach increases exponentially as the sizes of the problem increase. Therefore, according to the CPU time, it can be understood that the proposed model is NP-hard.

Fig. 9
figure 9

The behavior of the proposed methods based on CPU time

As it is seen, in using the MOSA algorithm, all of the suggested problems have been able to achieve the Pareto solutions in different problems. Moreover, the expected states of problem size, SDs increase as the problem size increases. It is also clear that the SD of the first problem (SS1) in the small size equal is zero, which indicates algorithms have reached the optimal Pareto solutions.

The MOACO and HACO-SA algorithms have achieved the optimal Pareto solutions in proposed test problems. Likewise, the HACO-SA has achieved the optimal Pareto solution, with the difference that, in three problems (SS1, MS4 and LS7), the SD is equal to zero. This means that in three categories of these experiment problems, exactly each run is optimised for the solutions, while for the MOSA, only in the first problem (SS1) have all the repetitions reached the optimal Pareto solutions. Therefore, HACO-SA is more reliable than other proposed algorithms. The NSGA-II and Fast PGA algorithms have also been able to achieve optimal Pareto solutions in proposed test problems, but only in two samples of these problems, which are optimised for each run.

The presented meta-heuristic algorithms were also evaluated based on CPU time. It is clear that, in Table 9, the minimum CPU time is related to the HACO-SA algorithm, in which the MOSA, NSGA-II, MOACO, and Fast PGA algorithms need more CPU time than the HACO-SA algorithm. Then, according to the measure of the average, the standard deviation, and the CPU time, the HACO-SA algorithm has high efficiency and better performance than the other suggested algorithms in the different sizes. According to Fig. 9, it is clear that the CPU times of the NSGA-II and Fast PGA are very similar and close together. The trend of the HACO-SA indicates that, in the small size, there is almost a routine, but, by increasing size, the CPU time increases significantly. In addition, as shown in Fig. 9, as the size of the problem increases, the CPU time of the Augmented Epsilon-constraint approach increases exponentially. It should be noted that all CPU times for the Augmented Epsilon-constraint approach are divided by 10. Also, the Augmented Epsilon-constraint approach is not able to solve problems in large size due to its long CPU time. Therefore, the NP-hard of the proposed model is proved.

The performance of proposed algorithms is examined by four evaluation metrics including NPS, MID, MS, and SNS as the comparison metrics for obtained Pareto sets under every experiment problem. Then, the outputs of the assessment metrics are indicated in Tables 10, 11, 12, 13 where it is clear that the HACO-SA algorithm is more reliable, better, and more powerful than other presented algorithms in this paper. In terms of NPS, SNS, and MS, a higher value of these three criteria indicates a greater ability of algorithm solutions, efficiency and performance, and a higher quality of non-dominated solutions. It should be noted that the lowest value of the MID criterion shows the better quality of the algorithm's solutions. As a result, in the four assessment metrics, HACO-SA has the best quality and more efficiency than other suggested meta-heuristics.

Table 10 NPS’s computational outputs for proposed algorithms
Table 11 MID’s computational outputs for proposed algorithms
Table 12 MS’s computational outputs for metaheuristic algorithms
Table 13 SNS’s computational outputs for proposed algorithms

Figure 10 indicates non-dominated (Pareto solutions) solutions of proposed algorithms in different experiment problems. The Pareto performance of the solutions is examined to assess the efficiency of the presented algorithms and their “number of Pareto solutions” results. Here, non-dominated solutions of algorithms in SS1, MS4, and LS8 (in Table 10) test problems based on the number of Pareto solutions (NPS) are indicated. The Pareto frontiers from the NSGA-II, Fast PGA, MOACO and MOSA algorithms were also compared with HACO-SA. In terms of efficiency, the HACO-SA has the more effective Pareto front than the other presented algorithms. The obtained outcomes based on NPS for the HACO-SA algorithm follow the Pareto optimality and are effective. In terms of the selected results in the NPS assessment metric, the number of Pareto front solutions is computed for each algorithm. It is clear that whatever the number of Pareto solution in each algorithm is further, it is more favourable. Therefore, the results of the objective functions based on the NPS metric are examined in Fig. 10. Here it is evident that iNSGA-II shows the worst performance, and HACO-SA is mostly overcome by the other algorithms. Consequently, HACO-SA indicates a more powerful Pareto efficiency than other suggested algorithms to solve the AFSCN problem and to find Pareto frontier solutions.

Fig. 10
figure 10

Pareto frontier of the presented meta-heuristics algorithms in the sizes SS1, MS4, and LS8

Thus, this paper performs a set of statistical comparisons between the suggested algorithms according to the Pareto optimal analyses by assessment metrics to detect the better algorithm. Hence, the outputs that were reported in Tables 10, 11, 12, 13 are converted to a well-known metric called Relative Deviation Index (RDI) using the following formula (Devika et al., 2014):

$$RDI=\frac{\left|{Alg}_{sol}-{Best}_{sol}\right|}{{Max}_{sol}-{Min}_{sol}}$$
(27)

where \({Min}_{sol}\) and \({Max}_{sol}\) illustrate the minimum and maximum values between all the obtained values of the presented algorithms. \({Best}_{sol}\) shows the best solution between the suggested algorithms and also \({Alg}_{sol}\) indicates the value of the obtained objective function by an assessment metric of the meta-heuristic algorithms.

All approaches to calculating the quality of metaheuristic solutions measure the Pareto point distance obtained to the ideal point. Therefore, the quality of the solutions is not related to the parameters and variables of the problem. In the RDI approach, the distance of the calculated Pareto points from the Pareto points obtained from the exact solution is calculated. It is clear that a lower value of RDI shows a higher quality of meta-heuristics. Accordingly, the confidence interval of 95% for the performance assessment metrics in the proposed algorithms to statistically analyse the effectiveness of algorithms is performed (Naderi & Ruiz, 2010). Accordingly, Least Significant Difference (LSD) and the means plot are considered for the used meta-heuristics. The run results by Minitab 16 software are presented in Fig. 11.

Fig. 11
figure 11

ANOVA plots for the assessment metrics in term of RDI for presented algorithms

According to Fig. 11, in terms of the NPS, MID, MS, and SNS assessment metrics, the developed meta-heuristic algorithm, i.e., HACO-SA, is more successful, of high quality, and more effective than the other suggested algorithms, while MOSA has worse performance than the NSGA-II, MOACO, Fast PGA and HACO-SA in all suggested metrics.

5.5 Performance analysis of the meta-heuristic algorithms

The optimisation performance related to the convergence of the proposed algorithms is evaluated based on convergence plots. Then, the convergence plots for NSGA-II, MOSA, MOACO, Fast PGA and HACO-SA algorithms, respectively, according to two objective functions in 50 iterations are shown in Figs. 12, 13, 14, 15, 16. All presented algorithms converge in a stable and fixed trend. The convergence of all of the suggested meta-heuristic algorithms is evaluated and, accordingly, their final outcomes are compared. The maximum iteration is generated based on Table 4 for the proposed algorithms, in which the maximum iteration is investigated the same for the presented algorithms. It is clear that, in Figs. 12, 13, 14, 15, NSGA-II, MOSA, MOACO and Fast PGA algorithms are converging in 50 iterations, but the HACO-SA algorithm converges after 43 iterations in a steady line. Consequently, HACO-SA has more efficiency and high performance than the other proposed algorithms. The purpose of showing the convergence of meta-heuristic algorithms was to prove the superiority of the hybrid algorithm over other algorithms. Therefore, due to the fact that the hybrid algorithm has reached convergence in less than 55 iterations, the convergence display of other algorithms has been omitted.

Fig. 12
figure 12

Comparative convergence of the NSGA-II for costs and environmental effects

Fig. 13
figure 13

Comparative convergence of the MOSA for costs and environmental effects

Fig. 14
figure 14

Comparative convergence of the MOACO for costs and environmental effects

Fig. 15
figure 15

Comparative convergence of the Fast PGA for costs and environmental effects

Fig. 16
figure 16

Comparative convergence of the HACO-SA for costs and environmental effects

5.6 Sensitivity analysis

In this section, sensitivity analysis is carried out to demonstrate the effects of the convergence behaviour on the main parameters of the AFSCN model on the values of the objective functions. According to the HACO-SA algorithm that is the best algorithm based on the obtained results, then this algorithm is utilised to perform the sensitivity analysis.

5.6.1 Changing the behaviour of the objective functions based on transportation costs

In terms of transportation costs, sensitivity analyses have been carried out by raising the amount of this parameter. To recognise the behaviour of both objective functions (total cost and environmental effects), the obtained values are provided in this comparison as depicted in Fig. 17. The outcomes illustrate that, by raising the amount of this parameter, the first objective function is raised slightly. Also, the value of the second objective decreased steadily.

Fig. 17
figure 17

The behavior of objective functions on transportation costs

5.6.2 Changing the behaviour of the objective functions based on parameter \(\zeta \)

In this subsection, in order to analyse the effect of the amount of the \({CO}_{2}\) emissions coefficient through vehicle fuel on the objective function values, various values of the \(\zeta \) are employed. The outcomes of Fig. 18 and Table 14 indicate that the rise in the amount of the CO2 emissions coefficient of vehicle fuel lead to increase gradually the second objective function. The value of the first objective function also increases. The slope of the increase in total costs is less than the slope of the environmental impact chart.

Fig. 18
figure 18

The trend of objective functions based on changing \(\zeta \)

Table 14 The results of the sensitivity analysis related to the number of vehicles

5.6.3 Changing the behaviour of the objective functions based on performance coefficient

In this subsection, in order to analyse the effect of the performance coefficient on the objective function values, various values of the \({g}_{mi}\) are employed. The outcomes of Fig. 19 and Table 15 indicate that the rise in the amount of the \({g}_{mi}\) lead to increase gradually the first objective function. The value of the second objective function also increases. The slope of the increase in environmental impact is less than the slope of the total costs chart.

Fig. 19
figure 19

The trend of objective functions based on changing \({g}_{mi}\)

Table 15 The results of the sensitivity analysis related to the performance coefficient

5.6.4 Changing the behaviour of the objective functions based on capability to re-cultivate

In this subsection, in order to analyse the effect of capability to re-cultivate on the objective function values, various values of the \({\lambda }_{mit}\) are employed. As can be seen in Table 16 and Fig. 20, with increasing capability of cultivating product, the values of objective functions increase. Also, the first objective function increases faster than the second objective function.

Table 16 The results of the sensitivity analysis related to the capability of cultivating product
Fig. 20
figure 20

The trend of objective functions based on changing \({\lambda }_{mit}\)

5.6.5 Changing the behaviour of the objective functions based on total amount of accessible water resources

In this subsection, in order to analyse the effect of the total amount of accessible water resources for cultivating the desired products on the farm on the objective function values, various values of the \({TW}_{i}\) are employed. As can be seen in Table 17 and Fig. 21, with increasing total amount of accessible water resources for cultivating the desired products on the farm, the total cost decreases and the environmental effect increases.

Table 17 The results of the sensitivity analysis related to the total amount of accessible water resources
Fig. 21
figure 21

The trend of objective functions based on changing \({TW}_{i}\)

5.7 Managerial insight

The different groups in the AFSCN including farmers, citrus fruit markets, farms, fridges/cold storage, and transportations get several insights from this paper. According to the increase of citrus fruit stock and the addition of the capacity, decision-makers or policymakers have to evaluate the storage capacity gap by establishing sufficient cold storage across the country. The proposed mathematical model can be utilised for the feasibility analysis of the different potential locations that support avoiding the loss of huge capital investment. The establishment of fridges/cold storage in surplus centres of the country will be useful for the brisk transition of citrus fruit (orange) from farms to citrus fruit markets. This will lead to an increase in procurement from farmers and can be used for the benefit of MSP, which improves their welfare and economic growth. Likewise, the construction of state and district-level fridges/cold storage will be useful for the holding of citrus fruit by satisfying the demand of the people in shortage states and decreasing malnutrition. Due to the establishment of new fridges/cold storage, farmers and other actors traverse fewer distances to reach the nearby fridges/cold storage. This leads to a decrease in transportation costs and related emissions among various stages. Moreover, less emission of CO2 will be instrumental to reduce the carbon tax on transportation activities. The holding losses of citrus fruit/orange by holding it in the surplus fridge/cold storage will be considerably decreased because of the establishment of new fridges/cold storage. All in all, the majority of the problems relevant to the holding, transportation, post-harvest losses, and carry overcharges can resolve after the accessibility of adequate storage capacity. Also, decision-makers can control and limit the release of CO2 emission according to the holding of the citrus fruit and activities related to transportation by keeping the optimal citrus fruit/oranges in various fridges/cold storages. The obtained Pareto optimal solutions are beneficial for the decision-makers to maintain the suitable trade-off between cost and released CO2 emission. Also considering the capability to re-cultivate the agricultural land after citrus fruit harvest, and developing a new heuristic method based on a meta-heuristic to solve the mathematical model are the other contributions of this paper.

Examination of the amount of released CO2 through vehicles shows that there are usually two peaks per year, one in winter and the other in summer. The maximum winter incidence is usually in October, November, or December and the maximum summer incidence is in June, July, or August. The existence of maximum in the winter season can occur due to increased ignition consumption of fuels caused by thermal functions and vehicles along with meteorological conditions such as increased air pressure, air stability, and temperature inversion. Also, the maximum presence of released CO2 in summer can be due to several factors including surface temperature inversion, reduced rain and failure to wash pollutants. Therefore, in this study, the released CO2 by vehicles in summer is considered.

6 Conclusions and future research directions

In this paper, a bi-objective mathematical model for a multi-period, three-echelon, single product (orange) AFSCN is designed. Additionally, an MINLP model under time windows for servicing citrus fruit markets in the AFSCN along with minimising the total costs and the CO2 emissions are provided. The water resource gap, considering simultaneously both the sent amount of orange to the fridge/cold storage and the duration of staying in the fridge/cold storage and time windows to service in citrus fruit market are some of the main motivations for this paper. In the current model, the capability to re-cultivate the agricultural land after citrus fruit harvest is considered. The formulated mathematical model is competent enough to demonstrate the trade-offs between cost and CO2 emission. Small-, medium-, and large-sized problem examples stimulated from the agri-food supply chain are solved utilising four meta-heuristic algorithms including NSGA-II, MOSA, MOACO and Fast PGA. Additionally, a new hybrid algorithm called HACO-SA is developed to solve the presented model. Based on the obtained results, HACO-SA is superior compared to other proposed algorithms. Sensitivity analysis results show that the decision-makers should establish an adequate number of fridges/cold storage and the amount of water source in the production, holding, and consumption processes by maintaining a proper balance among the two objective functions.

Therefore, a fuzzy, stochastic, or robust multi-objective model can be formulated in the near future to obtain the uncertainty in cost parameters. The proposed model integrates the economic (total costs) aspects and environmental effects (CO2 emissions) considering time windows. Therefore, interested researchers can use social effects such as the responsibility of farmers and workers in the proposed model. The current paper considered a single product (orange) and future research can look into the multi-product scenario. The quantification of the post-harvest losses can be another area to explore. The presented meta-heuristic algorithms can be applied to other problems such as location-routing, allocation-location-inventory, and scheduling-routing problems in the other supply chain network and healthcare network to assess their effectiveness. Finally, interested scholars can utilise other improved or hybrid meta-heuristic algorithms, exact methods such as the Lagrangian approach, and heuristic methods. The limitations of the research are as follows: As there was no official database for some parts of cost elements, the drivers’ estimations were asked to help. The questions about the transportation costs have been categorised and the estimated costs have been entered into the mathematical model. Meta-heuristic algorithms are not able to calculate the global optimum and calculate the local optimum. The final solution in NSGA-II, MOSA, MOACO and Fast PGA algorithms depends on the coder’s skill in defining chromosomes and the initial value of its parameters.

Similar to the other studies, the current paper has a few limitations, as follows:

  • One of the limitations of the research is the lack of access to accurate information about transportation costs. Moreover, in this study, only the information of drivers and experts about transportation costs has been considered.

  • In order to implement HACO-SA, NSGA-II, MOSA, MOACO and Fast PGA algorithms, high RAM and CPU hardware and software features such as MATLAB software are required.