Introduction

Land suitability evaluation is a powerful tool to support decision-making in land use planning; it deals with the assessment of the (most likely) response of land when used for specified purposes; it requires the execution and interpretation of surveys of climate, soil, vegetation and other aspects of land in terms of the requirements of alternative forms of land use. Land evaluation is carried out to estimate the suitability of land for a specific use such as arable farming or irrigated agriculture. Land evaluation can be carried out on the basis of biophysical parameters and/or socio-economic conditions of an area (FAO 1976). Biophysical factors tend to remain stable, whereas socio-economic factors that are affected by social, economic and political performances (Dent and Young 1981; Triantafilis et al. 2001). Thus, qualitative land suitability evaluation is a prerequisite for land-use planning and development (Sys 1985; Van Ranst et al. 1996). It provides information on the constraints and opportunities for the use of the land and therefore guides decisions on optimal utilization of land resources (FAO 1984). The FAO (1976) defines land evaluation as “a process of assessment of land performance when the land is used for specified purposes”. A qualitative land evaluation takes into account two key elements, the soil qualities/characteristics and the crop requirements (FAO 1976). The latter refers to “a set of land characteristics that can determine the production and management conditions of a kind of land use”. The outcome of the suitability assessment for a particular crop which is the final result of a land assessment depends on whether the land characteristics match with the crop requirements. Land suitability assessment can be regarded as a specific case of land evaluation: it is an appraisal of land characteristics in terms of their suitability for a specific use (FAO 1976). The basic concept behind land suitability evaluation is that suitability for a specific and sustainable use of the land is the synthetic result of complex relationships between different land environmental qualities (e.g., climate, soil characteristics and slope). Suitability for a specific use is therefore evaluated by matching requirements for that use with characteristics and qualities of land components. Land suitability is usually expressed by a hierarchical system organized into orders and classes (FAO 1976). The parametric approach is considered as a transitional phase between qualitative methods, which are entirely based on empirical expert judgments and standard mathematical models that would be the real quantitative systems. In parametric approach different classes of land suitability are defined as completely separate and discrete groups and are separated from each other by distinguished and consistent range. The non-certainty in the output results obtained by parametric approach can solve by evolutionary learning and the nonlinear mapping ability of the neural networks. Artificial neural networks (ANN) are a form of artificial intelligence, which by means of their architecture attempt to simulate the biological structure of the human brain and nervous system (Zurada 1992). A neural network consists of simple synchronous processing elements, called neurons, which are inspired by biological nerve system (Malinova and Guo 2004). The mathematical model of a neural network comprises of a set of simple functions linked together by weights. The network consists of a set of input units x, output units y and hidden units z, which link the inputs to outputs. The learning algorithm of neural networks used the values obtained by parametric approach. Through evolutionary learning from samples a neural network adapts its connection weights to approximate the desired output. Then, successfully trained neural networks can accomplish the suitability analysis task. Neural networks can identify subtle patterns in input training data, which may be missed by conventional statistical analysis. In contrast to statistical regression models, neural networks do not require knowledge of the functional relationships between the input and the output variables. Moreover, neural networks are nonlinear, and therefore may handle very complex data patterns, which make mathematical modeling unattainable. Another advantage of neural networks is that all kinds of data-continuous, near-continuous, and categorical or binary can be input without violating model assumptions, as well as the ability to model multi-output phenomena. Decision making issue in evaluating land suitability is very complex and complicated because of several decision indicators and criteria. Fuzzy set methodologies have been proposed as a method for overcoming problems related to vagueness in definition and other uncertainties. The use of fuzzy set methodologies in land suitability evaluation allows imprecise representations of vague, incomplete and uncertain information. Fuzzy land evaluations define continuous suitability classes rather than “true” or “false” as in the Boolean model (e.g. Burrough 1989; Sicat et al. 2005; Ziadat 2007; Keshavarzi et al. 2010). Fuzzy set methodologies have the potential to provide better land evaluations compared to Boolean approaches because they are able to accommodate attribute values and properties which are close to category boundaries. The fuzzy methods are able to address and explore the uncertainties associated with land resources, especially if they are integrated with fuzzy set models (Xiang et al. 1992; Ceballos-Silva and Lopez-Blanco 2003; Prakash 2003; Guerrero and Moreno 2007; Chaddad et al. 2009). However fuzzy methods are still relatively unknown in land suitability evaluations for agricultural crops. Fuzzy sets theory for the first time defined by Zadeh (1965) in order to quantitative defining and determining of some classes that are expressed vaguely such as “very important” and so on. In fuzzy thinking, determination of specific border is difficult and belonging of various elements to various concepts and issues are relative. In fuzzy theory, the membership was not two-valued, but it can allocate the range of numbers from zero to one. A function that expresses degree of membership is called membership function. In the present study a Gaussian combination membership function was used. Fuzzy model has been used by many researchers in land suitability evaluation (Tang et al. 1991; Van Ranst et al. 1996; Keshavarzi and Sarmadian 2009). Most of the researchers, have been compared the results of this evaluation with other conventional methods such as maximum limitation, parametric and multiple regression methods in order to predicting the yield of production. Fritz and See (2005) studied the comparison of land cover maps using fuzzy agreement. The spatial fuzzy agreement between the two land cover products is provided. The results showed that fuzzy agreement can be used to improve the overall confidence in a land cover product. Tang et al. (1991) used fuzzy method for evaluation of Hamen lands in Liaoning province in China in order to cultivating corn. These researchers were obtained the final matrix of suitability with constituting weight matrix and land characteristics matrix and multiplication of them and finally were calculated the land index. In this study, they were used multiple regression to determination of weights. The aim of the present study is to evaluate land suitability for soybean production in a semi-arid region by Parametric-based neural networks and fuzzy models and comparing land index values of both models with the observed yield of the crop to validate the accuracy of the models in practice.

Materials and methods

General characteristics of the study area

The present study was conducted in Neyshabour plain, Khorasan-e-Razavi Province, Northeast Iran (Fig. 1). The study area is located between latitude 35°41′N to 36°39′N and longitude 58°13′E to 59°30′E including lands less than 2933 m asl. The general physiographic trend of the plain extends in a NW–SE direction. The total surface of the study area comprises 5060 km2. The elevation values of the study area vary between 1100 and 1700 m asl, with an average of 1400 m asl. The main land use practice in the study area is irrigated farming. The climate of the study area is semi-arid with mean annual precipitation of 247.4 mm and the mean annual temperature of 13.9 °C.

Fig. 1
figure 1

The Geographical location of the study area (Neyshabour plain)

Parametric approach

The process of evaluation is based on the FAO qualitative land evaluation system (FAO 1976, 1983, 1985), which applies climatic conditions and land qualities/characteristics including topography, flooding hazard, drainage condition, soil texture, gravel percent, soil depth, calcium carbonate, gypsum, organic carbon, pH, soil salinity and alkalinity requirements for soybean production (Sys et al. 1991a, b, 1993). Based on morphological and physical/chemical properties of soil profiles some 41 land units were identified in the study area. For determining the mean values of soil physical, chemical and terrain parameters for the upper 60 cm of soil depth, the profile was subdivided into two equal sections and weighting factors of 1.75 and 1.25 were attributed for the above and below sections, respectively (Sys et al. 1991c). A qualitative land suitability evaluation indicates the degree of suitability for specific land use, without respect to economic conditions. It emphasizes the relatively permanent aspects of suitability, such as climate and soil qualities/characteristics, rather than changeable ones, such as prices. Applying parametric learning neural networks in land evaluation consists of numerical rating of different limitation levels of land characteristics according to a numerical scale between the maximum (normalized as 100) and the minimum value of zero. Finally, the climatic index, as well as the land index, is calculated from these individual ratings. On this basis, Boolean classification was implemented in a way that for classified (qualitative) values (e.g. soil texture/structure = SL) the higher score of the class is given (e.g. 85) while, for continues (quantitative) values a linear interpolation function used to assign a score. The data provided from a soil survey are often continuous data and therefore it is necessary to apply a classification scheme that assigns scores to individual land qualities/characteristics. This scheme is based on linear interpolation functions that map value intervals to score intervals. If the observed value is x and it falls into the interval [ab] it needs to get a score y that falls into the interval [cd]. The formula to calculate y is:

$$y = a + \frac{(b - a)( x - c)}{(d - c)}$$
(1)

Each class-determining factor is first matched individually. Critical limits indicate how suitable a land unit is for a given Land utilization type (LUT) in terms of that factor. For example, if one of the class determining factors for the LUT irrigated soybean is soil texture and the critical limits are to be represented in terms of soil texture corresponds to S1, S2, S3, N1 and N2 suitability levels. The soil texture recorded for each land unit will fall within one of these five ranges and the appropriate one is selected as the factor rating. In combining the factor ratings of several individual factors in order to decide the appropriate land suitability class to assign, the possibility of interactions should be taken into account. In a broad interpretation of the meaning of the word interaction it can be readily appreciated that many factors interact in the resultant land index which is the integral of their effects.

Climate evaluation

Climate data related to different stages of soybean growth were taken from 30 years of meteorological data of the region (1981–2010) and the climatic requirements of the crop were extracted from Sys et al. (1993). Based on crop climatic requirements, the climate index (CI), climatic rate (CR) was determined as implemented factors in estimating land index (Table 1).

Table 1 The values of climate characteristics and the degree of limitation for Soybean production in the study area based on Sys et al. (1993)

Estimating land suitability index

The proposed method is a parametric approach developed by Bagherzadeh and Paymard (2015) to estimate the land suitability index. On this basis the land index of each land unit is calculated by multiplying geometrical mean value of the scores given to each land quality/characteristic and climate rate in the interaction of the square root values of scores according to the following formula:

$$LI = \mathop \prod \limits_{i = 1}^{n} x_{i}^{{\left( {\frac{1}{n}} \right)}} \times \sqrt[n]{{\frac{{\mathop \prod \nolimits_{i = 1}^{n} x_{i} }}{{100^{n} }}}}$$
(2)

where, LI is the land index, X is the score given to each land quality/characteristic, n is the number of land qualities/characteristics.

Neural networks modeling

The neural network model is derived from a simulation of the human brain. The basic computation unit in a neural network is a neuron. A neuron performs the simple weighted summation and nonlinear mapping (Fig. 2), where w0 is a threshold and f is usually a sigmoid function: i.e.,

$$f\left( z \right) = \frac{1}{{1 + e^{ - z} }}$$
(3)
Fig. 2
figure 2

Artificial neural network neuron (w 0 is a threshold)

A neural network has many neurons. The way neurons connect determines the structure of a neural network. A type of neural network called multilayer perceptron (MP), or feed forward network, consists of a sequence of layers of neurons with full connections between successive layers. Two layers of MP have connections to the outside world: the input and output layer. There are one or more hidden layers between the input and output layer. Information sequentially passes through the input, hidden, and output layers. A feed forward network with one or more hidden layers can form any shape of decision boundaries or approximate any continuous function, given sufficient hidden neurons (De Villers and Barnard 1992; Kreinovich and Sirisaengtaskin 1993). A neural network usually has two distinctive phases: learning and recall. Currently, the most popular learning algorithm for feed-forward network is back-propagation (BP) (Rumelhart and Hinton 1986). We will refer to a feed-forward network using a BP learning algorithm as a BP network in this paper. With the BP algorithm, a set of training samples with the desired output is required. It is a procedure which iteratively adjusts the weights of the connections in the gradient descent direction so as to minimize a measure of the difference between the actual output vector of the network and the desired output vector. The difference is usually measured by the error function:

$$E = \frac{1}{2} \mathop \sum \limits_{c} \mathop \sum \limits_{j} (y_{j.c} - d_{j.c} )^{2}$$
(4)

where, c is an index over cases (input–output pairs), j is an index over output units, y is the actual state of the output unit, and d is its desired state. The simplest version of gradient descent is to change each weight by an amount proportional to the accumulated ∂E/∂w: i.e.,

$$\Delta w = - \frac{\eta \partial E}{\partial w}$$
(5)

where, η is the learning rate.

A neural network’s generalization ability, or the power to handle unseen data, is crucial. The generalization ability can be measured by a set of testing samples in the recall phase. If a trained neural network generalizes well, it can be safely used to process the whole data set.

The schematic of the neural networks and its components in the present study has been illustrated in Fig. 3.

Fig. 3
figure 3

The diagram of neural networks and its components

Fuzzy logic modeling

The Fuzzy set theory (Zadeh 1965) is a body of concepts and a technique that gives a form of mathematical precision to human thought processes that are imprecise and ambiguous in many ways. At the present study, we considered the fuzzy approach on the Singleton fuzzyfier, Centroid defuzzyfier and minimum Mamdani inference engine in MATLAB Ver. R2012a software. The fuzzy system has the following block diagram (Fig. 4). Accordingly, the application of the fuzzy set theory to determine the impact of land qualities for irrigated soybean comprises several steps:

Fig. 4
figure 4

A block diagram of the fuzzy system

  • Defining the inputs and output of fuzzy system;

  • Determination of membership functions;

  • Determination of membership values;

  • Determination of fuzzy If and Then rules;

  • Defuzzyfication of all membership values and determination of land index for each land unit.

In this study the membership function was defined as Gaussian combination membership function as follows:

$${\text{Syntax:}}\quad {\text{y}} = {\text{gauss}}2{\text{mf (x}},[{\text{sig}}1{\text{ c}}1{\text{ sig}}2{\text{ c}}2] )$$
(6)

The Gaussian function depends on two parameters sig and c as given by:

$$f(x;\sigma ,c) = e^{{\frac{{ - (x - c)^{2} }}{{2\sigma^{2} }}}}$$
(7)

The function gauss2mf is a combination of two of these two parameters. The first function, specified by sig1 and c1, determines the shape of the left-most curve. The second function specified by sig2 and c2 determines the shape of the right-most curve. Whenever c1 < c2, the gauss2mf function reaches a maximum value of 1. Otherwise, the maximum value is less than one. The parameters are listed in the order: [sig1, c1, sig2, c2] (Fig. 5).

Fig. 5
figure 5

Gaussian combination membership function (Gauss2mf)

The zonation of land suitability

An interpolation technique using the ArcGIS ver.10.3 helped in managing the spatial data and visualizing the land index values using parametric learning neural networks and fuzzy models for preparing the final land suitability evaluation maps for both models. According to Sys et al. (1991c) the land suitability classes, the intensity of limitation and the land index ranges are demonstrated in Table 2.

Table 2 Land suitability classes according to land index (Sys et al. 1991c)

Results and discussion

Neural networks model in land suitability evaluation

Suitability is largely a matter of producing yield with relatively low inputs. There are two stages in finding the land suited to a specific crop. The first stage focuses on being aware of the requirements of the crop, or alternatively what soil and site attributes adversely influence the crop. The second stage is to identify and delineate the land with the desirable attributes. In the present study, the specific soil and climate requirements for irrigated soybean were determined based on Sys et al., guidelines (1991a, b, 1993). There was a relatively low climate rate ranged from 37.0 to 38.0 in most parts of the study area which characterized the region marginally suitable (S3 class) for irrigated soybean production (Table 1). The statistical values for degrees of limitation of each soil quality/characteristic in depth of 0–60 cm at the study area indicated higher standard deviation and coefficient of variation for soil salinity and alkalinity, which affect the land index variations by both models more effectively than other soil parameters (Table 3). The values of land indexes using parametric learning neural networks varied between 29.77 and 57.45 with an average of 48.05 (Table 4). The land suitability classes for soybean were categorized into marginally suitable of S3 and moderate suitable class of S2. The zonation map of land suitability revealed that 73.52 % (3618.65 km2) of the surface area were marginally suitable and 26.48 % (1440.92 km2) were moderate suitable for soybean production (Table 5). The most important limiting factors for soybean production in the study area were climate conditions, soil pH and soil organic carbon. The moderate suitability class of S2 was mainly distributed in west and north parts of the plain, while the middle and east parts of the study area exhibited marginal suitability for irrigated soybean production (Fig. 6).

Table 3 The statistical values for degrees of limitation of soil qualities/characteristics in depth of 0–60 cm at the study area
Table 4 The values of land indexes and land suitability classes by neural networks and fuzzy models for Soybean production in the study area
Table 5 Land suitability classes, land indexes, the surface area and the percent of each suitability class by neural networks and fuzzy models in the study area
Fig. 6
figure 6

The zonation of land suitability for Soybean production by neural networks model in Neyshabour plain

Fuzzy model in land suitability evaluation

Land suitability analysis deals with many factors that are continuous in nature, like soil characteristics, and climatic parameters. Using parametric approach makes it impossible to model such a vagueness and imprecision of environmental factors. Fuzzy logic aids in most precise representation of such imprecise, incomplete and vague information. The values of land indexes based on fuzzy model altered from 16.10 to 47.80, which categorized the region into marginally not suitable class of N1 to marginally suitable class of S3 (Table 4). The major limitations for soybean cultivation were found to be climate conditions, soil pH and soil organic carbon. Hence, emphasis should be placed on soil management techniques that conserve organic matter and enhance nutrient and water-holding capacity of the soil. The produced maps of land suitability by fuzzy logic approach showed that 255.03 km2 equal 5.04 % of the study area has marginally not suitability and 4804.54 km2 equal 94.96 % of the plain has marginally suitability for soybean production (Table 5). The marginal suitability class of S3 was distributed all over the plain, while scattered parts in south, southeast and middle of the plain exhibited marginal not suitability for irrigated soybean production (Fig. 7).

Fig. 7
figure 7

The zonation of land suitability for Soybean productions by Fuzzy model in Neyshabour plain

Model validation

The land index values from neural networks and fuzzy models were compared by calculating the coefficient of determination (R 2) defined by Nash and Sutcliffe (1970) which is calculated as follow:

$$R^{2} = 1 - \frac{{\left[ {\mathop \sum \nolimits_{i = 1}^{n} (LI_{fuzzy} - LI_{ANN} )^{2} } \right]}}{{\left[ {\mathop \sum \nolimits_{i = 1}^{n} (LI_{fuzzy} - \overline{LI}_{ANN} )} \right]^{2} }}$$
(8)

where: LI fuzzy and LI parametric are computed values of sample i, based on the fuzzy system and parametric model, respectively, and LI parametric is the mean of measured values. The coefficient of determination (R 2) estimated from the above formula for our study was R 2 = 0.966 which shows a high correlation between the estimated land index values from both models. The values of regression coefficient (R 2) between the estimated land index values by both models with the observed soybean yield in each land unit varied between 0.610 and 0.514 for neural networks and fuzzy approaches, respectively, which indicate similar trend of both models in estimating land suitability for soybean production in the study area (Figs. 8, 9).

Fig. 8
figure 8

Exponential regression between the observed yield of Soybean and the land index values by parametric-based neural networks model in Neyshabour plain

Fig. 9
figure 9

Exponential regression between the observed yield of Soybean and the Fuzzy preference values in Neyshabour plain