Introduction

Worldwide urban and regional development has undergone many changes over the past 50 years (CHU and Tang 2005). Continued urban growth and development has put major challenges on the structure and functioning of sub-systems in the city and the region (Natale et al. 2015) Due to the responsibility of policy makers in the area of regional planning (Greenhalgh and Gudgeon 2004), it is necessary to pay attention to the development process in the management of complex systems (Barredo et al. 2003) at the city and district level. As a result, exploring urban development is important. Based on the view of different people, various tools and methods for simulating spatial and urban dynamics, such as cellular automation simulation models, fractal model, linear regression or logistic, technique multivariate evaluation (Barreira-González et al. 2015), dynamic model, baseline model (Qiu-Hao and Yun-Long 2006), conventional models have problems with analysis, due to the lack of dynamic information, because they cannot meet the needs for spatial–temporal simulation (He et al. 2015). Using of dynamic models which include spatial–temporal information are useful for simulating complex geographic systems (He et al. 2015). Among the models mentioned, the model of AHP cellular automation is due to its specific nature and benefits such as simplicity (because at the beginning, the parameter values are automatically obtained by complementary models efficiency due to the modification and development of several models) independent variables (Yang et al. 2013; Barreira-González et al. 2015) have become more widely used over the past two decades. Von Neumann and Stanislaw Ulam (Junfeng 2003) presented the concept of automated cells, in the field of computer science, for the first time in 1940, then Conway developed the concept by the theory of game of life. The outcome of the Tobler’s work in the 1970s at the University of Michigan was the entrance of the CA models into the geography (Firouzabadi et al. 2009). 1990s was one of the best decades for the development of urban CA models. In the past 20 years, CA has been very useful for many researchers in the urban studies (Clarke et al. 1997; White and Engelen 1997; Wu and Webster 1998; Batty et al. 1999; Li and Yeh 2000, 2002). A regular grid of cells where each cell can be given a certain value according to its location forms CA. The changing in These values and the values of neighboring cells in a discrete time interval is accomplished according to the rules, defined in the model as the transition rules, (Wolfram 1984). Due to the bottom-up approach of this model, it is able to simulate the overall behavior of the urban growth by considering the behavior of all the urban cells which are affected by the current conditions of the central cell and the neighboring cells (Al-Kheder 2006; Feng et al. 2011). The behavior of a sophisticated system, such as an urban system, can be simulated using transition rules in the cellular automata. The probability of the conversion of a cell as a function of the driving forces of urban growth and position of all neighboring cells calculated according to Transition rules (Al-Kheder 2006; Feng et al. 2011). Spatial parameters of the transition rules have often a severe spatial correlation where the precision and accuracy of modeling can reduce. Nowadays, one of the most important research topics is the manner managing of these variables and propose solutions to minimize the effects of correlation (Li and Yeh 2002; Al-Kheder 2006). So far, many articles have been published to determine the factors affecting urban growth. One of the most important articles of the article is (Wahyudi and Liu 2013) in this article Over a hundred articles published between 1993 and 2012 were selected and reviewed. The driving factors were extracted from the transition rules and classified according to their similarity and mechanism in influencing urban growth. (Table 1) they analysis shows that researches between 1993 and 2000 mainly focus on using geomorphological factors while recent studies tend to also include socio-economic factors, resulting in more sophisticated urban CA models. Nevertheless, the human-behavior factors impacting urban growth are generally under-represented (Wahyudi and Liu 2013).

Table 1 Key driving factors to urban growth extracted from existing CA models (Wahyudi and Liu 2013)

In addition to this article, many other articles have also been reviewed. The results of the reviewed articles are generally presented in Table 2. In this table, all the factors affecting urban development have been identified.

Table 2 Factors affecting urban growth based on resources provided in table

Materials and methods

Urban sprawl

Sprawl definition could be the increase in built-up and paved area with impact on loss of agricultural land, open space and ecologically sensitive habitats (Liu and Phinn 2001). The sprawl mainly causes population growth, economy and the template and preparation of infrastructure (Sudhira et al. 2003; Sudhira 2004) sprawl development is accomplished in three ways which are ribbon sprawl, leapfrog sprawl and low-density (radial) sprawl. Consumptive use of land is highly affected by provision of public infrastructures such as water, sewer, power, Telecommunication and roads. The ribbon sprawl is developed along the main transportation network that joins urban areas. The main point is that the Lands without direct access to the network remains rural or undeveloped. The leapfrog sprawl is a discontinuous pattern of urbanization in which the developed lands are widely separated from each other and it costs more than two other ways. The inherent causal and dynamics involved in the rapid changes of land-use because of urban sprawl are the most important factors, which are considered as a fit case to apply Cellular Automata models and AHP logic for simulating future scenarios (Sudhira et al. 2004).

Cellular automata

Cellular automata (CA) are a computational method, capable of simulating growth process by describing a complex system through a set of simple rules (Mantelas et al. 2008). A cellular automata model was designed for urban growth to simulate the process of urbanization in a hypothetical region. This model consists of a set of rules that describe the spatial interaction of cells and a set of parameters that lead to explore different urban forms. CA ability in the simulation of urban growth, land use change and population expansion had become suitable for simulating complex geographical process (Feng et al. 2011). One of the best utilities of CA for geographic modeling includes its capability to support a very large parameter spaces for simulation (Torren and Benenson 2007). CA-based models behave according to the neighborhood relationships, which makes them to have dynamic and discrete systems. In the CA, space is defined as a grid, which each part of that is called cell (Moghadam and Helbich 2013) CA models including four parts: lattice, cell states, neighborhood and transition rules (Batty et al. 1999) Lattice is a grid, which is able to adopt various geometrical forms like square, hexagonal, and Trapezius shapes (Fig. 1).

Fig. 1
figure 1

a MOR’s neighbourhood, b developed neighbourhood, c Van’s neighbourhood in the two-dimensional CA model, d neighbourhood in a one-dimensional CA mode (Lee et al. 2009)

CA modelling is one of the suitable ways for urban sprawl modelling (Feng et al. 2011).

Lattice: according the shape similarity to square at raster data in GIS, square shape of lattice was selected in which the size of each cell equals 10 m by 10 m.

Cell state: the definition of three cell states is in the offered model, the urban state, constraint state and the non-urban state with the values of 1, 0 and between 0 and 1 respectively.

Neighbourhood: the Moore neighbourhood was selected for this study.

The definition of transition rules is as follow:

  1. 1.

    There is not any changing in state of the cell at simulation periods if the state of a cell is urban.

  2. 2.

    There is not any changing in state of the cell at simulation periods if the state of a cell is constrained.

  3. 3.

    Probability of a cell for transformation to the urban state will increase if affecting factors are closer to the cell.

  4. 4.

    The probability for transformation to urban state will increase if cells have more urban state neighbours.

  5. 5.

    The state of the cell will transform to urban state if calculated of a cell has the maximum value between the other cells (Mohammad et al. 2013).

Analytic hierarchy process (AHP) model

In general, it has a set of issues that are evaluated on the basis of metrics. Multiple criteria decision making (MCDM) analysis is a set of analysis methods which helps decision makers to solving complex problems with poor or incomplete structure and use their knowledge to solve these problems (Sardari and Rafieian 2008).

Analytic hierarchy process is one of the most famous multiple criteria decision making methods (Saaty 1980) which has been suggested in year time and It has been widely used in various sciences as yet (Zebardast 2001). The process consists of four steps of creating hierarchy, determining the Importance factor of the criteria and sub-criteria, determine the Importance factor of the options, final rating and compatibility check in judgments (Zebardast 2001). By using this method, the appropriate weights of the affecting factors for urban growth are calculated and it is used in the production of land suitability map and formulation of transfer laws in the automated cell model

Entropy model

By using this model, we can understand the equilibrium rate of space for population clearance and the number of cities at the urban, state, regional, and country level. The whole structure of model is: In the entropy model, trending to zero, will means the increase of centralization or more centralization or unbalance in population distribution between cities, and movement toward one or higher than shows unbalance distribution in area level (Hekmatnia and Mousavi 2006). Definition of centralization degree or geographical phenomenon has been accomplished by this index (Wilson 1996).

Shannon entropy model

“Shannon entropy model is used for analysis and determination of the amount of urban disruption phenomenon” or, this indicator can be used to analyse information and the level of organization of a system (Mehrgan and Rahmani 2013). The overall structure of the model is as follow: Eq. (1) Shannon entropy formula

$$ {\text{H}} = \sum\limits_{{i = 1}}^{n} {pi^{ * } {\text{Ln}}({\text{Pi}})}$$
(1)

In the above formula:

H = Shannon entropy value; Pi = ratio of built area (total residential density) i = area to the total built area; N = total areas. The value of the entropy from zero to Ln, (n) represents a sprawl urban development. When the value of entropy is greater than in (n), urban sprawl growth has occurred (Hekmatnia and Mousavi 2006).

Holdern entropy model

Holdern is one of the basic methods for determining uneven urban growth. It is possible to determine the amount of city growth based on population growth and the amount of uneven urban growth with this method. This model first was applied to calculate the ratio population to any other source by Holdern in 1991. According to Beck et al. (2003).

In the holdern method, using the per capita formula, it is, \( a = \frac{{A\left( {area} \right)}}{p} \) determined how much the growth was due to population growth and how much was due to the increased per capita. Equation (2) holdern model formula

$$ \begin{aligned}& {\text{A}} = {\text{ p}}*{\text{a}} \hfill \\& {\text{A}} + \Delta {\text{A}} = \, \left( {{\text{p}} + \Delta {\text{p}}} \right) \, \left( {{\text{a}} + \Delta {\text{a}}} \right) \hfill \\& {\text{A}} + \Delta {\text{A}} = {\text{pa }} + {\text{p }}\Delta {\text{a }} + {\text{a }}\Delta {\text{p }} + \, \Delta {\text{p }}\Delta {\text{a}} \hfill \\& \Delta {\text{A}} = {\text{ p }}\Delta {\text{a }} + {\text{ a }}\Delta P \to \frac{{\Delta {\text{A}}}}{\text{A}} = \frac{{\Delta {\text{a}}}}{\text{a}} + \frac{{\Delta {\text{p}}}}{\text{p}} \hfill \\ \end{aligned} $$
(2)

Note: holdern \( A = \pi r^{2} \) found a general model based on population growth model Eq. (3)

$$ \begin{aligned}& p_{t} = p_{0} \left( {1 + y_{p} } \right)^{t} \hfill \\& p_{t} = p_{0} \left( {1 + g_{p} } \right)^{t} \hfill \\& \ln \left( {1 = g_{p} } \right) = \frac{1}{t}\ln \left( {\frac{{p_{t} }}{{p_{0} }}} \right) \hfill \\ \end{aligned} $$
(3)

(Condition A): for the amount lesser than, X is equals to X. Equation (4)

$$ \begin{aligned} g_{p} = \frac{1}{t}\ln \left( {\frac{{p_{t} }}{{p_{0} }}} \right) \hfill \\ g_{a} = \frac{1}{t}\ln \left( {\frac{{a_{t} }}{{a_{0} }}} \right)\quad \to g_{A} = g_{p} + g_{a} \hfill \\ g_{A} = \frac{1}{t}\ln \left( {\frac{{A_{t} }}{{A_{0} }}} \right)\quad \ln \left( {\frac{{A_{t} }}{{A_{o} }} } \right) = \ln \left( {\frac{{p_{t} }}{{p_{o} }} } \right) + \left( {\frac{{a_{t} }}{{a_{o} }} } \right) \hfill \\ \end{aligned} $$
(4)

Kappa coefficient analysis

Calculating the kappa coefficient

The Kappa coefficient and the overall accuracy are used to assess the proposed model. For this purpose, error matrix is computed as shown in Table 3 (Monserud and Leemans 1992; Pontius 2001).

Table 3 Method for calculating the Kappa coefficient (Congalton et al. 1983; Rosenfield and Fitzpatrick-Lins 1986)

Kappa is declared to have two advantages over raw accuracy:

  • Kappa considers all the cells of an error matrix and thus incorporates more information (Rosenfield and Fitzpatrick-Lins 1986; Fung and LeDrew 1988; Dicks and Lo 1990; Janssen and Vanderwel 1994);

  • Kappa is suitable for comparison between different error matrices because it removes chance agreement (Congalton et al. 1983; Congalton 1991).

Congalton et al. (1983) proposed an application of the Kappa analysis, (as described by Cohen 1960; Bishop et al. 1989), as a means of improving the interpretation of the error matrix. Kappa analysis yields a Khat coefficient that measures the difference between the observed agreement of the two maps and the agreement that might be attained by chance matching (Hurley et al. 1997). The Khat coefficient is computed as follows: Eq. (5)

$$ {\text{K}}\left( { {\text{hat}}} \right) = \frac{{{\text{P}}\left( {\text{A}} \right) - {\text{P}}\left( {\text{E}} \right)}}{{1 - {\text{P}}\left( {\text{E}} \right)}} = \frac{{\mathop \sum \nolimits_{{{\text{i}} = 1}}^{\text{C}} {\text{Pii }} - \mathop \sum \nolimits_{{{\text{i}} = 1}}^{\text{c}} {\text{PiT PTi}}}}{{1 - \mathop \sum \nolimits_{{{\text{i}} = 1}}^{\text{c}} {\text{PiT PTi}}}} $$
(5)
  • C: is the number of categories in the error matrix

  • Pii is the number of cells in row i and column i

  • PIT is the total number of cells in row i (shown as the row total in the matrix)

  • PTI is the total number of cells in column i (shown as the column total in the matrix)

  • Where Pa is the percent correct for model output, and PE is the expected percent correct due merely to chance.

Research methodology

The research method of this article is practical which its theoretical bases are collected by documentary method and quantitative and comparative methods have been used for its analysis. In this study, first, the automated cell model is checked and then the effective factors on urban growth are determined by using resource review and aerial photography of the study area at different time intervals. Subsequently, analytical maps related to the effective factors on urban growth in GIS software are produced and the weights obtained through the hierarchical analysis process of comparing the development stimuli relative to each other in the applied layers and overlapping the different layers and Considering the limitations of developing land suitability map as inputs for the automated cell model is produced. Finally, the model is conceptually designed and integrated with the hierarchical analysis process which using the model definition of the automated cell model. Then the conceptual model has been converted into software codes and used to predict city growth in the surrounding lands.

Recognition of the studied area

Qazvin is the city canter and it is Qazvin province. Qazvin is located 50-north longitude and 36.15 east latitude of the Greenwich meridian. The area of the city is 3701 hectares. At present, Qazvin has three regions areas, with the total population of the three regions equal to 450 thousand (Fig. 2).

Fig. 2
figure 2

Location of Qazvin city in province, county and country

Data and preparation of them

The data used in this research are satellite images of Landsat Qazvin. Figures 3 and 4 show Landsat images in 1996 and 2016. The Specifications of satellite images related to the city of Qazvin are shown in Tables 4, 5.

Fig. 3
figure 3

Image of Landsat Qazvin 1996

Fig. 4
figure 4

Image of Landsat Qazvin _2016

Table 4 Specifications satellite images of Qazvin city (http://glovis.usgs.gov)
Table 5 Specifications DEM Qazvin city

Sprawl study in Qazvin city

Entropy model

Investigation of urban sprawl growth in Qazvin using the holdern model (for the years 1956–2016)

The result of this investigation is in Tables 6, 7, 8.

Table 6 Calculation of Shannon entropy for the year 1996 in Qazvin
Table 7 Calculation of Shannon entropy for the year 2016 in Qazvin
Table 8 The investigation of sprawl growth of Qazvin using Holdern model

According to the result of two Shannon and Holdern Modern, it is observed that Qazvin has grown physically increasingly due to over concentration on population and activity.

Findings from the studying of Qazvin’s documents and reports, natural and artificial factors affecting the physical development of Qazvin

The results of reviewing the documents and reports on the Factors and obstacles of physical expansion in Qazvin are presented in Table 9. Location, studies factors in Qazvin city.

Table 9 Obstacles, directions and reasons for lack of physical development in Qazvin

Studies factors in Qazvin city

Allow construction, and vision and perspective.

Distance from the river, distance from fault, contour line, type of soil, accessibility to urban roads and land prices.

At the beginning of the simulation, urban growth has been done for the year 2016. After the prediction of urban growth by 2016 and comparing it with the current situation’s map of that year, it can be concluded that the urban simulation (for 2016) has a high overlap with the aerial map of that year. In addition, in order to check the accuracy of the model, the Kappa coefficient is used, and the results of that are illustrated in Table 10. Regarding this issue that the value of the Kappa coefficient is 92.3%, it can be stated that the performed simulation is correct for 2016. In the next step, Fig. 5 is used to simulate urban growth by 2026, and the results of the final simulation are shown in figure.

Table 10 The quantitative results of Kappa
Fig. 5
figure 5

Location, studies factors in Qazvin city

Results of kappa

The quantitative results of the model are illustrated Table 10.

How to implement AHP and cellular automate models in Qazvin

First step: At this step, first, layers associated with the effective factors on urban growth are prepared and after repackaging each layer, the weights calculated by the hierarchical analysis process are applied to the layers and overlapped. At this step, the probability of each cell’s status being transformed into a cell-dependent status is rely on the effective factors on urban growth and the effect of each factor on the weight of each cell is considered as weight shape.

This probability is calculated by the following relation where PAHF is Probability of cell change indicates the ‘ in this equation \( s_{n} \) is equaled to \( w_{1} \times s_{1} + w_{2} \times s_{2} + \cdots + w_{n} \times s_{n} \) and \( \sum\nolimits_{1}^{n} {w_{n} s_{n} } \) growth factor is the ‘ value of the class in which the intended cell is growth to the growth factors and \( w_{n} \).

Weight of Eq. (6)

$$ p_{ahf} = 1 - \frac{1}{{\mathop \sum \nolimits_{1}^{n} w_{n} \times s_{n} }} $$
(6)

Second step: At this step, according to the components, the automated cells are divided into four main parts, the cell, the cell status, the neighborhood, the transfer rules and the sub-part of time and at this step the automated cell model is defined as its components.

Third step: This will be calculated by checking the population and the number of cells which built in previous years. Because of using of this limiting factor, this type of automated cells is called as auto-restricted cells.

Fourth step: In the combination part, first determine the cell type and status and accordingly all the calculations which performed in the first and third steps will combine by using the following equation and then calculate with the conversion rate and enter to the model and computer program as a control factor.

$$ {\text{P}}_{\text{total}} = {\text{P}}_{\text{AHP}} \times\Omega $$

Data preparation

In order to calculate the weights and degree of importance of the effective factors on urban growth, a hierarchical analysis process has been used in this study. At first, the matrix of pairwise comparisons of factors to criteria and criteria to each other is formed and criteria scoring to each other and factors to criteria is performed using expert in Delphi technique and using the field of view. For calculating the weight of each factor, the Expert choice software has been used in hierarchical analysis process method. The results are implemented on analytical maps of the effective factors on urban growth (Figs. 6, 7).

Fig. 6
figure 6

Final simulation in Qazvin city

Fig. 7
figure 7

Fuzzy maps provided by the criteria for effective physical development of Qazvin

Then, after calculating the weight of each of the effective factors in urban growth, using the hierarchical process and preparation of layers associated with the factors of growth of layers’ overlap by applying the weight of each layer. Then the weighted probability of transforming the cell status is calculated. The results are prepared in the form of land suitability map and as input to the model. In order to calculate the land conversion rate, population and cell number of the state constructed in different periods, the correlation coefficient value was calculated by deviating from the mean for two population variables and the number of land pixels constructed.

In order to calculate the land conversion rate, population and cell number, the status has been determining at different time periods and then the Pearson correlation coefficient has been calculated by the way of deviating from the mean for the two population variables X and the number of land pixels Y calculated. The calculated value of Pearson’s correlation coefficient was 8.83 which indicating a strong correlation between the two variables. After determining the existence of a direct and strong correlation between the two variables x and y, linear Regression method has been used to predict the number of convertible pixels in different time periods and predicted year.

At the end, the model becomes software codes. Due to the high volume of data and computations in the model designed according to the time and data flow in the model, the formulas and the model work, the computer has been used to implement it. In order to implement the model in the computer, each supply step and model components are separately converted into software codes and then linked together for initial testing and final output. The computer codes of this program are written in MATLAB programming language.

Then the validation and model results are performed. In order to evaluate the validity of the designed model, first the land growth map in 2016 is modeled by using data available in 1996 and to evaluate the degree of agreement of a model with reality, the Kappa coefficient reality, visual matching overlap and observation are used. The calculated kappa coefficient is 0.95.

This value represents a 92% compliance with the modeled reality map which is based on the provided standard by the United States Geological Survey institute that considers the minimum acceptable value for the Kappa coefficient to be 85%. It can be said that the designed model has sufficient reliability and validity to modeling the urban growth in the surrounding lands. According to the acceptable results of the model, it has been used to predict the urban growth in the surrounding lands. Then, all the happened steps in the data of 2016 are implemented as input data and the growth simulation of Qazvin city up to 2026 is performed which the results are presented in map 10.

Conclusion

Nowadays, cities are experiencing rapid urban growth that destroys farmland, the formation of private housing, and the irregular expansion of the city. Expanding cities always changes the use of different lands to urban use. Due to the inevitable changes, awareness of this trend is important for its guidance towards the desired direction. Therefore, there is a fundamental need to realistically predict cities growth in the future, to determine the impact of various planning scenarios on the urban growth process and to answer the questions ((what if) any) questions. Due to the inevitable changes, awareness of this trend is important for its guidance towards the desired direction. For this reason, so far, many models have been created and used to simulate the development of the city and to expand the use of urban lands. In this research, the automated cellular model.

This model uses a suitable method for searching and selecting land for development. The implemented model was applied using data for 2016 to predict the development of 2026 year.

One of the characteristics of this model is its implementation on a wide area with a regional scale. The results of this research revealed that although urban development is really difficult to predict, but with the proper selection of the most important factors affecting it and the use of a model that has high adaptability to the conditions, which it can be more possible to predict the future development and took steps in order to a better management and directing them. Also, this research showed that cellular automate model as a low to high level model can simulate many human decision-making behaviours properly and by obtaining results, shows the general trend of social phenomena such as land use development. In addition, GIS has proven its capabilities as a suitable way to preparing factors, modelling their behaviour and analysing the results from their activities. The model developed in this study is very flexible due to its numerous and effective parameters for adaptation to the environment. Due to its favourable results, it can be used in other areas as well. Finally, the future of urban development was predicted by using the AHP model and automatic cells in the GIS environment. As can be seen in the (Fig. 8), the optimal growth of Qazvin city is projected in the East and West directions. The results of the research help a lot of planners and urban decision makers to understand the perspectives of sustainable urban development.

Fig. 8
figure 8

Proposed map of Qazvin city development