Background

Since the first theoretical approaches to CA-based models for simulating urban sprawl were proposed in 1980s and the first operational urban CA models were applied to real-world urban system in 1990s, lots of studies have been conducted to modify CA structure so as to address complex urban systems and Land use cover changes (LUCC) (Almeida et al. 2008; Batty 1997; Gong et al. 2009; Stevens et al. 2007; Torrens and O'Sullivan 2001; Verburg et al. 2004; Wu 1996, 1998a, b; Yang et al. 2008). The advantages of CA-based models including their simplicity, flexibility and intuitiveness and particularly their ability to incorporate the spatial and temporal dimensions of the processes have been agreed and the studies in this issue are still proliferating.

Temporal and spatial complexities of urban land use change can be well modeled by properly defining transition rules in CA models. To obtain appropriate transition rules, besides the conventional statistical methods (De Almeida and Gleriani 2005; He et al. 2008; Wu 2002), various approaches have also been taken including fuzzy logic (Al-Ahmadi et al. 2009; Al-kheder et al. 2008; Liu and Phinn 2003; Wu 1996 and Wu 1998a), support vector machine (Yang et al. 2008) and Artificial Neural Networks (ANNs) especially Back Propagation (BP) networks (Li and Yeh 2001 and Li and Yeh 2002; Li and Liu 2006; Liu et al. 2008a; Liu et al. 2008b; Pijanowski et al. 2002; Yeh and Li 2003 and Yeh and Li 2004; Wang et al. 2011).

In the most previous studies, land use types of urban and non-urban were simulated (He et al. 2008; Li and Yeh 2001; Li and Liu 2006; Liu et al. 2008a and Liu et al. 2008b; Pijanowski et al. 2002; Wu 2002; Yeh and Li 2003). In recent years, city residents pay more and more attention to their living environment, therefore, in simulation, other land use types especially those with high ecological services such as forest, water body and wetland (Li et al. 2010) should not be generalized as non-urban any more. However, less than 40% among the 33 CA-based urban models reviewed by Santé et al. (2010) were capable of dealing with multiple land uses. On the other hand, land use change results from the complex interaction of many factors including policy, management, economics, culture, human behaviors and the environment. Whether a cell changes its state is not only determined by transition probability but also bears randomness. In this view, transition rules can include a stochastic component and deterministic rules (Santé et al. 2010). The most previous studies considered the deterministic rules by maximum probability. A few of them treated the stochastic component, for instance, Li and Yeh (2002) added a random variable into the probability function in ANN-CA model, Wu (2002) combined Monte Carlo approach with probability generalized by statistical method in urban and non-urban simulations.

With Shenzhen City, a typical rapid urbanization area in China or even in the whole world, as the case, this study aims to simulate multiple land use dynamics by combining Monte Carlo approach to CA-ANN model, and to choose appropriate model parameters with sensitivity analysis on predicating variables, neighborhood size and cell size.

Results and discussions

Model performance

The simulation accuracy was evaluated based on cell-by-cell comparison which was typically used (Li and Yeh 2002; Yang et al. 2008). It evaluates the similarity between the actual and simulated situations at the scale of a single cell. The results of the cell-by-cell comparison with the test cells for three scenarios were given in Table 1.

Table 1 Class kappa coefficients and overall accuracy for each scenarios

For Scenarios I, II and III, the overall accuracy is 81.91%, 83.13% and 85.33% respectively. The kappa coefficients were calculated to quantify the actual degree of agreement (Congalton 1991). The coefficient was 0.71, 0.73 and 0.76 for Scenarios I, II and III in the same order. The similar accuracies for the two periods suggested that the land use change mechanism is relatively stable in this region and that the models trained by different senarios can be used to forecast future land use changes. Yang et al. (2008) used SVM and CA to simulate urban and non-urban land use in Shenzhen with different data sets (1993 and 2004) and reported that the overall accuracy were 87.25% and 84.90% and kappa coefficients were 0.70 and 0.68. Though their overall accuracies are a little higher than our results, but their kappa coefficients are all lower than those of the proposed model.

The actual maps and simulated maps are shown in Figures 1 and 2. Generally, the simulated maps are spatially conformity to the actual maps. The simulated maps in Figure 2 showed that Build-up and Woodland dominated the study area with their areal percentage being both around 44%, while the Water body, Cropland and Wetland occupied around 8%, 2% and 2% of the whole study area. They are generally consistent with the real land use maps. In the Figure 2 (I), the kappa coefficients of the Wetland, Water body and Woodland were all great than 0.70, suggesting these types of land use have been well simulated. In Figure 2 (II), the kappa coefficients of Woodland, Water body and Build-up were all great than 0.70. In Figure 2 (III), the coefficients of Cropland is 0.46 while those of the other four land uses were all greater than 0.70. On the other hand, the random land use type in Figure 2 (II) and Figure 2 (III) have decreased compared to Figure 2 (I).

Figure 1
figure 1

The actual land use map of 1996, 2000 and 2004.

Figure 2
figure 2

The simulated land use maps by three scenarios.

Sensitivity analysis

With the model trained by Scenarios III which produced relatively best performance, the impacts of spatial variables, neighborhood size and cell size were discussed.

Spatial variables

The input layer of BP network contained fifteen neurons associated with the variables, while the output layer contained five neurons associated with the map of transition probabilities for the considered types of land use (Table 2). In order to investigate the effects of spatial variables on model performance, we compared the model fitness of the nine versions of the reduced-variable model with that with the full predictor variables. In Table 3, row “p” represented the model performance with complete predictor variables, while row “pi = 0” represented that with variable “pi” removed. The relative contribution of each predictor variable on the model performance was also ranked according to their differences between goodness of fit for the model with full variables and that for each of the reduced-variable models.

Table 2 Input and output of the BP network
Table 3 Effects of spatial variables on model performance

According to Table 3, the overall accuracy and kappa coefficients with complete predictor variables were higher than those with removed variables, indicating that the variables selected in this study had positive effect on the model performance. The contribution of each predictor variable on the model performance was ranked as: p7 (Proximity to railways) > p13 (Proximity to city center) > p9 (Proximity to roads) > p14 (Proximity to towns) > p10 (Proximity to streets) > p8 (Proximity to expressways) > p12 (Proximity to lakes or reservoirs) > p15 (population density) > p11 (Proximity to rivers). This result showed that Proximities to railways and city center were the main factors which have driven the dynamics of land use. This result was also approved in Shi’s study (Shi et al. 2000).

Neighborhood size

Neighborhood interactions represent one of the main driving factors in a large group of land use change models based on cellular automata (Verburg et al. 2004). In many studies, the size of the neighborhood was chosen arbitrarily and only the direct neighborhood of a location was taken into account (e.g. Von Neumann or Moore neighborhoods), while others have argued that human activities are influenced by wider spaces, this makes an appropriate definition of the neighborhood essential (Verburg et al. 2004; White and Engelen 2000). In this study, three sizes of Moore neighborhoods, 3 × 3, 5 × 5, and 7 × 7, were used to understand the effects of neighborhood size on the model performance.

The simulation map of land use in 2004 was cell-to-cell compared to the actual map to evaluate the fitness of models with different neighborhood sizes. According to Table 4, the overall accuracy and Kappa coefficients increased slightly as the neighborhood size increased. However, this increase is too slight to make a significant impact. Considering the computing time cost, the smaller neighborhood size can be used.

Table 4 Fitness of models with different neighborhood sizes

Cell size

The spatial scale is also an important factor which should be taken into account in the modeling of spatial problems. Thus, the fitness of models for different cell sizes (100 m × 100 m, 250 m × 250 m, and 500 m × 500 m) was evaluated based on cell-to-cell comparison as shown in Table 5.

Table 5 Simulation accuracies with different cell sizes

In Table 5, the individual kappa coefficient for each type of land use increased as the cell size increased, and so did the overall accuracy and kappa coefficients, which indicated that increasing the cell size may slightly improve the fitness of the model. However, it was not favorite for guiding land use planning if the cell size was too large. As a result, a tradeoff was made between model performance and high spatial resolution output maps in actual simulations. In the following simulation, the cell size of 100 m × 100 m was adopted because it will produce more detailed land use maps without apparently lowering predicting accuracy.

Land use simulation

With the map of land use in 2000 used as the input map and with the model parameters selected by the previous section (cell size = 100 m × 100 m, neighborhood size = 3 × 3, 9 GIS layers of predictor variables or constraints), the tested model (Scenario II ) ran for 5 cycles to simulate the land use changes from 2004 to 2020 in Shenzhen in every four years. The areal changing trends of the five land use types is shown in Figure 3.

Figure 3
figure 3

Land use changes simulated from 2004 to 2020.

Figure 3 show that build-up will become the dominating land use type in 2020, its area exceeding any of the other land use types. Decrease of farmland was the most fast, with the rate of 5.52% per year, followed by wetland with the decreasing rate of 2.59% per year. This result is consistent with the results of a prior study in Shenzhen (Bai 2000), in which a System Dynamics (SD) approach was used and predicted that Shenzhen would experience rapid urban growth from 1980 to 2030 and that areas of build-up would increase significantly while cropland would decline to almost zero.

Conclusions

The Monte Carlo approach was introduced to combine with transition probabilities generalized by ANN to decide the states of cells. The modified ANN-CA model proved capable of simulating multiple land use changes in Shenzhen with good performance. The variables selected in this study had positive effects on the model performance, among which the proximities to transportation and to city center were recognized as the major factors which drove the dynamics of land use. Moreover, model performance increased slightly as either neighborhood size or cell size increased. However, a compromise should be made between model performance, provision of spatial information details and computing time. The cell size of 100 m × 100 m and 3 × 3 neighborhood size was chosen for the simulation of land use changes in the study area and land use maps from 2008 to 2020 with 4 year interval were forecasted and showed that the areas of woodland, cropland, wetland and water body tend to decline while the area of built-up tend to increase through 2004 to 2020. These results can be referenced by local decision makers in pursuing ecological and human-centered urbanization.

Methods

Cellular automata in land use simulation

A cellular automata system consists of a regular grid of cells, each of which can be in one of a finite number of possible states, updated synchronously in discrete time steps according to a local, identical interaction rule. The state of a cell is determined by the previous states of a surrounding neighborhood of cells (Wolfram 1984). The principle of cellular automata can be illustrated by Eq. 1:

$$ {S}_{t+1}=f\left({S}_t,N\right) $$
(1)

Where S t and S t+1 represents one of the finite number of possible states at time t and time (t + 1) respectively, N represents the combined states of the neighborhood, and f represents the transition rules.

A basic CA model consists of five fundamental elements: lattice, cell states, neighborhood, transition rules, and time steps. Because of the complexity of urban systems, the framework of basic CA is too simplified and constrained to represent real cities. To simulate complex geographical phenomena such as urban systems, it is necessary do some modification to the basic CA model, especially to modify the elicitation process of transition rules (Santé et al. 2010; Torrens 2000).

The lattice is the space where all of the cells are located. Theoretically, the lattice can be the Euclidean space with any dimension and any geometric shape though the lattice of rectangular grids is widely used.

The cell is the minimum unit in the CA system and it is regularly located on the Euclidean space, with discrete and finite states. To simulate the dynamics of multiple land use types, the state “s” can be defined as s = {1, 2, 3,…, n}, representing n types of land use.

There are three common types of neighborhoods in conventional CA models: Von Neumann, Moore, and Circular neighborhoods (Kocabas and Dragicevic 2006; Liu and Phinn 2003).

The transition rules can be defined as the theoretical functions which determine the state of each cell according to the previous states of its own and its neighbors. Since changes of land use may result from the complex interaction of many factors, including policy, management, economics, culture, human behavior, and the environment (Pijanowski et al. 2002), the classic CA transition rules should be modified. However, it is not possible to recognize and incorporate all of the factors which drive land use changes. In practice, therefore, variables were usually chosen with the help of literature review and with considering the data availability.

ANN-based CA model

When ANNs were integrated to CA models, their predictive abilities can be improved (De Almeida and Gleriani 2005; Li and Yeh 2001; Pijanowski et al. 2002). In this study, the widely used CA-ANN model is modified in three aspects, namely, the multiple land use maps can be input and predicted, a threshold parameter is introduced to make the model controllable and the Monte Carlo approach is introduced to determine the final state of the cell. The learning aspect was based on a BP network involving training and testing with different datasets. Figure 4 illustrates the framework of the proposed model.

Figure 4
figure 4

Framework of the model (modified on Li and Yeh 2002 ).

Monte Carlo

Since the process of land use change in the real world is complex and stochastic, the deterministic transition rules in conventional CA models were modified to probability functions (Eq. 2), and the converted states of each cell in the next step were obtained using the Monte Carlo approach (Eq. 3).

$$ {S}_i^{t+1}=\left\{{p}_1,{p}_2,\dots, {p}_j,\dots, {p}_n\right\} $$
(2)
$$ S{\prime}_i^{t+1}= MonteCarlo\left({S}_i^{t+1}\right) $$
(3)

p j is the probabilities that a certain cell would transition to jth land use type. A random number θ between 0 and 1 was obtained in the Monte Carlo approach.

For the state of a cell can be determined by the following rules, given p i > up, i ϵ{1,2,3,…, n}.

$$ S{\prime}_i^{t+1}= MonteCarlo\left({S}_i^{t+1}\right)=1\;\mathrm{if}\;0\le \theta <{p}_1 $$
(4)
$$ S{\prime}_i^{t+1}= MonteCarlo\left({S}_i^{t+1}\right)=j\ \mathrm{if}\;{\displaystyle {\sum}_{k=1}^{j-1}{p}_k\le \theta <{\displaystyle {\sum}_{k=1}^j{p}_k}} $$
(5)

When combining Monte Carlo approach with transition potentials generalized by ANN, this study introduced a parameter “up” to make the model controllable. This parameter controls the number of cells on which Monte Carlo approach will be used. For all the land use types, if the transition probability generalized by BP network to one land use type of a cell is smaller than “up”, then the state of the cell would remain the same in the iterative cycle. Otherwise, the state of the cell was then determined by Monte Carlo approach. Thus, if “up” is set zero, all of the cells will be treated according to the rules of combination of Monte Carlo approach and transition probabilities. If “up” is set one, none of the cells will be converted. This process was iterated and land use map was predicted. Since only those cells with transition probabilities larger than “up” would be treated by the Monte Carlo approach and transition probabilities, as the value of “up” increases, the effect of Monte Carlo approach will wane, the output land use map will have less insular land use patches mainly near the boundary between land use types. Theoretically, if a cell has the same possibilities of changing into (or remaining unchanged) other land use types, the probability value should be the reciprocal of the number of land use types. To compromise, a value around the reciprocal of the number of land use types is recommended for “up”. The exact value of “up” can be set by trial simulations for a certain study area.

CA will evolve in a sequence of discrete time steps. The real time of each step can be calibrated by historical data. Suppose that the periods of input and output data for CA training be T1 and T2, respectively. The interval “N” between two consecutive time steps can be calculated: N = T2-T1. Thus, the sequent time of each step can be calculated: T3 = T2 + N, T4 = T3 + N, T5 = T4 + N….After the structure, parameter, and transition rules are defined, the CA model will evolve dynamically.

The proposed model was implemented through the programming language of Visual Basic (VB) 6.0 and the technology of Component of Object Model (COM) on the platform MATLAB 6.5.

The study area

Shenzhen (22°27′ N ~22°52′ N and 113°46′ E ~ 114°37′ E) is a coastal city in the Pearl River Delta Region, southern China and locates directly north of Hong Kong (Figure 5). The total terrestrial area of Shenzhen is 0.19 million ha, and the coastline of the city totals 229.96 kilometers. Shenzhen has a mild, subtropical maritime climate with plenty of rain and sunshine. Since it was established in 1979 as the China’s first Special Economic Region, Shenzhen has experienced rapid urbanization and economic development, and its land use structure has experienced, and continues to experience dramatic changes.

Figure 5
figure 5

The study area.

Data sets and preprocessing

The spatial data selected for this study consisted of land use datasets (on a scale of 1:50,000) in the year of 1996, 2000 and 2004, transportation network maps, river system and lake distribution maps, and administrative borderline maps (including the city center, towns, and residential blocks). These datasets were officially acquired mainly by aerial photograph interpretation aided by ground surveys and historical land use maps. In the data preparation, the same data interpretation methods and the same classification categories were adopted. The data were then edited, calibrated, and converted into raster data in ArcGIS for the subsequent spatial analysis.

The original land use categories included two levels of Chinese National Standard Categories. In this study, the datasets were reclassified into five categories: woodland, cropland, wetland, water body, and build-up. Therefore, the values of raster grids were coded as 1, 2, 3, 4, and 5, representing the five land use types in the order mentioned above.

Moore neighborhood, which is the most frequent used, was adopted in this case study.

To build the transaction rules, variables were chosen with the help of literature review (Kocabas and Dragicevic 2007; Santé et al. 2010; Singh 2003; Shi et al. 2000; Yang et al. 2008) with considering the data availability. They are Tr0 (proximity to railways), Tr1 (proximity to expressways), Tr2 (proximity to roads), Tr3 (proximity to streets), C (proximity to city centers), T (proximity to towns), and P (population density). Because major rivers and lakes are ecologically sensitive areas in the study area, they should be protected and any developing activities should not be restricted in these areas. To this end, two proximities, R (proximity to rivers) and L (proximity to lakes), were selected as constraints of the model. The modified CA rules can be described by Eq. 6. As these spatial variables range extensively, they tend to be normalized to the same data range (e.g. integer values from 1 to 9).

$$ {S}_i^{t+1}=f\left({S}_i^t,{N}_i^t,Tr{0}_i^t,Tr{1}_i^t,Tr{2}_i^t,Tr{3}_i^t,{R}_i^t,{L}_i^t,{C}_i^t,{T}_i^t,{P}_i^t\right) $$
(6)

i refers to cell “i”, t refers to time steps, S refers to cell states, and N refers to the combined states of the neighborhood, i.e. the pixel numbers of each land use category in the whole neighborhood.

For the eight proximity variables, the minimum Euclidean distance to each feature was calculated in ArcGIS and eight raster maps were produced. Since it was highly difficult to acquire the population for each cell, it was assumed that population locations complied with residential blocks. Thus, the variable of population density was obtained indirectly by calculation of residential block density based on the Kernel density. As these proximity variables range extensively and have different dimensions, these proximity and density variables were reclassified into 9 classes by Natural Breaks in ArcGIS and denoted by nine integers from 1 to 9.

Since only those cells with transition probabilities larger than up would change their states in the next step through the Monte Carlo approach, the bigger value of up would produce fewer fragments while reduce the effect of Monte Carlo approach. Compromise is necessary, as above discussed. Because 5 land use types are simulated in this study, the value of up can be around 0.2, after several trial simulations, up was set to 0.15.

Model training

The CA transition potentials were learned through a BP network with training data. In this study, the proposed BP network had four layers: one input layer, two hidden layers, and one output layer. The neurons of input layer and output layer were define in Table 2. Through trials and experiments, the number of neurons in the first and second hidden layers was 15 and 8 respectively. Since the output of the neural network represented the transition probabilities for land use types, the Log-Sigmoid transfer function was adopted in order to guarantee of the output falling into the close range of [0, 1].

It is more appropriate to normalize and convert the input data into the close range of [0,1] using Eq. 7 since this scaling procedure makes the input values compatible with the Sigmoid activation function that produces a value between 0 and 1 (Gong 1996).

$$ {P}_i^{\hbox{'}}=\frac{P_i-{P}_{\min }}{P_{\max }-{P}_{\min }} $$
(7)

The land use maps of 1996, 2000 and 2004 were combined into two groups, namely, from 1996 to 2000 (Group A) and from 2000 to 2004 (Group B). The neighborhood size was set as 3 × 3, and the cell size was set to 100 m × 100 m. There are 194,989 cells in each map, of which 140,852 were chosen by stratified random sampling for training and the rest 54,137 cells were used for model test.

The algorithm of scaled conjugate gradient (SCG) (Moller 1993) was used to conduct the training. After about 2,000 running cycles, the ANN was stabilized and the error was reduced to 0.049 and 0.038 for Group A, B respectively (Figure 6).

Figure 6
figure 6

Training cycles and prediction error for two training groups (panel A for Group A and panel B for Group B).