Skip to main content

Dynamic Agent Based Simulation of an Urban Disaster Using Synthetic Big Data

  • Chapter
  • First Online:

Part of the book series: Springer Geography ((SPRINGERGEOGR))

Abstract

This paper illustrates how synthetic big data can be generated from standard administrative small data. Small areal statistical units are decomposed into households and individuals using a GIS buildings data layer. Households and individuals are then profiled with socio-economic attributes and combined with an agent based simulation model in order to create dynamics. The resultant data is ‘big’ in terms of volume, variety and versatility. It allows for different layers of spatial information to be populated and embellished with synthetic attributes. The data decomposition process involves moving from a database describing only hundreds or thousands of spatial units to one containing records of millions of buildings and individuals over time. The method is illustrated in the context of a hypothetical earthquake in downtown Jerusalem. Agents interact with each other and their built environment. Buildings are characterized in terms of land-use, floor-space and value. Agents are characterized in terms of income and socio-demographic attributes and are allocated to buildings. Simple behavioral rules and a dynamic house pricing system inform residential location preferences and land use change, yielding a detailed account of urban spatial and temporal dynamics. These techniques allow for the bottom-up formulation of the behavior of an entire urban system. Outputs relate to land use change, change in capital stock and socio-economic vulnerability.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   279.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    While not calling this ‘big data’ as such, Torrens (2014) notes that the volume of locations/vectors to resolve for each object moved in the simulation is of the order of 1012–1014.

  2. 2.

    A statistical area (SA) is a uniform administrative spatial unit defined by the Israeli Central Bureau of Statistics (CBS) corresponding to a census tract. It has a relatively homogenous population of roughly 3000 persons. Municipalities of over 10,000 population are subdivided into SA’s.

  3. 3.

    We also use coarser, regional data on non-residential plant and equipment stock to calculate non-residential building value. The estimating procedure for this data is presented elsewhere (Beenstock et al. 2011).

  4. 4.

    Calculated from 2012 immigration data in the Statistical Yearbook for Jerusalem 2014 (Jerusalem Institute for Israel Studies).

  5. 5.

    The algorithm works as follows: at each step, junctions adjacent to the current junction are scanned and the junction with the shortest aerial distance to the destination is flagged as the current junction for the next step. If a loop is encountered, it is deleted from the path. The algorithm ends when agents arrive at the junction closest to the destination or when all junctions accessible from the origin are scanned.

  6. 6.

    Social vulnerability by household (V hh ) is defined as: \( {V}_{hh}=0.5*{Z}_{i_{hh}}-0.2*{Z}_{ag{e}_{hh}}-0.2*{Z}_{\%di{s}_{hh}}+0.1*{Z}_{car} \) where: Z is the normalized value of a variable, i is household income for household hh, age is the average age group of members of household hh, %dis is the percent of disabled members of all members in household hh, car is car ownership for household hh.

References

  • Batty M, Hudson-Smith A, Milton R, Crooks A (2010) Map mashups, Web 2.0 and the GIS revolution. Ann GIS 16(1):1–13

    Article  Google Scholar 

  • Beenstock M, Felsenstein D, Ben Zeev NB (2011) Capital deepening and regional inequality: an empirical analysis. Ann Reg Sci 47:599–617

    Article  Google Scholar 

  • Chang SE (2003) Transportation planning for disasters: an accessibility approach. Environ Plan A 35(6):1051–1072

    Article  Google Scholar 

  • Chen X, Zhan FB (2008) Agent-based modeling and simulation of urban evacuation: relative effectiveness of simultaneous and staged evacuation strategies. J Oper Res Soc 59:25–33

    Article  Google Scholar 

  • Chen X, Meaker JW, Zhan FB (2006) Agent-based modeling and analysis of hurricane evacuation procedures for the Florida Keys. Nat Hazards 38:321–338

    Article  Google Scholar 

  • Crooks AT, Wise S (2013) GIS and agent based models for humanitarian assistance. Comput Environ Urban Syst 41:100–111

    Article  Google Scholar 

  • Dawson RJ, Peppe R, Wang M (2011) An agent-based model for risk-based flood incident management. Nat Hazards 59:167–189

    Article  Google Scholar 

  • Eicher CL, Brewer CA (2001) Dasymetric mapping and areal interpolation: implementation and evaluation. Cartogr Geogr Inf Sci 28(2):125–138

    Article  Google Scholar 

  • Felsenstein D, Lichter M (2014) Social and economic vulnerability of coastal communities to sea level rise and extreme flooding. Nat Hazards 71:463–491

    Article  Google Scholar 

  • Fiedrich F, Burghardt P (2007) Agent-based systems for disaster management. Commun ACM 50(3):41–42

    Article  Google Scholar 

  • Grinberger AY, Felsenstein D (2014) Bouncing back or bouncing forward? Simulating urban resilience. Urban Des Plan 167(3):115–124

    Google Scholar 

  • Grinberger AY, Felsenstein D (2017) A tale of two earthquakes: dynamic agent-based simulation of urban resilience. In: Lombard J, Stern E, Clarke G (eds) Applied spatial modeling and planning. Routledge, Abingdon, forthcoming

    Google Scholar 

  • Harper G, Mayhew L (2012a) Using administrative data to count local populations. J Appl Spat Anal Policy 5(2):97–122

    Article  Google Scholar 

  • Harper G, Mayhew L (2012b) Re-thinking households—using administrative data to count and classify households with some applications. Actuarial research paper, no. 198. Cass Business School, London

    Google Scholar 

  • Keon D, Steinberg B, Yeh H, Pancake CM, Wright D (2014) Web-based spatiotemporal simulation modeling and visualization of tsunami inundation and potential human response. Int J Geogr Inf Sci 28(5):987–1009

    Article  Google Scholar 

  • Kwan MP, Lee J (2005) Emergency response after 9/11: the potential of realtime 3D GIS for quick emergency response in micro-spatial environments. Comput Environ Urban Syst 29:93–113

    Article  Google Scholar 

  • Lichter M, Felsenstein D (2012) Assessing the costs of sea level rise and extreme flooding at the local level: a GIS-based approach. Ocean Coast Manag 59:47–62

    Article  Google Scholar 

  • Linard C, Gilbert M, Tatem AJ (2011) Assessing the use of global land cover data for guiding large area population distribution modeling. GeoJournal 76(5):525–538

    Article  Google Scholar 

  • Macal CM, North MJ (2005) Tutorial on agent-based modeling and simulation. In: Proceedings of the 37th conference on winter simulation, WSC’05. pp 2–15

    Google Scholar 

  • Mennis J (2003) Generating surface model of population using dasymetric mapping. Prof Geogr 55(1):31–42

    Google Scholar 

  • Ngo TA, See L (2012) Calibration and validation of agent based models of land cover change. In: Heppenstall AJ, Crooks AT, See LM, Batty M (eds) Agent-based model of geographical systems. Springer, Dordrecht, pp 191–199

    Google Scholar 

  • Ogawa Y, Akiyama Y, Kanasugi H, Sengoku H, Shibasaki R (2015) Evaluating the damage of great earthquakes in aggregate units based on detailed population distribution for each time frame. Paper presented at the 14th international conference on computers in urban planning and urban management (CUPUM 2015), Boston MA. http://web.mit.edu/cron/project/CUPUM2015/proceedings/Content/index.html

  • Oliveira MGS, Vovsha P, Wolf J, Birotker Y, Givon D, Paasche J (2011) Global positioning system-assisted prompted recall household travel survey to support development of advanced travel model in Jerusalem, Israel. Transp Res Rec J Transp Res Board 2246(1):16–23

    Article  Google Scholar 

  • Rashed T, Weeks J, Couclelis H, Herold M (2007) An integrative GIS and remote sensing model for place-based urban vulnerability analysis. In: Mesev V (ed) Integration of GIS and remote sensing. Wiley, New York, pp 199–231

    Google Scholar 

  • Reibel M, Bufalino ME (2005) Street-weighted interpolation techniques for demographic count estimation in incompatible zone systems. Environ Plan A 37:127–139

    Article  Google Scholar 

  • Salamon A, Katz O, Crouvi O (2010) Zones of required investigation for earthquake-related hazards in Jerusalem. Nat Hazards 53(2):375–406

    Article  Google Scholar 

  • Schettkat R (1996) Flows in labor markets: concepts and international comparative results. In: Schettkat R (ed) The flow analysis of labor markets. Routledge, London, pp 14–36

    Chapter  Google Scholar 

  • Simoes JA (2012) An agent based/ network approach to spatial epidemics. In: Heppenstall AJ, Crooks AT, See LM, Batty M (eds) Agent-based models of geographical systems. Springer, Dordrecht, pp 591–610

    Chapter  Google Scholar 

  • Torrens PM (2014) High-resolution space-time processes for agents at the built-human interface of urban earthquakes. Int J Geogr Inf Sci 28(5):964–986

    Article  Google Scholar 

  • Zhu Y, Ferreira J Jr (2015) Data integration to create large-scale spatially detailed synthetic populations. In: Geertman S, Ferreira J Jr, Goodspeed R, Stillwell J (eds) Planning support systems and smart cities. Springer, Dordrecht, pp 121–141

    Chapter  Google Scholar 

  • Zou Y, Torrens PM, Ghanem RG, Kevrekidis IG (2012) Accelerating agent-based computation of complex urban systems. Int J Geogr Inf Sci 26(10):1917–1937

    Article  Google Scholar 

Download references

Acknowledgements

This research is partially based on work done in the DESURBS (Designing Safer Urban Spaces) research project funded by the European Commission FP7 Program under Grant Agreement # 261652. The authors thank the JTMT for granting access to the HTS database.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Yair Grinberger .

Editor information

Editors and Affiliations

Appendices

Appendix 1: Data Disaggregation Method

This appendix describes the disaggregation procedure of spatially aggregated alpha-numeric real-estate values and populations and their socio-economic attributes into discrete spatial units at the building level. It then proceeds to describe the validation of this allocation method.

The number of floors in residential buildings (F R ), is calculated by dividing building height by average floor height of 5 m:

$$ {F}_R = \frac{H_B}{5} $$

In the case of non-residential buildings, the number of floors (F N ) is estimated as the building height divided by average floor height of 5 m:

$$ {F}_N = \frac{H_B}{7} $$

Floor space for each building (S B ) is then calculated by multiplying the number of floors in each building by its polygon area representing roof space:

$$ {S}_B={S}_R\times F $$

where:

  • S R  = Building polygon footprint

  • F = Building number of floors

The GIS buildings layer and building type serve as the basis for the calculation of residential building value, non-residential building and equipment value. To create estimates of residential building value we use average house prices per m2 2008–2013 (in real 2009 prices). In cases where no transactions exist in a specific SA over that period we use a regional estimate for residential property prices.

  1. 1.

    Value of residential buildings (P BR ) is calculated as follows:

    $$ {P}_{BR}={P}_{SR}\times {S}_{BR} $$

    where:

    P SR  = Average SA price per m2

    S BR  = Residential building floor space.

  2. 2.

    Value of Non residential buildings is calculated as follows:

    Non residential value per m2 by region (P RN ):

    $$ {P}_{RN} = \frac{V_{RN}}{S_{RN}} $$

    where:

    S RN  = Total regional non residential floor space.

    V RN  = Total regional non residential building stock

    Non residential building value per m2 for each region is multiplied by the floor space of each non-residential building to produce non-residential building values (P BN ):

    $$ {P}_{BN}={P}_{RN}\times {S}_{BN} $$

    where:

    S BN  = non residential building floor space.

    Regional non-residential stock estimates have been calculated for nine aggregate regions elsewhere (Beenstock et al. 2011).

  3. 3.

    Value of Equipment and Machinery (P RE ) is calculated as follows:

    $$ {P}_{RE} = \frac{V_{RE}}{S_{RN}} $$

    where:

    V RE  = Total regional non residential equipment stock

    The equipment stock per m2 for each region is multiplied by the floor space of each non-residential building to produce equipment stock totals by building (P BE ):

    $$ {P}_{BE}={P}_{RE}\times {S}_{BN} $$

    where:

    S BN  = non residential building floor space.

    The source for regional estimates of regional equipment and machinery stock is as above (Beenstock et al. 2011).

    The buildings layer also allows for the spatial allocation of aggregated households, and population counts (see Table 1) into building level households and inhabitant totals. Given the existence of these spatial estimates, the distribution of aggregate average monthly earnings, participation in the work force, employment sector, disabilities and age (see Table 1) into a building level distribution is implemented.

  4. 4.

    Household density by SA (households per m 2) of residential floor space in a statistical area (H SR ) is calculated as follows:

    $$ {H}_{SR} = \frac{H_S}{S_{SR}} $$

    where:

    H S  = Total population per statistical area (IV in Table 1).

    S SR  = Total statistical area residential floor space.

    The number of households per building (H B ) is calculated as follows:

    $$ {H}_B={H}_{SR}\times {S}_{BR} $$
  5. 5.

    Average number of inhabitants per m 2 of residential floor space in a statistical area (I SR ) is calculated as follows:

    $$ {I}_{SR} = \frac{I_S}{S_{SR}} $$

    where:

    S SR  = Total statistical area residential floor space.

    I S  = Total population per statistical area.

    Population counts per building (I B ) are then calculated as follows:

    $$ {I}_B={I}_{SR}\times {S}_{BR} $$
  6. 6.

    Total earnings per building (M B ) is calculated as follows:

    $$ {M}_B={M}_{SI}\times {H}_B $$

    where:

    M SI  = Average monthly earnings per household by SA.

    H B  = Total number of households in a building.

  7. 7.

    Number of inhabitants in each building participating in the labor force 2008 (I W ) is calculated by multiplying the number of inhabitants in a building by the labor participation rate in the corresponding SA.

    $$ {I}_W={W}_S\times {I}_B $$

    where:

    W S  = % of inhabitants participating in the labor force in an SA

    I B  = Population count per building

  8. 8.

    Number of inhabitants per building by employment sector (I O ) is calculated by multiplying the percentage of inhabitants employed by sector (commercial, governmental, industrial or home-based) per statistical area by the number of inhabitants in each building.

    $$ {I}_O={O}_S\times {I}_B $$

    where:

    O S  = % of inhabitants employed in an employment category.

    I B  = Population counts per building

  9. 9.

    Number of disabled inhabitants in each building (I D ) is calculated by multiplying the number of inhabitants in a building by the percentage of disabled in the corresponding SA.

    $$ {I}_D={D}_S\times {I}_B $$

    where:

    D S  = % of disabled inhabitants in an SA

    I B  = Population count per building

  10. 10.

    Number of inhabitants in each age category (I A ) is calculated by multiplying the number of inhabitants in a building by the percentage of inhabitants in each age category in the corresponding SA.

    $$ {I}_A={A}_S\times {I}_B $$

    where:

    A S  = % of inhabitants in each age group category in an SA

    I B  = Population count per building.

1.1 Validation of Data Disaggregation

To validate the data disaggregation procedure, we compare population counts and demographic attributes of our synthetic distribution with the Household Travel Survey (HTS) conducted in 2010 by the Jerusalem Transport Masterplan Team (JTMT; Oliveira et al. 2011). The idea behind the validation procedure is to re-aggregate the discrete counts yielded by our distribution to the spatial units used by the survey. The survey uses Aggregated Transportation Area Zones (ATAZs) which have a different spatial configuration to those of the SAs on which our distribution is predicated.

The HTS is based on representative samples of the population in the Jerusalem metropolitan area. It uses an expansion factor by which the population and attributes of each household are multiplied to replicate their representation in the ATAZs.

ATAZs covered by the HTS but not fully covered by our distribution (for example, ATAZs in East Jerusalem) are excluded from the analysis. This is because the HTS data are aggregates and we cannot account for their distribution within the ATAZ, as we can with our discrete data. Note also that some ATAZs are fully sampled in the HTS and others contain much lower sampling coverage. This can cause misrepresentation of populations and demographics in some ATAZs. This is taken into account in the validation procedure. Validation is performed for the following attributes: number of households, number of persons, persons ages 0–18, 19–64 and number of cars.

Table 2 presents the validation results. MAPE statistics are calculated for two subsets of the original ATAZ dataset. Results are compared between our synthetic distribution with those ATAZs with complete coverage and those with a sampling coverage of 70+%. Mean errors across the two sources are within the range of 18–25 %.

Table 2 Validation results for key variables

Appendix 2: Behavioral Rules for the ABM

  1. 1.

    Residential Location Choice is derived as follows:

    $$ {h}_h={b}_j\Rightarrow \left[\frac{I_h}{3}>H{P}_j\right]\ast \left[{k}_h>S\left({b}_j\right)\right]=1 $$

    where:

    h h is the new residential location for household h,

    b j is the building considered,

    \( \left[\right] \) is a binary expression with value of 1 if true and 0 otherwise,

    I h is household h’s monthly income,

    HP j is monthly housing cost of an average apartment in building j,

    k h is a random number between [0,1] indicating tolerance to change in residential environment incurred by relocation,

    S(b j ) is a similarity score for building j in relation to current place of residence, calculated as follows:

    $$ S\left({b}_j\right)=\frac{\Phi \left(\frac{{\overline{I}}_j-{\overline{I}}_h}{I_{\sigma_h}}\right)+\Phi \left(\frac{{\overline{A}}_j-{\overline{A}}_h}{A_{\sigma_h}}\right)}{2} $$

    where:

    Φ is the standard normal cumulative probability function,

    Ī j , Ā j are the average household income and average age of individuals in building j, respectively

    Ī h , Ā h are average household income and average age of individuals in residential buildings within 100 m of current residential location of household h,

    \( {I}_{\sigma_h} \), \( {A}_{\sigma_h} \) are standard deviations of household income and of resident age in residential buildings within 100 m from current home location of household h, respectively.

  2. 2.

    Choice of sequence of activities: occurs in two stages. First, the number of activities is fixed and then activities are allocated to locations:

    $$ \begin{array}{ll}\hfill & \#A{c}_i=\left\Vert a*\left(\frac{k_i}{0.5}\right)*\left(1+ca{r}_h*0.33\right)*\left(1-di{s}_i*0.33\right)*\left(1+\left[ ag{e}_i=2\right]*0.33\right)\right.\\ {}& *\left.\left(1-\left[ ag{e}_i\ne 2\right]*0.33\right)\right\Vert + employe{d}_i* her{e}_i\hfill \end{array} $$

    where:

    # Ac i is the number of activities for resident i,

    k i is a randomly drawn number between [0, 1] reflecting preferences regarding number of activities,

    car h is a binary variable equal to 1 if the household h owns a car and 0 otherwise,

    dis i is a binary variable equal to 1 if individual i is disabled and 0 otherwise,

    age i is the age group of individual i,

    employed i is a binary variable equal to 1 if i is employed and 0 otherwise,

    here i is a binary variable equal to 1 when i’s workplace is located within the study area and 0 otherwise,

    x‖ indicates the nearest integer number to x,

    a is the average number of activities based on employment status; equals 2.5 for employed residents and 3 for non-employed.

    $$ {a}_{t+1,i}={b}_j\Rightarrow \left\lfloor {b}_j\ne {a}_{t,i}\right\rfloor *\left\lfloor {k}_i\ge Att\left({b}_j\right)\right\rfloor =1 $$

    where:

    a t,i is the current location of individual i,

    \( {a}_{t+1,i} \) is the next location of activity of individual i,

    k i is a randomly drawn number between [0, 1] reflecting activity location preferences,

    Att(b j ) is the attractiveness score for building j, calculated as follows:

    $$ \begin{array}{l}Att\left({b}_j\right)\\ {}=\frac{1-\raisebox{1ex}{$\Sigma {E}_j$}\!\left/ \!\raisebox{-1ex}{$\Sigma {B}_j$}\right.+1-\raisebox{1ex}{${D}_{ij}$}\!\left/ \!\raisebox{-1ex}{$ \max {D}_i$}\right.*\left(1+0.33*\left(-ca{r}_h+di{s}_i+\left[ ag{e}_i=3\right]\right)\right)+\left[L{U}_j=non\mathrm{R}\mathrm{e}s\right]*\raisebox{1ex}{$F{S}_j$}\!\left/ \!\raisebox{-1ex}{$ \max FS$}\right.}{2+\left[L{U}_j=non\mathrm{R}\mathrm{e}s\right]}\end{array} $$

    where:

    ΣE j is the number of non occupied buildings within a 100 m buffer of building j,

    ΣB j is the number of all buildings within a 100 m buffer of building j,

    D ij is the distance of building j from the current location of individual i,

    max D i is the distance of the building within the study area furthest away from the current location of individual i,

    LU j is the land-use of building j,

    nonRes is non-residential use,

    FS j is the floor-space volume of building j,

    maxFS is the floor-space volume of the largest non-residential building within the study area.

  3. 3.

    Choice of workplace location is calculated similarly to the choice of activity location:

    $$ W{P}_i={b}_j\Rightarrow \left[L{U}_j= EL{U}_i\right]*\left[{k}_i>\frac{\raisebox{1ex}{${D}_{ij}$}\!\left/ \!\raisebox{-1ex}{$ \max {D}_i$}\right.+1-\raisebox{1ex}{$F{S}_j$}\!\left/ \!\raisebox{-1ex}{$ \max FS$}\right.}{2}\right]=1 $$

    where:

    WP i is the workplace location of individual i,

    ELU i is the employment-sector-related land-use for individual i,

    k i is a randomly drawn number between [0, 1] representing workplace location preferences,

    D ij is the distance between building j and individual i’s place of residence,

    max D i is the distance of the building within the study area furthest away from individual i’s place of residence.

  4. 4.

    Building values and the monthly cost of living in a dwelling unit are derived in a 3-stage process. First, daily change in average house price per SA is calculated. Then, values of individual buildings are derived and finally the price of the single, average dwelling unit is calculated. For non-residential buildings, the calculation of individual building values is similar.

    $$ \begin{array}{l}AH{P}_{z,t+1}=AH{P}_{z,t}*\left(1+ \log \left(\frac{\raisebox{1ex}{$po{p}_{z,t+1}$}\!\left/ \!\raisebox{-1ex}{$po{p}_{z,t}$}\right.+\raisebox{1ex}{$re{s}_{z,t}$}\!\left/ \!\raisebox{-1ex}{$re{s}_{z,t+1}$}\right.+\raisebox{1ex}{$n\mathrm{R}\mathrm{e}{s}_{z,t+1}$}\!\left/ \!\raisebox{-1ex}{$n\mathrm{R}\mathrm{e}{s}_{z,t}$}\right.}{3}\right)\right)\hfill \\ {}ANR{V}_{z,t+1}=ANR{V}_{z,t}*\left(1+ \log \left(\frac{n\mathrm{R}\mathrm{e}{s}_{z,t}}{n\mathrm{R}\mathrm{e}{s}_{z,t+1}}\right)\right)\hfill \end{array} $$

    where:

    AHP z,t is average housing price per meter in SA z at time t,

    pop z,t is population in SA z at time t,

    res z,t is the number of residential buildings in SA z at time t,

    nRes z,t is the number of non-residential buildings in SA z at time t,

    ANRV z,t is the average non-residential value per meter in SA z at time t,

    $$ \begin{array}{c}H{P}_{j,t}=AH{P}_{z,t}*F{S}_j*\raisebox{1ex}{$S{L}_{j,t}$}\!\left/ \!\raisebox{-1ex}{$S{L}_{z,t}$}\right.\\ {}{V}_{j,t}=ANR{V}_{z,t}*F{S}_j\end{array} $$

    where:

    HP j,t is the house price of a dwelling unit in building j at time t,

    SL s,t is the service level within area s at time t—the ratio of non-residential buildings to residential buildings in this perimeter,

    V j,t is the non-residential value of building j.

    $$ {P}_{du,t}=\frac{{\overline{I}}_t*\left(1+\frac{\raisebox{1ex}{$H{P}_{j,t}$}\!\left/ \!\raisebox{-1ex}{$\Sigma A{p}_j$}\right.-\raisebox{1ex}{${\displaystyle \sum_{l=1}^{L_t}H{P}_{l,t}}$}\!\left/ \!\raisebox{-1ex}{${\displaystyle \sum_{l=1}^{L_t}\Sigma A{p}_l}$}\right.}{P_{\sigma_t}}\right)}{c} $$

    where:

    P du,t is the monthly cost of living in dwelling unit du at time t,

    Ī t is the average household income in the study area at time t,

    ΣAp is the number of dwelling units within a building. If the building is initially of residential use, this is equal to its initial population size, otherwise it is the floor-space volume of the building divided by 90 (assumed to be average dwelling unit size in meters),

    L t is the number of residential buildings in the study area at time t,

    \( {P}_{\sigma_t} \) is the standard deviation of dwelling unit prices within the study area at time t,

    c is a constant.

  5. 5.

    Land-use changes, from residential to commercial and from commercial to unoccupied are based on the congruence between the building floor-space volume and the average intensity of traffic on roads within a 100 m radius over the preceding 30 days. Both these values are compared with the (assumed) exponential distribution of all values in the study area. This is done by computing the logistic probability of the relative difference in their locations in the distribution:

    $$ {P}_{j,t}\left(\Delta {x}_{j,t}\right)=\frac{e^{-\Delta {x}_{j,t}}}{1+{e}^{-\Delta {x}_{j,t}}} $$
    $$ \Delta {x}_{j,t}=\frac{z_{T{R}_{j,t}}-{z}_{F{S}_{t,t}}}{\left|{z}_{FS}\right|} $$
    $$ {z}_{y_{t,j}}=\frac{e^{-\raisebox{1ex}{${y}_{j,t}$}\!\left/ \!\raisebox{-1ex}{${\overline{y}}_t$}\right.}-{e}^{-\raisebox{1ex}{${y}_{me{d}_t}$}\!\left/ \!\raisebox{-1ex}{${\overline{y}}_t$}\right.}}{{\overline{y}}_t} $$

    where:

    P j,t is the probability of land-use change for building j at time t,

    Δx j,t is the relative difference in position of traffic load and floor-space for building j at time t,

    z yj,t is the position of value y in the exponential distribution, relative to the median for building j at time t,

    \( \frac{e^{-\raisebox{1ex}{${y}_{j,t}$}\!\left/ \!\raisebox{-1ex}{${\overline{y}}_t$}\right.}}{{\overline{y}}_t} \) is the exponential probability density value for \( y\left(\frac{1}{{\overline{y}}_t}={\widehat{\lambda}}_t\right) \) for building j at time t,

    y med t is the median of y at time t.

    If P > 0.99 for residential use, it changes to commercial. If the value is in the range [P(1) − 0.01, P(1)] for commercial uses, the building becomes unoccupied. This functional form and criteria values reduce the sensitivity of large commercial uses and small residential uses to traffic volume. Consequently, the process of traffic-related land-use change is not biased by a tendency to inflate initial land uses.

  6. 6.

    Earthquake impact is calculated as follows:

    $$ {\mathrm{Im}}_j=\frac{c*{10}^{mag}}{D_j*\left| \log \left({D}_j\right)\right|*{F}_j} $$

    where:

    Im j is the impact building j suffers, c is a constant, mag is the earthquake magnitude (similar to Richter scale), D j is distance of building j from the earthquake epicenter, F j is number of floors in building j.

Appendix 3: Sensitivity Analysis

The hard-to-predict nature of model dynamics is a substantive issue in agent-based simulation (Ngo and See 2012). To test for parameter stability, we test two key model parameters:

  1. 1.

    Land-use sensitivity (‘lu’)—in this scenario, the sensitivity of actual and potential commercial land-use to traffic volumes is set in relation the nearest road and not all roads within 100 m as in the baseline case.

  2. 2.

    Housing budget distribution (‘hb’)—instead of defining a rigid parameter share for housing service costs out of total monthly income (0.33), variation within the willingness to pay for housing parameter is allowed. A unique value is drawn for each household from a normal distribution centered on the average expenditure on housing in Israel (23.4 % of total income), see http://www1.cbs.gov.il/reader/newhodaot/hodaa_template_eng.html?hodaa=201415290.

Each scenario is simulated 25 times with extreme results discarded (two cases for the lu scenario and one for the hb scenario). All other parameters remain unchanged. To obtain high resolution results we compute the most frequent land-use and the average residential value, non-residential value and vulnerability index for each building at discrete time points. For land-use we compute the share of times a different land-use is registered in relation to the baseline scenario. For other variables we compute the Median Absolute Percentage Error (MAPE). The results are presented in Fig. 14.

Fig. 14
figure 14

Sensitivity tests for (a) lu scenario and (b) hb scenario.

The results illustrate morphological stability with the same land-use over scenario registered for almost 90 % of the buildings at all time points. The other variables also exhibit parameter stability (which is greater in the lu scenario). The only sensitive variable is average non-residential value. This is due to two changes exerting influence on the spatial distribution of commercial functions. The lu scenario constrains agglomeration tendencies and the hb scenario affects residential, and hence traffic, patterns. As commercial stock is small relative to residential stock these changes do not exhibit strong morphological influence but register greater and more fluctuating differences than in the other variables. While general patterns indicate parameter stability, micro-level differences are still observed.

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Grinberger, A.Y., Lichter, M., Felsenstein, D. (2017). Dynamic Agent Based Simulation of an Urban Disaster Using Synthetic Big Data. In: Thakuriah, P., Tilahun, N., Zellner, M. (eds) Seeing Cities Through Big Data. Springer Geography. Springer, Cham. https://doi.org/10.1007/978-3-319-40902-3_20

Download citation

Publish with us

Policies and ethics