Because we dispose of only one explanatory variable h, it is possible to run the analysis directly with the counts of buildings for each value of h where measurements are available. As such, the sample size n indicated in Table 2 corresponds to the total number of points used for the regression, the total number of buildings in each class being given by N.
Table 2 AIC (5) obtained for the buildings in the P area, with values corresponding to the best fitted model are in bold
Plain
20,682 buildings were surveyed in the P area of Ishinomaki City, after the considerations highlighted in Sect. 2.1 and removal of incomplete or erroneous data (e.g. missing information on building material, damage unexplained by flow depth), 15,736 buildings were analyzed.
Table 2 shows the different AIC values, by link function chosen and construction material. The fragility curves corresponding to the models with the smallest AIC are plotted in Fig. 5, along with the corresponding data. An initial examination of the curves shows that the vulnerability of wooden and masonry structures is higher than the vulnerability of RC and steel buildings. However, we can also see that the behavior of the data is erratic for RC buildings, extremely scattered for steel buildings, whereas the trend is much more obvious for wood and masonry structures. For the latter types of buildings, there is very little or no data points classified as DS4, resulting in equal estimations of the probability of damage for both DS4 and DS5. It is very likely that many buildings which had actually reached level DS4 were classified as DS5 in the field, due to the slightly subjective description of damage provided for these levels. For example, “Heavy damage to several walls and some columns” (DS4) can easily be classified as being “Destructive damage to walls (more than half of wall density) and several columns (bend or destroyed)” (DS5). To an extent, the definitions of DS4 and DS3 can trigger a similar issue (“damage to some walls”—DS3, “damage to several walls”—DS4).
The diagnostics plots in Fig. 6 reveal that indeed the model’s fit to the data is poor for RC and steel buildings, which is expected given the amount of scatter in the data and indicates that flow depth is not a good predictor of tsunami damage for these types of structures. The differences between the observed and expected probabilities become more pronounced as the damage level increases, the worse estimations corresponding to damage states that are representative of structural damage (DS4 and DS5). For wooden buildings, the observed and predicted probabilities are consistent however a trend is present, particularly obvious in the high probability regions (μ > 0.6) with the model systematically overestimating damage probability for non-structural damage (DS2 and DS3), and systematically underestimating damage probability for structural damage (DS4 and DS5). The opposite is true in the low probability region (μ < 0.6). This is likely due to the action of one or several missing variables, which if known should be included in the model (2). This hypothesis is supported by the observations from Yu et al. (2013), who noted in the context of flood damage analysis that sediment flow velocity, flood duration and sediment load have are likely to influence damage estimations. The underestimations may be due to the action of debris, as mentioned in Sect. 2.2 they were likely to have a significant influence on at least non-structural elements (photographic evidence), possibly also for structural damage and collapse, although visual evidence for this is harder to detect on post-tsunami survey images. The potential misclassifications highlighted previously are also likely to influence such trend, for example we can observe that some of the non structural high damage probability data in Fig. 5 is shifted to the right (leading to overestimation of DS2 and DS3), while it is shifted to the left for DS5. Finally, the diagnostic plots in Fig. 6 show a very good fit for DS1 for all structures, with a probability of 1. This is because the probability of a building to experience at least minor flooding (see Table 1) is intrinsically linked to the inundation depth and will reach its maximum as soon as the flow interacts with any building.
Terrain
22,810 buildings were surveyed in the T area of Ishinomaki City, after the considerations highlighted in Sect. 2.1 and removal of incomplete data, 18,289 buildings were analyzed.
Table 3 indicates the AIC values for different building construction types in the T area, and different link functions. The fragility curves corresponding to the models with the smallest AIC are plotted in Fig. 7, along with the corresponding data.
Table 3 AIC (5) obtained for the buildings in the T area, with values corresponding to the best fitted model are in bold
Again the probability of damage given by the model is higher for wooden and masonry structures (in comparison with the other structural types), whereas scatter in the data for RC and steel buildings is important. Similarly to the fragility curves derived for the P area, there is little or no difference between the damage probabilities corresponding to DS4 and DS5. The exact same remarks made for the diagnostics of the P area (Sect. 3.1) can be made for the diagnostics of the T area (Fig. 8).
River
13,458 buildings were surveyed in T area of Ishinomaki City, after the considerations highlighted in Sect. 2.1 and removal of incomplete data, 11,150 buildings were analyzed.
Table 4 indicates the AIC values for different building construction types in the R area, and different link functions. The fragility curves corresponding to the models with the smallest AIC are plotted in Fig. 9, along with the corresponding data.
Table 4 AIC (5) obtained for the buildings in the R area, with values corresponding to the best fitted model are in bold
In this area, scatter in the data for RC and steel buildings is still important, and the model cannot provide a satisfactory fit to the data, as shown also by the large departure from the perfect estimations line in Fig. 10. However, from this figure we can also see that there are less model misclassifications for all damage states in comparison with the results obtained for the plain and terrain areas (RC and steel buildings in Figs. 6 and 8, respectively), yielding a slightly improved damage probability estimation. In addition, the trend that was visible for the wooden buildings of the aforementioned areas is no longer present, despite some underestimation of damage probability for higher damage states (Fig. 10). This indicates that flow depth, while still not a satisfactory predictor of tsunami damage, performs visibly better in the R area. A likely reason for this might be the dominant mechanism of inundation along the river banks, namely dyke overtopping (as mentioned in Sect. 2.2). Indeed, while the tsunami height and velocities may increase in the river channel, the velocities of the water inundating the shores and beyond will be mainly determined by the head difference between the overtopping water surface and the ground, following a process similar to river flooding. As such, the flow velocity would be related to flow depth, which would allow the model to capture this effect through h and explain the slightly enhanced goodness-of-fit. Similarly to the fragility curves derived for the P and T areas, there is little or no difference between the damage probabilities corresponding to DS4 and DS5; and the estimations for DS1 are again very satisfactory.
Fragility comparisons between the three geographical areas in Ishinomaki city
The results of this study show that for all three areas, the correlation between flow depth and damage probability observations for steel and RC buildings is low, yielding a poor fit of the fragility curves, particularly in the case of structural damage. The scatter is less pronounced for masonry buildings, and best for wooden buildings, despite a trend being present around the perfect predictions line in the diagnostics plot.
Therefore, in order to assess if the different geographical characteristics of Ishinomaki City (i.e. plain, terrain and river) significantly altered building damage probability, we choose to compare the fragility curves corresponding to the structural material for which the most reliable estimations have been obtained, namely wooden buildings. Representative damage levels for comparison are DS3 and DS5, because they express probabilities for extensive non-structural and structural damage, respectively.
A first examination of the fragility functions in Fig. 11 shows that the most vulnerable area to tsunami damage, both structural and non-structural, appears to be the plain; whereas the probability of damage for the buildings bordering the river is visibly lower than both in the plain and terrain areas. A common assumption usually made for binomial and multinomial distributions is that the theoretical dispersion parameter ϕ associated with the variance function takes the value of 1 (Eq. (3), Sect. 2.3.1), so the resulting variance is independent of any deviations from the fit and could be underestimated. Because of the systematic deviations observed for wooden buildings in Figs. 6, 8, and 10, and to prevent misleadingly narrow confidence intervals, we have chosen to use instead an estimated dispersion parameter \( \hat{\phi } \) (Fahrmeir and Tutz 2001), expressed as:
$$ \hat{\phi } = \frac{1}{N - p}\sum\limits_{m = 1}^{N} {\sum\limits_{k = 1}^{n} {\hat{r}_{mk}^{2} } } . $$
(6)
In Eq. (6), \( \hat{r} \) represents the Pearson residuals (see Mc Cullagh and Nelder 1989; Fahrmeir and Tutz 2001), which similarly to deviance, are a measure of the model’s error. In the case of DS5, the confidence intervals for the plain and terrain areas overlap, indicating that the probability of a wooden building to suffer heavy structural damage (collapse) is similar in both areas. In the case of DS3, the buildings of the plain area appear significantly more vulnerable to tsunami-induced non-structural damage for flow depths higher than 0.5 m, whereas for flow depths higher than 1 m the confidence intervals corresponding to the fragility curves of the buildings from the terrain and river areas start to overlap. This may indicate that buildings from the terrain and river areas are possibly equally likely to suffer non structural damage for higher tsunami flow depths.
This result may at first appear to be in slight contradiction with the results obtained by Suppasri et al. (2013a, b), who highlighted a higher damage probability for the buildings of the “ria” coast, (in comparison with the “plain” coast), due to the propensity of this type of coastline (saw-toothed) to amplify tsunami waves. The present analysis focuses on the main city of Ishinomaki, not the ria coast to the North. The T area in this study displays a similar inland topography (i.e. mountainous), however, only a small proportion of the buildings in the city of Ishinomaki analyzed in this study are bordering a ria coastline (to the southeast in Fig. 3). The rest of the city is facing Ishinomaki Bay and is characterized by a relatively smooth coastline.
In addition, despite the relatively higher flow depths measured in the T in comparison with the P area, the former benefited from coastal protection along most of the seafront (breakwater, seawell and control forest). These visibly contributed to reduce flow depths and velocities inland, which could have contributed to reduce the severity of tsunami damage.
Other areas
The fragility analysis was also conducted separately for small areas which were thought to be particularly susceptible to tsunami damage (and for which enough points were available), namely:
-
The river island approximately 1 km from the river mouth, in the direct path of the fast tsunami flow travelling along the river and the river banks which were not protected by a dyke,
-
Terrain A for it is unprotected, backed up by high topography blocking the advancement of the tsunami and close to the river mouth,
-
Terrain B for it is located on the border of a canal and backed up by high topography.
These areas are represented in Fig. 12, only wooden buildings were used due to the low number of data points available for other types of structures. The results (Fig. 13) show that the curves are driven by a majority of data points corresponding to a 100 % damage probability exceedance for all damage levels. In other words, in these areas the probability of reaching or exceeding structural damage levels (wood) is very high. For example, there is certainty of collapse for the buildings located in Terrain A for water depths as low as 2 m. Non-structural damage is almost certain (approx. 90 %) for wooden buildings located in the other aforementioned areas, for water depths as low as 0.5 m, as well as collapse from about 3 m.