Classifying buildings according to seismic vulnerability using Cluster-ANN techniques: application to the city of Murcia, Spain

Meyers-Angulo, J. Eduardo; Martínez-Cuevas, Sandra; Gaspar-Escribano, Jorge M.

doi:10.1007/s10518-023-01671-5

Classifying buildings according to seismic vulnerability using Cluster-ANN techniques: application to the city of Murcia, Spain

Original Article
Open access
Published: 03 April 2023

Volume 21, pages 3581–3622, (2023)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of Earthquake Engineering Aims and scope Submit manuscript

Classifying buildings according to seismic vulnerability using Cluster-ANN techniques: application to the city of Murcia, Spain

Download PDF

2552 Accesses
4 Citations
4 Altmetric
Explore all metrics

Abstract

The seismic vulnerability of a city is a degree of its intrinsic susceptibility or predisposition to sustain damage or losses stemming from seismic events. In terms of physical vulnerability, one of the most important factors for assessing seismic risk, especially, for estimating losses, is the exposure of structures, particularly those structures intended for residential use. The present article outlines a methodology for classifying residential buildings based on the structural and non-structural components that ultimately determine the building typology and control the seismic performance. The proposed methodology is divided into three steps: first, spatial data are analysed using an official database that is supplemented by remote field work to verify, validate, and identify construction typologies and urban modifiers after incorporating the new observable data. During the second step, machine learning techniques based on Two-Step cluster analysis and neural networks are used to identify building typologies, using a multilayer perceptron to assess the representativeness of the building typologies identified. Finally, each building typology is defined, a vulnerability assessment is carried out, and vulnerability classes are ranked based on the macroseismic scale. The above-mentioned steps were applied to 7631 residential buildings in the city of Murcia, Spain. The methodology is scalable and may be automated, so it may be replicated in other urban areas with similar characteristics or adapted to different urban settings. This may help save time and reduce the cost of carrying out seismic risk studies, providing valuable information for both civil protection and regional and local governments.

An empirical seismic vulnerability model

Article 21 March 2022

Seismic vulnerability assessment of urban environments in moderate-to-low seismic hazard regions using association rule learning and support vector machine methods

Article 30 November 2014

Machine learning network suitable for accurate rapid seismic risk estimation of masonry building stocks

Article 17 August 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The seismic vulnerability of a building or group of buildings reflects its intrinsic predisposition to sustaining damage in the event of a seismic motion and is directly correlated with its physical properties, structural design and soil-structure interaction (Barbat et al. 1998). In order to further a multidisciplinary and comprehensive approach to seismic risk assessment, we must first evaluate the potential physical damage that may result from the combination of hazard and physical vulnerability for exposed elements (Carreño 2006). Seismic risk assessment at the urban level must contemplate a large number of structures with their corresponding potential levels of damage and probabilities of sustaining said damage, while also bearing in mind that different buildings serve different purposes that are more or less relevant to city life (Basaglia et al. 2018). In this context, data quality and availability directly affect a realistic risk assessment, which serves as the basis for implementing informed disaster risk reduction actions, in line with the international agenda and the global targets set in the Sendai Framework, such as those that seek to make risk data and information more available to local communities and organizations (Torres et al. 2019).

The details of a building’s construction make each building unique. In order to systematically characterize the different elements of the building, it is essential to record and update the data with the structural and non-structural attributes that can affect the seismic behaviour of the building. All these attributes are represented in an exposure model for vulnerability and seismic risk assessments, which can be adapted to a suitable taxonomy. This step poses the most significant challenges. For Dell’Acqua et al. (2013), the pitfalls of taking a detailed inventory of building stock include: (i) high investment in terms of time and money; (ii) access to data or public information that is incomplete or scattered across several entities; (iii) possible observation errors due to uncertainty; (iv) variable data formats or poor homogenization. Thus, the ongoing challenge of recording and enriching spatial data has led many researchers to propose methodologies based on primary data sources such as population and housing censuses, cadastres, and other official sources, combined with direct observations or computer-based surveys.

Examples of the above include the exposure model presented by Yepes-Estrada et al. (2017)—which, as part of the Global Earthquake Model’s (GEM) South America Risk Assessment (SARA) Project, classifies typical building structures in seven South American countries based on population and housing census data—and the classification of structures in Central and South Asian countries presented by Lang et al. (2017), in which the authors managed to identify key building typologies based on data from rapid visual surveys. At the country level, the exposure model proposed by Santa María et al. (2017) provides a classification of residential building structures in Chile using census data as a primary source and identifying structures based on remote surveys. Another example of methodology, this time based on geomatics and statistical techniques, is the study by Torres et al. (2019) in which the researchers propose procedures for estimating seismic exposure and vulnerability by applying algorithms to remote-sensing data, thus providing accurate automation for these types of studies. Consequently, one current challenge is to adopt new automated methodologies for classifying building structures before assessing vulnerability.

Several global initiatives propose methods for assessing seismic vulnerability based on Building Typology Model (BTM) that correspond to predefined building classes with similar characteristics as far as their structural systems and seismic behaviour. Some of the most characteristic of these initiatives include: the taxonomy proposed by the Federal Emergency Management Agency (FEMA-154 1988, 2002), which contains 15 different typological classes or BTMs based on building specifications typical in the United States; HAZUS (HAZUS-MH 2003), created by FEMA as a method for estimating seismic risk, with 15 BTMs subdivided according to the number of storeys; EMS-98 (Grünthal 1998), a European building classification based on empirical macroseismic intensity scales that identifies 15 BTMs based on the level of vulnerability in the event of a major earthquake; Risk-UE (Mouroux et al. 2004; Mouroux and Le Brun 2006), which integrates and homogenizes seismic risk projects in Europe, identifying 23 BTMs based on the most prevalent construction specifications; SYNER-G (2009), a comprehensive methodology for assessing the seismic vulnerability of buildings in Europe that contemplates the urban system and its interrelation with other systems and proposes new taxonomies for reinforced concrete and masonry; GEM Taxonomy; v.2.0 (Brzev et al. 2013), which comprises 13 attributes that describe the structural system, roofing, flooring, building envelope, and use, among other things. These attributes are divided into sub-levels corresponding to the specific characteristic of the main attribute, thus providing a more detailed physical description of the building and positioning the system as a global taxonomy. In this context, and in line with the Sendai Framework’s goal of promoting availability of multi-hazard systems, recent taxonomies such as GED4ALL (Silva et al. 2018)—which presents a taxonomy based on GEM v2.0—have taken a multi-risk approach, clearly distinguishing between the concepts of exposure (common to all risks) and vulnerability (specific to each risk). Other use cases include European projects such as SERA (2017), urban exposure studies (Pittore et al. 2018), and studies based on modelling building typologies (Esteghamati et al. 2020).

Hence, the main objective of any building vulnerability rating is to identify those building typologies that might respond differently to seismic shaking and group them into classes to evaluate their response and the estimated extent of potential losses. Ongoing studies rate the variables that affect building vulnerability and formulate a statistical model using a discrimination index (Martínez-Cuevas et al. 2020) that makes it possible to identify habitable and non-habitable buildings. Recently published studies in the field of machine learning apply artificial neural networks (ANN) to the field of seismic engineering. Several authors (e.g., Stefanini et al. 2022; Ferreira et al. 2020) have used techniques based on artificial intelligence to evaluate seismic response and estimate damage. Others (Vazirizade et al. 2017; Tang et al. 2021) have applied machine learning to structural reliability assessments and rapid assessments of seismic risk and potential building loss.

This paper focuses on using data mining and machine learning techniques to obtain data and ultimately classify the predominant building construction patterns (including structural and non-structural aspects) in an urban area. The method was applied to the city of Murcia, which has one of the highest seismic risks of any city in Spain (IGN-UPM 2013; Gaspar-Escribano et al. 2015). The last significant earthquake in this region occurred in 2011 in the city of Lorca, 105 km from the city of Murcia and the two cities have similar urban settings. The literature on Lorca’s post-earthquake vulnerability includes studies on seismic behaviour in masonry and reinforced concrete buildings (Basset-Salom and Guardiola-Víllora 2014; Gomez-Martinez et al. 2015; Ródenas et al. 2018) and analyses of building typologies and urban modifiers based on empirical data and the re-evaluation of macroseismic intensity estimates (Martínez-Cuevas and Gaspar-Escribano 2016; Martinez-Cuevas et al. 2017).

The proposed methodology uses multivariate statistical techniques and machine learning to classify building types according to seismic vulnerability. The study was carried out by performing data analysis at different resolutions, starting with primary data sources, and then verifying and collecting data via remote surveys using online map viewers, digital cartography analysis, and Geographical Information System (GIS) tools to obtain a high-quality final database. Based on the data obtained, an initial building typology classification was carried out by applying a two-step cluster analysis and a multilayer perceptron ANN to the final evaluation of key building typologies in the study areas. Finally, seismic vulnerability was assessed and classified according to the EMS-98 macroseismic scale (Grünthal 1998).

2 General methodology

The present study proposes a methodology (Fig. 1) that is divided into three steps, with specific workflow stages in each phase.

The first step consisted on selecting a study area in the city of Murcia, compiling building data extracted from the Cadastre (https://www.sedecatastro.gob.es/) and setting up an initial database and geographic information system. This allowed us to preliminarily identify the different Construction Typologies (CT) and their variability within the selected study area. Preliminary data were checked for potential identification errors by means of a typology membership probability matrix for Murcia (RISMUR 2014), and in parallel, remote field work was carried out to identify urban modifiers for the buildings in the study area (Martinez-Cuevas et al. 2017). In this way, an initial database was configured and was thus enriched with the incorporation of new attributes (urban modifiers) coded according to the GEM taxonomy (https://taxonomy.openquake.org/). The initial CTs completed with the corresponding urban modifiers are called Building Typologies (BT) in this work.

The second step involved multivariate statistical techniques. A two-step cluster analysis was used to identify clusters, correlating previously identified construction typologies with the urban modifiers present in each building. For each CT, the resulting clusters, called Building Cluster Typologies (BCT) in this work, were internally similar but dissimilar compared to other clusters. The various BCT were evaluated using a multilayer perceptron neural network (ANN-MLP), which allowed us to assess the representativeness of the natural clusters obtained using the multivariate technique.

The third step consisted of conducting a vulnerability study of structural and non-structural components, developing a building taxonomy based on the BCT evaluated, calculating average vulnerability at the neighbourhood and census tract levels, and establishing vulnerability index ranges for each cluster analysed. The European Macroseismic Scale (EMS-98) was used to classify vulnerability classes, in addition the corresponding vulnerability curves are prepared for each BCT.

2.1 Study area and data

The study area comprises the urban area of the city of Murcia, whose local administration is divided into 28 neighbourhoods and 158 census blocks (CB). The former represents the largest scale at the urban level while the latter are lower-level territorial subdivisions that are useful for disseminating statistical data. The total area covered by the sum of all neighbourhoods analysed is approximately 11.93 km². The study area contains high-density residential buildings with unique urban characteristics and construction typologies attributable to the date of construction, urban planning, and the way construction techniques have evolved. The total building stock consists of 8698 buildings, of which 7631 are residential buildings. These buildings constitute the basis of this study, corresponding to 100% of the residential buildings within the city of Murcia. Figure 2 shows the study area and the correlation between neighbourhoods, census blocks, and the number of residential buildings analysed.

2.2 Database and geographical information

Initially, a building database and a GIS were set up using the primary data (shapefiles Building and BuildingPart) obtained from the Spanish Cadastre (https://www.catastro.minhap.es/webinspire/index.html) according to the INSPIRE Directive. The attributes extracted for each building included: identifier for each building and building parts, cadastral reference, geometries, year of construction, state of conservation, total number of building, number of dwellings, number of storeys above and below ground level, reforms, areas, centroid coordinates, and precision. This database was depurated by specifying different IDs for buildings sharing the same cadastral reference, by correcting errors in contouring polygons and by removing nonessential data for the purpose of this work (retaining the number of storeys, year of construction, building use, number of dwellings, and renovation only). This process led to the configuration of the CT database, that was complemented with the attributes related to urban modifiers to obtain BT database.

The first step was to apply a building membership probability matrix (RISMUR 2014) that links the year of construction with the approximate validity periods of the corresponding earthquake-resistant building codes for the entire region of Murcia (Table 1). The initial matrix was randomly applied to the entire building database to obtain an initial identification of CTs. The CTs were classified as low-rise, medium-rise, and high-rise buildings according to the number of storeys. Depending on the year of construction and the applicable building code, concrete buildings were classified as pre-code and low-code (buildings with higher level of seismic design are not present in the study area).

Table 1 Evolution of seismic codes applicable in Spain

Classifying buildings according to seismic vulnerability using Cluster-ANN techniques: application to the city of Murcia, Spain

Abstract

Similar content being viewed by others

An empirical seismic vulnerability model

Seismic vulnerability assessment of urban environments in moderate-to-low seismic hazard regions using association rule learning and support vector machine methods

Machine learning network suitable for accurate rapid seismic risk estimation of masonry building stocks

1 Introduction

2 General methodology

2.1 Study area and data

2.2 Database and geographical information

2.3 Remote field work

3 Classification and characterization of urban modifiers

3.1 Irregular floor plan (code PLF)

3.2 Soft storey (code SOS)

3.3 Irregularities in the vertical structure (code CHV)

3.4 Short column (code SHC)

3.5 Residential building type (code RES)

3.6 Difference in height compared to adjacent buildings (code POP)

3.7 Position of the building within the block (code BP)

3.8 Identifying urban modifiers in the study area

4 Identifying patterns based on a multivariate statistical study

4.1 Types of variables

4.2 Implementing two-step cluster analysis

4.2.1 Variables predictive of cluster formation

4.2.2 Buildings cluster typologies obtained

4.3 Using neural networks to assess BCTs

4.3.1 Neural network architecture

4.3.2 Results of BCT evaluation

5 Seismic vulnerability assessment

5.1 Seismic vulnerability estimate applied to BCTs (lvBCT)

5.1.1 Value of IvCT

5.1.2 Modifiers by behaviour (Mc)

5.1.3 Calculating the IVBCT value

5.2 BCT descriptions: building cluster typologies

6 Results and discussion

6.1 Distribution of variables and clusters in the city of Murcia

6.2 Vulnerability distribution

6.3 BCT vulnerability classes: macroseismic scale

6.4 Vulnerability curves applied to BCTs

7 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

5.1.1 Value of I_vCT

5.1.3 Calculating the I_VBCT value