1 Introduction

Concrete gravity dams are key to flood control, energy production, and the industrial and agricultural supply. Most of the existing concrete gravity dams were designed without considering their dynamic behaviour, and often their shapes were optimised according to static concepts. Moreover, the levels of reliability required to important infrastructures have been recently increased in order to meet the requests of communities for greater safety, also in light of the revaluation of some areas as seismic [1]. Together with the well-known issues related to the quantification of dam seismic safety [e.g., calibration of load-and-resistance-factor approaches, definition of limit states (LSs)] [2], this justifies the need for strategies that preserve such structures. In this context, the implementation of structural health monitoring (SHM) is very important, as it improves dam control and reduces the uncertainties involved in predicting their dynamic behaviour. However, SHM in dam engineering is very rare, mainly because of the large computational burden of predictive models, as discussed in Sect. 2.

Concrete dams are equipped with static monitoring systems, which usually record displacements at a few points in the dam body, basin levels and temperatures of the air and water. As shown in Sevieri et al. [3] this information can be used to calibrate predictive models of the structure for control purposes. However, in view of seismic events, dam control can be performed more effectively through dynamic monitoring systems. Sevieri et al. [4] show how ambient vibrations, and in particular their elaboration through operational modal analysis (OMA), can be successfully used to update the parameters of predictive models of concrete gravity dams’ dynamic behaviour, thus reducing epistemic uncertainties.

This research work presents a new probabilistic ambient vibration-based SHM framework for real-time control of concrete gravity dams. The procedure exploits experimental modal characteristics deduced from the analysis of ambient vibrations via OMA. It consists of two phases: Training and Detection. The architecture of the system and the hierarchical statistical relationship between ambient vibrations, environmental measurements and model parameters enable analysts both to detect damage and to update the parameters of predictive models. Dynamic measurements thus provide structural control, thereby improving the prediction of the dam’s structural behaviour. This latter is possible thanks to the Bayesian setting, thus exploiting the information contained in the new observations to update the state of knowledge of the model parameters. This approach also helps to reduce the uncertainties that affect the estimate of the residual life expectancy of the structure.

Predicting dam behaviour requires a massive use of complex numerical models including both soil-structure interaction (SSI) [5] and fluid-structure interaction (FSI) [6], whose computational burden is prohibitive without high-performance computing. Therefore, the general polynomial chaos expansion (gPCE) [7] is used to propagate uncertainties through the proposed predictive models of the dam behaviour and to reduce the computational burden by surrogating finite element analysis (FEA) results in the solution of the inverse problem (i.e., definition of gPCE-based predictive models).

In addition, modelling the SSI leads to a large number of numerical modes without experimental correlation, which only act as disruption factors for dynamic SHMs. Sevieri at al. [4] discuss the problem of matching between numerical and experimental modes in the modal analysis of concrete gravity dams and propose an algorithm based on the Markov Chain Monte Carlo (MCMC) method for the selection of numerical modes with experimental correlation. In the present work this algorithm is used to solve the mode matching problem in both the Training and Detection phases, as discussed in Sect. 4.

The novelty of this study lies in the probabilistic nature and the particular architecture of the proposed framework, which exploits modal information derived from ambient vibrations both to control the structural health state and to reduce the uncertainties in the prediction of the dam behaviour. In fact, the literature review presented in Sect. 2 shows that recent research tends to focus on implementing ambient vibration-based control systems for concrete gravity dams, without proposing a real SHM framework.

Finally the case study proposed in Sect. 5 shows the feasibility of implementing the proposed method for the on-line control of concrete gravity dams.

2 Structural health monitoring systems for concrete gravity dams

SHM is the process of assessing the health state of a structure. The complete definition of a SHM system entails sensor technology, materials technology, numerical modelling and diagnostic techniques [8]. SHM is made up of both hardware components (e.g., instrumentation) and software components (e.g., damage modelling, detection algorithms). Selecting the structural quantity of interest (QI) to be monitored and the definition of damage levels are fundamental steps. They represent the “structural behaviour” and its “threshold”, respectively, the latter is clearly related to a particular damage scenario (DS).

The selected “structural behaviour” must be sensitive to damage propagation (i.e., high value of information embedded in the QI) and be easily measurable with a good degree of accuracy. The “threshold”, which reflects a particular damage scenario, in this context can be defined with respect to a specific limit state (LS) or damage index (DI). The definition of a threshold should also consider the error related to both the measurements and structural health state predictions. Calibrated predictive models of structural behaviour can be successfully used for this purpose [4]. SHM frameworks defined in probabilistic settings and based on calibrated predictive models can detect abnormalities by comparing observations and predictions [9].

SHM systems are composed of the Diagnosis and the Prognosis phases. During the diagnosis, the health state of the structure is assessed locally, at the component level, and globally, in order to detect and locate damaged areas. During the Prognosis, the effects of damage on the structural behaviour are computed, and the residual life expectancy of the construction is thus assessed.

Concrete gravity dams are always equipped with monitoring systems for control purposes, which record environmental measurements, such as the basin level and air and water temperatures. Structural behaviour can be monitored through static or dynamic systems. Static monitoring systems record daily displacements in some reference points of the dam body in both the upstream-downstream and cross-valley directions [10] related to changing environmental conditions. The velocity of this phenomena and the magnitude of the corresponding displacements enable the problem to be addressed as quasi-static.

Dynamic monitoring systems in dam engineering are rarer than static ones and their application is generally limited to research purposes. Due to the large size and mass of the dams, dynamic SHM are generally based on ambient vibrations which are recorded by a network of accelerometers or velocimeters. The data recorded can be directly used to control the health state of the structure or can be processed through OMA techniques [11].

Bukenya et al. [12] reviewed on application of static and dynamic monitoring systems in dam engineering and highlighted that static measurements are commonly used to calibrate predictive models of the structural behaviour in a deterministic setting. On the other hand, only a few applications regard dynamic monitoring systems for dams and no study has presented an SHM framework where calibrated predictive models are actually used for control purposes. Of the various applications that explore the use of ambient vibrations for structural control, some studies present the calibration of dam predictive models in a deterministic setting, but none have proposed a framework for the dynamic SHM of concrete gravity dams.

In addition to the papers mentioned in Bukenya et al. [12], other important research works have been published in recent years. The static SHM systems recently proposed for gravity dams are inspired by the progress made in machine learning. Different architectures have been used to define predictive models of dam static displacements [3, 13,14,15,16,17,18,19,20] in order to improve the accuracy of the prediction and to reduce the computational burden of the classical approaches based on functional approximation [21]. Artificial neural network, support vector machine learning, extreme machine learning, multiple linear regression and general polynomial chaos expansion have been successfully used for the approximation of static dam behaviour.

Another important aspect in the definition of an SHM is the training phase of the predictive model. Most studies are defined in a deterministic setting and regressive methods are used to calibrate dam predictive models. Different approaches have been proposed to improve the performance of predictive models: the hybrid simplex artificial bee colony algorithm (HSABCA) [22], boosted regression trees [23], multilevel-recursive method [17], dynamic time warping (DTW) method, local outlier factor (LOF) [24], chaotic residual errors [19], Random Forest Regression (RFR) [25] and Bayesian inference [3] have been successfully applied in the scientific literature.

Several interesting research works that focus on the definition of static SHM frameworks are described in more detail in the following. Prakash et al. [26] use the principal component analysis for the reduction in the dimension of static predictive models (e.g., displacements, strains). Predictive model coefficients are then calibrated using the least squares method, while the threshold of abnormal behaviour is continuously calibrated through new observations, thus avoiding erroneous warnings. The study of a Bulgarian arch dam shows the feasibility of the proposed procedure.

Su et al. [27] combine the Dempster–Shafer theory of evidence (DST) and Set Pair Theory (SPT), in order to propose a structural health control method for concrete dams, which also supports decision-makers in dam management. This combines multi-source space-time information, such as monitoring systems, and inspections. The study of a hydro-power station completed in 1953 proves the efficiency of the method both to control the structural behaviour and to support decision-makers in choosing the best retrofitting intervention.

Hu et al. [28] study the leakages of the Shimantan Reservoir, a concrete dam with penetrating cracks. The authors show that systematic field inspections are fundamental in understanding the cause of the leakages. They also present a predictive model for dam displacements based on a modified version of the Navier-Stokes equations, which is particularly suitable for dams that experience leakages and penetrating cracks.

Lin et al. [20] propose a procedure for the separation of dam displacements into two parts: one relating to the dam behaviour and another relating to the foundation deformation. The idea of partitioned finite element model (FEM) is used to define hybrid equations that enable these two contributions to be separated. Observations recorded by the static monitoring system are used to calibrate the partitioned model, thus estimating the material mechanical parameters. The analysis of a concrete dam is used to verify the validity of the separation method.

Kang et al. [29] propose a static framework for the structural control of concrete dams using long-term air temperature for the simulation of thermal effects. They use the radial basis function network (RBFN) to quantify temperature effects from a long-term air temperature series. The RBFN is then applied instead of the Fourier series, which are commonly adopted for the prediction of thermal effects in dam engineering. The application on a real concrete gravity dam demonstrates the performance of their method.

Dynamic monitoring systems in dam engineering are rarer than static ones. Issues related to SSI and FSI modelling in the dynamic numerical models of concrete dams have hitherto prevented the development of dynamic predictive models and consequently dynamic SHMs [4]. However, there are some studies on the use of ambient vibrations for model calibration purposes.

Cheng et al. [30] propose a SHM approach for concrete dams based on ambient vibrations in which the kernel principle analysis (KPCA) is used to remove the effect of environmental condition variations. Damage is detected by comparing the \(L_2\) norm of the KPCA error calculated during normal conditions with the \(L_2\) norm calculated for the current state of the dam. After detecting the damaged areas, the coordinate modal assurance criterion (COMAC) is used to locate them. The authors do not tackle the mode matching problem and do not define any probabilistic framework.

Hariri-Ardebili et al. [31] study the modal behaviour of an arch dam, highlighting the effect of mechanical parameter uncertainties on the numerical results. The authors present a calibration procedure based on regression to the model parameters, which employs experimental modal characteristics. Again, the procedure is not defined within an SHM framework.

A hierarchical Bayesian method based on ambient vibrations for the dynamic parameter calibration of dam numerical models is proposed by Sevieri et al. [4]. They present a modified version of the Markov Chain Monte Carlo (MCMC) to solve the mode matching problem related to SSI modelling. They use hybrid meta models to surrogate FEA results, thus enabling the Bayesian inference to be applied to dam engineering. The application of the proposed procedure to the case of a real dam demonstrates the validity of this study.

3 Failure mechanisms and damage development in concrete dams

The first step in defining the SHM system is to analyse the damage propagation and failure mechanisms of the structural system. This is carried out in order to select the QIs to be detected and real case histories, numerical models and expert judgements are exploited. Since there are no case histories on concrete dam failures after seismic events [32, 33], numerical models are the only way to predict how such structures deal with collapse [2, 33]. Due to the quasi-brittle nature of the material, collapse mechanisms of concrete dams are mostly driven by tensile crack development [34]. The opening of cracks weakens the structure, thus leading to sliding or rocking.

Crack paths start from both the upstream and the downstream face of the dam and then propagate toward the core [35]. There are four main critical locations where cracks tend to start: (A) the neck area, where there is a change in the downstream face slope; (B) along lift joints at various elevations; (C) along the dam-foundation interface both near to the toe and the dam heel; (D) and at the foundation in near field of the dam. These areas are shown in Fig. 1 adapted from Léger [35].

Fig. 1
figure 1

Crack path critical locations of potential seismic failure modes in the section of concrete dams

Three-dimensional models of the dam enable analysts to highlight other failure mechanisms (e.g., those involving vertical contraction joints) which may lead to uncontrolled water release [36] and which cannot be detected through two-dimensional models.

Failure mechanisms of dam cross-sections have been investigated in several studies. Omidi et al. [37] analyse the seismic cracking of concrete gravity dams using a plastic-damage model and assuming different damping approaches. Their results show a high damage concentration in areas A and C in Fig. 1.

Hariri-Ardebili et al. [2] present a state-of-the-art review of seismic fragility analyses of concrete gravity dams. Most studies assume that sliding or rocking only arise in areas A and C in Fig. 1.

In a previous work, De Falco et al. [38] propose an extended version of the “Masonry like” material [39] as a new damage constitutive model for dam concrete. The Koyna dam is studied in terms of both seismic and overflow conditions, and the results are compared with the literature. The study revealed a high concentration of crack paths in areas A and C in Fig. 1.

Ansari et al. [40] propose a damage index for concrete gravity dams based on crack propagation and sliding safety factors. They demonstrate that this index can be a reliable parameter for the control of the structural health state.

Most studies show that the main failure mechanisms are sliding and rocking due to tensile cracking. A reliable SHM system for concrete gravity dams therefore needs to provide early detection of any variations in a QI with the crack propagation.

4 Proposed SHM framework

4.1 General overview

The present paper proposes a probabilistic SHM framework for concrete gravity dams. Its particular architecture together with the use of gPCE-based predictive models allow for real-time structural control and a better estimation of the remaining life expectancy of the dam. The use of ambient vibrations (Fig. 2) as an information source enables damaged areas to be detected and the dynamic parameters of dam predictive models to be updated within the SHM itself. Epistemic uncertainties related to the mechanical parameters of the materials are thus reduced, and the prediction of the dam behaviour is improved. Both damage detection and SHM are based on a comparison between QIs related to the healthy state of the structure and the current state, i.e., experimental and predicted mode shapes. Since QIs representing structural healthy states vary depending on the environmental conditions, anthropic loads, random factors and so on, probabilistic predictive models are needed to forecast their evolution. Predictive models also need to be accurate and fast, to enable the control system to be in real time. In this case, the accuracy of the system is improved by reducing epistemic uncertainties via the Bayesian inference. Using gPCE-based meta models, rather than FE models, leads to more rapid predictions.

The proposed probabilistic framework contributes both to the diagnosis and prognosis steps of the SHM. In fact, the continuous updating of the proposed predictive model reduces the errors, thus refining the predictions themselves and consequently the detection and location of the damage. Likewise, uncertainties are also reduced in the prognosis step, thus improving the estimation of the residual life expectancy of the dam.

The proposed approach has two phases in the hierarchical Bayesian framework [41]: Training and Detection. During the Training phase, predictive models are calibrated using the observations recorded by the monitoring system during the normal operations of the dam. During the Detection phase, the structural health state is controlled. The monitoring system records ambient vibrations and environmental measurements, such as basin levels, air humidity rate, water and air temperatures. The structural response to ambient vibrations is analysed through the OMA to determine the system’s modal characteristics. In particular, dam mode shapes are used as QI because of their particular sensitivity to damage development.

Fig. 2
figure 2

Flow chart of SHM framework for concrete dams

An additive multivariate probabilistic model [41, 42] is used to statistically describe the relationship among the mechanical parameters of materials, environmental conditions, observed and predicted QIs, and error terms. For the sake of simplicity environmental conditions are collected in \(\mathbf{x }\), while the unknown elastic mechanical parameters of materials are collected in \(\pmb {\uptheta }_{\text {el}}\). Assuming the system mode shapes as reference QIs, only the elastic parameters of the materials need to be considered as random variables. A parametrization of the problem in terms of the bulk modulus K and shear modulus G [43] reduces the variance of the solution [3]. It is worth noting that the static and dynamic behaviour of dams strictly depend on the environmental conditions [3, 44,45,46,47,48,49].

The predictive model proposed by Sevieri et al. [4] is used to represent the mode shapes of the system dam-basin-soil. Let us consider q modes of a dam numerical model with m dynamic degrees of freedom (\(q \le m\)). By ordering the components of the mode shapes matrix \(\pmb {\Phi }=\left[ \pmb {\upphi }_1,\ldots ,\pmb {\upphi }_k,\ldots ,\pmb {\upphi }_q\right]\) in only one vector \(\pmb {\upphi }^{\text {total}}\) with dimension \(m \cdot q \times 1\), an index h with \(1 \le h \le m \cdot q\) can be defined, thus leading to a simplification from a computational point of view [4]. Consequently, the statistical relationship between the ith observation of the hth component of the experimental mode shape \(\phi _{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}},\pmb {\Sigma }_{\upphi }\right)\) and the corresponding predicted value \({\hat{\phi }}_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}\right)\) is

$$\begin{aligned}&\phi _{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi }\right) = {\hat{\phi }}_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}} \right) +\sigma _{\upphi _h}\epsilon _{\upphi _{h,i}}, \nonumber \\&\qquad h=1,\ldots ,m \cdot q \quad i=1,\ldots ,l. \end{aligned}$$
(1)

\(\pmb {\Sigma }_{\upphi }\) in Eq. 1 is the covariance matrix of the error terms \(\sigma _{\upphi _h}\epsilon _{\upphi _{h,i}}\). In order to improve the prediction capability of the SHM also the error term is updated, i.e. the components of \(\pmb {\Sigma }_{\upphi }\). The dimension of the probabilistic problem, and thus the computational burden, strongly increases. Therefore, by assuming that only components that belong to the same mode are correlated, the covariance matrix assumes a block form [4], that is

(2)

The computational burden can be further reduced by using covariance functions to describe the components of \(\pmb {\Sigma }_{\upphi }\).

By assuming an exponential form for the covariance function:

$$\begin{aligned} \text {COV}\left( \phi _r,\phi _s\right) =\frac{1}{\lambda ^n} \exp \left[ -\left( w_d^n d_{\phi _r,\phi _s}\right) \right] , \end{aligned}$$
(3)

two components \(\phi _r\) and \(\phi _s\) of the same nth mode can be conceived as being correlated through their Euclidean distance \(d_{\phi _r,\phi _s}\). The unknown combination coefficients \(\lambda ^n\) and \(w_d^n\), which are collected in the vectors \(\pmb {\lambda }\) and \(\mathbf{w }_d\), respectively, are also updated.

The use of the Bayesian inference within SHM frameworks is particularly effective because of the large amount of recorded data. In the field of engineering modelling, the updating process aims to find the parameter set that leads to the best fit between an observed QI and its prediction obtained with a numerical model [50]. The determination of the parameter values is an inverse ill-posed problem in the Hadamard’ sense [51], so a regularisation is needed. The Bayesian inference allows this to be achieved by introducing information about the parameters, namely the prior distributions [52].

As discussed by Sevieri et al. [4], a hierarchical formulation of the updating rule [53] is preferable because it enables the calibration of mean values and standard deviations of parameter distributions. This is fundamental for improving the estimation of the structural fragility during the prognosis phase.

Let us define the vector of the new observations \(\mathbf{y }\), which collects the experimental mode shapes obtained by processing the ambient vibrations through OMA, and the vector of the elastic hyper-parameters \(\pmb {\Xi }_{\text {el}}\). The prior distribution can thus be written as

$$\begin{aligned} p\left( \pmb {\uptheta }_{\text {el}},\pmb {\Sigma }_{\upphi },\pmb {\Xi }_{\text {el}}\right) = p\left( \pmb {\uptheta }_{\text {el}},\pmb {\Sigma }_{\upphi }|\pmb {\Xi }_{\text {el}}\right) p\left( \pmb {\Xi }_{\text {el}}\right) . \end{aligned}$$
(4)

By defining the likelihood function \(L\left( \mathbf{x }, \pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi }, \pmb {\Xi }_{\text {el}}|\mathbf{y }\right)\) which contains the new observations, the posterior distribution \(p\left( \pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi }, \pmb {\Xi }_{\text {el}}|\mathbf{y }\right)\) is obtained through the updating rule

$$\begin{aligned} \begin{aligned}&p\left( \pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi }, \pmb {\Xi }_{\text {el}}|\mathbf{y }\right) \\&\quad =\kappa L\left( \mathbf{x }, \pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi }, \pmb {\Xi }_{\text {el}}|\mathbf{y }\right) p\left( \pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi },\pmb {\Xi }_{\text {el}}\right) \\&\quad =\kappa L\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi },\pmb {\Xi }_{\text {el}}|\mathbf{y }\right) p\left( \pmb {\uptheta }_{\text {el}},\pmb {\Sigma }_{\upphi }| \pmb {\Xi }_{\text {el}}\right) p\left( \pmb {\Xi }_{\text {el}}\right) . \end{aligned} \end{aligned}$$
(5)

Updating procedures based on the Bayesian inference entails defining the prior distribution of the unknown parameters and selecting a computational approach. Calculating the posterior distribution in real problems (i.e., the solution of Eq. 5) is complex [53, 54] because of the integration on manifolds and the large dimension of the problems. The modified version of the numerical algorithm MCMC proposed by Sevieri et al. [4] is herein used to determine the posterior distribution. Given that MCMC is a sampling approach, it requires a large number of analyses, which are computationally expensive. When complex numerical models are used to simulate the structural behaviour, as in the case of concrete dams, the computational cost becomes prohibitive. The use of meta models within the numerical algorithm, rather than exploiting the FEA results, reduces the computational burden [55].

In this study, gPCE-based meta models of the system mode shapes \({\hat{\phi }}^\mathrm{gPCE}_{h,i}\left( \mathbf{x },\pmb {\uptheta }^\mathrm{gPCE}\left( \pmb {\uptheta }_{\text {el}}\right) \right)\) surrogate the FEA solutions \({\hat{\phi }}_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}\right)\) (Eq. 6). In order to build the surrogate model, orthogonal basis functions \(\pmb {\Psi }_{\alpha }\left( \pmb {\uptheta }^\mathrm{gPCE}\left( \pmb {\uptheta }_{\text {el}}\right) \right)\) are combined using coefficients collected in \(\mathbf{u }^{\left( \alpha \right) } \left( \mathbf{x } \right)\) [7].

$$\begin{aligned} {\hat{\phi }}_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}\right)&\approx \, {} {\hat{\phi }}^\mathrm{gPCE}_{h,i}\left( \mathbf{x },\pmb {\uptheta }^\mathrm{gPCE} \left( \pmb {\uptheta }_{\text {el}}\right) \right) \nonumber \\= & {} \sum _{\alpha \in {\mathbf {I}}}\mathbf{u }^{\left( \alpha \right) } \left( \mathbf{x } \right) \pmb {\Psi }_{\alpha }\left( \pmb {\uptheta }^\mathrm{gPCE} \left( \pmb {\uptheta }_{\text {el}}\right) \right) \end{aligned}$$
(6)

Constructing the response surface also entails defining the gPCE variables \(\pmb {\uptheta }^\mathrm{gPCE}\left( \pmb {\uptheta }_{\text {el}}\right)\) (which are a function of the elastic parameters of the material) and the finite multi-index set \({\mathbf {I}}\). The combination coefficients \(\mathbf{u }^{\left( \alpha \right) } \left( \mathbf{x } \right)\) are determined from the FEA solutions, by applying regression techniques [7] or the Bayesian approach [56]. By exploiting the orthogonality condition of the basis functions, the FEA output statistics (i.e., forward problem solution) can easily be derived through algebraic operations on the combination coefficients [7]. Likewise, a variance-based sensitivity analysis (i.e., Sobol’ indices) can easily be conducted by combining the coefficients collected in \(\mathbf{u }^{\left( \alpha \right) } \left( \mathbf{x } \right)\) [57], thus quantifying how each random parameter of \(\pmb {\uptheta }_{\text {el}}\) affects the variation of the final result.

Vibration-based damage assessment defined in the Bayesian setting has been used before in the field of SHM [58]. However, in the proposed framework the combination of Bayesian inference, gPCE-based meta models and the modified version of MCMC [4] provides a robust procedure that is applicable to concrete gravity dams.

Each step of the proposed SHM framework is detailed below.

4.2 The training phase

During the Training phase, observations recorded by the monitoring system are used to update through the Bayesian Inference the parameters of the predictive model of the mode shapes, which are related to the healthy state of the dam.

Considering l new observations, the likelihood function of Eq. 5 can be written as

$$\begin{aligned}&L\left( \mathbf{x },\pmb {\uptheta }_{\text {el}},\pmb {\Sigma }_{\upphi }, \pmb {\Xi }_{\text {el}}\right) \nonumber \\&\quad \propto \prod _{i=1}^l \frac{\exp \left[ -\frac{1}{2} {\mathbf{r }_i^{\upphi }}^\mathrm{T} \left( \mathbf{x },\pmb {\uptheta }_{\text {el}}, \pmb {\Xi }_{\text {el}}\right) \pmb {\Sigma }_{\upphi }^{-1}{\mathbf{r }_i^{\upphi }} \left( \mathbf{x },\pmb {\uptheta }_{\text {el}},\pmb {\Xi }_{\text {el}} \right) \right] }{\sqrt{|2 \pi \pmb {\Sigma }_{\upphi }|}}, \end{aligned}$$
(7)

where \({\mathbf{r }_i^{\upphi }} \left( \mathbf{x }, \pmb {\uptheta }_{\text {el}}, \pmb {\Xi }_{\text {el}}\right) = \left[ r_{1,i}^{\upphi ^{\text {U}}},\ldots ,r_{h,i}^{\upphi },\ldots ,r_{q \cdot m,i}^{\upphi }\right] ^\mathrm{T}\) are the residuals which express the difference between the ith observation of the experimental mode shapes and the related numerical prediction,

$$\begin{aligned} r_{h,i}^{\upphi } = {\phi }_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}}, \pmb {\Sigma }_{\upphi ,\text {el}}\right) -{\hat{\phi }}_{h,i} \left( \mathbf{x },\pmb {\uptheta }_{\text {el}},\pmb {\Xi }_{\text {el}}\right) . \end{aligned}$$
(8)

The prior distributions of the material elastic parameters (Eq. 5) can be derived from the material test results, which are usually available for existing concrete dams. On the other hand, non-informative prior distributions can be used for parameters whit no physical meaning, such as \(\pmb {\lambda }\) and \(\mathbf{w }_{d}\) [41].

4.3 The detection phase

The Detection phase detects damage by comparing experimental mode shapes, currently observed by processing ambient vibrations with OMA, with those of the healthy dam simulated by the calibrated predictive model. The computational speed achieved through the surrogate model allows for the real-time structural control in this step.

Three fundamental aspects must be considered in the definition of the Detection phase: selection of the OMA technique, selection of the control frequency, and the definition of thresholds that reflect structural abnormal behaviours.

Signal processing and selection of the most appropriate OMA technique are fundamental steps for the accurate estimation of experimental mode shapes and frequencies. Even tough this topic is outside the scope of the present study, it can be stated that automated OMA techniques [11] are certainly appropriate for the installation of a continuous SHM system.

The control frequency is the time period between two subsequent controls of the health state of the structure. This important aspect is usually related to the computational power available and the storage capacity. The gPCE-based models adopted in this study considerably reduce the computational burden, thus the computational power required is no longer a problem. On the other hand, continuous control systems require more storage capacities than those related to event-triggered systems. Continuous SHMs with relatively high control frequency thus require more storage capacity than daily ones. Clearly the choice of the best control frequency cannot be generalised and it must be accurately evaluated by balancing available storage capacity and required structural safety.

Defining a suitable threshold related to a particular damage state involves the physical phenomenon and the accuracy of the observations and predictions. Regarding this latter, the proposed approach allows analysts to estimate the trust level in the discrepancy between recorded and predicted QI by updating the covariance matrix \(\pmb {\Sigma }_{\upphi }\) of the error terms. A probabilistic study of the physical phenomenon is required in order to correlate a variation in the monitored QI with a particular damage scenario. In fact, if there are no case studies, only numerical models can help to investigate how concrete dams react during a collapse. Due to the many uncertainties involved in this calculation, such a study should be conducted within a robust probabilistic framework. Since each concrete gravity dam is characterised by its own construction features and geometric characteristics, scenario analyses for the determination of suitable thresholds should be performed for every individual dam.

5 Case of study

5.1 Dam description

The proposed SHM framework is applied to a large concrete dam in Italy for which material test results and ambient vibration records are available. Since the observations were recorded during the normal use of the dam, they are related to a healthy state of the structure in this study. However, given that there are no information on how the dam would behave when damage occurs, a high-fidelity model is used to simulate such conditions.

The dam is situated in a high seismic area in Italy. It is 55 m tall and 199.20 m in length, with no vertical contraction joints. The near-field of the dam is made up of two different types of soil: an arenaceous mass and a marl mass.

As outlined in Sect. 4, the elastic parameters of the materials (i.e., the bulk moduli K and the shear moduli G) are treated as random variables. The concrete parameters are indicated with the subscript C, while those of the arenaceous soil and those of the marl mass are indicated with A and M subscripts, respectively.

Therefore, in this study the elastic model parameters are collected in \(\pmb {\uptheta }_{\text {el}}=\left[ K_C,G_C,K_A,G_A,K_M,G_M\right] ^\mathrm{T}\). In addition, assuming that they are log-normally distributed with mean values collected in \(\pmb {\xi }^{\upmu }_{\text {el}} =\left[ \upxi _{K_C}^{\upmu },\upxi _{G_C}^{\upmu },\upxi _{K_A}^{\upmu },\upxi _{G_A}^{\upmu },\upxi _{K_M}^{\upmu },\upxi _{G_M}^{\upmu }\right] ^\mathrm{T}\) and variances collected in \(\pmb {\xi }^{\upsigma }_{\text {el}} =\left[ \upxi _{K_C}^{\upsigma },\upxi _{G_C}^{\upsigma },\upxi _{K_A}^{\upsigma },\upxi _{G_A}^{\upsigma },\upxi _{K_M}^{\upsigma },\upxi _{G_M}^{\upsigma }\right] ^\mathrm{T}\), their hyper-parameter vector becomes \(\pmb {\Xi }_{\text {el}}=\left[ \pmb {\xi }^{\upmu }_{\text {el}},\pmb {\xi }^{\upsigma }_{\text {el}}\right] ^\mathrm{T}\).

Only the basin level variation is considered as a measurable variable \(\mathbf{x }\) in this application, because no information on temperatures and humidity was available. Moreover, the results of only one ambient vibration experimental campaign are available. The temperatures and humidity can thus be assumed to be constant during the test, and consequently their effects on the dam behaviour can be neglected.

5.2 Dynamic experimental campaign

The Training phase is performed by using the information acquired from the Italian Civil Protection in 2016 during an experimental campaign. The aim of the survey was the modal identification of the system using its modal characteristics obtained by processing ambient vibrations using OMA techniques. Fifteen high performance 1 Hz Seismometer LE-3Dlite (Lennartz Electronic GmbH), placed in the upper part of the structure, were used to record ambient vibrations. Two-hour recordings with a sampling frequency of \(f_s =200 \, \hbox {Hz}\) were performed. The data and results are presented in the experimental campaign report [59] which is available on-line [60]. In particular, the first three modes were identified (Fig. 3) by applying the Enhanced Frequency Domain Decomposition (EFDD) method with a basin level equal to 856.2 m.a.s.l., i.e. 40.8 m from the bottom of the dam.

Fig. 3
figure 3

Experimental mode shapes

5.3 Material test results and prior distributions

Several different experimental campaigns were carried out over time to investigate the quality of the concrete and soil. The results of the 2012 experimental campaign (Table 1) are used in this work for the definition of the hyper-prior distributions. The hyper-prior distributions of the mechanical characteristics of the material are assumed log-normal: those of the hyper-parameters \(\pmb {\xi }^{\upmu }_{\text {el}}\) are directly derived from the material test results, while those of the hyper-parameters \(\pmb {\xi }^{\upsigma }_{\text {el}}\) have mean values equal to the standard deviations obtained during experimental tests and coefficient of variations (CoVs) equal to 10%. The assumed values of CoVs avoid numerical problems in the algorithm for the inverse problem solution, therefore, in the absence of any information on them, it seems to be a reasonable choice. In addition, only \(\pmb {\xi }^{\upmu }_{\text {el}}\) are updated in the proposed procedure, then every reasonable choice on \(\pmb {\xi }^{\upsigma }_{\text {el}}\) does not affect the final result. The hyper-prior distributions of the mechanical parameters are reported in Table 2. The prior distributions of the parameters with no physical meaning, such as those of the covariance function, are modelled as non-informative.

Table 1 Material test results
Table 2 Hyper-prior distributions of the mechanical parameters

5.4 Construction of the FE model and gPCE surrogate model

The predictive models \({\hat{\phi }}_{h,i}\left( \mathbf{x },\pmb {\uptheta }_{\text {el}},\pmb {\Xi }_{\text {el}}\right)\) of the mode shapes of the dam (Sect. 4) are based on the gPCE, which is trained on the solution of the modal analysis of a FE model.

Starting from the topographic maps and the original technical drawings of the dam, the system dam-basin-soil model is constructed in a CAD program. The final geometry is then imported into ABAQUS 6.14 [61], the software used for the FEAs. The SSI is considered by modelling the near-field with as an elastic medium and using infinite elements [5] to reproduce the unboundedness of the soil domain. Acoustic elements are used to model the basin and its interaction with the dam body and the soil domain as well. Low reflecting boundary conditions [6] are used at the edges of the fluid domain to avoid the reflection of seismic waves.

The FE model is composed of 13,280 quadratic tetrahedral mechanical elements (C3D10) for the dam body and the foundation soil, and 237 linear hexahedral infinite elements (CIN3D8) for the soil boundary conditions. The number of the 4-node linear tetrahedron fluid elements (AC3D4) of the reservoir depends on the water level that is a measurable variable. Materials are assumed linear elastic, since the calculation of gPCE combination coefficients is based on the modal analysis.

Once the FE model is created, the gPCE must defined. Hermitian polynomials are used for the definition of the orthogonal basis functions, and the combination coefficients of the polynomial expansion (Sect. 4) are determined on the FEA outputs. The parameters sets of theses FEAs are randomly sampled from the prior distributions of the mechanical parameters, while the basin levels \(\mathbf{x }\) is deterministically discretised. The prior distributions used to for this task are assumed log-normally distributed with parameters derived from the mean values of \(\pmb {\xi }^{\upmu }_{\text {el}}\) and \(\pmb {\xi }^{\upsigma }_{\text {el}}\) (Table 2). The basin level varies from the bottom of the dam to the maximum level (55 m), with steps of 10 m, and, in order to improve the accuracy of the solution, also basin levels equal to 40.8 m (basin level during the experimental campaign) 55 m (maximum level) are considered. The discretization of the measurable variables depends on the aims of the study, then such a coarse discretization is acceptable only to show the feasibility of the method, while a much more refined discretization is well recommended for the implementation of a dynamic SHM. A total of 150 numerical analyses, characterised by different parameters sets randomly sampled from the prior distributions, are finally performed for each basin level. The FEA results are then used for the calibration of the gPCE coefficients by applying the procedure proposed by Rosić et al. [56]. A sensitivity analysis is performed to select the degree of the polynomial expansion, which minimise the gPCE error. After 200 new analyses, performed by randomly sampling from the previously defined prior distributions and from the set of the measurable variables, the mean values of the gPCE error are calculated for the first 17 frequencies of the system (Fig. 4) considering 3th, 4th and 5th polynomial orders. The error is herein defined as the difference between the FEA result and the related gPCE prediction divided by the FEA result itself. The sensitivity analysis shows that the 5th order polynomial expansion provides the smallest error both in terms of frequencies and mode shapes.

Fig. 4
figure 4

Relative errors of the mode shapes meta model of the Scandarello dam

5.5 The high-fidelity FE model

The damaged behaviour of the dam is simulated through a high-fidelity model having the same geometry of the one described in Sect. 5.4, but with a much more refined mesh and a non-linear constitutive model for the concrete. The mesh is composed of 15,936 quadratic tetrahedral mechanical elements (C3D10) for the dam body and the soil domain, and 6873 linear tetrahedral acoustic elements (AC3D4) for the basin. Moreover, 474 linear hexahedral infinite elements (CIN3D8) are used to model the unboundedness of the soil domain, while low reflecting boundary conditions are applied at the edges of the basin. The minimum mesh size is one tenth of the smallest wavelength of the seismic motion used in the seismic analysis described in the next sections.

As the high-fidelity model is deterministic, the basin level is at the maximum height, i.e. 55.5 m, thus considering the worst case scenario for the seismic analysis.

By assuming that the failure can occur only in the dam body due to tensile crack propagation, an elastic-plastic damage model (the concrete damage plasticity model) [62] is adopted for the dam concrete, while the soil is assumed linear-elastic. The parameters of the damage law and those of the plastic flow can be found in the scientific literature [37] and they are reported in Table 3. More specifically, \(\psi\) is the dilation angle, \(\varepsilon\) is is the eccentricity that defines the rate at which the flow potential approaches the asymptote, \({\sigma _{b0}}/{\sigma _{c0}}\) is the ratio of initial equibiaxial compressive yield stress to initial uniaxial compressive yield stress and \(K_\mathrm{c}\) is the ratio of the second stress invariant on the tensile meridian to that on the compressive meridian. In this study, the tensile strength is fixed equal to 1.45 MPa, while the fracture energy is assumed equal to 150 N/m.

Table 3 Plastic flow parameters

5.6 Training phase

In the Training phase, the experimental mode shapes described in Sect. 5.2 are used to calibrate the gPCE-based predictive models defined in Sect. 5.4. This fundamental step aims to reduce the uncertainties involved in the prediction of the dam behaviour and to estimate its error as well. In a real control system, the procedure described in this Section, which is applied only to a single observation, is repeated every time new data are available.

The predictive model parameters are updated by following the procedure described in Sect. 4.2. The procedure proposed by Sevieri et al. [4] is used to solve the inverse problem. This numerical algorithm, based on MCMC, requires setting up few parameters: the initial mechanical parameter set, the number of steps and the proposal distributions. The mean values of the hyper-prior distributions of \(\pmb {\xi }^{\upmu }_{\text {el}}\) are used to define the initial mechanical parameter set. Normal distributions with mean values equal to zero and standard deviations equal to 10% of the prior distribution ones are used as proposal distributions. The convergence of the algorithm, checked through the probabilistic metric proposed by Brooks et al. [63], is achieved in 10,000 steps (1000 burn-in).

Fig. 5
figure 5

Comparison between prior and posterior distributions of K and G

Table 4 Posterior distributions of the elastic parameters

Fig. 5 shows the comparison between prior and posterior distributions of K and G, while the posterior statistics are reported in Table 4. The posterior distributions reported in Fig. 5 are used for the definition of the gPCE (Sect. 5.4). The comparison between experimental and numerical frequencies shown in Table 5 confirms the good agreement between them. The good agreement between experimental and predicted mode shapes can be observed in terms of modal assurance criterion (MAC) [11]. Each MAC matrix element represents the correlation between an experimental mode and its prediction. MAC values close to 1 indicate a high correlation between the two modes. Figure 6 shows that only three modes have a sufficiently high correlation, while most numerical modes have low correlation with the experimental ones.

Table 5 Comparison between experimental and numerical frequencies

The inverse analysis algorithm adopted in this work automatically selects the correlated modes, thus solving the previously discussed mode matching problem.

Fig. 6
figure 6

MAC matrix between experimental frequencies and numerical ones calculated using the mode shapes predictive model

5.7 Detection phase

During the Detection phase, the observed experimental mode shapes are compared to those obtained from the gPCE-based predictive models. In this application, the comparison is the point-to-point difference between experimental and predicted mode shapes. As discussed in the Sect. 4.3 the comparison criteria, as well as the definition of the thresholds, depends on case study and the available information. In the absence of real records providing the modal characteristics of the damaged dam, the high-fidelity FE model described in Sect. 5.5) is used to simulate the experimental mode shapes related to three different damage scenarios.

The three DSs considered in this study are defined with regard to the development of tensile crack paths. Only crack paths with tensile damage variable of the concrete damage plasticity model \(d_t\) higher than 0.3 are considered. DS1 is thus achieved when cracks appear in one of the critical areas of Fig. 1 (localised damage), DL2 is achieved when crack paths grow toward the core of the dam (diffuse damage), and DL3 is achieved when widespread damage or crossing crack path appear.

The reference seismic event for the construction of the DSs is the magnitude 6.2 earthquake occurred on 24 August 2016 at 01:36 UTC in Central Italy. The first period spectral acceleration \(S_\mathrm{a}\left( T_1\right)\) is used herein as reference intensity measure of the ground motion. The seismic input recorded by the Amatrice station, characterised by \(S_\mathrm{a}\left( T_1\right)\) equal to 1.5622 g, is considered and scaled to the attainment of the three defined DSs. The scaled action is applied as boundary condition at the base of the near-field after a deconvolution step [64].

DS1 is achieved for \(S_\mathrm{a}\left( T_1\right) =0.2107\)g, DS2 is achieved for \(S_\mathrm{a}\left( T_1\right) =0.4328\)g and DS3 is achieved for \(S_\mathrm{a}\left( T_1\right) =0.472\)g. The three preselected DSs can be achieved only by reducing the original seismic input. The tensile strength, assumed equal to 1.45 MPa, is a conservative value if compared to the material test results, this fact justifies the results of the seismic analyses.

Fig. 7
figure 7

Damage scenarios, Detection phase

The DSs are reported in Fig. 7. Following the procedure discussed in Sect. 4.3, structural damage is detected by comparing the experimental mode shapes related to different DSs (obtained in this study from the high-fidelity mode) with the corresponding predictions (derived from the calibrated gPCE-based predictive models). Figure 8 shows the comparison between observations and predictions. The devices layout is the same adopted during the experimental campaign described in Sect. 5.2.

Fig. 8
figure 8

Comparison between experimental mode shapes and predictions, Detection phase

The results highlight that an abnormal behaviour of the system can be easily detected from the first three principal modes of the dam for DS2 and DS3 (Fig. 8). In both these cases, observed and predicted mode shapes show large differences. Whereas, for DS1 predictions and observations are relatively close, even if their difference is higher than the error of the prediction itself. The damage detection in case of DS1 is of primary importance for the structural control. Figure 9 shows the histogram of the discrepancy between predictions and observations for each device in case of DS1. The discrepancy is herein defined as the percentage variation of the prediction plus its standard deviation. Clearly the higher modes better indicate abnormal behaviour, even tough the maximum discrepancy reaches 1% in the middle of the dam crest. The monitoring system, including hardware and software components, should be designed in order to achieve this accuracy.

Fig. 9
figure 9

Discrepancy between predictions and experimental mode shapes for each device (DL1)

6 Concluding remarks

Concrete gravity dams are fundamental infrastructure for the sustainability of a country. Most of them were designed/built without concerns for their dynamic behaviour, therefore do not meet the seismic reliability levels required by our society.

Dynamic structural health monitoring (SHM) systems play a fundamental role both in terms of controlling concrete dams and in terms of reducing the uncertainties involved in the seismic assessment of such structures. Exploiting ambient vibrations as source of information is an interesting option in dam engineering. In fact, in this strategy continuous monitoring systems can be installed without requiring any additional device for the excitation of the dam, thus reducing the overall cost. However, only few applications of ambient vibration-based monitoring systems to concrete gravity dams are available in the scientific literature. None of them propose probabilistic frameworks.

A new probabilistic framework for ambient vibration-based SHM for concrete dams is proposed in this paper. With the proposed SHM, the structural health state of the dam can be controlled and the mechanical parameters of the numerical model can be updated. The uncertainties involved in the predictive models of the dam are reduced, thus improving the structural control itself and the estimation of the remaining life expectancy of the structure. The particular SHM architecture is made up of two different steps defined in the Bayesian setting: the Training phase and the Detection phase. During the Training phase, ambient vibrations are first processed through Operational Modal Analysis (OMA) methods to determine the modal characteristics of the system. These characteristics are then used to calibrate predictive models of the structural behaviour based on the general Polynomial Chaos Expansion (gPCE). The use of the gPCE for surrogating numerical models of the dam means that the procedure can be defined in the Hierarchical Bayesian framework without high-performance computing. In addition, the definition of the framework in the Bayesian setting reduces the uncertainties involved in the predictive models themselves.

The calibrated predictive models are then used in the Detection phase in order to assess damaged areas. This phase is based on comparing predictions with current experimental observations. The computational speed and accuracy achieved with the use of gPCE-based predictive models enables the proposed dynamic SHM to be used for real-time control of the structures. Another important aspect in this fundamental step is the definition of a threshold for the detection of abnormal behaviour in the dam. However, this topic is not among the aims of the present paper, which is more focused on the definition of the SHM framework.

We applied the proposed framework to a real large concrete dam located in a high-seismic area in Italy, for which real ambient vibration records and material test results are available. The case study shows the feasibility of our approach in real cases and has important implications. The use of gPCE-based meta models is fundamental for the implementation of the real time control, and the inverse analysis algorithm used for the solution of the Bayesian problem allows the mode matching problem to be solved, which is a crucial issue in dam engineering. The idea of defining a threshold with regard to different Damage Scenarios (DSs) is an interesting idea which allows intervention strategies to be set up. The accuracy of the SHM system should be increased as much as possible by optimising the device layout (i.e., decrease the required accuracy level for detecting a particular DS) or by using a more sensitive quantity of interest (such as the derivatives of mode shapes). We plan to investigate these topics in future works.