Introduction

Tsunami data assimilation is a promising approach for tsunami forecasting. It predicts the tsunami waveform by assimilating offshore observed data into a numerical simulation, without calculating the initial sea surface height at the source (Maeda et al. 2015). An optimum interpolation method (Kalnay 2003) is adopted in data assimilation to compute the tsunami wavefield and to forecast the tsunami arrival time and maximum amplitude along the coast. This method has been successfully applied to observed tsunami waveforms of the 2012 Haida Gwaii Earthquake (Gusman et al. 2016) using bottom pressure gauge array data in the Cascadia subduction zone and a hypothetic tsunami in the Tohoku region (Maeda et al. 2015) using synthetic data based on the Seafloor Observation Network for Earthquakes and Tsunamis (S-net). If the observations are located in source regions, a new assimilation method developed by Tanioka (2018) can be used to solve the problem of non-hydrostatic effects and reproduce tsunami height distribution accurately.

In previous applications of the tsunami data assimilation method, the linear long-wave (LLW) tsunami propagation model was used (Maeda et al. 2015; Gusman et al. 2016; Wang et al. 2017). The LLW model is based on the long-wave approximation (Satake 2015). When the horizontal scale of motion, or the wavelength of the tsunami, is much larger than the water depth, the vertical acceleration of water is negligible compared with gravity. The horizontal motion of the water mass is a good approximation uniform from the ocean bottom to the surface. Then, the phase velocity only depends on the depth of the ocean.

However, the long-wave approximation breaks down when the wavelength of the water height distribution is not much greater than the water depth (Saito et al. 2010). For example, if an outer-rise earthquake fault has a large dip angle, the initial sea surface distribution will be enriched in short-wavelength components, which could not be simulated properly with the LLW model (Saito and Furumura 2009; Zhou et al. 2012). Moreover, landside tsunamis have smaller horizontal scales than tsunamis generated by earthquakes, and they present more evident dispersion effects (Marchenko et al. 2012). The LLW model may forecast the arrival time of a tsunami peak to be earlier than the real tsunami and may overestimate the maximum height of the tsunami waveform (Watada et al. 2014; Gusman et al. 2015; Ho et al. 2017). Therefore, a dispersive (DSP) tsunami model based on the Boussinesq equations should be used to compute tsunami waveforms with dispersive characteristics. The DSP tsunami model has been successfully used in forward tsunami simulation (Furumura and Saito 2009; Zhou et al. 2012) and tsunami waveform inversion analysis to estimate the initial water height distribution (Saito et al. 2010, 2011).

Until now, there has been no application of the DSP model in tsunami data assimilation. Because the tsunami data assimilation method proposed by Maeda et al. (2015) calculates the wavefield at every time step, an application of the DSP model with this method would lead to an extremely high computational cost and therefore fail to provide timely tsunami early warnings. To speed up the data assimilation process, Green’s function-based tsunami data assimilation (GFTDA) was proposed to reduce the computational time for tsunami early warning (Wang et al. 2017). The Green’s functions for data assimilation can be calculated and stored in advance. During the assimilation process, the waveforms at points of interest (PoIs) can be calculated by a simple matrix manipulation. The GFTDA enables us to conduct tsunami data assimilation with a linear dispersive model.

Great earthquakes have recurred along the megathrust fault between the Philippine Sea Plate and the Eurasian Plate along the Nankai Trough (Saito et al. 2010; Furumura et al. 2011). Such great earthquakes of recent years include the 1946 Tonankai (M7.9) and 1944 Nankai (M8.0) earthquakes. In addition, a large (M 7.4) earthquake occurred off the Kii Peninsula within the subducting Philippine Sea Plate in 2004. The tsunami generated by the 2004 off the Kii Peninsula earthquake exhibited evident dispersion characteristics at offshore stations such as MPG1 and MPG2 (Saito et al. 2010). One of the important features of tsunami generation is that the dispersive waves have a strong directional dependence with respect to the fault strike (Saito et al. 2010). Therefore, the dispersive tsunami developed efficiently toward the direction of the above offshore stations.

In order to monitor the earthquakes and tsunamis in the Nankai region, Japan Agency for Marine-Earth Science and Technology (JAMSTEC) set up the Dense Oceanfloor Network System for Earthquakes and Tsunamis (DONET). It is an elaborate system of cables and various devices and consists of 52 observation points in DONET1 and DONET2 (Kaneda 2010). Such a dense observation network enables us to forecast a tsunami along the Nankai coast by the method of tsunami data assimilation.

In this paper, we propose to use GFTDA to forecast the tsunami based on the linear DSP model. We compute synthetic tsunami observations in the Nankai region using the source parameters of the 2004 earthquake and forecast the tsunami along the coast of Shikoku Island and Kyushu Island. We also compare the forecasted results of arrival time and maximum height by assimilation using the LLW and DSP models.

Methodology

Linear DSP tsunami model

Numerical simulations of 2-D DSP tsunami equations have been conducted on high-performance computers and personal computers to simulate dispersive tsunami waves (Saito et al. 2010). The equations are derived from the continuity equation and the equation of motion for water waves. In Cartesian coordinate, they are:

$$\begin{aligned} & \frac{\partial h}{\partial t} = - \frac{\partial M}{\partial x} - \frac{\partial N}{\partial y}, \\ & \frac{\partial M}{\partial t} + gd\frac{\partial h}{\partial x} = \frac{1}{3}h^{2} \frac{{\partial^{2} }}{\partial x\partial t}\left( {\frac{\partial M}{\partial x} + \frac{\partial N}{\partial y}} \right), \\ & \frac{\partial N}{\partial t} + gd\frac{\partial h}{\partial y} = \frac{1}{3}h^{2} \frac{{\partial^{2} }}{\partial y\partial t}\left( {\frac{\partial M}{\partial x} + \frac{\partial N}{\partial y}} \right), \\ \end{aligned}$$
(1)

where M and N are the x and y components of flux, or velocity integrated along the vertical direction from the sea bottom to the sea surface, h is the tsunami height at the sea surface, d is the water depth, and g is the gravitational acceleration.

The right-hand sides of the second and third equations are linear dispersive terms, which cause the dispersion effect (Saito and Furumura 2009; Saito et al. 2010; Maeda et al. 2016). In the LLW model, we neglect the dispersion terms, so the right-hand sides of these two equations become zero. Although the linear DSP model is more complicated, the linearity still enables us to use GFTDA in our research.

Optimal interpolation method

We adopt the optimal interpolation method for tsunami data assimilation, as in the previous studies of Kalnay (2003) and Maeda et al. (2015). The details of the method are described in these two papers. In this method, we assume the total number of computational grid points is L, and the total observation number is m. The tsunami wavefield at the nth time step is represented as a \(\left( {3L \times 1} \right)\) column vector \(\varvec{x}_{\varvec{n}} = \left( {h\left( {n\Delta t,x,y} \right),M\left( {n\Delta t,x,y} \right),N\left( {n\Delta t,x,y} \right)} \right)^{\text{T}}\).

The data assimilation process consists of two steps: a propagation step and an assimilation step. The propagation step is expressed as

$$\varvec{x}_{n}^{\text{f}} = \varvec{Fx}_{n - 1}^{a} ,$$
(2)

which means that at every time step, the forecasted tsunami wavefield \(\varvec{x}_{n}^{\text{f}}\) is simulated by solving the tsunami propagation equations using the assimilated wavefield in the last time step \(\varvec{x}_{n - 1}^{a}\). Here, \(\varvec{F}\) refers to the tsunami propagation matrix (\(3L \times 3L\)) that corresponds to the tsunami propagation model. The assimilation step is expressed as

$$\varvec{x}_{n}^{a} = \varvec{x}_{n}^{\text{f}} + \varvec{W}\left( {\varvec{y}_{n} - \varvec{Hx}_{n}^{\text{f}} } \right),$$
(3)

where the observation matrix \(\varvec{H}\) (\(m \times 3L\)) is a sparse matrix that extracts the forecasted tsunami heights at the m points from the entire simulated tsunami wavefield. The residual is calculated by comparing it with the real observed tsunami height \(\varvec{y}_{n}\). The weight matrix \(\varvec{W}\) (\(3L \times m\)) is an important controlling factor for the quality of tsunami assimilation (Maeda et al. 2015). It is calculated by minimizing the covariance matrix as a solution of the linear system

$$\varvec{W}\left( {\varvec{R} + \varvec{HP}^{\text{f}} \varvec{H}^{\text{T}} } \right) = \varvec{P}^{\text{f}} \varvec{H}^{\text{T}} ,$$
(4)

where \(\varvec{P}^{\text{f}} = \left\langle {\varepsilon^{\text{f}} \varepsilon^{\text{fT}} } \right\rangle\) and \(\varvec{R} = \left\langle {\varepsilon^{\text{O}} \varepsilon^{\text{OT}} } \right\rangle\) are the covariance matrices of the forward numerical simulation and the observations, respectively. Here, ɛf represents the errors in numerical forecasts between two computational grids and ɛO represents the observational errors (Kalnay 2003; Maeda et al. 2015). The weight matrix is then multiplied by the residual to bring the assimilated tsunami wavefield closer to the observed wavefield. By alternatively repeating the propagation and assimilation steps, the tsunami wavefield is assimilated.

Green’s function-based tsunami data assimilation (GFTDA)

In order to speed up the data assimilation process and use the linear DSP model, we use the GFTDA proposed by Wang et al. (2017). The Green’s function Gi,j is defined as the waveform at the jth grid point resulting from the propagation of the ith station’s assimilation response. We compute the Green’s functions between the fixed observation stations and PoIs with both the LLW model and the linear DSP model. The computation of Green’s functions might be quite time-consuming, but this is done in advance. Moreover, this will not affect the efficiency of the data assimilation process. Then, during the assimilation process, we can directly synthesize the forecasted tsunami waveforms by multiplying the residual by the corresponding Green’s functions.

Green’s function

The bathymetry and topography dataset are derived from the General Bathymetric Chart of the Ocean (GEBCO). The grid data released in 2014 (GEBCO_2014) with a grid spacing of 30 arc s (Weatherall et al. 2015) are used.

The finite difference method (FDM) with the implicit scheme is used for numerical simulation. We use a grid spacing of 30 arc s and a time step of 1 s. The target area is 30°N–35°N, 130°E–140°E, so the total grid number is 600 × 1200 = 720,000.

The JAGURS tsunami code (Baba et al. 2015) is used to compute the Green’s functions between the observation points and PoIs and also between different observation points. We use 15 observation stations and nine PoIs, as described in the next section. Hence, the total number of Green’s functions is 15 × (9 + 15) = 360. The parameters for optimal interpolation are similar as those used in a study by Maeda et al. (2015).

Application and results

Observation stations and forecast points

We used the proposed method to simulated data for the Nankai Trough. DONET has 13 science nodes, and each node is linked with several bottom pressure observation points. In order to build an evenly distributed observation network for data assimilation, we take one point for each node except for Node C, for which we take two observation points. In addition, we use the submarine cable stations PG1 and PG2, also maintained by JAMSTEC. In total, 15 observation stations are used for data assimilation (Fig. 1). However, because DONET1 was completed in 2011 and DONET2 started operation in 2015, we lack a real tsunami record for the 2004 off the Kii Peninsula earthquake. To assess the ability of our data assimilation approach, we use synthetic tsunami data from the 2004 earthquake source model. In real practice, tsunami data could be obtained by removing the tide signal and high-frequency seismic signal from the observation record. The method of Tanioka (2018) can also be applied to the stations in or around the source area.

Fig. 1
figure 1

The observation network for data assimilation and near-shore PoIs. The observation points contain 13 DONET stations, PG1 and PG2, which are marked with red circles. Nine PoIs near Shikoku Island and Kyushu Island are marked with blue triangles. The focal mechanism is plotted according to Yamanaka (2004)

The nine points of interest (PoIs) are selected near population centers on the coast of Shikoku Island and Kyushu Island (Fig. 1), which are under the potential threat of tsunami disaster. They are used to compare simulated waveforms and waveforms predicted by data assimilation.

Source models

In our numerical simulation, we use the mainshock source model of the 2004 off the Kii Peninsula earthquake. The fault parameters are set according to the unpublished results of Yamanaka (data available at http://wwweic.eri.u-tokyo.ac.jp/sanchu/Seismo_Note/2004/EIC153.html) by analysis of the teleseismic body waves. The epicenter is 137.142°E, 33.143°N, and the depth is 10.0 km. The fault direction is perpendicular to the trough axis with a strike of 135°. The dip angle is 40°, and the rake angle is 123°. The length and width of the rectangular fault are 50.0 km and 30.0 km, respectively. The fault slip is 6.5 m, which is consistent with the magnitude of the main shock (M7.4).

We use Okada’s model to calculate the initial sea surface elevation in an elastic half-space (Okada 1985), which can be used as the initial condition for numerical simulation. Here, we only consider the vertical displacement. If the tsunami source is on a steep seafloor and the horizontal motion is much larger than the vertical motion, horizontal displacement will become important for tsunami generation (Tanioka and Satake 1996).

Assimilation setting

We use the JAGURS tsunami code to calculate the tsunami propagation. We consider only linear terms and assume reflective boundary at the coastline. To make the tsunami propagation closer to the real situation, we apply the linear DSP model, similar to that used to compute the dispersive Green’s functions.

We set the earthquake origin time as t = 0. The observation stations of the assimilation network are not far from the epicenter of the Kii Peninsula earthquakes. Hence, the tsunami arrives at the nearby stations of KMC09, KMC21, and KMD13 soon after the earthquake (Fig. 2). We set that the data assimilation process to begin at t = 0. The assimilation time window is defined as the period during which we use synthetic observation for assimilation. During the assimilation time window, the waveforms at PoIs are synthesized with the Green’s functions and the simulated observation.

Fig. 2
figure 2

Distribution of 15 observation points and the waveforms of synthetic tsunamis at each point. The tsunami arrives at the stations of KMC09, KMC21, and KMD13 soon after the earthquake. The data assimilation process begins at the time of earthquake

We apply the GFTDA algorithm with both the LLW and linear DSP models. The length of the time windows is set to range from 2 to 24 min, with an interval of 2 min. The calculation time for the data assimilation process is less than 10 s, almost negligible on the EIC computer system at the Earthquake Research Institute, the University of Tokyo.

Waveform comparison

Figure 3 demonstrates the comparison between simulated waveforms and waveforms predicted with an assimilation time window of 20 min. The waveforms predicted using both the LLW model and the linear DSP model agree well with the simulated waveforms. This proves the validity of data assimilation based on the observation network of DONET, PG1 and PG2. For the forecasting the first tsunami peak amplitude, the LLW model and the linear DSP model have similar performances. In the coastal PoIs of Murotomisaki, Awaji, and Abuyuki, the maximum amplitude forecasted by the linear DSP model tends to be closer to the simulation than that forecasted by the LLW model, though the differences are quite small. The main difference between the two models lies in the arrival time. In almost every PoI, especially Tosashimizu and Murotomisaki, the predicted waveform using the linear DSP model has a more accurate arrival time of the first tsunami peak.

Fig. 3
figure 3

Simulated waveforms (black lines) and waveforms forecasted by data assimilation at nine near-shore PoIs. The water depth of each PoI is provided. The forecasted waveforms are calculated by GFTDA using the LLW model (blue lines) and the linear DSP model (red lines). The time window of data assimilation is 20 min

Quantitative evaluation

To quantitatively evaluate the performance of two models in data assimilation, we calculate the tsunami forecast accuracy (Gusman et al. 2016) for different assimilation time windows. It is based on the geometric mean ratio (K) of the observed (Oi) and simulated (Si) maximum amplitude of the first tsunami peak for the ith station (Aida 1978),

$$\log \left( K \right) = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} { \log }\left( {\frac{{O_{i} }}{{S_{i} }}} \right),$$
(5)
$${\text{Accuracy}}\,\left( \% \right) = \frac{1}{K} \times 100\% \,\left( {K \ge 1} \right) \;{\text{or}}\; K \times 100\% \,(K < 1),$$
(6)

where N is the number of stations. Generally, a high accuracy value could indicate accurate forecasting of the tsunami data assimilation.

The accuracy for various assimilation time windows is plotted in Fig. 4a. The shapes of the forecasting accuracy curves for both the LLW model and the linear DSP model are quite similar. In the beginning, the time window is only 2 min, and soon after the earthquake, the accuracy of both models is very low. Because the first tsunami peak has not passed through any observation stations of our network, the data length used for assimilation is too short to provide accurate forecasts. Then, at 4 min, there is a sharp increase in the accuracy for both models, with the forecast accuracy exceeding 85% (Fig. 4). After that, the forecast accuracy varies slightly but exhibits a rising trend in general. Although there is not a large difference in forecast accuracy between the LLW model and the linear DSP model, the forecast accuracy of the LLW model is slightly higher. After the time window of 20 min, the first tsunami peak has already passed all of the observation stations. The forecast accuracy becomes saturated and stops increasing. Here, the values of forecasting accuracy calculated by both models are also similar, but the linear DSP model performs slightly better.

Fig. 4
figure 4

Forecast accuracy a and time lag b of the two models for various assimilation time windows. The forecast accuracy is used to evaluate the forecasted maximum amplitude of the first tsunami peak (Aida 1978). The time lag is used to examine the forecasted arrival time (Tsushima et al. 2012)

The difference in arrival time of the first tsunami peak between the two models is more evident. To quantitatively analyze the accuracy of the forecasted arrival time, we use the method of calculating time lag proposed by Tsushima et al. (2012). The time lag of the ith coastal station is defined as:

$$\Delta T_{i} = t_{i}^{S} - t_{i}^{O} ,$$
(7)

where t O i is the arrival time of the first tsunami peak observed at the ith station and t S i is the arrival time forecasted by data assimilation. A negative time lag indicates that the forecasted arrival time is earlier than the observation. A small absolute value of time lag indicates accurate forecasting of the arrival time. We calculate the time lag at all PoIs and calculate the average value.

Figure 4b clearly shows that the time lags calculated by the two models are clearly negative, which means that both the LLW model and the linear DSP model forecast the tsunami arrival times earlier than observed. Moreover, as the assimilation time window increases, the time lag becomes closer to zero. The tendency of the variation of time lag is the same as that of forecasting accuracy. As more observed data are used in data assimilation, the absolute value of time lag decreases quickly from the end of the 2-min time window to the 4-min time window. Then, it decreases slowly. After the time window of 20 min, the variation finally becomes very small. It is important to note that the difference between the LLW model and the linear DSP model is noticeable in the figure. The linear DSP model leads to a much smaller time lag than the LLW model, indicating that the linear DSP model performs better in forecasting tsunami arrival time.

Discussion and conclusion

In our application, we use two linear models, the LLW model and the linear DSP model. The accuracy of the forecasted maximum amplitude and arrival time depends on the length of the assimilation time window. It is important to choose a proper assimilation time window for both models. In our application to the 2004 off the Kii Peninsula earthquake, a time window of 14 min is a practical choice in order to produce a reliable and early tsunami forecast. At this time, the forecast accuracy of both the LLW and linear DSP models exceeds 96%, and the average time lag of tsunami arrival time is − 58.1 s (LLW) and − 25.7 s (DSP). Figure 2 shows that the first tsunami peak reaches the nearest PoI (Murotomisaki) around 35 min after the earthquake and reaches other PoIs more than 40 min after the earthquake. Using the tsunami height and arrival time of PoIs as input parameters, the shoreline tsunami height or inundation forecasts will be possible by applying the tsunami run-up models (Liu et al. 2009).

Because the calculation time in the GFTDA process is negligible, the tsunami forecast can be made 14 min after the earthquake based on the data assimilation. On the other hand, it is not practical to use a time window of more than 20 min. Though the forecasting accuracy becomes even higher and the time lag becomes even smaller, the tsunami forecast may not be useful if it is made shortly before the tsunami arrival.

The results also suggest that the tsunami propagation model could affect the accuracy of tsunami forecasting by data assimilation. For the maximum amplitude of the first tsunami peak, two models perform similarly. However, with respect to the arrival time, the linear DSP model has a better accuracy than the LLW model. The average time lag calculated by the linear DSP model is evidently smaller. For individual stations, if the station is located near Shikoku Island, which is close to the observation network, the difference in lag time is not so large. However, if the station is located near Kyushu Island which is approximately 200 km from the observation network, the difference in lag time becomes noticeable. This is caused by the limitation of the long-wave approximation. The limitations of the long-wave approximation may not be so evident when the wavelength of tsunami is sufficiently long. However, for the Kii Peninsula earthquake considered in this paper, the large dip results in short-wavelength components of the tsunami, and such an approximation would overestimate the velocity of tsunami propagation (Saito et al. 2010). Thus, the arrival time forecasted by the LLW model becomes quite earlier than the DSP model results, especially for PoIs far from the observation network, because a longer propagation distance will increase such errors.

Compared with the previous data assimilation method, the simple assimilation process of GFTDA enables us to apply more complicated models, apart from the LLW model. As long as linearity is assumed, the GFTDA is proven to be mathematically equivalent to the previous data assimilation approach (Wang et al. 2017). For potential tsunamis with more dispersion characteristics, or tsunamis that propagate over a longer distance from the observation network, the difference between dispersive and non-dispersive models is more important. Therefore, the linear DSP model would be able to make more improvements in forecasting the arrival time accurately and to mitigate the disasters of potential destructive tsunamis generated by outer-rise earthquakes.