1 Introduction

At 05:46:18.1 UTC, March 11, the 2011 Tohoku earthquake (Mw 9.0) occurred as the largest earthquake in Japanese instrumental observation history. It triggered large tsunami that caused enormous damage to the Pacific coast of the Tohoku region of northeastern Japan, including 19,765 fatalities, 6242 reported injuries, and 2553 people reported missing (according to the Fire and Disaster Management Agency, as of March 1, 2023). The seismic waves, crustal deformation, and tsunami were recorded by the Global Seismographic Network, GNSS Earth Observation NETwork system, and Deep-ocean Assessment and Reporting of Tsunamis, respectively. The initial source models of the 2011 Tohoku-Oki earthquake have been proposed from instrumental observations (e.g., Ide et al. 2011; Ozawa et al. 2011; Satake et al. 2013). These models have the following common features: There was a large slip area off the coast of Miyagi Prefecture, with a maximum slip amount of 30–70 m, and the duration of fault rupture was approximately 3–5 min. After the earthquake and tsunami, tsunami surveys were conducted by joint research groups with the participation of 299 tsunami researchers, such as coastal engineers, seismologists, and geologists from 64 universities and institutes throughout Japan. As of December 29, 2012, a total of 5907 tsunami trace heights have been reported from the Pacific coast of Hokkaido to Kagoshima Prefectures (Fig. 1, Mori et al. 2011).

Fig. 1
figure 1

Final slip distribution of the 2011 Tohoku-Oki earthquake (Satake et al. 2013) and spatial distribution of tsunami trace heights (Mori et al. 2011). Gray bars, black dots, and error bars show the observations, mean, and standard deviation averaged at 0.1° intervals, respectively

The tsunami associated with this earthquake left tsunami deposits in the coastal lowlands along the Pacific coast of the Tohoku region. Thickness, sedimentological structure, and grain size distribution of the deposit have been carefully investigated and summarized by tsunami deposit researchers (e.g., Goto et al. 2011; Kusumoto et al. 2018). The sediment thickness and grain size of tsunami deposits vary greatly depending on the survey site, and they are closely related to the tsunami source and topography/bathymetry (e.g., Goto et al. 2014). Coastal vegetation also affects the character of tsunami deposits. For example, Kusumoto et al. (2020) quantitatively evaluated changes in deposition volume of tsunami deposits in areas with/without coastal forest. In addition, the research on tsunami-induced sediment transport has progressed greatly since the 2011 Tohoku-Oki event. Tsunami hydraulic experiments have been performed to establish transport formula for fine- to medium-grained sands (e.g., Takahashi et al. 2011), and large-scale calculations using a supercomputer have enabled the simulation of tsunami sediment transport in regional scale (e.g., Yamashita et al. 2016). The numerical simulation of tsunami sediment transport can be now utilized to study past tsunami events (e.g., Sugawara et al. 2019).

To forecast characteristics of future megathrust earthquakes occurring in subduction zones, it is important to know how far the fault rupture extended or how much strain energy was released during past events. However, it is difficult to estimate the magnitude of past tsunamis because it is often impossible to measure the run-up heights or distances and exact localities of the observation at the time of the tsunami attack, especially for events that occurred before instrumental observation. For those events, historical documents and geological evidence are exclusive records. Therefore, a technique for estimating the source of historical tsunami events from such non-instrumental tsunami records is needed. In this study, we examined whether it is possible to estimate the initial tsunami source model from comparisons of numerical simulations of tsunami propagation and tsunami sediment transport, tsunami trace heights, and sediment thickness distribution of tsunami deposits for the 2011 Tohoku-Oki earthquake. The 2011 event provides a valuable modern analog to validate the proposed method because the tsunami source has been estimated from instrumental records, and tsunami data such as trace heights and deposits are also available.

2 Data and methods

2.1 Documental records

Among the tsunami trace points surveyed by the Tohoku Earthquake Tsunami Joint Survey Group, 2364 tsunami inundation heights with the highest reliability (i.e., reliability rank A) are located in the range of 35° to 41° N. Imai et al. (2013) have investigated the dependence of alongshore distribution of tsunami trace data for tsunami source estimation and concluded that deviation from an optimal solution can be minimized if the alongshore data distribution is approximately the same as the size of the tsunami source, and the tsunami data are obtained at intervals of about 5% with respect to the size of the tsunami source. In this study, the tsunami data are averaged at 0.1° intervals (approximately 5% for fault length; Imai et al. 2013), yielding 60 ranges (Figs. 1 and 2).

Fig. 2
figure 2

Tsunami trace heights (upper panels), topography (middle panels), and the observed sediment thickness (lower panels) of the 2011 Tohoku-Oki tsunami deposits along each transect. a Idagawa Lowland, Fukushima Prefecture; b Sendai Plain, Miyagi Prefecture; and c Numanohama Marsh, Iwate Prefecture, Japan. Circles on the map indicate sampling locations, and the dotted curves show the limits of the tsunami inundation

2.2 Geological records

Numerical simulation of tsunami sediment transport is expected to reproduce the depositional process for a simple topography with few artificial structures. We chose three sites from previous researches on tsunami deposits with different topography: a reclaimed lagoon (Idagawa Lowland, Minami-soma City, Fukushima Prefecture; Kusumoto et al. 2018), an alluvial plain (Sendai Plain, Sendai City, Miyagi Prefecture; Abe et al. 2012), and a wetland in a V-shaped valley (Numanohama Marsh, Miyako City, Iwate Prefecture; Goto et al. 2017) (Figs. 1, 2). Table 1 summarizes the characteristics of the 2011 Tohoku-Oki earthquake tsunami deposits at each survey site. The deposits of the Idagawa Lowland and Sendai Plain are mainly composed of medium to coarse sand, while that of the Numanohama Marsh is mostly composed of gravel. The sediment thickness is ~ 40 cm at all study sites and abruptly becomes thinner with increasing distance from the coast.

Table 1 The 2011 Tohoku-Oki tsunami deposits

2.3 Numerical simulations

Tsunami inundation heights on the Pacific coast of Tohoku were calculated using the high-performance tsunami calculation code JAGURS (Baba et al. 2015, 2017). The two-dimensional nonlinear long-wave equations were solved for the bathymetry and topography of the entire Pacific coast of Tohoku. The topographic and bathymetric grid interval was set at 2 arcsec (Chikasada 2020), without using a nesting grid, and the computational time was 3 h from the occurrence of the earthquake. The computational time interval was set at 0.1 s to satisfy the Courant–Friedrichs–Lewy (CFL) condition.

For numerical simulation of tsunami sediment transport, an existing code (TUNAMI-STM; Yamashita et al. 2016), which consists of the two-dimensional nonlinear long-wave equations, and the advection equation of suspended load and continuity equations for bedload and suspended load layers (e.g., Takahashi et al. 1999, 2011), was adopted. Sand particles were mostly transported in suspension; therefore, the sediment deposition and erosion were closely related to flow velocity of a tsunami. An increase in flow velocity eroded the basement, and a decrease in flow velocity caused sand depositions.

Topographic and bathymetric data were resampled from 450 to 5.56 m for the nested grids based on the M7000 series (about 20 m grid intervals) and the digital elevation model (5 m grid intervals) published by the Geospatial Information Authority of Japan. The computational time was set to 3 h from the occurrence of the earthquake, and the computational time interval was set to 0.1 s to satisfy the CFL condition.

Saturation concentration, which gives variable limits to pickup rate of the suspended load, is considered using the turbulence energy balance model (e.g., Yamashita et al. 2016; Sugawara et al. 2019). The mean grain size was set to 0.267 mm at all survey sites, and the grain size-dependent parameters (i.e., coefficients for bedload rate (4.0) and pickup rate (4.40 × 10–5), settling velocity (0.0392 m/s), and critical shear stress (0.0383)) were based on Takahashi et al. (2011) and Gusman et al. (2018).

Coastal forest is distributed along the coastline of the Sendai Plain. The forest affected not only the flow velocity but also sediment transport by the tsunami (e.g., Kusumoto et al. 2020). Therefore, a composite equivalent roughness coefficient to include the tsunami damping effect by the forests (e.g., Aburaya and Imamura 2002; Kusumoto et al. 2020) was adopted in this study. Here, the forest cover and trunk width of the coastal forest were set at 0.39% and 0.13 m, respectively, based on survey results before the 2011 Tohoku-Oki earthquake (e.g., Harada et al. 2000; Imai et al. 2009).

2.4 Tsunami source models

The coseismic seafloor displacement is calculated from the fault parameters of the tsunami source model of the 2011 Tohoku earthquake, which was estimated from a comparison of the simulated and observed waveforms at wave gauge stations near Japan (Okada 1985; Tanioka and Satake 1996; Satake et al. 2013). The Kajiura filter (Kajiura 1963) is applied to calculate the initial sea surface displacement from the coseismic seafloor displacement.

The 55-subfault model with the delayed slip was set as a reference model (Model 1). Based on the reference model, other 11 models (Models 2–12) were prepared (Fig. 3, Table 2). The fault ruptures in Models 3 and 4 extended only to the deep and shallow parts of the plate interface, respectively. Models 5–12 had different numbers of subfaults. In Models 5–9, the slip amounts of each subfault were averaged from Model 1. In Models 9–12, the average slip amount was multiplied by 1.9, 2.0, 1.6, and 1.5, respectively, so that the tsunami trace height could be reproduced by the calculated tsunami height of Models 5–8. Because distribution and thickness of tsunami deposits are controlled by flow velocity, these models do not necessarily reproduce the tsunami deposits. The rise time was set at 300 s with variable start times of rupture on each subfault in Model 1, and in the other models, it was set at 30 s (Satake et al. 2013). The reproducibility of the simulation was evaluated by the geometric mean and standard deviation (K and κ; Aida 1978). The simulations match with the observations when the geometric mean or standard deviation is close to 1, an overestimation if the geometric mean is less than 1, and an underestimation if it is greater than 1.

Fig. 3
figure 3

The initial tsunami source models examined in this study. All models are based on the 55-subfault reference model (Satake et al. 2013)

Table 2 The initial tsunami source models in this study

3 Results and discussion

3.1 Trace heights

Figure 4 illustrates a comparison of observed tsunami trace heights and the simulated heights for each model. The simulated heights from Model 1 well match the observations. Although Model 2 cannot explain the maximum trace heights around 40° N, 37.5° N, and 36° N, the simulated heights are roughly consistent with observations at the other locations. The difference between Models 1 and 2 is attributed to the effects of rupture propagation which is represented by rise time of 30–300 s and variable start times of rupture. Both Models 3 and 4 underestimated the observations, indicating that the rupture should cover both shallow and deep segments at the plate boundary to explain the tsunami trace heights, in addition to the fact that these models have smaller overall seismic moment. The comparison of the simulated results from Models 5–8 shows that the reproducibility is low with a smaller number of subfaults. The geometric means of Models 9–12 are close to 1, but the standard deviations range from 1.35 to 1.49, indicating high variability compared with Model 1.

Fig. 4
figure 4

Comparison of the observed tsunami trace heights (mean: black dots, standard deviation: error bars) and the simulated heights (red bars). K and \(\kappa\) show the geometric mean and standard deviation of Aida (1978), respectively

Additional file 1: Figure S1 shows a comparison of observed tsunami inundation heights and the simulated heights for each model around the Idagawa Lowland and Sendai Plain. For the Idagawa lowland, Model 2 underestimated the tsunami inundation height near the coastline, compared to Model 1. Conversely, Model 2 overestimated the heights for Sendai Plain. This can also be attributed to the difference in fault rupture propagation.

3.2 Sediment thickness of tsunami deposits

Figure 5 shows the comparisons of the observed and simulated sediment thickness at the three sites (Fig. 2). The 2011 Tohoku-Oki tsunami deposits in the Idagawa Lowland were characterized by being thin in the center of the lowland (approximately 1.2–1.4 km from the coast) and being thick at both sides (~ 1.0 km and ~ 1.5 km) (Fig. 2, Kusumoto et al. 2018). The simulated sediment thickness mostly produced mountain-shaped distribution with a peak at around 1.5 km from the coast, and the above observed feature (thin in the center) could not be reproduced. Model 2 had the geometric mean closest to 1, followed by Model 1. Models 4, 6, 7, and 8 showed a characteristic trend of landward thinning of sediment. In these models, sediments were not transported into the lowland, probably because of smaller slip amounts in the southern fault area. The result of Models 9–12 indicates that a large slip is required in the southern part of the tsunami source for sediment to be transported inland. The geometric means of Models 9–12 approached 1 when the average slip amount was increased, but they were still greater than 1, indicating underestimation on average.

Fig. 5
figure 5

Comparison of the observed and simulated sediment thickness for 0.267 mm in mean diameter at a Idagawa Lowland, b Sendai Plain, and c Numanohama Marsh

The 2011 Tohoku-Oki tsunami deposits on the Sendai Plain were characterized by being thickest near the coastline and thinning rapidly with distance from the coast (Fig. 2, 5b). The simulated sediment thickness in Models 1 and 2 well matched the observations. Model 4, which only ruptured the subfaults near the trench axis, produced the deposition only about 1 km from the coast; therefore, it is not suitable as an initial tsunami wave source from a viewpoint of sediment thickness of tsunami deposits. Among the models with a smaller number of subfaults (Models 5–8), the geometric mean was closest to 1 for Model 8. The reproductivity may depend strongly on wavelength of the tsunami. The tsunami generated by Model 7 is dominated by short wavelengths, so the inundation area is limited to the coastline and sediment movement is also limited. Conversely, the tsunami generated by Model 8 has a relatively long wavelength. Therefore, the tsunami-inundated deep inland and simulated sediment thickness match the observations. The geometric means of Models 9–12 were all less than 1, indicating that the simulated results overestimated the observations. This result is opposite to that for the Idagawa Lowland, suggesting that the sediment thickness distribution of tsunami deposits strongly depends on the topography of the survey site.

The 2011 Tohoku-Oki tsunami deposits of the Numanohama Marsh were about 20 cm thick up to 0.6 km from the coast. This feature cannot be explained by the simulated results, although Model 2 produces sediment transportation into the land. This is because the tsunami deposits were composed mostly of gravels, and the calculation formula for suspended load and bedload layers adopted in TUNAMI-STM may not be applicable to gravels.

3.3 Combination of trace heights and tsunami deposits

Table 3 summarizes the results presented in Sects. 3.1 and 3.2 for the Sendai Plain and the Idagawa Lowland. Models 1 and 2 successfully reproduced both the tsunami trace heights and the sediment thickness distribution of the tsunami deposits. Model 1 considered the slip propagation toward the north based on analyses of tsunami waveforms. However, the trace heights and tsunami deposits contain no information on temporal changes of tsunamis, so the model with instantaneous slip (Model 2) may be more appropriate for estimating the initial tsunami source from trace heights and tsunami deposits. Other models with the same moment magnitude but smaller numbers of subfaults (Models 5–9) could not reproduce the observed trace heights, and the reproducibility of the sediment thickness distribution strongly depended on the slip distribution, as indicated by Models 8 and 11. Models 9–12, which simply increased the slip amounts of Models 5–8 (and the resultant moment magnitude of 9.1), could reproduce the coastal trace heights; however, the simulated sediment thickness was underestimated for the Idagawa Lowland while overestimated for the Sendai Plain.

Table 3 Comparison of the geometric means of observed trace heights or tsunami deposits to the simulation results

Finally, we discuss the applicability of the methods to historical tsunami events. Although less information is available for older tsunami events, our finding that a smaller number of subfaults lead to lower reproducibility indicates the importance of adopting subfaults and increasing the number of observed data points, for example, by reexamining historical materials and by conducting tsunami deposit surveys. Rather than simply increasing the amount of slip, it will be necessary to set a large slip area and compare the observations and numerical simulation results. Gravel-dominant tsunami deposits are outside the applicability of the formulas for suspended sediments and bedload incorporated in TUNAMI-STM. Therefore, the survey sites used for estimating the initial tsunami wave source should be limited to those with sand-dominant tsunami deposits. If a gravel-dominant survey site is used, a formula for the mixed sediment transport of gravels and sand should be used.

4 Conclusions

We examined whether it is possible to estimate an initial tsunami source model for the 2011 Tohoku-Oki event by comparing numerical simulation results for tsunami propagation and sediment transport with measured trace heights and observed sediment thickness of tsunami deposits. Models 1 and 2, based on 55-subfault models obtained by a previous tsunami waveform inversion (Satake et al. 2013), could successfully explain both the tsunami trace heights and the sediment thickness distribution of the tsunami deposits. In other words, the accuracy of the initial tsunami source estimation could be improved by combining numerical simulations of tsunami propagation and sediment transport with tsunami trace heights and sediment thickness of tsunami deposits. The models with fewer subfaults had lower reproducibility, even if the overall moment magnitude was the same. Therefore, it is important to increase the number of subfaults as well as increasing the number of trace points, for example, by reexamining historical materials and conducting tsunami deposit surveys. We conclude that the combination of trace heights, sediment thickness of tsunami deposits, and numerical simulations of tsunami propagation and sediment transport can be used to estimate historical earthquakes and tsunamis.