Predicting site-specific storm wave run-up

Storm wave run-up causes beach erosion, wave overtopping, and street flooding. Extreme runup estimates may be improved, relative to predictions from general empirical formulae with default parameter values, by using historical storm waves and eroded profiles in numerical runup simulations. A climatology of storm wave run-up at Imperial Beach, California is developed using the numerical model SWASH, and over a decade of hindcast spectral waves and observed depth profiles. For use in a local flood warning system, the relationship between incident wave energy spectra E(f) and SWASH-modeled shoreline water levels is approximated with the numerically simple integrated power law approximation (IPA). Broad and multi-peaked E(f) are accommodated by characterizing wave forcing with frequency-weighted integrals of E(f). This integral approach improves runup estimates compared to the more commonly used bulk parameterization using deep water wave height H0\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$H_0$$\end{document} and deep water wavelength L0\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_0$$\end{document} Hunt (Trans Am Soc Civ Eng 126(4):542–570, 1961) and Stockdon et al. (Coast Eng 53(7):573–588, 2006. 10.1016/j.coastaleng.2005.12.005). Scaling of energy and frequency contributions in IPA, determined by searching parameter space for the best fit to SWASH, show an H0L0\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$H_0L_0$$\end{document} scaling is near optimal. IPA performance is tested with LiDAR observations of storm run-up, which reached 2.5 m above the offshore water level, overtopped backshore riprap, and eroded the foreshore beach slope. Driven with estimates from a regional wave model and observed βf\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta _f$$\end{document}, the IPA reproduced observed run-up with <30%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$<30\%$$\end{document} error. However, errors in model physics, depth profile, and incoming wave predictions partially cancelled. IPA (or alternative empirical forms) can be calibrated (using SWASH or similar) for sites where historical waves and eroded bathymetry are available.


Introduction
As sea level rises, high shoreline water levels will increase in frequency and severity. Total shoreline water level (TWL) contains components with wide ranges of time and space scales, including sea-level rise, climatic and seasonal cycles, tides, oceanic eddies, storm surge, and local waves (e.g., Idier et al. 2019;Woodworth et al. 2019). When several components co-occur, high TWL threatens coastal infrastructure. In northern hemisphere mid-latitudes, continental west coast TWL hazards largely come from high waves combined with high tides, rather than from tropical and extratropical storm surges important on east coasts (Rueda et al. 2017). On US West Coast beaches, wave run-up and setup are estimated to contribute more than 50% to extreme TWL (Serafin et al. 2017). In consideration of the significant impact and associated uncertainties, the wave component of TWL has been identified by the California Climate Change Assessment (Kalansky et al. 2018) as research needed for coastal management and adaptation.
The most southern city in California, Imperial Beach (IB), is historically impacted by flooding driven by coincident high waves and tides. IB lies at the tip of a once natural sand spit formed by sediment from the nearby Tijuana River (Flick 1993). To counter wave-related erosion and flooding, approximately 344,000 m 3 of offshore sand was added to the city's shoreline in Fall 2012 (SANDAG 2011;Ludka et al. 2018). Despite this adaptation strategy, IB continues to experience erosion and street flooding during high waves and tides. Gallien (2016) observed and modeled a 2014 flooding event from a winter storm with moderate waves and long period swell, and found large errors in the flooding extent predicted by traditional empirical models.
We develop an IB storm wave run-up (the wave-driven component of TWL) climatology by forcing the numerical model SWASH with waves from a 17-year offshore wave hindcast, on a typical winter depth profile based on 10 years of bathymetry observations. SWASH has been extensively tested in the laboratory and field on gentle and steep slopes (e.g., Nicolae Lerma et al. 2017;Smit et al. 2014;Suzuki et al. 2017;Fiedler et al. 2018, and others). Lacking site-specific observations of storm wave run-up, we create a model runup data set to calibrate an integrated power law approximation (IPA), relating frequency-weighted integrals of the incident sea-swell spectrum to bulk runup statistics. IPA is an emulator, a numerically simple empirical model of a more complex (here SWASH) model (e.g., Reichert et al. 2011). Uncertainty in offshore boundary conditions and depth h(x) translate into IPA uncertainty. IPA can be used for runup forecasts at sites where a spectral storm wave climatology and typical eroded bathymetry are available.
Runup parameterizations are reviewed and the IPA developed in Sect. 2. The wave and bathymetry climatology, and storm wave selection, are discussed in Sect. 3. The SWASH model configuration and the key role of boundary conditions are described in Sect. 4. IPA results are discussed in Sect. 5, and compared with few hours of observed storm run-up in Sect. 6. Model uncertainty and approaches to error reduction are discussed in Sect. 7.

Review
Runup models range from highly complex (Smoothed-Particle Hydrodynamics and Volume of Fluid solvers) to empirical equations based on bulk wave parameters. Highly accurate dynamical models solve the Reynolds-averaged Navier-Stokes (RANS) equations, with some approximations for breaking (e.g., Lara et al. 2011;Torres-Freyermuth et al. 2010). Laboratory experiments in wave flumes with known boundary conditions have demonstrated the accuracy of these advanced models, but at high computational cost, for example requiring 30 h to compute 300 s of a numerical simulation (Lara et al. 2011). Phase-resolving models solving the nonlinear shallow water (NLSW) equations include nonlinear random wave physics and are numerically faster but less exact. Despite limitations, phase resolving models have been shown to accurately simulate wave shoaling, breaking, and run-up in both the field and laboratory (e.g., Rijnsdorp et al. 2015;Pomeroy et al. 2012;Suzuki et al. 2017, and others). Although numerical models for run-up have been validated in controlled laboratory and field settings, they require information often poorly known in practical applications. For example, specification of shoreward propagating infragravity waves at the model offshore boundary, and the beach profile both above and below the water line. Empirical models, on the other end of the accuracy and ease-of-use scale, can estimate run-up using only bulk properties of incident sea and swell (wave height and period), with surfzone bathymetry h(x) characterized by at most a single foreshore or surfzone slope (e.g., Hunt 1961;Battjes 1974;Holman 1986;Stockdon et al. 2006;Gomes da Silva et al. 2020, and others). Wave effects at the shoreline are usually separated into wave setup ( , a super-elevation of the mean shoreline water level), and swash (S, the oscillatory up-and down-rush of waves). Swash is often further decomposed into two frequency bands: sea-swell (SS, 4-20 s) and infragravity (IG, 20-250 s). Assuming Gaussian distributed run-up, , S IG , and S SS are combined to estimate a wave-induced total water R 2%,G (the level exceeded by 2% of wave up-rushes), Total water level (TWL) superposes wave effects on SWL, the still water level without waves. In field studies, SWL is often taken as the water level observed seaward of the surfzone, so that SWL includes both tides and other processes (e.g., subtidal shelf waves) effecting coastal water levels, Using video observations on 11 beaches, Stockdon et al. (2006) (hereafter S06) calibrated empirical relationships for wave runup components that depend on √ H 0 L 0 and f , where H 0 and L 0 = (g∕2 )f −2 p are the deepwater significant wave height and wave length, respectively, g is gravity, f p is the peak frequency, and f is the foreshore beach slope: (2) TWL = R 2%,G + SWL. (3) where 1 , 2 , 3 are best-fit dimensionless constants. All 3 terms depend on , and S SS and are proportional to f . Substitution in (Eq. 1) with i from S06 yields where the factor of 1.1 corrects for non-Gaussian runup statistics. In the present work, in both observations and models, R 2%,G is based on swash spectra (Eq. 1) without a correction factor, and is considered a metric of extreme run-up. Individual wave run-ups, defined for example with zero upcrossings, are not considered. Equation 4 is widely used, including in the US Federal Emergency Management Authority's flood mapping guidance (Federal Emergency Management Agency (FEMA) 2015).
Empirical formulae for run-up and setup are abundant, with many alternate forms proposed prior to and since S06. In their "non-exhaustive" list, Dodet et al. (2019) Hunt 1961) to characterize the SS runup response to incident wave forcing (e.g., Eq. 4). There is less agreement on the functional form for S IG and . Best-fit parameter values for a given model vary between beaches, and no single parameterization is uniformly superior on all beaches.
In Eq. 3, the E(f) peak frequency f p is used to estimate L o , and bimodal E ss yield unrealistic (unstable) results when f p varies in time between the sea and swell peak. Definitions of "seaswell frequency" based on integrals of E(f) are stable, with different definitions weighting E(f) differently via a ratio of spectral moments (e.g., O'Reilly et al. 2016;Poate et al. 2016, and others). However, H 0 is still based on combined sea and swell energy. In Sect. 2.2 below, the (generalized) spectral equivalent of H 0 L 0 is effectively integrated over sea-swell frequencies.
Simple empirical models are necessarily scattered about observations owing to the extreme simplification of incident waves and beach parameters (e.g., Stockdon et al. 2006;Dodet et al. 2019). For example, setup depends on h(x) across the surfzone, and not only on f (e.g., Longuet-Higgins and Stewart 1962;Stephens et al. 2011). Additionally, only bulk H 0 and f (peak, mean, or otherwise) appear in empirical S IG , but S IG depends on incident wave frequency-directional spectra (e.g., Guza and Feddersen 2012;Ruju et al. 2019, and others). Despite these shortcomings, empirical equations have predictive skill, especially if has been calibrated at the prediction beach. In this case, i is essentially tuned to the site h(x) and wave characteristics (e.g., frequency and directional spread).

Frequency-integrated power-law approximation (IPA)
A generic empirical power law form for the energy of the wave-driven components of TWL at the shoreline is Equation 5 includes many existing empirical runup formulas as special cases. For example, with the commonly used S IG = √ H 0 L 0 (Eq. 3), the corresponding shoreline E IG ≈ S 2 IG ≈ H 0 L 0 and l = 0, m = 0.5, n = −2 (Eq. 6). In S06, S SS depends linearly on f , so shoreline E SS has l = 2, m = 0.5, n = −2 . With setup proportional to H 0 , independent of f and (e.g., Guza and Thornton 1982;Ruggiero et al. 2001), l = 0, m = 1∕2, n = 0 . Runup energy E IG proportional to offshore energy flux EC g (Inch et al. 2017) corresponds to m = 1, n = −1 . Ji et al. (2018) parameterized setup with l = 0.54 , m = 0.31 , and n = −0.74 . Very low-frequency IG energy ( 0.0005 < f < 0.003 Hz) observed in a harbor was best fit with m = 0.9 , and n = −3 near the harbor mouth (Cuomo and Guza 2017). Note for run-up E IG and E SS , is dimensionless only when m = 1 and n = 0 . For setup, is dimensionless when m = 1∕2 and n = 0 . The different dynamics and length scales of S SS , S IG , and can result in different (l,m,n) for each component. Here, Eq. 5 is used in an integrated power law approximation (hereafter IPA), where Best-fit values of , m and n are determined separately for each component ( , IG, and SS, Sect. 5). Values of l are either 2 or 0, following (Stockdon et al. 2006). The integral on the left side of Eq. 6 corresponds to the SWASH-modeled runup spectrum at the 10-cm water depth threshold and the right hand side uses offshore sea-swell from a regional wave model. Skill R 2 = 1-NMSE, where with o the observations and p the prediction.

Historical waves and bathymetry at Imperial Beach
The study transect, oriented approximately East-West directly offshore of Cortez Ave ( Fig. 1), is included in an ongoing long-term monthly monitoring program by Scripps Institution of Oceanography (SIO) (Ludka et al. 2019

Idealized bathymetry
The Cortez Ave cross-shore transect (MOP line D0045) has been surveyed 95 times between 2008 and 2016. Observations were interpolated and smoothed (mapped) as described in Ludka et al. (2019). A typical eroded profile is defined as the average of the mapped eroded profiles (n = 26), which have a horizontal mean sea level (MSL) shoreline position landward of the mean MSL shoreline position ( x = 0 m) by 10m or more. Eroded profiles occurred before the 2012 nourishment, and a few years after nourishment. Run-up on recently nourished profiles is not considered here. For use in the SWASH model, the average eroded profile (Fig. 1c) was extended to h = −15 m with constant slope ( = 0.01 ), and smoothed with a 50-m Gaussian filter from x = −450 m to −20 m, where the mean sampling is relatively coarse, and with 20-m smoothing elsewhere. The riprap above ∼4 m is not included in the model h(x). Instead, the beach slope at Fig. 1 a Location map, with MOP D0045 line (red) extending ∼1.2 km offshore. b overwashing of riprap (rock structures fronting condos) during the high water level on January 18, 2019. D0045 is shown in red. The image is mirrored for consistency with other subplots, such that true south appears north. Photograph: Rob Grenzeback. c Depth versus cross-shore distance on D0045; representative eroded profile (purple), typical lidar profile (red), historical eroded profiles (light gray). Dotted lines indicate tide levels and extrapolation above the rip rap ( x > 45 m ). An ADV ("PUV") was located at x ∼ −300 m. d Lidar-derived subaerial bed level versus cross-shore location for different times on January 18, 2019. The curve colors correspond to vertical shading in the inset. (Inset) La Jolla pier end water level (left axis, red) and foreshore slopes (blue dots, right axis) versus time elevations greater than 4 m is extended above 4 m (thick dashed lines in Fig. 1). In doing so, the riprap is assumed to be an extension of the sandy beach; porosity, infiltration, and overtopping are not considered. Two concave up foreshore profiles are used; (1) an averaged eroded profile (average f =0.08), and (2) a steeper (average f =0.12), lidar-derived profile from a January 2019 storm. The profiles are identical from x = −50 m seaward (Fig. 1c).

Idealized storm waves
Spectral wave hindcasts ( f = 0.04 to 0.25 Hz) are obtained from the MOPS model (O'Reilly et al. 2016). MOPS, a linear wave propagation model, uses the CDIP buoy network to produce hourly hindcasts of coastal waves. Tides are not included. We used hourly 1D (frequency) wave spectra from January 1, 2000 to Dec 31 2017, 157,800 total hours. In Southern California, MOPS wave hindcasts are on the 10-m depth contour seaward of the typical surfzone (MOPS does not include wave breaking). MOPS simulates spatially variable island shadowing, sheltering by changing shoreline orientation, and the local effects of shallow shoals and reefs. Ideally, the linear MOPS is used for non-breaking waves over possibly 2D bathymetry seaward of the surfzone, and SWASH 1D is applied to 1D, nonlinear surfzone and run-up. For this application to the most energetic few percent of waves ( Fig. 2), MOPS is used to initialize SWASH 1D in 15m depth (see Sect. 4.2). The effect of 2D bathymetry shallower than 15 m is not included. Deep water wave height H 0 is obtained by reverse shoaling the E(f) spectrum at 10m water depth. Deep water wavelength is Frequency spread f sp is used as a metric of E(f) narrow bandedness, For large offshore √ H 0 L 0 or H 0 (Fig. 2), the spread f sp is usually relatively small. SWASH is used to simulate run-up for the low f p (Fig. 2a) and small f sp typical of the largest IB winter storms. During the 17-year hindcast, less than 1% ( ∼ 1427 h) of the total have A total of 439 storm cases were selected, 295 with high √ H 0 L 0 > 25 m and an additional 144 with high H > 2 m (and √ H 0 L 0 < 25 m). For each √ H 0 L 0 > 25 m data bin, the centroid frequencies were randomly sampled to obtain up to 20 unique values spanning the minimum to maximum observed f c . The high H s > 2 m cases were binned in 0.5 m wide bins, and unique peak frequencies selected randomly from each bin, excluding the already selected cases with

Model configuration
SWASH is run in 2D mode with a narrow, 2-m alongshore mesh, effectively emulating a 1D profile but with more detail resolved by the variable grid spacing available in 2D mode. The grid spacing decreased from 2 m offshore to 25 cm in the swash zone. As a compromise between computational efficiency and model accuracy, the model uses 2 vertical layers (e.g., Suzuki et al. 2017;. The water level is constant at mean high water (MHW, 1.34 m NAVD88). Tidal effects are not included. To avoid repeating a random wave sequence in the 60-min simulation, the coarse E(f) SS spectrum from MOP (variable df= 0.005-0.05 hz) was interpolated to finer frequency resolution df = 0.00028 hz, (1/df = 60 min). Background viscosity of 0.1e −4 m 2 /s improved numerical stability. Bottom friction was parameterized using the Manning-Strickler formulation, with the default Manning roughness coefficient of 0.019 s m −1∕3 , a value that worked well at a nearby Southern California beach . Breaking parameters were = 0.6, = 0.3 and = 0.25 (Smit et al. 2014). A firstorder upwind scheme is used for vertical advective terms, with high-order interpolation for second-order accuracy in the horizontal advective terms using the MUSCL limiter. The ILU preconditioner with the Keller Box scheme was used to capture more accurately the physics of large waves. All model runs were 60 min with 10 additional minutes of spin up.
Runup statistics were obtained from the cross-shore and vertical location of the runup edge, defined using 1-h records and a 10-cm threshold depth. Varying the threshold has small effect on SS and IG swash, but increased threshold reduces setup (Appendix 1). Vertical runup spectra were computed with 10-min Hanning windows with 50% overlap, typically translating into roughly 10% statistical chatter in run-up H IG (Elgar 1987;. This statistical noise is small relative to boundary condition and other uncertainties.

Offshore boundary condition
SWASH requires specification at the offshore boundary of shoreward propagating sea-swell and IG waves. The nonlinearity of long surface gravity waves in shallow water is characterized with the Ursell number Ur = (a∕h)∕(kh 2 ) , with a, k, and h the wave amplitude, wavenumber, and water depth, respectively (Ursell 1952). In 10-m water depth, the average Ursell number of the considered storm waves is 0.53. The applicability range of the underlying theory that Ur << 1 was sometimes substantially exceeded, leading to unrealistically large 1D-bound waves for SWASH initialization. Initializing in 15-m water depth reduced the average to Ur = 0.23 . We use the 1D MOP energy spectrum with random SS phases. H 0 L 0 and frequency spread, c counts of √ H 0 L 0 (over all spreads) versus √ H 0 L 0 , d mean frequency (see color bar) versus significant H 0 (back-refracted to deepwater) and frequency spread, e counts (over all spreads) versus √ H 0 L 0 . The January 2019 waves (Fig. 1), star in b and d, was relatively energetic, low frequency with moderate spread. Red lines in b-e indicate the threshold used for modeled storm events ( Offshore E IG was either set to zero, or to a second-order, 1D boundwave (Hasselmann et al. 1963). These boundary conditions can be inaccurate in the field, but more accurate information for IG waves is generally lacking (Fiedler et al. 2019). In detailed laboratory and field studies, shoreward propagating sea-swell and IG waves at the offshore model boundary observations are known, and used to initialize SWASH (e.g., Rijnsdorp et al. 2015). Weakly nonlinear wavemaker theory has been used to generate the theoretical 1D-bound IG wave given by Stokes-like perturbation theory (Hasselmann 1962). However, in the field, infragravity waves seaward of the surfzone are a mix of free and forced directionally spread (2D) waves (Okihiro and Guza 1995), complicating specification of IG boundary conditions. A co-located pressure-current meter at the offshore boundary provides an approximate 1D boundary condition initialization that includes phase-coupled and free infragravity waves (Fiedler et al. 2019). However, in the present application, incident sea-swell spectra are provided by the MOPS regional wave propagation model. MOPS does not include IG waves, and IG waves are not measured accurately by local wave buoys.
Lacking the IG components of the incident wave spectrum at the offshore boundary (15-m depth), two contrasting BC are implemented here; a theoretical 1D boundwave ( E IG = Bound), and zero incident IG energy E IG = 0 . Fiedler et al. (2019) showed that, with energetic sea-swell, 1D bound theory over-estimated the observed E IG at the 11-m depth boundary by as much as a factor of 4, while E IG = 0 (of course) underestimated the BC observations. Given proper initial conditions, the phase-resolving model is able to accurately capture the transfers of energy throughout the surfzone between sea-swell and infragravity waves. The over-and underestimations at the offshore BC converged to a common E IG in the model inner surfzone, and run-up was not very sensitive to the offshore IG BC. At IB, variations in run-up predicted using these two different BC are therefore considered part of prediction uncertainty.

Results
The 439 large H and √ H 0 L 0 storm wave cases are run on the two different profiles with two different IG-component boundary conditions for a total of 1756 SWASH model runs. Each incident wave case and f yields different SWASH results for E IG = 0 and E IG = Bound, indicating sensitivity to the unknown offshore BC.

Components: IG, SS, and
Contours of IPA skill in (m,n) space for shoreline , E ss and E IG (Fig. 3) show that with offshore bathymetry and tide level held constant, a range of (m, n) values have high skill (e.g., dark red in Fig. 3). Note that the skill numbers shown here encompass all boundary conditions and slopes. An example that further explores slope dependence for IG swash is shown in Fig. 4, and slope and boundary condition input in Fig. 5a-c. Optimal (m, n) values do not vary wildly between shoreline , E IG and E SS , i.e., the red zones are similarly located in (m,n) space. The optimal (m,n) values (A in Fig. 3a-c) correspond to for IG, SS and , respectively, where E(f ) r refers to the "runup" energy. The in (Eq. 5) are dimensional. The left-hand side units are m 2 in Eqs. 10 and 11, and m in Eq. 12. The units of result in the same units for the right-hand side. Deviation in SWASH, via slope dependence or boundary input, degrades the skill of the IPA. The optimal fit for (m,n) maximizes model skill for each runup component (parameterization A (Fig. 3). Parameterization D ( m = 1, n = −1 ), a proportionality with SS incident energy flux (Inch et al. 2017), works well for IG, but poorly for and SS. Perhaps coincidentally, the harbor seiche parameterization C ( m = 0.45, n = −1.5 ) (Cuomo and Guza 2017) performs well for all components. Performance similar to optimal is provided by the integral equivalent of √ H 0 L 0 (Parameterization B), where m = 0.5, n = −2 for E IG and E SS (equivalent to Eqs. 10 and 11, and m = 0.25, n = −1 for : 3), based on bulk parameters, also predicts a ratio that depends on only , but with a different constant ( E SS /E IG ≈ 140 2 using default parameter values). The values in Eqs. 13-15 are specific to the low-frequency swell and beach profiles characteristic of Imperial Beach winter storms. When values in S06 (Eq. 3) are optimized, the fit improves above the skill using default S06 values, but is still lower than with integral forms (Table 1, discussed below). Using tuned parameter values, E SS /E IG = 65 2 . Miche (1951), Carrier and Greenspan (1958), Hunt (1961), Stockdon et al. (2006) and many others have shown that SS swash generally depends on beach slope (the exception is Here, held at constant tide stage, f , defined as ±2 around the mean wave setup, varies by less than 5% about the mean values (0.12 and 0.08) owing to the variable cross-shore swash extent. Including the f term in the SS parameterization (Eq. 11) yields R 2 = 0.57 , whereas assuming no f dependence yields R 2 = 0.20 . During a January 2019 storm (Sect. 6), the lidar-observed f varied between 0.14 to 0.10 over a single high tide, and the observed run-up S SS ∕2 decreased approximately proportionally from about 1.6 to 1.0m, further supporting the f term in (Eq. 11) . Wave setup, a super-elevation of the mean water level, is supported by the cross-shore decrease in the wave radiation stress associated with wave breaking (Longuet-Higgins and Stewart 1962, and many others). Shoreline setup depends on surfzone bathymetry,

Fig. 5
Empirical models (Eq. 12) versus the SWASH modeled extreme run-up ( R 2%,G ) using (left column) the E IG = 0 boundary condition and (middle column) the Boundwave boundary condition, and (right column) combined conditions, colored by foreshore slope ( f either 0.08, purple or 0.12, red). The 1:1 line (black) is bounded in light gray by ± rmse, binned in 20-cm increments. The left and middle columns are fit to their respective data sets, using the IPA parameterization, derived by fitting to each boundary condition separately. E IG = 0 at the offshore BC shows more scatter than the boundwave. These BC are lower and plausible upper bounds on the incident IG energy; dotted lines indicate a 20% error threshold in the parameterization. Also shown are the tuned S06 (Table 1,  including sand bars and terraces (e.g., Raubenheimer 2001;Stephens et al. 2011). Here, was approximated with the same bulk parameters H 0 , L 0 , f as SS and IG. The best-fit integral form ( R 2 = 0.85 , not shown), unlike S06, does not depend on foreshore beach slope. Including a beach slope term reduced the skill from 0.85 to 0.54 because the model setup is determined by h(x) across the entire surfzone, and is little altered by these spatially limited changes in f (Fig. 1). The dependence of on a typical eroded h(x) is included implicitly in (Eq. 12).

Total run-up R 2%,G
The best-fit IPA , n, m for all components of total run-up yields R 2 =0.83 and rmse=0.20 m (Fig. 5c, Table 1). The H 0 L 0 IPA only modestly alters skill (0.70) and rmse (0.26m) from  (light), as shown in legend. Red and purple colors indicate the steep and shallow slopes ( f ∼ 0.12 or f ∼ 0.08 , respectively). Note that RMSE is based on S/2, and vertical scale on R 2%,G (d) differs from the individual components (a-c). Tuning (lighter colors) reduces S06 model error using default parameter values (darker colors). In c, the vertical stratification of errors by f (compare red and purple) is consistent with errors in the S06 f dependence (Eq. 3), motivating the present use of Eq. 12, that lacks a f term optimal (Fig. 5i), reinforcing previous findings that H 0 L 0 parameterizations work reasonably well for swash. The separation of total run-up by slope reveals a slight divergence of fits, especially for the calibrated S06 model. This can be attributed to the poor fits for setup on the two different slopes (Fig. 6c). Calibrating the S06 model improves, but cannot resolve, the error stemming from the f term in setup. Uncalibrated S06 has the largest errors for the steeper slope (dark blues, Fig. 6). Tuning S06 to the Cortez location reduces the R 2% rmse to 0.44m , but does not altogether remove the stratification of setup and SSswash components by bathymetry. Note that f = 0.14 is steeper than used to calibrate S06. In addition to the slope errors, the two different boundary conditions yield slightly different fits, which when combined can be attributed to errors in unknown IG forcing offshore. Using both boundary conditions yields a roughly 20% prediction error in total runup level (black dotted lines, Fig. 5).

Application to a January 2019 storm event
Run-up was measured with lidar line scans at Cortez Ave (Fig. 1b), during the peak and ebb of a spring high tide on January 18, 2019 from 12:55 to 17:30 UTM (Fig. 7). Observations were split into nine 28-min long records. Lidar returns were gridded by median value into 10-cm horizontal bins, following Brodie et al. (2012). The estimated runup lines were extracted at the 10-cm depth threshold. Noise from breaking wave foam and splashing were smoothed with a Savitzky-Golay filter. When waves occasionally overtopped the rip-rap, Table 1 Goodness of fit statistics fitting the various parameterizations to the SWASH modeled run-up for incident swash ( S SS ), IG swash ( S IG ), setup ( ), and extreme run-up R 2%,G The statistics are the root-mean-square error (rmse), the skill ( R 2 , Eq.7), slope ( ) of the best-fit linear regression (Eq. 3). Note that the , l, m, and n values for the spectral parameterizations correspond to the E SS , E IG , and values from Eqs. 10-15 1 3 a continuous smooth runup (and rundown) line was extrapolated using an objective map with Gaussian covariance. The swash zone beach profile for each 28-min record was defined by the minimum vertical elevation in each horizontal bin. Cross-shore locations with vertical variance greater than 1 cm of the roughly 100 s moving minimum value were classified as water and removed from the profile. The foreshore beach slope f decreased from 0.14 to 0.10 over the 5-h record (Fig. 8d), where f is determined with bathymetry around ± 2 , with 2 the runup variance (Stockdon et al. 2006).
Hourly MOP waves (D0046) were used to estimate E(f), H 0 , L 0 , and f (H 0 L 0 ) 1∕2 for each 28-min lidar record (Fig. 8). These steep slopes yield an Iribarren number 0 = f (H 0 ∕L 0 ) −0.5 ∼ 1.5, corresponding to plunging breakers, not the spilling waves present further offshore, where is much lower (e.g., 0.01). Using E(f) in IPA (Eqs. 10-12) yields estimates of S ss ∕2 and that both typically differ by < 0.2 m from observations (Fig. 7a, c). Although S IG is over-estimated by as much as a factor of 2 (Fig. 7b), S IG < S SS and S ss is predicted relatively accurately.  Table 1) with MOP input (variable f , dotted black; and constant f = 0.1 , yellow), tuned S06 parameterization with MOPS input with variable f (dotted light blue) and f = 0.1 (green). The vertical bars on observations correspond to 95% statistical confidence Thus, relatively large errors in S IG do not necessarily translate into large R 2%,G errors. Furthermore, errors of opposite sign in and S IG partially cancel, and total waveinduced water level R 2%,G is predicted with rms error = 0.25 m, about 10% (Fig. 7d). For this event, calibrated S06 ('S06t') and IPA yield very similar results. S06t uses best-fit values (Eq. 3), given in Table 1. With constant f =0.1, rather than the observed variable f , the observed variation in S ss is not simulated (dashed curve in Fig. 7). The falling tide (SWL in Fig. 8c) and decreasing f (and hence decreasing S SS , Fig. 7a) both contribute to the decrease in R 2%,G after high tide, Fig. 7e).

Discussion
During the event for which onsite observations were available, IPA over-prediction of S IG (Fig. 7b) is eliminated by using incident (IG and SS) waves observed in 7 m depth, rather than estimated with the MOP model and E IG = 0 or 1D bound, as above. However, uncertainty in subaqueous h(x) continues to limit SWASH accuracy, and errors in R 2%,G are similar using the observed PUV to drive SWASH, and the MOPS-driven IPA emulator.
Lidar profiles (Fig. 9) were merged with the offshore portion of the representative eroded profile (Fig. 1) to obtain h(x) to 8 m depth (6 m, navd88, Fig. 9), where SWASH is initialized. Three plausible offshore profiles were tested, based on historical surveys, with each 28-min model run using the time-varying lidar-derived foreshore (Fig. 9). Profile 2 is monotonic, with a constant depth terrace-like feature from x = 0 to x = −100 m. Profile 3 represents extreme erosion for this site, and is from an El Niño year with a well-developed offshore bar. Profile 1 has a much smaller bar, located closer to shore.
In situ PUV (between 7 and 8 m depth) show H s = 2.0-2.6 m with peak frequency varying from 0.055 to 0.062 Hz. MOP hindcasts in 15 m depth predict H s = 3.01-3.74 m with constant f p = 0.06 Hz (starred locations in Figs. 2, 8a). The PUV was likely sometimes in the surfzone, perhaps contributing to the reduced H s compared with MOPS. Differences may also be the result of poor MOPS model skill for this particular event.
Modeled and observed bulk run-up agreed well when PUV input is used with Profiles 2 and 3 (Fig. 10, red and yellow lines), rmse = 18.3 cm). However, the shallow bar in Profile 1 (blue line in Fig. 9) resulted in large underprediction of sea-swell swash and setup Three depth profiles (thick curves) used for SWASH modeling of the January event. Profile 3 has a large offshore bar characteristic of a strongly erosive winter, whereas Profiles 1 and 2 represent more moderate winters. All profiles use the time varying lidar-observed profiles observed in the swash zone (dashed box at x, y > 0 m, detailed in Fig. 1d). Swash is initialized in either 6m (PUV) or 15m depth (Fig. 1c). Dotted horizontal lines denote the observed highest and lowest mean offshore water levels (e.g., SWL) (blue lines in Fig. 10), perhaps because the bar sheltered the shoreline. Similar to previous studies (e.g., Raubenheimer 2001), setup and SS swash can be sensitive to inner surfzone bathymetry. Suppressing lidar-observed changes in swash bathymetry using the lidar first Observed runup components (28-min records) versus time for 5 h: a sea-swell swash SS/2, b IG swash IG/2, c setup and d total run-up R 2%,G , and e SWL and foreshore beach slope f . Shown are lidar observations (black circles), SWASH model with PUV input (Profiles 1, blue; 2, red; and 3, yellow), and the new IPA parameterization with MOPS input (dashed black line). Numerical models (colored lines) use the time-varying lidar-observed foreshore beach slope (e, solid magenta), whereas the IPA uses a constant slope of 0.1 (dotted magenta line) record ( Fig. 8a) with Profile 2 (Fig. 8b) introduces error in IG swash and setup that propagate into R 2%,G (not shown), increasing the rmse from 18.3 to 41 cm.
Using the lidar-observed f , the integrated power law approach for R 2%,G performs well on the January 2019 storm event (Fig. 7), with rmse = 18.3 cm (Sk = 0.55). The site-specific tuned S06 model yields similar rmse = 19.6 cm (Sk = 0.49). Note that f = 0.14 used in IPA is steeper than the slopes (0.08 and 0.12) used for model calibration. The sizeable bias in IG swash for both empirical models could result from using a representative (rather than observed) offshore h(x) bathymetry in IPA calibration, and from MOPS model overestimation of SS waves at the offshore boundary.
Using the observed f , uncalibrated S06 (Eq. 4) over-predicts wave run-up with approximate bias of ∼ 1.4 m (not shown), primarily from setup error. The resulting skill is negative, with rmse=1.16 m. Although the choice of runup depth threshold alters estimates of the observed runup parameters, this overestimate is larger than can be explained by decreasing the threshold depth from 10 to 3 cm (Appendix 2).

Conclusion
Improved prediction of shoreline erosion, wave overtopping, and oceanfront flooding will improve the efficiency of both emergency response and long-term planning. On the US West coast, tides and waves are the largest contributors to elevated nearshore water levels. Sea-level rise is a confounding future factor. Tidal amplitudes and phases are well-known, and seasonal and local deviations (e.g., "tidal residuals") are routinely available. In comparison, wave-driven contributions to total water level are difficult to predict in both magnitude and timing (Kalansky et al. 2018). The MOPS-based training data set for IPA (Fig. 2) is intended only to mimic the range of historically plausible events, not to simulate accurately any individual event. An offshore water level of MHHW is assumed, and tides are not considered further. In practical application, tides are necessarily included in R 2%,G . The accuracy of R 2%,G predictions (IPA or otherwise) therefore depends on both storm wave timing and spectral characteristics. Storm impacts can vary dramatically depending on the Mean % Difference, relative to = 3 cm , f = 0.12 , f = 0.08 S/2, f = 0.12 S/2, f = 0.08 tide stage (e.g., spring versus neap and high versus low). The present discussion (Figs. 3, 4, 5, 6) considers model scatter caused by uncertainty in the IG boundary conditions and h(x). Uncertainty in SS predictions is not included. More complete error bars, including errors in MOP estimates, will be included in the future.
Setup depends on h(x) across the surfzone (Bowen et al. 1968;Longuet-Higgins and Stewart 1962;Raubenheimer 2001). Simulations of setup using an accurate h(x) will outperform parametric models using only beach face f . Swash also depends on surfzone h(x), especially inner surf zone sandbars that shelter the beach face (Fig. 9). With a wide range of historical h(x) IB storm profiles (Fig. 1), IPA scatter will be large. Runup forecasts can be improved with more accurate h(x), perhaps from observations and/or modeling (e.g., equilibrium Ludka et al. 2015) based on historical profiles. The bound wave and E IG = 0 energy boundary conditions used here can be improved using an approximate, empirical formula (including phase coupling) based on PUV observations in 8-15m depth.
An underlying assumption is that, given incident wave at the offshore boundary and the beach profile, 1D SWASH runup simulations are sufficiently accurate to inform overtopping warnings at Imperial Beach. Lidar observations at Imperial Beach support this assumption. Scatter in 1D SWASH simulations, for the same incident sea-swell waves, is caused by uncertainties in bathymetry and incident infragravity waves. For use in a local flood warning system, (https ://clima teada pt.ucsd.edu/imper ialbe ach/forec ast/), the relationship between incident wave energy spectra E(f) and SWASH-modeled shoreline water levels is approximated with IPA (integrated power law approximation). The fit to SWASH runup estimates is better with IPA than with the commonly used bulk parameterizations using deep water wave height H 0 and deep water wavelength L 0 (Hunt 1961;Stockdon et al. 2006). However, an IPA using H 0 L 0 scaling performs well and is essentially an extension of (Hunt 1961;Stockdon et al. 2006) bulk formulae to spectra.
The range of bathymetries and wave conditions for which 1D SWASH model, and the corresponding IPA, are applicable are unknown. Calibration and validation are integral components of improved predictive skill. Many beaches, however, lack the detailed historical spectral wave and bathymetry data required by the IPA. At locations without these data, local calibration to bulk wave statistics is likely the optimal approach. Future validation with observations, as well as uncertainty quantification, will aid in IPA application to other flood-prone sites. permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creat iveco mmons .org/licen ses/by/4.0/.

Appendix 1: Runup threshold depths
Model and lidar runup statistics are sensitive to the depth threshold , above set at 10 cm, as in Fiedler et al. (2015). Video-based measurements, such as those used in S06, do not explicitly define a threshold. Using the phase-averaged XBeach model, Stockdon et al. (2014) found modeled wave setup decreased approximately linearly 2 cm to every 1 cm increase in on an intermediate beach in high waves.
The IB storm SWASH model results were evaluated at =3, 5, 10 and 15 cm. Holding the = 3 cm line as the baseline, we find swash (combined S IG and S SS , squares in Fig. 11) shows less than 5% change (equivalent to ∼ 5 cm) for varying from 3 to 15 cm. Setup on the steeper lidar bathymetry ( = 0.12 ) decreases approximately 1 cm for every 1 cm increase in the runup threshold depth. Using = 10 cm instead of = 3 cm reduces modeled setup by an average of 14%. On the slightly less steep eroded profile ( = 0.08 ), setup differences are more pronounced; using = 15 cm instead of = 3 cm leads to an average 35% reduction in "measured" setup. For both h(x), setup differences between different exhibit a small increase dependent on magnitude of total run-up, such that larger setup equates to larger differences. Using = 3 cm changes the parameters (Eqs. 10-12) to with an rmse of 21.6 cm and R 2 = 0.81 . Using the = 3 cm threshold with the parameterization created for the 10 cm runup line yields an rmse of 28.5 cm and R 2 = 0.68.