The PoliTO–UniRoma1 database of cyclic and dynamic laboratory tests: assessment of empirical predictive models

The soil nonlinear hysteretic behaviour is usually described, in the moderate strain range, through the shear modulus reduction and material damping ratio (MRD) curves. In common practice, in absence of specific laboratory tests, the curves are estimated by employing empirical regression models. Such predictive models, typically calibrated on large experimental datasets, correlate the soil response to its physical properties. This research fits within this context, presenting a comprehensive database of cyclic and dynamic laboratory tests conducted on natural Italian soils. The database, publicly available as supplementary data of the paper, contains the results of the tests conducted by the geotechnical laboratories of the Politecnico di Torino (Turin, Italy) and the Sapienza Università di Roma (Rome, Italy) over the past 30 years. The experimental data are employed to assess the performance of some widely used empirical models in predicting the MRD curves of natural uncemented fine-grained soils, emphasizing the importance of using an independent dataset for conducting a reliable statistical analysis. The results show that the use of many soil parameters as proxies for predicting the soil response does not necessarily lead to an improvement in the performance of the model. Therefore, according to Occam’s razor principle, simple models are to be preferred.


D 0
Small-strain damping ratio D s Diameter of the specimen Global normalized root-mean-square error G S ∕G 0 , D Normalized root-mean-square errors respectively for G S ∕G 0 and D e Void ratio e i , e c Initial and after the consolidation void ratios f Loading frequency f 0 First torsional resonance frequency of the sample in resonant column tests f 1 , f 2 Frequencies associated with an amplitude equal to Excess pore-water-pressure V s Shear wave velocity w l Liquid limit w p Plastic limit w n Natural water content ΔW Energy dissipated by the unit volume of soil within one loading cycle W Elastic energy stored by the unit volume of soil within one loading cycle Y i , Ŷ i Measured and predicted values of the ith dependent variable Y Observed mean of the dependent variable z n , z n+1 Successive peak amplitudes during the free vibrations in resonant column tests

Introduction
The response of soils to cyclic and dynamic loadings is characterized by pronounced nonlinearity, energy dissipation, degradation of mechanical properties with cycles and coupling between shear and volumetric strains. In engineering practice, such complex behaviour is conveniently described in the small-to-medium shear strain range referring to the equivalent linear viscoelastic parameters, namely the secant shear modulus G S and the material damping ratio D . By considering an idealized shear stress-strain − cycle (Fig. 1), G S is defined as the slope of the line connecting the edges of the loop, while D quantifies the amount of energy dissipated. The latter is typically computed in analogy with the equivalent viscous damping ratio of a single-degree-of-freedom system, as originally derived by Jacobsen (1930): where: ΔW is the energy dissipated by the unit volume of the soil within one cycle, and W = c c 2 represents the elastic energy, being c and c the cyclic shear stress and strain amplitudes, respectively. At very small strains, the material response is almost linear and, therefore, G S is practically constant and equal to its initial, maximum, value G 0 (Hardin and Black 1968). Within this shear strain range, a small amount of energy is dissipated by the soil due to viscosity and friction between particles. The material thus exhibits a minimum, almost constant, small-strain damping ratio D 0 (e.g. Shibuya et al. 1995;Lanzo and Vucetic 1999). For c larger than the linear threshold shear strain tl , the nonlinear nature of soils becomes relevant and the dynamic parameters are typically described through the normalized modulus reduction G S ∕G 0 and damping ratio D curves (hereafter, MRD curves), firstly introduced by Seed and Idriss (1970).
At large strains, the soil experiences a gradual degradation of the mechanical properties, resulting in either pore water pressure build-up or permanent changes in the microstructure, depending on the drainage conditions (Silver and Seed 1971;Youd 1972;Stoll and Kald 1977). Such behaviour is typically associated with c larger than the volumetric threshold shear strain tv (Lo Presti 1991;Vucetic 1994). The G 0 is known to be influenced by both physical properties-such as soil type, particle size distribution, and grain angularity-and state parameters usually expressed in terms of effective confining pressure p ′ and current void ratio e (e.g. Hardin and Black 1968;Kokusho et al. 1982;Menq 2003). In addition, the soil fabric and the stress history, usually expressed in terms of overconsolidation ratio, strongly affect the material's small-strain behaviour.
In absence of specific cyclic or dynamic laboratory tests conducted on the soil under consideration, it is possible to adopt empirical models to predict its nonlinear hysteretic response (e.g. Seed and Idriss 1970;Vucetic and Dobry 1991;EPRI 1993;Ishibashi and Zhang 1993;Darendeli 2001;Menq 2003;Zhang et al. 2005;Oztoprak and Bolton 2013;Ciancimino et al. 2020;Wang and Stokoe 2022). Such models, usually calibrated on databases of experimental data, allow for the estimation of the MRD curves based on some input parameters. As a consequence, several studies have been devoted in the past years to the development of large databases of experimental data (e.g., Vardanega and Bolton 2013;Wang and Stokoe 2022), some of them publicly available (e.g., the VEL project database, Giusti et al. 2021, andthe Facciorusso 2021, archive). For fine-grained soils, the main input parameters are plasticity index PI , effective confining pressure p ′ , over-consolidation ratio OCR , and the number of loading cycles N (Kokusho et al. 1982;Vucetic and Dobry 1991;Shibuya et al. 1995;Darendeli 1997;Lanzo et al. 1997;Lo Presti et al. 1997). Additionally, the loading frequency f is recognized to affect the response of fine-grained soils, as a consequence of its strain-rate dependency (e.g., Lo Presti et al. 1997;Tatsuoka et al. 2000;Matešić and Vucetic 2003;Mortezaie and Vucetic 2013). Such an influence is therefore connected to the PI of the material and, for a given soil, to the shear strain level (Lo Presti et al. 1997;Tatsuoka et al. 2000). For coarse-grained materials, particle size distribution and void ratio e also play a role (Seed et al. 1986;Menq 2003;Wang and Stokoe 2022).
Despite a large amount of research carried out in the past years on this subject, predicting the MRD of natural soils is still challenging (e.g. Kishida 2016;Ciancimino et al. 2019). This research intends to contribute to the discussion, providing insights regarding the use of empirical predictive models. To this aim, a wide database of cyclic and dynamic laboratory tests is assembled. The database comprises the results of tests conducted on natural, uncemented, Italian soils by the geotechnical laboratories of the Politecnico di Torino (hereafter PoliTO) and the Sapienza Università di Roma (hereafter UniRoma1) over the past 30 years. The data set, publicly available as supplementary data for this paper, also includes the physical properties of the investigated soils. It represents a significant resource for both scientific research and practical applications.
The testing procedures are firstly presented, along with the data interpretation methods. Then, after a general presentation on the structure and organization of the experimental data, a subset of the database is used to assess the performance of empirical models in predicting the nonlinear hysteretic behaviour of Italian soils. Specifically, the models considered are those proposed by: (1) Vucetic and Dobry (1991); (2) Darendeli (2001); (3) Ciancimino et al. (2020); and (4) Wang and Stokoe (2022). Statistical analysis is performed to highlight the abilities-and, potentially, the drawbacks-of the models based on reliable independent experimental data, which have not been used for calibrating these empirical relationships.

Testing procedures and data interpretation
The compiled dataset includes the results of cyclic and dynamic laboratory tests performed by PoliTO and UniRoma1. Specifically, Cyclic Double Specimen Direct Simple Shear (CDSDSS) tests were performed by UniRoma1, whereas Resonant Column (RC) tests were carried out by PoliTO. Although the tests were conducted over a long period, the main testing procedures, as well as the techniques employed to interpret the experimental results, remained consistent over the years.

Cyclic double specimen direct simple shear test
The CDSDSS tests were carried out using a modified version of the standard direct simple shear device developed by the Norwegian Geotechnical Institute (Bjerrum and Landva 1966). The double specimen configuration was specifically designed at the University of California in Los Angeles to overcome the issues associated with false deformations and system compliance which affects the measured soil response at small strains. The apparatus of the geotechnical laboratory of UniRoma1 was built based on the original prototype designed by Doroudian and Vucetic (1995). A full description of the experimental device can be found in D' Elia et al. (2003). It can investigate the soil cyclic response in a wide strain range, varying between 3 × 10 -4 % and 7%, allowing it to measure both the smallstrain parameters and the MRD curves (Lanzo et al. 2009).
In the framework of an International Round Robin Test, comparisons between CDSDSS and RC/TS tests were carried out on the undisturbed Italian Augusta clay, showing a very satisfactory agreement in terms of MRD curves (Cavallaro et al. 2003).
The tests are performed on saturated samples, consolidated up to the desired vertical effective stress ′ v under pseudo-oedometric conditions owing to the lateral confinement exerted by wire-reinforced membranes. The samples are subjected to several steps of strain-controlled shearing cycles through a horizontal piston. Each step is generally constituted by 10 sinusoidal loading cycles, applied at the same shear strain amplitude c with an almost constant frequency of 0.25 Hz. The cyclic shearing is applied under constantvolume conditions by preventing sample height variations. Such a procedure, firstly suggested by Bjerrum and Landva (1966), is practically equivalent to testing samples under fully undrained conditions (Dyvik et al. 1987).
The applied horizontal displacement and the corresponding horizontal force are used to compute the shear stress and strain , which describe the cyclic response of the material. Figure 2 shows the stress-strain cycles as measured from CDSDSS tests at increasing levels of c . The figure refers to a test conducted on a sandy sample, for which typical S-shaped loops are observed at large strains (Fig. 2d). For each strain amplitude, G S and D are directly inferred from the − loops (as shown in Fig. 1); in particular, the values employed in the analyses are obtained as the average of cycles n. 2-3-4 for each constant c step.

Resonant column test
The RC tests were carried out using the free-fixed device of the geotechnical laboratory of PoliTO (Lo Presti et al. 1993, 1997. The apparatus can perform both resonant column or cyclic torsional shear tests, although in this paper reference is made only to RC tests. It is equipped with an electromagnetic driving system constituted of eight coils and four magnets (SBEL, Arizona). The motor can apply, under loading control, sinusoidal loading torques with a maximum amplitude of 1 Nm. The radial and axial strains are measured using proximity sensors. The soil response under dynamic loading conditions is tracked by an accelerometer mounted on the top of the specimen, whereas the sample rotation in torsional shear tests is measured by proximity transducers with targets integral to the driving system. Tests can be performed on both solid and hollow specimens, either isotropically or anisotropically consolidated. Nevertheless, the vast majority of the tests presented in the database were performed on solid specimens under isotropic conditions. The samples are firstly saturated and consolidated up to the desired p ′ . The soil response is then investigated for c ranging from 10 -5 % to about 0.6% by applying cyclic torques with increasing amplitude. Specifically, a frequency sweep (typically a 40 Hz range with a frequency-step of 0.1 Hz) is applied for each loading amplitude to identify the resonance condition of the first torsional mode of the specimen. For each testing frequency, 20 cycles of forced vibrations are usually followed by 10 cycles of free vibrations (Fig. 3a). The amplitude, characteristic of the sample response under forced vibrations, is computed as the Root Mean Square (RMS) of the output amplitude A measured by the accelerometer. The computation is repeated for each loading frequency and the amplitude versus frequency curve is plotted to identify the first torsional resonance frequency of the sample f 0 , corresponding to the maximum measured amplitude (Fig. 3b). Once obtained f 0 , the shear wave velocity V s is obtained via the theory of torsional waves propagation in a linear viscoelastic medium under steady-state conditions (Richart et al. 1970): where I is the mass polar moment of inertia of the specimen, I t is the driving system polar moment of inertia and H s is the height of the specimen. Based on the density of the soil , the secant shear modulus G S is thus computed as: The reference cyclic shear strain amplitude c is equal to 2/3 of the maximum shear strain max (Hardin and Drnevich 1972). The latter can be computed, in a fixed-free device, as (Woods 1978): where s is the maximum rotation of the sample obtained by double integrating the angular acceleration defined through the accelerometer, while D s and H s are the diameter and the height of the specimen, respectively. (2) Typical results of a RC test for a given loading amplitude: a time-history of the output amplitude A for one loading frequency; b amplitude A versus frequency f response curve; and c free-vibration decay method 1 3 The damping ratio can be evaluated through either the half-power bandwidth or the free-vibration decay method. The former is based on the bandwidth of the amplitude versus frequency A − f response curve (Fig. 3b). For relatively small D values, the equivalent viscous damping can be approximated by: being f 1 and f 2 the frequencies associated with an amplitude equal to √ 2 � 2 times the maximum one (Fig. 3b). The free-vibration decay method is instead applied considering the 10 cycles of freedamped vibrations at the end of the loading cycles (Fig. 3c). By knowing two successive peak amplitudes z n and z n+1 , the logarithmic decrement n+1 is computed as: The average value is used to compute the damping ratio as: The two aforementioned methods were applied in most of the tests. For a few, old, tests D was instead obtained through the resonance factor method, based on the ratio between the output rotation amplitude at resonance to the pseudo-static rotation amplitude (Drnevich et al. 1978). The electromagnetic driving system is known to be responsible for an additional small amount of equipment-generated damping Meng and Rix 2003;Wang et al. 2003), which is reduced if the input current is switched off, as during the free damped vibrations. Despite recent studies highlighting that D measurements are not excessively affected by this bias (e.g. Senetakis et al. 2015), measurements from the free-vibration decay were therefore preferred, when available, to data coming from steady-state vibration. Moreover, data obtained from the half-power bandwidth method were included only for c amplitudes lower than 0.1%, due to the well-known limitations of the method in the large strain range.

Database of natural soils
The PoliTO-UniRoma1 database includes the results of cyclic and dynamic laboratory tests performed by the two Universities in the past 30 years. It comprises a total of 252 tests: 110 RC tests and 142 CDSDSS tests, carried out on natural soil samples. Figure 4 shows the spatial distribution of the investigated sites, with different markers according to the laboratory that has performed the test.
The database represents a reliable set of experimental results, suitable for conducting statistical analyses on the variability of the cyclic response of natural soils. The latter is particularly relevant given the growing attention of the scientific community towards the influence of uncertainties inherent in the MRD curves on the outcomes of ground response analyses (e.g. Bahrampouri et al. 2019; Aimar et al. 2020). Additionally, the experimental data can be used as reference values for practical applications, in absence of a specific characterization of the site.

Structure of the compiled dataset
The data are compiled in the form of a structured-variable, developed within the Matlab (2020) environment. The variable is available as open-access supplementary data for this paper. In addition, the data archive is also reported in a spreadsheet. The structure of the database is presented in Fig. 5. Each test is identified with a unique "Sample ID" composed of a progressive number and an identifier of the laboratory that carried out the test (e.g., "006_POLITO"). The first field of the variable contains the "General information" about the samples in terms of site information (namely: the approximate location from which the sample was retrieved along with the sampling depth) and soil material type, as resulting from the Unified Soil Classification System (USCS, ASTM International 2017). When available, it is also reported the field small-strain shear modulus G 0,field computed via Eq. (3) as a function of the field V S of the soil, the latter being inferred through geophysical investigations.
The database includes information on the "Physical properties" of the tested materials. Specifically, it reports the unit weight, the main fractions coming from the Particle Size Distribution (PSD), the natural water content w n , and the index properties of the samples, namely: the plasticity index PI , the liquid limit w l , and the plastic limit w p . It is worth mentioning that the information is incomplete for some tests. However, the database includes only materials for which at least PI is available, as it is recognized as the main parameter controlling the cyclic behaviour of natural fine-grained soils (Kokusho et al. 1982;Dobry and Vucetic 1987;Vucetic and Dobry 1991).
The results of cyclic and dynamic laboratory tests are stored in the "Testing data" field. The latter includes the initial and post-consolidation void ratios ( e i and e c ) along with either the effective confining pressure p ′ or the vertical stress ′ v , respectively for RC and CDSDSS tests. When available, it is also reported the overconsolidation ratio OCR at which the test was performed. The experimental data are saved in the RC/CDSDSS subfield, according to the laboratory which has performed the test. For RC tests, the excess pore-water-pressure u w and the testing frequencies f (which are instead almost constant and equal to 0.25 Hz for CDSDSS tests) are presented in addition to the MRD curves. Moreover, it is also reported the subfield "D method", which contains information on the experimental approach adopted for estimating the damping ratio.
"Appendix 1" contains general information about the compiled data, including sample ID, site information, soil type (when available) according to USCS (ASTM International 2017), PI , and p ′ or ′ v (according to the type of test conducted).

Main characteristics of the investigated soils
The database comprises mainly the results of tests conducted on fine-grained soils, although some experimental data concerning the response of natural silty sands are also included. According to the Casagrande (Fig. 6a) and activity ( Fig. 6b) charts, the finegrained soils are mainly classifiable as low-to-normal active clays and silts. Only one of the investigated soils (red marker in Fig. 6) is very active silt, with a PI = 122. Figure 7 reports the statistical distributions of the main characteristics of the investigated samples. The specimens were retrieved mainly from depths comprised between 0 and 30 m, whereas just 22% of the materials come from depths larger than 30 m. The 48% of the investigated soils are characterized by 15% < PI < 30% , while the remaining materials are almost equally distributed between lightly ( PI < 15% ) and highly ( PI > 30% ) plastic soils. The laboratory tests were conducted at p ′ varying from 20 kPa to about 1100 kPa. For CDSDSS tests, p ′ is estimated based on ′ v considering a coefficient of earth pressure K = 0.5 . The corresponding G 0,lab values, defined as the maximum G S value measured in a given test, range from 7 to 341 MPa, despite the vast majority of the materials has 25 MPa < G 0,lab < 200 MPa.
The G 0 is strongly dependent on the soil structure and the stress history. As a result, laboratory tests quite frequently lead to underpredicted (or, more rarely, overpredicted) G 0 values compared to field measurements. As recognized by several studies, this laboratory underestimation is mainly due to sample disturbance effects (Anderson and Woods 1975;Stokoe and Santamarina 2000;Pagliaroli et al. 2014;Ciancimino et al. 2020). A comparison between the laboratory, G 0,lab , and the field, G 0,field , small-strain shear moduli is presented in Fig. 8 for 55 samples for which the laboratory tests were conducted at p ′ coherent with the in situ geostatic stress. It is quite evident that as the G 0,field increases, the difference between G 0,field and G 0,lab also increases, shifting the points from the diagonal of the plot. In other words, the stiffer is the soil, the larger will be the sample disturbance effect. The latter is consistent with previous studies (e.g., Stokoe and Santamarina 2000;Pagliaroli et al. 2014), which highlighted the relevance of the sampling procedure on the soil small-strain response. Therefore, it is once again confirmed the best practice of measuring G 0 through field tests and then computing the G S curve by multiplying the normalized G S ∕G 0 curve measured in the laboratory by G 0,field .

Experimental results
The results of RC and CDSDSS tests are shown in Fig. 9 in terms of MRD of the investigated soils as a function of PI . The experimental data are in good agreement with  previous findings (e.g. Vucetic and Dobry 1991;EPRI 1993;Darendeli 2001): the almost linear strain range tends to increase with increasing PI , shifting the nonlinear strain range towards larger c (Fig. 9a); similarly, an increase of PI implies a slower increase of D with c (Fig. 9b). In the small-strain range, soils characterized by large PI values usually show larger D 0 . As a result, the D curves present a cross-over shear strain between about 10 -4 % and 10 -2 % consistent with previous experimental results (EPRI 1993;Stokoe et al. 1995;Lanzo and Vucetic 1999). The latter separates the small-strain field, where highly plastic soils show larger D values, from the nonlinear strain field.
The dependency of the nonlinear soil response from PI is highlighted in Fig. 10, which reports the linear tl and the volumetric tv shear strain thresholds of the samples contained in the database, with over imposed the trends obtained by Vucetic (1994). The tl is here defined as c corresponding to G S ∕G 0 = 0.99 (after Vucetic 1994). The definition of tv is instead more problematic, as no information is available about cyclic degradation and the pore-water pressure is monitored only during RC tests. Consequently, the tv is obtained as the strain level at the onset of the pore pressure build-up (i.e. u w p � = 2% ) for the RC tests, whereas it is equal to the c corresponding to G S ∕G 0 = 0.65 for the cyclic DSDSS tests. Such a criterion is defined based on the study by Vucetic (1994) confirmed also by Ciancimino et al. (2019), which suggested that G S has to be reduced by approximately the 35% before that tv is reached.
The data points are in good accordance with the trends defined by Vucetic (1994), highlighting an increase of both tl and tv with PI (Fig. 10). To put it in another way, highly plastic soils tend to show a larger practically linear strain range, shifting the nonlinear range towards larger c . Conversely, sands and nonplastic silts tend to show a faster decay of the shear modulus and rapid degradation of their structure, leading to pore pressure build-up under undrained conditions. Such results are completely consistent with previous findings (e.g., Silver and Seed 1971;Youd 1972;Vucetic 1994;Tabata and Vucetic 2010;Mortezaie and Vucetic 2016), confirming also the quality of the compiled dataset.
The influence of p ′ is analyzed in Fig. 11, which presents the MRD curves obtained on soils with 15% < PI < 25% . As recognized by previous studies (e.g., Seed and Idriss 1970;Ishibashi and Zhang 1993;Darendeli 2001;Zhang et al. 2005) an increase of p ′ implies a larger almost linear strain range and, in turn, a slower increase of D with c . Nevertheless, it can be observed from Fig. 11 that its importance is limited for fine-grained soils. As also highlighted by Lanzo et al. (1997), the effect of p ′ on the MRD curves tends to vanish for medium to large plasticity soils.

Performance of empirical predictive models
Statistical analysis is conducted to investigate the performance of widely used empirical models in predicting the MRD curves of Italian soils. It is worth noting that the database only contains results of tests conducted on natural soils, mainly consisting of clays and silts with just a few sandy samples. The latter are characterized however by a not negligible fine content. Consequently, the analysis is performed with reference to empirical models suitable for predicting All these models take into account the dependency of the MRD curves on the soil plasticity, but some also consider the influence of other parameters defining the soil material as well as the loading conditions. In addition, also the mathematical structure of the equations adopted to describe the MRD curves varies from model to model. A summary of the main equations is presented in Table 1 along with the input parameters required to predict the nonlinear soil behaviour. A brief description of the structure of each model is provided in "Appendix 2".
The statistical analysis is performed referring to a subset of the database composed of tests for which all the required input parameters are available, namely: PI , FC , OCR , e , w n , f , and p ′ . It is worth mentioning that the Darendeli (2001) model also includes the number of loading cycles N as a parameter influencing the damping ratio curves. Within this study, its minor (especially in the medium strain range) influence was however neglected by adopting N = 10 , as it is not straightforward to define N for a RC test. Moreover, only plastic soils with a fine content FC > 12% are included, considering that the tested empirical models have been specifically developed to study the response of fine-grained soils. The tests originally used to calibrate the model by Ciancimino et al. (2020) are also excluded from the subset to guarantee the reliability of the statistical analysis. The independence of the regression models from the experimental data used for the verification is indeed crucial to properly assess their predictive capabilities.
The subset for the statistical analysis includes eventually 99 tests (49 RC and 50 CDS-DSS tests) conducted on samples with PI ranging from 6 to 53%. The details of the experimental data used for the statistical analysis are given in "Appendix 1".

Modulus reduction curve
The comparison between measured and predicted G S ∕G 0 values is presented in Fig. 12 for the four empirical models. The figure also reports the R 2 values associated with each model, computed as: being Y i and Ŷ i respectively the measured and predicted values of the ith dependent variable ( G S ∕G 0 in this case), and Y the observed average variable.
The R 2 is a statistical measure of the goodness-of-fit of the models which indicates how much variation is explained by the independent variables adopted in the regression. Consequently, it is a proper index of the performance of a model in predicting a given dependent variable.
By looking at the results, it is evident that the comparison is acceptable for practically all the models (Fig. 12). The Wang and Stokoe (2022) equation along with the Vucetic and Dobry (1991) charts are characterized by the highest R 2 = 0.91 , giving therefore the best Fig. 12 Comparison between measured and predicted modulus reduction G S ∕G 0 curves computed according to: a Vucetic and Dobry (1991); b Darendeli (2001); c Ciancimino et al. (2020); and d Wang and Stokoe (2022) prediction of the G S ∕G 0 curves for the investigated soils ( Fig. 12a-d). Interestingly, despite their simplicity, the Vucetic and Dobry (1991) charts are effective in predicting the G S ∕G 0 curves of fine-grained soils (Fig. 12a). The small differences observed for the Darendeli (2001) and Ciancimino et al. (2020) models can partially be explained by referring to two minor biases which take place in the small-strain field, close to the linearity threshold tl , and in the very large strain range. As pointed out by Wang and Stokoe (2022), these two misfit ranges are due to the single curvature modified hyperbolic relationship-Eq. (11) in "Appendix 2"-adopted by the two models to describe the G S ∕G 0 curves. The use of such a relationship results indeed in a slight underprediction of G S ∕G 0 at small strains and a faster decay at very large strains. Conversely, the double curvature equation proposed by Wang and Stokoe (2022)-Eq. (13) in "Appendix 2"-seems to better capture these strain fields (Fig. 12d). Nevertheless, such biases have a practically negligible effect on the overall performance of the empirical relationships, which are therefore both characterized by R 2 equal to 0.89.
The small-strain misfit can however become significant in problems involving the soil response in the proximity of tl . Direct visualization of the issue is given in Fig. 13, which shows the comparison between measured and predicted tl values, with the latter being defined as c corresponding to G S ∕G 0 = 0.99 (Vucetic 1994). The Vucetic and Dobry (1991) model predicts four tl values, corresponding to the four curves describing the PI range investigated in the tests (i.e. from 0% to about 50%). The predicted values are generally in good accordance with the experimental ones (Fig. 13a). The Darendeli (2001) and Ciancimino et al. (2020) models instead systematically underpredict tl , providing values in a quite narrow range comprised between 2•10 -4 % and 10 -3 % (Fig. 13b, c). Conversely, the threshold is well-captured by the double curvature equation adopted by Wang and Stokoe (2022), which guarantees a larger degree of flexibility (Fig. 13d).

Small-strain damping ratio
An accurate prediction of D 0 is necessary to evaluate the soil response at small strains. Its evaluation is however quite problematic, given its intrinsic variability (Foti et al. 2021). Figure 14 shows the performance of the models in predicting D 0 , differentiating the experimental data according to the laboratory which performed the test, and, in turn, the type of test conducted. The measured D 0 values are influenced by the different frequency range applied in RC or CDSDSS tests (Shibuya et al. 1995;d'Onofrio et al. 1999).
For the Vucetic and Dobry (1991) model a constant value, equal to 1%, is used as predicted value, resulting in a systematic underprediction of D 0 (Fig. 14a). Such a value derives from the discretization of the charts, as commonly adopted in software for site response analyses-namely Deepsoil 7 (Hashash et al. 2020) and Strata (Kottke and Rathje 2019). The Authors however originally plotted a dashed zone in the charts due to insufficient experimental data in the small-strain field, mentioning a range of measured values varying from 0.5% to about 5.5%. EPRI (1993) and Lanzo and Vucetic (1999) subsequently clarified the dependency of D 0 from PI , which is responsible for the cross-over shear strain of the D curves (Fig. 9b).
The Darendeli (2001) relationship for D 0 explicitly considers the influence of f , additionally to PI , p ′ and OCR (Table 1). The effect of f does not seem to be well-captured by the model, inducing a significant underestimation of D 0 for the experimental data coming from CDSDSS tests (Fig. 14b), typically conducted at f ≈ 0.25Hz . A similar empirical relationship is also adopted by Ciancimino et al. (2020). The Authors however evaluated the calibration parameters considering, in the original dataset, also D 0 values measured at small frequencies. Therefore, the predictions given by the model are satisfying both for RC and CDSDSS tests (Fig. 14c). Finally, the equation for D 0 proposed by Wang and Stokoe (2022) does not consider f as a parameter (Table 1), while it includes several additional soil parameters (e.g. e , w n , and FC ). Data coming from low-frequency cyclic tests are then frequently overpredicted by the model, as shown in Fig. 14d.

Damping ratio curve
The performances of the models in predicting the measured D curves are shown in Fig. 15, which also reports the R 2 values. The prediction of the D curves is more complex Fig. 13 Comparison between measured and predicted linear threshold shear strains tl computed according to: a Vucetic and Dobry (1991); b Darendeli (2001); c Ciancimino et al. (2020); and d Wang and Stokoe (2022) concerning the G S ∕G 0 curves. Consequently, the observed R 2 are significantly lower than the ones obtained for G S ∕G 0 , ranging from 0.67 to 0.76 (Fig. 12).
By looking more in-depth into the different models, it can be observed that the Vucetic and Dobry (1991) charts show the lowest R 2 value, equal to 0.67 (Fig. 15a). Such a poor prediction is strongly influenced by the assumed constant D 0 value, which appears to be relatively low. The performance of the other models is instead quite similar, with R 2 = 0.74 ÷ 0.76 (Fig. 15b-d). The models are however characterized by different structures. The Darendeli (2001) model links D to G S ∕G 0 -Eq. (12) in "Appendix 2"-as a function of a calibration parameter depending on N . Conversely, Wang and Stokoe (2022) provide a relationship-Eq. (14) in "Appendix 2"-that depends on several soil properties (i.e. p ′ , e , w n , FC , PI , and OCR ) according to the soil type considered. Finally, Ciancimino et al. (2020) adopted the same approach proposed by Darendeli (2001) but neglected the influence of N on the calibration parameter. The latter provides the best estimation for the Fig. 14 Comparison between measured and predicted small-strain damping ratios D 0 computed according to: a Vucetic and Dobry (1991); b Darendeli (2001); c Ciancimino et al. (2020); and d Wang and Stokoe (2022) D curves, with R 2 = 0.76 (Fig. 15c), probably also as a result of the good prediction provided by the relationship suggested for D 0 (Fig. 14c).

Discussion
The R 2 is a good indicator to evaluate the ability of an empirical model in predicting a specific dependent variable. However, to assess the overall performance of the models it is useful to define a unique, normalized, indicator. To this end, the global normalized rootmean-square error for each model is computed as: Fig. 15 Comparison between measured and predicted damping ratio D curves computed according to: a Vucetic and Dobry (1991); b Darendeli (2001); c Ciancimino et al. (2020); and d Wang and Stokoe (2022) where G S ∕G 0 and D are the normalized root-mean-square errors respectively for G S ∕G 0 and D , obtained as: The results are presented in Table 2. The specific errors G S ∕G 0 and D are consistent with the observed R 2 values: the predictions are generally satisfying in terms of G S ∕G 0 curves while the models struggle in predicting the D curves. As a consequence, the performances of the models are strongly influenced by the D predictions when looking at the global error . The Vucetic and Dobry (1991)  It is interesting to notice that the computed errors for the different models are very close one each other. For instance, although the double curvature equation adopted by Wang and Stokoe (2022) has proven to be effective in reducing the prediction biases on the G S ∕G 0 curves (Fig. 12), the corresponding G S ∕G 0 is not substantially lower than the other models (Table 2). At the same time, for the same model, the use of several soil parameters does not lead to a substantial improvement of D , which is instead slightly larger than the value obtained through the relationship proposed by Ciancimino et al. (2020), based just on a few parameters. This suggests that the introduction of further soil parameters as proxies for the MRD curves does not necessarily imply a reduction of the associated uncertainties. Such a result is particularly important given the independence of the tested models on the experimental dataset. Indeed this is the most robust (and perhaps the only) way to assess if the introduction of more complicated relationships would lead to an improvement of the model predictions or not.

Conclusions
This paper has presented a wide, comprehensive, database of cyclic and dynamic laboratory tests on natural Italian soils. The experimental data include the main physical properties of the investigated soils along with the results of 110 RC and 142 CDSDSS tests conducted, respectively, by the geotechnical laboratories of the Politecnico di Torino and the Sapienza Università di Roma. The database is made publicly available as supplementary data for this paper, and it represents a valuable resource for both scientific studies on nonlinear soil behaviour and more practical applications related to site response analyses. The database was then used to assess the performance of empirical models-specifically the Vucetic and Dobry (1991), the Darendeli (2001), the Ciancimino et al. (2020), and the Wang and Stokoe (2022) models-in predicting the MRD curves of fine-grained soils. A subset of the dataset was selected to this end, excluding both the tests conducted on materials without sufficient available information to apply the models and the tests previously used to calibrate the model by Ciancimino et al. (2020). The independence of the empirical models from the experimental data employed to test their performance is indeed a crucial point to conduct a reliable statistical analysis.
The predictions in terms of G S ∕G 0 curves are generally satisfying for all the models analyzed. Among the models, the one proposed by Wang and Stokoe (2022), along with the Vucetic and Dobry (1991) charts, have been shown to provide the lowest prediction error. In particular, the Wang and Stokoe (2022) double curvature modified hyperbolic relationship seems to be effective in predicting the soil linearity threshold tl , which is quite significant for problems involving small to moderate shear strains. The models are instead less effective in predicting D . The intrinsic uncertainties related to this parameter lead unavoidably to larger prediction errors. The best predictions for D 0 are provided by the equation proposed by Ciancimino et al. (2020), which can predict the experimental data coming from both RC and CDSDSS tests. The best estimate of the D curves is, also in this case, provided by the Ciancimino et al. (2020) model.
The statistical analysis shows that the performance is not significantly different from model to model. Conversely, the number of soil parameters required to predict the MRD curves can vary significantly. For instance, three input parameters-PI , p ′ , and f -are needed to apply the Ciancimino et al. (2020) relationships, with just PI as physical soil property. Conversely, the applicability of the last-stage Wang and Stokoe (2022) model requires the estimation of several parameters-p ′ , e , w n , FC , PI , and OCR-some of them not frequently available. As a consequence, practitioners may be induced to estimate such parameters through empirical correlations, introducing further uncertainties in the evaluation of the MRD curves. When dealing with models to be used in common practice, not only the performance but also the user-friendliness of the developed framework should be considered. It is therefore advisable to find a good balance between the applicability and accuracy of the models, as complexities in applying the model equations may induce a reduction in the reliability of the predictions. being mr the reference shear strain corresponding to G s ∕G 0 = 0.5 b . The second curvature parameter b is added to improve the fitting of the experimental data. In particular, Eq. (13) is claimed to better capture the linearity threshold tl and the mismatch in the highly nonlinear shear strain range. A three-parameter modified hyperbolic model is instead adopted for the D curve:

Appendix 1: Summary of the compiled database
where c and d are model parameters, and D is the reference shear strain for which D = d + D 0 2 . As opposed to the Darendeli (2001) and Ciancimino et al. (2020) models, the D curve is therefore independent of the G s ∕G 0 curve. The model parameters, as well as the D 0 equation, can be computed according to a combination of soil properties depending on the soil type as proposed by USCS (ASTM International 2017). Table 1 reports the model equations obtained by Wang and Stokoe (2022) for the last-stage models developed for the soil type considered in this research, i.e. clayey materials. According to the model, the MRD curves of clayey soils depend on six parameters: p ′ ; e ; w n ; FC ; PI ; and OCR.