Background

Based on published data for 17 ecosystem services in 16 biomes, Costanza et al. [1] estimated the value of ecosystem services at the level of the whole biosphere. They found a lower bound in the range of US$16–54 trillion (1012) per year, with an average of US$33 trillion per year. Marine systems produced near 63% of the annual value. Almost half derived from coastal ecosystems. Approximately 25% of this share related to algal beds and seagrasses. This contribution to human welfare is deem relevant. Thus maintaining the health of marine ecosystems is subject of scientific concerns. Currently, increasing anthropogenic pressures pose important threats. And global climate change could also threaten future viability of seagrass meadows [2, 3]. For instance, water quality and other local stressors promote unprecedented meadow loss [4,5,6]. This reduces mitigation of wave action [7] and filtration [8]. Diminishes food and shelter for a myriad of organisms [9,10,11]. Weakens nutrient cycling [12, 13], erosion abatement and shoreline stabilization [14,15,16]. Moderates support for detrital food web foundation [17]. And inhibits carbon sequestration [18, 19].

Eelgrass (Zostera marina L.) is a dominant along the coasts of both the North Pacific and North Atlantic [20]. This species supports communities known as among the richest and most diverse in sea life [21]. Contribution of organic materials for food webs in shallow environments [22] is noticeable. Indeed, eelgrass produced up to 64% of the whole primary production of an estuarine system [23]. Current deleterious effects of anthropogenic influences on eelgrass prompted special restoration strategies. Among remediation efforts replanting plays an important role [24,25,26,27]. Transplant success amounts to reinstatement of ecological functions of natural populations. Evaluation relies on monitoring standing stock and productivity of transplanted plants. Then comparing with assessments of a reference population, which usually settle nearby [28].

Combined biomass of leaves in shoots is an important component of standing stock. Assessments rely on the estimation of the biomass of individual leaves. This requires shoot removal followed by dry weight measurement procedures in the laboratory. Elimination of shoots could infringe damage to natural eelgrass populations [29]. And reduced shoot density makes these effects even more perceptible for transplanted plots. Allometric methods make it possible simplified-indirect estimations of eelgrass productivity and standing stock. Echavarría-Heras et al. [30] considered an allometric representation for eelgrass leaf biomass and related length. Agreeing with Solana et al. [31], the involved parameters are invariant within a given geographical region. Estimates and leaf length measurements grant nondestructive approximations of observed leaf biomass values. This way, leaf length measurements grant nondestructive approximations for observed leaf biomass values. Leaf growth rates estimation relies on successive measurements of leaf biomass. Then the allometric model in [30] entails nondestructive assessments of eelgrass productivity. But, invariance does not impede local factors to imply variability of parameter estimates. Besides, local influences other factors could explain numerical differences in parameter estimates. There are methodological influences that may render biased parameter estimates. Analysis method, sample size, and data quality can influence scaling results (e.g. [32, 33]). And, since scaling relationships are particularly sensitive to parametric uncertainties, Echavarría-Heras et al. [30] concluded that the actual precision of derived allometric surrogates requires clarification.

Here we deal with allometric surrogates for average leaf biomass in eelgrass shoots. These derive from the model w(t) = βa(t)α for leaf biomass w(t) and area a(t) measured at time t, and α and β parameters. Leaf area is more informative of eelgrass leaf biomass than corresponding length. Thus, the present scaling endures a boost in precision of parameter estimates by the model in [30]. This could increase the accuracy of derived surrogates for leaf biomass in shoots. Besides, eelgrass leaf area and length admit an isometric representation [34, 35]. Then, the time invariance found by Solana-Arellano et al. [31] also holds for parameter estimates of the present scaling. This by the way imbeds a nondestructive advantage to the present shoot-biomass substitutes. But, agreeing with Echavarría-Heras et al. [30], we must examine influences on precision of estimates for suitability of projections. Since, such an analysis was not produced before, we took here the try of filling that gap. Achieving the related goals, required the assemblage of an extensive data set. It comprises coupled measurements of eelgrass leaf biomasses and related areas. This is further called “raw data set”. A data cleaning procedure adapted from Echavarría-Heras et al. [30] removed inconsistent leaf biomass replicates from the raw data. Thereby forming what we call a “processed data set”. Differences in reproducibility strength allowed to assess data quality effects in precision. A similar procedure evaluated sample size effects. And a sensitivity analysis evaluated robustness of the projection method. This supports consistent, cost-effective allometric projections of observed values from raw data. But, this depends on nonlinear regression as an analysis method. Besides, sample size must be optimal. Data quality as expected improved reproducibility strength of the allometric projection method. But, this factor was more relevant in optimizing sample size. A detailed explanation of used procedures appears in the methods section. The results section is not only devoted to the presentation of our findings. It also examines the relative strengths of factors influencing the precision of proxies. A Discussion section emphasizes on the gains and the limitations of the method. Appendix 1 deals with the model selection problem. Appendix 2 is about data processing methodology. Appendix 3 presents the procedure for sensitivity assessment.

Methods

The symbol w m (t) will stand for average leaf dry weight of shoots collected at sampling date t. An the average of these values over all sampling dates symbolized through 〈w m (t)〉. Formal representations of these variables appear in Appendix 3 (cf. Eq. (15) and Eq. (16)). The symbols w(t) and a(t) will one to one stand for biomass and area of an individual leaf collected at a sampling time t. We assume that these variables are linked through the scaling relationship

$$ w(t)=\beta a{(t)}^{\alpha }. $$
(1)

The present raw data come from a coastal lagoon located in San Quintin Bay, México [30]. This comprises 10,412 leaves and measured lengths [mm], widths [mm] and dry weights (g). The product of length times width provided estimations of leaf area [mm2] [36]. In what follows the symbol n ra stands for number of leaves in raw data. Processed data results by applying direct and statistical data cleaning techniques. The direct hinges on the consistency of allometric models for eelgrass leaf biomass. Leaf length or area are allometric descriptors of eelgrass leaf biomass [30, 31, 34]. A model selection exploration corroborated a power function like trend assumption for the present data. Details appear in Appendix 1. Severe deviations, from the mean response curve, are inconsistent and must be removed. This took care of sets containing less than ten leaf dry weight replicates. The statistical procedure worked on sets with a larger number of replicates. It centers on properties of the median of a group of data. This is immune to sample size and also a robust estimator of scale. The adapted Median Absolute Deviation (MAD) data cleaning procedure [37] appears in Appendix 2. Processing data resulted in a number of n qa  = 6094 pairs of leaf dry weights and areas.

Parameter estimates \( \widehat{\alpha} \) and \( \widehat{\beta} \) and leaf area values yield allometric proxies \( {w}_m\left(\widehat{\alpha},\widehat{\beta},t\right) \) for w m (t). (cf. Eq. (19)). The symbol \( \left\langle {w}_m\left(\widehat{\alpha},\widehat{\beta},t\right)\right\rangle \) (cf. Eq. (20)) stands for the pertinent average over sampling dates. We use Lin’s Concordance Correlation Coefficient (CCC) [38] as an evaluation of reproducibility. This meant as the extent to which two connected variables fall on a line through the origin and with a slope of one. We represent this statistic by means of the symbol \( \widehat{\rho} \). Agreement defined as poor whenever \( \widehat{\rho}<0.90 \), moderate for \( 0.90\le \hat{\rho}<0.95 \), good for \( 0.95\le \hat{\rho}\le 0.99 \) or excellent for \( \widehat{\rho} \)>0.99 [39]. Values of \( \widehat{\rho} \) gave an evaluation of the strength of the \( {w}_m\left(\widehat{\alpha},\widehat{\beta},t\right) \) devise to reproduce observed values.

In getting parameter estimates \( \widehat{\alpha} \) and \( \widehat{\beta} \) we relied on two procedures. The traditional analysis method of allometry and nonlinear regression. Assessing analysis method effects on reproducibility strength of \( {w}_m\left(\widehat{\alpha},\widehat{\beta},t\right) \) depended on testing differences in \( \widehat{\rho} \). The traditional approach involves a linear regression equation (cf. Eq. (4)). This obtained through logarithmic transformation of response and descriptor in Eq. (1). The nonlinear regression analysis method relied on maximum likelihood [40, 41]. This approach fitted the model of Eq. (1) in a direct way in the original arithmetical scale. The nonlinear fit allowed the consideration of homoscedasticity or heteroscedasticity (cf. Eqs. (5) and (6)). All the required fittings for both raw and processed data depended on the use of the R software.

We also fitted the model of Eq. (1) to samples of different sizes taken out from primary and processed data sets. Each sample of size k; with 100 ≤ k ≤ n ra produced estimates \( \widehat{\alpha}(k) \) for α and \( \widehat{\beta}(k) \) for β, and resulting \( {w}_m\left(\widehat{\alpha}(k),\widehat{\beta}(k),t\right) \) projections. The symbol \( \widehat{\rho}(k) \) denotes the value of \( \widehat{\rho} \) for a sample of size k. Differences in \( \widehat{\rho}(k) \) allow exploring sample size influences in reproducibility.

Deviations ∆α q and ∆β r convey fluctuating values α q  = α + ∆α q and β r  = β + ∆β r for the parameters α and β one to one. The modulus of the vector of parametric changes (∆α q ∆β r ) defines a tolerance range θ(q, r). And the value of θ(q, r) determined by the standard errors of parameter estimates denoted by mean of θ ste . A fixed value of θ(q, r) leads to four possible characterizations of the pair (∆α q ∆β r ). Each one associates to a trajectory w m (α q , β r , t) shifting from a reference one w m (α, β, t). The symbol δw (α q , β r , t) (cf. Eq. (42)) denotes deviations between reference and average of shifting trajectories at sampling dates. And the average of δw (α q , β r , t) values taken over all sampling dates denoted through 〈δw (α q , β r , t)〉 (cf. Eq. (43)). The absolute value of the ratio of 〈δw (α q , β r , t)〉 to 〈w m (α, β, t)〉 defines a relative deviation index ϑ(θ). It measures sensitivity of 〈w m (α, β, t)〉 to fluctuations of tolerance θ(q, r) on α and β. Appendix 3 presents detailed formulae.

Results

Figure 1 shows the variation of leaf dry weight and area observed in the raw data. Smallest and largest leaf areas were 2 mm2 and 7868 mm2 respectively. Associated dry weights were 1 × 10−5 g and 0.1588 g one to one. The time average of mean leaf dry weight in shoots was 〈w m (t) 〉 = 0.01461g (cf. Eq. (16)). Each leaf area measurement associate to several replicates of leaf biomass. Number of replicates increased from a single association up to a largest value of 84. Dispersion masks a power function like trend. Contents of Appendix 1 corroborate this at formal level. And exploration of dispersion reveals severe deviations from the inherent power function-like trend. Inconsistencies are more visible for leaves with areas under 350 mm2 and also for those over 5000 mm2. This hints about the relevance of data quality.

Fig. 1
figure 1

Distribution of eelgrass leaf dry weight and linked area values in raw data. Dispersion shows a masked power function-like trend. Deviations from this trend are more manifest for areas under 350 mm2 and also for those bigger than 5000 mm2. This suggests data quality effects on accuracy of allometric projections

Figure 2 exhibits the spreading of leaf dry weight after data quality control. About 40% of the replicates in the raw data were eliminated. A power function like trend appears more depicted than that showing on Fig. 1. But dispersion still shows significant deviations around the expected power function like trend. This suggests lack of standardized routines for data gathering. In this work the length times width proxy [36] approximated leaf area. Errors in estimation of area of older damaged leaves could explain uneven replicates. Faulty equipment, or incorrect recording could explicate inconsistencies for small leaves.

Fig. 2
figure 2

Plot of processed data. Distribution of eelgrass leaf dry weight and area values remaining after data quality control procedures. Although about 40% of the replicates in the raw data were found inconsistent and eliminated, this plot still shows significant residual variability around an expected power function like trend

Table 1 gives estimates\( \widehat{\alpha} \) and \( \widehat{\beta} \)for the parameters α and β and corresponding standard errors. Assuming heteroscedasticity in the model of Eqs. (5) and (6) did not affect estimates. Thus, presentation of results of nonlinear regression refers to the homoscedastic case of the model of Eqs. (5) and (6). Figure 3 displays mean response curves fitted using raw data. Figure 4 shows those associated to quality controlled data. Results for the log-linear transformation method included correction for bias of retransformation [42]. The smearing estimate of bias of Duan [43] provided the form of the correction factor.

Table 1 Parameter estimates \( \widehat{\alpha} \) and \( \widehat{\beta} \) associated standard errors (\( ste\left(\widehat{\alpha}\right), ste\Big(\widehat{\beta} \))) found by fitting the model of Eq. (1). Nonlinear regression estimates associate to the homoscedastic case of the model of Eqs. (5) and (6) (see Appendix 1). Values of \( \widehat{\rho} \) give an evaluation of reproducibility strength of the proxy of Eq. (1)
Fig. 3
figure 3

Fit of the model of Eq. (1) to raw data. Panel a Fitting results of the model of Eq. (1) by the log-linear transformation method. Distribution of replicates around the mean response curve shows a significant bias. This entails poor reproducibility (\( \widehat{\uprho} \)=0.8910) of leaf dry weight values. Panel b Shows fitting results for nonlinear regression and raw data. For this arrangement parameter estimates and Eq. (1) produced \( \widehat{\uprho} \)=0.9307. Thus, nonlinear regression stands a gain in reproducibility strength

Fig. 4
figure 4

Fit of the model of Eq. (1) to processed data. Panel a Fitting results for model of Eq. (1) via traditional log-linear transformations. Though data processing improved goodness of fit, still a notorious bias remains. Panel b Fitting results by taking nonlinear regression as an analysis method. Shown spreading of replicates around the mean response curve is fair. Hence, \( \widehat{\uprho} \)=0.9777 entails suitable reproducibility of observed values via Eq. (1)

For raw data the log-linear transformation method produced \( \widehat{\rho}=0.8910 \), entailing poor reproducibility. This explains a biased distribution of replicates around the mean response curve (Fig. 3a). Meanwhile, estimates acquired by nonlinear regression from raw data conveyed adequate reproducibility (\( \widehat{\rho}=0.9307\Big) \). This explains a displayed fair distribution of projected leaf biomass values (Fig. 3b).

Estimates via log-linear transformation for processed data seemed enhance reproducibility (\( \widehat{\rho}=0.9777 \) ̂=0.9455). But, Fig. 4a, reveals a bulk of inconsistent replicates for leaves areas under 5000 mm2. Notice that this subset of replicates distributes almost evenly around the mean response curve. Yet replicate spread for areas beyond 5000 mm2 shows significant bias (Fig. 4a). Meanwhile, nonlinear regression and processed data associate to \( \widehat{\rho}=0.9777 \). This corresponded to good reproducibility strength. Indeed, spread of replicates around the mean response is fair (Fig. 4b).

Estimates acquired from raw data via the traditional analysis method of allometry returned a value of \( \widehat{\rho}=0.9285 \) for \( {w}_m\left(\hat{\alpha},\hat{\beta},t\right) \) projections (Table 2). This seems to correspond to moderate reproducibility. Yet, corresponding rms = 0.01265 was largest among analysis method- data set combinations (Table 2). Figure 3a shows a relative wider bias in spread around the mean response curve for larger leaves. This explains resulting inconsistencies in reproducibility of \( {w}_m\left(\widehat{\alpha},\widehat{\beta},t\right) \) projections shown in Fig. 5a. Display reveals biased \( {w}_m\left(\hat{\alpha},\hat{\beta},t\right) \) projections for near 50% of sampling dates. This, led to a smallest value of 0.8436 for the ratio of projected to observed averages.

Table 2 Reproducibility results for w m (α, β, t). Entries include, Lin’s concordance correlation coefficients \( \left(\hat{\rho}\right) \), root mean square deviations (rms) and ratios of 〈w m (α, β, t)〉 to 〈w m (t)〉 averages
Fig. 5
figure 5

Effects of analysis method on reproducibility of w m (α,β,t) projections (raw data). Continuous lines display w m (t) averages of leaf dry weight in shoots. Dashed lines in panel a show w m (α,β,t) projections produced by log-linear transformation. Dashed lines in panel b display those projected via nonlinear regression. Nonlinear regression estimates support greater reproducibility of observed w m (t) values through w m (α,β,t) proxies (Table 2)

Instead, nonlinear regression and raw data produced a value of \( \widehat{\rho}=0.9915. \) And root mean squared deviation attained a value of rms = 0.00460 (Table 2). This suggest a remarkable reproducibility strength for \( {w}_m\left(\hat{\alpha},\hat{\beta},t\right) \) projections (Table 2). Correspondence between projected and observed values, shown in Fig. 5b. corroborates high agreement. Moreover, the ratio of projected to observed leaf dry weight averages attained an outstanding value of 0.9773 (Table 2).

Processed data and log-linear transformation analysis produced \( \hat{\rho}=0.9489 \) for w m (α, β, t) projections. This figure is bigger than corresponding to raw data for this method. Nevertheless, lines in Fig. 6a show that this result does not correspond to a real gain in reproducibility. Besides data quality could not significantly reduce calculated root mean squared deviation (Table 2). Also, a value of 0.8588 for a ratio of projected to observed leaf dry weight averages is still low for suitable agreement (Table 2). Thus, regardless data quality, log-linear analysis failed to produce consistent w m (α, β, t) projections of w m (t) averages.

Fig. 6
figure 6

Effects of analysis method on reproducibility of w m (α,β,t) projections (processed data). Continuous lines depict observed w m (t) averages. Dashed lines in panel a) are projections yield by log-linear analysis. Dashed lines in panel b) stand for projections linked to nonlinear regression

In turn, w m (α, β, t) projections made by nonlinear regression and processed data yield the highest value of \( \hat{\rho}=0.9976 \). (Table 2). And also the smallest root mean squared deviation among analysis method–data set combinations (Table 2). As shown by Fig. 6b this corresponds to a fairly good reproducibility strength. Additionally, data quality and nonlinear regression led to an outstanding value of 0.9975 for the ratio of projected 〈w m (α, β, t)〉 to observed 〈w m (t)〉 averages.

Results exhibit that log-linear transformations failed to produce consistent projections for observed w m (t) averages. In contraposition, nonlinear regression entailed parameter estimates of noteworthy reliability. Thus studying sample size effects on reproducibility addressed only this analysis method. Figure 7 exhibits variation of CCC values depending on sample size k. This is expressed as a percentage of the extent of data set ( n ra  = 10412 for raw) or (n qa  = 6094 for processed). For raw data, a sample sized k = 0.20n ra endures reasonable reproducibility \( \left(\widehat{\rho}(k)=0.9889\right) \). But samples beyond this threshold would not induce a significant change in reproducibility. Meanwhile, for the quality controlled data, the sample size threshold for excellent reproducibility was k = 0.10n rq . This leads to a high reproducibility strength of \( \widehat{\rho}(k)=0.9929 \). Thus, a sample 10% the size of processed data set produced excellent reproducibility. Again sample sizes beyond this threshold failed to increase \( \widehat{\rho}(k) \) values.

Fig. 7
figure 7

The effects of sample size on reproducibility of w(α,β,t). For raw data a sample of size k = 0.20n ra (or near 2000 leaves) yields reasonable reproducibility (\( \widehat{\uprho} \)(k) = 0.9889). But, for quality controlled data similar reproducibility associates to only k = 0.10n rq (about 1000 leaves). Larger sample would not induce a significant changes in the values of \( \widehat{\uprho} \)(k)

In Appendix 3 we consider variations α q  = α + ∆α q and β r  = β + ∆β r . We found that shifting trajectories, w m (α q , β r , t) overestimated reference projections w m (α, β, t) whenever ∆α q  > 0 and ∆β r  > 0. In correspondence underestimation of w m (α, β, t) occurs for −α < ∆α q  < 0 and −β < ∆β r  < 0. Fig. 8 explains that for ∆α q  ∙ ∆β r  > 0, shifting trajectories overestimate (see red lines in panel a)) or underestimate the reference one (see blue lines in panel a)). We can also make certain that relatively smaller deviations between w m (α q , β r , t) and w m (α, β, t), values occur for the case, ∆α q  ∙ ∆β r  < 0, (see red and blue lines in panel b)).

Fig. 8
figure 8

Examples of changing trajectories w m (α q , β r ,t). Black lines a reference trajectory w m (α,β,t). This produced by raw data and nonlinear regression as an analysis method. For ∆α q . ∆β r  > 0, shifting trajectories w m (α q , β r ,t) overestimate or underestimate w m (α,β,t) projections (see red or blue lines in panel a)). The case ∆α q ∙∆β r  < 0, leads to relative smaller deviations between w m (α q , β r ,t) and w m (α,β,t) (see red and blue lines in panel b))

The simulation code of Eqs. (39) through (44) explored the sensitivity of the w m (α, β, t) projection method, to numerical variation of parameters α and β. Available parameter estimates yield reference values for α and β (Table 2). Again, since nonlinear regression associates to highest reproducibility strength, for easier presentation, we only explain results using this analysis method.

The variation of the absolute deviation index for raw data is shown in Fig. 9. A range 0 ≤ θ(q, r) ≤ θ ste , places the relative ϑ(θ) deviation index within the domain 0≤ ϑ(θ) ≤ 0.0205 (Table 3). Therefore, for a bound of θ(q, r) set by the standard errors of estimates largest absolute deviation between w m (α q , β r , t) and w m (t) amounts to about 2% of 〈w m (t) 〉. Moreover, a range 0 ≤ θ(q, r) ≤ 2θ ste produces 0 ≤ ϑ(θ) ≤ 0.031. This leads to an equivalent 3% of 〈w m (t) 〉. Figure 10 displays the dynamics of ϑ(θ) depending on θ(q, r) for processed data. We have that θ(q, r) varying in a range of 0 ≤ θ(q, r) ≤ θ ste implies 0 ≤ ϑ(θ) ≤ 0.003 (Table 3). This time largest absolute deviation was only 0.03% of 〈w m (t)〉. Comparing with results for raw data, we ascertained remarkable gain in precision of w m (α, β, t) projections. This exploration highlights on importance of data quality control as a procedure leading the consistent w m (α, β, t) projections. But, results show that the projection method is robust even when parameter estimates are obtained from raw data.

Fig. 9
figure 9

The variation of the relative deviation index ϑ(θ) (raw data). The standard errors of estimates acquired by nonlinear regression from raw data, produced ϑ(θ ste ) = 0.0205. And for a range 0 ≤ θ(q,r) ≤ θ ste the largest value of the absolute deviation between w m (α q , β r ,t) and w m (t) is around 2% of 〈w m (t) 〉 (see Appendix 3 for details)

Table 3 Sensitivity of the w m (α, β, t) projections to changes in estimates of the parameters α and β. Included are calculated θ ste values. This gives θ(q, r) as determined by the standard errors of estimates. We also present corresponding values of the relative deviation index ϑ(θ ste )
Fig. 10
figure 10

The variation of the relative deviation index ϑ(θ) (processed data). The standard errors of estimates acquired by nonlinear regression from processed data, produced θ ste =3.954 × 10− 3). This set a range 0 ≤ ϑ(θ) ≤ 0.003. Thus, the largest absolute deviation between w m (α q , β r ,t) and w m (t) amounts to about 0.3% of the 〈w m (t) 〉 average. Data quality control could be a factor improving accuracy of w m (α,β,t) projections

Discussion

Results of Solana-Arellano et al. [31] explain invariance of the allometric parameters α and β in Eq. (1). This suggest w m (α, β, t) proxies as possible nondestructive estimations of the average leaf dry weight in eelgrass shoots. These assessments are essential for monitoring the efficiency of transplanted eelgrass plots, fundamental in remediation aims. The present examination shows that the w m (α, β, t) proxies could in fact offer reliable and cost-effective assessments. This on condition that practitioners take in to account our guidelines. For instance, our results typify the extent on which accuracy of estimates of the parameters α and β influences the predictive power of the w m (α, β, t) projections. And, our findings clarify that there are methodological factors affecting the accuracy of estimates. Related influences could spread significant bias in approximations supported by the w m (α, β, t) device. Indeed, analysis method turned into a main factor inducing bias in parameter estimates of the model of Eq. (1). Moreover, only parameter estimates acquired by nonlinear regression yield consistency of the model of Eq. (1) (Table 1 and lines in Fig. 3b and Fig. 4b). And, only these estimates upheld conclusive predictive power of the w m (α, β, t) proxies (Table 2, as well as, lines in Fig. 5b and Fig. 6b). Our results also show that data quality could not improve the performance of w m (α, β, t) projections acquired via log-linear transformations. Without doubt, parameter estimates acquired from processed data by this method still led to significant bias in w m (α, β, t) projections (Fig. 6a). Meanwhile, data processing improved reproducibility of projections built for raw data using nonlinear regression (Table 2 and lines in Fig. 5b and Fig. 6b). Besides, relevance of data quality was also evident for optimizing sample size. Indeed, while for raw data, a sample of approximately 2000 leaves shows reasonable reproducibility, for the quality controlled data this threshold drops to near 1000. However, samples sized beyond these thresholds would not induce a noteworthy gain in reproducibility. This result on its own, ties to efficiency of the w m (α, β, t) projection method. Undoubtedly routines for leaf dry weight assessment are tedious and time consuming. So, reducing size of data set for parameter estimation increases cost-effectiveness of the w m (α, β, t) projection method.

Nonlinear regression estimation also showed advantages in sensitivity over the log-linear analysis counterpart. Estimates from raw data led to a largest absolute deviation between w m (α, β, t) and w m (t) values amounting only 3% of the average of w m (t) over sampling dates. And, for processed data, the fluctuation range for equivalent sensitivity widened to 2.8 times the range for standard errors of estimates. But, on spite of data quality relevance, sensitivity results for raw data reveal that the w m (α, β, t) projection method is robust relative to expected fluctuations in parameter estimates.

Our results show that both the accuracy and cost-effectiveness of projections can be enhanced by the addition of data quality control procedures. However, including data processing can become a weakness for the w m (α, β, t) projection method. Indeed, data cleaning procedures convey niceties that relate in a fundamental way to detection and rejection of inconsistent replicates. Also, compromising about which particular rejection edge should work, is hard to determine. Thus, the use of any data processing will endure a doubt, that the examiner selects an arrangement producing the most probable results [37]. In that order of ideas, when attempting to enhance the reproducibility power of w m (α, β, t) projections it is desirable to avoid depending in any form of data processing. For that aim, prior to data assembly, we must bear in mind standardized routines yielding accurate measurements for w(t) and a(t). This will favor direct identification of the model of Eq. (1) in a consistent way. It is of a fundamental importance to be aware, that errors in leaf dry weight or area assessment differentiate in terms of leaf size. Certainly, leaves produced anew normally present a complete and undamaged span. But, they normally yield very reduced dry weight values. Therefore, we can expect estimation errors imputable to the precision of the analytical scale for individual leaf dry weight assessments. To this, we may add errors in the reading and/or recording of the scale output. These issues could explain a perceptible accumulation of inconsistent replicates for leaves with areas between 2 mm2 and 350 mm2 (Fig. 1). And, even after data cleaning procedures, leaf dry weight replicate spread for leaves bigger than 2000 mm2 shows significant residual variability (Fig. 2). Likewise, as far as, bigger and older leaves is concerned, there are issues on dry weight estimation errors. These seem to relate to damage caused by exposure to environmental factors. The fact that we estimated leaf area by means of the product of related length and width could explain these effects. For complete undamaged eelgrass leaves, the use of a leaf times width proxy grants simplified and accurate estimations of leaf area [36].

But, this approach could deliver inaccurate estimations for long and damaged leaves. Actually, bigger leaves remain exposed during significant periods of time to environmental influences such as drag forces or herbivory. This could remove large amounts of leaf tissue while leaving length unaffected. Thus, causing overestimation of true leaf area when using a width and length product estimation. At the same time, lost portions of a leaf will produce a smaller dry weight than expected for an overestimated area. These effects will bring dry weight replicates that deviate from the power function–like trend associated to the model of Eq. (1). Estimation bias for the dry weights of smaller and longer leaves could explain the anomalous proliferation of inconsistencies (around 41 % ) found while applying the proposed data cleaning procedure to the present raw data. These effects will propagate uncertainty of parameter estimates of the model of Eq. (1), influencing accuracy of the w m (α, β, t) projections. Hence, for the sake of consistent reproducibility of observed values via w m (α, β, t) projections, we need to be aware of these effects. And as elaborated, a good starting point for granting consistency, is appropriate gathering of primary data for the identification of the model of Eq. (1). This will make subsequent data cleaning procedures unnecessary.

Conclusion

This research show that precise estimates of allometric parameters in Eq. (1) grant accurate non-destructive projections of the average leaf dry weight in eelgrass shoots. These projections offer efficient appraisal of eelgrass restoration projects, thereby contributing to the conservation of this important seagrass species. Our findings support views in Hui and Jackson [32], Packard and Birchard [33] and Packard et al. [44], on the relevance of analysis method in scaling studies. Indeed, we found that for assuring suitability of the w m (α, β, t) proxies, the use of nonlinear regression is crucial. On spite of claims that the use of the traditional log-linear analysis method is a must in allometric examination [45], exploration of spread of present crude data reveals curvature [46]. This explaining failure of the traditional analysis method to produce unbiased results for the present data. Besides proxies supported by nonlinear regression and raw data, are robust.

Data cleaning could only marginally enhance the accuracy of proxies produced by nonlinear regression and raw data. But results underline a relevant influence of data quality in setting optimal sample size for a suitable precision of parameter estimates. Nevertheless, the use of data cleaning procedures leads to intricacies. They in a fundamental way relate to choosing thresholds for rejection of detected inconsistencies, which are often regarded as subjective. Thus, instead of using later data cleaning, data gathering should seek for suitability. Special care must be taken when processing bigger and older leaves. These are often damaged or even trimmed so that their dry weights do not conform to a true weight to area relationship. Irregularities in raw data may also associate to biased estimation of leaf length or width. Moreover, in a lesser way faulty equipment for leaf dry weight assessment, rounding off, or even incorrect data recording could as well contribute.

Taking advantage of a time invariance of the parameters in Eq. (1) the w m (α, β, t) device could offer to the eelgrass conservation practitioner highly consistent and truly nondestructive assessments of the average value of leaf dry weight in shoots. But the explained guidelines on analysis method, sample size and data appropriateness are mandatory for cost-effectiveness. Moreover, the present results suggest that the use of the w m (α, β, t) method could be extended to other seagrasses species with similar leaf architecture to eelgrass.