A cost-precision model for marine environmental monitoring, based on time-integrated averages
Ongoing marine monitoring programs are seldom designed to detect changes in the environment between different years, mainly due to the high number of samples required for a sufficient statistical precision. We here show that pooling over time (time integration) of seasonal measurements provides an efficient method of reducing variability, thereby improving the precision and power in detecting inter-annual differences. Such data from weekly environmental sensor profiles at 21 stations in the northern Bothnian Sea was used in a cost-precision spatio-temporal allocation model. Time-integrated averages for six different variables over 6 months from a rather heterogeneous area showed low variability between stations (coefficient of variation, CV, range of 0.6–12.4%) compared to variability between stations in a single day (CV range 2.4–88.6%), or variability over time for a single station (CV range 0.4–110.7%). Reduced sampling frequency from weekly to approximately monthly sampling did not change the results markedly, whereas lower frequency differed more from results with weekly sampling. With monthly sampling, high precision and power of estimates could therefore be achieved with a low number of stations. With input of cost factors like ship time, labor, and analyses, the model can predict the cost for a given required precision in the time-integrated average of each variable by optimizing sampling allocation. A following power analysis can provide information on minimum sample size to detect differences between years with a required power. Alternatively, the model can predict the precision of annual means for the included variables when the program has a pre-defined budget. Use of time-integrated results from sampling stations with different areal coverage and environmental heterogeneity can thus be an efficient strategy to detect environmental differences between single years, as well as a long-term temporal trend. Use of the presented allocation model will then help to minimize the cost and effort of a monitoring program.
KeywordsEnvironmental surveys Cost of precision Optimal allocation Seasonal variability Marine environments Coastal ecology
Ongoing marine environmental monitoring programs, following the European Water Framework Directive, are designed to detect long-term trends over several years and to classify the environmental state into one of five ecological quality classes, ranging from bad to high quality (Heiskanen et al. 2004; Carstensen 2007). Marine environmental monitoring programs usually include only very few stations which are supposed to be representative of large areas. Carstensen (2007) used data from Limfjorden in Denmark over the period 1989–2004 and showed that 1291 samples of chlorophyll a was needed to define the quality classification within 5% with 80% power when a single station was monitored. Ongoing monitoring programs usually only include a small fraction per year of this number, e.g., one per month, which then would require more than 100 years to achieve sufficient data for a corresponding classification. Carstensen (2007) discusses methods to reduce the residual error by using for example seasonal adjustment models and covariates in the statistical analyses. Environmental data usually also needs to be transformed to achieve approximately normal distribution and sequential sampling over time tends to be autocorrelated (e.g., Priestley 1982), which causes problems in the statistical treatments. We show in the present paper that the sampling program and statistical procedures can be feasible when the goal is to only get an estimate of the annual or seasonal average of a variable, without an evaluation of within-year fluctuations. Such a program will be suitable for comparing the environmental condition between single years. This is highly needed in environmental research, where there is a common problem to find logic explanations for changes in for example primary production (e.g., Rydberg et al. 2006), zooplankton abundance (e.g., Moller et al. 2015), and recruitment of fish (e.g., Pecuchet et al. 2015) between consecutive years in the ecosystem.
Since aquatic environments are three-dimensional, more so than terrestrial environments, it might require measurements along a depth scale, dependent on the aim of the study. The pelagic habitat and its ecosystem are by definition not stationary in space, thereby introducing variability in a fixed sampling station due to e.g., migration of organisms and water mass exchange.
Since economy often constrains the scope of planned monitoring programs, it is necessary to optimize the design in a way that will give sufficient precision of the estimated parameters for the available budget. The cost for research ships for offshore operations may reach 10,000 € or more per day, making it especially important to optimize the field program. Our presented allocation model can be a guideline to design such a sampling program based on a given budget.
Technical developments in moored environmental sensor buoys, autonomous underwater vehicles (AUVs) and free-floating sensor buoys, and wireless communication have greatly improved the possibilities of efficiently monitoring environmental variables (e.g., Bogue 2011, Marcelli et al. 2014, Xu et al. 2014). However, there are still constraints, especially in their ability to measure biological variables (Gibbs 2012), and their accuracy is often lower than laboratory based analyses can provide. However, improvements in biological sensors (Moustahfid et al. 2012) have opened up for including more biological variables on the different sensor platforms. But a remaining problem is their need of regular maintenance, e.g., due to biofouling and power limitations.
Material and methods
where the terms are defined below:
Ntime = number of surveys per station over the year
Nspace = number of stations per survey
A = cost per day for research ship
n = average number of stations per cruise day
mi = cost for sampling and analysis of variable i
B = labor costs necessary for field and laboratory work, data compilation and reporting, and overhead cost and cost of other administrative work
The model can of course be made more detailed, including for example costs for land transportation, overtime costs, instrumentation, and equipment, and such factors can easily be included to get a good representation of the actual costs in each specific case. If they are proportional to the number of stations included, they add to the factors within the brackets in Eq. (1), otherwise they add to the factor B in the end of the equation.
The description above relates to a single variable, whereas a monitoring program usually includes several variables, each of them with their specific variability in time and space. As long as we have estimated their CV%, the model can also provide the precision as CI% for them, either through the formula CI% = t0.05 × CV% ∗ (N)−0.5 or by using a spreadsheet or statistical software to directly calculate CI% through input of CV%, N, and confidence level (here 95%, i.e., α = 0.05).
Material for model tests
For all statistical calculations and tests, we used the statistical software package Systat 13.1 (Systat software Inc., www.systat.com). Weekly vertical profiles of environmental variables were taken at 21 stations in an estuarine system in northern Bothnia Sea between 63.47°–63.52° N and 19.74°–19.86° E (Fig. 1) from June to December 2013. A logging instrument (Aanderaa SeaGuard, Seewww.aanderaa.com) with six different sensors were slowly lowered from the surface to the bottom with a measuring frequency of 1 per second. Sensors measured salinity, temperature, light (PAR, photosynthetic active radiation, 400–700 nm spectrum), oxygen saturation and concentration, chlorophyll a, and CDOM (chromatic dissolved organic material). The light profiles were used to calculate the light extinction coefficients (k in the equation Id = I0 ∗ e(−k ∗ d), where Id is the irradiance at d meter depth, I0 is the irradiance at the surface, and k is the slope in the exponential equation), whereas average values for 0–3-m depth profiles were used for the other variables. For each of the 21 stations, we calculated the arithmetic average for the whole period, and these 21 values were used to calculate the time-integrated arithmetic average and coefficient of variation, CV%, for each variable. These average CV% values for the different variables were used to calculate the necessary sample size to detect a 5% change, using power analysis at 95% significance level and a power of 80% (α = 0.05; β = 0.2).
The procedure of only using the time-integrated average of each station can be seen as pooling over time of measurements, and the problems of autocorrelation, seasonal trends, and non-normal distribution over time have no impact on the results. Test of normality of these temporal averages of the 21 stations, using Shapiro-Wilk test statistic (Shapiro and Wilk 1965) showed no significant deviation from normality for the six variables. In order to show how this procedure reduces variability in the estimates, we made calculations on temporal variability of each station and spatial variability at each time point. This was done for original data as well as for log-transformed data (improving normal distribution).
In order to define an optimal number of samples for the time-integrated average that should give a high precision with lowest cost, we calculated the arithmetic average and 95% confidence interval with sequentially reduced number of samples, going from one per week (24 samples) and down to one per 6 weeks (four samples). The mean and confidence interval were based on all 21 stations and standardized to show the result for 24 time points as 100%, in this context defined as “true value.” We used ±5% as maximum acceptable deviation from the true value, i.e., the result for 24 time points.
We used power analysis (one sample t test) on the time-integrated averages for the 21 stations in order to define sample size requirements for detecting 5 or 10% change in time-integrated averages between years. We then used our estimated time-integrated averages and CV% values as input, assuming the same CV% for 2 hypothetical years, and presented the result as graphs of statistical power versus sample size.
Weekly vertical sensor profiles
Time-integrated arithmetic average, range, and coefficient of variation of time-integrated average (given as CV% = SD / mean × 100%) within the area investigated based on all 21 stations and all 24 time points. The equation to calculate the precision (CI%) of the annual mean is also given
CV% of averages
Salinity (g dm−3)
12.48 × N−0.5
1.95 × N−0.5
PAR extinction coefficient (k, m−1)
24.83 × N−0.5
Oxygen saturation (%)
1.10 × N−0.5
CDOM (μg dm−3)
21.76 × N−0.5
Chlorophyll a (μg dm−3)
9.56 × N−0.5
Estimates of time-integrated arithmetic average of different variables, mean and range of coefficient of variation (CV% = SD / mean × 100), and sample size necessary to detect a 5% change with a power of 80% and a significance level of 95% (α = 0.05; β = 0.2). Estimates based on the original values from 21 stations and 24 sampling occasions. “Over time” means that the parameters are based on the spatial average of the 21 stations at each sampling occasion, i.e., average of 24 CV% values. “Over stations” means that the parameters are based on the temporal average for each station over the 24 time points, i.e., average of 21 CV% values. “Normal” means that original data from each individual station and time point has been used, and “log-normal” means that log transformed data was used
The optimal allocation model
We have used the results on variability, expressed as CV%, from the weekly vertical profiles of the six variables from 21 stations (See Table 1) in the model. The relationship between precision, given as CI%, and number of stations included is given in Table 1. By using these equations, we can see that oxygen and temperature can reach a precision better than 2% already with two stations included, whereas the extinction coefficient and CDOM require 25, respectively, 19 stations to reach a precision of 5%.
In this paper, we used a dataset of environmental variables covering only half a year. A logic time-unit to use is 1 full year. In high-latitude environments, there is a cyclic variation in the physio-chemical environment on a 1-year time scale. The phytoplankton succession and thereby other trophic levels in the planktonic ecosystem are usually adapted to this and hence show cyclic successions on a 1-year time scale. Therefore, annual means of environmental variables should reflect the average environmental condition for that year. Our dataset is mainly used to illustrate the methodological work and test the model, so the restricted cover of the year that we have used has no implications for the suggestions and conclusions. Furthermore, our dataset is dominated by physical and chemical variables, and even if biological variables in some cases might show higher variability, the basic principles for designing a monitoring program and using the allocation model still holds for such variables.
Inter-annual variability in the marine environment is typical for natural systems and are overlaid the slow changes related to the ongoing climate change occurring over several decades. Spatio-temporal variability within a year is also high, as exemplified in the range of individual records in Table 1 and CV ranges in Table 2. By only using time-integrated averages for each station in the analyses, we minimize effects of seasonal and short-term variations and thereby also reduce variability between stations. Such an effect can be seen in previous published results. One example is the study by Rantajarvi et al. (1998) who recorded chlorophyll a along transects from a passenger ferry in the Arkona Sea and the western Gulf of Finland. Data sampled over a 1-year period with sampling frequency varying from one per month to two per week gave a coefficient of variation ranging from 53 to 82% in the Arkona Sea and between 89 and 144% for western Gulf of Finland, as calculated from their Table 1. However, looking at the average chlorophyll concentration for the whole period, it ranged between 2.8 and 3.0 (mean 2.8) μg L−1 with a coefficient of variation of 4.4% in the Arkona Sea and between 2.7 and 4.5 (mean 3.9) μg L−1 with a coefficient of variation of 11.5% in the western Gulf of Finland. Thus, time-integrating station measurements, ideally over a full year, will reduce variability dramatically between the stations and a high precision in estimates of annual or seasonal parameters for the investigated area can be obtained.
Using pooled data, like we have done through the time integration, has been criticized for loss of important information. This critique is adequate when pooling data from individual organisms like single-fish specimens (Bignert et al. 1994, 2014). In such a case, different co-variates like age, gender, and body-fat level might be of relevance in explaining for example the level of eco-toxins and can be used to divide the population into more homogenous groups. When dealing with plankton ecosystems, each analytical sample is usually based on measurements on hundreds (mesozooplankton) to millions (phytoplankton, bacteria) of individuals and in reality then represent a pooled sample. Furthermore, the result from each time point will be present, and although each single data point should not be used in statistical tests, they will give a good indication of the seasonal variation in the stations.
Since most variables in aquatic environments in general show a vertical gradient, the results also depend on the depth of sampling. Usually variability decreases with depth and sampling only in the surface water will therefore maximize variability. In addition, we usually also find the largest seasonal variability in the surface water, where most of the seasonally variable primary production occurs. In our examples in this paper, we have integrated over 0–3-m depth, and the calculated PAR extinction coefficient showed that 1% of surface level (approximately euphotic zone, e.g., Valiela 1995) on average (±SD) was at 6.04 ± 0.71-m depth, meaning that we covered the upper half of the euphotic zone.
Since labor costs and cost for ship time usually are constraining budgetary factors for running a monitoring program with sufficient precision in estimated parameters, alternatives to the traditional cruise-based programs should be considered. During the last decades, the technical development in remote sensing, environmental sensors and platforms, and wireless sensor networks has been dramatic (e.g., Blondeau-Patissier et al. 2014, Moustahfid et al. 2012, Aguzzi et al. 2011, Albaladejo et al. 2010, Trevathan et al. 2012, Xu et al. 2013, 2014, Thosteson et al. 2009), and such techniques are used for a variety of environmental issues. The largest research program based on autonomous sensor platforms is the global Argo project (See http://argo.jcommops.org), where more than 3900 float units are presently spread over the world ocean, each one regularly sending data to host servers. However, as pointed out by Gibbs (2012), sensors for biological quantities and processes are far behind in development compared to sensors for physical and chemical measurements. The consequence of the limited ability to measure different variables, especially biological ones that are also given priority in current international directives (e.g., the Water Framework Directive, e.g., Ferreira et al. 2011), is that autonomous sensor systems still cannot replace traditional ship-based surveys. However, for specific research questions where a high-temporal resolution in data collection is crucial, moored sensor platforms might be the right solution.
Despite the choice of practical design of a monitoring program between traditional ship-based, use of autonomous platforms or combination of both, the use of a cost-precision model would optimize the use of resources. In our presented model, we have used time-integrated averages for each station which, in the studied area, gave normal distributed data and thereby a basis for sound and simple statistics when calculating precision and power. If only a single station is used to represent for example a given area, a water mass or a basin, the sampling size needed to detect a statistically significant change will be high. Data from a station in Limfjorden, Denmark was used by Carstensen (2007) in order to calculate necessary sample size to detect a deviation from a classification boundary level, and for 5% deviation, the necessary sample size ranged from 124 to 3400 (α = 0.05, β = 0.2) for the seven variables studied, with chlorophyll a requiring 1291 samples. If we assume monthly sampling and necessary number of stations to detect a 5% change in annual mean, we should come up with sample sizes between 24 (oxygen) and 504 (PAR extinction coefficient). Chlorophyll a, as a potential environmental quality indicator, would require 96 annual samples. If we reduce our requirements to detect a 10% deviation from a boundary level, we would need to take between 24 (temperature and oxygen) and 144 (PAR extinction coefficient) samples, i.e., less than 1/10 of the number presented from a station in Limfjorden by Carstensen (2007).
When using our model, it is necessary to decide which variable or set of variables should define the sampling allocation and which minimum precision we require for them. The sample allocation thereby given will then define the precision obtained for the remaining variables. Alternatively, the model can be used to estimate precision with a pre-defined budget and to calculate the resulting statistical power to detect differences between years. Note that in our examples of the model results, we have used arbitrary constants for cost of ship time, analysis costs, labor costs, number of stations per cruise day, etc. Any changes in these values will change the model results. Based on our own results from the six variables, we have used 12 sampling times in the model. Also, this number might be adjusted, up or down, if available information indicates that. Variables with a clear seasonal trend and where the variance change in parallel with changed level of the measure also require caution in the resource allocation process. Examples of such variables are biological production estimates, e.g., primary production, bacterial production or production at any higher trophic level, as well as abundance of species or taxonomic groups. Allocating more sampling during periods with high levels and high variability of measurements and reducing sampling frequency during periods with low levels and low variability will improve the precision in time-integrated estimates. This usually implies that the period March–September should be sampled more frequently than the period October to February in high northern latitudes, for example with nine of the 12 sampling times used for the first period. However, this might vary with the objective of the program.
Time integration of environmental data over a year from a set of sampling stations representing a given area can be used in environmental monitoring surveys to compare inter-annual differences and detect long-term trends with high precision. Such a data treatment will eliminate the problem of non-normal distribution and autocorrelation between samples in time series. The use of a simple allocation model helps the user to optimize the precision-cost relationship in such a survey.
This study was financed by the Strategic Marine Environmental Research programs Ecosystem dynamics in the Baltic Sea in a changing climate perspective (ECOCHANGE). We thank the crew onboard R/V Lotty for their contribution in the field work and Umeå Marine Sciences Centre for working facilities and support.
- Blondeau-Patissier, D., Gower, J. F. R., Dekker, A. G., Phinn, S. R., & Brando, V. E. (2014). A review of ocean color remote sensing methods and statistical techniques for the detection, mapping and analysis of phytoplankton blooms in coastal and open oceans. Progress in Oceanography, 123, 123–144.CrossRefGoogle Scholar
- Ferreira, J. G., Andersen, J. H., Borja, A., Bricker, S. B., Camp, J., da Silva, M. C., Garces, E., Heiskanen, A. S., Humborg, C., Ignatiades, L., Lancelot, C., Menesguen, A., Tett, P., Hoepffner, N., & Claussen, U. (2011). Overview of eutrophication indicators to assess environmental status within the European Marine Strategy Framework Directive. Estuarine Coastal and Shelf Science, 93, 117–131.CrossRefGoogle Scholar
- Heiskanen, A. S., Van De Bund, W., Cardoso, A. C., & Noges, P. (2004). Towards good ecological status of surface waters in Europe—interpretation and harmonisation of the concept. Water Science and Technology, 49, 169–177.Google Scholar
- Moustahfid, H., Jech, M. M., Weise, M. J., Horne, J. K., O'dor, R., Alexander, C. and Ieee (2012) Advancing “bio” sensor integration with ocean observing systems to support ecosystem based approaches. 2012 Oceans.Google Scholar
- Priestley, M. B. (1982). Spectral analysis and time series. London, New York: Academic Press.Google Scholar
- Rantajarvi, E., Olsonen, R., Hallfors, S., Leppanen, J. M., & Raateoja, M. (1998). Effect of sampling frequency on detection of natural variability in phytoplankton: unattended high-frequency measurements on board ferries in the Baltic Sea. ICES Journal of Marine Science, 55, 697–704.CrossRefGoogle Scholar
- Thosteson, E. D., Widder, E. A., Cimaglia, C. A., Taylor, J. W., Burns, B. C., Paglen, K. J., & Ieee. (2009). New Technology for ecosystem-based management: marine monitoring with the ORCA Kilroy Network. Oceans 2009 - Europe, 1 and 2, 1114–1120.Google Scholar
- Xu, C., Zheng, Z., & Ji, C. Q. (2013). Practical deployments of SEMAT on wireless sensor networks in the marine environment. 2013 I.E. Ninth International Conference on Mobile Ad-Hoc and Sensor Networks (Msn 2013), 442–448.Google Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.