Study area
The Baltic Sea is an inland sea with a surface area of 415,200 km2 and is one of the largest brackish-water basins in the world. It is commonly divided into several sub-basins separated by sills, including a transition zone to the North Sea consisting of the Kattegat and the Belt Sea (#11–17 in Fig. 1). These sub-areas differ considerably in several physical characteristics including ice cover, temperature, salinity, and residence time of the water (Leppäranta and Myrberg 2009). Surface salinity provides an illustrative example: while it is normally 20–25 in the Kattegat area, it is only 6–8 in the central Baltic Sea and drops below 2 in the northern and eastern extremities of the Bothnian Bay and the Gulf of Finland. As a result the composition of the biota changes considerably along these gradients (HELCOM 2007b; Feistel et al. 2008).
The human population in the catchment is 85 million, and human activities display a similar, distinctive north–south, east–west pattern. Population density outside main cities varies from more than 100 persons per km2 in the southern and south-western parts to less than 1 person per km2 in the northern and north-eastern parts of the catchment area (CIESN & CIAT 2005). In terms of land use there is a high proportion of agricultural land in the south-eastern and south-western parts, while boreal forest, wetlands and barren areas dominate in the north (Anon. 2001).
The long residence times (Leppäranta and Myrberg 2009) and the strong saline stratification of the water column, including natural hypoxia in the deep basins (Conley et al. 2009a), make large parts of the Baltic Sea sensitive to nutrient enrichment and eutrophication. Human activities and settlement, including e.g. agriculture, urban and industrial waste water, energy production and transport result in greatly increased loads of nutrients (nitrogen and phosphorus) from the (relatively large) 1,700,000 km2 catchment area entering the Baltic Sea (HELCOM 2004; Schernewski and Neumann 2005; Savchuk et al. 2008; HELCOM 2009).
Data sources
Three types of data are used in this study: (1) monitoring data for 2001–2006 (in some cases only 2001–2005 or 2001–2004), (2) information on reference conditions (RefCon), and (3) ‘target levels’ defined as acceptable deviation (AcDev) from RefCon.
Most of the monitoring data representing actual status (AcStat) originate from the HELCOM Cooperative Monitoring in the Baltic Marine Environment Programme (HELCOM COMBINE, see HELCOM (2008) for details and note that the Kattegat is included under both HELCOM and OSPAR) carried out in cooperation between the Baltic countries, and partly from national monitoring and assessment activities (e.g. Svendsen et al. 2005; OSPAR 2008).
Data representing long-term trends in inputs of nutrients (nitrogen and phosphorus) to the Baltic Sea are derived from the HELCOM Fifth Pollution Load Compilation (HELCOM 2010). All measurements and analytical methods used as well as quality assurance procedures are described in details in the HELCOM COMBINE Manual, Parts A, B and C (HELCOM 2008).
In this study, specific focus has been placed on indicators relevant to HELCOM objectives (HELCOM 2007b; Backer and Leppänen 2008), in particular nutrients (objective i), chlorophyll-a (objective ii), water transparency (objective iii), benthic invertebrates and submerged aquatic vegetation (SAV) (objective iv). For the description of the AcStat all Baltic Sea states have used the 2001–2006 period, except Denmark, which used the period 2001–2005 for the Kattegat and Great Belt and 2001–2004 for all other areas.
RefCon
RefCon, which are “… a description of the biological quality elements that exist, or would exist, at high status, that is, with no, or very minor disturbance from human activities” (Anon. 2000) are used to quantify the degree of disturbance observed in the environment. Furthermore, they should represent part of nature′s continuum and must reflect variability. Three principles for making the concept of RefCon operational are (1) reference sites, (2) historical data, and (3) modelling. Expert judgement can be used as a supplement when spatially based (option 1 and 2), modelled (option 3) or combinations of 1, 2 and 3 are not possible. In this study, the RefCon are mostly based on historical data and modelling, since reference sites no longer exist in the Baltic Sea and the use of expert judgement is occasionally less transparent. The RefCon’s for nutrients (dissolved inorganic nitrogen (DIN) and dissolved inorganic phosphorus (DIP)), chlorophyll-a, water transparency (Secchi depth) and benthic invertebrates in the open parts of the Baltic Sea, obtained from various sources described below, are shown in Fig. 2.
For nutrients, chlorophyll-a and water transparency, RefCon’s are basin specific and mostly based on historical data (HELCOM 2006; Fleming-Lehtinen 2007; Fleming-Lehtinen et al. 2008; Henriksen 2009). Modelled and site-specific RefCon’s have been used for parts of the Danish Straits (OSPAR 2008). The reference values used are largely in line with those presented by other sources, e.g. Sanden and Håkansson (1996), Aarup (2002) and Schernewski and Neumann (2005).
RefCon’s for benthic invertebrate diversity in open water basins, measured as gamma diversity, i.e. the average number of species in a sub-basin per year, were calculated based upon data from 1965 to 2006 (HELCOM 2009). RefCon’s varied by an order of magnitude between the Arkona Basin and the Bothnian Bay due to the salinity gradient, which constrains species distributions (Bonsdorff and Pearsson 1999). For the coastal water assessments, different national indices have been used; see HELCOM (2009) for details.
For SAV in coastal waters, namely depth distribution of Fucus vesiculosus and Zostera marina, which constitute monitoring species in coastal waters only, RefCon’s are based on historical records, e.g. Reinke (1889), Waern (1952) and von Wachenfeldt (1975) as well as Boström et al. (2003), Martin (1999), and Krause-Jensen et al. (2003).
AcDev
For the open basins of the Baltic Sea, AcDev values are set basin-wise for each indicator. Two different principles are used for setting the AcDev, according to whether indicators show a positive response (increasing in value) to increases in nutrient inputs or a negative response (decreasing in value). For an indicator showing positive response (e.g. nutrient concentrations and chlorophyll-a), AcDev has an upper limit of +50% deviation from RefCon (HELCOM 2009). Setting AcDev to 50% implies that low levels of disturbance (defined as less than +50% deviation) resulting from human activity are considered acceptable while moderate (i.e. greater than +50%) deviations are not considered acceptable for the body of water in question. However, in exceptional cases the +50% AcDev can be exceeded if scientifically justified. For indicators responding negatively to increases in nutrient input (e.g. Secchi depth and depth limit of SAV) the AcDev’s have in principle a limit of −25% (HELCOM 2009), although AcDev’s used for benthic invertebrates are slightly greater in magnitude, ranging from −27 to −40% (HELCOM 2009). Whereas an indicator with positive response can theoretically show unlimited deviation, indicators showing negative response have a maximum deviation of −100% and a deviation of −25% is, in most cases, interpreted as the boundary between low and moderate levels of disturbance. These +50% and −25% “principles” are under discussion, but these initial and pragmatic values are in accordance with the WFD (Anon. 2000, 2005) and other eutrophication assessment approaches (Bricker et al. 2003; HELCOM 2006; NOAA 2007; OSPAR 2008; Bricker et al. 2008; Claussen et al. 2009). The AcDev’s used for the coastal waters are largely defined by the WFD implementation process, in particular the WFD intercalibration activity in the Baltic Sea (Anon. 2008b).
Assessment principles and methods
The methodology used in this study to assess eutrophication status of a water body, the HEAT, is based on indicators, grouped according to a predefined manner. The grouping method used follows the WFD (Anon. 2000, 2005) quality elements (physical–chemical features, phytoplankton, SAV, benthic invertebrates) corresponding to HELCOM eutrophication objectives i, ii, iii (physical–chemical features), iv (phytoplankton) and v (SAV & benthic invertebrates); subsequently combined into a final classification of ‘eutrophication status’.
Using the described RefCon, AcDev and AcStat concepts, the basic assessment principle becomes: RefCon ± AcDev = EutroQO, where the latter is a “eutrophication quality objective” (or target) corresponding to the boundary between good and moderate ecological status. When the AcStat data exceed the EutroQO or target, the areas in question is regarded as ‘affected by eutrophication” cf. the BSAP.
Thus, following the basis assessment principle described above, a selection of indicators with RefCon and AcDev values turns qualitative goals like HELCOM’s five eutrophication objectives into operational targets, on which objective and transparent assessments of eutrophication status can be based. While the RefCon’s can be considered the “anchors” of the assessment, AcDev’s from RefCon’s are the necessary “yardsticks” while AcStat is actual indicator status. The assessment principles used by HEAT are summarised in Fig. 3.
The HEAT tool integrates all the elements described above and is based on: (1) Indicators representing well documented eutrophication effects with synoptic information on RefCon, AcDevs, and AcStat, (2) Quality Elements sensu the WFD, (3) HELCOM Ecological Objectives, (4) weighting of indicators within quality elements, and (5) integration of the Quality Elements used into a final assessment based on the ‘One out—all out’ principle sensu the WFD.
Step 1: Indicators and boundary setting
The EQR is a dimensionless measure of the observed value (AcStat) of an indicator compared with the reference value (RefCon). The ratio is equal to 1.00 if AcStat is better than or equal to RefCon and approaches 0.00 as deviation from RefCon becomes large.
Step 1A: Indicators with a positive numerical relationship to nutrient input
For an indicator showing positive response to nutrient input, the EQR is defined by:
$$ {\text{EQR}} = {\text{RefCon}}/{\text{AcStat}} $$
(1)
$$ 0 \le {\text{EQR}} \le 1 $$
(2)
where the observed value of the indicator (AcStat) is equal to or less than the reference value, then the EQR is equal to the maximum achievable, 1.00. For a given reference value, increasing values of AcStat give lower EQR, with EQR approaching zero as the status value becomes infinitely large (Fig. 4a).
The value of EQR is used to assign a quality class to the observed status. The classes in descending order of quality are RefCon, High, Good, Moderate, Poor, Bad. The central definition of the quality classes is given by the value of AcDev. The boundary between Good and Moderate status is defined as being where the deviation from RefCon is equal to the AcDev. That is:
$$ {\text{AcStat}} = \left( {1 + {\text{AcDev}}} \right) \times {\text{RefCon}} $$
(3)
Substituting for AcStat in (1) gives:
$$ {\text{EQR}}_{{{\text{Good}}/{\text{Moderate}}}} = 1/\left( {1 + {\text{AcDev}}} \right) $$
(4)
The EQR boundary between High and Reference status is always set equal to 0.95. If EQR is above 0.95, it is implicitly assumed that the indicator has a status equal to RefCon. This deviation is allowed in order to take into account a degree of uncertainty in the observations of RefCon and present status as well. Thus, this permissible deviation from RefCon (5%) represents a generic estimate of the uncertainty margin for all indicators. The quality class of “Reference” will rarely be used and quality class “High” therefore, in practice, represents the highest achievable status. However, the High/Ref boundary is employed in determining boundaries between the other classes.
The values for the boundary between Reference/High status and the boundary between Good/Moderate status constitute fixed points from which the remaining boundary values are calculated. For practical reasons the span of the two highest classes and the next two classes have equal width, i.e.:
$$ {\text{EQR}}_{{{\text{RefCon}}/{\text{High}}}} - {\text{EQR}}_{{{\text{Good}}/{\text{Moderate}}}} = {\text{EQR}}_{{{\text{Good}}/{\text{Moderate}}}} - {\text{EQR}}_{{{\text{Poor}}/{\text{Bad}}}} $$
(5)
That is, the difference between the values of EQR defining the Reference/High and Good/Moderate boundaries is equal to the difference between the Good/Moderate and Poor/Bad boundary values. This Eq. 10 can be rearranged to give the value for the boundary between Poor and Bad status:
$$ {\text{EQR}}_{{{\text{Poor}}/{\text{Bad}}}} = 2{\text{EQR}}_{{{\text{Good}}/{\text{Moderate}}}} - {\text{EQR}}_{{{\text{RefCon}}/{\text{High}}}} $$
(6)
For example, consider a case where the AcDev from RefCon is 50%. The boundary between Good and Moderate status is 1/(1 + 0.5) = 0.667. And according to (6), the boundary between Poor and Bad status lies at 0.383 (Fig. 3a).
This leaves two remaining boundaries to be defined, the boundary between Good and High status and the boundary between Poor and Moderate Status. These boundaries are defined as the midpoints between the two adjacent boundaries:
$$ {\text{EQR}}_{{{\text{High}}/{\text{Good}}}} = 0.5{\text{EQR}}_{{{\text{RefCon}}/{\text{High}}}} + 0.5{\text{EQR}}_{{{\text{Good}}/{\text{Moderate }}}} $$
(7)
$$ {\text{EQR}}_{{{\text{Moderate}}/{\text{Poor}}}} = 0.5{\text{EQR}}_{{{\text{Good}}/{\text{Moderate}}}} + 0.5{\text{EQR}}_{{{\text{Poor}}/{\text{Bad}}}} $$
(8)
For the example of AcDev equal to 50% the values for the High/Good and Moderate/Poor boundaries equal 0.808 and 0.525, respectively. Figure 3a shows how the value of EQR for the boundary between the classes varies with the AcDev from RefCon.
The method used for calculating class boundaries does not allow for use of AcDev greater than 110% for indicators with a positive response to nutrient input, as the Poor/Bad boundary would otherwise become negative (Fig. 3a). Consequently, it would therefore become impossible to obtain a “Bad” status as an EQR cannot be negative, irrespective of the extent to which the observed status exceeds RefCon.
Step 1B: Indicators with a numerical negative relationship to nutrient input
For an indicator showing a negative response to nutrient input, e.g. depth limit of SAV or Secchi depth, the EQR is defined as:
$$ {\text{EQR}} = {\text{AcStat}}/{\text{RefCon}} $$
(9)
$$ 0 \le {\text{EQR}} \le 1 $$
(10)
Here, for a given reference value, the EQR is directly proportional to the observed value, and is equal to the maximum value of 1.00 if the AcStat equals or exceeds the reference value.
As for the case of positive response, the AcDev from RefCon is used to define class boundaries for classification according to EQR value. Again, the Good/Moderate boundary lies where the deviation from RefCon is equal to the AcDev (3).
Using (3) to substitute for AcStat in (9), and remembering that AcDev is negative, gives:
$$ {\text{EQR}}_{\rm Good/Moderate} = (1-\text{AcDev})$$
(11)
For an AcDev of 50%, the boundary for Good/Moderate status is 0.5. Figure 4b shows how the class boundaries vary with the AcDev. Given the value for the Good/Moderate boundary and the Ref/High boundary (0.95), the values for the remaining boundaries are calculated in the same manner as described above for indicators with a positive response to nutrient input. Figure 4b is useful in illustrating the limit on allowable AcDev for an indicator with negative response. Choosing an AcDev greater than 52.5% would mean that according to the previously described method of calculating class boundaries, the Bad/Poor boundary becomes negative (Fig. 4b) and it is therefore impossible to arrive at a classification of Bad, no matter how far from RefCon the observed status is.
Step 2: Quality elements and final classification
An EQR value and a set of class boundaries are calculated for each indicator, but the overall status classification depends on a combination of indicators. First, indicator EQR values are combined to give an EQR value for a specific Quality Element (QE), and similarly the indicator class boundaries are combined to give the class boundaries for the QE. In the simplest case, where all indicators within a QE have equal weights, the EQR for the QE is the average of the indicators’ EQRs within the QE and each QE class boundary (e.g. Moderate/Good boundary) is found as the average of the class boundary values for all indicators representing that specific QE.
Within a QE, it is also possible to assign weighting factors to indicators according to expert judgement. The classification of the QE is then given by comparison of the weighted averages of the EQRs with the weighted averages of the individual class boundaries. Thus, the same weighting is applied both in calculation of the EQR for the specific QE as well as QE class boundary values.
The lowest rated of the QEs will because of the ‘One out—all out’ principle determine to final status classification. This principle is employed for two reasons: (1) all five HELCOM objectives for the open basins are required to be met independently, and (2) this principle is stated in the WFD (Anon. 2000) for assessing ecological status of coastal waters.