Background

Animal tracking with GPS has become an important and widely used technique for wildlife research in recent years [1, 2]. It has matured from being used in simple home range studies to diverse applications examining habitat selection, animal migration, behaviour and physiology [36]. This might partly be due to the improved performance of GPS circuits, but also the addition to the tags of auxiliary sensors, like depth metres, light sensors or accelerometers [7, 8]. Accelerometers in particular are revolutionising the field of animal energetics by enabling researchers to determine animal behaviour remotely [912]. Recently, it has even been shown that acceleration data are suitable to determine an animal’s internal states, like disease status or stress level [13].

Since the first years of animal tracking, researchers have been aware that study outcomes might be affected by the quality of the tags [1417] and how the tags affected the animals [1823]. Fix rate (amount of successfully acquired positions), GPS accuracy (deviation from true location) and precision (measurement reproducibility) were considered repeatedly [24, 25], especially for studies in closed or mountainous habitats [26, 27] or for those on fine-scale habitat use [28], social interactions for disease transmission [29] or GPS-based behaviour segmentation and classification [3032].

In order to improve GPS accuracy, which is one focus of this paper, GPS receiver modules apply smoothing algorithms to the location data before they are output to the user. Typically, these algorithms are based on the extended Kalman filters [33]. These filters rely on a variety of movement models and are sometimes referred to as platform settings [34], the choice of which depends on the expected movements of the object being tracked (e.g. stationary, pedestrian walking, motor vehicle). The constraints imposed by these models can be problematic if different behaviours are to be extracted from the GPS tracking data of free-ranging animals. Only one type of filter can be used for the whole data set, and a pedestrian walking filter, for example, might faultily introduce movement at times that the animal is resting. Other options for accuracy improvement are differential GPS systems [35], i.e. ground- and satellite-based augmentation systems (GBAS and SBAS), which can be applied to the data directly or by post-processing raw GPS data, i.e. satellite pseudo-ranges, protocol RXM-RAW [34]. The latter system is still under development in many countries and presently evaluated [3638].

GPS fix rate and accuracy have also been shown to depend on the design of the tag, especially on the gain and orientation of the antenna [39]. However, this is only one factor to consider during tag development, especially with respect to affecting the behaviour, physiology or demography of the animal [20, 21]. Tag weight has received a lot of attention, especially in flying or diving animals [40, 41]. It is generally agreed that tag weight should be <5 % (or <3 % as recently proposed) of the body weight of the animal [4244]. However, tag shape, attachment and placement can be of equal importance and influence maximum suitable tag weight [18, 4549]. Depending on body structure, tags are mostly placed on the back (close to the centre of gravity), neck or tail of terrestrial animals and attached by harnesses, collars or glue. For specific research questions, especially involving accelerometer sensors measuring specific body movements, other tag placements have been used (e.g. head [50], scapulae [51]). In this way, the 3D orientation and dynamic movement of specific body parts can be quantified, allowing conclusions about the use of muscles or other motile body parts for certain behaviours or physiological processes.

Until very recently, due to the large size of traditional GPS tags, only the largest birds have been studied with GPS telemetry, most notably swans, geese and large raptors, often focussing on their migration ecology [5256]. As some goose species are presently of conservation concern, while others are considered pests, the need to improve our knowledge about these species has increased [57]. Swans and geese have been shown to display a variety of adverse behaviours during being handled and after tagging, e.g. increased preening or biting the tag/harness [54, 5861]. To decrease negative effects of tag aversion [62], it is necessary to shorten handling time and evaluate different types of tag placement and attachment methods for this species group [58, 6366]. Welfare assessments should be made alongside considerations of data quality in order to balance animal discomfort against the amount and quality of scientific knowledge gained from the respective study [67]. Backpack tags attached with harnesses are the most widely used way of tracking large waterbirds [59, 60], but with the recent miniaturisation of GPS tags, it is now possible to also attach or integrate them into neckbands. Numbered plastic neckbands have been successfully used for many years for individual resighting of wild geese and swans [6871], and for a few years, these have included radio or GPS tags [54, 72].

Here, we present a comparison of the performance of neckband and backpack GPS tags in captive geese during early habituation. During six successive days after deployment, we have quantified the effect of both tag types on the birds’ behaviour, namely how much their behaviours deviated from control individuals without a tag. Then, we have evaluated GPS accuracy (with GPS platform setting ‘pedestrian walking’ and/or SBAS) of both tag types, expecting worse performance of the neckband tags, because antenna orientation deviates more often from upwards. Furthermore, the success of classification of different behaviours from accelerometer data was compared and related to the attachment method. Finally, considering different research questions, a framework is presented to inform tag design and placement for future tracking of large waterbirds like swans and geese.

Methods

Tests with captive geese

Our experimental animals were six captive, at the time flightless Canada geese (Branta c. canadensis) that were held in a group of ten at the outside waterbird facility of the Netherlands Institute of Ecology (NIOO-KNAW). The geese were sub-adults and had not yet formed stable couples. The six focal birds (three males and three females) were selected based on body size, low resistance to being handled and medium to high social position within the group. They weighed on average 5.2 kg (range 4.2–6.9 kg) so that tags up to a weight of 150 g would be within the more conservative 3 % margin [18] generally accepted for animal tracking.

The tests were performed as three trials, each lasting 6 days, in January–April 2013. During the trial periods, the geese were split into two groups of five individuals, each including three experimental birds. The groups were assembled in a rotational design to account for individual variability of the geese. On the day before the start of a trial, of each group of experimental birds, one was equipped with a neckband tag (70 g, including neck ring, able to freely rotate around the goose’s neck, Fig. 1a), one with a backpack tag (80 g, including harness, Fig. 1b) and one was left without a tag as control. The tags were prototypes, programmed to continuously collect GPS positions (1 Hz) and tri-axial accelerometer measures (50 Hz) for 2 h/day, one group in the morning (8:00–10:00 GMT) and the other in the afternoon (11:00–13:00 GMT). They were fitted with helical antennas (Sarantel GeoHelix), which were vertical (pointing upward) on the neckbands and horizontal (pointing forward) on the backpacks. The omnidirectional reception pattern and physical shape of a helical antenna enabled more similar performance between neckband and backpack tags than would be possible with a more conventional patch antenna. By setting different receiver protocols, the tags were activated to receive SBAS signals (from EGNOS satellites) for improved position accuracy as well as to collect raw GPS data for post-processing.

Fig. 1
figure 1

Photography by AK

Prototype tags used in this study, each mounted on a Canada goose: a neckband tag, b backpack tag. Note the orientation of the tags. The tags were developed by Biotrack Ltd. in the collaborative project E-Track (www.etrack-project.eu).

Every day of the trials, the goose groups were observed in a 26 × 26 m fenced field located in a wider open area at the times that the tags were recording data. Each of the three experimental animals was observed and its behaviour recorded during three periods of 10 min/day; the timing of those observations was designed in a balanced rotational grid to correct for the influence of time of day on behaviour. Goose behaviour was recorded live using the Observer XT version 11 software (Noldus IT), and we discerned six main behaviours: feeding, resting, walking/running, vigilance, shaking and preening. Other behaviours were also scored, but were not included in the analyses because of low frequencies. During each 10-min observation period, a second observer recorded the distance (±2 cm) and angle (±1°) to the lighter breast region of the focal goose (ca. 10–20 cm from the deployed GPS tag) about 5–15 times using a Geosystems Total Station (TCR 307 version 350.24). The device was at a fixed position that had previously been accurately located (±2 cm) with a DGPS instrument (Ashtech ProMark 800).

Analysis of behavioural observations

In total, each of the six experimental geese was observed 54 times for 10 min, apart from one bird that had to be excluded during the last 3 days because of feather wear below a small part of the harness. Thus, our data set comprised 315 observation periods. For indication of tag-induced behaviour, we extracted the total duration of preening (including pecking the tag) and frequency of shaking (the head or body) per 10-min observation period. To evaluate likely impacts of tag or tag type on other behaviours, we further examined the total durations of feeding, vigilance and resting per observation period. Effects on these behaviours and possible habituation with time of deployment were tested by comparing generalised linear mixed models (GLMMs, R package ‘lme4’) with and without tag type (backpack BP, neckband NB and control C) and day since deployment as fixed factors and date, individual and sex as random factors by a likelihood ratio test (LRT).

Processing and evaluation of GPS data

Because of several tag failures, we were only able to use 9 days of GPS data from the backpack tags and 7 days of data from the neckband tags, including 32 of the 10-min observation periods. Thus, the data set for the following analyses comprised 18 h (61,873 positions) of backpack and 14 h (47,178 positions) of neckband GPS data, both normally processed locations (aka NMEA data; see list of abbreviations) as well as raw GPS data. The NMEA positions were improved by the GPS module under the platform settings of ‘pedestrian walking’ [34]. Of the NMEA positions, 29.9 % (BP) and 29.6 % (NB) had incorporated SBAS (EGNOS) signals for improved accuracy.

For evaluation of the effect of SBAS improvement independently of the GPS platform settings, we post-processed the raw GPS data with archived SBAS correction files (from the EDAS service, ftp://igs.ensg.ign.fr/pub/igs/products/), using RTKlib version 2.4.2 with single point positioning mode, an elevation mask of 15°, IONEX TEC Ionosphere Correction, SBAS Satellite Correction and Precise Satellite Ephemeris. The position data set generated was called precise point positions (PPP; see list of abbreviations). The settings were selected as to obtain independent positions without any inference on movement type and on the basis of stationary tests with different elevation masks. Because of gaps in the EDAS data set, only 29.3 % (BP) and 25.9 % (NB) of the GPS positions could be augmented to PPP data.

To determine GPS accuracy of the different GPS position types (NMEA and PPP), we compared them with the rather exact positions that were calculated from the measurements obtained with the Geosystems Total Station. This resulted in the following data sets that coincided in time with Total Station positions: 148 (BP) and 127 (NB) NMEA positions without SBAS augmentation (referred to as NMEA 1; see list of abbreviations), 79 (BP) and 53 (NB) NMEA positions with SBAS augmentation (NMEA 2) and 73 (BP) and 50 (NB) PPP positions. After projection of all positions into the appropriate UTM (Universal Transverse Mercator coordinate system) zone 31, distances between time-overlapping positions were calculated, indicating accuracy of the respective GPS positions: NMEA 1 (with ‘pedestrian walking’ filter, without SBAS), NMEA 2 (with ‘pedestrian walking’ filter, with SBAS) and PPP (without ‘pedestrian walking’ filter, with SBAS). We also calculated minimum convex polygon areas of NMEA and PPP tracks for each 10-min period and compared how SBAS augmentation and the GPS module platform settings (i.e. ‘pedestrian walking’ filter) influenced the spatial extent of the data sets. Distances and polygon areas were compared between different GPS data sets and tag designs using GLMMs (see above).

Behaviour classification from acceleration data

Similar to the GPS data, we obtained a data set of 9 days (18 h) of accelerometer measurements from the backpack tags and 7 days (14 h) from the neckband tags. The acceleration data could not be recorded continuously at 50 Hz, because of time required intermittently to write data to the tag’s memory. Therefore, we down-sampled it to a continuous 20 Hz, which is widely used for studies to classify animal behaviours. All accelerometer data sets were then divided into static acceleration (as moving averages over a sliding window of 2 s width) and dynamic acceleration (difference between raw acceleration and static acceleration).

Following the workflow of [73], we overlaid accelerometer data and behavioural observations and split the data set into acceleration bursts of the same behaviour. Because of observation delays of about 1 s (range 0.5–2 s), we selected only bursts where the animals showed the same behaviour for ≥5 s (most used periods were longer than 5 s, and using a longer period of ≥10 s did not qualitatively change the results). Thus, the main part of each selected burst showed the targeted behaviour so that influences of delayed scoring were minimised. The selection resulted in a data reduction to 38.5 % (BP) and 25.7 % (NB) of all bursts for analysis, leaving us with 848 (BP) and 537 (NB) behavioural bursts, totalling to 5.63 h (BP) and 4.21 h (NB) of behaviour-annotated accelerometer data. Due to this selection, the sample size of shaking (naturally of short duration) became extremely low and we excluded it from these analyses.

For each burst, we calculated the proposed acceleration statistics (see Table 2 of [73], excluding speed) and analysed them with a recursive classification tree algorithm (R package ‘rpart’), split by tag type. To avoid overfitting, we pruned both of the resulting classification trees (BP and NB) to their minimum complexity parameter. Their predictive power was finally quantified by prediction accuracy (proportion correctly classified bursts), overall and for each of the behaviours separately.

Results

Effects of tags on observed behaviour

Our analyses showed that both tags had an effect on the behaviour of the experimental geese (Fig. 2). Especially discomfort behaviours, like preening and shaking, were significantly increased if the geese were carrying a tag. A goose without tag (control C) would be preening for about 18 s within a 10-min period, whereas a goose with a backpack (BP) would preen and peck the tag for c. 83 s and a goose with neckband (NB) for c. 82 s (LRT, χ2 = 37.0, df = 2, p < 0.001). Frequency of shaking (in times per 10-min period) increased somewhat for neckband tags (C: 0.3 times, BP: 0.4 times, NB: 1.0 times; LRT, χ2 = 26.4, df = 2, p < 0.001). This can be explained by the fact that shaking the head to get rid of the neckband was scored as ‘shaking’. The extra time spent on shaking when geese wear the neckband was very small relative to the increased preening time. Thus, both types of tags caused extra discomfort to the birds. The geese reacted to backpacks and neckbands differently, but overall used the same amount of time for extra discomfort behaviour.

Fig. 2
figure 2

Bar plots (mean ± SD) of discomfort levels of geese carrying a backpack tag (BP), neckband tag (NB) or no tag (control C). Behaviours indicating discomfort are a duration of preening (and pecking the tag), b frequency of shaking (the body or the head), c duration of feeding, d duration of being vigilant and e duration of resting behaviour, each within 10-min periods

Furthermore, a goose with a tag would feed less (C: 193 s, BP: 152 s, NB: 148 s; LRT, χ2 = 11.1, df = 2, p = 0.004), but be no less vigilant (LRT, χ2 = 0.2, df = 2, p = 0.92) nor rest for shorter times (LRT, χ2 = 0.8, df = 2, p = 0.67). There were no differences between the two tag types in terms of duration of feeding, vigilance and resting, indicating that they were similar in the degree to which they affected goose comfort. In addition, there was no effect of time since deployment on the extent of any of the behaviours (LRT, χ2 < 1, df = 1, p > 0.30), showing that the birds were not yet getting habituated to the tags.

GPS accuracy

There were various effects of tag type and GPS position type on the accuracy in terms of distance to the (exact) positions as obtained by the Geosystems Total Station (Fig. 3). For NMEA 1 positions, modelled inaccuracy (GLMM model estimate of distance to exact position) was smaller for neckband tags (1.8 m) than backpacks (3.5 m; LRT, χ2 = 16.3, df = 1, p < 0.001), whereas the inaccuracies of NMEA 2 were similar for the two tag types (BP: 2.4 m, NB: 2.2 m; LRT, χ2 = 0.7, df = 1, p = 0.39). Also for PPP positions, the inaccuracy was smaller for neckbands (BP: 3.7 m, NB: 1.9 m; LRT, χ2 = 5.4, df = 1, p = 0.02). Consequently, tag type lost its influence on GPS accuracy only if the platform setting (‘pedestrian walking’) and SBAS enhancement were applied simultaneously. If only one or the other was applied, the neckbands were more accurate than the backpack tags.

Fig. 3
figure 3

Example tracks and accuracy statistics for GPS data of backpack (BP) and neckband (NB) tags on geese. a NMEA data (red), PPP data (blue) and distances of time-overlapping NMEA or PPP data with exact measures of the Geosystems Total Station (green) for a 10-min track of one goose with a backpack tag. b Same as a, but for a goose with a neckband tag. c Distances of NMEA 1, NMEA 2 and PPP data to exact positions, split for BP and NB tags. d Minimum convex polygon area of NMEA and PPP tracks for both tag types. Please see the list of abbreviations for explanations of NMEA 1/2 and PPP. Note that measures in the box plots are not model parameters (as reported in the text), but raw data, and do not account for random factors

When looking at each tag type separately, only the backpacks revealed an effect of GPS position type on accuracy, with the pedestrian walking filter improving accuracy (NMEA 1: 3.3 m, NMEA 2: 3.2 m, PPP: 3.8 m; LRT, χ2 = 7.1, df = 2, p = 0.03). GPS accuracy did not differ by GPS position type, and effect sizes were smaller for neckbands (NMEA 1: 2.5 m, NMEA 2: 2.1 m, PPP: 2.4 m; LRT, χ2 = 2.4, df = 2, p = 0.31).

Our data did not show significant differences in minimum convex polygon area with respect to tag type, neither for NMEA data (BP: 464 m2, NB: 650 m2; LRT, χ2 = 0.9, df = 1, p = 0.33), nor for PPP data tracks (BP: 579 m2, NB: 632 m2; LRT, χ2 = 0.1, df = 1, p = 0.77). However, note that sample size was very low. For backpacks, polygon areas were larger for PPP than for NMEA positions (NMEA: 547 m2, PPP: 663 m2; LRT, χ2 = 5.5, df = 1, p = 0.02), but there was no difference in neckband tags (NMEA: 566 m2, NB: 548 m2; LRT, χ2 = 0.01, df = 1, p = 0.94). Thus, for backpacks the PPP positions were more spread out.

Behavioural classification from accelerometer data

In the behaviour-annotated examples of static and dynamic acceleration for the backpack tag (Fig. 4a, b) and neckband tag (Fig. 4c, d), the variability in alignment (static acceleration) was less pronounced in backpack than in neckband tags and did not as easily match with behaviours (but see long feeding burst in Fig. 4a). For the neckband, e.g. feeding events were clearly depicted by peaks in the x-axis static acceleration. Note that the x-axis pointed towards the head in the neckbands and was not affected by the regularly occurring, movement induced events of the neckband rotation. In the dynamic acceleration patterns, resting was clearly visible for the backpack as well as the neckband (dynamic acceleration of all axes = 0). Feeding could in this example not be well discerned from dynamic acceleration of the backpack: similar to walking, it was characterised by a regular wave pattern in the x-axis dynamic acceleration. On the other hand, the dynamic acceleration of the neckband showed rather unique high amplitudes during feeding. However, there are several other peaks that were not easy to explain.

Fig. 4
figure 4

Example data of static and dynamic acceleration data for both tag types, down-sampled to 20 Hz resolution. a Static and b dynamic acceleration in all three axes (x-blue, y-green, z-red) of a goose with backpack tag with an overlaid bar of observed behaviour inserted (black rest, blue walk, green feed, red preen, pink vigilance). c, d Same as a, b, but for a goose with neckband tag. Note the differences in scale and differentiability of behaviours by tag type. The accelerometer was fitted into the tags so that for the backpacks x is the reverse of surge, y is the sway and z is the heave. For the neckband on a raised goose neck that means x is the reverse of heave and y and z indicate surge and sway depending on how the tag is turned

When examining the classification trees, the pitch in the x-axis for backpack tags (Fig. 5a) was a main statistic for the first split of resting/feeding (high pitch) from walking/vigilance (low pitch), indicating that body angle (leaning back or forward) was the best initial classification criteria. Then, on one part of the tree, resting was split off by low x-axis overall dynamic body acceleration (odbaX), i.e. little front/back movement of the tag. At the other part of the tree, vigilance was split off by low odba (very little overall movement) and walking was further split off by high x-axis frequency at the dominant power spectrum (strong wave pattern; fdpsX). The only indication visible for feeding was low x-axis maximum dynamic body acceleration (mdbaX).

Fig. 5
figure 5

Final, pruned classification trees of six main behaviours of observed geese as calculated by accelerometer statistics. a Classification tree for backpack tags with a legend of cross-validation success rates for each behaviour separately and an overall value. b Same as a, but for neckband tag data. The acceleration statistics used in the analyses are: pitchX—body angle along x-axis, pitchZ—body angle along z-axis, rollY—body angle along y-axis, mdbaX/mdbaY/mdbaZ—maximum dynamic body acceleration along the x-/y-/z-axes, odbaX/odbaY/odbaZ—mean dynamic body acceleration along the x-/y-/z-axes, odba—overall dynamic body acceleration (sum of the previous), dpsX/dpsY/dpsZ—maximum power spectral density of dynamic acceleration along x-/y-/z-axes, fdpsX/fdpsY/fdpsZ—frequency at the maximum power spectral density along the x-/y-/z-axes (for more explanation, see [73])

For the neckband tag, the classification tree looked very different (Fig. 5b). First, feeding, preening and walking were split off by high odba, indicating a high level of general tag movement. Further, splits by mdbaZ and rollY indicated that preening contained very strong right/left positional and angular changes. Note that right/left and front/back movements (y- and z-axes) are not easily discernible, because the neckband tag can freely rotate around the goose’s neck. Walking was split off by low mdbaZ and high odbaY, showing the right/left swaying walk of Canada geese. Feeding showed low angular change in the y-axis (rollY). On the other side of the classification, tree resting and vigilance were discerned by pitchX, indicating that the difference between the two rather inert behaviours was in body/head angle.

The overall classification success rates of the fitted trees (Fig. 5) were similarly high for both tag types (BP: 72.7 %, NB: 74.6 %). However, feeding and vigilance behaviours were better classified for neckbands than for backpacks, whereas preening, resting and walking were better detected in backpack data (Fig. 5, for details see Additional file 1: Table A1). Thus, neckbands were better at classifying behaviours that were mainly performed with the head, whereas backpacks seem better able to map whole-body behaviours.

Discussion

We have compared two of the most widely used types of attachment of GPS/accelerometer tags on large waterbirds, for the first time in a way that integrated the quantification of tag-induced adverse behaviour during early habituation, GPS position accuracy and behaviour classification success from accelerometer data. In general, both tag types showed a similar short-term discomfort effect on the birds, GPS accuracy was only slightly better for neckband tags, and overall behavioural classification success from accelerometer data was similar. Behaviour-specific classification results differed between the two types of tags; neckbands showed better results for behaviours involving head movement such as feeding or vigilance due to their position close to the head, and backpacks were more successful in detecting behaviours such as walking or resting, for which it is important that the tag is closely fixed to the body (not freely moving around the neck). Thus, a decision on the use of neckband or backpack tags for large waterbirds cannot be based on early habituation discomfort of the birds or GPS position accuracy, but should depend on possibly differential long-term habituation and the research question.

Apart from showing that short-term, tag-induced discomfort was similar for both tag types, we have also seen that time since deployment did not influence the geese’ behaviours during the first 6 days. This indicates that previously observed habituation to the tags takes longer, up to several weeks or months [54], and might then differ between different tag attachments. This can be important to consider, because one issue of many tracking research is the necessary duration of the study, for how long the animal shall carry a tag and collect data (e.g. foraging movement vs. lifetime tracking). Furthermore, it is important to understand for which time frame tracking data are affected by discomfort of the animal and when seemingly normal movement data can be observed.

In the light of animal welfare, a tag should only be mounted on the animal as long as it is working properly [67]. Therefore, drop-off mechanisms and weak links in harnesses have become more widely used [74]. Simple glue-on-feathers has been used for short-term deployments in the past [75], but can damage the feathers or skin of animals. However, if it is desirable that the tag stays on the animal for a long time, the material and attachment methods should be adapted, taking into account habitat conditions and the destructiveness of the animals. Some species of geese are known to destroy harnesses and backpack tags within a short time [61], and there is advice not to use a harness for this species group [58]. However, plastic neckbands are known to have a long retention time [71] and are less accessible for the wearer to inflict damage with its bill. Initial concern regarding neckband icing [76] has been lessened by studies showing that icing is exceptionally rare and does not have long-term fitness consequences for geese [70, 77]. Furthermore, the fact that unique IDs can be inscribed on the outside of a neckband for visual observations is a large advantage when quantifying survival and tag functionality.

External tags have been reported to have no significant long-term effect on animals [59, 71, 78], but there have also been cases showing various negative effects on animal behaviour and survival [22, 43, 63, 65, 79, 80]. Such effects can be intensified if the animal is flying or moving through water, depending on tag placement [47, 48]. It is possible that neckband tags have a higher aerodynamic resistance during flight and that their placement away from the centre of gravity might affect the bird’s balance, leading to higher flight costs. On the other hand, the harness of a backpack tag is likely to cause abrasions and hamper flapping of the wings, which is especially important for geese that almost exclusively use flapping flight. It was not possible for us to incorporate flight behaviour in this study, and there are, as far as we know, no other studies that compare the differences of negative effects of neckbands versus backpacks during flight or diving. However, from field experience it seems that the longevity of wild geese with neckband tags is higher, possibly due to better manoeuvrability during flight (AK, unpublished data). Furthermore, fat accumulation for migration can negatively affect body harness fit of backpacks, but is not problematic for neckbands, as neck size does not change.

Habituation to tags might also depend on the handling time and procedures during deployment, in which context we consider neckbands more suitable as they are more ‘standardised’ and quickly to attach. However, it is important to realise that our study was performed with captive animals and that the effects of being handled and carrying the tags on the behaviour of wild birds might differ. They are usually more constrained in food availability, and the need for vigilance for predators is higher. Therefore, tag-induced discomfort might not be affecting their time budget as much, and our findings of extra time spent preening and shaking are conservative measures. On the other hand, they might be initially more stressed by the tag than captive geese that are somewhat used to being handled. However, we are confident that the general conclusions of our comparative study can be transferred to wild waterbirds.

With awareness that external devices are most likely affecting animals (at least short term during habituation [54]), it is even more important to ensure the highest possible quantity and quality of collected data. For long-term studies, the extension of time in functionality has successfully been achieved by including solar cells for energy provision, so that tag running times are not time-limited by battery power. Regarding data quality, our results suggest that GPS accuracy from the particular backpack tags of this study was generally lower and more strongly improved by filtering and SBAS augmentation than neckband tags. Thus, for data sets that are not continuously of NMEA 2 type, it seems advisable to prefer neckband tags to obtain higher GPS accuracy. These findings differ from earlier results on ARGOS reception and lower accuracy of neckbands than backpacks [58]. However, in that study the antennas of the neckbands pointed down the neck of the goose, which has been shown to be problematic [39].

Furthermore, signal frequency, antenna type and orientation will have a profound effect on device performance, making comparisons difficult. The signal reception patterns of our tags’ helical antennas in relation to the GPS satellites would have affected the device performance in ways that are too complex to explore here, but our results are probably influenced most strongly by antenna characteristics. Helical antennas were chosen because their reception pattern is omnidirectional, and their orientation has less effect on received signal strength than would the patch antennas that are normally used with GPS receivers because they have higher gain. We did not test the effect of antenna orientation during flight. However, if the orientation of the neckband tag during flight results in the antenna hanging under the bird’s outstretched neck, the ability of the GPS receiver to acquire satellites is likely to be reduced. In contrast, the orientation of a back-mounted tag is likely to remain much the same during flight, and indeed the height of the bird and clear line of sight to GPS satellites would probably improve receiver performance.

As more global satellite navigation systems join America’s GPS (e.g. Russia’s GLONASS and Europe’s Galileo) in becoming available for widespread use, and augmentation of GPS accuracy is possible in different ways (differential GPS, SBAS, filtering), improved position data and their applicability for ecological research should be evaluated. For different research questions, GPS fix rate, accuracy or precision is of varying importance. For example, studies on habitat selection require GPS positions of high accuracy to allow for correct overlap with e.g. remote sensing data, whereas studies about individual behaviour or group movement need high GPS precision. Here, we take a first step to also raise ecologists’ attention to the likelihood that GPS positions from standard devices are augmented by some form of smoothing algorithm (probably based on an extended Kalman filter) before being output from the device. Rapidly sampled individual locations output from a GPS tag are unlikely to be statistically independent, and the type of filter/platform setting applied (e.g. stationary, pedestrian walking, motor vehicle) will influence the data. As the smoothing algorithms depend on fast sampling rates, their influence on infrequent GPS locations is less, probably negligible. However, if a fast sampling rate is used to test the accuracy of a stationary GPS tag, the results may not be representative of the performance on the animal [81]. Therefore, the use of raw GPS data might be advisable for high-frequency GPS tracking studies.

In most studies, especially working with small species, tag weight is most important [42, 43, 49] and often limits data quality and quantity options. However, we want to stress here that tag weight cannot be considered independent of the tag design and tag attachment method [47]. A device mounted on body appendages such as leg, tail or head should ideally be somewhat lighter than a tag attached to the back of an animal or implanted [64, 82], which is supported by the whole body mass. For large waterbirds, this seems to be less of an issue, the tags used here are quite heavy in absolute terms, but still <1.5 % of the body weight.

The integration of extra sensors into GPS tags is improving usability of position data for many applications. Accelerometer measurements are one example that is gaining more attention [12, 13], and we show that for the best use of accelerometer data it is critical to compare different means of attachment and placement [51, 83]. An accelerometer records the movements of the parts of the body directly underneath the tag, and as body parts move differently for the various behaviours, it is important to have a clear idea which tag placement can give the most significant acceleration data to detect the respective behaviour. Thus, depending on which type of behaviour is relevant for the attempted study, unconventional placement of small tags might be most effective and will improve the scope of accelerometer data sets even more.

We are aware that there might always be limitations to animal tracking [1, 20]. Some species simply are too small, do not tolerate handling stress or are hard to catch. However, by pushing the technological limits, animal tracking will be refined into a truly revolutionary tool for wildlife research. Ecologists should use the smallest tags with the least effect on the animal’s behaviour giving the best quality data for answering the research questions posed by various disciplines. By extracting natural, objective time budgets and discerning small-scale changes in movement and other behaviours besides the animal’s location, we will be able to explore an animal’s true natural behaviour and apply this knowledge to conservation, management or models of disease spread.

Conclusion

We have shown that captive Canada geese with backpack or neckband tags exhibit discomfort behaviours at a similar level during a short habituation period. GPS accuracy and general behaviour classification success based on accelerometer data from both tag types were similar. However, some behaviour types were better recognised by neckbands, others by backpacks. Therefore, we advise that the selection of either type of tag attachment method for large waterbirds should depend on the research question, including focal behaviours and necessary tag retention time.