Background

Global biodiversity loss is one of the biggest crises currently threatening the natural world [1,2,3]. Approximately 40% of plant species, 23% of invertebrates, and 18% of vertebrates assessed are considered to be threatened [4]. For mammals, 22% of described species [4] and 26% of assessed carnivores are considered to be threatened [5]. Some of the primary threats to carnivores include reductions in prey [6] and habitat [7], human-wildlife conflict (primarily in terms of livestock losses) [8,9,10,11], and illegal trade in animals or animal parts [11,12,13,14,15]. Conservation is therefore at the forefront of policy-making decisions worldwide [16,17,18,19].

To implement effective conservation and species management strategies, an understanding of target species populations, ecology, and behaviour is important to provide an insight into the status of the species, its needs, and putative causes of decline. When monitored over time, population data can indicate trends in particular groups of animals or in species as a whole [20, 21] or the efficacy of conservation efforts in comparison to control areas [22]. Remote sensing technologies such as camera traps have aided in species population assessments [23,24,25,26], as well as in our understanding of species ecology, and behaviour [23]. Advances in Global Positioning System (GPS) devices have further contributed towards understanding species ecology by providing insights into movements and habitat use. Knowledge of behaviour in space and time can provide insights into the importance of particular habitats and microhabitats for a species. For example, Wege et al. [27] identified novel foraging sites used by fur seals, contributing towards conservation policy-making as these sites were heavily utilised during the winter and were not previously considered when making assessments for potential marine protected areas (MPAs), where summer use is considered to be more important.

Accelerometer data loggers have been used independent of [28, 29] and in combination with [30,31,32] other remote sensing technologies such as GPS devices, magnetometers, and gyroscopes. Tri-axial accelerometers measure acceleration in three orthogonal axes (heave, surge, and sway), providing information on omnidirectional dynamic movement of an animal, as well as its posture (via static acceleration) [33, 34]. When accelerometers are used alongside devices such as GPS loggers, detailed behaviour patterns in space and time can be elucidated (e.g. [35, 36]). Unlike other remote sensing technology such as camera traps, loggers are fitted to the animals of interest (either directly or via collars or harnesses), providing data on the individual for the entire deployment period, not simply when activated. This feature is particularly useful for assessing the behaviours of cryptic species with large home ranges or that utilise difficult-to-monitor habitats (e.g. dense forests/bush, burrows, or expansive deserts). Although, the relative affordability and ease with which loggers can be deployed has led to their widespread use, less consideration appears to be given to device selection and subsequent downstream data processing. Most applications of animal-borne accelerometers have been to examine behaviours (e.g. [31, 37,38,39,40]), with several resulting in the categorisation of coarse-scale descriptions (i.e. three or four different behaviours) [28, 33, 40, 41]. While several studies have categorised behaviours manually by coarsely examining the acceleration traces generated (e.g. [33, 42]), others have implemented machine-learning techniques (many described in [43]), including random forests (RFs) [29, 31, 37,38,39,40, 43, 44], to classify behaviours to datasets using training and test data. Other approaches, such as the use of magnetometers, have proven successful in the determination of specific behaviours (e.g. biting and chewing in grazing herbivores) [45].

Cheetahs (Acinonyx jubatus) are medium-large felids inhabiting Africa and Iran [46,47,48]. They are classified as ‘Vulnerable’ by the IUCN [46] with the most recent population assessment (2014) suggesting just under 7100 adolescent and adult cheetah remain in the wild [47]. Population strongholds exist in southern and eastern Africa [46, 47]. Whilst conservation measures such as confiscation of traded animals and parts and reducing conflict with humans have been put in place, cheetah populations continue to decrease, with habitat loss, persecution, and illegal hunting and trade comprising major threats [7, 11, 47]. As such, detailed monitoring of cheetah movements, habitat use, and behaviour can assist with conservation efforts to ensure stringent monitoring of frequently used areas to reduce poaching and the adequate provision of resources to meet the needs of the species. To date, only coarse behaviours (active, inactive, and feeding) have been defined for cheetahs using remote sensing technology (accelerometers) [33, 41]. However, other ecological information such as different hunting strategies they may adopt and the associated costs of chasing prey [30, 32, 49, 50] (using GPS and accelerometers) have also been elucidated. However, while fine-scale behaviours, such as stalks (which may not result in a hunt), different movement gaits (e.g. walking vs. sprinting), and resting, have yet to be described for cheetahs, such data are available in other species (e.g. [31, 37,38,39, 43, 44]). Cheetahs are considered to be “extreme” movers, potentially reaching top speeds of 64mph (103kph) in a matter of seconds [51]. Therefore, the ability to distinguish between fine-scale behaviours may help to define the ecological needs of cheetahs, including hunting success rate, and, thus, contribute to conservation efforts. However, due to the high power and accelerations attained by cheetahs, monitoring their behaviour remotely may be limited by the capacity of individual devices.

The overall aim of the current study was to ground-truth behaviours performed by cheetahs against data collected using tri-axial accelerometers. Specifically, we wanted to (1) determine the accuracy with which a suite of behaviours in a cheetah’s repertoire could be defined; (2) determine whether this could be affected by the technical specifications of two different accelerometer devices, and; (3) provide a framework in the form of a vignette containing “R” code to develop behaviour categorisation models for other species of policy or conservation interest.

Methods

Study animals and collar preparation

This study was carried out in October 2012 at the Cheetah Conservation Fund (CCF) research centre near Otjiwarongo, Namibia (− 20.447763° N, 16.677918° E). Five resident adult cheetahs (three males and two females) were fitted with their own neck collars (nylon dog collars with plastic clip buckle: mass = 75 g, length = 570 mm, width = 20 mm) equipped with two tri-axial accelerometer data loggers: 1. G6, CEFAS Technology Limited, Lowestoft, UK (maximum = 2.3 g, size = 40 × 28 × 15 mm (L × B × D), mass = 18 g including urethane encasement, recording frequency = 30 Hz); 2. X8M-3, Gulf Coast Data Concepts (GCDC), LLC, Waveland, MS, USA (maximum = 8.6 g, resolution = 0.001 g, size = 50 × 30 × 12 mm (L × B × D), mass = 21.6 g including epoxy encasement, recording frequency = 50 Hz). To ensure the collar remained centred on the ventral side of the neck, an additional weight comprising four steel nuts (120 g) was added. The total weight of the fully equipped collars was approximately 235 g (see Additional file 3: Figure S1a for constructed collar design). Prior to being fitted to cheetahs, collars were hung on a metal rail with the accelerometers located at the bottom of the collar to allow for the devices to be calibrated (see “Data processing—accelerometers” below).

Exercise arena and video capture

Cheetahs were exercised by chasing a lure (cloth rag) attached to ~ 285 m of cord around a pre-determined track. The lure machine, powered by an electric motor, was remotely controlled by a keeper, such that the speed and direction of the lure could be altered at will. The keeper changed the direction of the lure strategically to attempt to outwit the chasing animals and prevent capture of the lure. The chasing animals were thereby encouraged to employ different strategies to try to catch the lure, including stalking behaviour and high-speed pursuits. Each cheetah was exercised individually and behaviour was recorded using a video camera (Canon PowerShot SX230 HS; Canon, Japan). Typically exercise bouts lasted 10–15 min and consisted of three or four active chases (e.g. running, stalking) punctuated by two or three lower intensity rest periods (e.g. lying down, walking, standing). Collars were retrieved when the animal had finished exercising. As exercise bouts comprised periods of activity and inactivity, data associated with both hunting and resting were collected and ground-truthed against video footage.

Data processing—accelerometers

Following exercise bouts, data loggers were removed from collars and data were downloaded. The data collected for both devices were calibrated to correct for non-centred mounting of the devices on the collars using the region of the dataset where the collars had been attached to the metal rail (see Additional file 1: Study details, collar calibration, and calculations). The data corresponding to the times of captured video footage were selected and the rest of the data were removed. Static acceleration (acceleration due to gravity; Additional file 5: Figure S2, static acceleration diagram) was derived for each axis from the corrected heave (acceleration in vertical axis), surge (acceleration in longitudinal axis), and sway (acceleration in transverse axis) data by calculating a rolling mean over a two-second window [52]. Dynamic acceleration was then calculated for each axis as the absolute result of subtracting static acceleration for a particular axis from its raw acceleration. Vectorial Dynamic Body Acceleration (VeDBA), Vectorial Static Body Acceleration (VeSBA), animal static acceleration (Anim.stat), pitch, and roll were also determined (Additional file 1: Study details, collar calibration, and calculations).

Data processing—video footage

All video footage (approximately 58 min; 103,869 CEFAS logging events; 174,185 GCDC logging events) was synchronised with its complementary accelerometer datasets. Video footage was assessed frame-by-frame (Avidemux software; Developer: Mean) and cheetah behaviour was matched with the accelerometer data. Initially, 22 behaviours and behaviour combinations were identified (Table 1). Any other behaviour was recorded as ‘other’ and instances where behaviour could not be assigned (e.g. if an object obstructed a clear view of the animal) were removed from the dataset as we could not be certain of categorisation, resulting in a loss of approximately six minutes’ worth of data. Each labelled dataset was amalgamated to give two master spreadsheets of labelled accelerometer data; one for each model of accelerometer device (CEFAS and GCDC).

Table 1 Behaviours performed by cheetahs that were used to label accelerometer data

Data analysis

Data analysis was carried out in ‘R’ version 3.4.3 [54] using the ‘h2o’ package version 3.16.0.2 [53]. RF analysis (Additional file 2: Code) was conducted on the datasets labelled with behaviours. The datasets were split into three, such that 60% of cases were selected at random to entrain models (training dataset), 20% of cases were selected at random to validate the model (validation dataset), and the remaining 20% were used to test model performance (test dataset). The training data were used to entrain the RF model to categorise specific behaviours (see Table 2 for behaviour list). The validation dataset was then used to assess the performance of the model via model accuracy, (root) mean square error (RMSE and MSE), and r2. The validation data were also used to refine the model by altering model parameters and comparing the metrics listed above. The test data were only used once at the end of the process to compare model accuracy after validation to the outputs of the training dataset.

Table 2 Structure of fine-, medium-, and coarse-scale random forest (RF) behaviour models and frequency of occurrence of each

Model structure

Initially, models were entrained to categorise 17 behaviours (Table 2, fine-scale). The predictor variables were: heave, surge, sway, static heave, static surge, static sway, dynamic heave, dynamic surge, dynamic sway, VeDBA, VeSBA, Anim.stat, pitch, and roll. A stopping criterion (stopping-rounds = 2) was implemented to optimise the duration for which models were run. A stopping criterion of two stops fitting the model when the two-tree average is within 0.1% accuracy of the previous two-tree average. If this criterion is increased, the average is taken over the specified number of trees. Models were refined by changing their depth and comparing their overall accuracy (percentage of correctly categorised behaviours divided by percentage of incorrectly categorised behaviours). The model with the highest accuracy was retained. Models were re-run using coarser behavioural categories (Table 2). For each model, cross-validation was performed using five folds and comparing mean accuracy, RMSE, MSE, and r2 to the training dataset. ‘R’ code for RF model constructs and additional model information are provided in the supplement (Additional file 2: Code). Logger performances were compared for the categorisation of each individual behaviour using chi-squared tests.

Results

There was no indication of significant overfitting when cross-validation of models was carried out (Table 3).

Table 3 Mean cross-validation and training data metrics

CEFAS loggers

In the first model, behaviours were categorised on a fine scale. The behaviours sought to be categorised were: crouch, lie, sit, stand, head movement, crouching stalk, lying stalk, sitting stalk, standing stalk, walking stalk, trotting stalk, walk, trot, canter, gallop, pounce, and other. The overall accuracy of the model was 83.3% (MSE = 0.18, RMSE = 0.42, r2 = 0.99). However, as ‘other’ was an uninformative category, which didn’t require correct positive categorisation in the training dataset as it comprised a ‘rag bag’ of various movements, its categorisation could be disregarded (but the variable still remained in the model). Once disregarded, the accuracy of the model increased to 84.2%. Sitting stalk, lying stalk, lying, and standing were categorised with over 90% accuracy. Behaviours with < 50% categorisation accuracy included pouncing, crouching, trotting, and trotting stalk (see Table 4 ; Fig. 1 for full description of classification accuracy). Crouching behaviour was most often confused with lying (14.5%), other (27.6%), and standing (47.4%). Trotting and trotting stalk were most often confused with cantering (trotting: 21.1%; trotting stalk: 19.4%), galloping (trotting: 6.1%; trotting stalk: 13.2%), other (trotting: 42.2%; trotting stalk: 45.8%), and walking (trotting: 19.7%; trotting stalk: 9.7%). In addition, trotting stalk was confused with walking stalk (6.3%) (Fig. 2A). In terms of predictor variables, static acceleration in all three axes was most important in categorising behaviours (heave: scaled importance (improvement of MSE relative to maximum improvement across all predictors) = 100%, explanatory power = 14.4%; sway: scaled importance = 80.0%, explanatory power = 11.5%; surge: scaled importance = 71.1%, explanatory power = 10.2%), followed by VeDBA (scaled importance = 53.3%, explanatory power = 7.7%), roll (scaled importance = 52.0%, explanatory power = 7.5%), and heave acceleration (scaled importance = 51.6%, explanatory power = 7.4%). In all, these six predictors explained 58.7% of the RF model variance.

Table 4 Behaviour categorisation accuracy (%) for each accelerometer type and for each model resolution
Fig. 1
figure 1

Heatplots of behaviour categorisation in fine-, medium-, and coarse-scale models. Behaviours in the green zone indicate those that were categorised well by both CEFAS and GCDC loggers, whereas behaviours in the red zone were categorised poorly by both

Fig. 2
figure 2

Confusion matrices for CEFAS accelerometer data loggers. A Confusion matrix for fine-scale behaviour model; B confusion matrix for medium scale behaviour model; C confusion matrix for coarse scale behaviour model. All matrices represent validation datasets. Darker shades of red represent higher classification values and shades that are more white indicate lower values. Values in each cell indicate frequency of classification

In the second, coarser model, several behaviours from the previous model were combined in an attempt to reduce the error rate. ‘Pounce’ was entered as ‘other’ as it could not be categorised reliably and was often confused with several other behaviours. Behaviours in this model included: Sedentary (‘crouch’, ‘lie’, ‘sit’, ‘stand’), head movement, moving stalk (‘trotting stalk’, ‘walking stalk’), crouching stalk, sitting stalk, lying stalk, standing stalk, gallop, canter, trot, and walk. The overall accuracy of this model was 84.7% (MSE = 0.16, RMSE = 0.40, r2 = 0.98), which increased to 86.6% when ‘other’ behaviours were removed. Sedentary, sitting stalk and lying stalk were the only behaviours where the prediction accuracy surpassed 90%. The prediction accuracy for two behaviours was lower than 50%; cantering and trotting (see Table 4 and Fig. 1 for full description of classification accuracy). Cantering was most often confused with galloping (43.8%), other (34.4%), and trotting (8.0%), while trotting was most often confused with cantering (20.1%), other (40.9%), and walking (20.8%) (Fig. 2B). Once again, static accelerations (heave: scaled importance = 100%, explanatory power = 13.0%; surge: scaled importance = 87.8%, explanatory power = 11.5%; sway: scaled importance = 76.2%, explanatory power = 9.9%), VeDBA (scaled importance = 73.7%, explanatory power = 9.6%), and VeSBA (scaled importance = 50.8%, explanatory power = 6.6%) featured among the important predictor variables. Overall, the top five predictors explained 50.7% of the RF model variance.

The final, coarsest model comprised a simplified RF where behaviours were either deemed to be active (‘walk’, ‘trot’, ‘canter’, ‘gallop’, ‘walking stalk’, ‘trotting stalk’, ‘pounce’), inactive (‘crouch’, ‘lie’, ‘sit’, ‘stand’, ‘crouching stalk’, ‘lying stalk’, ‘sitting stalk’, ‘standing stalk’), head movement, or other. The accuracy of this final model was 88.2% (MSE = 0.11, RMSE = 0.32, r2 = 0.89). When ‘other’ was removed, model accuracy increased to 92.7%. Inactivity was predicted to the highest degree of accuracy and head movement with the lowest. Activity was predicted with 86.9% accuracy (see Table 4; Fig. 1 for full description of classification accuracy). Whilst head movement was most often confused with all three behaviour categories; inactivity (47.3%), activity (28.8%), and other (24.0%), activity was most often confused with inactivity (58.9%) and ‘other’ behaviours (40.2%) (Fig. 2C). The most important predictors in this model were VeDBA (scaled importance = 100%, explanatory power = 15.1%), static acceleration in the heave axis (scaled importance = 84.1%, explanatory power = 12.7%), and dynamic acceleration in the sway (scaled importance = 63.0%, explanatory power = 9.5%), heave (scaled importance = 56.8%, explanatory power = 8.6%), and surge (scaled importance = 52.5%, explanatory power = 7.9%) axes as well as static acceleration in the surge axis (scaled importance = 50.4%, explanatory power = 7.6%). In total, the top six variables explained 61.6% of the model variance.

GCDC loggers

The models outlined above were repeated for the GCDC data loggers. The model containing the finest-scale behaviours was 85.5% accurate (MSE = 0.17, RMSE = 0.41, r2 = 0.99). Accuracy increased to 85.8% when the category ‘other’ was omitted. The sedentary behaviours of lying, lying stalk, and sitting stalk were categorised with > 90% accuracy. Crouching was the only behaviour that was categorised with < 50% accuracy (see Table 4 and Fig. 1 for full description of classification accuracy); it was most often confused with standing (41.1%) and other behaviours (41.1%) (Fig. 3A). Static acceleration in all three axes was the most important predictor for behaviour (heave: scaled importance = 100%, explanatory power = 16.0%; sway: scaled importance = 80.1%, explanatory power = 12.8%; surge: scaled importance = 72.8%, explanatory power = 11.6%). In total, static acceleration variables explained 40.4% of the model variance.

Fig. 3
figure 3

Confusion matrices for GCDC accelerometer data loggers. A Confusion matrix for fine-scale behaviour model; B confusion matrix for medium scale behaviour model; C confusion matrix for coarse scale behaviour model. All matrices represent validation datasets. Darker shades of red represent higher classification values and shades that are more white indicate lower values. Values in each cell indicate frequency of classification

When the second model outlined above was reproduced for the GCDC logger, it had an accuracy of 86.2% (MSE = 0.15, RMSE = 0.39, r2 = 0.98), which increased to 87.4% when the ‘other’ category was removed. Behaviours categorised with > 90% accuracy were sedentary, lying stalk, and sitting stalk. No behaviour categorisation was < 50% accurate; those behaviours that were most difficult to categorise were trotting, cantering, and head movement (see Table 4, Fig. 1 for full description of classification accuracy, and Fig. 3B for confusion matrix). In terms of predictor variables, static acceleration in all axes (heave: scaled importance = 100%, explanatory power = 14.7%; surge: scaled importance = 78.6%, explanatory power = 11.6%; sway: scaled importance = 78.2%, explanatory power = 11.5%), VeSBA (scaled importance = 51.8%, explanatory power = 7.6%), and Anim.stat (scaled importance = 51.5%, explanatory power = 7.6%) were most important in determining behaviours in this model. The top five predictor variables explained 53.1% of the model variance.

In the final RF model for the GCDC data loggers, activity, inactivity, head movement, and other behaviours were categorised. This model performed with a categorisation accuracy of 90.2% (MSE = 0.09, RMSE = 0.31, r2 = 0.90), which increased to 92.9% when the ‘other’ behaviour category was omitted. Inactivity was most easily classified and no behaviour had a classification accuracy below 61.3% (see Table 4, Fig. 1 for full description of classification accuracy, and Fig. 3C for confusion matrix). In this model, static acceleration in the heave (scaled importance = 100%, explanatory power = 12.7%) and surge (scaled importance = 84.8%, explanatory power = 10.8%) axes were most important for categorisation, followed by VeDBA (scaled importance = 83.2%, explanatory power = 10.6%), static acceleration in the sway axis (scaled importance = 75.8%, explanatory power = 9.6%), Anim.stat (scaled importance = 61.1%, explanatory power = 7.8%), VeSBA (scaled importance = 57.4%, explanatory power = 7.3%), and dynamic acceleration in the heave (scaled importance = 52.0%, explanatory power = 6.6%) and sway (scaled importance = 52.0%, explanatory power = 6.6%) axes. Together these eight variables explain 79.1% of the model variance.

Logger comparison

Generally, the higher capacity loggers, with higher recoding frequency (GCDC) outperformed their lower capacity (CEFAS) counterparts when determining cheetah behaviour (Table 5; Fig. 1). This was particularly evident during medium- and fine-scale behaviour categorisation. Of the 30 different behaviour-model combinations run, the CEFAS loggers significantly outperformed the GCDC loggers only four times: standing2 = 11.63, df = 1, p < 0.001) and galloping2 = 19.46, df = 1, p < 0.001) in the fine-scale model, galloping2 = 23.15, df = 1, p < 0.001) in the medium-scale model, and inactivity2 = 4.42, df = 1, p = 0.035) in the coarse model (Table 5; Fig. 1). Conversely, the GCDC loggers were significantly better than the CEFAS loggers at behaviour categorisation on 11 occasions (Table 5; Fig. 1), notably when defining trotting, moving stalks, head movement, pouncing, and, in the fine-scale model, most sedentary behaviours. Whilst the GCDC loggers were better overall at defining behaviours on a fine- and medium-scale, there was no significant difference between the two devices when categorising behaviours on a coarse-scale (Table 5).

Table 5 Comparison of device performance (percent difference) for each behaviour in fine-, medium-, and coarse-scale models

Discussion

Recent, ongoing, and imminent species declines have prompted conservation focused research outputs (e.g. [55,56,57,58]). To make worthwhile steps towards successful conservation, it is important to not only know about populations and habitat requirements, but also about species’ behaviour, to ensure all ecological and biological needs can be met. To ensure that natural behaviours are not compromised, monitoring techniques should be minimally invasive [59], for example through the use of remote sensing technologies (such as lightweight GPS and accelerometer devices). Whilst the use of such devices has been gaining momentum for decades, interpretation of their outputs for behavioural categorisation is relatively recent, especially when high resolution and precision are desired (e.g. [31, 37,38,39,40]).

The cheetah (Acinonyx jubatus) is listed as ‘vulnerable’ [46] with wild populations purportedly decreasing [47]. However, descriptions of their movements and behaviour (particularly fine-scale behaviour) remain scarce [7, 30, 41, 49]. Accelerometry has been used to describe coarse behaviours in cheetahs; Grünewälder et al. [41] determined “mobile”, “stationary” and “feeding” with 84–94% accuracy and Shepard et al. [33] provide (without categorisation metrics) acceleration traces of walking, chasing, and trotting behaviours. In the current study, three RF models were constructed for fine-, medium-, and coarse-scale behaviour determination on high (“GCDC”, Maximum acceleration: ~ 8 g; Frequency: 50 Hz) and lower (“CEFAS”, Maximum acceleration: ~ 2 g; Frequency: 30 Hz) capacity devices. Using coarse modelling approaches (behaviours categorised as “Active”, “Inactive”, or “Head movement”.), RF analysis rendered consistent results (93% accuracy) between the two devices. Both devices categorised inactive behaviours well (GCDC = 95.0% accuracy and CEFAS = 95.5% accuracy), with the CEFAS logger performing best (Table 5). However, head movement could be described with just over 50% accuracy using CEFAS loggers (over 10% lower than GCDC devices), often confused with inactivity, which may be due to the core body remaining stationary. This finding suggests that even the use of collars does not guarantee reliable detection of head movement, which may, in fact, be beneficial if coarsely categorising behaviours. Head movement categorisation was better with GCDC devices (GCDC = 61.3% and CEFAS = 50.3% accuracy), which is probably due to their higher frequency recording so slight, short-lived movements were more likely to be detected [60]. In this model, dynamic acceleration (VeDBA and heave) was consistently important across both devices, which is unsurprising given the disparity in dynamic motion between the three categories. However, static acceleration in the heave and surge axes were also important parameters for the CEFAS logger, and several additional measures of static acceleration were important for the GCDC loggers (e.g. VeSBA), suggesting that postural changes may also play a significant role, especially as logger sensitivity increases. Practically, it is important to ensure consistent logger attachment, and device capacity and configuration should be considered when using results from previous studies to underpin novel research. The results of the current study are consistent with the only other study to categorise cheetah behaviour remotely using accelerometers [41]; “stationary” (“inactive”) behaviours were most accurately classified, followed by “mobile” (“active”) behaviours (Table 6). Feeding was specifically measured in this study rather than more generic “head movement” so the two categories may not be directly comparable. Classification of active behaviour was better in the current study and the overall performance was slightly better, which may be due to differences between the loggers used (bi-axial versus tri-axial), logger configuration, or analytical approach (SVM versus RF).

Table 6 Comparison of coarse behaviour model performance in the current study to Grünewälder et al. [41]

Fine-scale behaviours

One objective of the current study was to determine whether fine-scale cheetah behaviours could be categorised using accelerometers. Such data could provide information on cheetah ecology and assist conservation efforts. For example, if foraging requirements (indicated by chases and stalks), the frequency of abandoned hunts (by identification of stalks with no subsequent pursuit), or changes in behaviour associated with life history such as rearing offspring could be identified accurately, specific ecological needs could be addressed by ensuring prey and habitat requirements were met. In the current study, a fine- and medium-scale behaviour categorisation model was produced for each accelerometer device; the finest-scale model included all behaviours that could be derived from video footage, whilst the medium-scale model collapsed several of these categories together, resulting in marginally coarser classification. Although both performed less well than coarse (active/inactive/head movement) models, there was little difference between the fine- and medium-scale models themselves. As such, it may be prudent to categorise behaviours on the finest or coarsest scales as they are more accurate (coarsest) or the benefit of additional behavioural information outweighs the marginal cost in accuracy (finest). To our knowledge, this study represents the most ambitious attempt to elucidate cheetah behaviour, with the highest resolution, fine-scale models incorporating 16 behaviours, and the coarser, medium-scale models including 11. Across both sets of models and both devices lying, lying stalks, and sitting stalks were always classified with over 90% accuracy; in fact, sitting stalks were classified with 100% accuracy on the GCDC loggers. This is the first time that these behaviours have been classified remotely with such accuracy in cheetahs. Stalks usually occur prior to pursuits of prey in cheetahs [61] so knowledge of the habitats that may facilitate stalks and successful hunts could be of great importance for survival. As such, acceleration data combined with GPS locations could provide vital insights for conservation. Furthermore, sedentary behaviours were categorised with a high degree of accuracy, which, when combined with other approaches, may provide insights into cheetahs’ physiological and habitat requirements.

The lowest (walking) and highest intensity (galloping) locomotory behaviours were categorised best with a higher error rate for intermediate trotting and cantering. Nevertheless, classification accuracy of walking and galloping was always between 68 and 78%. As footfall and rhythmicity of each locomotory gait varies (Fig. 4), incorporating periodicity may beneficial to differentiate them [28] but may be limited by rapid transitions between them and a lack of continuous measurements of any one in isolation. Correct identification of each gait could assist conservation efforts by providing insights into hunting and evasion, potentially facilitating the identification of areas favoured for hunting or resting, or those where cheetahs may be threatened by other species. Incorporation of lab-based techniques, such as indirect calorimetry, would allow us to determine the relative energetic cost of each behaviour and the overall proportion of their daily energy expenditure attributable to each [62]. Such an approach would inform management strategies, potentially reducing conflict with livestock owners [10].

Fig. 4
figure 4

Depiction of locomotory gaits and relative duration of each footfall in the gait cycle. a Walking gait, b Trotting gait, c cantering gait, and d galloping gait. In each plot FR = front/fore right foot; FL = front/fore left foot; BR = back/hind right foot; BL = back/hind left foot. Shaded areas indicate when foot is in contact with the ground; unshaded areas indicate when foot is raised

Pouncing represented the worst categorised behaviour, with only 4.8% accuracy on the CEFAS logger (Table 4). This poor performance is likely due to a combination of low recording frequency, the instantaneous nature of the behaviour, and its rarity. However, pouncing is likely to be uncommon in free-ranging adult cheetahs, which primarily implement stalk-and-chase hunting strategies [61]. As such, the low classification accuracy in this context is not concerning but may be problematic when trying to define the behaviour in ambush hunters.

It is worth noting as a caveat that certain behaviours were underrepresented in the datasets e.g. pouncing and stalking, with some others overrepresented (e.g. sedentary behaviours such as lying). This imbalance may have affected how the data were split into training, validation, and test data and, ultimately, the models. However, with the approach taken here ecologically important behaviours such as stalking could be incorporated into the models and was likely to be randomly selected for a split based on its representivity. The result was reasonably reliable models (according to accuracy, MSE, RMSE, and r2) with several under-, over- and well-represented behaviours being predicted accurately,

Application and experimental design

Generally, there was good consistency in model accuracy between CEFAS and GCDC accelerometer data. However, it is important to note that whilst CEFAS loggers categorised more behaviours with > 90% accuracy (n = 8; Table 4), than the GCDC loggers (n = 7), the latter categorised fewer behaviours with < 50% accuracy (GCDC: n = 1; CEFAS: n = 6). It is therefore important to determine a priori, where possible, the scale at which behavioural categorisation is desired and select devices and analytical models accordingly.

Whilst reliable categorisation was established for some cheetah behaviours, the GCDC logger outperformed the CEFAS logger in both fine- and medium-scale models (Table 5). Two potential reasons may explain this better performance: GCDC loggers could record higher accelerations (~ 8 g versus the ~ 2 g capacity of the CEFAS loggers) and were set to record at higher frequencies (50 Hz vs. 30 Hz). During high intensity galloping the CEFAS loggers reached maximum capacity, which may have led to a high frequency of correct categorisation for this behaviour but it may also have contributed to a high false positive rate for other, relatively high intensity behaviours such as trotting. Whilst locomotory behaviours were most often confused with adjacent behaviours for both loggers (e.g. cantering was most likely to be confused with trotting or galloping), trotting, cantering, and trotting stalks were the only locomotory behaviours identified with < 50% accuracy (occurring on the CEFAS loggers). Recording frequency may be important as higher logging rates will generate more data, rendering more information for entrainment of RF models. Recording frequency may be particularly important for rarely occurring and short-lived behaviours such as pouncing. Whilst a multitude of variables were used to entrain RF models, it is possible that others may also assist in categorising behaviours, for example, periodicity (movement rhythmicity) may be useful in discriminating between various locomotory gaits [28]. As there were significant performance differences between the two devices, there is an onus on researchers to select those with an appropriate specification for their study species, or, indeed, to use multiple different loggers in tandem on the same individual.

Behaviours have previously been categorised using accelerometer data loggers without the use of complementary video capture in cheetahs [33] and other species [42]. In such studies, behaviours are usually differentiated via variations in dynamic body accelerations and posture. However, if such an approach were implemented here, fine-scale sedentary behaviours would have been erroneously categorised, resulting in misinterpretation of species behavioural ecology. For example, resting behaviours such as standing or lying down could have been confused with sedentary stalks, where the former would signify true resting but the latter would indicate an attempted hunt. It is therefore recommended that data collected on accelerometer devices are synchronised with an extensive behavioural repertoire for the species.

Conclusions

In this study we found that the ability to categorise behaviours differed significantly between data loggers. The results of the current study can be used to form the basis of remotely monitoring coarse- and fine-scale behaviours of the vulnerable cheetah. Knowledge of their behaviours can inform cheetah biology and ecology, particularly when combined with other loggers such as GPS. Once the basic needs of the cheetah have been firmly established, the efficacy of conservation and management practices can be maximised, and strategies can be implemented to mitigate human-cheetah conflict. The approach taken here may be adopted in remote-sensing studies of other species but careful consideration of logger capacity and recording frequency is recommended.