Introduction

The golden jackal (Canis aureus) is a canid species spreading into central and northern Europe from southern and eastern countries such as Bulgaria, Romania, Serbia, and Hungary and is abundant in countries in the Middle East and Asia (Krofel et al. 2017; Spassov and Acosta-Pankov 2019). One method for monitoring this species’ presence is bioacoustic monitoring (Giannatos et al. 2005), which can help to gain information of animal abundance (Marques et al. 2013) and has already been used to study species across different taxa (Dawson and Efford 2009; Suter et al. 2017). Golden jackals perform an array of different vocalization types, for example long distance calls for territorial defense, social cohesion, or to find a mating partner (Jhala and Moehlman 2004; Comazzi et al. 2016). Their highly complex and long (29.9 s ± 3.7 s) vocalization repertoires (Comazzi et al. 2016) could theoretically vary between individual packs and meta-populations (Hennelly et al. 2017). During acoustic monitoring, which is usually done from dusk until dawn, researchers mostly estimate the distance to, and the composition of the group based on the heard acoustic signal emitted by the jackals (Giannatos et al. 2005; Szabó et al. 2009; Hatlauf and Hackländer 2016). Passive bioacoustic monitoring, i.e., placing audio recording units in the field to capture vocalizations of the focal species, is used for researching canid species in general (Suter et al. 2017) and also for golden jackals specifically (Comazzi et al. 2016). For confirmation of presence, a fixed recorder might be especially useful when conditions are sub-optimal for researchers to reach the study area frequently (Marques et al. 2013). Precise measurements are necessary to estimate population trends and pack size (Comazzi et al. 2016). Imprecise distance estimation potentially leads to under- or overestimation of density indices due to human bias (Yip et al. 2019).

Scott et al. (1981) as well as Nadeau and Conway (2012) noticed that humans are prone to errors in distance estimations of acoustic signals. The given distance estimations were up to 30 times the true distance. This error in distance estimation is likely to increase with the distance to the sound source. Sebastián-González et al. (2018) and Yip et al. (2019) found that the relative sound level (RSL) presents a more reliable distance estimation, thus removing human subjectivity from distance estimations. The RSL is the relative measured emitted energy of a signal in an audio recording that is measured with specific audio software (Yip et al. 2019). As the RSL—which is typically measured in decibels (dB)—has a predictable relationship with distance it can be an estimator for distance.

Recent studies that focus on distance estimation with autonomous recording units (ARUs) mostly used Song Meter devices by Wildlife Acoustics (Suter et al. 2017; Sebastián-González et al. 2018; Yip et al. 2019). In this study, we applied a low-budget alternative, the AudioMoth devices by Open Acoustic Devices. Hill et al. (2018) showed that AudioMoth devices can successfully detect gunshots on distances of over 1 km. This prompted us to investigate AudioMoths for opportunities or weaknesses specifically within golden jackal monitoring.

For our general aim to evaluate the application of AudioMoth devices in distance estimation for golden jackal monitoring with RSL, the broadcasted playback needs to be at a similar sound pressure level (SPL) as a live golden jackal howl. Therefore, we firstly evaluated the SPL of golden jackal howls and secondly analyzed the maximum possible distance for the detection of golden jackal howls using AudioMoth devices. We hypothesized that golden jackal howls (contrary to gunshots) cannot reliably be detected by AudioMoths over a distance of more than 1 km and that distance and number estimation becomes less precise with increasing distance. Finally, with the predicted distances of the RSL, we suggest adapted minimum grid sizes of AudioMoth devices, to not only be able to confirm jackal presence but also reliably predict distances.

Methods

Study design

We used 11 AudioMoth devices (five devices of version 1.0.0 and six devices of version 1.1.0) in this study. We calibrated microphones using a 1 kHz sinus wave at a known SPL. To analyze the SPL of golden jackal howls we mounted multiple AudioMoths in an enclosure in combination with a camera trap to evaluate the distance from the howling animal to the deployed AudioMoth. The SPL of a single howl on 1 m distance from the source was calculated, using the formula \({L}_{2}={L}_{1}-\left|20\right.*\mathrm{log}(\frac{r1}{r2})\left|\mathrm{dB}\right.\) with the sound source of known SPL as reference. On the basis of this, we proceeded to calculate the SPL of a group howl using the formula \({L}_{\varepsilon }=10*{\mathrm{log}}_{10 }\left({10}^\frac{L1}{10}+{10}^{\frac{{L}_{2}}{10}}+\dots + {10}^\frac{Ln}{10}\right)\mathrm{dB}\), assuming that the animals howl close to each other and at the same SPL, with L = previously calculated SPL and r = distance. We calculated the SPL of a group howl with four individuals, as four fundamental frequencies were shown at the same time in the spectrogram the standard howling (used previously in monitoring). We used a sound level meter (Peak Meter PM6708) to calibrate the megaphone in the field trials, according to the calculated SPL.

The study area was located in a lowland rural area in eastern Austria without nearby forests and 121 m above sea level (see study design in Fig. 1 of the Online Resource). We mounted the devices in a linear transect with approximately 100 m distance between devices and chose additional 20 positions along the transect to be calling stations (CS). The position of each device and each CS was determined using a global positioning device (GPS) device. We covered the devices in plastic bags and fixed them at a height of approximately 1.3 m to thin metal poles. The lowest distance from an AudioMoth to CS was 8.81 m; the maximal distance was 1540 m. We programmed the AudioMoths to record at a “high gain” and a “sample rate” of 8 kHz. A recorded howl of four individuals was played back using a megaphone, set to the previously calculated SPL. The researcher holding the megaphone was kneeling and holding the megaphone approximately 50 cm above ground to simulate the height of a golden jackal. We played the group howl once at each CS with a megaphone (Pyle PMP57LIA, 50 W), followed by a single howl, imitated by a human (the corresponding author).

Wave data analysis

Distance estimation with RSL

We analyzed the data in Raven Pro v. 1.5 (Center for Conservation Bioacoustics 2014), using a 512-point Hamming window spectrogram for visualization. We examined RSL using the “max power” function. For better visualization (and thus, detectability), the recordings were amplified by 12 dB, using the “batch amplifier” function. We marked the smallest area possible around the signal as a selection, following the method described by Yip et al. (2019). Selections generated in the amplified sound files were then copied back to the original sound files, keeping the original RSL.

Estimating the number of responding animals

We estimated number of responding animals by counting the simultaneously visible fundamental frequencies in the spectrogram of each detected group howl.

Statistical analysis

We transferred the selections into the statistic software R 4.0.4 (R Core Team 2021) using the package “Rraven” (Araya-Salas 2017). We then calculated the distance between each CS and the ARU’s using the package “sp (Pebesma and Bivand 2005). Likewise, the bearing of a vocalization relative to the microphone of the ARU was calculated using the package “geosphere” (Hijmans 2019) and classified in variables “front” and “back” depending on the relative direction of the emitted howl relative to the microphone. We tested the known distance of the vocalizations in the resulting dataset (i.e., the selections generated in Raven Pro 1.5.) (Graf and Hatlauf 2021) for normal distribution using a Shapiro–Wilk normality test. No normal distribution was found, thus we tested for differences between the measured SPL and the two call types (“group howl” and “researcher”, directions (“back” and “front”) and device versions (“1.0.0” and “1.1.0”) utilizing a Mann–Whitney U-test. We fitted linear models, using the known distance as the response variable.

Analysis of RSL

We ranked multiple models using an Akaike Information Criterion corrected for small sampling sizes (AICc), only selecting the most parsimonious model for further analysis. To estimate the optimal recording distances and enabling reliable distance predictions, we repeatedly removed the top 5% of the howls with the longest known distance until the Nagelkerke–R2 was higher than R2 = 0.8, using R2 as an estimator for model prediction quality, resulting in five subsets of data to our models. We ran a k-fold means cross validation (k = 10) on each model of each data subset with the lowest AICc-score, using the mean absolute error (MAE) as a further estimate of model prediction quality. We calculated the distance estimation error as the value of the predicted distance subtracted by the known distance.

Analysis of number of responding animals

To test whether the number of responding animals can reliably be estimated in spectrograms over distance, we fitted two linear models (one for the full dataset and one for dataset showing the best considered model after AICc-ranking), with the estimated number of responding animals as the response variable and the known distance as the predictor variable.

Results

The calculated SPL of a single golden jackal howl was 101.8 dB. Using the formula for multiple, coherent emitting sound sources, the determined SPL of four simultaneously howling golden jackals resulted in 107.0 dB. Therefore, the megaphone was calibrated to play the recorded group howl at the closest possible SPL. The measured RSL in Raven Pro showed similar maximal values for the recorded group howl (112.5 dB) and group howl emitted by the megaphone during the field test (113.7 dB).

Ten of 11 AudioMoth devices recorded wave files as expected, one device had a corrupted file, thus leaving n = 400 vocalizations to be detected in the spectrograms. 57.5% of those vocalizations (ntotal = 230; nresearcher = 120; ngroup = 110) were detected in the spectrograms. The longest distance to successfully identify vocalizations by means of visually scanning the spectrogram was 814.57 m for both call types.

No significant differences between “call types” emitted by the megaphone or the howling researcher and “distance” (W = 6330.5; p = 0.593) (see Fig. 2 in Online Resource 1), nor between the different “device versions” and “distance” (W = 6226; p = 0.5315) were found. Therefore, both “call types” and “device versions” were therefore included in further analyses and not analyzed separately or excluded. Significant differences between the “directions” and “distance” (W = 3239; p < 0.001) were found, thus all calls with variable “back” (n = 55) (where the microphone was aligned opposite from the calling direction) were left out from further analysis, leaving nhowls = 175 for further analysis.

For each considered dataset, the AICc-score was lowest when the RSL was modeled as a second order polynomial term (all for the AICc model selection considered models can be found in Table 1). For the full dataset model, the k-fold means cross validation showed a MAE = 82.57 m (± 14.37 m). Reducing the maximum distance of the dataset by 5% per step showed, after removing 20% of the data, the maximal distance was 400.86 m, R2 was above 80% (R2 = 0.803) and the MAE reduced to 39.02 m (± 7.41 m). The distance estimation error increased with growing distance to the ARU (Fig. 1) and had a maximum of − 576.32 m for the full dataset model and − 146.63 m at the 80% dataset model.

Table 1 Considered models in AICc model selection. The bold models with the lowest AICc-score for each respective dataset were considered for the k-fold means cross validation. Models were sorted after their AICc-score and their R2; thus, the model with lowest AICc is at the top. As another measure of model prediction quality, the mean absolute error (MAE) is listed
Fig. 1
figure 1

Direct relationships between distance estimation error and distance in both datasets; the x-axis shows the distance and the y-axis the estimation error calculated as the value of the predicted distances subtracted by the known distance. The RSL predicted distance error increases with increasing known distance to the ARU. Left is the plot of the full model (n = 175); the right plot is the reduced “80% model” (n = 140). Points represent plotted raw data. Gray bands indicate confidence bands

The estimated number of responding animals showed a significant negative relationship with distance for the full dataset (− 0.004; p < 0.001) as well as for the best considered model after AICc model selection (− 0.012; p < 0.001).

Discussion

Wave files recorded by AudioMoth devices in our study allowed for visual detection of golden jackal group howls in spectrograms at distances of over 800 m, confirming our first hypothesis, that howls can’t be detected at over 1 km distance. Our best model reliably predicts the distance to the recording AudioMoth up to 400 m. Estimated number of responding animals showed a strong negative relationship with the distance in both considered models, indicating that abundance of golden jackals (at higher distances) might therefore be underestimated in a “real-world” analysis, where the group size of the golden jackal pack is not previously known. Estimation of group size and group composition can be done differently using other passive monitoring applications, like DNA analyses through scats (Hatlauf et al. 2021). Nevertheless, we found that our models reliably estimate distances of howling golden jackal packs. Though occasionally the distance estimations given by our models showed a high MAE, they are likely still more accurate than those given by human researchers (Scott et al. 1981; Nadeau and Conway 2012). Future research should emphasize more on this assumption and include direct comparisons between estimations of researchers in the field and known distances and group compositions. However, from personal experience in fieldwork, the distance estimation by researchers is prone to high diversity and often leads to discussions with insecurities and usually with lack of any real confirmations or proof. Learning from our study, it could be an option to implement the (easy portable) recording devices into active bioacoustic monitoring to remove human subjectivity in distance estimation (Yip et al. 2019).

To further improve RSL-based study designs for golden jackal research, it is suggested to study a greater number of howls with known distance to calculate the SPL of a single golden jackal howl and to capture possible plasticity of SPLs. However, Suter et al. (2017) showed the mean SPL of a howling wolf to be ~ 107.8 dB. Our calculated SPL of a golden jackal howl being 101.8 dB is within good biological reason. Furthermore, we did not include possible covariates in this study, such as topographical relief, wind situation, vegetation and habitat type or time of day, what could possibly have influenced the model quality (Halfwerk et al. 2011; Maciej et al. 2011; Morrill et al. 2013). Thus, we recommend the training of location specific models, as distance prediction quality might be influenced by surrounding habitats or the time of day. Furthermore, different weather conditions (e.g., strong winds; heavy rain) or insects near the ARU might further alter the RSL. To account for these potentially negative effects, quality thresholds, such as the signal to noise ratio (SNR), can be integrated.

Installing multiple AudioMoths in an area can be an effective tool for bioacoustic monitoring. They are able to capture howling responses that are inaudible for the researchers during active bioacoustic monitoring. Home range size of golden jackals is highly variable and dependent on different factors like prey availability or hiding options. In an area of 2–10 km2 (Giannatos et al. 2005; Rotem et al. 2011; Lanszki et al. 2019), a considerable amount of AudioMoth devices would be needed to successfully predict distances of golden jackals using RSL. AudioMoth devices are built with monodirectional microphones and given a 400 m optimal prediction distance between each device, two devices (if not four, to reduce potential dB loss on the sides of the devices) per recording station would be needed, to account for the significant underperformance from the “back” of the device. This is reflected in our significant differences between the variables “front” and “back” leading to exclusion of the latter. There are considerable pros and cons for the application of AudioMoths—further studies should test the influence of the signal reaching the AudioMoths laterally to examine the optimal number of devices per recording station. However, installing multiple AudioMoths would again increase the costs and the time spent on analyzing the spectrograms—but could improve overall data quality. Using other autonomous recording units (with a more sensitive microphone) might further increase the recording quality (Suter et al. 2017; Yip et al. 2019) but may also considerably increase the costs per recording station. The long and complex vocalizations of golden jackals (29.9 s ± 3.7 s) (Comazzi et al. 2016) on the other hand could impede implementation of recognizer software or codes. Successful implementation of these would further improve the overall accuracy of distance estimation and again reduce working hours (Yip et al. 2019). Installing the AudioMoths in a defined grid would furthermore allow the acoustic triangulation of howling packs, by using the known GPS-positions of the AudioMoths as reference points for triangulation (Kershenbaum et al. 2019; O’Gara et al. 2020).

Our study highlights the possible applications, strengths, and weaknesses of AudioMoths specifically in bioacoustic golden jackal monitoring. We conclude that the AudioMoth recording devices are a helpful addition to improve data quality. Especially when research teams are small or they work in large study areas, deploying multiple AudioMoths might increase overall data quantity. Besides the help within passive acoustic monitoring, an AudioMoth device can even be a simple inclusion during active bioacoustic monitoring for a later confirmation of the estimated distance and removing estimation bias.