Autonomous search of an airborne release in urban environments using informed tree planning

Rhodes, Callum; Liu, Cunjia; Westoby, Paul; Chen, Wen-Hua

doi:10.1007/s10514-022-10063-8

Autonomous search of an airborne release in urban environments using informed tree planning

Open access
Published: 03 October 2022

Volume 47, pages 1–18, (2023)
Cite this article

Download PDF

You have full access to this open access article

Autonomous Robots Aims and scope Submit manuscript

Autonomous search of an airborne release in urban environments using informed tree planning

Download PDF

Callum Rhodes¹,
Cunjia Liu ORCID: orcid.org/0000-0003-2829-9369¹,
Paul Westoby² &
…
Wen-Hua Chen¹

2407 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

The use of autonomous vehicles for source localisation is a key enabling tool for disaster response teams to safely and efficiently deal with chemical emergencies. Whilst much work has been performed on source localisation using autonomous systems, most previous works have assumed an open environment or employed simplistic obstacle avoidance, separate from the estimation procedure. In this paper, we explore the coupling of the path planning task for both source term estimation and obstacle avoidance in an adaptive framework. The proposed system intelligently produces potential gas sampling locations that will reliably inform the estimation engine by not sampling in the wake of buildings as frequently. Then a tree search is performed to generate paths toward the estimated source location that traverse around any obstacles and still allow for exploration of potentially superior sampling locations.The proposed informed tree planning algorithm is then tested against the standard Entrotaxis and Entrotaxis-Jump techniques in a series of high fidelity simulations. The proposed system is found to reduce source estimation error far more efficiently than its competitors in a feature rich environment, whilst also exhibiting vastly more consistent and robust results.

Hybrid Route Optimisation for Maximum Air to Ground Channel Quality

Article Open access 20 May 2022

Using Spatial Uncertainty to Dynamically Determine UAS Flight Paths

Article 29 March 2021

Path planning algorithm ensuring accurate localization of radiation sources

Article 07 January 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The quick acquisition of accurate estimates of the source of a Chemical, Biological, Radiological and Nuclear CBRN release is vital in the process of minimising the impact of the resulting hazard and allowing first responders to quickly manage the situation. Doing so with manual sensor probes puts human operators at a high risk of life threatening situations, especially when considering the state of such environments can be highly uncertain. To mitigate this risk, the use of mobile robotic sensors has seen increasing interest as they can be deployed quickly in areas that are inaccessible to humans (Murphy et al., 2012).

Whilst the use of manually driven CBRN robots has already seen use in the field, for example in settings such as nuclear plant decommissioning (Tsitsimpelis et al., 2019), these vehicles must be operated by trained users. If a trained operator is not available or close by at the onset of a disaster event, then the manual nature of the system adds further delay to a process which is heavily time critical. The obvious next step to this problem is to automate the task of data collection, allowing robotic agents to be deployed autonomously.

When considering an autonomous system for a source search task, there are several basic functionalities that the agent needs to possess. Firstly, it should be able to estimate source term parameters as it collects data. This is so that the system can update its belief about the source and use this information to help dictate its next course of action, e.g. collecting data for recursive inference. Secondly, the agent should be able to plan valid trajectories to data sampling locations that help achieve the task of finding the source. This process in feature rich environments is denoted as informative path planning IPP.

Many works, including the pioneering work by Ristic et al. (2016), have focused on the first task of source term estimation and only briefly incorporate some form of IPP thus having little appreciation to feature rich environments (which cannot be overlooked for real-world CBRN incidents). Most of the existing solutions also use a myopic path planner wherein utility of only neighbouring locations are considered (discussed in sects. 2.2 & 2.3), so may limit the searching efficacy. Therefore, this paper seeks to develop an improved IPP solution to the source term estimation problem that is capable of navigating complex environments whilst efficiently carrying out its task of localising an unknown source.

It should be noted that although path planning in complex environments has generated solutions that consider long term trajectories and the overall goal of the agent in a wide array of scenarios, it is not straightforward to bring them into the source term estimation framework. Algorithms that use rapidly-exploring random trees RRT structure are popular in literature due to the flexibility on dynamic constraints, with their spanning trees capable of expanding into any free space. However, directly applying this free spanning feature may not produce good results for source search in an urban environment. In this case, it is intuitive that sampling downwind of buildings is less likely to achieve predictable concentration data than in the open wind direction, due to the influence of buildings on the plume structure (see Sect. 7 for CFD examples of such a plume structure). Therefore, more samples should be collected in preferential areas compared to obscured areas, so that simple analytic dispersion models can be used to interpret the measurements. This principle may be violated by the one-sample-per-batch approach of RRT as branches are grown heuristically towards a sampled state and therefore branches can be grown into undesirable regions, regardless of the desired sampling frequency of said region. Recent works on FMT$^*$ (Janson et al., 2015) and batch informed trees BIT$^*$ (Gammell et al., 2015) are two sampling based approaches that sample the whole environment in a single batch to create a random geometric graph RGG in which spanning trees are expanded. Such multi-sample batch methods allow a custom sampling distribution to be enforced so that undesirable regions are sampled with a lower frequency and therefore a variant of this technique will be explored in the proposed informed tree approach. By employing a sampling based path planner, the system can free itself of deterministic path planning choices and perform adaptively across varying environment scales and complexity.

Another requirement to be considered is the efficiency of the search, which requires a good balance between exploration and exploitation. The goal location in a conventional path planner is specified in some way, so that it can be directly exploited. For example, in (Gammell et al., 2015) the use of a goal set is postulated alongside the single goal state. When combining with a Bayesian inference framework to estimate the source location (i.e. navigation goal), there is not a single (or set) location, but instead it is described by a probability density function PDF. To this end, inspired by the Dual Control principle introduced in (Chen et al., 2021), the goal state used by the proposed informed tree search algorithm is modified, so that the tree can be spanned iteratively towards the source while accounting for the uncertainty of the source term estimation,

By addressing the above technical challenges, this paper develops a more powerful and more applicable IPP framework for searching an unknown CBRN source in an urban environment. This paper is organised as follows. In Sect. 2, relevant works are reviewed to justify the novelty of this work. Section 3 formulates the problem to be considered, followed by technical solutions in Sects. 4–6. The proposed algorithm is tested and verified in Sect. 7 using a high fidelity dataset and the conclusions are provided in Sect. 8.

2 State of the art

CBRN related robotics has seen a swell of research interest in recent years, as recently summarised in (Monroy and Gonzalez-Jimenez, 2019), due to the ever increasing computational capabilities of small onboard chipsets and chemical sensors that can be easily fitted to mobile platforms including small UAVs. Coupled with their mobility to collect large amounts of data at any location, these systems are highly beneficial compared to the traditional approach of sparse static sensors running alongside complex CFD models that can take several days to resolve. To enable these mobile systems for source localisation tasks, both online estimation and motion planning functions need to be developed.

2.1 Estimation

Probabilistic estimation algorithms can be split into two categories, i.e., using parametric models and non-parametric models. In parametric estimation algorithms, the PDF of the underlying parameters of the atmospheric transport and diffusion ATD model will be established. Examples of light-weight ATD models that have been applied to mobile robots include the Gaussian plume GP model (Wang et al., 2018) and the isotropic plume IP model (Vergassola et al., 2007). These models describe the expected concentration at a given location under defined source terms and environmental conditions. These simple models have drawbacks in that they make strong assumptions about the source (such as a single source, constant release and uniform wind fields), however their computational efficiency lends towards the inclusion in probabilistic frameworks.

Non-parametric models for gas localisation include Gaussian Process (Hutchinson et al., 2019a), Kernel DM+V (Lilienthal et al., 2009) and Gaussian Markov random fields (Monroy and J, Blanco JL, Gonzalez-Jimenez J,, 2016). The number of parameters in these models is not fixed and therefore less assumptions are made about the gas distributions. To account for the transportation of particles, GMRF and Kernel DM have further additions to account for wind direction in GW-GMRF (Gongora et al., 2020) and Kernel DM+V/W (Asadi et al., 2017). This leads to the ability to account for multi modal distributions and the inclusion of obstacles in the environment. A major drawback of these methods are that they are poor at estimating outside of sample locations and require a large and varied set of data to accurately estimate the distribution. For a source term estimation case, this leads to the case that if the area near the source location itself cannot be sampled (such may be the case in an urban environment), then non-parametric models tend to be unable to accurately estimate the source. Furthermore, these models tend to be computationally expensive to iteratively calculate. Note that the framework proposed in this paper can use both model types (e.g. (Rhodes et al., 2020)), therefore leaving flexibility in the system. However, a parametric model is used in this work, not only due to its computational efficiency and wide spread adoption in the literature, but also because one of the motivations of this work is to show that such simple ATD models are adequate to inform robotic source localisation in urban environments, given features can be accommodated in the path planning algorithm.

2.2 Source search in simple environments

Motion planning of mobile robots plays a key role in many IPP frameworks for environment monitoring. However, in the literature, motion planning for source search is generally limited to goal selection i.e., which place should be sampled next. To solve the goal selection problem that is inherent in an autonomous system, there are three classes of algorithms employed: coverage based, bio-inspired and information theoretic. Coverage based planners (e.g. (Hombal et al., 2010; Galceran and Carreras, 2013)) rely on predetermined trajectories to maximise coverage of the search area in a systematic manner. These algorithms are incredibly efficient and easy to implement but are decoupled from the estimation side of the system. Therefore, they can be ineffective and difficult to scale. Bio-inspired methods such as Anemotaxis (Harvey et al., 2008) and Chemotaxis (Dhariwal and Sukhatme, 2004; Russell et al., 2003) use instantaneous concentration and anemometry measurements to guide robots based on the local concentration gradients. These methods are computationally lightweight and are often employed for use in swarm robotics (Marjovi and Marques, 2014; Jatmiko et al., 2007) where resources are limited. However, they are heavily reliant on the presence of data and do not perform well in large scale scenarios or with sparse measurements, since they only consider immediate reward in their locality.

The third class, and the method that is leveraged in the proposed system, is the information theoretic approach. Information theoretic approaches exploit the belief of the system state and try to take actions that reduce the uncertainty of a given estimate. Given this property, information theoretic approaches are inherently coupled to the estimation process and require some form of metric to quantify uncertainty. Within source term estimation, both Infotaxis (Vergassola et al., 2007) and Entrotaxis (Hutchinson et al., 2018) have been successfully employed for sparse search tasks. Infotaxis is concerned with reducing entropy based on the expectation of the posterior distribution, whereas Entrotaxis considers the entropy reduction based on the predictive measurement distribution. In a comparison between bio-inspired searches and information theoretic searches (Voges et al., 2014), the information theoretic solution is found to be more effective in problems which exhibit sparse data, thus performs well in real-world experiments (Hutchinson et al., 2019a, 2020). Sparse measuring conditions are more conducive towards urban environments since complex geometry can obscure the plume from much of the search domain.

It is noted that the vast majority of research items that focus on the motion planning aspect of source search do so in an open environment and therefore this gap between open and urban scenarios is the key motivation for the research presented.

2.3 Source search in complex environments

Path planning in complex environments for source search has some studies but many do so in a heavily constrained environments and therefore are not optimised for the challenges of real urban scenarios. In (Marjovi and Marques, 2011), multi robot mapping and source localisation is performed using an anemotaxis approach wherein simultaneous localisation and mapping SLAM is performed until a threshold concentration is found, upon which the robot switches to an anemotaxis search. As with all gradient-based approaches, this system requires an increasing number of agents to cope with large scale situations. In (Zou et al., 2014), a particle swarm optimisation PSO method is proposed that accounts for obstacles that each sensing agent may encounter by proposing new directions that do not intersect with the obstacles. This simplistic approach to obstacle avoidance has clear success with swarm implementations, however, to enable a single agent to efficiently cover a large area within a time budget, we argue more advanced methods are needed that plan adaptively in the longer term as opposed to maximising within a deterministic set of neighbouring points.

With the informed tree search algorithm we seek to address the scaling problem with adaptive sampling of the environment so that the system is broadly independent of the scale. This also means that a multi agent approach is not required to attain positive results. However, it is appreciated that multi agent approaches are generally more time efficient than their single agent counterpart (at the expense of increased resources).

In model-based search techniques, (Khodayi-Mehr et al., 2019) propose a solution that uses an ATD model solved via partial differential equations to identify a source. This work uses the Fischer Information matrix of the source parameters to select a sequence of future waypoints and is shown to be capable of operating in small non-convex domains but is not proven for larger urban scenarios.

In (Zhao et al., 2020b), the Entrotaxis-jump algorithm is proposed for source search in a large-scale road network. Entrotaxis-jump combines Entrotaxis with an intermittent search strategy that allows a myopic agent to traverse around obstacles if the utility of sampling in the direction of the obstacle is high. Whilst this method successfully increases the performance of Entrotaxis in urban environments, it assumes a simple Gaussian-like dispersion model adequately reflects the dispersion characteristics of a source release in a dense urban environment. Moreover, the trajectory generation is not explored in the search process and therefore for more complex geometries (i.e. not a road network), it is unknown if this method will be able to successfully navigate towards a source. Nevertheless, based on the findings of Zhao et al. (2020b) it is clear that there is a benefit to intelligent goal selection methods in urban scenarios compared to those in classical source search motion planning.

Further to this work, Zhao et al. also propose a searching method based on Entrotaxis for escaping forbidden zones in a source search scenario (Zhao et al., 2020a). This paper proposes a planning technique to avoid the sampling agent becoming trapped in its locality based on the degrees of free travel around a location. This technique is shown to be effective in a block discretised scenario and is not shown how it could be applied to more complicated geometric scenarios (such as the urban case). Contrary to this, the BIT* path planning technique can also be used to plan escape routes around obstacles and can do so around non-block structures.

The information theoretic approach has also been seen in other applications involving complex environments. For example, Schmid et al. (2020) developed a sampling based path planner using RRT to evaluate the utility of sampling the environment at a specified location along a trajectory. This method allowed a sampling agent the ability to determine efficient sampling locations whilst navigating in complex environments, inherently coupling these two aspects. However, the nature of the work is to explore the unknown environment, rather than locating the release source.

Recently, An et al. present an urban source search algorithm namely receding-horizon RRT-Infotaxis (An et al., 2022). This work leverages the standard RRT path planning technique along with the Infotaxis method of determining future sample location utility (similar in framework to our BIT$^*$ and Entrotaxis technique). In addition, this method also attempts to predict the sampling utility along a multi-step trajectory in a receding-horizon fashion by summing potentials. Whilst predicting utility multiple steps into the future may offer more efficient movement choices, due to the formulation of the utility calculation, this demands a fixed step size between consecutive samples (to cancel out traversal cost considerations) which violates our chosen batch sampling planner. We opt in favour of more robust obstacle avoidance and the capability to incorporate custom sampling distributions.

By employing more advanced path planning techniques that have been adopted in other fields and customising these methods to suit the challenges of the source search problem, we aim to bring a new level of operational efficiency and robustness to the field of CBRN related robotics.

2.4 Contributions

Based on the literature review, we present the contributions of this paper that address the issues raised above.

Our first contribution is to show that simple atmospheric dispersion models (such as the IP model) can be used to guide a robot to localise a release source in a complicated urban environment. To aid in this task, we propose a novel sampling distribution that identifies regions in the wake of buildings where a large model discrepancy between the modelled plume and the actual flow would be seen. By sampling in these regions with a lower frequency, we can collect samples that will more reliably inform the inference engine of the true source. This also shows how preferential sampling can help make the search process more efficient and leaves the door open for future work investigating other informed distributions under the same planning framework.

The second major contribution of this paper is the proposed informed tree search algorithm. Based on the BIT$^*$ concept, the novel tree search method creates branches that extend toward the expected source location (increasing convergence speed) whilst navigating through areas in the wake of buildings with a lower frequency. The informed tree is then either pruned or blossoms to meet the computational requirements of the information utility function, providing obstacle free informative trajectories for the autonomous agent to follow. This second contribution moves away from previous works that rely on sampling at deterministic future locations (such as $\uparrow , \rightarrow , \downarrow , \leftarrow $) and introduces an adaptive sampling framework that balances the trade-off between exploration and exploitation in desirable regions.

3 Problem statement

The source search problem is formulated under the IPP framework in this Section.

Let ${\mathbf {X}} \subset {\mathbb {R}}^2$ be the state space of the search and planning problem, ${\mathbf {X}}_{\mathrm {obs}} \subset {\mathbf {X}}$ be the states in collision with obstacles. Thus, the set of admissible states can be expressed as ${\mathbf {X}}_{\mathrm {free}} := {\mathbf {X}} \setminus {\mathbf {X}}_{\mathrm {obs}}$. Let ${\mathbf {s}} \in {\mathbf {X}} $ be the source location, ${\mathbf {x}}_{k} \in {\mathbf {X}}_{\mathrm {free}}$ be the robot position at sampling instant k and ${\mathbf {X}}_{\mathrm {goal}} \subset {\mathbf {X}}_{\mathrm {free}}$ be the set of goal region. A collision-free path is continuous mapping $\sigma : {\mathbb {R}} \xrightarrow {} {\mathbf {X}}_{\mathrm {free}}$. Specifically, we define $\sigma _{i}^{j}(s)$, $ s \in [0, \,1]$, a path from $\sigma _{i}^{j}(0) = {\mathbf {x}}_{i}$ to $\sigma _{i}^{j}(1) = {\mathbf {x}}_{j}$. The traversal length of the path is denoted as $c(\sigma _{i}^{j})$.

The problem considered in this work is to guide the robots to explore the free space to find the source location ${\mathbf {s}}$ and eventually navigate to the goal region inclusive of the source location, such that ${\mathbf {x}}_{k} \in {\mathbf {X}}_{\mathrm {goal}}({\mathbf {s}})$. However, directly finding an optimal path or feasible path from initial position ${\mathbf {x}}_{init}$ to ${\mathbf {s}}$ is not possible, since the source location is unknown to the robot. In this case, a recursive IPP framework will be structured to address this problem.

Problem 1

(source term estimation) At each sampling time k, the robot takes a measurement of the local chemical concentration $z_{k}({\mathbf {x}}_{k})$, which in conjunction with historical readings ${\mathcal {Z}}_{k} = \{ z_{k}({\mathbf {x}}_{k}), {\mathcal {Z}}_{k-1} \}$, can be used to estimate the source term $\Theta $, in the form of its posterior distribution, i.e., $p(\Theta |{\mathcal {Z}}_k)$.

e term $\Theta $ normally consists of source location ${\mathbf {s}}$, release rate Q and other relevant parameters that can be used to characterise an airborne release. In this study we utilise the IP model, the parametrisation of which is shown in Eq. (1) (see (Hutchinson et al., 2019a) for more details).

$$\begin{aligned} \Theta _k=\big [{\mathbf {s}}^{T} Q u \phi d \tau \big ]^{T} \end{aligned}$$

(1)

where Q is the release rate of the source (g/s), u is the wind field speed (m/s) with direction $\phi $ (deg), d is the diffusivity of the hazard in air (m$^2$/s) and $\tau $ is the average lifetime of the emitted particle (s). Using this model, for a given $\Theta _k$, the expected concentration that a sensor will record at position ${\mathbf {x}}_k$ is calculated using:

$$\begin{aligned} C({\mathbf {x}}_k|\Theta _k)= & {} \frac{Q}{4\pi d\Vert {\mathbf {x}}_k-{\mathbf {s}}\Vert _2}\exp \bigg [\frac{-\Vert {\mathbf {x}}_k-{\mathbf {s}}\Vert _2}{\lambda }\bigg ] \nonumber \\&\times \exp \bigg [\frac{-x_k-x_s u\cos \phi }{2d}\bigg ]\nonumber \\&\times \exp \bigg [\frac{-y_k-y_s u\sin \phi }{2d}\bigg ] \end{aligned}$$

(2)

where, $\lambda =\sqrt{\frac{d\tau }{1+(u^2\tau )/(4d)}}$. Note that to facilitate the discussion we use $\theta _s^{(k)}$ to denote the estimated source location at sampling instant k.

Problem 2

(IPP) Let $\sigma _{k}^{k+1}$ be a collision-free path that can be executed by the robot, starting from the robot’s current location ${\mathbf {x}}_{k}$ to an end location ${\mathbf {x}}_{k+1}$. Let $\Sigma $ be the set of such non-trivial paths to be constructed. The IPP problem is then formally defined as the search for a path, $\sigma ^{*} \in \Sigma $, that minimise a utility function $\Psi (\cdot )$, such that

$$\begin{aligned} \sigma ^{*} := \arg \min _{\sigma \in \Sigma } \{ \Psi (\sigma ) | \sigma (0) = {\mathbf {x}}_{k}, \, c(\sigma ) \le {\bar{c}} \} \end{aligned}$$

(3)

where ${\bar{c}}$ is the upper bound of the path length.

Note that the objective of the path planning problem is to find the most informative sampling location at the end of the path $\sigma ^{*}$. This is because the chemical sensing robot normally takes point measurements to accommodate the response time of the chemical sensor.

The proposed framework is to recursively solve Problem 1 and 2 such that the robot can be guided to the source region ${\mathbf {X}}_{\mathrm {goal}}({\mathbf {s}})$. At each sampling instant k, the robot takes the sensor reading $z_{k}$ to update the source term estimation $p(\Theta |{\mathcal {Z}}_k)$. Such a posterior distribution can be used to inform the design of the set $\Sigma $, so that Problem 2 can be constructed and subsequently solved to generate the next sampling location at the end of $\sigma ^{*}_{k}$.

In this work, Problem 1 is solved by using an established particle filter developed in (Hutchinson et al., 2019a), so its implementation detail is skipped for the sake of brevity. The key challenge that remains open is how to efficiently construct the set of candidate paths $\Sigma $ in Problem 2, which should (1) reduce the chance of taking samples downwind of buildings, (2) guarantee collision-free control actions and (3) strike a good balance between exploitation and exploration. To this end, an informed tree search algorithm is developed and integrated into the proposed framework as outlined in Fig. 1. Each part of the downward running system will be explained in further detail in the order that they are performed during one iteration of the planning loop.

4 Generation of sampling locations

Many STE algorithms for predicting the source parameter distribution (e.g. Ristic et al. (2016); Hutchinson et al. (2018); Zhao et al. (2020b)), whilst proven in ideal open environments, have drawbacks when used in a feature rich environments. Because obstacle interactions with the gas dispersion are not accounted for (and the use of such a computationally expensive model is unsuitable for mobile robotics), the intelligent use of how sample locations are chosen can be leveraged to mitigate the incongruity of the model and the physical system.

As defined in the problem statement, the set of admissible states that are considered for the search and planning problem is defined as ${\mathbf {X}}_{\mathrm {free}}$. When using a sampling based path planner, traditionally a single sample state ${\mathbf {x}}_m$ is drawn such that ${\mathbf {x}}_m \leftarrow \{{\mathbf {D}}\sim {\mathbf {U}}({\mathbf {X}}_\mathrm {free})\}$, where ${\mathbf {D}}$ represents the uniform distribution of states that exist within free space. We also define the notation ${\mathbf {X}}_m \leftarrow \{{\mathbf {D}}\sim {\mathbf {U}}({\mathbf {X}}_{\mathrm {free}})\}_{1:N}$, where ${\mathbf {X}}_m$ is a set of samples with size N, uniformly drawn from free space.

In general traversability planning, the definition of ${\mathbf {X}}_\mathrm {free}$ is adequate for dictating how sample states should be drawn and therefore a uniform distribution is most often used. However, as stated in (Karaman and Frazzoli, 2011), the sampling framework can extend to any distribution with a density bounded away from 0 upon ${\mathbf {X}}_\mathrm {free}$. In source term estimation, (as discussed in the introduction) the robot should sample in areas where it is most likely to interact with the target source and also in areas which are more likely to accurately predict the source. When obstacles are present in the flow field, this is not uniform over the free space since there is a modelling discrepancy between the IP model and the actual flow.

Obstacle interactions with scalar wind fields lead to complex flow dynamics that take significant resources to resolve. However, it is clear that a particle in a laminar wind field will generally move in the wind direction $\phi $, unless obstructed by an obstacle. Obstacles create isolated areas (wake) behind the obstacle that disrupt flow and create a disparity between what the model predicts and the real flow. It is in these areas that is less likely for a robot to sample the source plume predictably since the estimation model implemented does not account for obstacle interactions (due to computational constraints). Therefore, a sample distribution ${\mathbf {D}}_\phi $ should be attained that stipulates the robot to sample less in these areas that are likely to observe contradictory measurements.

${\mathbf {D}}_\phi $ is derived (similarly to (Bellingham et al., 2002)) by calculating the divergent effect that an obstacle would have on a particle entering the search space using Dijkstra’s search (detailed below and shown in Fig. 2). The sample generation technique, whilst being significantly quicker to compute than CFD modelling, can be expensive to calculate for large maps and therefore should be performed at a reasonable resolution. Samples can be drawn repeatedly from the same distribution assuming the conditions that the wind direction $\phi $ does not change significantly and that the obstacle map is static. The sample generation methodology comprises one of the main new contributions to the field of source search and feeds directly into the second new contribution, the informed tree search. It should be noted that this method is adopted for its ability so generate ${\mathbf {D}}_\phi $ quickly and with little a-priori environmental information. Given an infinite budget, similar approaches using more complex modelling could also be implemented which may improve the search efficiency further. Due to the design choice of implementing the BIT$^*$ method, any informed distribution for preferential sampling may be exploited.

4.1 Sample distribution algorithm

To generate a probability distribution that reflects the obstacle interactions with the wind field, firstly a set of starting states that represents the inlet of a particulate to the search space ${\mathbf {X}}$ is defined as ${\mathbf {X}}_{\mathrm {inlet}}$ (red line in Fig. 2). These inlet states are akin to the inlet condition of a CFD model thus the location of these states is dependent on the wind field direction $\phi $, which can be initiated with the prior of ${\mathbb {E}}(\theta _\phi )$ from the Bayesian inference. Dijkstra’s search is then performed on the discrete state obstacle map (e.g. an occupancy grid) using ${\mathbf {X}}_{\mathrm {inlet}}$ as starting conditions. The vertices of the Dijkstra network ${\mathbf {V}}_{\mathrm {obs}}$ are defined as all ${\mathbf {x}} \in {\mathbf {X}}_{\mathrm {free}}$ and edges of traversal ${\mathbf {E}}_{\mathrm {obs}}$ lie between obstacle free adjacent vertices. This then generates the average cost ${\mathbf {C}}_{\mathrm {obs}}$ of getting from all ${\mathbf {x}} \in {\mathbf {X}}_{\mathrm {inlet}}$ to all ${\mathbf {x}} \in {\mathbf {X}}_{\mathrm {free}}$. A second cost map, ${\mathbf {C}}_{\mathrm {open}}$, is also calculated using Dijkstra’s search on the obstacle free map using the same ${\mathbf {X}}_{\mathrm {inlet}}$ condition. The vertices of the second network ${\mathbf {V}}_{\mathrm {open}}$ are defined as all ${\mathbf {x}} \in {\mathbf {X}}$ and edges of traversal ${\mathbf {E}}_{\mathrm {open}}$ lie between any adjacent vertices. This second cost map represents how an inlet particle would traverse the domain uninterrupted by the obstacles. ${\mathbf {C}}_{\mathrm {open}}$ is then subtracted from ${\mathbf {C}}_{\mathrm {obs}}$ leaving a final cost map ${\mathbf {C}}_\phi $ that represents how the obstacles have negatively interrupted the wind field. ${\mathbf {C}}_\phi $ can then be used to give the probability distribution ${\mathbf {D}}_\phi $, by adding the minimum value of ${\mathbf {C}}_\phi $. ${\mathbf {D}}_\phi $ is now bound away from 0 upon ${\mathbf {X}}_{\mathrm {free}}$ and can be used to draw samples from which to grow the informed tree. This process is summarised in Algorithm 1.

5 IPP Informed tree search

The second main contribution of the paper is the informed tree procedure, which acquires a set of obstacle free trajectories that help the robot achieve its goal of localising an unknown source within a feature rich search space. The informed tree search procedure can be separated into two distinct parts: tree expansion and path selection. Tree expansion is concerned with finding admissible paths to the goal location whereas path selection decides the set of paths to be evaluated by the utility function. Both parts are described below and further detail is given in Sect. 5.2.

To establish possible gas sampling locations and an initial set of admissible paths, an RGG is defined which contains a set of sampled states, ${\mathbf {G}} \in {\mathbf {X}}_{\mathrm {free}}$ (Figs. 3a, 4a). Samples are drawn in weighted free space, of sample size N, with the weighting on samples derived from the likely gas particle path distribution ${\mathbf {D}}_\phi $ described in Sect. 4. The parameter N is chosen such that the informed tree search can fully explore the area ${\mathbf {X}}_{\mathrm {free}}$. The base of the tree is set to ${\mathbf {x}}_k$, and the goal set is defined as the $k_n$-nearest ${\mathbf {x}} \in {\mathbf {G}}$ to the best estimate of the source location (other definitions of goal sets are also applicable). For the rest of the paper, ${\mathbb {E}}({\mathbf {s}})$ is used as the best estimate.

The tree is then heuristically constructed towards the goal set, in a similar manner to (Gammell et al., 2015), until a valid path is found (Figs. 3b, 4b). However unlike (Gammell et al., 2015), after one batch search has been completed, another batch is not initiated. This is because we are not trying to acquire the shortest trajectory towards the estimated source location. Instead, we need only a set of admissible and exploratory paths to the source that can guide the search in a way that will converge on the source location.

Once a path to the goal set is found, the tree is pruned (Figs. 3c, 4c), and then must be either upsampled or down-sampled in order to match the computational requirements of the IPP. This is driven by the desired number of paths $|\Sigma |$, to be calculated in (3). If the tree contains paths $\ge |\Sigma |$, then the pruned tree is downsampled, as shown in Fig. 3d, to the first $|\Sigma |$ paths stemming from ${\mathbf {x}}_k$. If more samples are required, then the tree is expanded by taking further samples (again from weighted free space) that extend the tree as shown in Fig. 4d. This expansion is bound within an ellipse that accounts for the uncertainty of the source estimate in $\theta _s$ (derived in Sect. 5.2.3). Once the path set $\Sigma $ has been established, the set is passed for utility calculation. This process is then repeated each time a concentration measurement updates the particle filter as ${\mathbb {E}}({\mathbf {s}})$, and therefore the goal set, will change after each update (since ${\mathbb {E}}({\mathbf {s}})$ is given as weighted expectation). The previous sampled set of nodes in the network, ${\mathbf {G}}$, can be recycled for efficiency or resampled for diversity. The detailed derivation of informed tree search is provided in this section.

5.1 Notation

To facilitate the derivation of the informed tree search algorithm, some notations are defined in this sub-section. The function $\mathbf {{\widehat{g}}(x)}$ relates to the estimated cost to come from the start (i.e., ${\mathbf {x}}_{k}$) to a state ${\mathbf {x}} \in {\mathbf {G}}$, and $\mathbf {{\widehat{h}}(x)}$ is the estimated cost to go from the state to the goal set ${\mathbf {X}}_{goal}$. These functions can be evaluated using the Euclidean distance between two states. Then, $\mathbf {{\widehat{f}}(x)}$ is the estimated cost from the start ${\mathbf {x}}_k$, to the goal set, given the path passes through ${\mathbf {x}}$, i.e. $\mathbf {{\widehat{f}}(x)}:=\mathbf {{\widehat{g}}(x)}+\mathbf {{\widehat{h}}(x)}$. If the current best solution to the goal set is defined as ${\mathbf {c}}_{best}$, then this function can define a subset of states that potentially give a better solution to the goal i.e. $\mathbf {X_{{\widehat{f}}}} := \big \{ {\mathbf {x}} \in {\mathbf {G}} \big | {\widehat{f}}({\mathbf {x}}) \le {\mathbf {c}}_{best} \big \}$.

The tree with vertices ${\mathbf {V}} \subseteq {\mathbf {G}}$ and edges ${\mathbf {E}} = {({\mathbf {v}},{\mathbf {w}})}$ for vertices ${\mathbf {v}}\in {\mathbf {V}}$ and ${\mathbf {w}} \in {\mathbf {V}}$, is defined as $\tau := ({\mathbf {V}},{\mathbf {E}})$. From the current tree, the cost to come to a state ${\mathbf {x}} \in {\mathbf {G}}$ is given by the function $\mathbf {g}_{\varvec{\tau }}{} \mathbf{(x)}$. Any state that is not in the tree is assumed to have a cost $\infty $. If the optimal cost to come to a state is defined as $\mathbf {g(x)}$, then it is seen that, $\forall {\mathbf {x}} \in {\mathbf {G}}, \mathbf {{\widehat{g}}(x)} \le \mathbf {g(x)} \le \mathbf {g}_{\varvec{\tau }}{} \mathbf{(x)}$.

The cost of an edge between states ${\mathbf {x}},{\mathbf {y}} \in {\mathbf {G}}$ and the estimated cost of said edge are defined as $\mathbf {c(x,y)}$ and $\mathbf {{\widehat{c}}(x,y)}$. Any edge that intersects an obstacle is assumed to have an infinite cost thus defining that $\forall {\mathbf {x}},{\mathbf {y}} \in {\mathbf {G}}, \mathbf {{\widehat{c}}(x,y)} \le \mathbf {c(x,y)} \le \infty $. Calculating ${\mathbf {c}}(\cdot )$ is computationally expensive due to the need to account for obstacle collisions and dynamic constraints and therefore the heuristic estimate $\mathbf {{\widehat{c}}(\cdot )}$ is used to delay calculating this where possible. In the scenario where there are no dynamic constraints on the system, then an obstacle free edge $\mathbf {{\widehat{c}}(\cdot )} = \mathbf {c(\cdot )}$.

The Lebesque measure of a set is written as, $\lambda (\cdot )$, and the Lebesque measure of an n-dimensional unit ball, is $\zeta _n$. $|\cdot |$ refers to the cardinality of a set. $X\xleftarrow {+}\{x\}$ and $X\xleftarrow {-}\{x\}$ are shorthand notation for $X=X\cup \{x\}$ and $X=X\setminus \{x\}$ respectively.

5.2 Informed tree algorithms

Algorithm 2 outlines the tree expansion procedure during a single query event given the sampled states ${\mathbf {G}}$, the robots current location ${\mathbf {x}}_k$, and the current PDF of the source location $\theta _s^{(k)}$.

5.2.1 Initialisation (Alg 2, Lines 1:5)

To initialise, the goal state ${\mathbf {x}}_{goal}$ is defined as ${\mathbb {E}}(\theta _{s}^{(k)})$, from which the goal set ${\mathbf {X}}_{goal}$ is defined as the $k_n$ nearest ${\mathbf {x}} \in {\mathbf {G}}$. The tree vertices set ${\mathbf {V}}$ is set to ${\mathbf {x}}_k$, the tree edges ${\mathbf {E}}$ and edge queue ${\mathbf {Q}}_e$ are set to empty, and the vertex queue ${\mathbf {Q}}_v$, is set to ${\mathbf {V}}$. The edge queues exist to track which vertex and edge should be processed for adding to the tree. The radius r can also be defined during initialisation since only a single batch is being performed. The radius is calculated using the scaling parameter $\kappa $, the problem dimensionality n and the number of sampled states $|{\mathbf {G}}|$, as described in (Gammell et al., 2015).

5.2.2 Tree expansion (Alg 2, Lines 6:25)

The tree is expanded until ${\mathbf {Q}}_v$ is empty (i.e. all ${\mathbf {x}} \in {\mathbf {G}}$ have been checked), or one of the states ${\mathbf {x}} \in {\mathbf {X}}_{goal}$ have a valid path in the tree. To determine which node should be selected for expansion, ${\mathbf {v}}_m$, the state ${\mathbf {x}} \in {\mathbf {Q}}_v$ with the lowest estimated cost from the start to the goal, $\widehat{{\mathbf {f}}}({\mathbf {x}})$, is selected for expansion. The set ${\mathbf {V}}_{near}$ is defined around the expansion node which contains all states ${\mathbf {w}}$ in ${\mathbf {G}}$ that are within the radius r of ${\mathbf {v}}_m$. The edges that connect all ${\mathbf {w}}$ to ${\mathbf {v}}_m$ are then added to the queue ${\mathbf {Q}}_e$.

Once the edge queue ${\mathbf {Q}}_e$ has been defined, each edge is processed by selecting the ${\mathbf {w}} \in {\mathbf {Q}}_e$ with the lowest estimated cost from the expanded vertex ${\mathbf {v}}_m$ to the goal, that passes through ${\mathbf {w}}$ (Alg 2, Line 12). The chosen edge is then removed from ${\mathbf {Q}}_e$. The edge will be added to the tree subject to the condition in Alg 2, Line 14. This is that the actual cost (including collisions) to ${\mathbf {w}}_m$ via ${\mathbf {v}}_m$ is less than the tree cost $\mathbf {g}_{\varvec{\tau }}({\mathbf {w}}_m)$ (if ${\mathbf {w}}_m$ is not already in the tree then this is guaranteed).

If ${\mathbf {w}}_m$ is already in ${\mathbf {V}}$, then its existing edge $({\mathbf {v}},{\mathbf {w}}_m)$ is removed from ${\mathbf {E}}$ and the new edge $({\mathbf {v}}_m,{\mathbf {w}}_m)$ is added. If ${\mathbf {w}}_m$ is not in the tree then it is added to ${\mathbf {V}}$ and also to the vertex queue ${\mathbf {Q}}_v$ (for future vertex expansion). The vertex expansion process is then ended when all edges of ${\mathbf {v}}_m$ have been checked i.e. ${\mathbf {Q}}_e$ is empty. When all vertices have been expanded or a path to the goal has been found, then the tree $\mathbf {\tau (V,E)}$ is returned for candidate selection (as shown in Fig 3b, 4b).

5.2.3 Sample set acquisition (Alg 3)

The full tree $\tau $ from the initial search is groomed in Alg 3 in order to meet computational requirements of the IPP. Firstly, the tree is pruned so that only vertices in ${\mathbf {V}}$ and edges in ${\mathbf {E}}$ which can possibly improve (or are part of) the current solution are kept for consideration (Alg 3, Lines 1:2). This defines the ellipse of Figs. 3c, 4c. In the case where no solution is found, i.e. ${\mathbf {c}}_{best}=\infty $, then ${\mathbf {V}}$ and ${\mathbf {E}}$ remain unchanged.

After pruning, tree blossoming or tree culling will occur. The tree will be further expanded (blossoming, Fig. 4d) if there are not enough vertices in ${\mathbf {V}}$ to match $|\Sigma |$ (Alg 3, Lines 4:21), or further reduced (culling, Fig. 3d) if there are more vertices in ${\mathbf {V}}$ than can be efficiently computed for their utility (Alg 3, Line 23:25).

For a given tree $\tau $ where $|{\mathbf {V}}|<|\Sigma |$, the tree is expanded with a new search radius r defined by the current state of the tree (Alg 3, Line 4). To give more potential sampling locations from which to evaluate utility, a new query state ${\mathbf {x}}_m$ is sampled from weighted free space ${\mathbf {x}} \in {\mathbf {X}}_{free} | {\mathbf {D}}_{\phi }$ subject to the constraint that $\widehat{{\mathbf {f}}}({\mathbf {x}}) \le {\mathbf {c}}_{best}+2\sigma _{\theta _s^{(k)}}$ (Alg 3, Line 6). This defines the ellipse of Fig. 4d. The addition of including source location uncertainty is necessary in order to account for the fact that it is probabilistically likely that the true ${\mathbf {x}}_s$ lies within 2 standard deviations of the goal ${\mathbf {x}}_{goal}$ and therefore new exploratory samples should be drawn accounting for this. Furthermore, as ${\mathbf {c}}_{best} \rightarrow 0$, ${\mathbf {x}} \in {\mathbf {X}}_{free} \rightarrow \emptyset $ and therefore to avoid this local minimum, the robot should attempt to search in an area relative to the uncertainty of its estimate. Alg 3, Lines 7:20 follows closely to the tree expansion process of Alg 2, Lines 9:23. The difference being that we are now attempting to connect the unconnected state ${\mathbf {x}}_m$ to one of the vertices in the tree (as opposed to expanding the tree into the unconnected set). If a sampled state ${\mathbf {x}}_m$ cannot be connected to the tree, then the criterion at Alg 3, Line 12 will always fail and a new state will be initiated with ${\mathbf {V}}$ and ${\mathbf {E}}$ remaining unchanged. Once the number of vertices in the tree equals $|\Sigma |$, then the final state of $\tau $ is returned for utility calculation. This expansion process is similar in formulation to Alg 2 however single query states are attached to the tree set as opposed to the tree expanding into the unconnected set.

If for a given tree $\tau $, where $|{\mathbf {V}}|\ge |\Sigma |$, the tree is culled as per Alg 3, Lines 23:25. A query set of vertices ${\mathbf {V}}_m$ is defined which takes the first $|\Sigma |$ number of ${\mathbf {v}} \in {\mathbf {V}}$ which have the lowest tree cost. Since ${\mathbf {V}}$ has already been pruned, the remaining branches in ${\mathbf {V}}_m$ are towards the goal whilst allowing for some exploratory states around the path to the goal. ${\mathbf {V}}$ is then updated to the new set ${\mathbf {V}}_m$ and any edges that contain old vertices are removed from ${\mathbf {E}}$ (Alg 3, Line 25). Given the scenario where ${\mathbf {c}}_{best}=\infty $, then ${\mathbf {V}}_m$ will be a non-directional set of nodes that were connected to ${\mathbf {x}}_k$ during tree expansion.

6 IPP utility calculation

Given the robot has acquired a set candidate trajectories $\Sigma $, the robot needs to select the trajectory and the sample location that will minimise Eq. (3). Several definitions of the utility function $\Psi (\cdot )$ can be used together with parametric modelling techniques such as the Bayesian inference used here. Based on the results attained in (Hutchinson et al., 2018), the Entrotaxis measure of information gain has proven to be effective in source search and therefore is the chosen metric when defining the utility function. Entrotaxis attempts to find the most informative location by considering the entropy of the predictive measurement distribution at a sample location, therefore, we define this location as $\sigma (1)$ given the start of the trajectory $\sigma (0)$. In Entrotaxis, the Shannon Entropy ${\mathbf {H}}(\cdot )$ is used as the expected information measure as follows:

$$\begin{aligned} \Psi (\sigma )=-\int P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|{\mathbf {z}}_{1:k}) \log P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|{\mathbf {z}}_{1:k})d\widehat{{\mathbf {z}}}_{k+1}\nonumber \\ \end{aligned}$$

(4)

where $\widehat{{\mathbf {z}}}_{k+1}(\sigma )$ refers to the unknown measurement at the potential sampling position of $\sigma (1) \in \Sigma $. This unknown measurement will not be known until the location is physically sampled and therefore the probability of the expected number of particle encounters $P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|{\mathbf {z}}_{1:k})$ is derived using the posterior distribution of the source $\Theta _{k}$:

$$\begin{aligned}&P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|{\mathbf {z}}_{1:k}) \nonumber \\&\quad = \int _{\Theta _{k+1}} P(\widehat{{\mathbf {z}}}_{k+1}(\sigma ),\Theta _{k+1}|{\mathbf {z}}_{1:k})d\Theta _{k+1}\nonumber \\&\quad =\int _{\Theta _{k+1}}P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|\Theta _{k+1})P(\Theta _{k+1}|{\mathbf {z}}_{1:k})d\Theta _{k+1} \nonumber \\&\quad \approx \sum _{i=1}^{n} w_k^{(i)} \cdot P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|\Theta _{k+1}^{(i)}) \end{aligned}$$

(5)

where the weighted samples $\{\Theta _{k}^{(i)}, w_{k}^{(i)}\}_{i=1}^{n}$ constitutes the posterior distribution $P(\Theta _{k}|{\mathbf {z}}_{1:k})$ and $\Theta _{k+1}^{(i)}$ = $\Theta _{k}^{(i)}$ (Hutchinson et al., 2019a). To reduce computational load, a much smaller number of samples $\{\Theta _k^{(l)},1/n_z\}_{i=l}^{n_z}$ is resampled from the full posterior, where $n_z \ll n$, to give a possible future measurement set of $\{\widehat{{\mathbf {z}}}_{k+1}^{(l)}\}_{l=1}^{n_z}$ and reduce Eq. (5) to:

$$\begin{aligned} P(\widehat{{\mathbf {z}}}_{k+1}(\sigma )|{\mathbf {z}}_{1:k}) \approx \frac{1}{n_z}\sum _{l=1}^{n_z} \delta \big (\widehat{{\mathbf {z}}}_{k+1}-\widehat{{\mathbf {z}}}_{k+1}^{(l)}\big ) \end{aligned}$$

(6)

Substituting Eq. (6) into Eq. (4) allows the entropy for performing the trajectory $\Psi (\sigma )$, to be approximated by summing over the possible future measurements.

$$\begin{aligned} \Psi (\sigma ) \approx \frac{1}{n_z} \sum _{l=1}^{n_z} {\widehat{w}}_{k+1}^{(i,l)} \log {\widehat{w}}_{k+1}^{(i,l)} \end{aligned}$$

(7)

$\Psi (\cdot )$ is then calculated for all $\sigma \in \Sigma $ and minimised as per Eq. (3), to give the optimal trajectory $\sigma ^*_k$. The Entrotaxis reward function is calculated every time a new set of $\Sigma $ is defined from the informed tree search.

Whilst the Entrotaxis utility function has been utilised, any information theoretic measure may be used in its place. The exploratory effects of such measures ensure that the searching agent does not get stuck in minima around the goal set and will tend the agent to continue picking samples that minimise the estimation uncertainty (encouraged by the tree blossoming effect). This feature also helps in the case of a misleading prior, where there is a mismatch between the ${\mathbf {s}}$ and ${\mathbb {E}}(\theta _{\mathbf {s}})$, since the Entrotaxis utility function will pick samples that update the source estimate away from the incorrect region.

At this point in the system, the IPP has taken an informed set of potential trajectories from the informed tree search, performed predictive modelling on this subset of samples to calculate predicted information gain for each trajectory, and then chosen the optimal solution to be executed by a low-level path planner.

This defines a single control loop of the proposed source search system. The inference, informed tree search and utility calculation procedures are then repeated iteratively as per Fig. 1 until an end constraint on the system is met, e.g., a time budget.

7 Simulation

A set of simulation studies have been carried out to verify the proposed autonomous search algorithm in a large scale, outdoor, feature rich environment. The staging for the study is the DAPPLE dispersion scenario (Martin et al., 2010), which is generated by an experimentally validated CFD simulation of a source release under steady wind conditions in urban London. This dataset contains a complex series of urban canyons causing local wind field instabilities, which is not only challenging for the algorithms to predict the source, but also suitable to test the efficient path finding ability of the main contribution of this paper, i.e. the informed tree search.

There are 3 source locations within the same DAPPLE domain that can be tested as shown in Fig. 5. Sources 1 & 2 are modelled as active vent releases of outlet velocity 1m/s whilst source 2 is a passive release where the main method of transportation is the local wind field formed inside the urban environment. Sources 2 & 3 are modelled as ground releases whilst source 1 is a release on top of a building. The corresponding occupancy grid for source 1 is less dense due to being at a higher altitude than some of the surrounding buildings, and due to this, the corresponding sample distribution is also slightly different. At the initialisation of the each source i.e. $k=0$s, the plume structure is quasi-stable and therefore is only locally fluctuating, whilst the main plume structures are stable. All sources are modelled as constant release, matching the assumption made in the estimation model, however each source configuration matches the estimation model with varying accuracy as will be shown in the results.

The popular Entrotaxis approach is used as the benchmark in this paper from which conclusions about the proposed algorithm will be drawn. To ensure functionality in a feature rich environment, Entrotaxis is slightly modified for simple obstacle avoidance, so that any candidate trajectories $\sigma \subseteq {\mathbf {X}}_{\mathrm {obs}}$ are discarded before utility calculation.

In addition to the benchmark algorithm, Entrotaxis-Jump (Zhao et al., 2020b) is also compared in the study. Similar to the proposed algorithm, Entrotaxis-Jump is based on the Entrotaxis utility function and has also been designed for use in dense urban environments. Entrotaxis-Jump is originally presented with four deterministic path planning actions of $\{\uparrow , \rightarrow , \downarrow , \leftarrow \}$, however we extend the algorithm to match the planning horizon of standard Entrotaxis in Table 2 (detailed implementation can be found in Appendix 1).

This paper presents two unique additions to the source search process: informed tree search and sample generation. As discussed previously, the sample generation technique is proposed to minimise the effect of buildings in the sampling process and as such, it is expected that the sample generation technique will have the largest improvement on sources that do not well match the IP plume of the estimation model. To prove the efficacy of the sample generation methodology, the system is also tested without this feature (termed as uniform tree search) and compared alongside the fully informed tree search algorithm.

7.1 Test setup

Each control strategy is tested across all 3 sources with the same prior parameters of $\Theta $ initiated for each source, as shown in Table 1. Prior distributions are implemented following literature examples as typical starting source search conditions (Hutchinson et al., 2019b; Ristic et al., 2017). Gaussian distributions are set on the source location to implement the domain knowledge that the source is most likely to be at the centre of the search area (uniform distributions may also be used given no user domain information). Testing three sources ensures that results can be evaluated for their robustness in differing configurations, as opposed to the circumstance that a particular method favours a single type of source release.

To further test adaptability of the proposed algorithm, 5 starting locations are also tested for each source (shown in Fig. 6). Locations 1–5 are situated at [1100,50], [1100, 325], [1100,650], [100,50] & [100,650] respectively. Locations 1–3 are downwind of the plume (aligned along the x-axis) and represent typical favourable starting locations for source search. These downwind locations are used to determine the general performance of each control strategy. Despite the fact that source search is typically started downwind, to test the robustness of the algorithms, two further upwind locations have also been tested that represent unfavourable starting locations.

Model parameters for the estimation engine are shown in Table 1 alongside the ground truth values of the sources. Sources 1, 2, & 3 locations are [466, 392], [475, 376] & [534, 300] respectively.

Table 1 Model parameters for the estimation engine prior parameters alongside ground truth values

Full size table

Key parameters for the Entrotaxis, Entrotaxis-Jump and the Informed tree search are outlined in Table 2. To ensure a fair comparison, all three control methods are subject to the same number of utility calculations per step (equating to $|\Sigma |$), and the predictive measurements of the particle filter $n_z$ also remains constant. From the obstacle map of the DAPPLE domain, the sample generation distribution for the informed tree search is shown in Fig. 6.

Table 2 Parameters for IPP

Full size table

To model the sampling robot, we assume a mobile sensor (e.g. unmanned ground/aerial vehicle) with parameters outlined in Table 3. The vehicle is fitted with a single fast response chemical sensor capable of detecting down to a minimum concentration of 0.1 g/m$^3$. Due to the limitations of the dataset, concentration data is only available at a sampling height of 5 m for sources 2 & 3, and 15 m for source 1, and therefore it is assumed that the robot has a fixed sampling height. The sensing robot is assumed to travel at a constant velocity. The time budget allowed is more conducive to a ground vehicle however, conclusions for suitability on a UAV can still be drawn by analysing the presented results at a lower time budget.

Table 3 Operational parameters for simulated robot

Full size table

7.2 Results

To analyse the performance of a searching strategy, the weighted root mean square error RMSE between the source location ${\mathbf {s}}_{x,y}$ and the current posterior estimate of the Bayesian inference $\theta _{s}^{(k)}$ is recorded after each sampling event. The equation for calculating the weighted RMSE is:

$$\begin{aligned} RMSE_k=\sqrt{\sum _{i=1}^n{w}_k^{(i)}\Vert \theta _{s}^{(k)}-{\mathbf {s}}\Vert _2^2} \end{aligned}$$

(8)

The success rate SR and mean search time MST are also studied, since they are common criteria in evaluating source search algorithms. A source is defined as successfully resolved when the RMSE of the inference engine drops below 50m during a single test run. The MST is defined as the time taken for the source to be resolved, averaged across all successful runs. As such, MST is used to determine how efficient the searching process is, whilst SR determines the reliability of the search.

7.2.1 Individual source analysis

For the 3 different sources with the downwind starting locations, Fig. 7 shows the average RMSE at each time step, as well as $1\sigma $ upper and lower bounds of the 360 Monte Carlo runs per method. Downwind starting locations are only selected for the individual source analysis to better show the convergence performance without accounting for the algorithms’ robustness to unfavourable start conditions. Table 4 also shows the SR and MST of the same simulations. It can be seen that for all sources, the searching efficiency of the proposed tree search has improved significantly over Entrotaxis and Jump (shown by a lower MST and faster RMSE reduction rate). Search reliability has also increased as shown by the SR of the tree searches.

Table 4 Downwind location SR and MST for each of the three sources

Full size table

When comparing between the sources, each source’s properties must be first considered. Of the 3 tested sources, source 1 & 2 match the IP model the most closely with source 3 being the least well modelled. This is due to the location of source 3 being most affected by the environment structure. Whilst this can be seen visually in Fig. 5, it is also shown in the results by studying the final RMSE when examining the sources individually. For source 1 & 2, the average converged RMSE is 11 m & 25 m, respectively, against an average of 67 m for source 3, proving the greater mismatch between the estimation model and the actual source. Source 1 is the easiest on the path planning front since, as explained previously, the occupancy map is more sparse as shown by the mutual convergence of all the planners. Source 2 is more challenging on the path planning perspective and the ability of the tree search to adaptively dictate sample steps based on the posterior variance is clearly shown by the disparity in converged RMSE.

In Sect. 4, it is argued that the preferential sampling distribution should see a gain in efficiency when estimating sources that do not well match the model. Based on this notion, it is expected that the informed search should see a larger efficiency gain over the uniform search on source 3, whereas only a marginal performance gain is expected on source 2 and similar performance on source 1. This is clearly shown in Fig. 7 as well as in the SR and MST values of Table 4. This result shows that the informed searching method can help make the source search process more efficient (especially in complex dispersion scenarios) and allows further research into how other prior sampling distributions may be incorporated for more efficiency gains.

Analysing the performance of the Jump strategy, it can be seen that Jump performs well in the more difficult source 3 (comparable with the uniform search), but has a generally slower MST due to jumps around buildings causing extended traversals between sampling events. For sources 1 & 2, comparable performance with Entrotaxis is seen. Since one set of parameters are set across the sources to ensure fairness of comparison, the Jump algorithm does not necessarily perform optimally in each case and this highlights the need for an adaptive algorithm such as the proposed method. This is seen in source 1 where the plume near the source has significantly fewer obstacles. This causes Jump to unnecessarily jump around some buildings leading to a longer MST despite the same SR as Entrotaxis, indicating a need for a larger $n_{jump}$ value than in the other sources.

7.2.2 Overall evaluation

For the overall evaluation, the 2 upwind start locations are included to demonstrate each algorithm’s general performance including robustness to unfavourable start locations. Fig. 8 shows the average RMSE at each time step, as well as $1\sigma $ upper and lower bounds of the 600 Monte Carlo runs per method. It is clear to see that even with upwind locations, overall the informed tree search shows significant improvement over the standard Entrotaxis and Entrotaxis-Jump approaches. The informed search shows a much greater initial rate of error reduction and well as having a better average final RMSE at $k=3600$ s (24 m vs 29 m vs 44 m vs 54 m). Furthermore, the variance of the proposed method is much narrower than the other methods, showing that performing the informed tree search gives more consistence. Table 5 shows the proposed search method also drastically increases the SR as well as the MST over both Entrotaxis and Entrotaxis-Jump.

The overall results also back the findings of Zhao et al. (2020b), by showing Entrotaxis-Jump outperforms the standard algorithm (in urban environments) with a significantly improved SR despite a comparable MST. Fig. 8 shows how initially Jump and Entrotaxis have the same performance, due to the initial search not being in a cluttered region. This shows that Entrotaxis-jump will only outperform Entrotaxis given a dense urban region (its designed purpose), whereas our proposed method also outperforms Entrotaxis in open regions due to the tree search stemming towards the likely source location thus providing more potential sampling locations in an informed direction.

Table 5 SR and MST across all simulations

Full size table

For context, example trajectories for a single run of the informed tree search and Entrotaxis are shown in Fig. 9. The efficiency bonus of using informed trees is clear to see as 1200s into the simulation, the informed tree approach has navigated the robot to the vicinity of the source location, whereas Entrotaxis has not reached the main plume.

8 Conclusion

In this paper, we integrate obstacle avoidance trajectory generation with a source term estimation engine to deliver autonomous search of an airborne release in urban environments. By combining a state-of-the-art parametric inference model alongside a single batch informed tree search algorithm, we are able to efficiently navigate a sampling robot in an informed manner towards a source location. Furthermore, local minima are avoided by taking exploratory actions relative to the uncertainty of the source location estimate.

As demonstrated in the simulation studies, the presented approach is reliably capable of localising a source within a large-scale and feature-rich environment against a variety of source terms and varied start conditions. The informed tree search is shown to far outperform the baseline Entrotaxis and Entrotaxis-Jump approaches in an urban environment when evaluating the same number of possible future sampling locations, proving the efficiency bonus of planning trajectories towards the goal. The addition of an informed sample generation algorithm, that attempts to account for the fundamental mismatch between the plume model and the actual complex flow around obstacles, is also shown to provide a further efficiency gain with regards to the rate of error reduction that can be achieved. Since the batch tree search is capable of accepting any prior sampling distribution, future work can look at alternative distribution to make the search process even more efficient and robust. Overall, the results demonstrate that the proposed framework is capable of guiding the sensing robot to respond to emergency CBRN events in urban environments.

References

An, S., Park, M., & Oh, H. (2022). Receding-horizon rrt-infotaxis for autonomous source search in urban environments. Aerospace Science and Technology, 120, 107276.
Article Google Scholar
Asadi, S., Fan, H., Bennetts, V. H., & Lilienthal, A. J. (2017). Time-dependent gas distribution modelling. Robotics and Autonomous Systems, 96, 157–170.
Article Google Scholar
Bellingham, J., Richards, A., How, J. (2002) Receding horizon control of autonomous aerial vehicles. In: Proceedings of the 2002 American Control Conference, 5: 3741–3746
Chen, W. H., Rhodes, C., & Liu, C. (2021). Dual control for exploitation and exploration (DCEE) in autonomous search. Automatica, 133, 109851.
Article MathSciNet MATH Google Scholar
Dhariwal, A., & Sukhatme, G. S. (2004). Requicha AA (2004) Bacterium-inspired robots for environmental monitoring. Proceedings - IEEE International Conference on Robotics and Automation, 2, 1436–1443.
Google Scholar
Galceran, E., & Carreras, M. (2013). A survey on coverage path planning for robotics. Robotics and Autonomous Systems, 61(12), 1258–1276.
Article Google Scholar
Gammell, J. D., Srinivasa, S. S., Barfoot, T. D. (2015) Batch informed trees (bit): Sampling-based optimal planning via the heuristically guided search of implicit random geometric graphs. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 3067–3074
Gongora, A., Monroy, J., & Gonzalez-Jimenez, J. (2020). Joint estimation of gas & wind maps for fast-response applications. Applied Mathematical Modelling, 87, 655–674.
Article MathSciNet MATH Google Scholar
Harvey, D. J., Lu, T. F., & Keller, M. A. (2008). Comparing insect-inspired chemical plume tracking algorithms using a mobile robot. IEEE Transactions on Robotics, 24(2), 307–317.
Article Google Scholar
Hombal, V., Sanderson, A., Blidberg, D. R .(2010) Multiscale adaptive sampling in environmental robotics. In: 2010 IEEE Conference on Multisensor Fusion and Integration, pp 80–87
Hutchinson, M., Ladosz, P., Liu, C., Chen, W. H. (2019a) Experimental assessment of plume mapping using point measurements from unmanned vehicles. In: Proceedings - IEEE International Conference on Robotics and Automation, vol 2019-May, pp 7720–7726
Hutchinson, M., Liu, C., & Chen, W. H. (2019). Information-based search for an atmospheric release using a mobile robot: algorithm and experiments. IEEE Transactions on Control Systems Technology, 27(6), 2388–2402.
Article Google Scholar
Hutchinson, M., Liu, C., & Chen, W. H. (2019). Source term estimation of a hazardous airborne release using an unmanned aerial vehicle. Journal of Field Robotics, 36(4), 797–817.
Article Google Scholar
Hutchinson, M., Liu, C., Thomas, P., & Chen, W. H. (2020). Unmanned aerial vehicle-based hazardous materials response: Information-theoretic hazardous source search and reconstruction. IEEE Robotics & Automation Magazine, 27(3), 108–119.
Article Google Scholar
Hutchinson, M., Oh, H., & Chen, W. H. (2018). Entrotaxis as a strategy for autonomous search and source reconstruction in turbulent conditions. Information Fusion, 42, 179–189.
Article Google Scholar
Janson, L., Schmerling, E., Clark, A., & Pavone, M. (2015). Fast marching tree: A fast marching sampling-based method for optimal motion planning in many dimensions. International Journal of Robotics Research, 34(7), 883–921.
Article Google Scholar
Jatmiko, W., Sekiyama, K., & Fukuda, T. (2007). A pso-based mobile robot for odor source localization in dynamic advection-diffusion with obstacles environment: theory, simulation and measurement. IEEE Computational Intelligence Magazine, 2(2), 37–51.
Article Google Scholar
Karaman, S., & Frazzoli, E. (2011). Sampling-based algorithms for optimal motion planning. The International Journal of Robotics Research, 30(7), 846–894.
Article MATH Google Scholar
Khodayi-Mehr, R., Aquino, W., & Zavlanos, M. M. (2019). Model-based active source identification in complex environments. IEEE Transactions on Robotics, 35(3), 633–652.
Article Google Scholar
Lilienthal, A. J., Reggente, M., Trinca, M., Blanco, J.L., Gonzalez, J. (2009) A statistical approach to gas distribution modelling with mobile robots - The Kernel DM+V algorithm. In 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems pp 570–576
Marjovi, A., & Marques, L. (2011). Multi-robot olfactory search in structured environments. Robotics and Autonomous Systems, 59(11), 867–881.
Google Scholar
Marjovi, A., & Marques, L. (2014). Optimal swarm formation for odor plume finding. IEEE Transactions on Cybernetics, 44(12), 2302–2315.
Article Google Scholar
Martin, D., Nickless, G., Price, C. S., Britter, R. E., Neophytou, M. K., Cheng, H., et al. (2010). Urban tracer dispersion experiment in London (DAPPLE) 2003: Field study and comparison with empirical prediction. Atmospheric Science Letters, 11(4), 241–248.
Article Google Scholar
Monroy, J., Gonzalez-Jimenez, J. (2019) Towards odor-sensitive mobile robots. In: Rapid Automation: Concepts, Methodologies, Tools, and Applications, IGI Global, pp 1491–1510
Monroy, G. J., Blanco, J. L., & Gonzalez-Jimenez, J. (2016). Time-variant gas distribution mapping with obstacle information. Autonomous Robots, 40(1), 1–16.
Article Google Scholar
Murphy, R. R., Peschel, J., Arnett, C., Martin, D. (2012) Projected needs for robot-assisted chemical, biological, radiological, or nuclear (cbrn) incidents. In: 2012 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), pp 1–4
Rhodes, C., Liu, C., Chen, Wh. (2020) Informative Path Planning for Gas Distribution Mapping in Cluttered Environments. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp 6726–6732
Ristic, B., Angley, D., Moran, B., & Palmer, J. L. (2017). Autonomous multi-robot search for a hazardous source in a turbulent environment. Sensors (Switzerland), 17(4), 1–17.
Article Google Scholar
Ristic, B., Skvortsov, A., & Gunatilaka, A. (2016). A study of cognitive strategies for an autonomous search. Information Fusion, 28, 1–9.
Article Google Scholar
Russell, R. A., Bab-Hadiashar, A., Shepherd, R. L., & Wallace, G. G. (2003). A comparison of reactive robot chemotaxis algorithms. Robotics and Autonomous Systems, 45(2), 83–97.
Article Google Scholar
Schmid, L., Pantic, M., Khanna, R., Ott, L., Siegwart, R., & Nieto, J. (2020). An efficient sampling-based method for online informative path planning in unknown environments. IEEE Robotics and Automation Letters, 5(2), 1500–1507.
Article Google Scholar
Tsitsimpelis, I., Taylor, C. J., Lennox, B., & Joyce, M. J. (2019). A review of ground-based robotic systems for the characterization of nuclear environments. Progress in Nuclear Energy, 111, 109–124.
Article Google Scholar
Vergassola, M., Villermaux, E., & Shraiman, B. I. (2007). “Infotaxis’’ as a strategy for searching without gradients. Nature, 445(7126), 406–409.
Article Google Scholar
Voges, N., Chaffiol, A., Lucas, P., Martinez, D. (2014) Reactive Searching and Infotaxis in Odor Source Localization. PLoS Computational Biology 10(10)
Wang, C., Li, T., Meng, M. Q .H., De Silva, C. (2018) Efficient Mobile Robot Exploration with Gaussian Markov Random Fields in 3D Environments. In 2018 IEEE International Conference on Robotics and Automation (ICRA) pp 5015–5021
Zhao, Y., Chen, B., Zhu, Z., Chen, F., Wang, Y., & Ji, Y. (2020). Searching the diffusive source in an unknown obstructed environment by cognitive strategies with forbidden areas. Building and Environment, 186, 107349.
Article Google Scholar
Zhao, Y., Chen, B., Zhu, Z., Chen, F., Wang, Y., & Ma, D. (2020). Entrotaxis-Jump as a hybrid search algorithm for seeking an unknown emission source in a large-scale area with road network constraint. Expert Systems with Applications, 157(May), 113484.
Article Google Scholar
Zou, R., Zhang, M., Kalivarapu, V., Winer, E., Bhattacharya, S. (2014) Particle swarm optimization for source localization in environment with obstacles. In: 2014 IEEE International Symposium on Intelligent Control (ISIC), IEEE, pp 1602–1607

Download references

Acknowledgements

The authors would like to thank Tim Foat at the DSTL for providing the DAPPLE experiment dataset used as the ground truth in this paper.

Funding

Funding was provided by Engineering and Physical Sciences Research Council (Studentship Grant no. 2126619), Defence Science and Technology Laboratory (Grant no. 1000151054)

Author information

Authors and Affiliations

Loughborough University, Loughborough, LE11 3TU, UK
Callum Rhodes, Cunjia Liu & Wen-Hua Chen
Dstl Porton Down, Salisbury, Wiltshire, SP4 0JQ, UK
Paul Westoby

Authors

Callum Rhodes
View author publications
You can also search for this author in PubMed Google Scholar
Cunjia Liu
View author publications
You can also search for this author in PubMed Google Scholar
Paul Westoby
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Hua Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cunjia Liu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A entrotaxis-jump implementation

In the original implementation of Entrotaxis-jump (Zhao et al., 2020b), the number of possible future control actions, $\sigma _k^{k+1}$, is limited to the deterministic directions of $\uparrow , \rightarrow , \downarrow , \leftarrow $ with a step size that must be predetermined. In the original study, an area approximately $25\%$ of the area of that in this study is searched and a step size of 5 m is chosen. Another key difference between the two studies is that the ratio of ${\mathbf {X}}_{\mathrm {free}} : {\mathbf {X}}$ is much greater in our study, and therefore the number of times that the searcher will be hindered by an obstacle will be appreciably lower. Based on these two main differences, further considerations must be made for the implementation of Entrotaxis-Jump in this study.

To match the standard Entrotaxis implementation and provide fair comparison, the same 8 possible move directions (as detailed in Table 2) and the same two step sizes are used for consideration in $\Sigma $. Due to incorporating two possible sampling locations per direction, at each move event there are two chances that the jump counter, $n_{\mathrm {jump}}$, is triggered per direction. Due to this fact and also the aforementioned statement of searching in a greater ${\mathbf {X}}_{\mathrm {free}} : {\mathbf {X}}$, the parameter values for both $n_{\mathrm {jump}}$ (jump counter) and $m_{\mathrm {jump}}$ (jump memory) must be investigated for our scenario.

To this end, the same $n_{\mathrm {jump}}$ & $m_{\mathrm {jump}}$ sensitivity study as performed in the original work is employed on all sources (${\mathbf {s}}_1$, ${\mathbf {s}}_2$, ${\mathbf {s}}_3$) with start location [1100, 325], repeated 40 times each. Jump parameter values tested are $n_{\mathrm {jump}} = \{2, 4, 6, 8, 10\}$ and $m_{\mathrm {jump}} = \{10, 12, 14\}$. The skill score equation from the original work is used for evaluation.

$$\begin{aligned} S^i_{mst}&= (MST_{max} - MST^i) / (MST_{max} - SR_{min}) \end{aligned}$$

(9)

$$\begin{aligned} S^i_{sr}&= (SR^i - SR_{min}) / (SR_{max} - SR_{min}) \end{aligned}$$

(10)

$$\begin{aligned} S&= w_{sr} \cdot S_{sr} + w_{mst} \cdot S_{mst} \end{aligned}$$

(11)

where $w_{sr}=0.5$ and $w_{mst}=0.5$ as per default. SR and MST are calculated the same as in Sect. 7.2. The skill score aggregated from all sources for all 40 repeats are shown in Fig. 10. The results show that our scenario favours a lower jump threshold, $n_{\mathrm {jump}} = 4$, and the middle value $m_{\mathrm {jump}} = 10$. This can be primarily explained by having more free space thus the searcher is less likely to trigger a necessary jump within its memory of $m_{\mathrm {jump}}$. Therefore, these Entrotaxis-Jump parameters are set when running all the simulation configurations of our main study.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rhodes, C., Liu, C., Westoby, P. et al. Autonomous search of an airborne release in urban environments using informed tree planning. Auton Robot 47, 1–18 (2023). https://doi.org/10.1007/s10514-022-10063-8

Download citation

Received: 09 April 2021
Accepted: 13 September 2022
Published: 03 October 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s10514-022-10063-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Autonomous search of an airborne release in urban environments using informed tree planning

Abstract

Similar content being viewed by others

Hybrid Route Optimisation for Maximum Air to Ground Channel Quality

Using Spatial Uncertainty to Dynamically Determine UAS Flight Paths

Path planning algorithm ensuring accurate localization of radiation sources

1 Introduction

2 State of the art

2.1 Estimation

2.2 Source search in simple environments

2.3 Source search in complex environments

2.4 Contributions

3 Problem statement

Problem 1

Problem 2

4 Generation of sampling locations

4.1 Sample distribution algorithm

5 IPP Informed tree search

5.1 Notation

5.2 Informed tree algorithms

5.2.1 Initialisation (Alg 2, Lines 1:5)

5.2.2 Tree expansion (Alg 2, Lines 6:25)

5.2.3 Sample set acquisition (Alg 3)

6 IPP utility calculation

7 Simulation

7.1 Test setup

7.2 Results

7.2.1 Individual source analysis

7.2.2 Overall evaluation

8 Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

A entrotaxis-jump implementation

A entrotaxis-jump implementation

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation