Skip to main content
Log in

Online planning for multi-robot active perception with self-organising maps

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

We propose a self-organising map (SOM) algorithm as a solution to a new multi-goal path planning problem for active perception and data collection tasks. We optimise paths for a multi-robot team that aims to maximally observe a set of nodes in the environment. The selected nodes are observed by visiting associated viewpoint regions defined by a sensor model. The key problem characteristics are that the viewpoint regions are overlapping polygonal continuous regions, each node has an observation reward, and the robots are constrained by travel budgets. The SOM algorithm jointly selects and allocates nodes to the robots and finds favourable sequences of sensing locations. The algorithm has a runtime complexity that is polynomial in the number of nodes to be observed and the magnitude of the relative weighting of rewards. We show empirically the runtime is sublinear in the number of robots. We demonstrate feasibility for the active perception task of observing a set of 3D objects. The viewpoint regions consider sensing ranges and self-occlusions, and the rewards are measured as discriminability in the ensemble of shape functions feature space. Exploration objectives for online tasks where the environment is only partially known in advance are modelled by introducing goal regions in unexplored space. Online replanning is performed efficiently by adapting previous solutions as new information becomes available. Simulations were performed using a 3D point-cloud dataset from a real robot in a large outdoor environment. Our results show the proposed methods enable multi-robot planning for online active perception tasks with continuous sets of candidate viewpoints and long planning horizons.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  • Angéniol, B., de la Vaubois, C. G., & Texier, J. Y. L. (1988). Self-organizing feature maps and the travelling salesman problem. Neural Networks, 1(4), 289–293.

    Article  Google Scholar 

  • Archetti, C., Hertz, A., & Speranza, M. G. (2007). Metaheuristics for the team orienteering problem. Journal of Heuristics, 13(1), 49–76.

    Article  Google Scholar 

  • Atanasov, N., Ny, J. L., Daniilidis, K., & Pappas, G. J. (2015). Decentralized active information acquisition: Theory and application to multi-robot SLAM. In Proceedings of IEEE ICRA (pp. 4775–4782).

  • Atanasov, N., Sankaran, B., Le Ny, J., Pappas, G., & Daniilidis, K. (2014). Nonmyopic view planning for active object classification and pose estimation. IEEE Transactions on Robotics, 30(5), 1078–1090.

    Article  Google Scholar 

  • Bargoti, S., Underwood, J. P., Nieto, J. I., & Sukkarieh, S. (2015). A pipeline for trunk detection in trellis structured apple orchards. Journal of Field Robotics, 32(8), 1075–1094.

    Article  Google Scholar 

  • Becerra, I., Valentín-Coronado, L. M., Murrieta-Cid, R., & Latombe, J. C. (2016). Reliable confirmation of an object identity by a mobile robot: A mixed appearance/localization-driven motion approach. The International Journal of Robotics Research, 35(10), 1207–1233.

    Article  Google Scholar 

  • Bektas, T. (2006). The multiple traveling salesman problem: An overview of formulations and solution procedures. Omega, 34(3), 209–219.

    Article  MathSciNet  Google Scholar 

  • Best, G., Cliff, O., Patten, T., Mettu, R., & Fitch, R. (2016a). Decentralised Monte Carlo tree search for active perception. In Proceedings of the WAFR.

  • Best, G., Faigl, J., & Fitch, R. (2016b). Multi-robot path planning for budgeted active perception with self-organising maps. In Proceedings of IEEE/RSJ IROS (pp. 3164–3171).

  • Best, G., & Fitch, R. (2016). Probabilistic maximum set cover with path constraints for informative path planning. In: Proceedings of ARAA ACRA.

  • Best, G., Martens, W., & Fitch, R. (2017). Path planning with spatiotemporal optimal stopping for stochastic mission monitoring. IEEE Transactions on Robotics, 33(3), 629–646.

    Article  Google Scholar 

  • Binney, J., & Sukhatme, G. (2012). Branch and bound for informative path planning. In Proceedings of IEEE ICRA (pp. 2147–2154).

  • Bircher, A., Kamel, M., Alexis, K., Burri, M., Oettershagen, P., Omari, S., et al. (2016). Three-dimensional coverage path planning via viewpoint resampling and tour optimization for aerial robots. Autonomous Robots, 40(6), 1059–1078.

    Article  Google Scholar 

  • Bourgault, F., Makarenko, A., Williams, S., Grocholsky, B., & Durrant-Whyte, H. (2002). Information based adaptive robotic exploration. In Proceedings of IEEE/RSJ IROS (pp. 540–545).

  • Cao, N., Low, K. H., & Dolan, J. M. (2013). Multi-robot informative path planning for active sensing of environmental phenomena: A tale of two algorithms. In Proceedings of AAMAS (pp. 7–14).

  • Charrow, B. (2015). Information-theoretic active perception for multi-robot teams. Ph.D. thesis, University of Pennsylvania.

  • Chekuri, C., & Pal, M. (2005). A recursive greedy algorithm for walks in directed graphs. In Proceedings of IEEE FOCS (pp. 245–253).

  • Chen, S., Li, Y., & Kwok, N. M. (2011). Active vision in robotic systems: A survey of recent developments. The International Journal of Robotics Research, 30(11), 1343–1377.

    Article  Google Scholar 

  • Cochrane, E. M., & Beasley, J. E. (2003). The co-adaptive neural network approach to the euclidean travelling salesman problem. Neural Networks, 16(10), 1499–1525.

    Article  Google Scholar 

  • Corah, M., & Michael, N. (2017). Efficient online multi-robot exploration via distributed sequential greedy assignment. In Proceedings of robotics: science and systems.

  • Cunningham-Nelson, S., Moghadam, P., Roberts, J., & Elfes, A. (2015). Coverage-based next best view selection. In Proceedings of ARAA ACRA.

  • Dang, D. C., El-Hajj, R., & Moukrim, A. (2013a). A branch-and-cut algorithm for solving the team orienteering problem. In Proceedings of CPAIOR (pp. 332–339). Springer.

  • Dang, D. C., Guibadj, R. N., & Moukrim, A. (2013b). An effective PSO-inspired algorithm for the team orienteering problem. European Journal of Operational Research, 229(2), 332–344.

    Article  MATH  Google Scholar 

  • Dornhege, C., Kleiner, A., Hertle, A., & Kolling, A. (2016). Multirobot coverage search in three dimensions. Journal of Field Robotics, 33(4), 537–558.

    Article  Google Scholar 

  • Faigl, J. (2010). Approximate solution of the multiple watchman routes problem with restricted visibility range. IEEE Transactions on Neural Networks, 21(10), 1668–1679.

    Article  Google Scholar 

  • Faigl, J. (2016a). An application of self-organizing map for multirobot multigoal path planning with minmax objective. Computational Intelligence and Neuroscience. https://doi.org/10.1155/2016/2720630.

  • Faigl, J. (2016b). On self-organizing map and rapidly-exploring random graph in multi-goal planning. In Advances in self-organizing maps and learning vector quantization (pp. 143–153). Springer.

  • Faigl, J. (2017). On self-organizing maps for orienteering problems. In Proceedings of IJCNN (pp. 2611–2620).

  • Faigl, J., & Hollinger, G. (2014). Unifying multi-goal path planning for autonomous data collection. In Proceedings of IEEE/RSJ IROS (pp. 2937–2942).

  • Faigl, J., & Hollinger, G. A. (2017). Autonomous data collection using a self-organizing map. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2017.2678482.

  • Faigl, J., Kulich, M., & Přeučil, L. (2012). Goal assignment using distance cost in multi-robot exploration. In Proceedings of IEEE/RSJ IROS (pp. 3741–3746).

  • Faigl, J., Pěnička, R., & Best, G. (2016). Self-organizing map-based solution for the orienteering problem with neighborhoods. In Proceedings of IEEE SMC (pp. 1315–1321).

  • Faigl, J., & Váňa, P. (2016). Self-organizing map for data collection planning in persistent monitoring with spatial correlations. In Proceedings of IEEE SMC (pp. 3264–3269).

  • Galceran, E., & Carreras, M. (2013). A survey on coverage path planning for robotics. Robotics and Autonomous Systems, 61(12), 1258–1276.

    Article  Google Scholar 

  • Garg, S., & Ayanian, N. (2014). Persistent monitoring of stochastic spatio-temporal phenomena with a small team of robots. In Proceedings of robotics: science and systems.

  • Gunawan, A., Lau, H. C., & Vansteenwegen, P. (2016). Orienteering problem: A survey of recent variants, solution approaches and applications. European Journal of Operational Research, 255(2), 315–332.

    Article  MathSciNet  MATH  Google Scholar 

  • Helsgaun, K. (2000). An effective implementation of the Lin–Kernighan traveling salesman heuristic. European Journal of Operational Research, 126(1), 106–130.

    Article  MathSciNet  MATH  Google Scholar 

  • Hollinger, G., Singh, S., Djugash, J., & Kehagias, A. (2009). Efficient multi-robot search for a moving target. The International Journal of Robotics Research, 28(2), 201–219.

    Article  Google Scholar 

  • Hollinger, G. A., Mitra, U., & Sukhatme, G. S. (2011). Active classification: Theory and application to underwater inspection. In Proceedings of ISRR.

  • Hönig, W., & Ayanian, N. (2016). Dynamic multi-target coverage with robotic cameras. In Proceedings of IEEE/RSJ IROS (pp. 1871–1878).

  • Kassir, A., Fitch, R., & Sukkarieh, S. (2015). Communication-aware information gathering with dynamic information flow. The International Journal of Robotics Research, 34(2), 173–200.

    Article  Google Scholar 

  • Kriegel, S., Brucker, M., Marton, Z. C., Bodenmuller, T., & Suppa, M. (2013). Combining object modeling and recognition for active scene exploration. In Proceedings of IEEE/RSJ IROS (pp. 2384–2391).

  • Kulich, M., Faigl, J., & Přeučil, L. (2011). On distance utility in the exploration task. In Proceedings of IEEE ICRA (pp. 4455–4460).

  • Kulich, M., Sushkov, R., & Přeučil, L. (2016). Speed-up of self-organizing networks for routing problems in a polygonal domain. In Proceedings of IEEE/RSJ IROS 10th international workshop on cognitive robotics.

  • Lagoudakis, M. G., Markakis, E., Kempe, D., Keskinocak, P., Kleywegt, A. J., Koenig, S., Tovey, C. A., Meyerson, A., & Jain, S. (2005). Auction-based multi-robot routing. In Proceedings of robotics: science and systems.

  • Likhachev, M., Ferguson, D. I., Gordon, G. J., Stentz, A., & Thrun, S. (2005). Anytime dynamic A*: An anytime, replanning algorithm. In Proceedings of ICAPS (pp. 262–271).

  • Martens, W., Poffet, Y., Soria, P. R., Fitch, R., & Sukkarieh, S. (2017). Geometric priors for Gaussian process implicit surfaces. IEEE Robotics and Automation Letters, 2(2), 373–380.

    Article  Google Scholar 

  • Mathew, N., Smith, S., & Waslander, S. (2013). A graph-based approach to multi-robot rendezvous for recharging in persistent tasks. In Proceedings of IEEE ICRA (pp. 3497–3502).

  • McMahon, J., & Plaku, E. (2017). Autonomous data collection with limited time for underwater vehicles. IEEE Robotics and Automation Letters, 2(1), 112–119.

    Article  Google Scholar 

  • Noon, C. E., & Bean, J. C. (1989). An efficient transformation of the generalized traveling salesman problem. Technical report 89-36, Department of Industrial and Operations Engineering, University of Michigan.

  • Patten, T. (2017). Active object classification from 3D range data with mobile robots. Ph.D. thesis, The University of Sydney.

  • Patten, T., Kassir, A., Martens, W., Douillard, B., Fitch, R., & Sukkarieh, S. (2015). A Bayesian approach for time-constrained 3D outdoor object recognition. In: Proceedings of IEEE ICRA workshop on scaling up active perception.

  • Patten, T., Martens, W., & Fitch, R. (2017). Monte Carlo planning for active object classification. Autonomous Robots. https://doi.org/10.1007/s10514-017-9626-0.

  • Patten, T., Zillich, M., Fitch, R., Vincze, M., & Sukkarieh, S. (2016). Viewpoint evaluation for online 3-D active object classification. IEEE Robotics and Automation Letters, 1(1), 73–81.

    Article  Google Scholar 

  • Peng, C., Roy, P., Luby, J., & Isler, V. (2016). Semantic mapping of orchards. IFAC-PapersOnLine, 49(16), 85–89.

    Article  Google Scholar 

  • Quattrini Li, A., Cipolleschi, R., Giusto, M., & Amigoni, F. (2016). A semantically-informed multirobot system for exploration of relevant areas in search and rescue settings. Autonomous Robots, 40(4), 581–597.

    Article  Google Scholar 

  • Robin, C., & Lacroix, S. (2015). Multi-robot target detection and tracking: Taxonomy and survey. Autonomous Robots, 40(4), 729–760.

    Article  Google Scholar 

  • Singh, A., Krause, A., Guestrin, C., & Kaiser, W. J. (2009). Efficient informative sensing using multiple robots. Journal of Artificial Intelligence Research, 34(2), 707.

    MathSciNet  MATH  Google Scholar 

  • Smith, S. L., & Imeson, F. (2017). GLNS: An effective large neighborhood search heuristic for the generalized traveling salesman problem. Computers and Operations Research, 87, 1–19.

    Article  MathSciNet  Google Scholar 

  • Somhom, S., Modares, A., & Enkawa, T. (1997). A self-organising model for the travelling salesman problem. Journal of the Operational Research Society, 48(9), 919–928.

    Article  MATH  Google Scholar 

  • Somhom, S., Modares, A., & Enkawa, T. (1999). Competition-based neural network for the multiple travelling salesmen problem with minmax objective. Computers and Operations Research, 26(4), 395–407.

    Article  MathSciNet  MATH  Google Scholar 

  • Toth, P., & Vigo, D. (2001). The vehicle routing problem. New Delhi: SIAM.

    MATH  Google Scholar 

  • Tucci, M., & Raugi, M. (2010). Stability analysis of self-organizing maps and vector quantization algorithms. In Proceedings of IJCNN (pp. 1–5).

  • van Hoof, H., Kroemer, O., & Peters, J. (2014). Probabilistic segmentation and targeted exploration of objects in cluttered environments. IEEE Transactions on Robotics, 30(5), 1198–1209.

    Article  Google Scholar 

  • Vansteenwegen, P., Souffriau, W., & Oudheusden, D. V. (2011). The orienteering problem: A survey. European Journal of Operational Research, 209(1), 1–10.

    Article  MathSciNet  MATH  Google Scholar 

  • Wohlkinger, W., Aldoma, A., Rusu, R. B., & Vincze, M. (2012). 3DNet: Large-scale object class recognition from CAD models. In Proceedings of IEEE ICRA (pp. 5384–5391).

  • Wohlkinger, W., & Vincze, M. (2011). Ensemble of shape functions for 3D object classification. In Proceedings of IEEE ROBIO (pp. 2987–2992).

  • Wu, K., Ranasigne, R., & Dissanayake, G. (2015). Active recognition and pose estimation of household objects in clutter. In Proceedings of IEEE ICRA (pp. 4230–4237).

  • Xu, Z., Fitch, R., Underwood, J., & Sukkarieh, S. (2013). Decentralized coordinated tracking with mixed discrete-continuous decisions. Journal of Field Robotics, 30(5), 717–740.

    Article  Google Scholar 

  • Yu, J., Schwager, M., & Rus, D. (2016). Correlated orienteering problem and its application to persistent monitoring tasks. IEEE Transactions on Robotics, 32(5), 1106–1118.

    Article  Google Scholar 

  • Zhang, W. D., Bai, Y. P., & Hu, H. P. (2006). The incorporation of an efficient initialization method and parameter adaptation using self-organizing maps to solve the TSP. Applied Mathematics and Computation, 172(1), 603–623.

    Article  MathSciNet  MATH  Google Scholar 

  • Zlot, R., Stentz, A., Dias, M., & Thayer, S. (2002). Multi-robot exploration controlled by a market economy. In Proceedings of IEEE ICRA (Vol. 3, pp. 3016–3023).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Graeme Best.

Additional information

This work was supported in part by the Australian Centre for Field Robotics; the NSW Government; the Australian Research Council’s Discovery Project funding scheme (No. DP140104203); and the Faculty of Engineering & Information Technologies, The University of Sydney, under the Faculty Research Cluster Program. The work of Jan Faigl was supported by the Czech Science Foundation (GAČR) under research Project No. 15-09600Y.

This is one of the several papers published in Autonomous Robots comprising the Special Issue on Online Decision Making in Multi-Robot Coordination.

Appendix: Convergence of SOM

Appendix: Convergence of SOM

In this appendix, we elaborate on the convergence properties discussed in Remark 2 of Sect. 5.1.1 to provide insight into the behaviour of the algorithm.

There are two key phases of the algorithm to consider when analysing the convergence properties. After epoch \(i_{max}=1/\delta \), the neighbourhood function definition (1) ensures that no further adaptations occur, and thus the algorithm is guaranteed to have converged (Lemma 2). However, the algorithm will typically converge prior to \(i_{max}\), but after the neighbourhood function (1) reaches 0 for \(l>0\). Because such a convergence depends on the spatial configuration of the viewpoint regions, it is not possible to prove convergence prior to \(i_{max}\) for all possible cases. In fact, it is possible to show that for certain configurations, a particular waypoint of the network may oscillate between viewpoint locations during the learning epochs. However, these are specific cases and do not occur frequently; therefore, faster convergence occurs with a high probability. Furthermore, these oscillations do not occur to the best found solution, which is what is actually returned by the algorithm. In the remainder of this appendix, we discuss the intuition behind these claims of convergence. For simplicity and without loss of generality, we suppose a single robot problem (\(R=1\)) and the viewpoint regions are defined as discrete points.

The crucial property for determining convergence is that, after some epoch, only the winner is moved towards the viewpoint location while the neighbours of the winner are not adapted. This occurs when \(f(\sigma ,l)=0\) for \(l>0\), and thus only the waypoints with the cardinal distance \(l=0\) to \(x^\star \) can be adapted, i.e., only the winner waypoint \(x^\star \) is moved. In the case of \(\sigma _0=4\) and \(\delta =0.002\), this occurs at learning epoch \(i = 68\); this agrees with our empirical analysis of the convergence of solutions in Fig. 5.

The main intuition behind the convergence is related to the limited travel cost budget \(b^1\). The adaptation is performed only if the sequence of waypoints satisfies the budget constraint after the adaptation. Let the value of the neighbourhood function be non-zero only for the winner waypoint, as described above. In cases where the winner is a new waypoint \(x_e\) on an edge (Algorithm 2 line 13), the length of the path (sequence of waypoints \(X^i\)) must increase, as illustrated in Fig. 17. Thus, if edge nodes are selected as winners, the network will stop adapting once the budget limit is reached.

Fig. 17
figure 17

Illustration of a scenario where the winner is on an edge of the path. All possible adaptations result in an increased path length

Fig. 18
figure 18

Illustration of a scenario where the winner is an existing waypoint. The path after the adaptation to \(z^\star \) is shown as the dashed lines connecting \(x^i_j\) with \(z^\star \) and \(z^\star \) with \(x^i_{j'}\). This adaptation results in an increased path length

The other case to consider is when the winner is an existing waypoint \(x^\star \). Let the neighbouring waypoints to \(x^\star \) be \(x^i_j\) and \(x^i_{j'}\). This scenario is illustrated in Fig. 18. The shaded area represents possible locations for \(z^\star \) that would result in the existing waypoint \(x^\star \) being selected as the winner. This area is defined as the intersection of the half-planes with boundaries perpendicular to \((x^i_j,x^\star )\) and \((x^\star ,x^i_{j'})\). In the Fig. 18 adaptation scenario, the length of the path increases in a similar way to the edge waypoint \(x_e\) winner case. Thus, this type of adaptation can only occur up until the cost budget is met.

However, when an existing \(x^\star \) waypoint is selected as the winner, it is possible for the length of the path to decrease as a result of the adaptation. A simple example of such a situation is visualised in Fig. 19 with two fixed waypoints at \(z_1\) and \(z_2\). Assume the travel cost budget is such that the path can visit either \(z_3\) or \(z_4\), but it is impossible to visit both \(z_3\) and \(z_4\) without exceeding the budget. If the network is in the configuration shown in Fig. 19a, and the random permutation of the viewpoints be such that the node \(z_3\) is presented as the first node, then the winner waypoint at \(z_4\) will be adapted to \(z_3\). When \(z_4\) is presented, the network adapts back to \(z_4\) (Fig. 19b). In this configuration, the waypoint may continue to oscillate between \(z_4\) and \(z_3\) until \(f(\sigma ,0)\) finally reaches zero. However, it is rare for such a configuration of viewpoint locations to occur. Also, the network may also adapt towards other viewpoints, which will increase the path length until the cost budget is reached, causing the oscillations to eventually cease.

Fig. 19
figure 19

An example scenario that may cause the network to oscillate between two viewpoint locations \(z_3\) and \(z_4\)

We also note that the algorithm maintains the best found solution (Algorithm 1 line 15). When the network is oscillating as described above, the best found solution is likely to remain constant. This is the solution that is returned by the algorithm. An empirical verification of the presented intuition behind the solution convergence is reported in Fig. 6, where the solution does not change after 50 learning epochs. The network itself does not change after 70 epochs, which further supports the presented idea that the network typically converges much sooner than \(i_{max}=1/\delta \) learning epochs.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Best, G., Faigl, J. & Fitch, R. Online planning for multi-robot active perception with self-organising maps. Auton Robot 42, 715–738 (2018). https://doi.org/10.1007/s10514-017-9691-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-017-9691-4

Keywords

Navigation