Skip to main content
Log in

Near-optimal probabilistic search using spatial Fourier sparse set

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

Autonomous search is an essential research topic for rescue and other robotic applications. However, searching for targets efficiently is still an unsolved problem. To achieve this objective, a robot needs to simultaneously maximize environmental coverage, maximize probability of detection (PD) and minimize motion cost. The problems associated with these objectives are NP-hard. This research reformulates the three objective functions as a maximum cumulative PD problem with motion cost. Since the PD function depends on the environment, the robot needs to both learn the PD function and the cost-to-go (CTG) function. This research proposes a reinforcement learning algorithm to learn the PD and CTG functions simultaneously. Since the PD function is sparse in the Fourier domain under certain subgoal patterns, spatial Fourier sparse set is proposed to learn PD functions based on the compressed sensing technique. The learned PD and CTG functions can then be used to generate subgoals that achieve \((1-1/e)\) of the optimum due to the submodularity. Experiments conducted with this algorithm demonstrate that the robot can search for the target faster than prior learning approaches (e.g., PMAC and FSS) and the benchmark model (e.g., PD).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21

Similar content being viewed by others

Notes

  1. The conditional probability table is computed based on the detection outcome. There are four cases: true positive, false negative, false positive, and true negative. For example, if the target is there and the sensor cannot detect, this is called false negative.

  2. The H-area is defined as the \(20 \times 20\) cm\(^2\) area around the highest probability cell. If the target is in the H-area and \(P(A)>90\%\), it is a positive decision.

  3. The value of glimpse function can be decided from the conditional probability table of the given detector. \(g=\frac{q-pq}{1-pq}\), where p denotes the probability of detected area. q denotes the probability of that the robot detects the target if the target is at the detected area. The derivation of glimpse function can be found in the appendix of Tseng and Mettler (2015).

  4. The number of episodes means the number of trials in reinforcement learning.

  5. In EX1, 12 subgoals can cover the environments over \(80\%\). In EX2, since the motion cost is considered, the robot needs more subgoals to cover over \(80\%\).

  6. \(3\%\) is a noisy measurement since the maximal coverage of one subgoal is less than \(15\%\).

  7. SFSS can converge within 2000–3000 samples but it needs to explore the environment. Hence, it takes 10+ episodes.

References

  • Aggarwal, A. (1984). The art gallery theorem: Its variations, applications, and algorithmic aspects,. Ph.D. thesis, Johns Hopkins University.

  • Balcan, M. F., & Harvey, N. J. (2011). Learning submodular functions. In Proceedings of the 43rd annual ACM symposium on theory of computing.

  • Beck, A., & Teboulle, M. (2009). A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences, 2, 183–202.

    Article  MathSciNet  MATH  Google Scholar 

  • Bellingham, J., Richards, A., & How, J. P. (2002). Receding horizon control of autonomous aerial vehicles. American Control Conference, 5, 3741–3746.

    Google Scholar 

  • Binney, J., & Sukhatme, G. S. (2012). Branch and bound for informative path planning. In IEEE international conference on robotics and automation (pp. 2147–2154).

  • Borenstein, J., & Koren, Y. (1991). The vector field histogram-fast obstacle avoidance for mobile robots. IEEE Transactions on Robotics and Automation, 7(3), 278–288.

    Article  Google Scholar 

  • Bourgault, F., Furukawa, T., & Durrant-Whyte, H. F. (2003). Coordinated decentralized search for a lost target in a bayesian world. In IEEE/RSJ international intelligent robots and systems (pp. 979–1000).

  • Candes, E., Romberg, J., & Tao, T. (2006). Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Transaction on Information Theory, 52(2), 489–509.

    Article  MathSciNet  MATH  Google Scholar 

  • Charrow, B., Liu, S., Kumar, V., & Michael, N. (2015). Information-theoretic mapping using Cauchy–Schwarz quadratic mutual information. In IEEE international conference on robotics and automation (pp. 4791–4798).

  • Chen, W., Rodrigues, M. R. D., & Wassell, I. J. (2011). Distributed compressive sensing reconstruction via common support discovery. In IEEE international conference on communications (pp. 1–5).

  • Chung, T. H., & Carpin, S. (2011). Multiscale search using probabilistic quadtrees. In IEEE international conference on robotics and automation (pp. 2546–2553).

  • Cortes, J., Martinez, S., Karatas, T., & Bullo, F. (2004). Coverage control for mobile sensing networks. In IEEE international conference on robotics and automation (pp. 243–255).

  • Eagle, J. N. (1984). The optimal search for a moving target when the search path is constrained. Operations Research, 32(5), 1107–1115.

    Article  MathSciNet  MATH  Google Scholar 

  • Feige, U. (1998). A threshold of ln n for approximating set cover. Journal of the ACM, 45(4), 634–652.

    Article  MathSciNet  MATH  Google Scholar 

  • Gerkey, B. P., Thrun, S., & Gordon, G. (2006). Visibility-based pursuit-evasion with limited field of view. The International Journal of Robotics Research, 25(4), 299–315.

    Article  Google Scholar 

  • Goerzen, C., Kong, Z., & Mettler, B. (2010). A survey of motion planning algorithms from the perspective of autonomous UAV guidance. Journal of Intelligent and Robotic Systems, 57, 65. doi:10.1007/s10846-009-9383-1.

    Article  MATH  Google Scholar 

  • Hayashi, K., Nagahara, M., & Tanaka, T. (2013). A user’s guide to compressed sensing for communications systems. In IEICE transactions on communications E96-B(3), 685–712.

  • Heng, L., Gotovos, A., Krause, A., & Pollefeys, M. (2015). Efficient visual exploration and coverage with a micro aerial vehicle in unknown environments. In IEEE international conference on robotics and automation (pp. 1071–1078).

  • Hollinger, G., Choudhuri, C., Mitra, U., & Sukhatme, G. S. (2013). Squared error distortion metrics for motion planning in robotic sensor networks. In Proceedings of the international workshop wireless networking for unmanned autonomous vehicles (pp. 1426–1431).

  • Hollinger, G., Kehagias, A., & Singh, S. (2010). Gsst: Anytime guaranteed search. Autonomous Robots, 29(1), 99–118.

    Article  Google Scholar 

  • Hollinger, G., & Singh, S. (2008). Proofs and experiments in scalable, near-optimal search by multiple robots. Robotics: Science and Systems,. doi:10.15607/RSS.2008.IV.027.

    Google Scholar 

  • Hollinger, G., & Sukhatme, G. S. (2013). Sampling-based motion planning for robotic information gathering. Robotics: Science and Systems Conference,. doi:10.15607/RSS.2013.IX.051.

    Google Scholar 

  • Kadane, J. B., & Simon, H. A. (1977). Optimal strategies for a class of constrained sequential problems. The Annals of Statistics, 5(2), 237–255.

    Article  MathSciNet  MATH  Google Scholar 

  • Khuller, S., Moss, A., & Naor, J. (1999). The budgeted maximum coverage problem. Information Processing Letters, 70(1), 39–45.

    Article  MathSciNet  MATH  Google Scholar 

  • King, J., & Kirkpatrick, D. (2011). Improved approximation for guarding simple galleries from the perimeter. Journal Discrete and Computational Geometry, 46(2), 252–269.

    Article  MathSciNet  MATH  Google Scholar 

  • Konolige, K. (1997). Improved occupancy grids for map building. Autonomous Robots, 4, 351–367.

    Article  Google Scholar 

  • Kosmatopoulos, E. B. (2009). An adaptive optimization scheme with satisfactory transient performance. Automatica, 45(3), 716–723.

    Article  MathSciNet  MATH  Google Scholar 

  • Krause, A., & Guestrin, C. (2005). Near-optimal nonmyopic value of information in graphical models. In Twenty-First conference on uncertainty in artificial intelligence (pp. 324–331).

  • Krause, A., Singh, A., & Guestrin, C. (2008). Near-optimal sensor placements in gaussian processes: Theory, efficient algorithms and empirical studies. The Journal of Machine Learning Research, 9, 235–284.

    MATH  Google Scholar 

  • Lau, H., Huang, S., & Dissanayake, G. (2006). Probabilistic search for a moving target in an indoor environment. In IEEE/RSJ international conference on intelligent robots and systems (pp. 3393–3398).

  • Lau, H., Huang, S., & Dissanayake, G. (2008). Discounted mean bound for the optimal searcher path problem with non-uniform travel times. European Journal of Operational Research, 190(2), 383–397.

    Article  MathSciNet  MATH  Google Scholar 

  • LaValle, S. M., & Kuffner, J. J. (1999). Randomized kinodynamic planning. IEEE International Conference on Robotics and Automation, 1, 473–479.

    Google Scholar 

  • Lloyd, S. P. (1982). Least-squares quantization in pcm. In IEEE transactions on information theory (pp. 129–137).

  • Lo, N., Berger, J., Noel, M. (2012). Toward optimizing static target search path planning. In IEEE symposium on computational intelligence for security and defence applications (pp. 1–7).

  • McCue, B. (1990). U-boats in the bay of biscay: An essay in operations analysis. Washington: National Defense University Press.

  • Mettler, B., & Kong, Z. (2008). Receding horizon trajectory optimization with a finite-state value function approximation. In American control conference (pp. 3810–3816)

  • Mettler, B., Tehrani, N. D., & Kong, Z. (2010). Agile autonomous guidance using spatial value functions. Control Engineering Practice, 18(7), 773–788.

    Article  Google Scholar 

  • Montemerlo, M., Thrun, S., Koller, D., Wegbreit, B. (2003). Fastslam 2.0: An improved particle filtering algorithm for simultaneous localization and mapping that provably converges. In Proceedings of the sixteenth international joint conference on artificial intelligence (pp. 1151–1156).

  • Narasimhan, M., & Bilmes, J. (2007). Local search for balanced submodular clusterings. In Proceedings of the 20th international joint conference on artifical intelligence (pp. 981–986).

  • Nemhauser, G. L., Wolsey, L. A., & Fisher, M. L. (1978). An analysis of approximations for maximizing submodular set functions. I. Mathematical Programming, 14(1), 265–294.

    Article  MathSciNet  MATH  Google Scholar 

  • ORourke, J., & Supowit, K. (1983). Some NP-hard polygon decomposition problems. IEEE Transactions on Information Theory, 29(2), 181–190.

    Article  MathSciNet  MATH  Google Scholar 

  • Pimenta, L. C. A., Schwager, M., Lindsey, Q., Kumar, V., Rus, D., Mesquita, R. C., et al. (2009). Simultaneous coverage and tracking (scat) of moving targets with robot networks, Algorithmic foundation of robotics VIII (pp. 85–99). Berlin: Springer.

  • Renzaglia, A., Doitsidis, L., Martinelli, A., Kosmatopoulos, E. B. (2010). Cognitive-based adaptive control for cooperative multi-robot coverage. In IEEE/RSJ international intelligent robots and systems (pp. 3314–3320).

  • Renzaglia, A., Doitsidis, L., Martinelli, A., Kosmatopoulos, E. B. (2011). Adaptive-based distributed cooperative multi-robot coverage. In American control conference (pp. 468–473).

  • Renzaglia, A., Doitsidis, L., Martinelli, A., & Kosmatopoulos, E. B. (2012). Multi-robot three dimensional coverage of unknown areas. The International Journal of Robotics Research, 31(6), 738–752.

    Article  Google Scholar 

  • Richards, A., Boyle, P. (2010). Combining planning and learning for autonomous vehicle navigation. In AIAA guidance, navigation and control conference.

  • Schwager, M., Julian, B. J., & Rus, D. (2009). Optimal coverage for multiple hovering robots with downward facing cameras. In IEEE international conference on robotics and automation (pp. 3515–3522).

  • Singh, A., Krause, A., Guestrin, C., Kaiser, W., & Batalin, M. (2007). Efficient planning of informative paths for multiple robots. In Proceedings of the 20th international joint conference on artificial intelligence (pp. 2204–2211).

  • Singh, A., Krause, A., & Kaiser, W. (2009). Nonmyopic adaptive informative path planning for multiple robots. In International joint conference on artificial intelligence (pp. 1843–1850).

  • Stobbe, P., & Krause, A. (2012). Learning Fourier sparse set functions. In Proceedings of the fifteenth international conference on artificial intelligence and statistics (pp. 1125–1133).

  • Stone, L. D. (1975). The theory of optimal search. Operations Research Society of America.

  • Tokekar, P., & Isler, V. (2014). Polygon guarding with orientation. In IEEE international conference on robotics and automation (pp. 1014–1019).

  • Trummel, K. E., & Weisinger, J. R. (1986). The complexity of the optimal searcher path problem. Operations Research, 34(2), 324–327.

    Article  MathSciNet  MATH  Google Scholar 

  • Tseng, K. S., & Mettler, B. (2015). Near-optimal probabilistic search via submodularity and sparse regression. Autonomous Robots,. doi:10.1007/s10514-015-9521-5.

    Google Scholar 

  • Vig, L., & Adams, J. A. (2007). Coalition formation: From software agents to robots. Journal of Intelligent and Robotic Systems, 1, 85–118.

    Article  Google Scholar 

Download references

Acknowledgements

This research was completed thanks to the financial support from ONR Grant 1361538 and NSF CAREER CMMI 1254906. Kuo-Shih would like to thank his daughter, Chin-Chun Tseng. The hide-and-seek game they played inspires the learning concept for search problems.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bérénice Mettler.

Additional information

This is one of several papers published in Autonomous Robots comprising the Special Issue on Active Perception.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (mp4 23269 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tseng, KS., Mettler, B. Near-optimal probabilistic search using spatial Fourier sparse set. Auton Robot 42, 329–351 (2018). https://doi.org/10.1007/s10514-017-9616-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-017-9616-2

Keywords

Navigation