Abstract
Purpose
The accuracy of real-time sound source localization is an important issue in acoustics. The steered sample algorithm (SSA), an algorithm developed based on the reciprocity of wave propagation, has a higher spatial resolution than the steered response power – phase transform (SRP-PHAT) algorithm. The algorithm can also render an accurate sound source localization under limited array elements and low signal-to-noise ratio. However, the actual implementation of the algorithm is usually based on an expensive grid search process, which makes the computational cost a serious problem.
Methods
An improved implementing algorithm based on beetle swarm optimization (BSO) is proposed for SSA, which can effectively reduce the computational cost. The modified algorithm has a good convergence speed as the SSA algorithm has fewer local extremums.
Results
Experimental results demonstrate that compared with steered sample algorithm (SSA), the proposed algorithm has almost the same localization performance and robustness with lower computational cost.
Conclusion
This paper proposes an improved algorithm about SSA algorithm. Compared with SRC algorithm, the amount of computation is reduced more than twice. Meanwhile, the proposed algorithm inherits the anti-noise performance of SSA. Under the condition of low SNR, the positioning success rate and RMSE performance are excellent. Under the condition of high reverberation, the improved algorithm needs more particles to ensure the positioning performance. When the number of particles is not less than 70, the localization success rate of the proposed algorithm is highly consistent with the conventional SSA, and the RMSE is slightly less than the conventional SSA. Compared with SRC algorithm, the proposed algorithm better inherits the robustness of original SSA and improves the positioning accuracy.
Similar content being viewed by others
Data availability
The data used to support the findings of this study are available from the corresponding author upon request.
References
Zhao XY, Tang J, Zhou L et al (2013) Accelerated steered response power method for sound source localization via clustering search. Sci China Phys Mech Astron 56(7):1329–1338
Cobos M, Marti A, Lopez JJ (2010) A modified SRP-PHAT functional for robust real-time sound source localization with scalable spatial sampling. IEEE Signal Process Lett 18(1):71–74
Sudo Y, Itoyama K, Nishida K et al (2020) Sound event aware environmental sound segmentation with Mask U-Net. Adv Robot 34(20):1280–1290
Tang B, Zhang D, Jiang H et al (2020) Optimization of energy management strategy for the EPS with hybrid power supply based on PSO algorithm. Energies 13(1):428
Brown AD, Tollin DJ (2018) Temporal constraints on neural and psychophysical sensitivity to interaural level difference cues to sound source location. J Acoust Soc Am 143(3):1785–1785
Nhu VH, Thao N, Pham TD et al (2020) A new hybrid firefly–PSO optimized random subspace tree intelligence for torrential rainfall-induced flash flood susceptible mapping. Remote Sens 12(17):2688
Grondin F, Glass J (2019) SVD-PHAT: A Fast Sound Source Localization Method, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)IEEE
Miyatake M, Veerachary M et al (2011) Maximum power point tracking of multiple photovoltaic arrays: a PSO approach. IEEE Trans Aerosp Electron Syst 47(1):367–380
Pastore MT, Yost W (2019) Auditory detection and sound source elevation. J Acoust Soc Am 145(3):1723–1723
Chen J, Takashima R, Guo X et al (2021) Multimodal fusion for indoor sound source localization. Pattern Recogn 115(3):107906
Seewald LA, Gonzaga L, Veronez MR et al (2014) Combining SRP-PHAT and two kinects for 3D sound source localization. Expert Syst Appl 41(16):7106–7113
Pertila P, Cakir E (2017) Robust direction estimation with convolutional neural networks based steered response power, IEEE International Conference on Acoustics, pp. 6125–6129
Liu B, Zhang L, Nie P et al (2020) Steered sample algorithm for acoustic source localization. PLoS ONE 15(10):e0241129–e0241129
Bemani A, Xiong Q, Baghban A et al (2020) Modeling of cetane number of biodiesel from fatty acid methyl ester (FAME) information using GA-, PSO-, and HGAPSO- LSSVM models. Renew Energy 150(1):924–934
Chen T, Zhu Y, Teng J (2018) Beetle swarm optimization for solving investment portfolio problems. J Eng 16:1600–1605
Acknowledgements
This work is supported by the Hubei Key Research and Development Program of China (2022BAA099)
Funding
This study was supported by Hubei Key Research and Development Program of China (2022BAA099).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, L., Liu, C., Song, X. et al. A Modified SSA Function for Real-Time Sound Source Localization. J. Vib. Eng. Technol. (2023). https://doi.org/10.1007/s42417-023-01168-0
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42417-023-01168-0