Skip to main content
Log in

Learning based speech compressive subsampling

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, we present a learning-based approach to speech compressive subsampling. Prior work in the field has mainly used random or deterministic matrices, which are unaffected by the structure and nature of the signal to be sampled. As a result, recovery algorithms require more computational and power resources to recover the original signal from a small number of samples. The framework presented in this paper aims to propose an efficient methodology for optimised speech sensing matrix design using standard speech datasets via offline training. The effectiveness of the suggested method is assessed using the signal-to-noise ratio that is obtained after reconstructing the original speech signal from samples supplied by the developed matrix. The baseline sensing matrices chosen for this work are widely used random and deterministic matrices, including the Gaussian random matrix, the Bernoulli random matrix, the Toeplitz matrix, and the Hadamard matrix. Convex relaxation and linear reconstruction algorithms were used to perform the signal reconstruction, and the key performance indicators for the relevant sensing matrix are chosen to be reconstruction accuracy and speed. In this work, we observe almost 4% on average improvement in reconstruction accuracy when the proposed method is compared with one of the efficient sensing matrices.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

  1. Ahmed I, Ahmad N, Ali H, Ahmad G (2012) The development of isolated words pashto automatic speech recognition system. In: 2012 18th international conference on automation and computing (ICAC), IEEE, pp 1–4

  2. Ahmed I, Ali H, Ahmad N, Ahmad G (2012) The development of isolated words corpus of pashto for the automatic speech recognition research. In: 2012 International conference of robotics and artificial intelligence, IEEE, pp 139–143

  3. Ahmed I, Khan A (2022) Genetic algorithm based framework for optimized sensing matrix design in compressed sensing. Multimed Tools Appl, pp 1–26

  4. Ahmed I, Khan A, Ahmad N, Ali H (2020) Speech signal recovery using block sparse bayesian learning. Arab J Sci Eng 45(3):1567–1579

    Article  Google Scholar 

  5. Ahmed I, Khan A, Khan A, Mujahid K, Khan N (2021) Efficient measurement matrix for speech compressive sampling. Multimed Tools Appl 80(13):20327–20343

    Article  Google Scholar 

  6. Bala S, Arif M (2015) Performance comparison of discrete transforms on speech compressed sensing. In: 2015 IEEE international conference on computational intelligence & communication technology (CICT), IEEE, pp 632–637

  7. Baldassarre L, Li Yen-Huan, Scarlett J, Gözcü B, Bogunovic I, Cevher V (2016) Learning-based compressive subsampling. IEEE J Sel Top Signal Process 10(4):809–822

    Article  Google Scholar 

  8. Baraniuk R, Davenport MA, Duarte MF, Hegde C et al (2011) An introduction to compressive sensing. Connexions e-textbook, pp 24–76

  9. Bhatti UA, Huang M, Di W u, Zhang Y u, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Enterprise Inform Syst 13(3):329–351

    Article  Google Scholar 

  10. Bhatti UA, Huang M, Wang H, Zhang Y u, Mehmood A, Di W u (2018) Recommendation system for immunization coverage and monitoring. Hum Vaccines Immunother 14(1):165–171

    Article  Google Scholar 

  11. Bhatti UA, Yu Z, Chanussot J, Zeeshan Z, Yuan L, Luo W, Nawaz SA, Bhatti MA, Ain QU, Mehmood A (2021) Local similarity-based spatial–spectral fusion hyperspectral image classification with deep cnn and gabor filtering. IEEE Trans Geosci Remote Sens 60:1–15

    Article  Google Scholar 

  12. Bhatti UA, Yu Z, Hasnain A, Nawaz SA, Yuan L, Wen L, Bhatti MA (2022) Evaluating the impact of roads on the diversity pattern and density of trees to improve the conservation of species. Environ Sci Pollut Res 29 (10):14780–14790

    Article  Google Scholar 

  13. Bhatti UA, Zeeshan Z, Nizamani MM, Bazai S, Yu Z, Yuan L (2022) Assessing the change of ambient air quality patterns in jiangsu province of China pre-to post-covid-19. Chemosphere 288:132569

    Article  Google Scholar 

  14. Candes EJ (2008) The restricted isometry property and its implications for compressed sensing. CR Math 346(9-10):589–592

    MathSciNet  MATH  Google Scholar 

  15. Candès EJ, Wakin MB (2008) An introduction to compressive sampling. IEEE Signal Process Mag 25(2):21–30

    Article  Google Scholar 

  16. Donoho DL (2006) Compressed sensing. IEEE Trans Inform Theory 52(4):1289–1306

    Article  MathSciNet  MATH  Google Scholar 

  17. Liu F, Zhao L, Cheng X, Dai Q, Shi X, Qiao J (2020) Fine-grained action recognition by motion saliency and mid-level patches. Appl Sci 10 (8):2811

    Article  Google Scholar 

  18. Mahabadi RK, Aprile C, Cevher V (2018) Real-time dct learning-based reconstruction of neural signals. In: 2018 26Th european signal processing conference (EUSIPCO), IEEE, pp 1925–1929

  19. Parchami M, Amindavar H, Zhu W (2020) Learning-based design of random measurement matrix for compressed sensing with inter-column correlation using copula function. IET Signal Process 14(6):385–395

    Article  Google Scholar 

  20. Piotr I (2010) Sparse recovery using sparse random matrices. In: LATIN, pp 157

  21. Rani M, Dhok SB, Deshmukh RB (2018) A systematic review of compressive sensing: Concepts, implementations and applications. IEEE Access 6:4875–4894

    Article  Google Scholar 

  22. Srinivas K, Srinivas N, Kumar PK, Pradhan G (2018) Performance comparison of measurement matrices in compressive sensing. In: International conference on advances in computing and data sciences, Springer, pp 342–351

  23. Thapliyal M, Ahuja NJ, Shankar A, Cheng X, Kumar M (2022) A differentiated learning environment in domain model for learning disabled learners. J Comput High Educ 34(1):60–82

    Article  Google Scholar 

Download references

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Irfan Ahmed.

Ethics declarations

Conflict of Interests

The authors declare no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ahmed, I., Khan, A. Learning based speech compressive subsampling. Multimed Tools Appl 82, 15327–15343 (2023). https://doi.org/10.1007/s11042-022-14003-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-14003-7

Keywords

Navigation