Fractional Segmental Transform for Speech Enhancement

Ram, Rashmirekha; Palo, Hemanta Kumar; Mohanty, Mihir Narayan

doi:10.1007/978-981-13-2182-5_14

Rashmirekha Ram¹⁸,
Hemanta Kumar Palo¹⁸ &
Mihir Narayan Mohanty¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 846))

312 Accesses
1 Citations

Abstract

The Fractional Fourier Transform (FrFT) can be interpreted as a rotation in the time-frequency plane with an angle α. It describes the speech signal characteristics as the signal changes from time to frequency domain. However, to locate the fractional Fourier domain frequency contents and multicomponent analysis of nonlinear chirp like signals such as speech the Short-Time FrFT (SFrFT) can provide an improved time-frequency resolution. By representing the time and fractional frequency domain information simultaneously, the SFrFT can filter out cross terms and distortion in a signal adequately for better signal enhancement. The method has experienced with better Signal to Noise Ratio (SNR) and Perceptual Evaluation of Speech Quality (PESQ) under different noisy conditions as compared to the conventional FrFT in our results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Loizou, P.C.: ‘Speech Enhancement: Theory and Practice’ (CRC Press, Boca Raton, FL, USA, 2007).
Google Scholar
P. Mowlaee, J. Stahl, & J. Kulmer, “Iterative joint MAP single-channel speech enhancement given non-uniform phase prior”. Speech Communication, 86, pp. 85–96, 2017.
Google Scholar
H. Barfuss, C. Huemmer, A. Schwarz, & W. Kellermann, “Robust coherence-based spectral enhancement for speech recognition in adverse real-world environments”, Computer Speech & Language, 46, pp. 388–400, 2017.
Google Scholar
R. Ram, M. N. Mohanty, “Comparative Analysis of EMD and VMD Algorithm in Speech Enhancement”, Journal of Natural Computing Research (IJNCR), 6(1), pp. 17–35, 2017.
Google Scholar
A. Bhowmick, M. Chandra, “Speech enhancement using voiced speech probability based wavelet decomposition”, Computers & Electrical Engineering, pp. 1–13, 2017 (in press).
Google Scholar
R. Ram, H. K. Palo, M. N. Mohanty, “An Adaptive Method for Emotional Speech Enhancement and Recognition Using PNN”, International Journal of Control Theory and Applications, 8(5), pp. 2395–2403, 2015.
Google Scholar
Y. Xu, J. Du, L. R. Dai, C. H. Lee, “A regression approach to speech enhancement based on deep neural networks”, IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 23(1), pp. 7–19, 2015.
Google Scholar
R. Ram, M. N. Mohanty, “Use of Fractional Domain for Speech Enhancement”, Int. J. Of Imaging and Robotics, 18, pp. 85–93, 2018.
Google Scholar
R. Ram, M. N. Mohanty, “Design of Fractional Fourier Transform based Filter for Speech Enhancement”, International Journal of Control Theory and Applications, 10(7), pp. 235–243, 2017.
Google Scholar
J. Wang, “Speech Enhancement based on Fractional Fourier transform”, WSEAS Transactions on Signal Processing, 10, pp. 576–581, 2014.
Google Scholar
R. Tao, Y. Lei, Y. Wang, “Short-time fractional Fourier transform and its applications”, IEEE Transaction on Signal Processing, 58(5), pp. 2568–2580, 2010.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, ITER, Siksha ‘O’ Anusandhan (Deemed to be University), Bhubaneswar, Odisha, India
Rashmirekha Ram, Hemanta Kumar Palo & Mihir Narayan Mohanty

Authors

Rashmirekha Ram
View author publications
You can also search for this author in PubMed Google Scholar
Hemanta Kumar Palo
View author publications
You can also search for this author in PubMed Google Scholar
Mihir Narayan Mohanty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rashmirekha Ram .

Editor information

Editors and Affiliations

Department of Electrical and Electronics Engineering, Velammal Engineering College, Chennai, Tamil Nadu, India
M. Arun Bhaskar
Department of Electrical and Electronics Engineering, Government College of Engineering, Keonjhar, Odisha, India
Subhransu Sekhar Dash
Electronics and Communication Sciences Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Swagatam Das
Department of Electrical Engineering, Indian Institute of Technology Delhi, New Delhi, Delhi, India
Bijaya Ketan Panigrahi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ram, R., Palo, H.K., Mohanty, M.N. (2019). Fractional Segmental Transform for Speech Enhancement. In: Bhaskar, M., Dash, S., Das, S., Panigrahi, B. (eds) International Conference on Intelligent Computing and Applications. Advances in Intelligent Systems and Computing, vol 846. Springer, Singapore. https://doi.org/10.1007/978-981-13-2182-5_14

Download citation

DOI: https://doi.org/10.1007/978-981-13-2182-5_14
Published: 09 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2181-8
Online ISBN: 978-981-13-2182-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics