Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
Improvement of the cochlear implant (CI) front-end signal acquisition is needed to increase speech recognition in noisy environments. To suppress the directional noise, we introduce a speech-enhancement algorithm based on microphone array beamforming and spectral estimation. The experimental results indicate that this method is robust to directional mobile noise and strongly enhances the desired speech, thereby improving the performance of CI devices in a noisy environment.
The spectrum estimation and the array beamforming methods were combined to suppress the ambient noise. The directivity coefficient was estimated in the noise-only intervals, and was updated to fit for the mobile noise.
The proposed algorithm was realized in the CI speech strategy. For actual parameters, we use Maxflat filter to obtain fractional sampling points and cepstrum method to differentiate the desired speech frame and the noise frame. The broadband adjustment coefficients were added to compensate the energy loss in the low frequency band.
The approximation of the directivity coefficient is tested and the errors are discussed. We also analyze the algorithm constraint for noise estimation and distortion in CI processing. The performance of the proposed algorithm is analyzed and further be compared with other prevalent methods.
The hardware platform was constructed for the experiments. The speech-enhancement results showed that our algorithm can suppresses the non-stationary noise with high SNR. Excellent performance of the proposed algorithm was obtained in the speech enhancement experiments and mobile testing. And signal distortion results indicate that this algorithm is robust with high SNR improvement and low speech distortion.
- Chung K, Zeng FG: Using hearing aid adaptive directional microphones to enhance cochlear implant performance. Hear Res 2009, 250:27–37. CrossRef
- Nelson PB, Jin SB, Carney AE: Understanding speech in modulated interference: cochlaer implant users and normal-hearing listeners. J Acoust Soc Am 2003,113(2):961–968. CrossRef
- Zeng FG, Nie K, Stickney GS, Kong YY, Vongphoe M, Bhargave A, Wei C, Cao K: Speech recognition with amplitude and frequency modulations. Proc Natl Acad Sci U S A 2005,102(7):2293–2298. CrossRef
- Donaldson GS, Kreft HA, Litvak L: Place-pitch discrimination of single- versus dual-electrode stimuli by cochlear implant users (L). J Acoust Soc Am 2005,118(2):623–626. CrossRef
- Kwon BJ, van den Honert C: Dual-electrode pitch discrimination with sequential interleaved stimulation by cochlear implant users. J Acoust Soc Am 2006,120(1):1–6. CrossRef
- Izzo AD, Richter CP, Jansen ED, et al.: Laser stimulation of the auditory nerve. Lasers Surg Med 2006,38(8):745–753. CrossRef
- Chung K, Zeng FG, Acker KN: Effects of directional microphone and adaptive multichannel noise reduction algorithm on cochlear implant performance. J Acoust Soc Am 2006, 120:2216–2227. CrossRef
- Spriet A, Van Deun L, Eftaxiadis K, Laneau J, Moonen M, Van Dijk B, Van Wieringen A, Wouters J: Speech understanding in background noise with the two-microphone adaptive beamforming BETM in the Nucleus Freedom cochlear implant system. Ear Hear 2007,28(1):62–72. CrossRef
- Boll SF: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 1979,27(2):113–120. CrossRef
- Kamath S, Loizou P: A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, Florida 2002, 675–678.
- Scalart P, Filho JV: Speech enhancement based on a priori signal to noise estimation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, volume 2, Atlanta, GA 1996, 629–632. CrossRef
- Ephraim Y, Van Trees H: A signal subspace approach for speech enhancement. IEEE Trans Speech Audio Process 1995,3(4):251–266. CrossRef
- Bitzer J, Simmer KU, Kammeyer KD: Theoretical noise reductoin limits of the generalized sidelobe canceller (GSC) for speech enhancement. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ 1999, 2965–2968.
- Flanagan JL: Computer-steered microphone arrays for sound transduction in large rooms. J Acoust Soc Am 1985, 78:1508–1518. CrossRef
- Marciano JS, Vu TB: Reduced complexity beam space broadband frequency invariant beamforming. Electron Lett 2000, 36:682–683. CrossRef
- Elko GW, Pong ATN: A simple adaptive first-order differential microphone. Proceedings of IEEE workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA 1995, 169–172.
- Fischer S, Simmer KU: Beamforming microphone arrays for speech acquisition in noisy environments. Speech Commun 1996,20(3):215–227. CrossRef
- Luo FL, Yang J, Pavlovic C, et al.: Adaptive null-forming scheme in digital hearing aids. IEEE Trans Acoust Speech Signal Process 2002,50(7):1583–1590.
- Frost OL: An algorithm for linearly constrained adaptive array processing. Proc IEEE 1972, 60:926–935. CrossRef
- Van Veen BD, Buckley KM: Beamforming: a versatile approach to spatial filtering. IEEE Trans Acoust Speech Signal Process 1988, 5:4–24.
- Miyoshi M, Kaneda Y: I nverse filtering of room acoustics . IEEE Trans Acoust Speech Signal Process 1988,2(36):145–152. CrossRef
- Capon J: High resolution frequency-wavenumber spectrum analysis. Proc IEEE 1969, 57:1408–1418. CrossRef
- Johnson DH: The application of spectral estimation methods to bearing estimation problems. Proc IEEE 1982, 70:1018–1028. CrossRef
- Lockwood ME, Jones DL, et al.: Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am 2004,115(1):379–391. CrossRef
- Kates JM, Weiss MR: A comparison of hearing-aid array-processing techniques. J Acoust Soc Am 1996, 99:3138–3148. CrossRef
- Qin G, Yousheng C: Parameter section methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys 2011,57(4):542–550. CrossRef
- Thiran JP: Recursive digital filters with maximally flat group delay. IEEE Trans Circuits Theory 1971,18(6):659–664. CrossRef
- Cooklev T, Nishihara A: Maximally flat FIR filters. Proc. IEEE Int. Symp. Circuits Syst. Chicago, USA 1993, 96–99.
- Valimaki V, Laakso TI: Principles of fractional delay filters. IEEE ICASSP’00 2000, 3870–3873.
- Martin E, Arild L: Maximally flat FIR and IIR fractional delay filters with expanded bandwidth. European Signal Processing Conference. Poznan, Poland 2007, 1038–1042.
- Oppenheim AV, Schfer R: Dig ital Signal Processing. Prentice-Hall, Englewood Cliffs, NJ; 1975.
- Furui S: Cepstral analysis technique for automatics peaker verification [J]. IEEE Trans Acoust Speech Signal Process 1981,29(2):254–272. CrossRef
- Viikki O, Bye D, Laurila K: A recursive feature vector normalization approach for robust speech recognition in noise. In Proceedings of ICASSP' 98. IEEE Acoustics, Speech and Signal Processing Society, Seattle, WA, USA; 1998:733–736.
- Yousheng C, Qin G: A normalized beamforming algorithm for broadband speech using a continuous interleaved sampling strategy. IEEE Trans Audio Speech Language Process 2011,20(3):868–874.
- Wilson BS, Lawson DT, Zerbi M, et al.: Design and evaluation of a Continuous Interleaved Sampling(CIS) processing strategy for multichannel cochlear implants. J Rehabil Res Dev 1993,30(1):110.
- Psarros CE, Plant KL, Lee K, et al.: Conversion from the SPEAK to the ACE strategy in children using the Nucleus 24 cochlear implant system: speech perception and speech production outcomes. Ear Hear 2002,23(18):18. CrossRef
- Greenwood DD: A cochlear frequency-position function for several species-29 years later. J Acoust Soc Am 1990,87(6):2593–2605. CrossRef
- Greenberg JE, Peterson PM: Intelligibility-weighted measures of speech-to-interference and speech system performance. J Acoust Soc Am 1993,94(5):3009–3010. CrossRef
- Acoustical Society of American: American national standard mehthods for calculation of the speech inteligibility index. 1997. ANSI S3.5
- Chen JD, Benesty J, Huang YT: New insights into the noise reduction wiener filter. IEEE Trans Audio Speech Language Process 2006,14(4):1218–1234. CrossRef
- Benesty J, Chen JD, Huang YT: Binaural noise reduction in the time domain with a stereo setup. IEEE Trans Audio Speech Language Process 2011,19(8):2260–2272. CrossRef
- Chen JD, Benesty J, Huang Y: On the optimal linear filtering techniques for noise reduction. Speech Commun 2007, 49:305–316. CrossRef
- Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
BioMedical Engineering OnLine
- Online Date
- September 2012
- Online ISSN
- BioMed Central
- Additional Links