Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
Improvement of the cochlear implant (CI) front-end signal acquisition is needed to increase speech recognition in noisy environments. To suppress the directional noise, we introduce a speech-enhancement algorithm based on microphone array beamforming and spectral estimation. The experimental results indicate that this method is robust to directional mobile noise and strongly enhances the desired speech, thereby improving the performance of CI devices in a noisy environment.
The spectrum estimation and the array beamforming methods were combined to suppress the ambient noise. The directivity coefficient was estimated in the noise-only intervals, and was updated to fit for the mobile noise.
The proposed algorithm was realized in the CI speech strategy. For actual parameters, we use Maxflat filter to obtain fractional sampling points and cepstrum method to differentiate the desired speech frame and the noise frame. The broadband adjustment coefficients were added to compensate the energy loss in the low frequency band.
The approximation of the directivity coefficient is tested and the errors are discussed. We also analyze the algorithm constraint for noise estimation and distortion in CI processing. The performance of the proposed algorithm is analyzed and further be compared with other prevalent methods.
The hardware platform was constructed for the experiments. The speech-enhancement results showed that our algorithm can suppresses the non-stationary noise with high SNR. Excellent performance of the proposed algorithm was obtained in the speech enhancement experiments and mobile testing. And signal distortion results indicate that this algorithm is robust with high SNR improvement and low speech distortion.
- Chung, K, Zeng, FG (2009) Using hearing aid adaptive directional microphones to enhance cochlear implant performance. Hear Res 250: pp. 27-37 CrossRef
- Nelson, PB, Jin, SB, Carney, AE (2003) Understanding speech in modulated interference: cochlaer implant users and normal-hearing listeners. J Acoust Soc Am 113: pp. 961-968 CrossRef
- Zeng, FG, Nie, K, Stickney, GS, Kong, YY, Vongphoe, M, Bhargave, A, Wei, C, Cao, K (2005) Speech recognition with amplitude and frequency modulations. Proc Natl Acad Sci U S A 102: pp. 2293-2298 CrossRef
- Donaldson, GS, Kreft, HA, Litvak, L (2005) Place-pitch discrimination of single- versus dual-electrode stimuli by cochlear implant users (L). J Acoust Soc Am 118: pp. 623-626 CrossRef
- Kwon, BJ, van den Honert, C (2006) Dual-electrode pitch discrimination with sequential interleaved stimulation by cochlear implant users. J Acoust Soc Am 120: pp. 1-6 CrossRef
- Izzo, AD, Richter, CP, Jansen, ED (2006) Laser stimulation of the auditory nerve. Lasers Surg Med 38: pp. 745-753 CrossRef
- Chung, K, Zeng, FG, Acker, KN (2006) Effects of directional microphone and adaptive multichannel noise reduction algorithm on cochlear implant performance. J Acoust Soc Am 120: pp. 2216-2227 CrossRef
- Spriet, A, Van Deun, L, Eftaxiadis, K, Laneau, J, Moonen, M, Van Dijk, B, Van Wieringen, A, Wouters, J (2007) Speech understanding in background noise with the two-microphone adaptive beamforming BETM in the Nucleus Freedom cochlear implant system. Ear Hear 28: pp. 62-72 CrossRef
- Boll, SF (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech Signal Process 27: pp. 113-120 CrossRef
- Kamath, S, Loizou, P (2002) A multi-band spectral subtraction method for enhancing speech corrupted by colored noise. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, Florida. pp. 675-678
- Scalart, P, Filho, JV (1996) Speech enhancement based on a priori signal to noise estimation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, volume 2, Atlanta, GA. pp. 629-632 CrossRef
- Ephraim, Y, Van Trees, H (1995) A signal subspace approach for speech enhancement. IEEE Trans Speech Audio Process 3: pp. 251-266 CrossRef
- Bitzer, J, Simmer, KU, Kammeyer, KD (1999) Theoretical noise reductoin limits of the generalized sidelobe canceller (GSC) for speech enhancement. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ. pp. 2965-2968
- Flanagan, JL (1985) Computer-steered microphone arrays for sound transduction in large rooms. J Acoust Soc Am 78: pp. 1508-1518 CrossRef
- Marciano, JS, Vu, TB (2000) Reduced complexity beam space broadband frequency invariant beamforming. Electron Lett 36: pp. 682-683 CrossRef
- Elko, GW, Pong, ATN (1995) A simple adaptive first-order differential microphone. Proceedings of IEEE workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA. pp. 169-172
- Fischer, S, Simmer, KU (1996) Beamforming microphone arrays for speech acquisition in noisy environments. Speech Commun 20: pp. 215-227 CrossRef
- Luo, FL, Yang, J, Pavlovic, C (2002) Adaptive null-forming scheme in digital hearing aids. IEEE Trans Acoust Speech Signal Process 50: pp. 1583-1590
- Frost, OL (1972) An algorithm for linearly constrained adaptive array processing. Proc IEEE 60: pp. 926-935 CrossRef
- Van Veen, BD, Buckley, KM (1988) Beamforming: a versatile approach to spatial filtering. IEEE Trans Acoust Speech Signal Process 5: pp. 4-24
- Miyoshi, M, Kaneda, Y (1988) Inverse filtering of room acoustics. IEEE Trans Acoust Speech Signal Process 2: pp. 145-152 CrossRef
- Capon, J (1969) High resolution frequency-wavenumber spectrum analysis. Proc IEEE 57: pp. 1408-1418 CrossRef
- Johnson, DH (1982) The application of spectral estimation methods to bearing estimation problems. Proc IEEE 70: pp. 1018-1028 CrossRef
- Lockwood, ME, Jones, DL (2004) Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. J Acoust Soc Am 115: pp. 379-391 CrossRef
- Kates, JM, Weiss, MR (1996) A comparison of hearing-aid array-processing techniques. J Acoust Soc Am 99: pp. 3138-3148 CrossRef
- Qin, G, Yousheng, C (2011) Parameter section methods of delay and beamforming for cochlear implant speech enhancement. Acoust Phys 57: pp. 542-550 CrossRef
- Thiran, JP (1971) Recursive digital filters with maximally flat group delay. IEEE Trans Circuits Theory 18: pp. 659-664 CrossRef
- Cooklev, T, Nishihara, A (1993) Maximally flat FIR filters. Proc. IEEE Int. Symp. Circuits Syst. Chicago, USA. pp. 96-99
- Valimaki, V, Laakso, TI (2000) Principles of fractional delay filters. IEEE ICASSP’00. pp. 3870-3873
- Martin, E, Arild, L (2007) Maximally flat FIR and IIR fractional delay filters with expanded bandwidth. European Signal Processing Conference. Poznan, Poland. pp. 1038-1042
- Oppenheim, AV, Schfer, R (1975) Dig ital Signal Processing. Prentice-Hall, Englewood Cliffs, NJ
- Furui, S (1981) Cepstral analysis technique for automatics peaker verification [J]. IEEE Trans Acoust Speech Signal Process 29: pp. 254-272 CrossRef
- Viikki, O, Bye, D, Laurila, K (1998) A recursive feature vector normalization approach for robust speech recognition in noise. Proceedings of ICASSP' 98. IEEE Acoustics, Speech and Signal Processing Society, Seattle, WA, USA, pp. 733-736
- Yousheng, C, Qin, G (2011) A normalized beamforming algorithm for broadband speech using a continuous interleaved sampling strategy. IEEE Trans Audio Speech Language Process 20: pp. 868-874
- Wilson, BS, Lawson, DT, Zerbi, M (1993) Design and evaluation of a Continuous Interleaved Sampling(CIS) processing strategy for multichannel cochlear implants. J Rehabil Res Dev 30: pp. 110
- Psarros, CE, Plant, KL, Lee, K (2002) Conversion from the SPEAK to the ACE strategy in children using the Nucleus 24 cochlear implant system: speech perception and speech production outcomes. Ear Hear 23: pp. 18 CrossRef
- Greenwood, DD (1990) A cochlear frequency-position function for several species-29 years later. J Acoust Soc Am 87: pp. 2593-2605 CrossRef
- Greenberg, JE, Peterson, PM (1993) Intelligibility-weighted measures of speech-to-interference and speech system performance. J Acoust Soc Am 94: pp. 3009-3010 CrossRef
- American national standard mehthods for calculation of the speech inteligibility index.
- Chen, JD, Benesty, J, Huang, YT (2006) New insights into the noise reduction wiener filter. IEEE Trans Audio Speech Language Process 14: pp. 1218-1234 CrossRef
- Benesty, J, Chen, JD, Huang, YT (2011) Binaural noise reduction in the time domain with a stereo setup. IEEE Trans Audio Speech Language Process 19: pp. 2260-2272 CrossRef
- Chen, JD, Benesty, J, Huang, Y (2007) On the optimal linear filtering techniques for noise reduction. Speech Commun 49: pp. 305-316 CrossRef
- Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
BioMedical Engineering OnLine
- Online Date
- September 2012
- Online ISSN
- BioMed Central
- Additional Links