Skip to main content
Log in

Spatio-Temporal Resolution Enhancement of Vocal Tract MRI Sequences—A Comparison Among Wiener Filter Based Methods

  • Published:
Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

Abstract

Articulatory Synthesis consists in reproducing speech by means of models of the vocal tract and of articulatory processes. Recent advances in Magnetic Resonance Imaging (MRI) allowed for important improvements with respect to the speech comprehension and the forms taken by the vocal tract. However, one of the main challenges in the field is the fast and at the same time high-quality acquisition of image sequences. Since adopting more powerful acquisition devices might be financially inviable, we propose a method for the spatio-temporal resolution enhancement of the obtained sequences using only digital image processing techniques. The approach involves two stages: (1) the temporal resolution enhancement by means of a motion compensated interpolation technique; and (2) the spatial resolution enhancement by means of a super resolution image reconstruction technique. Considering the spatial resolution enhancement, inspired by two methods available in the literature, three adaptations of the Wiener filter were proposed: the statistical interpolation, the multi-temporal approach, and the adaptive Wiener filter. In all cases, a separable Markovian model and an isotropic model were compared for the characterization of the spatial correlation structures. Considering all Wiener filter-based approaches, the adaptive Wiener filter outperformed all other approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Bresch, E., Narayanan, S.: Region segmentation in the frequency domain applied to upper airway Real-Time magnetic resonance images. IEEE Trans. Med. Imaging 28(3), 323–338 (2009)

    Article  Google Scholar 

  2. Baer, T., Gore, J.C., Gracco, L.C., Nye, P.W.: Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. J. Acoust. Soc. Am. 90(2), 799–828 (1991)

    Article  Google Scholar 

  3. Bresch, E., Kim, Y.-C., Nayak, K., Byrd, D., Narayanan, S.: Seeing speech: capturing vocal tract shaping using real-time magnetic resonance imaging [exploratory DSP]. IEEE Signal Process. Mag. 25(3), 123–132 (2008)

    Article  Google Scholar 

  4. Engwall, O.: Combining MRI, EMA & EPG measurements in a three-dimensional tongue model. In: Speech Communication, vol. 41, pp. 303–329 (2003)

    Google Scholar 

  5. Martins, A.L.D., Mascarenhas, N.D.A., Suazo, C.A.T.: Spatio-Temporal resolution enhancement of vocal tract MRI sequences based on image registration. Integr. Comput.-Aided Eng. 18(3), 143–155 (2011)

    Google Scholar 

  6. Rueckert, D., Sodona, L.I., Hayes, C., Hill, D.L.G., Leach, M.O., Hawkes, D.J.: Nonrigid registration using Free-Form deformations: application to breast MR images. IEEE Trans. Med. Imaging 18(8), 712–721 (1999)

    Article  Google Scholar 

  7. Hardie, R.A.: A fast image super-resolution algorithm using an adaptive Wiener flter. IEEE Trans. Image Process. 16(12), 2953–2964 (2007)

    Article  MathSciNet  Google Scholar 

  8. Mascarenhas, N.D.A., Banon, G.J.F., Candeias, A.L.B.: Multispectral image data fusion under bayesian approach. Int. J. Remote Sens. 17(8), 1457–1471 (1996)

    Article  Google Scholar 

  9. Rueckert, D., Aljabar, P.: Nonrigid registration of medical images: theory, methods, and applications. IEEE Signal Process. Mag. 27(4), 113–119 (2010)

    Article  Google Scholar 

  10. Tsai, R.Y., Huang, T.S.: Multi-frame image restoration and registration. Adv. Comput. Vis. Image Process. 317–339 (1984)

  11. Stark, H., Oskoui, P.: High-resolution image recovery from image-plane arrays, using convex projections. J. Opt. Soc. Am. A 6(11), 1715–1726 (1989)

    Article  Google Scholar 

  12. Katsaggelos, A.K., Molina, R., Mateos, J.: Super Resolution of Images and Video Morgan & Claypool, San Rafael (2007), 134 pp.

    Google Scholar 

  13. Schultz, R.R., Stevenson, R.L.: Extraction of highresolution frames from video sequences. IEEE Trans. Image Process. 5(6), 996–1011 (1996)

    Article  Google Scholar 

  14. Siegel, S., Castellan, N.J. Jr.: Nonparametric Statistics for the Behavioral Sciences, 2nd edn. McGraw-Hill, New York (1988), 399 pp.

    Google Scholar 

  15. Park, S.C., Park, M.K., Kang, M.G.: Super-resolution image reconstruction: a technical overview. IEEE Signal Process. Mag. 20(3), 21–36 (2003)

    Article  Google Scholar 

  16. Papoulis, A., Pillai, S.U.: Probability, Random Variables and Stochastic Processes, 4th edn. McGraw-Hill Europe, London (2002), 852 pp.

    Google Scholar 

  17. Pratt, W.K.: Digital Image Processing: PIKS Scientific Inside, 4th edn. Wiley-Interscience, New York (2007), 812 pp.

    Book  Google Scholar 

  18. Hardie, R.C.: Super-Resolution using adaptive wiener filters. In: Milanfar, P. (ed.) Super-Resolution Imaging, Cap. 2, pp. 35–61. CRC Press, Boca Raton (2010)

    Google Scholar 

Download references

Acknowledgements

We would like to thank professors Antonio Teixeira and Augusto Silva from Instituto de Engenharia Eletronica e Telematica de Aveiro (IEETA) of Universidade de Aveiro, Portugal, for the vocal tract images used in this work. These images are part of the HERON Project—A Framework for Portuguese Articulatory Synthesis Research, POSI/PLP/57680/2004. Ana L.D. Martins is supported by FAPESP, Brazil, under grant number 2008/01348-2.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ana L. D. Martins.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martins, A.L.D., Mascarenhas, N.D.A. Spatio-Temporal Resolution Enhancement of Vocal Tract MRI Sequences—A Comparison Among Wiener Filter Based Methods. J Math Imaging Vis 45, 200–213 (2013). https://doi.org/10.1007/s10851-012-0389-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10851-012-0389-0

Keywords

Navigation