Journal of Signal Processing Systems

, Volume 52, Issue 3, pp 297–311 | Cite as

A Hardware Acceleration Platform for Digital Holographic Imaging

  • Thomas LenartEmail author
  • Mats Gustafsson
  • Viktor Öwall


This paper presents a hardware acceleration platform for image reconstruction in digital holographic imaging. The hardware accelerator executes a computationally demanding reconstruction algorithm which transforms an interference pattern captured on a digital image sensor into visible images. Focus in this work is to maximize computational efficiency, and to minimize the external memory transfer overhead, as well as required internal buffering. The paper presents an efficient processing datapath with a fast transpose unit and an interleaved memory storage scheme. The proposed architecture results in a speedup with a factor 3 compared with the traditional column/row approach for calculating the two-dimensional FFT. Memory sharing between the computational units reduces the on-chip memory requirements with over 50%. The custom hardware accelerator, extended with a microprocessor and a memory controller, has been implemented on a custom designed FPGA platform and integrated in a holographic microscope to reconstruct images. The proposed architecture targeting a 0.13 µm CMOS standard cell library achieves real-time image reconstruction with 20 frames per second.


digital holography flexible FFT data scaling hybrid floating-point matrix transpose burst oriented memory 


  1. 1.
    Schnars, U., & Jueptner, W. (2005). Digital holography. Berlin: Springer.Google Scholar
  2. 2.
    Gustafsson, M., et al. (2004). High resolution digital transmission microscopy—A Fourier holography approach. Optics and Lasers in Engineering, 41(3), 553–563 (March).CrossRefGoogle Scholar
  3. 3.
    Born, M., & Wolf, E. (1999). Principles of optics. Cambridge, UK: Cambridge University Press.Google Scholar
  4. 4.
    Demetrakopoulos, T. H., & Mittra, R. (1974). Digital and optical reconstruction of suboptical diffraction patterns. Applied Optics, 13, 665–670 (March).CrossRefGoogle Scholar
  5. 5.
    Wenbo, Xu, Jericho, M. H., Meinertzhagen, I. A., & Kreuzer, H. J. (2001). Digital in-line holography for biological applications. Cell Biology, 98, 11301–11305 (September).Google Scholar
  6. 6.
    MT48LC32M16A2-75, Micron Technology—SDRAM components.
  7. 7.
    Brigham, E. O. (1988). The fast fourier transform and its applications. Englewood Cliffs, NJ: Prentice-Hall.Google Scholar
  8. 8.
    McKee, S. A., et al. (1998). Smarter memory: Improving bandwidth for streamed references. Computer, 31(7), 54–63 (July).CrossRefGoogle Scholar
  9. 9.
    Parhami, B. (2000). Computer arithmetic. 198 Madison Avenue, New York 10016: Oxford University Press.Google Scholar
  10. 10.
    He, S., & Torkelson, M. (1998). Designing pipeline FFT processor for OFDM (de)modulation. In URSI international symposium on signals, systems, and electronics (pp. 257–262) (October).Google Scholar
  11. 11.
    Lenart, T., & Öwall, V. (2006). Architectures for dynamic data scaling in 2/4/8K pipeline FFT cores. IEEE Trans. VLSI Syst., 14(11), 1286–1290 (November).CrossRefGoogle Scholar
  12. 12.
    Bidet, E., Joanblanq, C., & Senn, P. (1995). A fast single-chip implementation of 8192 complex point FFT. IEEE Journal of Solid-State Circuits, 30, 300–305 (March).CrossRefGoogle Scholar
  13. 13.
    Del Toso, C., et al. (1998). 0.5-μm CMOS circuits for demodulation and decoding of an OFDM-based digital TV signal conforming to the European DVB-T standard. IEEE Journal of Solid-State Circuits, 33(11), 1781–1792 (November).CrossRefGoogle Scholar
  14. 14.
    Wosnitza, M., Cavadini, M., Thaler, M., & Tröster, G. (1998). A high precision 1024-point FFT processor for 2D convolution. In 1998 IEEE international solid-state circuits conference. Digest of technical papers, ISSCC (pp. 118–119) (February).Google Scholar
  15. 15.
    Kristensen, F., Nilsson, P., & Olsson, A. (2004). Reduced transceiver-delay for OFDM systems. In Proc. of vehicular technology conference, VTC 2004 Spring, (Vol. 3, pp. 1242–1245) (May).Google Scholar
  16. 16.
    Gaisler, J. (2002). A portable and fault-tolerant microprocessor based on the SPARC v8 architecture. In Proc. of dependable systems and networks (pp. 409–415) (June).Google Scholar
  17. 17.
    ARM Ltd. (1999). AMBA Specification—Advanced microcontroller bus architecture.
  18. 18.
    Miyamoto, N., Karnan, L., Maruo, K., Kotani, K., & Ohmi1 T. (2003). A small-area high-performance 512-point 2-dimensional FFT single-chip processor. In Proc. of European solid-state circuits (ESSCIRC’03) (pp. 603–606) (September).Google Scholar
  19. 19.
    Uzun, I., Amira, A., & Bensaali, F. (2003) A reconfigurable coprocessor for high-resolution image filtering in real time. In Proc. of the 10th IEEE international conference on electronics, circuits and systems (pp. 192–195) (December).Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2008

Authors and Affiliations

  1. 1.CCCD, Department of Electrical and Information TechnologyLund UniversityLundSweden

Personalised recommendations