An efficient tree architecture for modulo 2 n +1 multiplication
- 74 Downloads
- 45 Citations
Abstract
Modulo 2 n +1 multiplication plays an important role in the Fermat number transform and residue number systems; the diminished-1 representation of numbers has been found most suitable for representing the elements of the rings. Existing algorithms for modulo (2 n +1) multiplication either use recursive modulo (2 n +1) addition, or a regular binary multiplication integrated with the modulo reduction operation. Although most often adopted for largen, this latter approach requires conversions between the diminished-1 and binary representations. In this paper we propose a parallel fine-grained architecture, based on a Wallace tree, for modulo (2 n +1) multiplication which does not require any conversions; the use of a Wallace tree considerably improves the speed of the multiplier. This new architecture exhibits an extremely modular structure with associated VLSI implementation advantages. The critical path delay and the hardware requirements of the new multiplier are similar to that of a correspondingn×n bit binary multiplier.
Keywords
Partial Product Full Adder Residue Number System Fermat Number Critical Path DelayPreview
Unable to display preview. Download preview PDF.
References
- 1.R.C. Agarwal and C.S. Burrus, “Fast convolution using Fermat number transforms with applications to digital filtering,”IEEE Trans. Acoust. Speech, Signal Processing, Vol. ASSP-22, pp. 87–97, 1974.MathSciNetCrossRefGoogle Scholar
- 2.W. Luo, G.A. Jullien, N.M. Wigley, W.C. Miller, and Z. Wang, “An array processor for inner product computations using a Fermat number ALU,”Proc. 1995 Symposium on Application Specific Array Processors (in print).Google Scholar
- 3.L.M. Leibowitz, “A simplified binary arithmetic for the Fermat number transform,”IEEE Trans. Acoust. Speech, Signal Processing, Vol. ASSP-24, pp. 356–359, 1976.MathSciNetCrossRefGoogle Scholar
- 4.W.K. Jenkins, “The design of specialized residue classes for efficient recursive digital filter realization,”IEEE Trans. Acoust. Speech, Signal Processing, Vol. ASSP-30, pp. 370–380, 1982.CrossRefMATHGoogle Scholar
- 5.W.K. Jenkins, “Recent advance in residue number techniques for recursive digital filtering,”IEEE Trans. Acoust. Speech, Signal Processing, Vol. ASSP-27, pp. 19–30, 1979.CrossRefMATHGoogle Scholar
- 6.F.J. Taylor, “A VLSI residue arithmetic multiplier”IEEE Trans. Comput., Vol. C-31, pp. 540–546, 1982.CrossRefGoogle Scholar
- 7.J.J. Chang, T.K. Truong, H.M. Shao, I.S. Reed, and L.-S. Hsu, ‘The VLSI design of a single chip for the multiplication of integers modulo a Fermat number”,IEEE Trans. Acoust. Speech, Signal Processing, Vol. ASSP-33, pp. 1599–1602, 1985.CrossRefGoogle Scholar
- 8.M. Benaissa, A Pajayakrit, S.S. Dlay, and A.G.J. Holt, “VLSI design for diminished-1 multiplication of integers modulo a Fermat number,”IEE Proc., Vol. 135, Pt. E, pp. 161–164, 1988.Google Scholar
- 9.M. Benaissa, A. Bouridane, S.S. Dlay, and A.G.J. Holt, “Diminished-1 multiplier for a fast convolver and correlator using the Fermat number transform,”IEE Proc., Vol. 135, Pt. G, pp. 187–193, 1988.Google Scholar
- 10.A.V. Curiger, H. Bonnenberg, and H. Kaeslin, “Regular VLSI architecture for multiplication modulo (2n+1),”IEEE J. Solid-State Circuits, Vol. 26, pp. 990–994, 1991.CrossRefGoogle Scholar
- 11.A. Hiassat: “New memoryless mod (2n+1) residue multiplier,”Elec. Letts., Vol. 28, pp. 3124–3125, 1992.Google Scholar
- 12.C.S. Wallace, “A suggestion for a fast multiplier,”IEEE Trans. Electronic Computers, Vol. EC-13, pp. 14–17, 1964.CrossRefMATHGoogle Scholar
- 13.J.Y. Lee, H.L. Garvin, and C.W. Slayman, “A high-speed highdensity silicon 8×8-bit parallel multiplier,”IEEE J. Solid-State Circuits, Vol. SC-22, pp. 35–40, 1987.CrossRefGoogle Scholar
- 14.Z. Wang, G.A. Jullien, and W.C. Miller, “New design techniques for column compression multipliers,”IEEE Trans. Comput., Vol. C-44, No. 8, pp. 962–970, 1995.CrossRefMATHGoogle Scholar
- 15.R.P. Brent and H.T. Kung, “A regular layout for parallel adders,”IEEE Trans. Comput., Vol C-31, pp. 260–264, 1982.MathSciNetCrossRefMATHGoogle Scholar
- 16.V.G. Oklobdzija, D. Villeger, and S.S. Liu, “A method for speed optimized partial product reduction and generation of fast parallel multiplier using an algorithmic approach,”IEEE Trans. Computers., Vol. C-45, No. 3, pp. 294–306, 1996.CrossRefMATHGoogle Scholar