Abstract
This paper presents new low-power, high-speed unified and scalable word-based radix 8 architecture for Montgomery modular multiplication in GF(P) and GF(2n). This architecture has some similarities to the architecture of Huang, but it achieves more reduction in area and power consumption. To speed up the modular multiplication process, the hardware architecture employs carry-save addition to avoid carry propagation at each addition operation of the add-shift loop. To reduce power consumption, some latches called glitch blockers are employed at the outputs of some circuit modules to reduce the spurious transitions and the expected switching activities of high fan-out signals in the architecture. Also, we proposed a modified low-power dual-field 4-to-2 carry-save adder that has internal logic structure that reduces the chance of glitches occurrence. An ASIC implementation of the proposed architecture shows that it can perform 1,024-bit modular multiplication (for word size w = 32) in about 5.45 μs. Also, the results show that it has smaller Area × Time values compared to all unified and scalable designs by ratios ranging from 12.2 to 66.8 %, which makes it suitable for implementation where both area and performance are of concern. Also, it has higher throughput over them by ratios ranging from 6.0 to 80.7 %. In addition, it achieves a decrease in power consumption compared to these designs by ratios ranging from 18.8 to 52.6 %. By comparing to the designs that are not unified, it has slightly higher Area × Time and lower throughput values compared to some of them. However, it achieves a significant low-power consumption compared to all of them.
Similar content being viewed by others
References
Rivest L., Shamir A. and Adleman L.: A method for obtaining digital signatures and public-key cryptosystems. Commun. ACM. 21: 120–126 (1978)
National Institute for Standards and Technology, Digital signature standard. In: FIPS PUB 186–2 (2000)
Hellman, M.: New directions on cryptography. IEEE Trans. Inf. Theory 644–654 (1976)
Koblitz N.: Elliptic curve cryptosystems. Math. Comput. 48: 203–209 (1987)
Montgomery P.: Modular multiplication without trial division. Math. Comput. 44: 519–521 (1985)
Yanik T., Savas E. and Koc C.: Incomplete reduction in modular, arithmetic. Math. Comput. 149: 46–54 (2002)
Koc C. and Acar T.: Montgomery multiplication in GF(2k). Des. Codes Cryptogr. 14: 57–69 (1998)
Gang, F.: Design of modular multiplier based on improved Montgomery algorithm and systolic array. In: Proc. 1st Int. Multi-Symp. Comput. Comput. Sci. 356–359 (2006)
Ghoreishi, S.; Pourmina, M.; Bozorgi, H.; Dousti, M.: High speed RSA implementation based on modified Booths technique and Montgomery’s multiplication for FPGA platform. In: Proc. 2nd Int. Conf. Adv. Circuits, Electron. Micro-Electron. 86–93 (2009)
Sassaw, G.; Jimenez, C.; Valencia, M.: High radix implementation of Montgomery multipliers with CSA. In: Proc. Int. Conf. Microelectron, 315–318 (2010)
Neto, J.; Tenca, A.; Ruggiero, W.: A parallel k-partition method to perform Montgomery multiplication. In: Proc. IEEE Int. Conf. Appl. Specif. Syst., Arch. Process. 251–254 (2011)
Cilardo A., Mazzeo A., Romano L. and Saggese G.: Exploring the design-space for FPGA-based implementation of RSA. Microprocess. Microsyst. 28: 183–191 (2004)
Bayhan, D.; Ors, S.; Saldamli, G.: Analyzing and comparing the Montgomery multiplication algorithms for their power consumption. In: Proc. Int. Conf. Comput. Eng. Syst. 257–261 (2010)
Wang, X.; Noel, P.; Kwasniewski, T.: Low power design techniques for a Montgomery modular multiplier. In: Proc. Int. Symp. Intell. Signal Process. Commun. Syst. 449–452 (2005)
Son, H.; Oh S.: Design and implementation of scalable low-power Montgomery multiplier. In: International Conference on Design, ICCD 2004, pp. 524–531 (2004)
Kuang, S.; Wang, J.; Chang K.; Hsu, H.: Energy-Efficient High Throughput Montgomery modular multipliers for RSA cryptosystems. Accepted for inclusion in future issue of IEEE Trans. on VLSI systems, December (2012)
Carbognani F., Buergin F., Felber N. and Kaeslin H.: Transmission gates combined with level-restoring CMOS gates reduce glitches in low-power low-frequency multipliers. IEEE Trans. VLSI Systems. 16: 830–836 (2008)
Chirca, K.; Schulte, M.; Glossner, J; Wang, H.; Mamidi, X.; Balzola, P.; Vassiliadis, S.: A static low-power, high-performance 32-bit carry skip adder. In: Euromicro Symposium on Digital System Design, pp. 615–619 (2004)
Tenca A. and Koc C.: A scalable architecture for modular multiplication based on Montgomery’s algorithm. IEEE Trans. Comput. 52: 1215–1221 (2003)
Tenca A., Savas E. and Koc C.: A design framework for scalable and unified architectures that perform multiplication in gf(p) and gf(2m). Int. J. Comput. Res. 13: 68–83 (2004)
Todorov, T.; Tenca, A.; and Koc. C.: High-radix design of a scalable modular multiplier. In: Koc, C.; Naccache, D.; Paar, C. (eds.) Cryptographic Hardware and Embedded Systems, Lecture Notes in Computer Science No. 2162, Springer, Berlin, Germany, pp. 189–205 (2001)
Savas E., Tenca A., Ciftcibasi M. and Koc C.: Multiplier architectures for gf(p) and gf(2n). IEE Proc. Comput. Digit. Tech. 151: 147–160 (2004)
Harris, D.; Krishnamurthy, R.; Andersm, M.; Mathew, S.; Hsu, S.: An improved unified scalable radix-2 Montgomery multiplier. In: IEEE (ARITH-17), pp. 172–178 (2005)
Groschdl, J.: A bit-serial unified multiplier architecture for finite fields GF(p) and GF(2m). In: Proc. CHES 2001, pp. 202–218 (2001)
Savas, E.; Tenca, A.; Koc, C.: A scalable and unified multiplier architecture for finite fields GF(p) and GF(2m). In: Proceedings of CHES 2000, pp. 281–296 (2000)
Ibrahim, A.; Gebali, F.; Elsimary, H.; Nassar, A.: New processor array architecture for scalable radix 8 montgomery modular multiplication. In: Proc. Canadian Conference on Electrical and Computer Engineering (CCECE), pp. 389–394 (2011)
Todorov, G.; Tenca, A.: ASIC Design, Implementation and Analysis of a Scalable High-Radix Montgomery Multiplier. Master’s Thesis, Oregon State University, USA (2000)
Walter C.: Montgomery exponentiation needs no final subtractions. Electron. Lett. 35: 1831–1832 (2004)
Orup, H.: Simplifying quotient determination in high-radix modular multiplication. In: Proc. 12th IEEE Symp. Computer Arithmetic, pp. 193–199 (1995)
Hachez, G.; Quisquater, J.: Montgomery exponentiation with no final subtractions: Improved results. In: Koc, Ç.; Naccache, D.; Paar, C. (eds.) Cryptographic Hardware and Embedded Systems, Lecture Notes in Computer Science No. 1965, Springer, Berlin, Germany, pp. 293–301 (2000)
Tawalbeh, L.; Tenca, A.: Radix-4 ASIC Design of a Scalable Montgomery Modular Multiplier Using Encoding Techniques. Master’s Thesis, Oregon State University, USA (2002)
El-Guibaly F. and Tawfik A.: Mapping 3d iir digital filter onto systolic arrays. Multidimens. Syst. Signal Process. 7: 7–26 (1996)
Gebali F.: Algorithms and Parallel Computing. Wiley, New York (2011)
Huang M., Gaj K. and El-Ghazawi T.: New hardware architectures for montgomery modular multiplication algorithm. IEEE Trans. Comput. 7: 923–936 (2011)
Ibrahim, A.; Gebali, F.; El-Simary, H.; Nassar, A.: Design and VLSI Implementation of Fast Modular Multiplier Architectures for Cryptosystems. PhD’s Dissertation, Department of Electronics and Electrical Communication, Cairo University (2010)
Ibrahim A., Gebali F., El-Simary H. and Nassar A.: Processor array architectures for scalable radix 4 montgomery modular multiplication algorithm. IEEE Trans. Parallel Distrib. Syst. 22: 1142–1149 (2011)
Ibrahim, A.; Gebali, F.; El-Simary, H.: New and improved word-based unified and scalable architecture for radix 2 Montgomery modular multiplication algorithm. In: 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM), pp. 153–158 (2013)
Sudhakar M., Kamala R. and Srinivas M.: New and improved architectures for montgomery modular multiplication. Mobile Netw Appl. 12: 750–755 (2007)
Wang, H.; Lin, W.; Ye, J.; Shieh, M.: Fast scalable radix-4 Montgomery modular multiplier. In: 2012 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 3049–3052 (2012)
Amberg, P.; Pinckney, N.; Harris, M.: Parallel high-radix montgomery multipliers. In: 42nd Asilomar Conference on Signals, Systems and Computers, pp. 772–776 (2008)
Ye, J.; Hung, T.; Shieh M.: Energy-efficient architecture for word-based montgomery modular multiplication algorithm. In: 2013 International Symposium on VLSI Design, Automation, and Test (VLSI-DAT), pp. 1–4 (2013)
Sassaw, G.; Carlos, J.; Valencia, M.: High radix implementation of montgomery multipliers with CSA. In: 22nd International Conference on Microelectronics (ICM 2010), pp. 315–318 (2010)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ibrahim, A., Elsimary, H. & Gebali, F. Low-Power, High-Speed Unified and Scalable Word-Based Radix 8 Architecture for Montgomery Modular Multiplication in GF(P) and GF(2n). Arab J Sci Eng 39, 7847–7863 (2014). https://doi.org/10.1007/s13369-014-1363-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-014-1363-5