SlideShare a Scribd company logo
Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 
www.ijera.com 210 | P a g e 
An Area-efficient Montgomery Modular Multiplier for Cryptosystems Anizha Radhakrishnan*, Seena George** *(M.tech scholar, Department of ECE, Sree Narayana Gurukulam College of Engineering , Kolenchery, India) 
** (Asst. Professor, Department of ECE, Sree Narayana Gurukulam College of Engineering , Kolenchery, India) 
Abstract RSA is one of the most widely adopted public key algorithms at present and it requires repeated modular multiplications to accomplish the computation of modular exponentiation. A famous approach to implement modular multiplication in hardware circuits is based on the Montgomery modular multiplication algorithm since it has many advantages. To speed up the encryption/decryption process, many high-speed Montgomery modular multiplication algorithms and hardware architectures employ carry-save addition. But CSA based architecture increases the area. In this paper, in order to reduce the area of the CSA based multiplier, an area-efficient algorithm called Double Add Reduce algorithm is introduced. Then the performance analysis of the new design with the previous had done for comparison. 
Keywords— Area-efficient architecture, Carry save addition, Cryptography, Double add reduce algorithm, Montgomery modular multiplier. 
I. INTRODUCTION 
The motivation for studying area-efficient and fast modular multiplication algorithms comes from their application in public key cryptography. Modular exponentiation is a critical operation in RSA [1], Diffie- Hellman key exchange etc. It is a popular operation for encryption in public key cryptosystems. The core operation in this method is repeated modular multiplication. Many algorithms are available for performing fast modular multiplication. These algorithms can be grouped in to two. That is, the first category algorithms perform modular reduction followed by multiplication. Classical, Karatsuba, Schonhage- Strassen and Barrett algorithms are examples of such multiplication and modular reduction algorithms [2], [3]. The second category performs modular reduction combined with multiplication. Montgomery algorithm [4] introduced in 1985, is a famous approach for carrying out fast modular multiplication and belongs to the second category. For a single modular multiplication Montgomery method is disadvantageous. But the advantage comes when it performs modular exponentiation. Montgomery algorithm performs modular multiplication without division. That is, it replaces division with simple bit shift operation. In many public key systems Montgomery method is used for obtaining fast modular multiplication. 
Many algorithms are introduced to perform Montgomery modular multiplication in a faster way. In this paper, we are introducing a new area reducing architecture for Montgomery modular multiplier. The remainder of this paper is organized as follows: Section II briefly reviews previous algorithms and their architectures for performing Montgomery multiplication. In section III, the mathematics behind the Montgomery algorithm is explained in detail. Then we have introduced a new area efficient Montgomery algorithm in section IV. Section V gives a detailed comparison between proposed method and the previous CSA based modular multiplier. Finally we concluded this paper in section VI. 
II. PREVIOUS METHODOLOGIES 
Montgomery multiplication algorithm is the most efficient algorithm available. The main advantage of Montgomery algorithm is that it replaces the division operation with shift operations. During two decades many alternative forms of Montgomery algorithms are introduced. These architectures use carry save addition. The work in [5] and [13] presented two types of Montgomery algorithms which use Carry Save Adder (CSA). One of the two types used four-to-two CSA and the other used five-to-CSA. They had given a brief comparison between these two versions of Montgomery multipliers. They had found that the multiplier using four-to-two CSA architecture has shorter critical path than that of five-to-two CSA multiplier. But extra storage elements and multiplexers are required for 4-to-2 architecture which probably increases the energy consumption. The work in [6] proposed a Montgomery multiplication algorithm using pipelined carry save addition to 
RESEARCH ARTICLE OPEN ACCESS
Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 
www.ijera.com 211 | P a g e 
shorten the critical path delay of five-to-two CSA. This method also required additional pipeline registers and multiplexers which will increase the area. Ming Der Shieh presented [7] a new algorithm for high speed modular multiplication. This new Montgomery multiplier performs modular reduction in a pipelined fashion, so that the critical path delay is reduced from the four-to-two to three-to-two carry- save addition. Then it requires additional pipeline registers to store intermediate values. All the above works were not discussed about the energy consumption. Several previous works [8], [9] have developed techniques to reduce the power/energy consumption of Montgomery multipliers. In [8], some latches named glitch blockers are located at the outputs of some circuit modules to reduce the spurious transitions and the expected switching activities of high fan-out signals in the radix-4 scalable Montgomery multiplier. Shiann Rong Kuang [9] tried to reduce the energy consumption of CSAs and registers in the CSA-based Montgomery multipliers via a new technique. They modified the CSA based Montgomery multiplier algorithm and as a result of this, the number of clock cycles required to complete the multiplication is largely decreased. To achieve further energy reduction, they have adjusted the internal structure of barrel register full adder and then applied the gated clock design technique. But this energy efficient algorithm increased the total area of the design. In order to reduce the area, we are presenting a new algorithm which can modify the CSA based algorithm in [9]. Before introducing the area efficient algorithm, let us see the mathematics behind the Montgomery modular multiplication. 
III. MONTGOMERY MODULAR MULTIPLICATION 
Modular multiplication of two integers X and Y, simply performs, Z = X . Y mod M (1) Here X, Y and M are n - bit numbers and M should be greater than X and Y. Instead of computing X. Y, the Montgomery multiplication [10] algorithm computes, Z’ = MP (X, Y, M) = X .Y. 2-n mod M (2) MP denotes Montgomery Product and sometimes the equation (2) can also be represented as shown below. Z’ = MP (X, Y, M) = X .Y. R-1 mod M (3) 
Here R = 2n and R-1 is the multiplicative inverse of R mod M. That is, 
R. R-1 mod M = 1 (4) In order to perform Montgomery multiplication the numbers should be converted to the Montgomery domain. The conversion to the Montgomery domain is very simple. The conversion from integer domain to Montgomery domain is shown below. X’ = X. R mod M (5) Y’ = Y. R mod M (6) Here X’ and Y’ are the Montgomery forms of X and Y. After completing the multiplication in Montgomery domain, we have to convert it back to the integer domain to obtain the final result. The conversion from the Montgomery product Z’ to Z is shown below: Z = MP (Z’, 1, M) = Z’ .1. R-1 mod M (7) These conversions are required only at the starting point and ending point. It is important to note here that to find the modular product the Montgomery algorithm does not perform any division. On the other hand, it performs just addition and bit shift operation. The pseudo code [11], [12] for finding Montgomery product S is given below. Algorithm 1 MP1(X,Y,M) { Initially S = 0; for i = 0 to n-1 { S = S + Xi . Y ; If S is odd then S = S + M ; S = S/2; } If S ≥ M then S = S – M; } Here we can see that algorithm requires division by 2. Division by 2 is equivalent to shifting the bits to right which is fast and easy in hardware. Thus we can avoid the division. Only requirement is to perform simple conversion from Montgomery domain done before and after multiplication. 
IV. PROPOSED METHODOLOGY 
The work in [9], proposed an energy-efficient four-to-two CSA based Montgomery algorithm. While reducing energy, area of the multiplier is slightly increased. In this paper, we propose an area efficient, fast and low power algorithm called Double Add Reduce (DAR) algorithm which can replace the CSA based multiplier.
Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 
www.ijera.com 212 | P a g e 
Fig.1. Block diagram of multiplier based on DAR algorithm Figure 1 shows the block diagram of Montgomery modular multiplier. Here main block is a double add reduce block. 
4.1 Double Add Reduce (DAR) algorithm 
The DAR block computes Montgomery product using DAR algorithm which is shown below: Algorithm 2 MP2(X,Y,M) { S-1 = 0; Y = 2 x Y; For i=0 to n { qi = Si-1 mod 2; (LSB of Si-1) Si = (Si-1 + qi.M + Xi.Y) / 2; } Return Sn; } The multiplicand X is shifted up by 1 bit to simplify the calculation of qi. This implies that the result is a factor of 2 too large, and so the for loop must be executed an additional time to eliminate this factor. 
4.2 Architecture 
The architecture based on DAR algorithm is shown in figure 2. That is, the algorithm can be executed with the help of two adders. To remove the dependency of qi on the addition of Xi.Y and Si-1, Y can be shifted up 1 bit, thus forcing the LSB of Si-1 + Xi. Y to always be zero, as illustrated in figure 2. Fig.2. Architecture based on DAR algorithm 
However, this extra factor of 2 must be removed by an extra iteration, meaning that the number of clock cycles is equal to n+1. 
V. RESULT ANALYSIS 
In order to verify the area efficiency, we have captured our new design in VHDL and implemented into the Xilinx Spartan 3E series of FPGA. For performance comparison, we have also captured the CSA based multiplier design in VHDL and implemented into the FPGA. Following section gives the performance results of these implementations. 
5.1 Detailed area analysis 
During implementation, we have generated device utilization report using Xilinx ISE 8.1i software tool. The design summary report generated by the Xilinx ISE software tool can be used to analyze the area utilized by the two versions of the Montgomery modular multiplier. Table I gives a detailed area analysis of the two implementations. 
TABLE I. AREA ANALYSIS OF THE TWO IMPLEMENTATIONS 
CSA based modular multiplier (8- bit) 
DAR based modular multiplier (8- bit) 
Number of slice flipflop used 
64 
23 
Number of 4 input LUTs used 
144 
60 
Total equivalent gate count for design 
1527 
595 
Table I shows that proposed DAR based modular multiplier (8- bit) considerably reduces the total area used by the design.
Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 
www.ijera.com 213 | P a g e 
5.2 Timing and power analysis 
From the simulation results of both CSA multiplier (8 bit) and DAR multiplier (8 bit), we have seen that after applying inputs the later one takes less time for producing the output. For same inputs Modified multiplier gives the result in less time. From the Figure 3, CSA based multiplier takes 2.185 μs for generating the output while from the Figure 4, DAR based multiplier takes only 1.745 μs. After applying inputs, DAR based modular multiplier takes less time to produce output. Fig.3. Simulation result for 4-to-2 CSA based modular multiplier Fig.4. Simulation result for DAR based modular multiplier The power summary provides information about the usage of voltage and other power resources for your entire design or for a single design module. During synthesis we have generated the power data choosing device type Spartan 3 and package tq144. These two multipliers consumes almost same power (18mW). 
VI. CONCLUSION 
This paper presented an efficient algorithm and its corresponding architecture to reduce the area without affecting the speed of a Montgomery modular multiplier. Experimental results showed that the proposed method is indeed capable of reducing the area of 8 bit Montgomery modular multiplier up to 61% compared to the previous CSA based design. In addition to this, our new design increased the speed to 20% and having almost same power consumption. Here we cannot observe significant change in power consumptions of the two versions of multipliers. In future, we will try to develop efficient architecture to carry out fast modular multiplication with low energy consumption. 
VII. ACKNOWLEDGMENT 
We would like to express our sincere gratitude to the Management and staff, SNG College of Engineering for providing the facilities. 
REFERENCES 
[1] Taek-Won Kwon, Chang-Seok You,‖ Two implementation methods of a 1024-bit RSA Cryptoprocessor based on modified Montgomery algorithm‖, 0-7803-6685- 9/01/$10.00 2001 IEEE. 
[2] Andre Weimerskirch and Christof Paar, ―Generalizations of the Karatsuba Algorithm for Efficient Implementations‖, IEEE Int.Conf. Comput. Design, 2005. 
[3] Antoon Bosselaers, Rene Govaerts and Joos Vandewalle,‖ Comparison of three modular reduction functions‖, ° Springer-Verlag 1993 oct 25. 
[4] Peter L. Montgomery, ―Modular multiplication without trivial division”, Mathematics of computation, Vol.44, No. 170. (Apr 1985) ,pp. 519-522. 
[5] C. McIvor, M. McLoone, and J. V. McCanny, ―Modified Montgomery modular multiplication and RSA exponentiation techniques,‖ IEE Proc. Comput. Digit. Tech., vol. 151, no. 6, pp. 402–408, Nov. 2004. 
[6] K. Manochehri and S. Pourmozafari, ―Fast Montgomery modular multiplication by pipelined CSA architecture,‖ in Proc. IEEE Int. Conf. Microelectron., Dec. 2004, pp. 144–147. 
[7] M.-D. Shieh, J.-H. Chen, W.-C. Lin, and H.- H.Wu, ―A new algorithm for high-speed modular multiplication design,‖ IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 56, no. 9, pp. 2009–2019, Sep. 2009. 
[8] H.-K. Son and S.-G. Oh, ―Design and implementation of scalable low-power Montgomery multiplier,‖ in Proc. IEEE Int. Conf. Comput. Design, 2004, pp. 524–531. 
[9] Shiann-Rong Kuang, Jiun-Ping Wang,‖ Energy-Efficient High-Throughput Montgomery Modular Multipliers for RSA Cryptosystems‖, IEEE Transactions on very large scale integration (VLSI) systems, vol. 21, no. 11, november 2013. 
[10] J. C. Neto, A. F. Tenca, and W. V. Ruggiero, ―A parallel k-partition method to perform Montgomery multiplication,‖ in Proc. IEEE Int. Conf. Appl.-Specif. Syst., Arch. Process., Sep. 2011, pp. 251–254.
Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com 
ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 
www.ijera.com 214 | P a g e 
[11] Ambika R, Hamsavahini R, ― A Survey on Hardware Architectures for Montgomery Modular Multiplication Algorithm‖, IJETCAS 13-342; © 2013. 
[12] Kooroush Manochehri, Saadat Pourmozafari,‖ Montgomery and RNS for RSA hardware implementation‖, Computing and Informatics, Vol. 29, 2010, 849–880. 
[13] Alan Daly, Ciaran McIvor, William Marnane,‖ Fast Montgomery Modular Multiplication and RSA Cryptographic Processor Architectures‖, November, 2003.

More Related Content

What's hot

Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...Angel Yogi
 
Design of 8-Bit Comparator Using 45nm CMOS Technology
Design of 8-Bit Comparator Using 45nm CMOS TechnologyDesign of 8-Bit Comparator Using 45nm CMOS Technology
Design of 8-Bit Comparator Using 45nm CMOS Technology
IJMER
 
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo AdderIRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
IRJET Journal
 
Layout Design Analysis of CMOS Comparator using 180nm Technology
Layout Design Analysis of CMOS Comparator using 180nm TechnologyLayout Design Analysis of CMOS Comparator using 180nm Technology
Layout Design Analysis of CMOS Comparator using 180nm Technology
IJEEE
 
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
IJERA Editor
 
Vedic multiplier
Vedic multiplierVedic multiplier
Vedic multiplier
BHUSHAN MHASKE
 
Paper id 37201520
Paper id 37201520Paper id 37201520
Paper id 37201520IJRAT
 
Iaetsd design and implementation of pseudo random number generator
Iaetsd design and implementation of pseudo random number generatorIaetsd design and implementation of pseudo random number generator
Iaetsd design and implementation of pseudo random number generator
Iaetsd Iaetsd
 
9.design of high speed area efficient low power vedic multiplier using revers...
9.design of high speed area efficient low power vedic multiplier using revers...9.design of high speed area efficient low power vedic multiplier using revers...
9.design of high speed area efficient low power vedic multiplier using revers...nareshbk
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
J045075661
J045075661J045075661
J045075661
IJERA Editor
 
Efficient Layout Design of CMOS Full Subtractor
Efficient Layout Design of CMOS Full SubtractorEfficient Layout Design of CMOS Full Subtractor
Efficient Layout Design of CMOS Full Subtractor
IJEEE
 
Area-Delay Efficient Binary Adders in QCA
Area-Delay Efficient Binary Adders in QCAArea-Delay Efficient Binary Adders in QCA
Area-Delay Efficient Binary Adders in QCA
IJERA Editor
 
G1103026268
G1103026268G1103026268
G1103026268
IOSR Journals
 
Ijarcet vol-2-issue-7-2357-2362
Ijarcet vol-2-issue-7-2357-2362Ijarcet vol-2-issue-7-2357-2362
Ijarcet vol-2-issue-7-2357-2362Editor IJARCET
 

What's hot (17)

Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
Design of High Performance 8,16,32-bit Vedic Multipliers using SCL PDK 180nm ...
 
Design of 8-Bit Comparator Using 45nm CMOS Technology
Design of 8-Bit Comparator Using 45nm CMOS TechnologyDesign of 8-Bit Comparator Using 45nm CMOS Technology
Design of 8-Bit Comparator Using 45nm CMOS Technology
 
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo AdderIRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
IRJET- Implementation of FIR Filter using Self Tested 2n-2k-1 Modulo Adder
 
Layout Design Analysis of CMOS Comparator using 180nm Technology
Layout Design Analysis of CMOS Comparator using 180nm TechnologyLayout Design Analysis of CMOS Comparator using 180nm Technology
Layout Design Analysis of CMOS Comparator using 180nm Technology
 
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
Hardware Implementation of Two’s Compliment Multiplier with Partial Product b...
 
Vedic multiplier
Vedic multiplierVedic multiplier
Vedic multiplier
 
Paper id 37201520
Paper id 37201520Paper id 37201520
Paper id 37201520
 
Id93
Id93Id93
Id93
 
N046018089
N046018089N046018089
N046018089
 
Iaetsd design and implementation of pseudo random number generator
Iaetsd design and implementation of pseudo random number generatorIaetsd design and implementation of pseudo random number generator
Iaetsd design and implementation of pseudo random number generator
 
9.design of high speed area efficient low power vedic multiplier using revers...
9.design of high speed area efficient low power vedic multiplier using revers...9.design of high speed area efficient low power vedic multiplier using revers...
9.design of high speed area efficient low power vedic multiplier using revers...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
J045075661
J045075661J045075661
J045075661
 
Efficient Layout Design of CMOS Full Subtractor
Efficient Layout Design of CMOS Full SubtractorEfficient Layout Design of CMOS Full Subtractor
Efficient Layout Design of CMOS Full Subtractor
 
Area-Delay Efficient Binary Adders in QCA
Area-Delay Efficient Binary Adders in QCAArea-Delay Efficient Binary Adders in QCA
Area-Delay Efficient Binary Adders in QCA
 
G1103026268
G1103026268G1103026268
G1103026268
 
Ijarcet vol-2-issue-7-2357-2362
Ijarcet vol-2-issue-7-2357-2362Ijarcet vol-2-issue-7-2357-2362
Ijarcet vol-2-issue-7-2357-2362
 

Viewers also liked

La música
La músicaLa música
La músicadc2x
 
Buscamos faltas
Buscamos faltasBuscamos faltas
Buscamos faltas
sergioluiscortesdiaz
 
Colores primarios
Colores primariosColores primarios
Colores primariosDOGOHURTADO
 
Ativ1 4 alinemoraes-tecnologias na minha escola
Ativ1 4 alinemoraes-tecnologias na minha escolaAtiv1 4 alinemoraes-tecnologias na minha escola
Ativ1 4 alinemoraes-tecnologias na minha escolaAline Moraes
 
Pspc r5 direcionadores estratégicos ipê rs
Pspc r5 direcionadores estratégicos ipê rsPspc r5 direcionadores estratégicos ipê rs
Pspc r5 direcionadores estratégicos ipê rsLeonardo Rajesh
 
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...Ronaldo de Sousa
 
Madrid
MadridMadrid
Madrid
G56PC
 
Poster2_Hillary_PhysAct+Cogn (1)
Poster2_Hillary_PhysAct+Cogn (1)Poster2_Hillary_PhysAct+Cogn (1)
Poster2_Hillary_PhysAct+Cogn (1)Hillary Nguyen
 
Presentación1
Presentación1Presentación1
Presentación1orubis
 
I Social Media @UNIURB.IT
I Social Media @UNIURB.ITI Social Media @UNIURB.IT
I Social Media @UNIURB.IT
Università of Urbino Carlo Bo
 
Jacques louis david
Jacques louis davidJacques louis david
Jacques louis davidKlevan
 
Kick Start your Future Career flyer 20 Sept
Kick Start your Future Career flyer 20 SeptKick Start your Future Career flyer 20 Sept
Kick Start your Future Career flyer 20 SeptJulie Harbert
 
10120140507005
1012014050700510120140507005
10120140507005
IAEME Publication
 
Livro o touro e o homem
Livro o touro e o homemLivro o touro e o homem
Livro o touro e o homem
Jozi Mares
 
Tugas rmk susanti nurul ramadani ch 6
Tugas rmk susanti nurul ramadani ch 6Tugas rmk susanti nurul ramadani ch 6
Tugas rmk susanti nurul ramadani ch 6Erdha Reidha
 
Comparaciones entre buenos aires y puerto madryn
Comparaciones entre buenos aires y puerto madrynComparaciones entre buenos aires y puerto madryn
Comparaciones entre buenos aires y puerto madrynalcira olivera
 

Viewers also liked (20)

La música
La músicaLa música
La música
 
Buscamos faltas
Buscamos faltasBuscamos faltas
Buscamos faltas
 
Colores primarios
Colores primariosColores primarios
Colores primarios
 
Ativ1 4 alinemoraes-tecnologias na minha escola
Ativ1 4 alinemoraes-tecnologias na minha escolaAtiv1 4 alinemoraes-tecnologias na minha escola
Ativ1 4 alinemoraes-tecnologias na minha escola
 
Pspc r5 direcionadores estratégicos ipê rs
Pspc r5 direcionadores estratégicos ipê rsPspc r5 direcionadores estratégicos ipê rs
Pspc r5 direcionadores estratégicos ipê rs
 
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...
JORNAL FORÇA JOVEM 4ª Edição (Nucleo Adalberto Studart Filho - Projovem Urban...
 
Madrid
MadridMadrid
Madrid
 
Poster2_Hillary_PhysAct+Cogn (1)
Poster2_Hillary_PhysAct+Cogn (1)Poster2_Hillary_PhysAct+Cogn (1)
Poster2_Hillary_PhysAct+Cogn (1)
 
Presentación1
Presentación1Presentación1
Presentación1
 
I Social Media @UNIURB.IT
I Social Media @UNIURB.ITI Social Media @UNIURB.IT
I Social Media @UNIURB.IT
 
Daniel
DanielDaniel
Daniel
 
Jacques louis david
Jacques louis davidJacques louis david
Jacques louis david
 
Lista do promorar
Lista do promorarLista do promorar
Lista do promorar
 
Kick Start your Future Career flyer 20 Sept
Kick Start your Future Career flyer 20 SeptKick Start your Future Career flyer 20 Sept
Kick Start your Future Career flyer 20 Sept
 
2 disjuntores moldada
2 disjuntores moldada2 disjuntores moldada
2 disjuntores moldada
 
10120140507005
1012014050700510120140507005
10120140507005
 
Livro o touro e o homem
Livro o touro e o homemLivro o touro e o homem
Livro o touro e o homem
 
Tugas rmk susanti nurul ramadani ch 6
Tugas rmk susanti nurul ramadani ch 6Tugas rmk susanti nurul ramadani ch 6
Tugas rmk susanti nurul ramadani ch 6
 
Comparaciones entre buenos aires y puerto madryn
Comparaciones entre buenos aires y puerto madrynComparaciones entre buenos aires y puerto madryn
Comparaciones entre buenos aires y puerto madryn
 
iPad Apple
iPad AppleiPad Apple
iPad Apple
 

Similar to An Area-efficient Montgomery Modular Multiplier for Cryptosystems

Modified montgomery modular multiplier for cryptosystems
Modified montgomery modular multiplier for cryptosystemsModified montgomery modular multiplier for cryptosystems
Modified montgomery modular multiplier for cryptosystems
IAEME Publication
 
Final Project Report
Final Project ReportFinal Project Report
Final Project ReportRiddhi Shah
 
IRJET- VLSI Architecture for Montgomery Modular Multiplication
IRJET-  	  VLSI Architecture for Montgomery Modular MultiplicationIRJET-  	  VLSI Architecture for Montgomery Modular Multiplication
IRJET- VLSI Architecture for Montgomery Modular Multiplication
IRJET Journal
 
A compact FPGA-based montgomery modular multiplier
A compact FPGA-based montgomery modular multiplierA compact FPGA-based montgomery modular multiplier
A compact FPGA-based montgomery modular multiplier
nooriasukmaningtyas
 
Arithmetic Operations in Multi-Valued Logic
Arithmetic Operations in Multi-Valued Logic   Arithmetic Operations in Multi-Valued Logic
Arithmetic Operations in Multi-Valued Logic
VLSICS Design
 
Design and testing of systolic array multiplier using fault injecting schemes
Design and testing of systolic array multiplier using fault injecting schemesDesign and testing of systolic array multiplier using fault injecting schemes
Design and testing of systolic array multiplier using fault injecting schemes
CSITiaesprime
 
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCMIRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET Journal
 
Low cost high-performance vlsi architecture for montgomery modular multiplica...
Low cost high-performance vlsi architecture for montgomery modular multiplica...Low cost high-performance vlsi architecture for montgomery modular multiplica...
Low cost high-performance vlsi architecture for montgomery modular multiplica...
jpstudcorner
 
High Performance MAC Unit for FFT Implementation
High Performance MAC Unit for FFT Implementation High Performance MAC Unit for FFT Implementation
High Performance MAC Unit for FFT Implementation
IJMER
 
Fpga based efficient multiplier for image processing applications using recur...
Fpga based efficient multiplier for image processing applications using recur...Fpga based efficient multiplier for image processing applications using recur...
Fpga based efficient multiplier for image processing applications using recur...
VLSICS Design
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
sipij
 
IJET-V3I1P14
IJET-V3I1P14IJET-V3I1P14
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
IJERD Editor
 
F1074145
F1074145F1074145
F1074145
IJERD Editor
 
Low Power VLSI Design of Modified Booth Multiplier
Low Power VLSI Design of Modified Booth MultiplierLow Power VLSI Design of Modified Booth Multiplier
Low Power VLSI Design of Modified Booth Multiplier
idescitation
 
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET Journal
 
IRJET- Efficient Design of Radix Booth Multiplier
IRJET- Efficient Design of Radix Booth MultiplierIRJET- Efficient Design of Radix Booth Multiplier
IRJET- Efficient Design of Radix Booth Multiplier
IRJET Journal
 

Similar to An Area-efficient Montgomery Modular Multiplier for Cryptosystems (20)

Modified montgomery modular multiplier for cryptosystems
Modified montgomery modular multiplier for cryptosystemsModified montgomery modular multiplier for cryptosystems
Modified montgomery modular multiplier for cryptosystems
 
Final Project Report
Final Project ReportFinal Project Report
Final Project Report
 
IRJET- VLSI Architecture for Montgomery Modular Multiplication
IRJET-  	  VLSI Architecture for Montgomery Modular MultiplicationIRJET-  	  VLSI Architecture for Montgomery Modular Multiplication
IRJET- VLSI Architecture for Montgomery Modular Multiplication
 
A compact FPGA-based montgomery modular multiplier
A compact FPGA-based montgomery modular multiplierA compact FPGA-based montgomery modular multiplier
A compact FPGA-based montgomery modular multiplier
 
Arithmetic Operations in Multi-Valued Logic
Arithmetic Operations in Multi-Valued Logic   Arithmetic Operations in Multi-Valued Logic
Arithmetic Operations in Multi-Valued Logic
 
Design and testing of systolic array multiplier using fault injecting schemes
Design and testing of systolic array multiplier using fault injecting schemesDesign and testing of systolic array multiplier using fault injecting schemes
Design and testing of systolic array multiplier using fault injecting schemes
 
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCMIRJET- The RTL Model of a Reconfigurable Pipelined MCM
IRJET- The RTL Model of a Reconfigurable Pipelined MCM
 
Low cost high-performance vlsi architecture for montgomery modular multiplica...
Low cost high-performance vlsi architecture for montgomery modular multiplica...Low cost high-performance vlsi architecture for montgomery modular multiplica...
Low cost high-performance vlsi architecture for montgomery modular multiplica...
 
High Performance MAC Unit for FFT Implementation
High Performance MAC Unit for FFT Implementation High Performance MAC Unit for FFT Implementation
High Performance MAC Unit for FFT Implementation
 
Fpga based efficient multiplier for image processing applications using recur...
Fpga based efficient multiplier for image processing applications using recur...Fpga based efficient multiplier for image processing applications using recur...
Fpga based efficient multiplier for image processing applications using recur...
 
I1035563
I1035563I1035563
I1035563
 
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORMDESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
DESIGN OF DELAY COMPUTATION METHOD FOR CYCLOTOMIC FAST FOURIER TRANSFORM
 
38 116-1-pb
38 116-1-pb38 116-1-pb
38 116-1-pb
 
IJET-V3I1P14
IJET-V3I1P14IJET-V3I1P14
IJET-V3I1P14
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
F1074145
F1074145F1074145
F1074145
 
Low Power VLSI Design of Modified Booth Multiplier
Low Power VLSI Design of Modified Booth MultiplierLow Power VLSI Design of Modified Booth Multiplier
Low Power VLSI Design of Modified Booth Multiplier
 
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
IRJET- MAC Unit by Efficient Grouping of Partial Products along with Circular...
 
Ax03303120316
Ax03303120316Ax03303120316
Ax03303120316
 
IRJET- Efficient Design of Radix Booth Multiplier
IRJET- Efficient Design of Radix Booth MultiplierIRJET- Efficient Design of Radix Booth Multiplier
IRJET- Efficient Design of Radix Booth Multiplier
 

Recently uploaded

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation & Control
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
ankuprajapati0525
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
Jayaprasanna4
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
VENKATESHvenky89705
 

Recently uploaded (20)

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
Water Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdfWater Industry Process Automation and Control Monthly - May 2024.pdf
Water Industry Process Automation and Control Monthly - May 2024.pdf
 
The role of big data in decision making.
The role of big data in decision making.The role of big data in decision making.
The role of big data in decision making.
 
ethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.pptethical hacking in wireless-hacking1.ppt
ethical hacking in wireless-hacking1.ppt
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
road safety engineering r s e unit 3.pdf
road safety engineering  r s e unit 3.pdfroad safety engineering  r s e unit 3.pdf
road safety engineering r s e unit 3.pdf
 

An Area-efficient Montgomery Modular Multiplier for Cryptosystems

  • 1. Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 www.ijera.com 210 | P a g e An Area-efficient Montgomery Modular Multiplier for Cryptosystems Anizha Radhakrishnan*, Seena George** *(M.tech scholar, Department of ECE, Sree Narayana Gurukulam College of Engineering , Kolenchery, India) ** (Asst. Professor, Department of ECE, Sree Narayana Gurukulam College of Engineering , Kolenchery, India) Abstract RSA is one of the most widely adopted public key algorithms at present and it requires repeated modular multiplications to accomplish the computation of modular exponentiation. A famous approach to implement modular multiplication in hardware circuits is based on the Montgomery modular multiplication algorithm since it has many advantages. To speed up the encryption/decryption process, many high-speed Montgomery modular multiplication algorithms and hardware architectures employ carry-save addition. But CSA based architecture increases the area. In this paper, in order to reduce the area of the CSA based multiplier, an area-efficient algorithm called Double Add Reduce algorithm is introduced. Then the performance analysis of the new design with the previous had done for comparison. Keywords— Area-efficient architecture, Carry save addition, Cryptography, Double add reduce algorithm, Montgomery modular multiplier. I. INTRODUCTION The motivation for studying area-efficient and fast modular multiplication algorithms comes from their application in public key cryptography. Modular exponentiation is a critical operation in RSA [1], Diffie- Hellman key exchange etc. It is a popular operation for encryption in public key cryptosystems. The core operation in this method is repeated modular multiplication. Many algorithms are available for performing fast modular multiplication. These algorithms can be grouped in to two. That is, the first category algorithms perform modular reduction followed by multiplication. Classical, Karatsuba, Schonhage- Strassen and Barrett algorithms are examples of such multiplication and modular reduction algorithms [2], [3]. The second category performs modular reduction combined with multiplication. Montgomery algorithm [4] introduced in 1985, is a famous approach for carrying out fast modular multiplication and belongs to the second category. For a single modular multiplication Montgomery method is disadvantageous. But the advantage comes when it performs modular exponentiation. Montgomery algorithm performs modular multiplication without division. That is, it replaces division with simple bit shift operation. In many public key systems Montgomery method is used for obtaining fast modular multiplication. Many algorithms are introduced to perform Montgomery modular multiplication in a faster way. In this paper, we are introducing a new area reducing architecture for Montgomery modular multiplier. The remainder of this paper is organized as follows: Section II briefly reviews previous algorithms and their architectures for performing Montgomery multiplication. In section III, the mathematics behind the Montgomery algorithm is explained in detail. Then we have introduced a new area efficient Montgomery algorithm in section IV. Section V gives a detailed comparison between proposed method and the previous CSA based modular multiplier. Finally we concluded this paper in section VI. II. PREVIOUS METHODOLOGIES Montgomery multiplication algorithm is the most efficient algorithm available. The main advantage of Montgomery algorithm is that it replaces the division operation with shift operations. During two decades many alternative forms of Montgomery algorithms are introduced. These architectures use carry save addition. The work in [5] and [13] presented two types of Montgomery algorithms which use Carry Save Adder (CSA). One of the two types used four-to-two CSA and the other used five-to-CSA. They had given a brief comparison between these two versions of Montgomery multipliers. They had found that the multiplier using four-to-two CSA architecture has shorter critical path than that of five-to-two CSA multiplier. But extra storage elements and multiplexers are required for 4-to-2 architecture which probably increases the energy consumption. The work in [6] proposed a Montgomery multiplication algorithm using pipelined carry save addition to RESEARCH ARTICLE OPEN ACCESS
  • 2. Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 www.ijera.com 211 | P a g e shorten the critical path delay of five-to-two CSA. This method also required additional pipeline registers and multiplexers which will increase the area. Ming Der Shieh presented [7] a new algorithm for high speed modular multiplication. This new Montgomery multiplier performs modular reduction in a pipelined fashion, so that the critical path delay is reduced from the four-to-two to three-to-two carry- save addition. Then it requires additional pipeline registers to store intermediate values. All the above works were not discussed about the energy consumption. Several previous works [8], [9] have developed techniques to reduce the power/energy consumption of Montgomery multipliers. In [8], some latches named glitch blockers are located at the outputs of some circuit modules to reduce the spurious transitions and the expected switching activities of high fan-out signals in the radix-4 scalable Montgomery multiplier. Shiann Rong Kuang [9] tried to reduce the energy consumption of CSAs and registers in the CSA-based Montgomery multipliers via a new technique. They modified the CSA based Montgomery multiplier algorithm and as a result of this, the number of clock cycles required to complete the multiplication is largely decreased. To achieve further energy reduction, they have adjusted the internal structure of barrel register full adder and then applied the gated clock design technique. But this energy efficient algorithm increased the total area of the design. In order to reduce the area, we are presenting a new algorithm which can modify the CSA based algorithm in [9]. Before introducing the area efficient algorithm, let us see the mathematics behind the Montgomery modular multiplication. III. MONTGOMERY MODULAR MULTIPLICATION Modular multiplication of two integers X and Y, simply performs, Z = X . Y mod M (1) Here X, Y and M are n - bit numbers and M should be greater than X and Y. Instead of computing X. Y, the Montgomery multiplication [10] algorithm computes, Z’ = MP (X, Y, M) = X .Y. 2-n mod M (2) MP denotes Montgomery Product and sometimes the equation (2) can also be represented as shown below. Z’ = MP (X, Y, M) = X .Y. R-1 mod M (3) Here R = 2n and R-1 is the multiplicative inverse of R mod M. That is, R. R-1 mod M = 1 (4) In order to perform Montgomery multiplication the numbers should be converted to the Montgomery domain. The conversion to the Montgomery domain is very simple. The conversion from integer domain to Montgomery domain is shown below. X’ = X. R mod M (5) Y’ = Y. R mod M (6) Here X’ and Y’ are the Montgomery forms of X and Y. After completing the multiplication in Montgomery domain, we have to convert it back to the integer domain to obtain the final result. The conversion from the Montgomery product Z’ to Z is shown below: Z = MP (Z’, 1, M) = Z’ .1. R-1 mod M (7) These conversions are required only at the starting point and ending point. It is important to note here that to find the modular product the Montgomery algorithm does not perform any division. On the other hand, it performs just addition and bit shift operation. The pseudo code [11], [12] for finding Montgomery product S is given below. Algorithm 1 MP1(X,Y,M) { Initially S = 0; for i = 0 to n-1 { S = S + Xi . Y ; If S is odd then S = S + M ; S = S/2; } If S ≥ M then S = S – M; } Here we can see that algorithm requires division by 2. Division by 2 is equivalent to shifting the bits to right which is fast and easy in hardware. Thus we can avoid the division. Only requirement is to perform simple conversion from Montgomery domain done before and after multiplication. IV. PROPOSED METHODOLOGY The work in [9], proposed an energy-efficient four-to-two CSA based Montgomery algorithm. While reducing energy, area of the multiplier is slightly increased. In this paper, we propose an area efficient, fast and low power algorithm called Double Add Reduce (DAR) algorithm which can replace the CSA based multiplier.
  • 3. Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 www.ijera.com 212 | P a g e Fig.1. Block diagram of multiplier based on DAR algorithm Figure 1 shows the block diagram of Montgomery modular multiplier. Here main block is a double add reduce block. 4.1 Double Add Reduce (DAR) algorithm The DAR block computes Montgomery product using DAR algorithm which is shown below: Algorithm 2 MP2(X,Y,M) { S-1 = 0; Y = 2 x Y; For i=0 to n { qi = Si-1 mod 2; (LSB of Si-1) Si = (Si-1 + qi.M + Xi.Y) / 2; } Return Sn; } The multiplicand X is shifted up by 1 bit to simplify the calculation of qi. This implies that the result is a factor of 2 too large, and so the for loop must be executed an additional time to eliminate this factor. 4.2 Architecture The architecture based on DAR algorithm is shown in figure 2. That is, the algorithm can be executed with the help of two adders. To remove the dependency of qi on the addition of Xi.Y and Si-1, Y can be shifted up 1 bit, thus forcing the LSB of Si-1 + Xi. Y to always be zero, as illustrated in figure 2. Fig.2. Architecture based on DAR algorithm However, this extra factor of 2 must be removed by an extra iteration, meaning that the number of clock cycles is equal to n+1. V. RESULT ANALYSIS In order to verify the area efficiency, we have captured our new design in VHDL and implemented into the Xilinx Spartan 3E series of FPGA. For performance comparison, we have also captured the CSA based multiplier design in VHDL and implemented into the FPGA. Following section gives the performance results of these implementations. 5.1 Detailed area analysis During implementation, we have generated device utilization report using Xilinx ISE 8.1i software tool. The design summary report generated by the Xilinx ISE software tool can be used to analyze the area utilized by the two versions of the Montgomery modular multiplier. Table I gives a detailed area analysis of the two implementations. TABLE I. AREA ANALYSIS OF THE TWO IMPLEMENTATIONS CSA based modular multiplier (8- bit) DAR based modular multiplier (8- bit) Number of slice flipflop used 64 23 Number of 4 input LUTs used 144 60 Total equivalent gate count for design 1527 595 Table I shows that proposed DAR based modular multiplier (8- bit) considerably reduces the total area used by the design.
  • 4. Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 www.ijera.com 213 | P a g e 5.2 Timing and power analysis From the simulation results of both CSA multiplier (8 bit) and DAR multiplier (8 bit), we have seen that after applying inputs the later one takes less time for producing the output. For same inputs Modified multiplier gives the result in less time. From the Figure 3, CSA based multiplier takes 2.185 μs for generating the output while from the Figure 4, DAR based multiplier takes only 1.745 μs. After applying inputs, DAR based modular multiplier takes less time to produce output. Fig.3. Simulation result for 4-to-2 CSA based modular multiplier Fig.4. Simulation result for DAR based modular multiplier The power summary provides information about the usage of voltage and other power resources for your entire design or for a single design module. During synthesis we have generated the power data choosing device type Spartan 3 and package tq144. These two multipliers consumes almost same power (18mW). VI. CONCLUSION This paper presented an efficient algorithm and its corresponding architecture to reduce the area without affecting the speed of a Montgomery modular multiplier. Experimental results showed that the proposed method is indeed capable of reducing the area of 8 bit Montgomery modular multiplier up to 61% compared to the previous CSA based design. In addition to this, our new design increased the speed to 20% and having almost same power consumption. Here we cannot observe significant change in power consumptions of the two versions of multipliers. In future, we will try to develop efficient architecture to carry out fast modular multiplication with low energy consumption. VII. ACKNOWLEDGMENT We would like to express our sincere gratitude to the Management and staff, SNG College of Engineering for providing the facilities. REFERENCES [1] Taek-Won Kwon, Chang-Seok You,‖ Two implementation methods of a 1024-bit RSA Cryptoprocessor based on modified Montgomery algorithm‖, 0-7803-6685- 9/01/$10.00 2001 IEEE. [2] Andre Weimerskirch and Christof Paar, ―Generalizations of the Karatsuba Algorithm for Efficient Implementations‖, IEEE Int.Conf. Comput. Design, 2005. [3] Antoon Bosselaers, Rene Govaerts and Joos Vandewalle,‖ Comparison of three modular reduction functions‖, ° Springer-Verlag 1993 oct 25. [4] Peter L. Montgomery, ―Modular multiplication without trivial division”, Mathematics of computation, Vol.44, No. 170. (Apr 1985) ,pp. 519-522. [5] C. McIvor, M. McLoone, and J. V. McCanny, ―Modified Montgomery modular multiplication and RSA exponentiation techniques,‖ IEE Proc. Comput. Digit. Tech., vol. 151, no. 6, pp. 402–408, Nov. 2004. [6] K. Manochehri and S. Pourmozafari, ―Fast Montgomery modular multiplication by pipelined CSA architecture,‖ in Proc. IEEE Int. Conf. Microelectron., Dec. 2004, pp. 144–147. [7] M.-D. Shieh, J.-H. Chen, W.-C. Lin, and H.- H.Wu, ―A new algorithm for high-speed modular multiplication design,‖ IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 56, no. 9, pp. 2009–2019, Sep. 2009. [8] H.-K. Son and S.-G. Oh, ―Design and implementation of scalable low-power Montgomery multiplier,‖ in Proc. IEEE Int. Conf. Comput. Design, 2004, pp. 524–531. [9] Shiann-Rong Kuang, Jiun-Ping Wang,‖ Energy-Efficient High-Throughput Montgomery Modular Multipliers for RSA Cryptosystems‖, IEEE Transactions on very large scale integration (VLSI) systems, vol. 21, no. 11, november 2013. [10] J. C. Neto, A. F. Tenca, and W. V. Ruggiero, ―A parallel k-partition method to perform Montgomery multiplication,‖ in Proc. IEEE Int. Conf. Appl.-Specif. Syst., Arch. Process., Sep. 2011, pp. 251–254.
  • 5. Anizha Radhakrishnan Int. Journal of Engineering Research and Applications www.ijera.com ISSN : 2248-9622, Vol. 4, Issue 9( Version 1), September 2014, pp.210-214 www.ijera.com 214 | P a g e [11] Ambika R, Hamsavahini R, ― A Survey on Hardware Architectures for Montgomery Modular Multiplication Algorithm‖, IJETCAS 13-342; © 2013. [12] Kooroush Manochehri, Saadat Pourmozafari,‖ Montgomery and RNS for RSA hardware implementation‖, Computing and Informatics, Vol. 29, 2010, 849–880. [13] Alan Daly, Ciaran McIvor, William Marnane,‖ Fast Montgomery Modular Multiplication and RSA Cryptographic Processor Architectures‖, November, 2003.