Filters








138 Hits in 2.4 sec

Long Number Bit-Serial Squarers

E. Chaniotakis, P. Kalivas, K.Z. Pekmestzi
17th IEEE Symposium on Computer Arithmetic (ARITH'05)  
doi:10.1109/arith.2005.28 dblp:conf/arith/ChaniotakisKP05 fatcat:aulvyj73njfjdm5dae3mui4zaq

A scalable and high performance elliptic curve processor with resistance to timing attacks

A. Hodjat, D.D. Hwang, I. Verbauwhede
2005 International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II  
The architecture of this processor is based on the Galois Field of GF(2 n ) and the bit-serial field multiplier and squarer are designed.  ...  The point multiplication algorithm (double-add-subtract) is modified so that the processor performs the same operations for every 3 bits of the scalar k independent of the bit pattern of the 3 bits.  ...  In our case we have chosen the bit-serial implementation of the GF multiplier and squarer operations.  ... 
doi:10.1109/itcc.2005.32 dblp:conf/itcc/HodjatHV05 fatcat:sybaz7ma7vdepfep76bk5kwjze

Square-rich fixed point polynomial evaluation on FPGAs

Simin Xu, Suhaib A. Fahmy, Ian V. McLoughlin
2014 Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays - FPGA '14  
The conventional algorithm, referred to as Horner's rule, involves the least number of steps but can lead to increased latency due to serial computation.  ...  By using a squarer design that is more efficient than general multiplication, this can result in polynomial evaluation with a 57.9% latency reduction over Horner's rule and 14.6% over Estrin's method,  ...  While the structure is simple, Horner's rule suffers from a long latency due to the serial arrangement of operations.  ... 
doi:10.1145/2554688.2554779 dblp:conf/fpga/XuFM14 fatcat:ratosehnwzfadcwx7j7qivsoc4

Fast Arithmetic Architectures for Public-Key Algorithms over Galois Fields GF((2n)m) [chapter]

Christof Paar, Pedro Soria-Rodriguez
1997 Lecture Notes in Computer Science  
The approach explores bit parallel arithmetic in the sub eld GF(2 n ), and serial processing for the extension eld arithmetic.  ...  In particular, the number of clock cycles for one eld multiplication, which is the atomic operation in most public-key schemes, can be reduced by a factor of n compared to all other known realizations.  ...  (XOR), registers (in bits), an number of clock cycles for one multiplication, respectively.  ... 
doi:10.1007/3-540-69053-0_25 fatcat:dsfdjlrvsnenvb5sppcv4yb6hu

Fast arithmetic for public-key algorithms in Galois fields with composite exponents

C. Paar, P. Fleischmann, P. Soria-Rodriguez
1999 IEEE transactions on computers  
The approach explores bit parallel arithmetic in the sub eld GF(2 n ), and serial processing for the extension eld arithmetic.  ...  The bit parallel squarer architectures have been completely revised. 1 optimizations are discussed. We provide two di erent approaches to squaring.  ...  The U operand is fed into the architectures in a bit serial manner, most signi cant bit rst.  ... 
doi:10.1109/12.805153 fatcat:bons5kby6ra6tbxw3xrsmovxhu

FPGA High Performance Pipelined Architecture Of Elliptic Scalar Multiplication Over GF(2m) for IOT

Kiran Sulthana S
2017 International Journal for Research in Applied Science and Engineering Technology  
I estimate the maximum number of different bit-width multiplier cores that could be mapped to the Virtex-6 and their performance.  ...  The number of pipeline stages in the architecture also critically affects the computation time. The optimal number of pipeline stages in the design.  ... 
doi:10.22214/ijraset.2017.4058 fatcat:6p3jp245cfhthbhl7r7rhi56ay

Efficient Hardware Implementation Of An Elliptic Curve Cryptographic Processor Over Gf (2 163)

Massoud Masoumi, Hosseyn Mahdizadeh
2012 Zenodo  
field with degree 163 in 11.92 s with the maximum achievable frequency of 251 MHz on Xilinx Virtex-4 (XC4VLX200) while 22% of the chip area is occupied, where G is the digit size of the underlying digit-serial  ...  So, we need to use a 2 to 1 multiplexer that is controlled with the key bits. Therefore, in order to avoid long critical path, another strategy should be considered.  ...  Output of this squarer together with a number of combinational gates such as AND, OR, and NOT gates are connected to the input of the multiplier.  ... 
doi:10.5281/zenodo.1329671 fatcat:g2mvalyzqrdatdihukdywpfcha

A Novel Low-Area Point Multiplication Architecture for Elliptic-Curve Cryptography

Muhammad Rashid, Mohammad Mazyad Hazzazi, Sikandar Zulqarnain Khan, Adel R. Alharbi, Asher Sajid, Amer Aljaedi
2021 Electronics  
The hardware resources are reduced with the use of a bit-serial (traditional schoolbook) multiplication method.  ...  For a pair of m bit polynomial multiplications, m clock cycles are needed in the bit-serial multiplication method.  ...  In this context, there exist four possibilities: bit-serial, digitserial, bit-parallel, and digit-parallel to compute polynomial multiplication.  ... 
doi:10.3390/electronics10212698 fatcat:xk36pjz6y5badliofhmzlq26om

Public Key Cryptography in Sensor Networks—Revisited [chapter]

Gunnar Gaubatz, Jens-Peter Kaps, Berk Sunar
2005 Lecture Notes in Computer Science  
algorithms-Rabin's Scheme and NtruEncrypt-and analyze their architecture and performance according to various established metrics like power consumption, area, delay, throughput, level of security and energy per bit  ...  Squarers can be implemented in many ways. As our main concern is to conserve power we chose a bit-serial approach.  ...  Hence, we built a squarer as a bit-serial multiplier, operating on the entire width of the 512 bit multiplicand and on a single bit of the multiplier at a time.  ... 
doi:10.1007/978-3-540-30496-8_2 fatcat:uhke7a4ng5bgfixn2pohezikia

A 26.9 K 314.5 Mb/s Soft (32400,32208) BCH Decoder Chip for DVB-S2 System

Yi-Min Lin, Chih-Lung Chen, Hsie-Chia Chang, Chen-Yi Lee
2010 IEEE Journal of Solid-State Circuits  
For the high-speed and long-distance data transmission, the BCH codes with long block length are specified to suppress the error floor due to iterative LDPC decoding.  ...  In contrast with the hard BCH decoder, the proposed soft BCH decoder that deals with least reliable bits can provide much lower complexity with similar error-correcting performance.  ...  Notice that, if the error-correcting capability is equal to 1, the number of multipliers and squarers is 0 because only will be computed.  ... 
doi:10.1109/jssc.2010.2065630 fatcat:oqcogags6nbw5iohdru3a6wvrm

Exploration of Design Space in ECDSA [chapter]

Jan Schmidt, Martin Novotný, Martin Jäger, Miloš Bečvář, Michal Jáchim
2002 Lecture Notes in Computer Science  
Polynomial basis inverter and multiplier The multiplication (X * Y ) mod p(x) is implemented by serial add-and-shift algorithm. Three m bits long registers (R w , R y R v ) are used.  ...  The register R x is m + 1 bits long, the remaining three m bits long. At the start, R x holds the field polynomial p(x), R y resp. R w contain the divisor y resp. the dividend X, and R v is cleared.  ... 
doi:10.1007/3-540-46117-5_110 fatcat:k4jooubihjcyvobrxj5gw3hwru

Towards Efficient FPGA Implementation of Elliptic Curve Crypto-Processor for Security in IoT and Embedded Devices

Shaimaa Abu Khadra, Salah Eldin S. E. Abdulrahman, Nabil A. Ismail
2020 Menoufia Journal of Electronic Engineering Research  
The aim is to obtain the optimal registers number for an area optimization of ECCP architecture.  ...  A bit serial multiplier is a good choice for area but a bit parallel is a good choice for time.  ...  occupied Slices Time in ns Bit parallel 163 - 4,123 4.69 Pipeline Digit serial 82 380.12 2,718 5.26 Pipeline Digit serial 42 408.69 1,497 9.78 Bit parallel 409 - 17,560 5 Pipeline Digit serial 205 276.095  ... 
doi:10.21608/mjeer.2020.103280 fatcat:geqy5bsvlvcinm6j2p4inkonoa

Page 458 of IEEE Transactions on Computers Vol. 52, Issue 4 [page]

2003 IEEE Transactions on Computers  
167-bit squarer and is highly optimized for the high-level architecture design and lower gate level design.  ...  But, its is not metic, while being too long for EC operations.  ... 

On Parallelization of High-Speed Processors for Elliptic Curve Cryptography

K. Jarvinen, J. Skytta
2008 IEEE Transactions on Very Large Scale Integration (vlsi) Systems  
A bit-serial multiplier computes one bit of the output per cycle with a single processing block resulting in latency of .  ...  Hence, a bit-serial implementation of the Massey-Omura multiplier requires three -bit shift registers and one -function block.  ... 
doi:10.1109/tvlsi.2008.2000728 fatcat:uzuotsbowvagnkinqhrpu6hlcq

Parabolic synthesis methodology implemented on the sine function

Erik Hertz, Peter Nilsson
2009 2009 IEEE International Symposium on Circuits and Systems  
When analyzing the squarer in Fig. 2 , it was found that the resemblance to a bit-serial squarer [6] [7] is large.  ...  By introducing registers in the design of the bit-serial squarer the partial results of x n 2 is easily extracted.  ... 
doi:10.1109/iscas.2009.5117733 dblp:conf/iscas/HertzN09 fatcat:gw54ldzwbrc5pdksjfprpivdlq
« Previous Showing results 1 — 15 out of 138 results