I need to calculate the speed difference between performing a Montgomery Multiplication page 602-603 with a word-size/register of size 32 vs. 64.
So far, this is what I understand:
- x and y are represented by multiple-word arrays of length n where n = m/w and w is the register size (either 32 or 64).
- The total number of single-digit multiplications in Montgomery multiplication is n*(2 + 2*n), where n represents the number length of the word-arrays.
- I will assume that the multiplication of two single-digit takes 1 clock cycle on each of the computers.
How can I put all this together to represent the number of clock cycles needed in Montgomery multiplication on a computer with a 32-bit register or 64-bit register?