Score:1

Crypto

Rabin-Miller Primality Test - Elaboration needed

Kevin Stefanov

6/11/24, 8:45 AM

In short, my question is:

What exactly do people mean when they say that "The more you apply the Rabin-Miller test to a number, the more certain you can be that the number you're testing is prime."?

To clarify what I'm asking, let's look at an example I was working through:

Testing if N = 78007 is prime or not (spoiler, it is).

Rabin-Miller procedure:

Find N - 1 = 2^K * M

In this case, 78007 - 1 = 2¹*39003, which means K = 1 and M = 39003

Pick an A, such that 2 <= A <= N-2. We pick A = 2.
Compute B₀ = A^M % N.

If B₀ = 1 or B₀ = (N-1), then N is prime (probably). If it's anything else, we stop and conclude that N is not prime.

Else, begin computing terms of B_i = (B_i-1)² % N, and: If B_i = 1, N is not prime. If B_i = N - 1, N is prime (probably).

Now, for our running example:

B₀ = 2³⁹⁰⁰³ % 78007 = 1.

Which means N is probably prime.

HERE IS MY QUESTION: What would I proceed to do next if I want to be more certain that N is prime?

Would I try steps 2 and 3 with different A's?

Or would I keep the same A and proceed with the B_i terms until one of the B_i's turns out to be equal to 1? In which case the test says that N is not prime? If I reach such a B_i term, would that mean N is definitely not prime, or the more i's it took to get there, the more likely it is that N is actually prime?

Since I read a statement that said "If you do the Rabin-Miller test about 20 times on a big-ish number, you can be pretty certain it's prime." What exactly did they mean?

Thanks everyone in advance for any answers that clears it up for me.

Side note: I came across this primality test while looking for ways to generate a random 4000-bit prime number as part of implementing the Diffie-Hellman key exchange. I have written my own custom Big Integer library in C which I will be implementing Diffie-Hellman in. I implemented the Sieve of Eratosthenes but it was so damn slow. People say this primality test is way faster and can lead to high enough certainy that a number is prime for said number to be used in encryption schemes. If anyone knows of a better way, please let me know. I am no expert in encryption, but I really love implementing stuff on my own, it teaches me a lot more than simply using existing libraries for everything.

I tried reading explanations on several websites and asked on programming discord servers, watched several youtube videos but nobody was able to clarify this particular question to me. It's still unclear to me what exactly it means to "do the test 20 times on a number to be more certain that said number is prime".

170

1 + 13

number-theory

prime-numbers

fgrieu

6/11/24, 8:59 AM

Hint: do the same experiment with N=2047 (which is not prime: 2047=23×89), or any N in OEIS'[A001262](https://oeis.org/A001262) or [A014233](https://oeis.org/A014233).

0

Reply

Richard Thiessen

6/11/24, 9:01 AM

First off, consider not implementing DH in a large prime field. Elliptic curve cryptography works, is well studied and has small code size. Secondly, re-using a prime isn't actually a bad thing. You can just hard-code a large prime as the DH parameter to use. Here's an RFC defining some good ones https://www.ietf.org/rfc/rfc3526.txt .

2

Reply

j.p.

6/12/24, 6:05 AM

The answer to "here is my question" is "more A's".

0

Reply

Kevin Stefanov

6/12/24, 9:33 AM

Thank you everyone! I did some more N's and kinda answered my own question after several hours of trying different N's, A's and B's. Now the thing that struck me is, if I want to apply Rabin-Miller to a 4000-bit number, I'd have to compute 2 to the power of a 2000-bit number (in the worst case). Wondering whether that's gonna be practical or if it's gonna turn into a disaster like Sieve of Eratosthenes did. Might do what you guys said and simply hard-code a large prime for DH, or if things get desperate enough, not implement DH at all and look at elliptic curve cryptography.

0

Reply

garfunkel

6/12/24, 11:45 AM

@KevinStefanov: When calculating $a^e \bmod m$ for a big exponent $e$, be sure to reduce modulo $m$ after each squaring or multiplication during the exponentiation. (It's a good exercise; if you use a "modern" programming language like python, modular exponentiation is already built in.)

0

Reply

Kevin Stefanov

6/13/24, 12:17 PM

@garfunkel That was actually a worry I had. So what you're saying is, I can check the result after each multiplication and if it got bigger than M, subtract M from it?

0

Reply

garfunkel

6/15/24, 10:24 AM

@KevinStefanov: By multiplying two 1000-bit numbers you will get a 2000-bit number, so you will have to subtract the 1000-bit modulus M quite often to get the intermediate result down to 1000 bit again... (you have to reduce it modulo M).

0

Reply

Kevin Stefanov

7/1/24, 9:06 AM

@j.p. How many A's would be sufficient to check to be pretty sure a number is prime if the number has 5000 bits? For numbers of a few digits I noticed just a few A's are enough. Is that also the case for numbers with thousands of digits?

0

Reply

j.p.

7/2/24, 9:04 AM

The longer the primes are you are looking for, the less MR-tests you will have to perform. For primes of length 2048 (or longer), passing two MR-tests gives you an error probability of less than $2^{-100}$. For details look at table B.1 or Appendix C.1 in [this norm](https://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.186-5.pdf).

0

Reply

Kevin Stefanov

7/2/24, 6:14 PM

@j.p. Passing two different MR-tests means the algorithm said "it's probably prime" for two different A values? Also, side question, is my understanding correct that for Diffie Hellman to be secure, the big prime you feed it has to have a large prime factor itself?

0

Reply

j.p.

7/6/24, 5:11 AM

Yes, repeating MR-tests with the same prime wouldn't make much sense ;-). About DH: A prime has itself as its only prime factor. (You phrased your question wrongly.) But if you subtract one from the big prime, then you get the order of the multiplicative group mod the prime, in which you are working, and this order has to have a big prime factor.

0

Reply

Kevin Stefanov

7/8/24, 7:54 PM

@j.p. wait, A has to be prime? Also, is the converse true as well, like for 3000-bit numbers, if it says "composite" for two different A's BEFORE it says "probably prime" for 2 different A's, then we can be really sure the number is composite?

0

Reply

j.p.

7/9/24, 7:48 AM

@KevinStefanov: Sorry! No! No clue, why I wrote prime. Just forget it!

0

Reply

Score:2

Crypto

poncho

6/12/24, 3:32 PM

I'll answer your questions about Rabin-Miller, as well as put in a few comments about Diffie-Hellman. You have other questions in your comments; you can ask those in other questions (or just search through crypto.stackexchange for the answers - you aren't the first one to ask them)

As for Rabin-Miller, it has this property:

If $N$ is prime, then the result of the algorithm is always "probably prime", no matter what value of $A$ you selected
If $N$ is composite, then the result of the algorithm is "composite" for at least 75% (3/4) of the possible $A$ values within the range.

That is, if your $N$ is composite, and you selected $A$ randomly, then with probability at least 0.75, the algorithm will say "composite".

And, if you run the algorithm $k$ times, and select a random $A$ each time, then with probability at least $1 - 1/4^k$, the algorithm will say "composite" at least once.

These hold for any composite. Note that this is strictly true if you use random $A$ values; if you use a fixed value for $A$ (say, 2) the proofs do not apply.

That is what is meant by "running Rabin-Miller multiple times, you get a higher assuredness of primality".

That said, it is sometimes overly conservative. If you were handed a value $N$ from an adversary, and you are asked to test it for primality, and he "wins" if he gets you to accept a composite value, then this is the best we can do with Rabin-Miller; there exist composite numbers for which the probability of a false acceptance is very close to 1/4. On the other hand, such values are actually fairly rare - if you are testing 'random' values for primality, the vast majority of composite values will hardly ever come up with 'probability prime', and so it's safe in that scenario to use a far smaller number of iterations.

Now, on to Diffie-Hellman: it is often not safe to just use a random prime. The reason is that the security of Diffle-Hellman depends on the factorization of $N-1$; if you just pick a random prime $N$, $N-1$ may have a number of small factors (in addition to the factor 2 which is always present), and (depending on how you use it; how you select $G$ and the size of the private exponents you use), this can subvert security.

If you don't know what you're doing, you're better off using the RFC 3526 primes (which are known not to have any small factors other than 2)

+ 2

Kevin Stefanov

6/13/24, 12:27 PM

Thank you for the detailed answer! One worry I have currently is, during Rabin Miller, I have to compute (2^N mod K) where N is a 3000-bit number itself. So apparently there's a way called Modular Exponentiation, which saves you the effort of having to compute 2 to the power of a 3000-bit number. Can you explain please? Im currently watching videos on that and asking around. Do I simply check if 2^N has gotten larger than K after each multiplication, and if it has, subtract M from it?

0

Reply

poncho

6/13/24, 12:49 PM

@KevinStefanov For an efficient way to do modexp, one place to start is https://en.wikipedia.org/wiki/Modular_exponentiation#Right-to-left_binary_method

0

Reply

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: Rabin-Miller Primality Test - Elaboration needed

Rabin-Miller Primality Test - Elaboration needed

Post an answer