"Reverse" Reed-Solomon error correction, given prefix of input

Jonathan Lee

8/13/23, 3:16 PM

I have a string $S$ of length (say) 34, that I know the first (say) 24 bytes of, but not the last 10. I also have the 10-byte error correcting code $RS_{44,34}(S)$ in full. Do I have any hope of recovering $S$?

The amount of information of $S$ that I'm missing far exceeds the theoretical guarantee of Reed-Solomon (which I think in this case is 3 bytes), but at the same time, there's $2^{80}$ possible values for the unknown portion of $S$, and also $2^{80}$ possible outputs for the error correction. If we were to iterate over all possible values for the unknown portion of $S$, I would naively expect approximately 1 of them to match the error correction. But $2^{80}$ is too much to brute force.

Are there any techniques that could recover (or at least reduce the state space for) an input, given its Reed-Solomon EC? Is there any reason to think one way or the other that RS is cryptographically secure in this sense?

For background, the "real world" application here is that I have a QR code (version 2, L-level EC) where I don't have the main data bits, but I do have the EC bits. I know that the data is a URL on a particular domain, thus the prefix.

0 + 0

hash

Score:1

Crypto

poncho

8/13/23, 4:21 PM

Are there any techniques that could recover (or at least reduce the state space for) an input, given its Reed-Solomon EC?

You are in luck - Reed-Solomon is a linear code, that is, the error correcting section is a linear function of the input, and in fact, by fixing all but 10 bytes of the input, it is a bijection.

What this means is that you can efficiently reconstruct the missing 10 bytes of the input; Gaussian elimination would work, and while there are likely to be more efficient algorithms available, Gaussian elimination would take approximately $10^3$ operations, and so that could be efficient enough.

0 + 0

Score:0

Crypto

kodlu

8/13/23, 4:48 PM

Not all codes have the nice property that RS codes have, which is that every $k$ symbols are an information set which can be used to reconstruct the original codeword.

There are efficient erasure decoders out there. Erasure means that some symbols are unknown, not just errored.

For example, this recent paper presents an efficient decoding algorithm for RS codes over the finite field with $q$ elements, $\mathbb{F}_q$ with $q=2^m,$ in $O(q \log_q q^2)$ time. It uses walsh transforms to do Lagrange interpolation.

0 + 0

Elon Musk

I sit in a Tesla and translated this thread with Ai:

EN: "Reverse" Reed-Solomon error correction, given prefix of input

TH: การแก้ไขข้อผิดพลาด "ย้อนกลับ" รีด-โซโลมอน ให้คำนำหน้าอินพุต

RO: Corectarea erorii „Reverse” Reed-Solomon, dat fiind prefixul de intrare

RU: «Обратное» исправление ошибок Рида-Соломона с заданным префиксом ввода

VI: Sửa lỗi Reed-Solomon "đảo ngược", đã cho tiền tố đầu vào

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.