Merkle–Hellman knapsack cryptosystem

The Merkle–Hellman knapsack cryptosystem was one of the earliest public key cryptosystems. It was published by Ralph Merkle and Martin Hellman in 1978. A polynomial time attack was published by Adi Shamir in 1984. As a result, the cryptosystem is now considered insecure.^[1]^: 465 ^[2]^: 190

History

The concept of public key cryptography was introduced by Whitfield Diffie and Martin Hellman in 1976.^[3] At that time they proposed the general concept of a "trap-door one-way function", a function whose inverse is computationally infeasible to calculate without some secret "trap-door information"; but they had not yet found a practical example of such a function. Several specific public-key cryptosystems were then proposed by other researchers over the next few years, such as RSA in 1977 and Merkle-Hellman in 1978.^[4]

Description

Merkle–Hellman is a public key cryptosystem, meaning that two keys are used, a public key for encryption and a private key for decryption. It is based on the subset sum problem (a special case of the knapsack problem).^[5] The problem is as follows: given a set of integers $A$ and an integer $c$ , find a subset of $A$ which sums to $c$ . In general, this problem is known to be NP-complete. However, if $A$ is superincreasing, meaning that each element of the set is greater than the sum of all the numbers in the set lesser than it, the problem is "easy" and solvable in polynomial time with a simple greedy algorithm.

In Merkle–Hellman, decrypting a message requires solving an apparently "hard" knapsack problem. The private key contains a superincreasing list of numbers $W$ , and the public key contains a non-superincreasing list of numbers $B$ , which is actually a "disguised" version of $W$ . The private key also contains some "trapdoor" information that can be used to transform a hard knapsack problem using $B$ into an easy knapsack problem using $W$ .

Unlike some other public key cryptosystems such as RSA, the two keys in Merkle-Hellman are not interchangeable; the private key cannot be used for encryption. Thus Merkle-Hellman is not directly usable for authentication by cryptographic signing, although Shamir published a variant that can be used for signing.^[6]

Key generation

1. Choose a block size $n$ . Integers up to $n$ bits in length can be encrypted with this key.

2. Choose a random superincreasing sequence of $n$ positive integers

W=(w_{1},w_{2},\dots ,w_{n})

The superincreasing requirement means that

w_{k}>\sum _{i=1}^{k-1}w_{i}

, for

1<k\leq n

.

3. Choose a random integer $q$ such that

q>\sum _{i=1}^{n}w_{i}

4. Choose a random integer $r$ such that $\gcd(r,q)=1$ (that is, $r$ and $q$ are coprime).

5. Calculate the sequence

B=(b_{1},b_{2},\dots ,b_{n})

where

b_{i}=rw_{i}{\bmod {q}}

.

The public key is $B$ and the private key is $(W,q,r)$ .

Encryption

Let $m$ be an $n$ -bit message consisting of bits $m_{1}m_{2}\dots m_{n}$ , with $m_{1}$ the highest order bit. Select each $b_{i}$ for which $m_{i}$ is nonzero, and add them together. Equivalently, calculate

c=\sum _{i=1}^{n}m_{i}b_{i}

.

The ciphertext is $c$ .

Decryption

To decrypt a ciphertext $c$ , we must find the subset of $B$ which sums to $c$ . We do this by transforming the problem into one of finding a subset of $W$ . That problem can be solved in polynomial time since $W$ is superincreasing.

1. Calculate the modular inverse of $r$ modulo $q$ using the Extended Euclidean algorithm. The inverse will exist since $r$ is coprime to $q$ .

r':=r^{-1}{\pmod {q}}

The computation of

r'

is independent of the message, and can be done just once when the private key is generated.

2. Calculate

c':=cr'{\bmod {q}}

3. Solve the subset sum problem for $c'$ using the superincreasing sequence $W$ , by the simple greedy algorithm described below. Let $X=(x_{1},x_{2},\dots ,x_{k})$ be the resulting list of indexes of the elements of $W$ which sum to $c'$ . (That is, $c'=\sum _{i=1}^{k}w_{x_{i}}$ .)

4. Construct the message $m$ with a 1 in each $x_{i}$ bit position and a 0 in all other bit positions:

m=\sum _{i=1}^{k}2^{n-x_{i}}

Solving the subset sum problem

This simple greedy algorithm finds the subset of a superincreasing sequence $W$ which sums to $c'$ , in polynomial time:

1. Initialize

X

to an empty list.

2. Find the largest element in

W

which is less than or equal to

c'

, say

w_{j}

.

3. Subtract:

c':=c'-w_{j}

.

4. Append

j

to the list

X

.

5. Remove

w_{j}

from the superincreasing sequence

W

6. If

c'

is greater than zero, return to step 2.

Example

Key generation

Create a key to encrypt 8-bit numbers by creating a random superincreasing sequence of 8 values:

W=(2,7,11,21,42,89,180,354)

The sum of these is 706, so select a larger value for $q$ :

q=881

.

Choose $r$ to be coprime to $q$ :

r=588

.

Construct the public key $B$ by multiplying each element in $W$ by $r$ modulo $q$ :

{\begin{aligned}&(2*588){\bmod {8}}81=295\\&(7*588){\bmod {8}}81=592\\&(11*588){\bmod {8}}81=301\\&(21*588){\bmod {8}}81=14\\&(42*588){\bmod {8}}81=28\\&(89*588){\bmod {8}}81=353\\&(180*588){\bmod {8}}81=120\\&(354*588){\bmod {8}}81=236\end{aligned}}

Hence $B=(295,592,301,14,28,353,120,236)$ .

Encryption

Let the 8-bit message be $m=97=01100001_{2}$ . We multiply each bit by the corresponding number in $B$ and add the results:

The ciphertext $c$ is 1129.

Decryption

To decrypt 1129, first use the Extended Euclidean Algorithm to find the modular inverse of $r$ mod $q$ :

r'=r^{-1}{\bmod {q}}=588^{-1}{\bmod {8}}81=442

.

Compute $c'=cr'{\bmod {q}}=1129*442{\bmod {8}}81=372$ .

Use the greedy algorithm to decompose 372 into a sum of $w_{i}$ values:

{\begin{aligned}c'&=372\\&w_{8}=354\leq 372\\c'&=372-354=18\\&w_{3}=11\leq 18\\c'&=18-11=7\\&w_{2}=7\leq 7\\c'&=7-7=0\end{aligned}}

Thus $372=354+11+7=w_{8}+w_{3}+w_{2}$ , and the list of indexes is $X=(8,3,2)$ . The message can now be computed as

m=\sum _{i=1}^{3}2^{n-x_{i}}=2^{8-8}+2^{8-3}+2^{8-2}=1+32+64=97

.

Cryptanalysis

In 1984 Adi Shamir published an attack on the Merkle-Hellman cryptosystem which can decrypt encrypted messages in polynomial time without using the private key. ^[7] The attack analyzes the public key $B=(b_{1},b_{2},\dots ,b_{n})$ and searches for a pair of numbers $u$ and $m$ such that $(ub_{i}{\bmod {m}})$ is a superincreasing sequence. The $(u,m)$ pair found by the attack may not be equal to $(r',q)$ in the private key, but like that pair it can be used to transform a hard knapsack problem using $B$ into an easy problem using a superincreasing sequence. The attack operates solely on the public key; no access to encrypted messages is necessary.

Shamir's attack on the Merkle-Hellman cryptosystem works in polynomial time even if the numbers in the public key are randomly shuffled, a step which is usually not included in the description of the cryptosystem, but can be helpful against some more primitive attacks.

References

^ Schneier, Bruce (1996). Applied Cryptography. New York: John Wiley & Sons. ISBN 0-471-12845-7.
^ Stinson, Douglas R. (1995). Cryptography: Theory and Practice. Boca Raton: CRC Press. ISBN 0-8493-8521-0.
^ Whitfield Diffie; Martin Hellman (1976). "New directions in cryptography". IEEE Transactions on Information Theory. 22 (6): 644. CiteSeerX 10.1.1.37.9720. doi:10.1109/TIT.1976.1055638.
^ Merkle, Ralph; Hellman, Martin (1978). "Hiding information and signatures in trapdoor knapsacks". IEEE Transactions on Information Theory. 24 (5): 525–530. doi:10.1109/TIT.1978.1055927.
^ Cherowitzo, William (2002-03-02). "Merkle-Hellman Knapsack Cryptosystem". Math 5410 - Modern Cryptology. Retrieved 2019-08-18.
^ Shamir, Adi (July 1978). "A Fast Signature Scheme". MIT Laboratory for Computer Science Technical Memorandum. 79 (MIT/LCS/TM–107): 15240. Bibcode:1978STIN...7915240S.
^ Shamir, Adi (1984). "A polynomial-time algorithm for breaking the basic Merkle - Hellman cryptosystem". IEEE Transactions on Information Theory. 30 (5): 699–704. doi:10.1109/SFCS.1982.5.