A New Public Key System

A New Public Key System With Signature And Master Key Functions

T. Moh Lecture Notes at EE Department of Stanford University in Oct 1998

1 : Introduction

Thank Prof. D. Allison for inviting me to speak here.

In this talk we will introduce a new public key system, the "Tame Transformation Method" or TTM. One should not view "TTM" as an abbreviation of "Time To Market" nor of "Time To Money". Many years ago, I wrote a text book "Algebra" for graduate students. In there I mentioned RSA as a cute example of number theory. In the summer of 1995, Dr. John M. Acken of Intel came to visit my family. He raised the question of a fast public key system. I reviewed my book and realized that classically people glued a big chunk of characters together, say 128 characters with each an 8 bits number, so one got 1024 bits. Then one viewed this huge data as a binary whole number. One played with those 1024 bits numbers in a modular sense. It is naturaly slow to manipulate those numbers. It is essentially the same for ECC and other group-theorectic ones.

Maybe I shall mention that the public key system is not a mathematical problem. Most mathematicians will dismiss any discussion about public key systems which can be solved in finitely many steps.

Mathematics or not, public key system is interesting in itself. Let a₁,...,a₁₂₈ be 128 characters with each an 8 bits number. The natural way is not to glue them together. We shall treat them as a point a=(a₁,...,a₁₂₈) (a so called "plaintext") in 128 dimensional space. To scramble it, we simple apply a map f to the 128 dimensional space and get a new point b=(b₁,...,b₁₂₈) (a so called "ciphertext"). For the conveniece of computation, we require that

(a) both value f(a) and its inverse value f^-1(b) can be computed easily.
(b) a composition of a few of the maps should be hard to be decompsed and its inverse hard to be recovered.
(c) it should be user-friendly.

There are ready candidates "Tame Automorphisms" (see below). Furthermore we require the coefficient field of the 128 dimensional space to be the finite field GF(2⁸) (see below). Thus all computations will be fast.

The beauty of Tame Automorphisms is that after compositing a few of them, the resulting map loses all appearances of a Tame Automorphism (cf the example below). For a technique reason, we shall select the space of plaintexts to be a subspace of this space (of ciphertexts). Then we have a fast public key system.

2 : Mathematical background

This is the first time the theory of Tame Automorphisms is applied to provide a public key system. We shall explain every term used in this lecture.

(a): Finite Field.

We shall discuss the concept of finite fields. The finite field GF(2^m) of 2^m elements is the collection of the m bits numbers (a₁,a₂,...,a_m) , where a_i's are zeroes or ones, and the sum of m bits numbers is bitwise, while the product depends on the defining irreducible polynomial, which can be carried out by a LSR (linear shift register) or by looking up a table.

(b): Affine Space.

Let K be a field, say GF(2^m). Let K^n+r be the affine space of dimension n+r over K. Note that an "affine space" K^n+r is a vector space without the algebraic structure and the origin, i.e., the "physical space". We prefer an affine space over a vector space because (1) we need to move the origin around, (2) we shall consider non-linear maps such as polynomial maps.

(c): Tame Automorphism.

A linear transformation f is a map of the following form,

f(x_i)=a_i1x₁+...+a_i(n+r)x_n+
r+b_i

where a_ij and b_i are elements in K. A linear transformation f is said to be invertible if the coefficient matrix (a_ij) is invertible.

Definition: We define a tame automorphism f_i=(f_i,1,..., f_i,n+r) as either an invertible linear transformation, or of the following form in any order of variables x₁,...,x_n+r with polynomials h_i,j,

(1): f_i,1(x₁,..., x_n+r)=x₁+h_i,1(x₂,...,x_{n+r)=y₁

(2): f_i,2(x₁,...,
x_n+r)=x₂+h_i,2(x₃,...,x_{n+r)=y₂

..........................

(j): f_i,j(x₁,...,
x_n+r)=x_j+h_i,j(x_j+1,...,x_n+r)=y_j

..........................

(n+r):
f_i,n+r(x₁,...,x_n+r)=x_n+r=y_n
+r}}

Example: Let K=GF(2), f(x₁, x₂, x₃) =(x₁+x₂x₃, x₂ +x₃², x₃), g(x₁, x₂, x₃) =(x₁, x₂, x₃+x₁²) be two tame automorphisms. Then it is easy to see that f²(x₁, x₂, x₃)=(x₁+x₃³, x₂, x₃) and fg(x₁, x₂, x₃) =(x₁+x₂x₃ +x₁²x₂, x₂+x₃² +x₁⁴, x₃+x₁²) .

The group generated by all tame automorphisms is called the tame automorphism group. Note that the group product is the composition of maps, i.e., substitution (cf the previous example), which is different from the product of polynomials. The following proposition and its corollaries will be given without proofs.

Proposition 1: Let a tame automorphism f_i be defined as in the preceding paragraph. We have the inverse f^-1 =(f_i,1^-1,...,f_i,n+r^-1) with x_n+r=f_i,n+r^-1(y₁,..., y_n+r)=y_n+r and x_j= f_i,j^-1(y₁,...,y_n+r)= y_j-h_i,j(f_i,j+1^-1(y₁, ..., y_n+r),...,f_i,n+r^-1(y₁,..., y_n+r)), for j=n+r-1,...,1.

For instance, in the case of four variables, we have the inverse polynomial map f_i^-1 in the following abstract general form in term of variables,

f_i,4^-1(y₁,...,y₄)=y₄ .
f_i,3^-1(y₁,...,y₄)=y₃ -h_i3(y₄).
f_i,2^-1(y₁,...,y₄)=y₂ -h_i2(y₃-h_i3(y₄),y₄).
f_i,1^-1(y₁,...,y₄)=y₁ -h_i1(y₂-h_i2(y₃-h_i3(y< sub>4),y₄),y₃-h_i3(y₄),y4).

In general, the total degree of f_i,j^-1(y₁,..., y_n+r) increases very fast and the number of terms can be quite large. As can be shown, the number of terms in g^-1 in our scheme is greater than 10²⁵⁴. Therefore it is impractical to actually write down the inverse map. However, if a point (y'₁,..., y'_n+r) is given, the value of the inverse map can be readily computed in the following special form in term of numbers.

Corollary 2: Given a set of values (y'₁,..., y'_n+r) in K^n+r and a tame automorphism f_i as in the Definition of this section, then the values (x'₁,..., x'_n+r) =(f_i,1^-1(y'₁,...,y'_n+r),..., f_i,n+r^-1(y'₁,...,y'_n+r)) can be found by induction; first, we have x'_n+r =f_i,n+r^-1(y'₁,...,y'_n+r) =y'_n+r, inductively we have x'_j+1,..., x'_n+r in K, then we have x'_j =f_i,j^-1(y'₁,...y'_n+r) =y'_j-h_i,j(x'_j+1,..., x'_n+r) for j=n+r-1,...,1.

Corollary 3: Given the decomposition g=f₁....f_s where f_i are tame automorphisms, then we have g^-1=f_s^-1... f₁^-1. Furthermore, if a set of values (y'₁,..., y'_n+r) is given, then we have g^-1(y'₁,..., y'_n+r) =f_s^-1...f₁^-1 (y'₁,..., y'_n+r).

3 : Theory of automorphisms groups

There is a long history of studying `automorphism groups' for affine spaces K^n+r and `embedding theory' in mathematics. There are thousands of papers on those subjects. The theory of automorphism groups for K² was established by W. Van der Kulk in 1953 which stated that the automorphism group for K² is the tame automorphism group, i.e, any automorphism of K² can be written as a canonical product of tame automorphisms. The most famous problem in this area is the fifty eight year old Jacobian Conjecture for 2-dimensional space. For embedding theory, the simplest case, i.e., the (algebraic) embedding of an affine line to an affine plane in characteristic 0, had been an open problem for forty years. It was solved in a joint paper in 1972 by S. Abhyankar and T. Moh using difficult and long arguments. In the case of fields of characteristic p (which include the case of finite fields), the embedding problem is open for n=1 and n+r=2. There are some conjectures formed by T. Moh in 1988. They are beyond the scope of the present talk.

There is an abyss between our knowledge of the automorphism group of K² and the automorphism group of K^n+r for n+r greater than or equal to 3. In these cases, every element g in the tame automorphism group has a factorization into a product of tame automorphisms by its definition, however, there is no known way to find it. In 1972, a very good mathemtician, M. Nagata constructed an automorphism g for n+r=3. One can not decide whether g is in the tame automorphism group since there is no theorem for the above factorization. Note that one can show that the square root of 2 is not rational since we know the factorization theorem for integers.

4 : Principle or Algorithm

Let m, n, r, s be positive integers. Let K be a finite field of 2^m elements. Let f_s,...,f₂, f₁ be s tame (equivalently, triangular) automorphisms, which are elementary and easily computable, of the (n+r)-dimensional affine space K^n+r. Let the composition automorphism be g=f_s...f₂f₁. The automorphism g and some of the f_i's will be hidden.

Let the restriction of g to the n dimensional subspace be g'=(h₁,..., h_n+r): Kⁿ--> K^n+r. The field K and the polynomial map (h₁,..., h_n+r) will be announced as the public key.

Given a plaintext (x'₁,...,x'_n) in Kⁿ, let y'_i=h_i(x'₁,...,x'_n), then the ciphertext will be (y'₁,...,y'_n+r).

Given tame automorphisms f_i and a ciphertext (y'₁,..., y'_n+r), it is easy to find f_i^-1 (y'₁,..., y'_n+r). Therefore, the plaintext can be recovered by taking (x'₁,...,x'_n,0...0) =f₁^-1 f₂^-1...f_s^-1( y'₁,..., y'_n+r). The private key will be the set of maps {f₁^-1,..., f_s^-1}.

5 : Implementation Scheme

We will give a report of an implementation (for a complete detail, please click on TTM 1.9) for the case that n=64, n+r=100. In our implementation, let the field K be GF(2⁸), the finite field of 2⁸ elements. We will build four tame automorphisms f₄, f₃, f₂, f₁. The maps f₄, f₁ provided by the user are invertible linear transformations. The composition f₃f₂ =(q*₁,...,q*₁₀₀), which is provided by the software, will have the following properties,

(1) all componenets, q*_i, of it are polynomials in 64 variables of degree 2.
(2) the degree 2 homogeneous parts of q*_i's are linear independent.
(3) no polynomial in q*_i's of degree less than 8 will generate a power of any polynomial of degree 1.

Furthermore, we require that the linear transformation f₁ to move the origin (0,...,0) to a point (b₁,...,b₆₄) where all b_i 's are nozeroes, and the linear transformation f₄ to make the composition f₄f₃f₂f₁ fixes the origin. The reason is that then all linear forms of q*_i's will not form an linear transformation of the vector space K⁶⁴. The purpose of the above requirement is to safeguqrd the linear terms from an attack using linear algebra.

6 : Plaintexts, Users and Compactness

Let us count the possible number of plaintexts. Since the number of plaintexts is the number of choices for x'₁,...,x'₆₄, we see that there are 2⁵¹² such plaintexts.

Of equal importance to having a large number of possible plaintexts is having lots of possible users. In order to allow for many such users, we first get an expression for this number in terms of 8 and 64. This amounts to counting the number of automorphisms g of the form g=f₄f₃f₂f₁. Assuming that a negligible proportion of these automorphisms g have more than one representation g=f₄f₃f₂f₁ =f'₄f'₃f'₂f'₁, the number of users is asymptotic to (choices for f₄) (choices for f₁). We may use the possible numbers for the invertible linear transformations as an estimation for f₄, f₁. The number of invertible linear transformations f₁ is greater than 2³³²⁷⁹. A similar count of terms of f₄ results in the total possible number of users is greater than 2¹¹⁴⁰⁷⁹.

Now let us look at the compactness of the scheme. We have 100 quadratic polynomials in 64 variables. It is easy to see that the number of terms of polynomials of degree 2 is (67)(64)/2! and we have 100 polynomials, therefore the total number of terms is 214,400 (for another software implementation TTM 2.8, the number is 20,736). This is the size of the public key. We believe that the number can be further reduced. The expense to the sender is mainly in evaluating the 100 polynomials. Note that our scheme can also be computed in a parallel way. Thus the process can be sped up several hundred fold. On the receiver's side, the total number of terms for f₁, f₂, f₃, f₄ is 17,000 (the corresponding number for TTM 2.8 is 4,800). This is the size of the private key. The legitimate receiver needs to evaluate f₁^-1f₂^-1 f₃^-1f₄^-1 (according to Corollaries 2 & 3) which is not expensive.

7 : Technical Report

Following the principle of this talk, there are several software implementations. For the convenience of discussions, the method will be called "tame transformation method" (TTM). There are versions TTM 1.9 (of this talk), TTM 2.1, TTM 2.3, TTM 2.5 and TTM 2.8 programed by C Language. The rates of expansion of data are 1.4, 1.56, 1.63, 1.5 and 2.6 respectively. They have been used on various machines listed below,

250 Mhz PowerPC 750 (G3)
200 Mhz PowerPC 604e w/1024K cache.
225 Mhz PowerPC 603e w/256K cache,
167 Mhz Ultrasparc w/512K cache.
167 Mhz Pentium w/512K cache.

The software TTM 2.1, 2.3, and 2.5 are on 200 Mhz PowerPC 604e (w/1024K cache: virtual memory off) and the software TTM 2.8 is on 250 Mhz PowerPC 750 (G3). Their speeds are listed in the following;

TTM 1.9, 94,939 bit/sec
TTM 2.1, 106.224 bit/sec
TTM 2.3, 207,000 bit/sec
TTM 2.5, 300,000 bit/sec
TTM 2.8, 617,363 bit/sec

The implementation speed depends on the speed to compute a*b+c where a,b and c are in the 8-bits finite field. For TTM 2.8, every 108 repeatitions of this computation will process one bit of imformation, while each computation requires 3.75 cycles. It is interesting to note that the newly announced Motorola "AltiVec" chip will provide "table walks" and parallel computation, thus speeds up the process.

The decrypting speed is in general 4 to 15 times faster than the encrypting speed. The PC software TTM 2.8 is faster than a possible hardware implementation for RSA 1024. According to the opinion of a certain expert, a couple added instruction about finite field multiplication in the chip architecture would increase the speed of software implementations at least 10--16 times. If it is done, then our software implementations would reach a few million bits per second for the PC.

It is possible to encrypt voice communications(64,000 bit/sec) by those softwares on an ordinary PC. Note that in comparison, RSA toolkit BSAFE 3.0 for 1024 is 7 K bit/sec. It is conceivable that a hardware implementation, using finite field multiplication and parallel computing, would approximate the speed of the fastest hardware implementation of the triple DES 128.

8 : Useful Properties of the Scheme

We will discuss (please click on to see a complete detail) three useful properties; error-detect function, master key function and signatures.

9 : Security

The security of the system rests in part on the difficulty of finding the map g from the partial information provided by the map g', and there is no known way to recover the private key {f₄, f₃, f₂, f₁} from the public key g' and the field K. There are three other direct ways to attack the scheme: (1) use `inverse formula' for power series. (2) determine the polynomial p_i(=x_i) of {y_i} with indeterminate coefficients for all i by experiments. or (3) use `resultant'. At this moment, the above three direct methods are ineffective.

The other methods one may use are "search for polynomial relations" and "identify the highest homogeneous parts" . It can be shown that they are ineffective.

Finally we may use the "brute force attack" as follows: The attacker has the 100 polynomials {f₁,...,f₁₀₀). Then the attacker assign random values from the field K for {x₁,...,x₆₄} to see if the assigned values are correct. It is easy to see that the possiblities is one in 10¹⁵³. Assuming it only take one clock cycle to test if a set of 64 random numbers is correct, the attacker still need 3* 10¹³⁹ misp (one million instruction per second) years to crack the scheme. In comparison, it requires 3* 10²⁰ mips years to cracked RSA 2048.