Eric writes about math and life

Finite fields: an interesting construction

In the last post I introduced some facts about finite fields and we played around with the field with 28 elements. This post will assume a bit more background about finite fields, but as always the key facts are: for each prime number p and integer d1 there is a unique up to isomorphism field with pd elements, and that field can be constructed as 𝔽p[x]/(g(x)) where g(x) is a degree d irreducible polynomial.

Today's we'll look at a seemingly bewildering alternative construction of field of size 22n, and figure out how it relates to the more standard polynomial quotient construction.

Conway's construction

In his book On Numbers and Games Conway gave the following construction of finite fields Cn of size 22n.

Elements: the elements of Cn are the natural numbers 0,1,2,,22n1.

Addition: for xy, the sum of 2x and 2y is just the natural number 2x+2y. The sum of 2x with itself is 0. In other words, natural numbers are added by taking the XOR of their base 2 expansions.

Multiplication: is defined by determining the products of 2-powers of 2, i.e. 22x times 22y. If xy then the product is defined to be the natural number 22x+2y. Note that this is actually guaranteed to be less than 22n if the original elements are distinct and less than 22n! The product of 22x with itself is defined to be the natural number 3222x where we interpret the multiplication and division in that formula with normal rational number arithmetic, i.e. it is 22x+22x1. Assuming ring axioms (associativity, commutativity, distributivity) this defines multiplication for all elements of Cn by writing a natural number in its binary expansion, and also taking binary expansions of the powers appearing in the powers of 2.

As a particular example: the field C2 has 222=16 elements, the integers 0 through 15. Multiplication is determined using binary expansions and the rules 2×2=3 and 4×4=6!

If you've taken a more textbook route to learning about finite fields (as I had!) then this construction seems bonkers! It is not even obvious that these operations produce a field. How can this possibly work? The more targeted question, since every finite field can be realized through the quotient construction, how does this construction relate to the more usual one with quotients of polynomial rings?

Artin-Schreier polynomials

In a field F of characteristic p, the Artin-Schreier polynomial associated to an element a is xpxa. These polynomials turn out to be easy to understand compared to general polynomials and thus useful for many constructions.

Lemma: if b is a root of the Artin-Schreier polynomial xpxa, then b+1,b+2,,b+p1 are all roots of that polynomial.

Proof: the key here is to think about the function xpx as a linear map, since we're in a field of characteristic p. Note that this map is linear because taking p-th powers (Frobenius!) is linear in characteristic p. By Fermat's Little Theorem we know that xp=x for any x in 𝔽p (i.e. x=0,1,2,,p1). Therefore for any i in 𝔽p we have that

(b+i)p(b+i)=bp+ipbi=bpb=a.

In particular, this concrete understanding of the roots of Artin-Schreier polynomials allows us to prove their irreducibility in many cases.

Lemma: an Artin-Schreier polynomial xpxa is irreducible over 𝔽 if and only if it has no roots in 𝔽.

Proof: one direction is immediate, if xpxa has any root in 𝔽 then the above lemma gives us a complete factorization into linear terms.

Let's suppose that xpxa factors as f(x)g(x) over 𝔽, where each of f(x) and g(x) has degree <p. If f(x) has degree d, the coefficient of xd1 in f(x) is (up to a minus sign) a sum of the roots of f(x), which are a subset of the roots of xpxa. By the previous lemma we know those roots are of the form b+i for i in 𝔽p, so their sum is of the form db+j for some j in 𝔽p. Since f(x) has coefficients in 𝔽 by assumption and 1d<p we deduce that b itself is in 𝔽.

Polynomial realization of Conway's construction

Conway's construction has immediate inclusions CnCn+1 for all n. Since increasing n by one squares the size of the field, it is reasonable to want to directly construct Cn+1 as a quadratic extension of Cn, i.e. construct it by adjoining a root of an irreducible quadratic polynomial to Cn.

Let's do this with Artin-Schreier polynomials. We're looking for an irreducible quadratic polynomial x2+x+a where a is in the field Cn (note that + and are the same since we're in characteristic 2). We know from above that this polynomial is irreducible when a is not of the form x2+x, i.e. when this polynomial has no roots in Cn.

The map x2+x from Cn to itself is linear over 𝔽2, and the kernel of this linear map is the 1-dimensional subspace {0,1}. By rank-nullity that means the image has size 22n1, so in particular there are lots of elements which aren't in the image of this map and so could be used to make irreducible quadratic Artin-Schreier extensions.

Moreover, we can directly pin down which elements aren't in the image. The basic idea is to look at the highest degree terms in the binary expansion of x2+x. The square of 2n will be of the form 2n plus lower degree terms. We'll write 2n as a product of 22ni, square each term individually to get a product of (22ni+22ni1) and expand. Note that in squaring an x=xi2i we get xi(2i)2 since squaring is linear in characteristic 2 (Frobenius again!). So the highest degree term in the binary expansion of x2 is the same as that of x itself, and these cancel out in the sum x2+x. In particular if we take an element of Cn having the highest bit set, i.e. an element 22n1, we are guaranteed that it is not in the image of the map x2+x on Cn.

So we can build a quadratic extension Cn+1 of Cn by adjoining a root of the quadratic polynomial x2+x+a for any a in Cn which is 22n1. Conway's construction uses the choice of the "smallest" (in the normal integer sense) value of a. To create Cn+1 we're adding to Cn an element which satisfies x2+x+22n1=0. Looking back at our definition of the multiplication operation this is exactly the element 22n of Cn+1: it satisfies (22n)2=22n+22n1.