Finite fields: an interesting construction
In the last post I introduced some facts about finite fields and we played around with the field with elements. This post will assume a bit more background about finite fields, but as always the key facts are: for each prime number and integer there is a unique up to isomorphism field with elements, and that field can be constructed as where is a degree irreducible polynomial.
Today's we'll look at a seemingly bewildering alternative construction of field of size , and figure out how it relates to the more standard polynomial quotient construction.
Conway's construction
In his book On Numbers and Games Conway gave the following construction of finite fields of size .
Elements: the elements of are the natural numbers .
Addition: for , the sum of and is just the natural number . The sum of with itself is . In other words, natural numbers are added by taking the XOR of their base 2 expansions.
Multiplication: is defined by determining the products of -powers of , i.e. times . If then the product is defined to be the natural number . Note that this is actually guaranteed to be less than if the original elements are distinct and less than ! The product of with itself is defined to be the natural number where we interpret the multiplication and division in that formula with normal rational number arithmetic, i.e. it is . Assuming ring axioms (associativity, commutativity, distributivity) this defines multiplication for all elements of by writing a natural number in its binary expansion, and also taking binary expansions of the powers appearing in the powers of 2.
As a particular example: the field has elements, the integers through . Multiplication is determined using binary expansions and the rules and !
If you've taken a more textbook route to learning about finite fields (as I had!) then this construction seems bonkers! It is not even obvious that these operations produce a field. How can this possibly work? The more targeted question, since every finite field can be realized through the quotient construction, how does this construction relate to the more usual one with quotients of polynomial rings?
Artin-Schreier polynomials
In a field of characteristic , the Artin-Schreier polynomial associated to an element is . These polynomials turn out to be easy to understand compared to general polynomials and thus useful for many constructions.
Lemma: if is a root of the Artin-Schreier polynomial , then are all roots of that polynomial.
Proof: the key here is to think about the function as a linear map, since we're in a field of characteristic . Note that this map is linear because taking -th powers (Frobenius!) is linear in characteristic . By Fermat's Little Theorem we know that for any in (i.e. ). Therefore for any in we have that
In particular, this concrete understanding of the roots of Artin-Schreier polynomials allows us to prove their irreducibility in many cases.
Lemma: an Artin-Schreier polynomial is irreducible over if and only if it has no roots in .
Proof: one direction is immediate, if has any root in then the above lemma gives us a complete factorization into linear terms.
Let's suppose that factors as over , where each of and has degree . If has degree , the coefficient of in is (up to a minus sign) a sum of the roots of , which are a subset of the roots of . By the previous lemma we know those roots are of the form for in , so their sum is of the form for some in . Since has coefficients in by assumption and we deduce that itself is in .
Polynomial realization of Conway's construction
Conway's construction has immediate inclusions for all . Since increasing by one squares the size of the field, it is reasonable to want to directly construct as a quadratic extension of , i.e. construct it by adjoining a root of an irreducible quadratic polynomial to .
Let's do this with Artin-Schreier polynomials. We're looking for an irreducible quadratic polynomial where is in the field (note that and are the same since we're in characteristic ). We know from above that this polynomial is irreducible when is not of the form , i.e. when this polynomial has no roots in .
The map from to itself is linear over , and the kernel of this linear map is the -dimensional subspace . By rank-nullity that means the image has size , so in particular there are lots of elements which aren't in the image of this map and so could be used to make irreducible quadratic Artin-Schreier extensions.
Moreover, we can directly pin down which elements aren't in the image. The basic idea is to look at the highest degree terms in the binary expansion of . The square of will be of the form plus lower degree terms. We'll write as a product of , square each term individually to get a product of and expand. Note that in squaring an we get since squaring is linear in characteristic (Frobenius again!). So the highest degree term in the binary expansion of is the same as that of itself, and these cancel out in the sum . In particular if we take an element of having the highest bit set, i.e. an element , we are guaranteed that it is not in the image of the map on .
So we can build a quadratic extension of by adjoining a root of the quadratic polynomial for any in which is . Conway's construction uses the choice of the "smallest" (in the normal integer sense) value of . To create we're adding to an element which satisfies . Looking back at our definition of the multiplication operation this is exactly the element of : it satisfies .