Eric writes about math and life

Newton polygons

This post is on one of my favourite tools in number theory: Newton polygons. I think this topic is an excellent vehicle for introducing a lot of the ideas that are at the heart of algebraic number theory. To this day, I still find Newton polygons to be one of the most magical phenomena in mathematics: with just a simple picture, Newton polygons give you profound insights into what numbers are really doing. Without further ado, let's get started!

A polygonal Newton (not actually what this post is about) Figure 1: a polygonal Newton (not actually what this post is about).

Main question

The problem that Newton polygons help us understand is the following: if I give you a number x, can you tell me how divisible by 2 x is? Or more generally1, for any prime number p can you tell me how divisible by p x is?

For small examples we can do this by inspection: 40 is divisible by 2 exactly 3 times (40=23·5). For larger integer examples you can bust out the Euclidean algorithm. For integer values of x, the fundamental theorem of arithmetic is what guarantees for us that there is a well-defined answer, since x factors in a unique way as a product of powers of prime numbers.

The place that things start to get interesting is when we consider non-integer values of x. How divisible by 2 is 8? What about for a rational number like 316? A more complicated irrational number like 4+62? Complex numbers like 1+i? Real numbers like π? And moreover, what should this even mean when we start working with non-integer values of x since we don't have the fundamental theorem of arithmetic telling us that the answer is well-defined. For some of these examples you can provide a convincing answer.

Claim: I claim that 8 is be divisible by 2 exactly 32 times.

Proof 1: taking square roots is dividing exponents by 2, so we divide the exponent of 2 in 8=23 by 2 to get 32.

Proof 2: from a different angle, 8·8=8=23 is divisible by 2 a total of 3 times. Since each 8 should contribute the same amount to the divisibility by 2 of 8, we get 32.

You can employ similar thinking to convince yourself of the correct values for the next 3 examples I mentioned (give it a try!). But what about π? The same techniques don't work because the strategies for the other numbers revolve around reducing the problem to one about integers, which isn't something you can do with π.

This example shows pretty convincingly that we should allow fractional answers when we start working with non-integer values of x. To help guide our thinking it is useful at this point to come up with a more precise statement of what it should mean that a non-integer x is divisible by 2 by some amount. The language that is useful for this is valuations.

Valuations

Definition: the 2-adic valuation function val2 is the function from integers to integers which takes in an integer x and outputs the number of times k that x is divisible by 2. In other words, x=2ky for some integer y which isn't divisible by 2.

This function will be useful to us in the following ways:

Here are the important properties of val2, which I'll label as V1, V2, V3.2

With this language of valuations set up, we can provide a precise answer for what "how divisible is x by 2" mean for non-integer values of x: the answer should be a value ν2(x) for some function ν2 which

Returning to our examples from before, we can see that the relationships our non-integers have with integers let us establish conclusively what these values ν2(x) would have to be just based on the properties V1, V2(-strong), and V3, without ever actually needing to construct the function ν2.

(1+i)(1i)=2 and (1+i)+(1i)=2.

The 2-adic valuations of 1±i must add to 1 by V1, and by V2-strong rules out the case they are different since then the smaller of the two valuations would have to equal 1. So the 2-adic valuations of 1±i must be 12.

Theorem (hard non-constructive algebra theorem): there exists an extension ν2 of val2 to the full set of complex numbers which satisfies the properties V1, V2, V3.3

This theorem is both good news and bad news. Good news, in that it tells us that no matter how complicated our number looks, we can assign a sensible 2-adic valuation to it The bad news is actually sort of the same thing, and it is sort of hidden in the epithet I've given the theorem: sometimes there are too many choices! For any real number r that you pick, there's infinitely many different valuations with ν2(π)=r. So for π, there's no "true" 2-adic valuation; it could be anything, depending on which extension ν2 you pick.

But we saw before that for numbers like 4+62 and 1+i the properties V1, V2, V3 do determine the 2-adic valuation uniquely. The key difference here is that all of these numbers are algebraic, i.e. they are roots of polynomials with integer coefficients, whereas π is transcendental, it is not the root of any polynomial with integer coefficients.

This is where Newton polygons enter the picture: for algebraic numbers we've thus far been figuring out the valuations with an ad hoc process that relies on properties V1, V2, V3. Newton polygons give us a slick way to compute valuations of algebraic numbers just by drawing a simple picture based on the polynomial that they're a root of.

Newton polygons

Take a polynomial f(x) with rational coefficients. Let's write

f(x)=a0+a1x++anxn

and assume that a00 (f(x) isn't divisible by x) and an0 (f(x) has degree n).

For a prime number p, the p-adic Newton polygon is formed as follows:

Forming the 2-adic Newton polygon of 32+8x+8x2+2x3+4x4+x5 Figure 2: forming the 2-adic Newton polygon of 32+8x+8x2+2x3+4x4+x5.

The term "polygon" is a bit of a misnomer here, since we're really working work with just a set of line segments rather than a polygon in the traditional sense.

Why are we doing this? It is instructive to work through what is going on in the case of a general quadratic polynomial. Let's assume that f(x) is monic, and has two roots α and β. Thus we have that

f(x)=a0+a1x+x2=(xα)(xβ).

For some νp which extends valp let's say that νp(α)=λ and νp(β)=μ.

What can we say about the Newton polygon of f(x)? No matter what we'll have that

valp(a0)=νp(αβ)=λ+μ.

Since a1=α+β, in general we can only guarantee that

valp(a1)=νp(α+β)min(λ,μ)

using property V2 of νp. There's really only two cases to consider here: either λμ in which case the strong form of V2 applies and we know valp(a1) exactly, or λ=μ in which case we just have the inequality above.

In the λμ case we can read off the valuations λ and μ as the negatives of the slopes of the two-segment Newton polygon; in the λ=μ case the inequality implies that the point at x=1 lies on or above the line connecting the x=0 and x=2 points, giving a Newton polygon with a single segment of horizontal length 2 and slope the negative of λ=μ.

You could (and it is instructive!) to think through a similar analysis for cubic polynomials, to get a feel for what goes on in the general case. Here's the full theorem relating Newton polygons and valuations of roots of polynomials.

Theorem: the p-adic valuations of the roots of a rational polynomial f(x) are equal to the negatives of the slopes of the p-adic Newton polygon of f(x) (counted with multiplicity according to the horizontal length of the Newton polygon segments).

Proof: In principle we could prove this the same way that we handled the cubic case above, with a lot of careful bookkeeping.

Let's instead formulate the proof this way: there are two polygons we could draw given the polynomial f(x). One is the p-adic Newton polygon, based on the valuations of the coefficients of f(x). The second is the polygon with the property that we want: the slopes of this polygon will be the negatives of the p-adic valuations of the roots of f(x).

For simplicity let us assume that f(x) is monic (i.e. if it has degree n, the coefficient an of xn is equal to 1, we can also divide out by an to arrange this, which doesn't change the set of roots or the slopes of the Newton polygon). This will let us line up the valuation polygon vertically with the Newton polygon. Let's say that the roots of f(x) are α1,,αn, with If the roots of f(x) have valuations λi=νp(αi), we'll order the roots to ensure that

λ1λ2...λn

To form the valuation polygon, draw the lines segments connecting the points (i,j=i+1nλj). Notice that this makes the valuation polygon:

To prove that this valuation polygon is equal to the Newton polygon we'll prove two things: the points plotted to make the Newton polygon lie on or above the points plotted to make the valuation polygon, and these points agree wherever there's a change in slope of the valuation polygon. Since in both cases the slopes increase from left to right, this implies that they are equal.

To prove that the points making up the Newton polygon are above the corresponding points in the valuation polygon, we need to prove that valp(ak)λnk+1++λn for each k=0,,n. We know that ak is the sum over all possible products of k of the roots of f(x), i.e.

ak=1i1<<iknαi1αik

We can just compute the valuation of this using valuation axioms V1 and V2! We can guarantee that the term αnk+1αn has valuation smaller than or equal to all other terms by the ordering of the roots, so we get that

valp(ak)νp(αnk+1αn)=λnk+1++λn.

Moreover if this is an index where the slope of the valuation polygon changes, i.e. λnkλnk+1, then we are guaranteed that the term αnk+1αn has strictly smaller valuation than all other terms in the sum making up ak. Applying the strong form of V2 gives that

valp(ak)=λnk+1++λn

at these indices, i.e. the vertices of the Newton and valuation polygons agree here.

Wrap up

So there you have it! For any algebraic number we can figure out how divisible it is by a given prime p: find the polynomial f(x) it is a root of, draw the p-adic Newton polygon of f, and read off the (negatives of the) slopes of the polygon. No matter what valuation νp we've chosen to extend valp, the valuation of our root is (the negative of) one of those slopes, and the Newton polygon itself doesn't need to know anything about νp. This is magical. In the words of one of my Mathcamp students: Newton polygons are OP.

There's a subtlety still remaining: the Newton polygon tells us the set of valuations that the roots of a polynomial have, but it can't in general tell us which root has which valuation. In fact this is to be expected, since different extensions of valp will permute which of the possible valuations the roots get!

In the next post, I'll talk about this bigger picture of this world of valuations, and we'll give some really slick proofs of cool theorems using Newton polygons.

  1. I'm using p=2 to keep things concrete for much of this post, but everything works the same for a general prime number p.

  2. These properties break down when you try to use them for divisibility by n for non-primes n, which is why we restrict to primes when talking about valuations.

  3. Proving that valuations extend to the complex numbers is well beyond what I want to do in this post, but here's the brief idea if you have a bit of background. The p-adic valuation extends from in a unique to all algebraic extensions of the p-adic numbers p, and hence also to the algebraic closure p. By continuity, that valuation also extends to the complex p-adics p. Since the complex numbers are abstractly isomorphic to the complex p-adics p (they are algebraically closed fields of the same cardinality and characteristic), you can pick such an isomorphism and use it to "compute" valuations of complex numbers.