Newton polygons
This post is on one of my favourite tools in number theory: Newton polygons. I think this topic is an excellent vehicle for introducing a lot of the ideas that are at the heart of algebraic number theory. To this day, I still find Newton polygons to be one of the most magical phenomena in mathematics: with just a simple picture, Newton polygons give you profound insights into what numbers are really doing. Without further ado, let's get started!
Figure 1: a polygonal Newton (not actually what this post is about).
Main question
The problem that Newton polygons help us understand is the following: if I give you a number , can you tell me how divisible by is? Or more generally1, for any prime number can you tell me how divisible by is?
For small examples we can do this by inspection: is divisible by exactly times (). For larger integer examples you can bust out the Euclidean algorithm. For integer values of , the fundamental theorem of arithmetic is what guarantees for us that there is a well-defined answer, since factors in a unique way as a product of powers of prime numbers.
The place that things start to get interesting is when we consider non-integer values of . How divisible by is ? What about for a rational number like ? A more complicated irrational number like ? Complex numbers like ? Real numbers like ? And moreover, what should this even mean when we start working with non-integer values of since we don't have the fundamental theorem of arithmetic telling us that the answer is well-defined. For some of these examples you can provide a convincing answer.
Claim: I claim that is be divisible by exactly times.
Proof 1: taking square roots is dividing exponents by , so we divide the exponent of in by to get .
Proof 2: from a different angle, is divisible by a total of times. Since each should contribute the same amount to the divisibility by of , we get .
You can employ similar thinking to convince yourself of the correct values for the next 3 examples I mentioned (give it a try!). But what about ? The same techniques don't work because the strategies for the other numbers revolve around reducing the problem to one about integers, which isn't something you can do with .
This example shows pretty convincingly that we should allow fractional answers when we start working with non-integer values of . To help guide our thinking it is useful at this point to come up with a more precise statement of what it should mean that a non-integer is divisible by by some amount. The language that is useful for this is valuations.
Valuations
Definition: the -adic valuation function is the function from integers to integers which takes in an integer and outputs the number of times that is divisible by . In other words, for some integer which isn't divisible by .
This function will be useful to us in the following ways:
- The function has a bunch of properties that encode what it means for a number to be divisible by .
- We can re-frame our original question to the following: how can we compute ?
- To answer this question for non-integer values we want to extend the domain of to accept non-integer inputs, while preserving the properties that make meaningfully compute "divisibility by 2".
Here are the important properties of , which I'll label as V1, V2, V3.2
- V1: for integers and , .
- V2: for integers and , . Moreover, if , then (we'll call this second part V2-strong when we need to reference it specifically).
- V3: by convention , and is the only number with an infinite valuation.
With this language of valuations set up, we can provide a precise answer for what "how divisible is by " mean for non-integer values of : the answer should be a value for some function which
- agrees with the usual for integer inputs,
- has a domain containing the non-integer ,
- and satisfies properties V1, V2, V3.
Returning to our examples from before, we can see that the relationships our non-integers have with integers let us establish conclusively what these values would have to be just based on the properties V1, V2(-strong), and V3, without ever actually needing to construct the function .
- For , we can use the fact that agrees with on integers and property V1 to get and from here we divide by to get .
- Use that has valuation and has valuation , so .
- Use V2-strong: has a -adic valuation of , has a valuation of , and since they're distinct the overall valuation is the smaller of the two, .
- Figuring out the valuation of the complex number is a little bit more involved; V2 itself doesn't help right away since both and have valuation (since ). The trick insight is to involve the complex conjugate since that helps us move back to the world of integers. The key relations here are:
The -adic valuations of must add to by V1, and by V2-strong rules out the case they are different since then the smaller of the two valuations would have to equal . So the -adic valuations of must be .
Theorem (hard non-constructive algebra theorem): there exists an extension of to the full set of complex numbers which satisfies the properties V1, V2, V3.3
This theorem is both good news and bad news. Good news, in that it tells us that no matter how complicated our number looks, we can assign a sensible -adic valuation to it The bad news is actually sort of the same thing, and it is sort of hidden in the epithet I've given the theorem: sometimes there are too many choices! For any real number that you pick, there's infinitely many different valuations with . So for , there's no "true" -adic valuation; it could be anything, depending on which extension you pick.
But we saw before that for numbers like and the properties V1, V2, V3 do determine the -adic valuation uniquely. The key difference here is that all of these numbers are algebraic, i.e. they are roots of polynomials with integer coefficients, whereas is transcendental, it is not the root of any polynomial with integer coefficients.
This is where Newton polygons enter the picture: for algebraic numbers we've thus far been figuring out the valuations with an ad hoc process that relies on properties V1, V2, V3. Newton polygons give us a slick way to compute valuations of algebraic numbers just by drawing a simple picture based on the polynomial that they're a root of.
Newton polygons
Take a polynomial with rational coefficients. Let's write
and assume that ( isn't divisible by ) and ( has degree ).
For a prime number , the -adic Newton polygon is formed as follows:
- plot the points in the plane;
- form their "lower convex hull", i.e. imagine pulling a string taut against those points from below;
- the resulting set of line segments stretching from to are called the -adic Newton polygon of .
Figure 2: forming the -adic Newton polygon of
The term "polygon" is a bit of a misnomer here, since we're really working work with just a set of line segments rather than a polygon in the traditional sense.
Why are we doing this? It is instructive to work through what is going on in the case of a general quadratic polynomial. Let's assume that is monic, and has two roots and . Thus we have that
For some which extends let's say that and .
What can we say about the Newton polygon of ? No matter what we'll have that
Since , in general we can only guarantee that
using property V2 of . There's really only two cases to consider here: either in which case the strong form of V2 applies and we know exactly, or in which case we just have the inequality above.
In the case we can read off the valuations and as the negatives of the slopes of the two-segment Newton polygon; in the case the inequality implies that the point at lies on or above the line connecting the and points, giving a Newton polygon with a single segment of horizontal length 2 and slope the negative of .
You could (and it is instructive!) to think through a similar analysis for cubic polynomials, to get a feel for what goes on in the general case. Here's the full theorem relating Newton polygons and valuations of roots of polynomials.
Theorem: the -adic valuations of the roots of a rational polynomial are equal to the negatives of the slopes of the -adic Newton polygon of (counted with multiplicity according to the horizontal length of the Newton polygon segments).
Proof: In principle we could prove this the same way that we handled the cubic case above, with a lot of careful bookkeeping.
Let's instead formulate the proof this way: there are two polygons we could draw given the polynomial . One is the -adic Newton polygon, based on the valuations of the coefficients of . The second is the polygon with the property that we want: the slopes of this polygon will be the negatives of the -adic valuations of the roots of .
For simplicity let us assume that is monic (i.e. if it has degree , the coefficient of is equal to , we can also divide out by to arrange this, which doesn't change the set of roots or the slopes of the Newton polygon). This will let us line up the valuation polygon vertically with the Newton polygon. Let's say that the roots of are , with If the roots of have valuations , we'll order the roots to ensure that
To form the valuation polygon, draw the lines segments connecting the points . Notice that this makes the valuation polygon:
- start at and end at ,
- have slope between the values and ,
- and by our ordering of the the slopes increase from left to right.
To prove that this valuation polygon is equal to the Newton polygon we'll prove two things: the points plotted to make the Newton polygon lie on or above the points plotted to make the valuation polygon, and these points agree wherever there's a change in slope of the valuation polygon. Since in both cases the slopes increase from left to right, this implies that they are equal.
To prove that the points making up the Newton polygon are above the corresponding points in the valuation polygon, we need to prove that for each . We know that is the sum over all possible products of of the roots of , i.e.
We can just compute the valuation of this using valuation axioms V1 and V2! We can guarantee that the term has valuation smaller than or equal to all other terms by the ordering of the roots, so we get that
Moreover if this is an index where the slope of the valuation polygon changes, i.e. , then we are guaranteed that the term has strictly smaller valuation than all other terms in the sum making up . Applying the strong form of V2 gives that
at these indices, i.e. the vertices of the Newton and valuation polygons agree here.
Wrap up
So there you have it! For any algebraic number we can figure out how divisible it is by a given prime : find the polynomial it is a root of, draw the -adic Newton polygon of , and read off the (negatives of the) slopes of the polygon. No matter what valuation we've chosen to extend , the valuation of our root is (the negative of) one of those slopes, and the Newton polygon itself doesn't need to know anything about . This is magical. In the words of one of my Mathcamp students: Newton polygons are OP.
There's a subtlety still remaining: the Newton polygon tells us the set of valuations that the roots of a polynomial have, but it can't in general tell us which root has which valuation. In fact this is to be expected, since different extensions of will permute which of the possible valuations the roots get!
In the next post, I'll talk about this bigger picture of this world of valuations, and we'll give some really slick proofs of cool theorems using Newton polygons.
I'm using to keep things concrete for much of this post, but everything works the same for a general prime number .↩
These properties break down when you try to use them for divisibility by for non-primes , which is why we restrict to primes when talking about valuations.↩
Proving that valuations extend to the complex numbers is well beyond what I want to do in this post, but here's the brief idea if you have a bit of background. The -adic valuation extends from in a unique to all algebraic extensions of the -adic numbers , and hence also to the algebraic closure . By continuity, that valuation also extends to the complex -adics . Since the complex numbers are abstractly isomorphic to the complex -adics (they are algebraically closed fields of the same cardinality and characteristic), you can pick such an isomorphism and use it to "compute" valuations of complex numbers.↩