Posts

Versal torsors are awesome

I was just at Bar Ilan University for Louis Rowen’s great birthday conference. Included among my fellow speakers were Danny Krashen and Alexander Merkurjev.
They told me about something I should have seen long ago: versal torsors. Among other things, versal torsors give a way to produce universal bounds on symbol lengths for central division algebras with bounded index. This blew my mind. Let me show you how it works.

Look at what happened to my classifying space!

Let G be a linear algebraic group. Suppose I want to understand G-torsors. As a card-carrying algebraic geometer and stack aficionado, I should just “use”
the classifying stack \mathrm{B}G, right? WRONG! Here’s another thing you can do.

Choose a faithful representation G\hookrightarrow\mathrm{GL}_n (over whatever base we’re using) and form the quotient B=\mathrm{GL}_n/G. By the definition of quotients, there is an induced G-torsor T\to B (i.e., T=\mathrm{GL}_n!!). This is called a versal torsor. Why versal? For any field K over the base we get an exact sequence of pointed sets

G(K)\to\mathrm{GL}_n(K)\to B(K)\to\mathrm{H}^1(\mathrm{Spec}(K),G)\to\mathrm{H}^1(\mathrm{Spec}(K), \mathrm{GL}_n(K)).

Hilbert’s Theorem 90 tells us that the last set is a singleton. We conclude the following amazing fact: for any field K over the base and any G-torsor \Gamma over K, there is a K-point of B such that the fiber of T over that point is isomorphic to \Gamma.

I don’t know why this freaks me out so much; perhaps it has to do with my perception that there are a lot of G-torsors over arbitrary fields, whereas this tells us that they are all points of a single variety. (Corollary: the essential dimension of G is finite. When you put it that way, it doesn’t sound so outrageous.)

Now that they gave me this thing, what can I do with it?

Here’s an amazing application of versal torsors. Let G be the group \mathrm{PGL}_n over the ring A:=\mathbf{Z}\left[\frac{1}{n}\right][\mu_n]. Choosing a faithful linear representation (example: the adjoint) \mathrm{PGL}_n\to\mathrm{GL _N and forming the versal torsor \mathrm{GL}_N\to\mathrm{GL}_N/\mathrm{PGL}_n gives me a scheme X (the quotient) and a \mathrm{PGL}_n-torsor (let’s call it T\to X) with the amazing property that for any field K in which n is invertible and containing a primitive nth root of unity and any \mathrm{PGL}_n-torsor Y over K, there is a K-point of X such that the fiber of T over that point is isomorphic to Y. (I’m just restating the above.)

Here’s something we can do with this. Given a central division algebra D over such a field K with index n, the Merkurjev-Suslin theorem says that there is a sequence of symbol algebras (a_1,b_1),\ldots,(a_m,b_m) over K such that D is Brauer-equivalent to the tensor product E=\bigotimes_i (a_i,b_i) (that is, there are matrix algebras over D and E that are isomorphic). The question is: how many symbols do we need?

The surprising answer is: there is a universal bound on the number of symbols we need! Here’s how to prove this. Let \alpha be the Brauer class of the versal torsor over X. Merkurjev-Suslin gives us an expression for \alpha restricted to the generic point of X in terms of symbols. Standard spreading out arguments make this expression valid in an open neighborhood (i.e., the symbols extend to Azumaya algebras in a neighborhood). Take the complement of this open subset; this is some closed subset X'\subset X. Repeat this at a generic point of X'. Since any locally-closed stratification of X is finite (X being Noetherian), this process will terminate. Take the largest number of symbols you used and that’s your bound! Every algebra of degree n over any field factors through X, so it will factor through a piece of the stratification, and we can use the symbol decomposition on that stratum.

As Harry Caray used to say, “Holy Cow!”

Forsyth = Chisini + Moishezon

Our May AIM workhop on Algebraic Vision was an absolutely fascinating experience. I will eventually write more about things that I learned there, but what I will write about first is of the most interesting from a sociological perspective. It sheds some light on the culture gap between computer vision and pure mathematics.

Forsyth

In 1993, David Forsyth published a paper at the Fourth International Conference on Computer Vision entitled “Recognizing algebraic surfaces from their outlines”. The key result is the following. (I have copied the statement of the theorem verbatim.)

Theorem. The equation of its outline in a perspective image completely determines the projective geometry of an algebraic surface of degree 2 or greater, for a generic view of a generic algebraic surface.

What does the statement mean? Let me define various terms that come up.

  • Algebraic surface is easy: this means projective algebraic surface X\subset\mathbf{P}^3.
  • View means: linear projection \mathbf{P}^3\dashrightarrow\mathbf{P}^2.
  • Perspective image means: image in a view (as defined above).
  • Generic is undefined in the paper, but we know what it means: true on a Zariski-open locus.
  • Outline is the most interesting one. Here is how to define it: we assume our linear projection \mathbf{P}^3\dashrightarrow\mathbf{P}^2 has base locus at a point not on X. We thus get an induced finite morphism X\to\mathbf{P}^2. This morphism ramifies along a curve D\subset X, and the outline is the image of D in $latex\mathbf{P^2}.$
  • Projective geometry of X means “X up to automorphism of
    \mathbf{P}^3“.

We can restate the theorem like this.

Theorem (restated). Suppose X\subset\mathbf{P}^3 is a generic smooth projective surface of degree at least 2 and f:\mathbf{P}^3\dashrightarrow\mathbf{P}^2 is a generic linear projection that is regular along X. Let D\subset X be the ramification curve of the restriction f_X. Then f(D) uniquely determines X, up to automorphism of \mathbf{P}^3.

(This is close enough to a rigorous statement that it would pass muster to almost any algebraic geometer, even though some number theorists might not like the lax use of genericity. In fact, I don’t like it either this way. We should be careful and say that we are looking at a generic pair (X,f).)

Chisini

When stated this way, it looks awfully similar to the following, which I will state as a conjecture.

Conjecture. A generic finite morphism f:X\to\mathbf{P}^2 is uniquely determined by its branch divisor.

This conjecture is simpler — it removes linear projections from the picture — and harder — it removes linear projections from the picture! But now it has transformed into a standard problem in algebraic geometry, called Chisini‘s conjecture, first stated in 1944. In fact, Chisini also assumed for his conjecture that the degree of f is at least 5. And the word “generic” allows us to assume that all ramification of f is simple (i.e., is locally around source and target given analytically as x\mapsto x^2), that the ramification curve R in X is smooth, that its image B in \mathbf{P}^2 has only nodes and cusps, and that the induced map R\to B is birational.

Chisini himself stated a solution to some cases of this conjecture in [1], but his proof was incorrect. (This is briefly explained by Antonio Lanteri in his MathSciNet review of Catanese’s paper [2]. The error seems to be related to the degeneration types of the curves that appear.)

As an algebraic geometer, the immediately intuitive way to approach a problem like this is to study the fundamental group of the complement of the branch divisor in \mathbf{P}^2. It looks like one is trying to prove that if the monodromy action of cycles around the divisor are a given set of transpositions, then the monodromy representation is uniquely determined. This is pretty close to wanting to know a presentation of the fundamental group. For example, is it abelian with a specific set of generators?

Indeed, Zariski conjectured that if a plane curve has only nodes, then its complement has abelian fundamental group. This was proven by Fulton (in the sense of profinite fundamental groups, although this was improved by Deligne in the complex case) [3] in 1980. (Note: knowing that such curves generally degenerate to unions of lines plays an important role, echoing the issues with Chisini’s original incorrect proof.)

So we’re done, right? Unfortunately, no: the branch curve of a projection need not be nodal. In fact, a simple dimension count argument shows that it can’t be, in general.

 

Moishezon

As it turns out, a number of authors have studied Chisini’s conjecture over the years, including Catanese, Kulikov, Nemirovskii, and Moishezon. Moreover, Moishezon solved the precise case encountered by Forsyth in 1981. The hard part of Chisini’s problem is that it is stated very generally, with no assumptions on the structure of the morphism. Things are much easier for linear projections, although Kulikov still has to work hard for high dimensional projections, as opposed to projections from 3-space.

Moishezon’s proof is embedded in a larger work that studies problems related to fundamental groups of curve complements [4]. So my mathy gut instinct was “right” in a certain vague sense, but it is hard to implement.

What did Forsyth do?

Forsyth’s argument is much easier. It relies heavily on input from Debarre, who was at Iowa at the same time as Forsyth (the early 1990s), consisting of a few simple cohomology calculations and some clever observations about generators of ideals in various degrees (related to the degree of the surface). At some point, I intended to sketch it here, but that is stopping me from actually finishing this. Instead, I encourage you to read the paper, not only to see how the argument works, but also to get a taste of the comparison between the cultures of algebraic geometry and computer vision.

What happened?

The main point I want to make here is that Forsyth, a computer vision researcher, seems to have rediscovered the results of Chisini, etc., without being aware of the algebro-geometric literature. There’s one small wrinkle: Forsyth got help from Debarre, a well-known algebraic geometer. Chisini’s conjecture and the work of Catanese and Moishezon seem to have been obscure enough within the subject that Debarre didn’t immediately point Forsyth toward those results. Or perhaps Debarre realized that to answer Forsyth’s immediate question, much simpler techniques would suffice. Or maybe Forsyth went to Debarre with a specific bit of the question, and Debarre didn’t know about the bigger context. Enough speculating! Regardless of how this came to be, Forsyth and Debarre cooked up a very nice and simple proof in the case of linear projections.

There is a lot that computer vision researchers and algebraic geometers have to learn from one another. But that is the subject of another post.

References

[1] Chisini, “Sulla identita birazionale delle funzioni algebriche di due variabili
dotate di una medesima curva di diramazione.” Ist. Lombardo Sci. Lett. Cl. Sci. Mat. Nat. Rend. (3) 8(77), (1944). 339–356.

[2] Catanese, “On a problem of Chisini.” Duke Math. J. 53 (1986), no. 1, 33–42.

[3] Fulton, “On the fundamental group of the complement of a node curve.” Ann. of Math (2) 111 (1980), no. 2, 407–409.

[4] Moishezon, “Stable branch curves and braid monodromies.” Algebraic geometry (Chicago, Ill., 1980), pp. 107–192, Lecture Notes in Math., 862, Springer, Berlin-New York, 1981.