12.07.2015 Views

LANG-TROTTER REVISITED Dedicated to the memory ... - CiteSeerX

LANG-TROTTER REVISITED Dedicated to the memory ... - CiteSeerX

LANG-TROTTER REVISITED Dedicated to the memory ... - CiteSeerX

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 3has no integer solutions because it has no R solutions; <strong>the</strong> equationx 2 + y 2 = 691,and more generally any equation of <strong>the</strong> formx 2 + y 2 = 4n + 3,has no integer solutions because it has no solutions mod 4; and <strong>the</strong>equationy 2 + x 4 + 2 = 0has no integer solutions both because it has no R solutions and becauseit has no mod 5 solutions.Even in <strong>the</strong> possible presence of an archimedean obstruction, it canstill be interesting <strong>to</strong> ask, given f, modulo which primes p <strong>the</strong> equationf = 0 has an F p solution. For example, <strong>the</strong> study of <strong>the</strong> equation inone variablex 2 + 1 = 0,mod odd primes p, amounts <strong>to</strong> <strong>the</strong> determination of <strong>the</strong> “quadraticcharacter of −1 mod p”, and led Euler <strong>to</strong> <strong>the</strong> <strong>the</strong>orem, already stateda century earlier by Fermat, that all primes of <strong>the</strong> form 4n + 1, butnone of <strong>the</strong> form 4n − 1, are sums of two squares. In this example, <strong>the</strong>number N p of mod p solutions is ei<strong>the</strong>r 0 or 2; if we writeN p = 1 + a p<strong>the</strong>n a p = ±1, and <strong>the</strong> result is that a p = 1 if p is of <strong>the</strong> form 4n + 1,and a p = −1 if p is of <strong>the</strong> form 4n − 1.Still with this x 2 + 1 = 0 example, we might ask whe<strong>the</strong>r a p = 1(resp. a p = −1) holds for infinitely many primes. That it does, forboth choices of sign, amounts <strong>to</strong> <strong>the</strong> special case of Dirichlet’s <strong>the</strong>orem,that <strong>the</strong>re are infinitely many primes in each of <strong>the</strong> two arithmeticprogressions 4n ± 1.Now let us consider an equation in two variables. For simplicity, wetake it <strong>to</strong> be of <strong>the</strong> formy 2 = h(x)with h(x) ∈ Z[x] monic of some odd degree 2g+1, such that h has 2g+1distinct zeroes in C. The C solutions, <strong>to</strong>ge<strong>the</strong>r with a single “point at∞”, form a compact Riemann surface of genus g. The discriminant∆ ∈ Z of <strong>the</strong> polynomial h(x) is nonzero. For any “good” prime,i.e., any odd prime p which does not divide ∆, <strong>the</strong> F p -solutions of thisequation, <strong>to</strong>ge<strong>the</strong>r with a single “point at ∞”, form <strong>the</strong> F p points C(F p )


4 NICHOLAS M. KATZof a (projective, smooth, geometrically connected) curve C/F p of genusg over F p . In this case, for each good prime p we have#C(F p ) = 1 + #{F p solutions of y 2 = h(x)}and we define <strong>the</strong> integers a p by#C(F p ) = p + 1 − a p .In <strong>the</strong> x 2 +1 example with its a p , we knew a priori that a p was ei<strong>the</strong>r±1, and <strong>the</strong> two questions were a) how a p depended on p and b) were<strong>the</strong>re infinitely many p with a given choice of a p .In <strong>the</strong> curve case, we almost never know a “simple” rule for how a pdepends on p (short of literally computing it for each given p, more orless cleverly). We do have an archimedean bound, <strong>the</strong> celebrated Weilbound|a p | ≤ 2g √ p.And since a curve cannot have a negative number of points, we have<strong>the</strong> archimedean inequalitya p ≤ p + 1,which for large genus g and small prime p, say 2g > √ p, does not followfrom <strong>the</strong> Weil bound.What else do we know about <strong>the</strong> numbers a p for a given curve?Remarkably little (outside <strong>the</strong> trivial case of genus g = 0, where alla p vanish), but <strong>the</strong>re are a plethora of open problems and conjecturesabout <strong>the</strong>m, some of which have strikingly elementary formulations, orat least consequences which have strikingly elementary formulations.Here is one example of an easy-<strong>to</strong>-state open problem. Suppose weare given <strong>the</strong> numbers a p /p 1/2 for all good p, but are not <strong>to</strong>ld whatcurve <strong>the</strong>y came from, or even its genus. By <strong>the</strong> Weil bound, we havea p /p 1/2 ∈ [−2g, 2g].Is it true that we can recover 2g as <strong>the</strong> limsup of <strong>the</strong> numbers |a p |/p 1/2 ?Or weaker, is it true that <strong>the</strong> inequality|a p |/p 1/2 > 2g − 2holds for infinitely many p? Weaker yet, does it hold for at least onegood p? If this were <strong>the</strong> case, <strong>the</strong>n 2g would be <strong>the</strong> smallest eveninteger such that |a p |/p 1/2 ≤ 2g for all good p.The truth of <strong>the</strong> strong form, that 2g is <strong>the</strong> limsup of <strong>the</strong> numbers|a p |/p 1/2 , is implied by a general Sa<strong>to</strong>-Tate conjecture about <strong>the</strong> realnumbers a p /p 1/2 attached <strong>to</strong> a curve C of genus g ≥ 1. To formulateit, denote by USp(2g) ⊂ Sp(2g, C) a maximal compact subgroupof <strong>the</strong> complex symplectic group. [So USp(2) is just SU(2).] The


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 5conjecture 5 is that for a given curve C <strong>the</strong>re is a compact subgroupK ⊂ USp(2g) with <strong>the</strong> property that, roughly speaking, <strong>the</strong> numbersa p /p 1/2 are distributed like <strong>the</strong> traces of random elements of K. Moreprecisely, denote by dk <strong>the</strong> Haar measure on K of <strong>to</strong>tal mass one, anddenote byTrace : K → [−2g, 2g]<strong>the</strong> trace map, for <strong>the</strong> tau<strong>to</strong>logical 2g-dimensional representation of K.Any continuous functionF : [−2g, 2g] → Cgives rise <strong>to</strong> a continuous function on K by k ↦→ F (Trace(k)), so wecan form <strong>the</strong> integral∫F (Trace(k))dk.KThe conjecture is that for any such F , we can compute this integral byaveraging F over more and more of <strong>the</strong> a p /p 1/2 ; i.e., we have <strong>the</strong> limitformula∑good p≤Tlim F (a p/p 1/2 ∫)T →∞ = F (Trace(k))dk.#{ good p ≤ T}If Sa<strong>to</strong>-Tate holds for C, <strong>the</strong>n we will recover 2g as <strong>the</strong> limsup of <strong>the</strong>numbers |a p |/p 1/2 . Given a real ɛ > 0, take for F a continuous R-valuedfunction on [−2g, 2g] which is nonnegative, supported in [2g−ɛ, 2g] andidentically 1 on [2g − ɛ/2, 2g]. [For instance, take F piecewise linear.]Because <strong>the</strong> setU ɛ/2 := {k ∈ K|Trace(k) > 2g − ɛ/2}is an open neighborhood of <strong>the</strong> identity element, it has strictly positiveHaar measure, and <strong>the</strong>refore <strong>the</strong> integral ∫ K F (Trace(k))dk ≥∫U ɛ/2F (Trace(k))dk = ∫ U ɛ/2dk > 0. So if Sa<strong>to</strong>-Tate holds, <strong>the</strong>re mustbe infinitely many p for which |a p |/p 1/2 ≥ 2g − ɛ.The Sa<strong>to</strong>-Tate conjecture is now known for all elliptic curves over Qwhose j-invariant is not an integer, where <strong>the</strong> group K is SU(2) itself[H-SB-T, Thm. A], and is expected <strong>to</strong> hold, still with K = SU(2), solong as <strong>the</strong> curve does not have complex multiplication. It has beenknow for elliptic curves over Q with complex multiplication for over5 Strictly speaking, what we are formulating is “merely” <strong>the</strong> consequence fortraces of <strong>the</strong> actual Sa<strong>to</strong>-Tate conjecture, which asserts <strong>the</strong> equidistribution ofunitarized Frobenius conjugacy classes in <strong>the</strong> space K # of conjugacy classes ofK, with respect <strong>to</strong> Haar measure, cf. [Se-Mot, 13.5]. Only in genus 1 are <strong>the</strong>yequivalent.K


6 NICHOLAS M. KATZfifty years, thanks <strong>to</strong> work of Deuring [Deu-CM] and Hecke [He]. In<strong>the</strong> CM case, <strong>the</strong> K is <strong>the</strong> normalizer in SU(2) of its maximal <strong>to</strong>rus.In higher genus, Sa<strong>to</strong>-Tate is hardly ever known 6 . For certain hyperellipticcurves y 2 = h(x) as above, we can be more precise in itsformulation. Denote by G <strong>the</strong> galois group of (<strong>the</strong> splitting field L/Qof) <strong>the</strong> polynomial h(x). If g ≥ 2 and if G is ei<strong>the</strong>r <strong>the</strong> full symmetricgroup S 2g+1 or <strong>the</strong> alternating group A 2g+1 , <strong>the</strong>n Sa<strong>to</strong>-Tate should 7hold, with K = USp(2g).Now let us turn <strong>to</strong> considering, for a given curve C, <strong>the</strong> integers a p<strong>the</strong>mselves. Here we ask two questions. First, for which integers A willwe have A = a p for infinitely many p? Second, for an A which doesoccur as a p for infinitely many p, give an asymp<strong>to</strong>tic formula for <strong>the</strong>number of p up <strong>to</strong> X for which A = a p .Of course <strong>the</strong>se same questions make sense for o<strong>the</strong>r naturally occurringsequences of integers a p . For example, if we take, instead of acurve, a projective smooth hypersurface H ⊂ P n+1 of degree d, <strong>the</strong>nfor good primes p we define integers a p byn∑#H(F p ) = p i + a p .i=0Here <strong>the</strong> Weil bound is replaced by Deligne’s bound|a p | ≤ prim(n, d)p n/2 ,with prim(n,d) <strong>the</strong> constant ((d − 1)/d)((d − 1) n+1 − (−1) n+1 ).Or we might wish <strong>to</strong> consider <strong>the</strong> sequence a p = τ(p), where Ramanujan’sτ(n) are <strong>the</strong> coefficients inq ∏ (1 − q n ) 24 = ∑n≥1n≥1τ(n)q n .6 However, it is (trivially) known for a genus 2 curve whose Jacobian is isogenous<strong>to</strong> E × E, for an elliptic curve E for which Sa<strong>to</strong>-Tate is known. For example,take h(x) = x 3 + λ(x 2 + x) + 1 <strong>to</strong> be a palindromic cubic with all distinct roots,i.e., λ ≠ −1, 3. Then C:=(<strong>the</strong> complete nonsingular model of) y 2 = h(x 2 ) hasits Jacobian isogenous <strong>to</strong> E × E for E <strong>the</strong> elliptic curve of equation y 2 = h(x),by <strong>the</strong> two maps C → E given by (x, y) ↦→ (x 2 , y) and (x, y) ↦→ (1/x 2 , y/x 3 ). Inparticular, for each good p, <strong>the</strong> a p ’s of <strong>the</strong>se curves are related by a p,C = 2a p,E .This last identity has an elementary proof.7 There is a conjectural description of K in terms of <strong>the</strong> l-adic representationsattached <strong>to</strong> C, and having K = USp(2g) is conjecturally equivalent <strong>to</strong> <strong>the</strong>property that for every l, <strong>the</strong> l-adic representation has a Zariski-dense image inGSp(2g, Q l ).That this property holds for <strong>the</strong> curves y 2 = h(x) whose G is ei<strong>the</strong>rS 2g+1 or A 2g+1 is a striking result of Zarhin [Z]


Here we have Deligne’s bound<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 7|a p | ≤ 2p 11/2 .The Lang-Trotter approach <strong>to</strong> <strong>the</strong>se question is based in part on asimple probabilistic model. For each (good) prime p, we have an integera p in a finite setX p ⊂ Z.In <strong>the</strong> curve case, X p = Z ∩ [−2g √ p, 2g √ p]. In <strong>the</strong> hypersurface case,X p = Z ∩ [−prim(n, d)p n/2 , prim(n, d)p n/2 ]. In <strong>the</strong> Ramanujan τ case,X p = Z ∩ [−2p 11/2 , 2p 11/2 ].The sets X p are increasing, in <strong>the</strong> sense that X p1 ⊂ X p2 ⊂ Z ifp 1 ≤ p 2 , and <strong>the</strong>ir union, in this simple model, is all of Z. Our collectionof a p is an element in <strong>the</strong> product spaceX := ∏X p .good pWe endow each X p with counting measure, normalized <strong>to</strong> have <strong>to</strong>talmass one; i.e., each point x p in X p has mass 1/#X p .We <strong>the</strong>n endow X with <strong>the</strong> product measure. The basic idea is that,in <strong>the</strong> absence of any special information, <strong>the</strong> particular element (a p ) pof X should behave like a “random” element of X, in <strong>the</strong> sense that any“reasonable” property of elements of X which holds on a set of measureone should hold for <strong>the</strong> particular element (a p ) p . For example, fix aninteger A, and consider <strong>the</strong> set of points x = (x p ) p ∈ X which have<strong>the</strong> property that A = x p for infinitely many p. If this set has measureone, <strong>the</strong>n we will “expect” that A = a p for infinitely many p. And iffor some explicit function g : R >0 → R >0 , <strong>the</strong> set of x = (x p ) p ∈ X forwhich <strong>the</strong> asymp<strong>to</strong>tic formula#{p ≤ T |A = x p } ∼ g(T ) as T → ∞holds is a set of measure one, <strong>the</strong>n we “expect” that we have <strong>the</strong> asymp<strong>to</strong>ticformula#{p ≤ T |A = a p } ∼ g(T ) as T → ∞.Let us recall <strong>the</strong> basic results which address <strong>the</strong>se questions.Lemma 1.1. Fix A ∈ Z. The following properties are equivalent.(1) The set of points x = (x p ) p ∈ X which have <strong>the</strong> property thatA = x p for infinitely many p has measure one.(2) The series ∑ p 1/#X p diverges.


8 NICHOLAS M. KATZProof. Given A, consider <strong>the</strong> set Z A ⊂ X of those x = (x p ) p ∈ Xfor which A = x p holds for only finitely many p. So (1) for A is<strong>the</strong> statement that this set Z A has measure zero. This set Z A is <strong>the</strong>increasing union of <strong>the</strong> setsZ n,A := {x ∈ X|x p ≠ A ∀p ≥ p n }.So Z A has measure zero if and only if each Z n,A has measure zero. But<strong>the</strong> measure of Z n,A is <strong>the</strong> product ∏ p≥p n(1 − 1/#X p ), which is zero ifand only (2) holds.□As a special case of <strong>the</strong> strong law of large numbers, we get a quantitativeversion of <strong>the</strong> previous result.Lemma 1.2. Suppose <strong>the</strong> series ∑ p 1/#X p diverges. Fix an integerA, and an increasing sequence b p of positive real numbers with b p → ∞such that <strong>the</strong> series ∑ p 1/#X p(b p ) 2 converges. Then for x ∈ X in aset of measure one, we have#{p ≤ p n |x p = A} = ∑ p≤p n1/#X p + o(b pn ).Proof. This is <strong>the</strong> strong law of large numbers [I<strong>to</strong>, Thm. 4.5.1], applied<strong>to</strong> <strong>the</strong> independent sequence of L 2 functions {f p } p on X given byf p (x) := δ xp,A. The mean E(f p ) of f p is 1/#X p , and its variance V (f p )is bounded above by 1/#X p + 1/(#X p ) 2 ≤ 2/#X p . So by hypo<strong>the</strong>sis<strong>the</strong> series ∑ p V (f p)/b 2 p converges. Then <strong>the</strong> strong law of largenumbers tells us that on a set of measure one, we havelim n→∞ (1/b pn ) ∑ p≤p n(f p − E(f p )) = 0.Making explicit <strong>the</strong> f p , we recover <strong>the</strong> assertion of <strong>the</strong> lemma.□Let us see what this gives in <strong>the</strong> cases we have looked at above.In <strong>the</strong> case of a curve C, we have #X p ∼ 4g √ p. The series ∑ p 1/√ pdiverges, and one knows that∑1/ √ p ∼ √ T / log T.p≤THere we can take b p = p (1+ɛ)/4 for any fixed real ɛ > 0. So we get#{p ≤ T |x p = A} = ∑ p≤T1/#X p + o(T (1+ɛ)/4 ) ∼ √ T /4g log Ton a set of measure one.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 9In <strong>the</strong> case of a smooth hypersurface of dimension n, we have #X p ∼2prim(n, d)p n/2 . So for n ≥ 3, <strong>the</strong> series ∑ p 1/#X p converges. Similarlyfor <strong>the</strong> Ramanujan τ, we have #X p ∼ 2p 11/2 , and again <strong>the</strong> series∑p 1/#X p converges. So in both <strong>the</strong>se cases we don’t expect any A <strong>to</strong>occur as a p infinitely often.The remaining example case is that of a smooth surface in P 3 . Here#X p ∼ 2prim(2, d)p, so <strong>the</strong> series ∑ p 1/#X p diverges, but very slowly:one knows that ∑1/p ∼ log log T .p≤TSo while <strong>the</strong> probabilistic heuristic suggests that a given A might occurinfinitely often as an a p , it also suggests that no computer experimentcould ever convince us of this.Let us now return <strong>to</strong> <strong>the</strong> case of a (projective, smooth, geometricallyconnected) curve C/Q, and introduce <strong>the</strong> second heuristic on which<strong>the</strong> Lang-Trotter approach is based. This is <strong>the</strong> notion of a congruenceobstruction. If a given integer A occurs as a p for infinitely many p,<strong>the</strong>n whatever <strong>the</strong> modulus N ≥ 2, <strong>the</strong> congruence A ≡ a p mod N willhold for infinitely many p.Here is <strong>the</strong> simplest example of a congruence obstruction. Take ahyperelliptic curve C of equation y 2 = h(x) with h(x) ∈ Z[x] monicof degree 2g + 1 ≥ 3, with 2g + 1 distinct roots in C. Suppose inaddition that all <strong>the</strong>se 2g + 1 roots lie in Z. Then for any good (sonecessarily odd) p, a p will be even. [Here is <strong>the</strong> elementary proof, basedon <strong>the</strong> character sum formula for a p . Denote by χ quad,p <strong>the</strong> quadraticcharacter χ quad,p : F × p → ±1, (so χ quad,p takes <strong>the</strong> value 1 precisely onsquares) and extend it <strong>to</strong> all of F p by setting χ quad,p (0) := 0. Then forany b ∈ F p , 1 + χ quad,p (b) is <strong>the</strong> number of square roots of b in F p . So<strong>the</strong> number of F p points on C is1(<strong>the</strong> point at ∞) + ∑ x∈F p(1 + χ quad,p (h(x))) = p + 1 + ∑ x∈F pχ quad,p (h(x)).So we have <strong>the</strong> formulaa p = − ∑ x∈F pχ quad,p (h(x)).In this formula, <strong>the</strong> reductions mod p of <strong>the</strong> 2g + 1 roots of h are <strong>the</strong>2g + 1 distinct (because p is a good prime) elements of F p at which hmod p vanishes; at all o<strong>the</strong>r points of F p , h is nonzero. So a p is <strong>the</strong>sum of an even number p − (2g + 1) of nonzero terms, each ±1, so iseven.] So in this example, no odd integer A can ever be an a p for agood prime p.


10 NICHOLAS M. KATZIn <strong>the</strong> special case of an elliptic curve E/Q, say with good reductionoutside of some ∆, <strong>the</strong>re is ano<strong>the</strong>r visible source of congruence obstructions,namely <strong>to</strong>rsion points, based on <strong>the</strong> fact that <strong>the</strong> set E(Q)has <strong>the</strong> structure of an abelian group. Suppose that <strong>the</strong> group E(Q)contains a point P of finite order N ≥ 2. For every odd prime p notdividing ∆, it makes sense <strong>to</strong> reduce this point mod p, and we obtaina point of <strong>the</strong> same order N in E(F p ). Therefore N divides #E(F p ),so we have <strong>the</strong> congruencea p ≡ p + 1 mod N.From this congruence, we see that among odd primes p not dividing∆, A = 1 can never occur as a p unless N|p, i.e., unless N is itself anodd prime, in which case we might have a p = 1 for p = N, but for noo<strong>the</strong>r, cf. [Maz, pp. 186-188].Let us explain briefly <strong>the</strong> general mechanism by which congruenceobstructions arise. Taking for ∆ <strong>the</strong> product of <strong>the</strong> primes which arebad for our curve C, we get a proper smooth curve C/Z[1/∆]. Foreach integer N ≥ 2, we have <strong>the</strong> “mod N representation” attached <strong>to</strong>C/Q, or more precisely <strong>to</strong> its Jacobian Jac(C)/Q. This is <strong>the</strong> actionof Gal(Q/Q) on <strong>the</strong> group Jac(C)(Q)[N] of points of order dividingN. This group is noncanonically (Z/NZ) 2g , and it is endowed with aGalois-equivariant alternating au<strong>to</strong>duality <strong>to</strong>ward <strong>the</strong> group µ N (Q) ofN’th roots of unity. Because C is a proper smooth curve C/Z[1/∆], <strong>the</strong>mod N representation is unramified outside of N∆, so we may view itas a homomorphismρ N : π 1 (Spec(Z[1/N∆])) → GSp(2g, Z/NZ)<strong>to</strong>ward <strong>the</strong> group GSp(2g, Z/NZ) of mod N symplectic similitudes.The key compatibility is that for any prime p not dividing N∆, <strong>the</strong>arithmetic Frobenius conjugacy classhasF rob p ∈ π 1 (Spec(Z[1/N∆]))Trace(ρ N (F rob p )) ≡ a p mod N, det(ρ N (F rob p )) ≡ p mod N.Now consider <strong>the</strong> image group Im(ρ N ) ⊂ GSp(2g, Z/NZ). If this groupcontains at least one element whose trace is A mod N, <strong>the</strong>n by Chebotarev<strong>the</strong> set of primes p not dividing N∆ for which a p ≡ A mod Nhas a strictly positive Dirichlet density, so in particular is infinite. On<strong>the</strong> o<strong>the</strong>r hand, if <strong>the</strong> image group Im(ρ N ) ⊂ GSp(2g, Z/NZ) containsno element whose trace is A mod N, <strong>the</strong>n a p ≡ A mod N can hold atmost for one of <strong>the</strong> finitely many primes p dividing N. It is precisely in


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 11this second case that A has a congruence obstruction at N(<strong>to</strong> havinga p = A for infinitely many primes p).Lang-Trotter conjecture 8 that, for curves, it is only congruence obstructionswhich prevent an A from being a p infinitely often:Conjecture 1.3. (Weak Lang-Trotter)Let C/Q be a projective, smooth,geometrically connected curve, with good reduction outside of ∆. Givenan integer A, suppose that for every modulus N ≥ 2, A has no congruenceobstruction at N, i.e., <strong>the</strong> congruence A ≡ a p mod N holds forinfinitely many p. Then we have A = a p for infinitely many p.In <strong>the</strong> case of a non-CM elliptic curve E, Lang-Trotter also formulate,for any A which has no congruence obstructions, a precise conjecturalasymp<strong>to</strong>tic for how often A is an a p . Given such an A, <strong>the</strong>y define anonzero real constant c A,E and make <strong>the</strong> following precise conjecture.Conjecture 1.4. (Strong Lang-Trotter for elliptic curves) LetE/Q be a non-CM elliptic curve. Then as T → ∞,#{p ≤ T |a p = A} ∼ c A,E (2/π) √ T / log T.Here is <strong>the</strong>ir recipe for <strong>the</strong> constant c A,E . For each integer N ≥ 2,consider <strong>the</strong> finite groupG N := Im(ρ N ) ⊂ GL(2, Z/NZ).For each a ∈ Z/NZ, we have <strong>the</strong> subset G N,a ⊂ G N defined asG N,a := {elements γ ∈ G N with Trace(γ) = a},whose cardinality we denoteWe defineg N,avg := (1/N)g N,a := #G N,a .a∑mod Ng N,a = (1/N)#G N<strong>to</strong> be <strong>the</strong> average, over a, of g N,a . For an A with no congruence obstruction,Lang-Trotter show that as N grows multiplicatively, <strong>the</strong> ratiog N,A /g N,avg ,(which Lang-Trotter write as Ng N,A /#G N ) tends <strong>to</strong> a nonzero (archimedean)limit, which <strong>the</strong>y define <strong>to</strong> be c A,E . [If we apply this recipe <strong>to</strong> an Awhich has a congruence obstruction, <strong>the</strong>n for all sufficiently divisibleN, we have g N,A = 0, so <strong>the</strong> limit exists, but it is 0.]8 Lang-Trotter make this conjecture explicitly only for elliptic curves


12 NICHOLAS M. KATZIn this vein, we have <strong>the</strong> following “intermediate” conjecture, forany 9 curve C of any genus g ≥ 1 which is “strongly non-CM” in <strong>the</strong>sense that for every l, <strong>the</strong> l-adic representation has Zariski dense imagein GSp(2g, Q l ).Conjecture 1.5. (Intermediate Lang-Trotter) Let C/Q be a projective,smooth, geometrically connected curve, with good reduction outsideof ∆, such that that for every l, <strong>the</strong> l-representation has Zariskidense image in GSp(2g, Q l ).Suppose <strong>the</strong> integer A has no congruenceobstruction mod any N. Then for every real ɛ > 0, <strong>the</strong>re exists aconstant c(C, A, ɛ) such that for T ≥ c(C, A, ɛ), we have√ 1−ɛT ≤ #{p ≤ T |ap = A} ≤ √ T 1+ɛ .There are no cases whatever of a pair (C, A) for which this conjectureis known. In <strong>the</strong> case of elliptic curves, <strong>the</strong>re are some results on upperbounds with ɛ = 1/2, some under GRH [Se-Cheb, 8.2, Thm. 20], andsome on average, cf. [Da-Pa], [Ba], [Co-Shp].Are <strong>the</strong>re o<strong>the</strong>r situations where one should expect congruence obstructions<strong>to</strong> be <strong>the</strong> only thing preventing a given integer A from occurringas a p infinitely often? A natural context for this question is that ofa compatible system of l-adic representations of some π 1 (Spec(Z[1/∆])).Let us recall one version of this notion. We are given an integer n ≥ 1and, for each prime l, a homomorphismρ l ∞ : π 1 (Spec(Z[1/l∆])) → GL(n, Z l ).The compatibility condition is that for every prime p not dividing ∆,<strong>the</strong>re is an integer polynomial P p (T ) ∈ Z[T ] such that for every primel ≠ p, <strong>the</strong> reversed characteristic polynomialdet(1 − T ρ l ∞(F rob p )) ∈ Z l [T ]lies in Z[T ] and is equal <strong>to</strong> P p (T ). We are <strong>the</strong>n interested in <strong>the</strong> a p :=Trace(F rob p ) (trace in any l-adic representation with l ≠ p) for good(i.e., prime <strong>to</strong> ∆) primes p. Reducing mod powers l ν of l, we getrepresentationsρ l ν: π 1 (Spec(Z[1/l∆])) → GL(n, Z/l ν Z).Putting <strong>the</strong>se <strong>to</strong>ge<strong>the</strong>r, we get for each integer N ≥ 2 a mod N representationρ N : π 1 (Spec(Z[1/Nl∆])) → GL(n, Z/NZ).9 Without some sort of “non-CM” hypo<strong>the</strong>sis, we can have a p = 0 for a set ofprimes p of positive Dirichlet density, cf. <strong>the</strong> example following Conjecture 1.7.Perhaps for nonzero A <strong>the</strong> conjecture remains reasonable for any C/Q.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 13Exactly as in <strong>the</strong> case of curves, A has no congruence obstruction atN, i.e., A ≡ a p mod N holds for infinitely many p, if and only if <strong>the</strong>reis an element in <strong>the</strong> image group Im(ρ N ) ⊂ GL(n, Z/NZ) whose traceis A mod N. In this case <strong>the</strong> set of p for which A ≡ a p mod N haspositive Dirichlet density.In <strong>the</strong> case of curves, <strong>the</strong>se representations are “pure of weight 1” in<strong>the</strong> sense that for each good p, when we fac<strong>to</strong>r P p (T ) = ∏ i (1 − α iT )over C, each α i has |α i | = p 1/2 . This in turn implies <strong>the</strong> estimate|a p | ≤ np 1/2 .The Lang-Trotter idea is that for any compatible system which ispure of weight 1, it is only congruence obstructions which preventan integer A from being a p for infinitely many primes p. As Serrehas remarked [Se-Cheb, 8.2, Remarques (3)], all of <strong>the</strong> image groupsIm(ρ N ) ⊂ GL(n, Z/NZ) contain <strong>the</strong> identity, and hence its trace, <strong>the</strong>integer n, has no congruence obstruction. Specializing <strong>to</strong> <strong>the</strong> case ofcurves, we get <strong>the</strong> following conjecture, which in genus g ≥ 1 seems <strong>to</strong>be entirely open. [It is of course trivially correct in genus zero, whereevery a p vanishes.]Conjecture 1.6. Let C/Q be a projective smooth geometrically connectedcurve of genus g. Then <strong>the</strong>re are infinitely many good primes pwith a p = 2g.Already very special cases of this conjecture are extremely interesting.Consider <strong>the</strong> special g = 1 case when E/Q is <strong>the</strong> lemniscatecurve y 2 = x 3 − x, which has good reduction outside of 2. Here weknow <strong>the</strong> explicit “formula” for a p , cf. [Ir-Ros, Chpt.18, &4, Thm. 5].If p ≡ 3 mod 4, <strong>the</strong>n a p = 0. If p ≡ 1 mod 4, <strong>the</strong>n we can writep = α 2 + β 2 with integers α, β, α odd, β even, and α ≡ 1 + β mod 4.This specifies α uniquely, and it specifies ±β. [More conceptually, <strong>the</strong>two gaussian integers α ± βi are <strong>the</strong> unique gaussian primes in Z[i]which are 1 mod 2 + 2i and which lie over p.] Then a p = 2α. So wehave a p = 2 precisely when <strong>the</strong>re is a gaussian prime of <strong>the</strong> form 1 + βiwith 1 ≡ 1+β mod 4, i.e. with β = 4n for some integer n. Thus a p = 2precisely when <strong>the</strong>re exists an integer n withp = 1 + 16n 2 .So <strong>the</strong> conjecture for this particular curve is <strong>the</strong> statement that <strong>the</strong>reare infinitely many primes of <strong>the</strong> form 1 + 16n 2 .There is ano<strong>the</strong>r element common <strong>to</strong> all <strong>the</strong> mod N image groups.Embeddings of Q in<strong>to</strong> C determine “complex conjugation” elementsin Gal(Q/Q), all in <strong>the</strong> same conjugacy class, denoted F rob R . In <strong>the</strong>


14 NICHOLAS M. KATZcurve case, F rob R has g eigenvalues 1 and g eigenvalues −1 in everyl-adic representation. Therefore F rob R has trace zero in every l-adicrepresentation, and consequently in every mod N representation. So weare led <strong>to</strong> <strong>the</strong> following conjecture, which in genus g = 1 is a celebratedresult of Elkies, cf. [Elkies-Real] and [Elkies-SS].Conjecture 1.7. Let C/Q be a projective smooth geometrically connectedcurve of genus g. Then <strong>the</strong>re are infinitely many good primes pwith a p = 0.This conjecture is trivially true in some cases. For example, takean odd Q-polynomial h(x) = −h(−x) with all distinct roots, and <strong>the</strong>curve y 2 = h(x). Then <strong>the</strong> character sum formula for a p shows thata p = 0 for all good p ≡ 3 mod 4. But for an irreducible h of degreed ≥ 5 whose Galois group is ei<strong>the</strong>r S d or A d , and <strong>the</strong> curve y 2 = h(x),this conjecture seems <strong>to</strong> be entirely open.What should we expect for compatible systems which are pure ofweight 2, i.e., each |α i | = p? In this weight 2 case, <strong>the</strong> probabilisticmodel has sets X p = Z ∩ [−np, np] of size 2np + 1. So <strong>the</strong> series∑ p 1/#X p ∼ (1/2n) ∑ p1/p diverges slowly, and <strong>the</strong> model allowsA = a p <strong>to</strong> hold about (1/2n) log log T times for primes up <strong>to</strong> T . Butin weight 2 <strong>the</strong>re may be more than congruence obstructions <strong>to</strong> havinga given A being a p infinitely often. Here is <strong>the</strong> simplest example.Start with an elliptic curve E/Q, say with good reduction at primes pnot dividing some integer ∆, and its compatible system of weight onerepresentationsρ l ∞ : π 1 (Spec(Z[1/l∆])) → GL(2, Z l ).In each of <strong>the</strong>se, F rob R has eigenvalues 1 and −1. Now consider <strong>the</strong>compatible systemSym 2 (ρ l ∞) : π 1 (Spec(Z[1/l∆])) → GL(3, Z l ).In each of <strong>the</strong>se, F rob R has two eigenvalues 1 and one eigenvalue −1,so has trace 1, and hence has trace 1 in every mod N representationSym 2 (ρ N ). Thus A = 1 has no congruence obstruction for <strong>the</strong> compatiblesystem of Sym 2 (ρ l ∞)’s. Denote by A p <strong>the</strong> trace of F rob p in thisSym 2 system. Then A p is related <strong>to</strong> <strong>the</strong> original a p by <strong>the</strong> formulaA p = (a p ) 2 − p.So A p = 1 is equivalent <strong>to</strong> (a p ) 2 − p = 1, i.e. <strong>to</strong>p = (a p + 1)(a p − 1),which is trivially impossible for p ≥ 5.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 15It would be interesting <strong>to</strong> understand, even conjecturally, what “should”be true about compatible weight 2 systems, for instance for <strong>the</strong> a pof a weight 3 newform 10 with integer coefficients on some congruencesubgroup Γ 1 (N). Here we are dealing with a compatible system of 2dimensional representations, so in particular A = 2 has no congruenceobstruction. It may well be that no fixed nonzero integer A is a p forinfinitely many p, no computer experiment can convince us ei<strong>the</strong>r way.None<strong>the</strong>less, we report on some computer experiments below. Caveatemp<strong>to</strong>r.The simplest examples of weight 3 newforms with integer coefficientsare gotten by taking a (K-valued, type (1, 0)) weight one grossencharacterρ of a quadratic imaginary field K of class number one and inducingits square down <strong>to</strong> Q. The common feature <strong>the</strong>y exhibit is thatfor a certain integer D ≥ 1, we have a p = 2 if and only if <strong>the</strong> pair ofsimultaneous equationsx 2 + Dy 2 = p, x 2 − Dy 2 = 1has an integer solution. Here are some examples.(D=1) Here K = Q(i), and ρ attaches <strong>to</strong> an odd prime ideal P of Z[i]<strong>the</strong> unique genera<strong>to</strong>r π = α + βi ≡ 1 mod (2 + 2i). This ρ is<strong>the</strong> grossencharacter attached <strong>to</strong> <strong>the</strong> elliptic curve y 2 = x 3 − x,cf. [Ir-Ros, Chpt. 18, Thm. 5]. Inducing ρ 2 gives a weight3 newform on Γ 1 (16) whose nebentypus character is <strong>the</strong> mod4 character of order 2. [This is 16k3A[1,0]1 in Stein’s tables[St].] See [Ka-TLFM, 8.8.10-11] for ano<strong>the</strong>r occurrence, in <strong>the</strong>cohomology of a certain elliptic surface.] For this form, we havea p = 0 unless p ≡ 1 mod 4. When p ≡ 1 mod 4, choose a Plying over p, and write ρ(P) = π = α + βi. Thena p = Trace Q(i)/Q ((π) 2 ) = 2(α 2 − β 2 ) = 2(α − β)(α + β).So no odd A is ever a p . For a fixed nonzero even A, <strong>the</strong> pairof integers (α − β, α + β) is on <strong>the</strong> finite list of fac<strong>to</strong>rizationsin Z of A/2. Solving for (α, β), we see that (α, β) is itself on afinite list. So p = α 2 + β 2 is on a finite list, and hence a p = Aholds for at most finitely many primes p. In this particularexample, A = 2 is never an a p , since <strong>the</strong> only integer solutionsof α 2 − β 2 = 1 are (±1, 0). This D = 1 case is <strong>the</strong> only casewhere we can prove that for any fixed nonzero A, a p = A holdsfor at most finitely many primes p.10 The weight in <strong>the</strong> sense of modular forms is one more than <strong>the</strong> weight in <strong>the</strong>sense of compatible systems.


16 NICHOLAS M. KATZ(D=2) Here K = Q( √ −2), and ρ attaches <strong>to</strong> an odd prime ideal Pof Z[ √ −2] <strong>the</strong> unique genera<strong>to</strong>r π = α + β √ −2 with α ≡ 1mod 4. Inducing ρ 2 gives a weight 3 newform on Γ 1 (8) whosenebentypus character is <strong>the</strong> mod 8 character of order 2 whosekernel is {1, 3}. [This is 8k3A[1,1]1 in Stein’s tables [St].] Forodd p, a p vanishes unless p ≡ 1 or 3 mod 8. When p ≡ 1 or3 mod 8, choose ei<strong>the</strong>r P lying over p, and write ρ(P) = π =α + β √ −2. Then p = Norm Q(√ −2)/Q (π) = α 2 + 2β 2 , anda p = Trace Q(√ −2)/Q ((π) 2 ) = 2(α 2 − 2β 2 ).(D=3) Here K = Q(ζ 3 ), and ρ attaches <strong>to</strong> a prime-<strong>to</strong>-6 prime ideal Pof Z[ζ 3 ] <strong>the</strong> unique genera<strong>to</strong>r π = α + β √ −3 which lies in <strong>the</strong>order Z[ √ −3] and which has α ≡ 1 mod 3. Inducing ρ 2 gives aweight 3 newform on Γ 1 (12) whose nebentypus character is <strong>the</strong>mod 3 character of order 2. [This is 12k3A[0,1]1 in Stein’s tables[St].] For p prime <strong>to</strong> 6, a p vanishes p ≡ 1 mod 3. If p ≡ 1 mod3, choose a P lying over p, and write ρ(P) = π = α + β √ −3.Then Then p = Norm Q(ζ3 )/Q(π) = α 2 + 3β 2 , anda p = Trace Q(ζ3 )/Q((π) 2 ) = 2(α 2 − 3β 2 ).(D=27) Here K = Q(ζ 3 ), and ρ attaches <strong>to</strong> a prime-<strong>to</strong>-3 prime ideal Pof Z[ζ 3 ] <strong>the</strong> unique genera<strong>to</strong>r π = α + β(3ζ 3 ) which lies in <strong>the</strong>order Z[3ζ 3 ] and has α ≡ 1 mod 3.This ρ is <strong>the</strong> grossencharacterattached <strong>to</strong> <strong>the</strong> elliptic curve y 2 = x 3 + 16, cf. [Ir-Ros, Chpt.18, Thm. 4]. Inducing ρ 2 gives a weight 3 newform on Γ 1 (27)whose nebentypus character is <strong>the</strong> mod 3 character of order 2.[This is 27k3A[9]1 in Stein’s tables [St].] For p prime <strong>to</strong> 3, a pvanishes p ≡ 1 mod 3. If p ≡ 1 mod 3, choose a P lying overp, and write ρ(P) = π = α + 3βζ 3 . Then p = Norm Q(ζ3 )/Q(π) =α 2 − 3αβ + 9β 2 anda p = Trace Q(ζ3 )/Q((π) 2 ) = 2α 2 − 6αβ − 9β 2 .So if a p is even, <strong>the</strong>n β must be even, say β = 2B, and ourequations becomep = (α − 3B) 2 + 27B 2 , a p = 2((α − 3B) 2 − 27B 2 ).(D=7,11,19,43,67,163) Here K = Q( √ −D), and ρ attaches <strong>to</strong> a prime-<strong>to</strong>-D prime idealP of Z[(1 + √ −D)/2] <strong>the</strong> unique genera<strong>to</strong>r π = α 0 + β 0 (1 +√−D)/2 which mod√−D is a square mod D. Inducing ρ2gives a weight 3 newform on Γ 1 (D) whose nebentypus characteris <strong>the</strong> mod D character of order 2. [This is Dk3A[(D-1)/2]1in Stein’s tables [St].] For p ≠ D, a p vanishes unless p is a


18 NICHOLAS M. KATZD u D T X # for n =2 3 + 2 √ D 32768 50170 3 1, 2, 43 2 + √ D 32768 37482 3 1, 2, 827 26 + 5 √ D ∞ ∞ 0 none7 8 + 3 √ D 32768 78801 3 1, 2, 1611 10 + 3 √ D 16384 42596 2 1, 219 170 + 39 √ D 8192 41475 0 none43 3482 + 531 √ D 8192 62961 0 none67 48842 + 5967 √ D 8192 81753 2 4, 32163 64080026 + 5019135 √ D 8192 132837 0 noneThat <strong>the</strong>re are provably none for D = 27 results from <strong>the</strong> fact thatu 27 is <strong>the</strong> cube of u 3 . For <strong>the</strong> amusement of <strong>the</strong> reader, we give below,for D = 2, 3, 7, 11, <strong>the</strong> two or three primes p with a p = 2 in our searchrange.D p 1 p 2 p 32 17 577 6658573 7 19 7081589777 127 32257 15003817139490503043200328185433971097711 199 79201 no third one[For D = 67, <strong>the</strong> first of <strong>the</strong> two primes found in our search range witha p = 2 wasp = 4145314481238973783106627512888262311297.The second prime found with a p = 2 had 320 digits; it was <strong>to</strong>o big forMa<strong>the</strong>matica <strong>to</strong> certify its primality.]2. Lang-Trotter in <strong>the</strong> function field case: generalitiesand what we might hope forWe now turn <strong>to</strong> a discussion of <strong>the</strong> Lang-Trotter conjecture for ellipticcurves in <strong>the</strong> function field case, cf. [Pa] for an earlier discussion(but note that his Proposition 4.4 is incorrect). Thus we let k be afinite field F q of some characteristic p > 0, X/k a projective, smooth,geometrically connected curve, K <strong>the</strong> function field of X, and E/K anelliptic curve over K. Then E has good reduction at all but finitelymany closed points P ∈ X; more precisely, its Neron model E/X is,over some dense open set U ⊂ X, a one-dimensional abelian scheme.For each closed point P ∈ U, with residue field F P of cardinality N(P),


20 NICHOLAS M. KATZFor each finite extension field F Q /k, and each F Q -valued point u ∈U(F Q ), we have its arithmetic Frobenius conjugacy class F rob u,FQ ∈π 1 (U), whose image in Gal(k/k) is <strong>the</strong> Q’th power au<strong>to</strong>morphism of k.For a closed point P of U of some degree d ≥ 1, viewed as a Gal(k/k)-orbit of length d in U(k), we have <strong>the</strong> arithmetic Frobenius conjugacyclass F rob P ∈ π 1 (U), equal <strong>to</strong> <strong>the</strong> class of F rob u,FQ , for F Q <strong>the</strong> residuefield F q d of P and for u ∈ U(F Q ) any point in <strong>the</strong> orbit which “is” P.For any element F ∈ π 1 (U) of degree one, e.g., F rob u,k if <strong>the</strong>re existsa k-rational point of U, we have a semidirect product descriptionπ geom1 (U)⋊ < F > ∼−→ π 1 (U)where < F > −→ ∼Ẑ is <strong>the</strong> pro-cyclic group generated by F .The second difference from <strong>the</strong> number field case is that only forintegers N 0 ≥ 2 which are prime <strong>to</strong> p is <strong>the</strong> group scheme E[N 0 ] a finiteétale form of Z/N 0 Z × Z/N 0 Z. So it is only for integers N 0 ≥ 2 whichare prime <strong>to</strong> p that we get a mod N 0 representationρ N0 : π 1 (U) → (GL(2, Z/N 0 Z).For a finite extension field F Q /k, and an F Q -valued point u ∈ U(F Q ),we have an elliptic curve E u,FQ /F Q , <strong>the</strong> number of whose F Q -rationalpoints we writeE u,FQ (F Q ) = Q + 1 − A u,FQ .The fundamental compatibility is that for each N 0prime <strong>to</strong> p, we have≥ 2 which isTrace(ρ N0 (F rob u,FQ )) ≡ A u,FQ mod N 0 , det(ρ N0 (Frob u,FQ )) ≡ Q mod N 0 .In particular, for a closed point P of U, weTrace(ρ N0 (F rob P )) ≡ A P mod N 0 , det(ρ N0 (Frob P )) ≡ N(P) mod N 0 .The third difference from <strong>the</strong> number field case is that, because E/Uis fibre by fibre ordinary, <strong>the</strong> p-divisible group E[p ∞ ] sits in a shortexact sequence0 → E[p ∞ ] 0 → E[p ∞ ] → E[p ∞ ] et → 0,in which <strong>the</strong> quotient E[p ∞ ] et is a form of Q p /Z p , and <strong>the</strong> kernel E[p ∞ ] 0is <strong>the</strong> dual form of µ p ∞. So <strong>the</strong> quotient E[p ∞ ] et gives us a homomorphismρ p ∞ : π 1 (U) → Aut gp (Q p /Z p ) ∼ = GL(1, Z p ) ∼ = Z × p .On Frobenius elements, this p-adic character ρ p ∞ of π 1 (U) gives <strong>the</strong>p-adic unit eigenvalue of Frobenius: <strong>the</strong> fact that <strong>the</strong> integer A u,FQ ,


g N (avg, Q) := (1/N)<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 23∑A mod Ng N (A, Q) = (1/N)g N,det=Q .The relevance of <strong>the</strong> subsets G N (A, Q) ⊂ G N,det=Q ⊂ G N is this.Suppose we are given an integer A prime <strong>to</strong> p, and a power Q of #k. If<strong>the</strong>re is an F Q -valued point u ∈ U(F Q ) with A u,FQ = A, resp. a closedpoint P with norm Q and A P = A, <strong>the</strong>n for every N ≥ 2, ρ N (F rob u,FQ ),resp. ρ N (F rob P ), lies in G N (A, Q).We say that <strong>the</strong> data (A, Q), A an integer prime <strong>to</strong> p and Q a (strictlypositive) power of #k, has a congruence obstruction at N if <strong>the</strong> setG N (A, Q) is empty. And we say that (A, Q) has an archimedean obstructionif A 2 > 4Q.The most optimistic hope is that if (A, Q) has nei<strong>the</strong>r archimedeannor congruence obstruction (i.e., A is prime <strong>to</strong> p, |A| < 2 √ Q, and for allN ≥ 2 <strong>the</strong> set G N (A, Q) is nonempty), <strong>the</strong>n <strong>the</strong>re should be a closedpoint P with norm Q and A P = A. [And we might even speculateabout how many, at least if Q is suitable large.] Unfortunately, thishope is false for trivial reasons; we can remove from U all its closedpoints of any given degree and obtain now a new situation where <strong>the</strong>groups G N , being birational invariants, are unchanged, but where <strong>the</strong>reare no closed points whatever of <strong>the</strong> given degree. What is <strong>to</strong> bedone? One possibility is <strong>to</strong> make this sort of counterexample illegal:go back <strong>to</strong> <strong>the</strong> projective smooth geometrically connected curve X/kwith function field K in which U sits as a dense open set, and replace Uby <strong>the</strong> possibly larger open set U max ⊂ X we obtain by removing fromX only those points at which <strong>the</strong> Neron model of E K /K has ei<strong>the</strong>r badreduction or supersingular reduction. But even this alleged remedy isinsufficient, as we will see below. It is still conceivable that if (A, Q) hasnei<strong>the</strong>r archimedean nor congruence obstruction <strong>the</strong>re is an F Q -pointu ∈ U max such that F rob u,FQ gives rise <strong>to</strong> (A, Q); <strong>the</strong> counterexamplebelow does not rule out this possibility.Here is <strong>the</strong> simplest counterexample.Take any prime power q = p ν ≥4, take for U = U max <strong>the</strong> (ordinary part of <strong>the</strong>) Igusa curve Ig(q) ord /F q ,and take for E/U <strong>the</strong> corresponding universal elliptic curve. For afinite field (or indeed for any perfect field) L/k, an L-valued pointu ∈ Ig(q) ord (L) is an L-isomorphism class of pairs (E/L, P ∈ E[q](L))consisting of an elliptic curve E/L <strong>to</strong>ge<strong>the</strong>r with an L-rational poin<strong>to</strong>f order q. Now consider <strong>the</strong> data (A = 1 − 2q, Q = q 2 ). The keyfact is that any E 2 /F q 2 with trace A 2 = 1 − 2q is isomorphic <strong>to</strong> <strong>the</strong>extension of scalars of a unique E 1 /F q with trace A 1 = 1, as will beshown in Lemma 4.1. But any such E 1 /F q has q rational points, so<strong>the</strong> group E 1 (F q ) is cyclic of order q, and hence every point of order q


24 NICHOLAS M. KATZin E 1 (F q ), and a fortiori every point of order q in E 1 (F q 2), is alreadyF q -rational. So although <strong>the</strong> data (A = 1 − 2q, Q = q 2 ) occurs from anF q 2-point, and hence has no congruence obstruction, it does not occurfrom a closed point of degree 2.There are three plausible hopes one might entertain in <strong>the</strong> functionfield case. Let E/U be as above (fibrewise ordinary, nonconstant j-invariant). Here are <strong>the</strong> first two.HOPE (1) Given a prime-<strong>to</strong>-p integer A, <strong>the</strong>re exists a real constant C(A, E/U)with <strong>the</strong> following property. If Q is a power of #k with Q ≥C(A, E/U), and if (A, Q) has nei<strong>the</strong>r archimedean nor congruenceobstruction, <strong>the</strong>n <strong>the</strong>re exists a closed point P with normQ and A P = A.HOPE (2) Given a prime-<strong>to</strong>-p integer A, and a real number ɛ > 0, <strong>the</strong>reexists a real constant C(A, ɛ, E/U) with <strong>the</strong> following property.If Q is a power of #k with Q ≥ C(A, ɛ, E/U), and if (A, Q)has nei<strong>the</strong>r archimedean nor congruence obstruction, <strong>the</strong>n for<strong>the</strong> number π A,Q of closed points with norm Q and A P = Aand for <strong>the</strong> number n A,Q of F Q -valued points u ∈ U(F Q ) withA u,FQ = A we have <strong>the</strong> inequalitiesQ 1 2 −ɛ < π A,Q ≤ n A,Q < Q 1 2 +ɛ .To describe <strong>the</strong> final hope, we must discuss ano<strong>the</strong>r, weaker, notionof congruence obstruction. Given a prime-<strong>to</strong>-p integer A, suppose <strong>the</strong>reare infinitely many closed points P with A P = A. Then as <strong>the</strong>re areonly finitely many closed points of each degree, it follows that <strong>the</strong>re areinfinitely many powers Q i of #k for which (A, Q i ) has no congruenceobstruction (and of course no archimedean obstruction ei<strong>the</strong>r). For afixed N = N 0 p ν , if Q i is sufficiently large (Q i ≥ p ν being <strong>the</strong> precisecondition), <strong>the</strong>n G N contains an element whose trace is A mod N.So we are led <strong>to</strong> a weaker notion of congruence obstruction, whichis <strong>the</strong> literal analogue of <strong>the</strong> number field condition: we say that <strong>the</strong>prime-<strong>to</strong>-p integer A has a congruence obstruction at N if G N containsno element whose trace is A mod N, and we say that A has a congruenceobstruction if it has one at N for some N. This brings us <strong>to</strong> <strong>the</strong> thirdhope.HOPE (3) Suppose <strong>the</strong> prime-<strong>to</strong>-p integer A has no congruence obstruction.Then <strong>the</strong>re exist infinitely many closed points P withA P = A.Notice, however, that <strong>the</strong> assumption that A has no congruence obstructionis, at least on its face, much weaker than <strong>the</strong> assumption


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 25that <strong>the</strong>re are infinitely many powers Q i of #k for which (A, Q i ) hasno congruence obstruction.3. Lang-Trotter in <strong>the</strong> function field case: <strong>the</strong> case ofmodular curvesIn <strong>the</strong> number field case, <strong>the</strong>re is no elliptic curve where we knowLang-Trotter for even a single nonzero integer A. But over any finitefield k, we will show that <strong>the</strong>re are infinitely many examples of situationsE/U/k, nonconstant j-invariant and fibrewise ordinary, where allthree of our hopes are provably correct. These examples are providedby modular curves over finite fields, and <strong>the</strong> universal families of ellipticcurves <strong>the</strong>y carry.Let us first describe <strong>the</strong> sorts of level structures we propose <strong>to</strong> dealwith in a given characteristic p > 0. We specify three prime-<strong>to</strong>-ppositive integers (L, M, N 0 ) and a power p ν ≥ 1 of p. We assume that(L, M, N 0 ) are pairwise relatively prime.Given this data, we work over a finite extension k/F p given with aprimitive N 0 ’th root of unity ζ N0 ∈ k, and consider <strong>the</strong> moduli problem,on k-schemes S/k, of S-isomorphism classes of fibrewise ordinaryelliptic curves E/S endowed with all of <strong>the</strong> following data, which forbrevity we will call an M-structure on E/S.(1) A cyclic subgroup of order L, i.e., a Γ 0 (L)-structure on E/S.(2) A point P M of order M, i.e., a Γ 1 (M)-structure on E/S,(3) A basis (Q, R) of E[N 0 ] with e N0 (Q, R) = ζ N0 , i.e., an orientedΓ(N 0 )-structure on E/S.(4) A genera<strong>to</strong>r T of Ker(V ν : E (pν /S)→ E), i.e., an Ig(p ν )-structure on E/S.Having specified a finite extension k/F p given with a primitive N 0 ’throot of unity ζ N0 ∈ k and <strong>the</strong> data (L, M, N 0 , p ν ) above, we make <strong>the</strong>fur<strong>the</strong>r assumption that at least one of <strong>the</strong> following three conditionsholds:(1) M ≥ 4,(2) N 0 ≥ 3,(3) p ν ≥ 4.This assumption guarantees that <strong>the</strong> associated moduli problem is representableby a smooth, geometrically connected k-curve M ord overwhich we have <strong>the</strong> corresponding universal family E univ /M ord . Forthis situation, points of M ord have a completely explicit description.For any k-scheme S/k, <strong>the</strong> S-valued points of M ord are precisely<strong>the</strong> S-isomorphism classes of fibrewise ordinary elliptic curves E/S


26 NICHOLAS M. KATZendowed with an M-structure. In particular, for F Q /k a finite overfield,an F Q -valued point of M ord is an F Q -isomorphism class of pairs(an ordinary elliptic curve E/F Q ,an M−structure on it).What about closed points P of M ord with norm N(P) = Q? Theseare precisely <strong>the</strong> orbits of Gal(F Q /k) on <strong>the</strong> set M ord (F Q ) which containdeg(F Q /k) distinct F Q -valued points. In more down <strong>to</strong> earth terms,an F Q -valued point lies in <strong>the</strong> orbit of a closed point of norm N(P) = Qif and only it is not (<strong>the</strong> extension of scalars of) a point with values ina proper subfield k ⊂ F Q1 F Q . Let us denote byM ord (F Q ) prim ⊂ M ord (F Q )those F Q -valued points which lie in no proper subfield. So we have <strong>the</strong>tau<strong>to</strong>logical formula#{closed points with norm Q} = #Mord (F Q ) prim.deg(F Q /k)4. Counting ordinary points on modular curves by classnumber formulasIn this section, we recall <strong>the</strong> use of class number formulas in countingordinary points. In a later section, we will invoke <strong>the</strong> Brauer-Siegel<strong>the</strong>orem (but only for quadratic imaginary fields, so really Siegel’s <strong>the</strong>orem[Sie]) and its extension <strong>to</strong> quadratic imaginary orders, <strong>to</strong> convert<strong>the</strong>se class number formulas in<strong>to</strong> <strong>the</strong> explicit upper and lower boundsasserted in HOPE (2).] These class number formulas go back <strong>to</strong> Deuring[Deu], cf. also Waterhouse [Wat]. As Howe points out [Howe], <strong>the</strong>s<strong>to</strong>ry is considerably simplified if we make use of Deligne’s description[De-VA] of ordinary elliptic curves over a given finite field. Let F q bea finite field, and E/F q an ordinary elliptic curve. We have#E(F q ) = q + 1 − A,for some prime-<strong>to</strong>-p integer A satisfyingA 2 < 4q.Conversely, given any prime-<strong>to</strong>-p integer A satisfyingA 2 < 4q,one knows by Honda-Tate, cf. [Honda] and [Tate], that <strong>the</strong>re is at leas<strong>to</strong>ne one ordinary elliptic curve E/F q with#E(F q ) = q + 1 − A.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 27The first question, <strong>the</strong>n, is <strong>to</strong> describe, for fixed (A, q) as above (i.e.,A prime-<strong>to</strong>-p with A 2 < 4q) <strong>the</strong> category of all ordinary elliptic curvesE/F q with#E(F q ) = q + 1 − A,<strong>the</strong> morphisms being F q -homomorphisms. We denote by Z[F ] <strong>the</strong> ringZ[X]/(X 2 − AX + q). Since A 2 < 4q, this ring Z[F ] is an order ina quadratic imaginary field, whose ring of integers we will denote O.[In <strong>the</strong> general Deligne s<strong>to</strong>ry we would need <strong>to</strong> work with <strong>the</strong> ringZ[F, q/F ], but here q/F is already present, namely q/F = A − F .]Deligne provides an explicit equivalence of categories (by picking (!) anembedding of <strong>the</strong> ring of Witt vec<strong>to</strong>rs W (F q ) in<strong>to</strong> C and <strong>the</strong>n taking<strong>the</strong> first integer homology group of <strong>the</strong> Serre-Tate canonical lifting, cf.[Mes, V 2.3, V 3.3, and Appendix]) of this category with <strong>the</strong> categoryof Z[F ]-modules H which as Z-modules are free of rank 2 and suchthat <strong>the</strong> characteristic polynomial of F acting on H isX 2 − AX + q.In this equivalence of categories, suppose an ordinary E/F q givesrise <strong>to</strong> <strong>the</strong> Z[F ]-module H. For any prime-<strong>to</strong>-p integer N, <strong>the</strong> groupE[N](F q ) as Z[F ]-module, F acting as <strong>the</strong> arithmetic Frobenius F rob qin Gal(F q /F q ), is just <strong>the</strong> Z[F ]-module H/NH. For a power p ν of p,<strong>the</strong> group E[p ν ](F q ) as Z[F ]-module is obtained from H as follows. Wefirst write <strong>the</strong> Z p [F ]-decompositionH ⊗ Z Z p = H et ⊕ H conn ,H et := Ker(F − unit q (A)), H conn := Ker(F − q/unit q (A)),of H ⊗ Z Z p as <strong>the</strong> direct sum of two free Z p -modules of rank one, ofwhich <strong>the</strong> first is called <strong>the</strong>“unit root subspace”. Then for each powerp ν of p, we haveE[p ν ](F q ) ∼ = H et /p ν H et .An equivalent, but less illuminating, description of H et /p ν H et is as<strong>the</strong> <strong>the</strong> image of F ν in H/p ν H (because H/p ν H is <strong>the</strong> direct sumH et /p ν H et ⊕ H conn /p ν H conn , and F ν is an isomorphism on <strong>the</strong> firstfact but kills <strong>the</strong> second fac<strong>to</strong>r).Here is an application of Deligne’s description.Lemma 4.1. Suppose E 2 /F q 2 is an elliptic curve with trace A 2 = 1−2q.Then <strong>the</strong>re exists a unique elliptic curve E 1 /F q with trace A 1 = 1 whichgives rise <strong>to</strong> E 2 /F q 2 by extension of scalars.Proof. Denote by F 2 <strong>the</strong> Frobenius for E 2 /F q 2. Then F 2 satisfiesF 2 2 − (1 − 2q)F 2 + q 2 = 0, i.e. F 2 = (F 2 + q) 2 .


28 NICHOLAS M. KATZThus F 1 := F 2 + q is a square root of F 2 , and it satisfies <strong>the</strong> equationF 2 1 − F 1 + q = 0, i.e. F 2 = F 1 − q.This last equation shows that Z[F 2 ] = Z[F 1 ]. In terms of <strong>the</strong> Z[F 2 ]-module H 2 attached <strong>to</strong> E 2 /F q 2, E 1 /F q is <strong>the</strong> unique curve over F q corresponding<strong>to</strong> <strong>the</strong> same H 2 , now viewed as a Z[F 1 ]-module. □Class number formulas are based on <strong>the</strong> following “miracle” of complexmultiplication of elliptic curves. [We say “miracle” because <strong>the</strong>analogous statements can be false for higher dimensional abelian varieties.]Given a Z[F ]-module H as above, we can form a possibly largerorder R,Z[F ] ⊂ R ⊂ O,defined asR := End Z[F ] (H).Of course this R is just <strong>the</strong> F q -endomorphism ring of <strong>the</strong> correspondingE/F q , thanks <strong>to</strong> <strong>the</strong> equivalence. So tau<strong>to</strong>logically H is an R-module.The miracle is that H is an invertible R-module, cf. [Sh, 4.11, 5.4.2].Of course any order Z[F ] ⊂ R ⊂ O can occur as H varies, since onecould take R itself as an H. So if we separate <strong>the</strong> ordinary ellipticcurves E/F q with given data (A, q) by <strong>the</strong> orders which are <strong>the</strong>irF q -endomorphism rings, <strong>the</strong>n for a given order R <strong>the</strong> F q -isomorphismclasses with that particular R are <strong>the</strong> isomorphism classes of invertibleR-modules, i.e., <strong>the</strong> elements of <strong>the</strong> Picard group P ic(R), whose orderis called <strong>the</strong> class number h(R) of <strong>the</strong> order R.Suppose that we now fix not only (A, q) but also <strong>the</strong> endomorphismring R. Then for any ordinary elliptic curve E/F q with this data, <strong>the</strong>question of exactly how many M-structures E/F q admits is determinedentirely by <strong>the</strong> data consisting of (A, q) and R. Indeed, if E/F q givesrise <strong>to</strong> H, <strong>the</strong>n H is an invertible R module. Now for any invertibleR-module H 1 , and for any integer N 1 ≥ 1, <strong>the</strong> invertible R/N 1 R-module H 1 /N 1 H 1 is R-isomorphic <strong>to</strong> R/N 1 R (simply because R/N 1 R,being finite, is semi-local, so has trivial Picard group), and hence afortiori is Z[F ]-isomorphic <strong>to</strong> R/N 1 R. Taking H 1 <strong>to</strong> be H and N 1 <strong>to</strong>be LMN 0 p ν , we conclude that H/(LMN 0 p ν )H is Z[F ]-isomorphic <strong>to</strong>R/(LMN 0 p ν )R. Translating back through Deligne’s equivalence, wesee that E[LMN 0 p ν ]((F q ) is Z[F ]-isomorphic <strong>to</strong>R/LMN 0 R × F ν (R/p ν R).Thus we have <strong>the</strong> following dictionary:(1) Γ 0 (L)-structure: a cyclic subgroup of R/LR of order L whichis Z[F ]-stable.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 29(2) Γ 1 (M)-structure: a point P ∈ R/MR which has additive orderM and which is fixed by F .(3) unoriented Γ(N 0 )-structure: a Z/N 0 Z-basis of R/N 0 R consistingof points fixed by F . An unoriented Γ(N 0 )-structure exists ifand only if F acts as <strong>the</strong> identity on R/N 0 R. If an unorientedΓ(N 0 )-structure exists, <strong>the</strong>re are precisely #GL(2, Z/N 0 Z) of<strong>the</strong>m. Of <strong>the</strong>se, precisely #SL(2, Z/N 0 Z) are oriented (for achosen ζ N0 ).(4) Ig(p ν )-structure: a Z/p ν Z-basis of F ν (R/p ν R) ( ∼ = H et /p ν H et )which is fixed by F , or equivalently, an F -fixed point in R/p ν Rwhich has additive order p ν .Thus we see explicitly that how many M-structures E/F q admitsis determined entirely by <strong>the</strong> data consisting of (A, q) and R. Let usdenote this number by#M(A, q, R).Notice also that for for such an E/F q giving rise <strong>to</strong> (A, q) and R, <strong>the</strong>au<strong>to</strong>morphism group of E/F q is <strong>the</strong> group R × of units in <strong>the</strong> endomorphismring R. Recall that F q points on <strong>the</strong> modular curve M ord are F q -isomorphism classes of pairs (ordinary E/F q , M − structure on E/F q ).So <strong>the</strong> number of F q points on M ord whose underlying ordinary ellipticcurve gives rise <strong>to</strong> <strong>the</strong> data (A, q, R) is <strong>the</strong> product#M(A, q, R)h(R)/#R × .For given (ordinary) data (A, q), with Z[F ] := Z[X]/(X 2 − AX + q)and ring of integers O ⊂ Q[F ],let us denote byM ord (F q , A) ⊂ M ord (F q )<strong>the</strong> set of F q points on M ord whose underlying ordinary elliptic curvegives rise <strong>to</strong> <strong>the</strong> data (A, q). Then #M ord (F q , A) is a sum, over allorders R between Z[F ] and O:∑#M ord (F q , A) =#M(A, q, R)h(R)/#R × .orders Z[F ]⊂R⊂OBefore we try <strong>to</strong> count M-structures, let us record <strong>the</strong> congruencesand inequalities which necessarily hold when such structures exist.Lemma 4.2. Let k/F p be a finite extension, given with a primitiveN 0 ’th root of unity ζ N0 ∈ k, F q /k a finite extension, and E/F q anordinary elliptic curve which gives rise <strong>to</strong> <strong>the</strong> data (A, q, R). Supposethat E/F q admits an M-structure. Then q ≡ 1 mod N 0 , and we have<strong>the</strong> following additional congruences.


30 NICHOLAS M. KATZ(1) There exists a ∈ (Z/LZ) × satisfying a 2 − Aa + q ≡ 0 mod L,i.e., <strong>the</strong> polynomial X 2 − AX + q fac<strong>to</strong>rs completely mod L.Equivalently, <strong>the</strong>re exists a ∈ (Z/LZ) × such that A ≡ a + q/amod L.(2) q + 1 ≡ A mod MN 2 0 p ν .Moreover, we have q ≥ p ν if p is odd. When p = 2, we also have q ≥ p νexcept in <strong>the</strong> two exceptional cases (q, p ν ) = (2, 4) and (q, p ν ) = (4, 8);in those two cases we have A = −1 and A = −3 respectively.Proof. That q ≡ 1 mod N 0 results from <strong>the</strong> fact that F q contains aprimitive N 0 ’th root of unity. To prove (1), suppose we have an F -stable Z/LZ subgroup Γ 0 ⊂ R/LR. Then F , being an au<strong>to</strong>morphismof R/LR, acts on this subgroup by multiplication by some unit a ∈(Z/LZ) × . But F 2 − AF + q annihilates R, so it annihilates R/LR. AsΓ 0 ⊂ R/LR is F -stable, and F acts on Γ 0 by a, we get that a 2 −Aa+q ∈Z/LZ annihilates this cyclic group of order L, so a 2 − Aa + q = 0 inZ/LZ. The existence of such an a is equivalent <strong>to</strong> <strong>the</strong> polynomialX 2 −AX +q fac<strong>to</strong>ring mod L, and <strong>to</strong> <strong>the</strong> congruence A ≡ a+q/a modL (<strong>the</strong>n <strong>the</strong> fac<strong>to</strong>rization is (X − a)(X − q/a) mod L). The congruence(2) is just <strong>the</strong> point-count divisibility that follows from having an M-structure. To prove <strong>the</strong> “moreover” statement, we exploit <strong>the</strong> fact that,by (2), p ν divides q + 1 − A. We argue by contradiction. If p ν > q,<strong>the</strong>n p ν ≥ pq (since q is itself a power of p). So p ν is divisible by pq,and hence pq divides q + 1 − A. By <strong>the</strong> Weil bound and ordinarity,q + 1 − A is nonzero (indeed q + 1 − A > ( √ q − 1) 2 > 0), so from <strong>the</strong>divisibility we get <strong>the</strong> inequalityq + 1 − A ≥ pq.Again by <strong>the</strong> Weil bound, we have ( √ q + 1) 2 > q + 1 − A, so we getq + 1 + 2 √ q = ( √ q + 1) 2 > pq = (p − 2)q + q + q.Adding 1 − 2 √ q − q <strong>to</strong> both sides, we getThis is nonsense if p ≥ 3.indicated cases.2 > (p − 2)q + ( √ q − 1) 2 .If p = 2, this can hold, precisely in <strong>the</strong>□To say more about how this works explicitly, we need <strong>to</strong> keep track,for given ordinary data (A, q), of <strong>the</strong> orders between Z[F ] and <strong>the</strong>full ring of integers O. The orders R ⊂ O are <strong>the</strong> subrings of <strong>the</strong>form Z + fO, with f ≥ 1 an integer. The integer f ≥ 1 is called<strong>the</strong> conduc<strong>to</strong>r of <strong>the</strong> order; it is <strong>the</strong> order of <strong>the</strong> additive group O/R.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 31Because (A, q) is given, <strong>the</strong> particular order Z[F ] ⊂ O is given, and wewill denote by f A,q its conduc<strong>to</strong>r:f A,q := conduc<strong>to</strong>r of Z[F ].An order R ⊂ O contains Z[F ] if and only if its conduc<strong>to</strong>r f R dividesf A,q . For an intermediate order Z[F ] ⊂ R ⊂ O, we define its coconduc<strong>to</strong>rfR c <strong>to</strong> be <strong>the</strong> quotient:f c R := f A,q /f R = #(R/Z[F ]).Of course this notion of co-conduc<strong>to</strong>r only makes sense because we havespecified <strong>the</strong> particular order Z[F ]. Just as <strong>the</strong> conduc<strong>to</strong>r measureshow far “down” an intermediate order is from O, so its co-conduc<strong>to</strong>rmeasures how far “up” it is from Z[F ].Lemma 4.3. Let k/F p be a finite extension, given with a primitiveN 0 ’th root of unity ζ N0 ∈ k, F q /k a finite extension, and E/F q anordinary elliptic curve which gives rise <strong>to</strong> <strong>the</strong> data (A, q, R). Supposethat <strong>the</strong> following congruences hold.(1) There exists a ∈ (Z/LZ) × satisfying a 2 − Aa + q ≡ 0 mod L.(2) q + 1 ≡ A mod MN 0 p ν .Then we have <strong>the</strong> following conclusions.(1) Whatever <strong>the</strong> order R, E/F q admits precisely φ(p ν ) Ig(p ν ) structures.(2) If R has co-conduc<strong>to</strong>r prime <strong>to</strong> L, <strong>the</strong>n E/F q admits at leas<strong>to</strong>ne Γ 0 (L) structure.(3) If R has co-conduc<strong>to</strong>r prime <strong>to</strong> M, <strong>the</strong>n E/F q admits preciselyφ(M) Γ 1 (M) structures.(4) If R has co-conduc<strong>to</strong>r divisible by N 0 , <strong>the</strong>n E/F q admits precisely#SL(2, Z/N 0 Z) oriented Γ(N 0 ) structures. O<strong>the</strong>rwise,E/F q admits none.Proof. (1) Since E/F q is ordinary, <strong>the</strong> group E(F q )[p ∞ ] is noncanonicallyQ p /Z p . So <strong>the</strong> p-power <strong>to</strong>rsion subgroup of E(F q ) is cyclic, andits order is <strong>the</strong> highest power of p which divides #E(F q ) = q + 1 − A.Because this cardinality is divisible by p ν , E(F q )[p ν ] is cyclic of orderp ν , and its φ(p ν ) genera<strong>to</strong>rs are precisely <strong>the</strong> Ig(p ν ) structures on E/F q .(2) and (3) The existence of a Γ 0 (L) (resp. Γ 1 (M))-structure dependsonly upon R/LR (resp. R/MR) as a Z[F ]-module. If R hasco-conduc<strong>to</strong>r prime <strong>to</strong> L (resp. M), <strong>the</strong>n <strong>the</strong> inclusion Z[F ] ⊂ R inducesa Z[F ]-isomorphism Z[F ]/LZ[F ] ∼ = R/LR (resp. Z[F ]/MZ[F ] ∼ =R/MR). So it suffices <strong>to</strong> treat <strong>the</strong> single case when R = Z[F ]. We willnow show in Z[F ]/LZ[F ] (resp. Z[F ]/MZ[F ]), <strong>the</strong> kernel of F −a (resp.F − 1) is a cyclic subgroup of order L (resp. M). Once we show this,


32 NICHOLAS M. KATZ<strong>the</strong>n <strong>the</strong> kernel of F −a in Z[F ]/LZ[F ] is <strong>the</strong> asserted Γ 0 (L)-structure,and <strong>the</strong> φ(M) genera<strong>to</strong>rs of <strong>the</strong> kernel of F − 1 in Z[F ]/MZ[F ] areall <strong>the</strong> Γ 1 (M) structures. The assertion about <strong>the</strong> kernels results from<strong>the</strong> fact (elementary divisors) that for an endomorphism Λ of a finitefree Z/LZ-module (resp. of a finite free Z/MZ-module), Ker(Λ) andCoker(Λ) are isomorphic abelian groups. [In fact, as Bill Messing explained<strong>to</strong> me, <strong>the</strong> kernel and cokernel of an endomorphism of any finiteabelian group are isomorphic abelian groups, but we will not need thatfiner statement here.] Applying this <strong>to</strong> <strong>the</strong> endomorphisms F − a ofZ[F ]/LZ[F ] and F −1 of Z[F ]/MZ[F ], we find that <strong>the</strong> relevant kernelsare <strong>the</strong> cyclic groups underlying <strong>the</strong> quotient ringsandZ[F ]/(L, F − a) := Z[X]/(L, X 2 − aX + q, X − a)∼ = Z/(L, a 2 − aA + q) ∼ = Z/LZ,Z[F ]/(M, F − 1) := Z[X]/(M, X 2 − aX + q, X − 1)∼ = Z/(M, 1 − A + q) ∼ = Z/MZ,(4) We have q ≡ 1 mod N 0 because F q contains a primitive N 0 ’throot of unity; by assumption N02 divides q + 1 − A. We must showthat all <strong>the</strong> points of order dividing N 0 are F q -rational if and only ifR has co-conduc<strong>to</strong>r divisible by N 0 . All <strong>the</strong> points of order dividingN 0 are F q -rational if and only if F − 1 kills R/NR, i.e., if and only ifif (F − 1)/N, which a priori lies in <strong>the</strong> fraction field of O, lies in R.[Let us remark in passing that in order for (F − 1)/N <strong>to</strong> lie in O, itis necessary and sufficient that its norm and trace down <strong>to</strong> Q lie inZ. But its norm down <strong>to</strong> Q is (q + 1 − A)/N0 2 and its trace down <strong>to</strong>Q is (A − 2)/N 0 = (q − 1)/N 0 + (A − q − 1)/N 0 .] Thus <strong>the</strong>re existΓ(N 0 )-structures if and only if R contains <strong>the</strong> order Z[(F − 1)/N 0 ].This last order visibly has co-conduc<strong>to</strong>r N 0 , so <strong>the</strong> orders containingit are precisely those whose co-conduc<strong>to</strong>r is divisible by N 0 . Onceany (possibly unoriented) Γ(N 0 ) structure exists, <strong>the</strong>re are precisely#SL(2, Z/N 0 Z) oriented Γ(N 0 )-structures.□Remark 4.4. In <strong>the</strong> above lemma, we don’t specify how many Γ 0 (L)-structures <strong>the</strong>re are,“even” when R has co-conduc<strong>to</strong>r prime <strong>to</strong> L, andwe don’t say when any exist for o<strong>the</strong>r R. We also don’t say how manyΓ 1 (M)-structures <strong>the</strong>re are for o<strong>the</strong>r R. For <strong>the</strong>se R, we will be able<strong>to</strong> make do with <strong>the</strong> trivial inequalities, valid for any R,0 ≤ #{Γ 0 (L) − structures on R/LR} ≤ #P 1 (Z/LZ),0 ≤ #{Γ 1 (M) − structures on R/MR} ≤ φ(M)#P 1 (Z/MZ).


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 335. Interlude: Brauer-Siegel for quadratic imaginaryordersThe following minor variant of Siegel’s <strong>the</strong>orem for quadratic imaginaryfields is certainly well known <strong>to</strong> <strong>the</strong> specialists. We give a proofhere for lack of a suitable reference. For a quadratic imaginary order,i.e., an order R in an quadratic imaginary field, we denote by d R itsdiscriminant, by h(R) := #P ic(R) its class number, and byh ⋆ (R) := h(R)/#R ×its “normalized” class number. [We should warn <strong>the</strong> reader that inGekeler [Ge, 2.13, 2.14] his h ⋆ and his H ⋆ are twice ours.]Theorem 5.1. Given a real ɛ > 0, <strong>the</strong>re exists a real constant C ɛ > 0such that for any quadratic imaginary order R with |d R | ≥ C ɛ , we have<strong>the</strong> inequalities|d R | 1 2 −ɛ ≤ h ⋆ (R) ≤ |d R | 1 2 +ɛ .Proof. Given a quadratic imaginary order R, denote by f R its conduc<strong>to</strong>r,K its fraction field, and O K <strong>the</strong> ring of integers of K. Then <strong>the</strong>discriminant d R of R = Z + f R O K is related <strong>to</strong> <strong>the</strong> discriminant d OKby <strong>the</strong> simple formulad R = f 2 Rd OK .Their normalized class numbers are related as follows, cf. [Cox, 7.2.6and exc. 7.30(a)] or [Sh, p. 105, exc. 4.12]:h ⋆ (R)h ⋆ (O K ) = #(O K/f R O K ) ×#(Z/f R Z) × .We rewrite this as follows. Given <strong>the</strong> quadratic imaginary field K,denote by χ K <strong>the</strong> associated Dirichlet character: for a prime numberp, χ K (p) := 1 if p splits in K, χ K (p) := 0 if p ramifies in K, andχ K (p) := −1 if p is inert in K. We <strong>the</strong>n define <strong>the</strong> multiplicativefunction φ K on strictly positive integers byφ K (1) = 1, φ K (nm) = φ K (n)φ K (m) if gcd(n, m) = 1,φ K (p ν ) = p ν−1 (p − χ K (p)), if ν ≥ 1.In terms of this function, we can rewrite <strong>the</strong> relation of normalizedclass numbers ash ⋆ (R) = φ K (f R )h ⋆ (O K ).By Siegel’s <strong>the</strong>orem, applied with ɛ/2, <strong>the</strong>re exist real constants A ɛ > 0and B ɛ > 0 such that for all quadratic imaginary fields K we have(∗∗ ɛ/2 ) : A ɛ |d OK | 1 2 −ɛ/2 ≤ h ⋆ (O K ) ≤ B ɛ |d OK | 1 2 +ɛ/2 .


34 NICHOLAS M. KATZ[This is true without A and B for |d| large; A and B take care of <strong>the</strong>small |d|. Conversely, if we know (∗∗ ɛ/2 ) for all |d|, we get (∗∗ ɛ ) forlarge |d| with A = B = 1.]In view of <strong>the</strong> formulasandh ⋆ (R) = φ K (f R )h ⋆ (O K ),d R = f 2 Rd OK ,it suffices <strong>to</strong> show that <strong>the</strong>re exist real constants A ′ ɛ > 0 and B ′ ɛ > 0such that for every quadratic imaginary field K and every integer f ≥ 1,we haveA ′ ɛf 1−ɛ ≤ φ(f) ≤ B ′ ɛf 1+ɛ .In view of <strong>the</strong> definition of φ K , this is immediate from <strong>the</strong> two followingobservations. First, for large (how large depending on ɛ) primes p, wehavep 1−ɛ ≤ p − 1 ≤ φ K (p) ≤ p + 1 ≤ p 1+ɛ .Second, for <strong>the</strong> finitely many, say N, small primes p where this fails,we can find real constants A ′′ɛ > 0 and B ɛ ′′ > 0 such thatA ′′ɛ p 1−ɛ ≤ p − 1 ≤ φ K (p) ≤ p + 1 ≤ B ′′ɛ p 1+ɛholds for <strong>the</strong>se N primes. We defineA ′ ɛ := (A ′′ɛ ) N , B ′ ɛ := (B ′′ɛ ) N .Then we have <strong>the</strong> desired inequalityA ′ ɛf 1−ɛ ≤ φ K (f) ≤ B ′ ɛf 1+ɛ .Once we have this, we combine it with Siegel’s <strong>the</strong>orem for quadraticimaginary fields <strong>to</strong> conclude that for every quadratic imaginary orderR we haveA ɛ A ′ ɛ|d R | 1 2 −ɛ/2 ≤ h ⋆ (R) ≤ B ɛ B ′ ɛ|d R | 1 2 +ɛ/2 .Then as soon as |d R | is large enough thatand1 ≤ A ɛ A ′ ɛ|d R | ɛ/2B ɛ B ′ ɛ|d R | −ɛ/2 ≤ 1,we get <strong>the</strong> assertion of <strong>the</strong> <strong>the</strong>orem.It is also convenient <strong>to</strong> introduce <strong>the</strong> (normalized) Kronecker classnumber of a quadratic imaginary order R, H ⋆ (R), defined as <strong>the</strong> sum□


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 35of <strong>the</strong> normalized class numbers of all orders between R and <strong>the</strong> ringof integers O in its fraction field:∑H ⋆ (R) :=h ⋆ (R ′ ).orders R⊂R ′ ⊂OCorollary 5.2. Given a real ɛ > 0, <strong>the</strong>re exists a real constant C ɛ > 0such that for any quadratic imaginary order R with |d R | ≥ C ɛ , we have<strong>the</strong> inequalities|d R | 1 2 −ɛ ≤ H ⋆ (R) ≤ |d R | 1 2 +ɛ .Proof. We trivially have H ⋆ (R) ≥ h ⋆ (R), so we get <strong>the</strong> asserted lowerbound for H ⋆ (R). To get <strong>the</strong> lower bound, recall from <strong>the</strong> proof of <strong>the</strong>previous <strong>the</strong>orem that for any quadratic imaginary order R ′ , we haveSo we getH ⋆ (R) ≤h ⋆ (R ′ ) ≤ B ɛ B ′ ɛ|d R ′| 1 2 +ɛ/2 .∑orders R⊂R ′ ⊂OB ɛ B ′ ɛ|d R ′| 1 2 +ɛ/2 .The co-conduc<strong>to</strong>rs f c R ′ := f R/f R ′ of <strong>the</strong>se intermediate orders withrespect <strong>to</strong> R are precisely <strong>the</strong> divisors of f R , and we haved R ′ = d R /(f c R ′)2 .Thus we haveH ⋆ (R) ≤ ∑ n|f RB ɛ B ′ ɛ|d R /n 2 | 1 2 +ɛ/2 .But <strong>the</strong> sum∑n≥11/n 1+ɛconverges, <strong>to</strong> ζ(1 + ɛ), so we get <strong>the</strong> inequalityH ⋆ (R) ≤ B ɛ B ′ ɛζ(1 + ɛ)|d R | 1 2 +ɛ/2for all quadratic imaginary R, and we need only take |d R large enoughthatB ɛ B ′ ɛζ(1 + ɛ)|d R | −ɛ/2 ≤ 1<strong>to</strong> insure <strong>the</strong> asserted upper bound.□


36 NICHOLAS M. KATZ6. Point-count estimatesWe now return <strong>to</strong> <strong>the</strong> modular curve M ord /k Recall that we fix acharacteristic p > 0, three prime-<strong>to</strong>-p positive integers (L, M, N 0 ) anda power p ν ≥ 1 of p. We assume that (L, M, N 0 ) are pairwise relativelyprime. We assume that ei<strong>the</strong>r M ≥ 4 or N 0 ≥ 3 or p ν ≥ 4. Wework over a finite extension k/F p given with a primitive N 0 ’th root ofunity ζ N0 ∈ k. We have <strong>the</strong> smooth, geometrically connected modularcurve M ord /k, which parameterizes isomorphism classes of fibrewiseordinary elliptic curves over k-schemes endowed with a Γ 0 (L)-structure,a Γ 1 (M)-structure, a Γ(N 0 )-structure, and an Ig(p ν )-structure.For a finite extension F q /k, and a prime-<strong>to</strong>-p integer A with |A| A 2 /4 such that (A, q) satisfies <strong>the</strong> conditions of <strong>the</strong>previous lemma. If p = 2, suppose fur<strong>the</strong>r that q ≥ 8. Then <strong>the</strong>re exist□


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 37infinitely many powers Q of q such that (A, Q) satisfies <strong>the</strong>se sameconditions.Proof. We first observe that <strong>the</strong> “moreover” part of Lemma 4.2, and<strong>the</strong> assumption that q ≥ 8 if p = 2, insures that q ≥ p ν . So <strong>the</strong> p-par<strong>to</strong>f <strong>the</strong> second condition is simply that A ≡ 1 mod p ν , and this willhold whatever power Q we take. The o<strong>the</strong>r conditions depend only onq mod LMN0 2 . As q is invertible mod LMN0 2 , we have q e ≡ 1 modLMN0 2 for some divisor e of φ(LMN0 2 ). Then every power Q := q 1+ne ,n ≥ 1 has Q ≡ q mod LMN0 2 .□Theorem 6.3. Given a prime-<strong>to</strong>-p integer A, suppose <strong>the</strong>re exists anF q /k with q > A 2 /4 such that (A, q) satisfies <strong>the</strong> conditions of Lemma6.1. If p = 2, suppose fur<strong>the</strong>r that q ≥ 8. Given a real number ɛ > 0,<strong>the</strong>re exists a real constant C(A, ɛ, M ord /k) such that whenever F Q /k isa finite extension with Q ≥ C(A, ɛ, M ord /k) such that (A, Q) satisfies<strong>the</strong> conditions of Lemma 6.1, <strong>the</strong>n we have <strong>the</strong> inequalitiesQ 1 2 −ɛ ≤ #M ord (A, Q) < Q 1 2 +ɛ .Proof. This is immediate from Lemma 6.1 and <strong>the</strong> Brauer-Siegel inequalities:<strong>the</strong> discriminant of Z[(F − 1)/N 0 ], for F relative <strong>to</strong> F Q , is(A 2 − 4Q)/N 2 0 , and A and N 0 are fixed while Q grows. □We now explain how <strong>to</strong> pass from estimates for F Q -points <strong>to</strong> estimatesfor closed points of norm Q, with given A. Denote by M ordclosed (A, Q)<strong>the</strong> set of closed points of norm Q giving rise <strong>to</strong> (A, Q), and byM ord (A, Q) prim ⊂ M ord (A, Q)<strong>the</strong> subset of those F Q -points which, viewed simply as points in M ord (F Q ),come from no proper subfield k ⊂ F Q1 F Q . As noted at <strong>the</strong> end ofsection 3, we have#M ordclosed(A, Q) = #M ord (A, Q) prim / log #k (Q).So our basic task is <strong>to</strong> estimate #M ord (A, Q) prim .Lemma 6.4. Let A be a prime <strong>to</strong> p integer, Q a prime power, and F q ⊂F Q a subfield. There exists a list, depending on (A, Q, q), of at mostsix integers a such that if E 0 /F q is an elliptic curve with #E 0 (F Q ) =Q + 1 − A, <strong>the</strong>n #E 0 (F q ) = q + 1 − a for some a on <strong>the</strong> list.Proof. Since A is prime <strong>to</strong> p, any such E 0 /F q becomes ordinary overF Q , so is already ordinary. Denote by n := deg(F Q /F q ), by F <strong>the</strong>Frobenius of E 0 ⊗ Fq F Q //F Q , and by F 0 <strong>the</strong> Frobenius of E 0 /F q . Wehave an inclusion of ordersZ[F ] ⊂ Z[F 0 ].


38 NICHOLAS M. KATZThese orders have <strong>the</strong> same fraction field K, and in K we have (F 0 ) n =F . But K is quadratic imaginary, so it contains at most 6 roots ofunity. So if F , a root of X 2 − AX + q in K, has any n’th roots in K, ithas at most 6, since <strong>the</strong> ratio of any two is a root of unity in K. Thelist is <strong>the</strong>n <strong>the</strong> list of traces, down <strong>to</strong> Q, of all <strong>the</strong> n’th roots of F . □In fact, we will need only <strong>the</strong> following standard fact, whose proofwe leave <strong>to</strong> <strong>the</strong> reader.Lemma 6.5. Let A be an integer, q a prime power, and Q = q 2 . IfE 0 /F q is an elliptic curve with #E 0 (F q 2) = q 2 +1−A, <strong>the</strong>n #E 0 (F q ) =q + 1 − a with a one of <strong>the</strong> two roots of X 2 − 2q = A.Theorem 6.6. Given a prime-<strong>to</strong>-p integer A, suppose <strong>the</strong>re exists anF q /k with q > A 2 /4 such that (A, q) satisfies <strong>the</strong> conditions of Lemma6.1. If p = 2, suppose fur<strong>the</strong>r that q ≥ 8. Given a real number ɛ > 0,<strong>the</strong>re exists a real constant C ′ (A, ɛ, M ord /k) such that whenever F Q /k isa finite extension with Q ≥ C ′ (A, ɛ, M ord /k) such that (A, Q) satisfies<strong>the</strong> conditions of Lemma 6.1, <strong>the</strong>n we have <strong>the</strong> inequalitiesQ 1 2 −ɛ ≤ #M ord (A, Q) prim < Q 1 2 +ɛ .Proof. The statement only gets harder as ɛ shrinks, so it suffices <strong>to</strong>treat <strong>the</strong> case when 0 < ɛ < 1/10. If <strong>the</strong> degree of F Q over k is odd,we will use only <strong>the</strong> trivial inequality#M ord (A, Q) − #M ord (A, Q) prim ≤ ∑k⊂F qF Q#M ord (F q ).Whatever <strong>the</strong> value of q, we have a uniform upper upper bound of <strong>the</strong>form#M ord (F q ) ≤ σq,for σ <strong>the</strong> sum of <strong>the</strong> Betti numbers of M ord ⊗ k k. But if deg(F Q /k) isodd, each of <strong>the</strong> at most log #k (Q) terms is at most σQ 1 3 , so this erroris, for large Q, negligeable with respect <strong>to</strong> Q 1 2 −ɛ .If <strong>the</strong> degree of F Q over k is even, we can still use <strong>the</strong> above crudeargument <strong>to</strong> take care of imprimitive points which come from a subfieldk ⊂ F q F Q with deg(F Q /F q ) ≥ 3.But we must be more careful about imprimitive points in #M ord (A, Q)which come from <strong>the</strong> subfield F q ⊂ F Q over which F Q is quadratic. IfX 2 − 2q = A has no integer solutions, <strong>the</strong>re are no such imprimitivepoints. If X 2 − 2q = A has integer solutions, say ±a, <strong>the</strong>n <strong>the</strong> numberof such imprimitive points in #M ord (A, Q) is#M ord (a, q) + #M ord (−a, q).


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 39If we take Q so large that √ Q is large enough for Theorem 6.3 <strong>to</strong> apply<strong>to</strong> <strong>the</strong> sets M ord (±a, q), <strong>the</strong>n <strong>the</strong>se sets have size at most Q 1 4 + ɛ 2 , againnegligeable with respect <strong>to</strong> Q 1 2 −ɛ .□Combining this with <strong>the</strong> identity#M ordclosed(A, Q) = #M ord (A, Q) prim / log #k (Q),and noting that log #k (Q) is negligeable with respect <strong>to</strong> Q ɛ , we get <strong>the</strong>following corollary.Corollary 6.7. Given a prime-<strong>to</strong>-p integer A, suppose <strong>the</strong>re exists anF q /k with q > A 2 /4 such that (A, q) satisfies <strong>the</strong> conditions of Lemma6.1. If p = 2, suppose fur<strong>the</strong>r that q ≥ 8. Given a real number ɛ >0, <strong>the</strong>re exists a real constant C ′′ (A, ɛ, M ord /k) such that wheneverF Q /k is a finite extension with Q ≥ C ′′ (A, ɛ, M ord /k) such that (A, Q)satisfies <strong>the</strong> conditions of Lemma 6.1, <strong>the</strong>n we have <strong>the</strong> inequalitiesQ 1 2 −ɛ ≤ #M ordclosed(A, Q) < #M ord (A, Q) < Q 1 2 +ɛ .To end this section, we interpret its results in terms of <strong>the</strong> mod NGalois images G N := ρ N (π 1 (M ord )) and <strong>the</strong>ir subsets G N (A, Q) ⊂ G Nintroduced in section 2.Theorem 6.8. Given a prime-<strong>to</strong>-p integer A, suppose that for <strong>the</strong>single value N := LMN 2 0 p ν , A mod N is <strong>the</strong> trace of some elemen<strong>to</strong>f G N . Then <strong>the</strong>re exist infinitely many closed points P of M ord withA P = A.Proof. By Chebotarev, every conjugacy class in G N is <strong>the</strong> image ofF rob P for infinitely many closed points P. In particular, every conjugacyclass in G N is <strong>the</strong> image of some F rob P with N(P) := Q ≥Max(A 2 /4, 8). By Lemma 4.2, we have Q ≥ p ν , and (A P , Q) satisfies<strong>the</strong> two conditions of that lemma, namely(1) X 2 − A P X + Q fac<strong>to</strong>rs completely mod L,(2) A P ≡ Q + 1 mod MN0 2 p ν .But A ≡ A P mod N, and hence (A, Q) satisfies <strong>the</strong>se same two conditions.The result now follows from Lemma 6.2 and Corollary 6.7,applied <strong>to</strong> (A, Q).□Similarly, we have <strong>the</strong> following result.Theorem 6.9. Given a prime-<strong>to</strong>-p integer A and a power q of #k withq ≥ Max(A 2 /4, 8), suppose that for <strong>the</strong> single value N := LMN 2 0 p ν ,<strong>the</strong> subset G N (A, q) ⊂ G N is nonempty. Then <strong>the</strong>re exist infinitelymany closed points P of M ord with A P = A and with N(P) ≡ q modLMN 2 0 .


40 NICHOLAS M. KATZProof. Pick an element γ in G N (A, q); its conjugacy class in G N is <strong>the</strong>image of F rob P for infinitely many closed points P, so is <strong>the</strong> image ofsome F rob P with N(P) := Q ≥ Max(A 2 /4, 8). Exactly as in <strong>the</strong> proofof <strong>the</strong> <strong>the</strong>orem above, Q ≥ p ν and (A P , Q) satisfies <strong>the</strong> two conditionsof Lemma 4.2. We write <strong>the</strong>se now as three conditions, breaking <strong>the</strong>second one in<strong>to</strong> a prime-<strong>to</strong>-p part and a p-part.(1) X 2 − A P X + Q fac<strong>to</strong>rs completely mod L,(2a) A P ≡ Q + 1 mod MN 2 0 .(2b) A P ≡ Q + 1 mod p ν .As F rob P lands in G N (A, Q), we have <strong>the</strong> congruences A ≡ A P modN and Q ≡ q mod LMN0 2 . Both Q and q are 0 mod p ν . Hence (A, Q)satisfies <strong>the</strong>se same conditions, and we conclude as above. □7. Exact and approximate determination of Galois imagesIf we take <strong>the</strong> inverse limit of <strong>the</strong> mod N representations as N growsmultiplicatively, we get a representationρ = ρ not p × ρ p ∞ : π 1 (M ord ) → GL(2, Ẑnot p) det in (#k) Ẑ × Z × p .Here Ẑnot p := ∏ l≠p Z l, and (#k)Ẑ is <strong>the</strong> closed subgroup of (Ẑnot p) ×profinitely generated by #k. Under this representation, <strong>the</strong> geometricfundamental group lands in SL(2, Ẑnot p) × Z × p .The following <strong>the</strong>orem is certainly well known <strong>to</strong> <strong>the</strong> specialists. Wegive a proof for lack of a suitable reference.Theorem 7.1. In suitable bases, <strong>the</strong> image groupρ(π 1 (M ord )) ⊂ GL(2, Ẑnot p) det in (#k) Ẑ × Z × pconsists of those elements which mod L have <strong>the</strong> shape (⋆, 0, ⋆, ⋆), modM have <strong>the</strong> shape (1, 0, ⋆, ⋆), mod N 0 have <strong>the</strong> shape (1, 0, 0, 1), mod p νhave <strong>the</strong> shape (1) (i.e., <strong>the</strong> p-component is 1 mod p ν ). The image of<strong>the</strong> geometric fundamental group,ρ(π geom1 (M ord )) ⊂ SL(2, Ẑnot p) × Z × pis just <strong>the</strong> intersection of ρ(π 1 (M ord )) with SL(2, Ẑnot p) × Z × p ,i.e., itconsists of those elements of SL(2, Ẑnot p)×Z × p with <strong>the</strong> imposed shapes.Proof. In a basis adapted <strong>to</strong> <strong>the</strong> imposed level structures, every elemen<strong>to</strong>f <strong>the</strong> image ρ(π 1 (M ord )) has <strong>the</strong> asserted shapes.To see this, denoteby K <strong>the</strong> function field of M ord , and by K an algebraic closure of K.Viewing η := Spec(K) as a geometric generic point of M ord , π 1 (M ord )


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 41viewed as π 1 (M ord , η) is a quotient of Aut(K/K), and ρ on Aut(K/K)is <strong>the</strong> action of this group on <strong>the</strong> profinite group∏T l (E univ (K)) = T not p (E univ (K)) × T p (E univ (K)).all primes lHere T not p (E univ (K)) is a free Ẑnot p-module of rank 2, and T p (E univ (K))is a free Z p -module of rank one. Then take any Z p -basis of T p , andany Ẑnot p-basis of T not p adapted <strong>to</strong> <strong>the</strong> imposed Γ 0 (L), Γ 1 (M), andΓ(N 0 )-structures. Then Aut(K/K) acts through elements of <strong>the</strong> assertedshape.We next explain that it suffices <strong>to</strong> show that <strong>the</strong> imageρ(π geom1 (M ord )) ⊂ SL(2, Ẑnot p) × Z × pis as asserted. For if F ∈ π 1 (M ord )) is any element of determinant #k,<strong>the</strong>n ρ(π 1 (M ord )) is <strong>the</strong> semidirect product of ρ(π geom1 (M ord )) with <strong>the</strong>Ẑ generated by F . [Such elements F exist: if M ord has a k-point, takeits Frobenius, o<strong>the</strong>rwise take <strong>the</strong> ratio of Frobenii at two closed pointswhose large degrees differ by one.] The key point is that ρ(F ) is anelement of GL(2, Ẑnot p) det in (#k) Ẑ × Z × p of <strong>the</strong> asserted shape. By <strong>the</strong>explicit description of ρ(π geom1 (M ord )), this semidirect product itselfhas <strong>the</strong> asserted description.That <strong>the</strong> imageρ(π geom1 (M ord )) ⊂ SL(2, Ẑnot p) × Z × pis as asserted is a geometric statement, so we may extend scalars fromk <strong>to</strong> k. Suppose first that ei<strong>the</strong>r M ≥ 4 or that N 0 ≥ 3. In that casewe can consider <strong>the</strong> moduli problem M 0 /k where we require Γ 0 (L),Γ 1 (M), and (oriented) Γ(N 0 )-structures, but no longer impose ei<strong>the</strong>rordinarity or any fur<strong>the</strong>r condition on p-power <strong>to</strong>rsion. Suppose weknow that for M ord0 , <strong>the</strong> imageρ(π geom1 (M ord0 )) ⊂ SL(2, Ẑnot p) × Z × pis as asserted. Then we argue as follows. By a fundamental <strong>the</strong>orem ofIgusa, cf. [Ig] and [K-M, 12.6.2], at any supersingular point s ∈ M 0 (k),<strong>the</strong> p-adic character ρ p ∞ restricted <strong>to</strong> <strong>the</strong> inertia group I s at s haslargest possible image:ρ p ∞(I s ) = Z × p .Therefore <strong>the</strong> covering M ord → M ord0 is finite étale galois, with group(Z/p ν Z) × . So ρ(π geom1 (M ord )) ⊂ ρ(π geom1 (M ord0 )) is an open subgroupof index φ(p ν ). But ρ(π geom1 (M ord )) lies in <strong>the</strong> group it is asserted <strong>to</strong> be,


42 NICHOLAS M. KATZand that group has <strong>the</strong> same index φ(p ν ) in <strong>the</strong> known ρ(π geom1 (M ord0 )).So we get <strong>the</strong> asserted description of ρ(π geom1 (M ord )).We now show that for M ord0 , <strong>the</strong> imageρ(π geom1 (M ord0 )) ⊂ SL(2, Ẑnot p) × Z × pis as asserted. By Igusa’s <strong>the</strong>orem, at any supersingular point of M 0 ,ρ p ∞(I s ) = Z × p . But <strong>the</strong> representation ρ not p is everywhere unramifiedon M 0 , so ρ(I s ) = {1} × Z × p in <strong>the</strong> product SL(2, Ẑnot p) × Z × p . So weare reduced <strong>to</strong> showing that <strong>the</strong> imageρ not p (π geom1 (M ord0 )) ⊂ SL(2, Ẑnot p)is as asserted. This follows from <strong>the</strong> tame specialization <strong>the</strong>orem[Ka-ESDE, 8.17.14] and <strong>the</strong> corresponding result over C. The modulischeme M 0 is one geometric fibre of <strong>the</strong> corresponding modulischeme M 0 over /Z[ζ N0 ][1/LMN 0 ], which one knows has a propersmooth compactification M 0 over Z[ζ N0 ][1/LMN 0 ] with “infinity” adivisor which is finite étale over Z[ζ N0 ][1/LMN 0 ]. Extend scalars fromZ[ζ N0 ][1/LMN 0 ] <strong>to</strong> <strong>the</strong> Witt vec<strong>to</strong>rs W (k). On this scheme M 0 /W (k),<strong>the</strong> lisse Ẑnot p-sheaf which “is” ρ not p is (au<strong>to</strong>matically) tamely ramifiedalong “infinity”, so by <strong>the</strong> tame specialization <strong>the</strong>orem its geometricmonodromy is <strong>the</strong> same on <strong>the</strong> special fibre as on <strong>the</strong> geometricpoint over <strong>the</strong> generic fibre gotten by choosing (!) an embedding ofW (k) ⊂ C.It remains <strong>to</strong> show that <strong>the</strong> imageρ(π geom1 (M ord )) ⊂ SL(2, Ẑnot p) × Z × pis as asserted in <strong>the</strong> general case, i.e., without <strong>the</strong> assumption thatei<strong>the</strong>r M ≥ 4 or N 0 ≥ 3. To treat this case, pick two distinct primes l 1and l 2 , both of which are odd and prime <strong>to</strong> LMN 0 p. Consider <strong>the</strong> moduliproblems M ord1 , respectively M ord2 , over k where in addition <strong>to</strong> imposingan M structure we impose also an oriented Γ(l 1 )-structure, resp.an oriented Γ(l 2 )-structure. The two groups ρ(π geom1 (M ordi )), i = 1, 2,are <strong>the</strong>n known. Both are subgroups of ρ(π geom1 (M ordi )), and <strong>to</strong>ge<strong>the</strong>r<strong>the</strong>y visibly generate <strong>the</strong> asserted candidate forρ(π geom1 (M ordi )). □Using this result, one can say something, again well known <strong>to</strong> <strong>the</strong>specialists, in <strong>the</strong> case of general families.Theorem 7.2. Let E/U/k be a family of fibrewise ordinary ellipticcurves with nonconstant j-invariant over a base curve U/k, k a finitefield, which is smooth and geometrically connected.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 43(1) The image ρ(π geom1 (U)) is open in SL(2, Ẑnot p) × Z × p ; <strong>the</strong>reexists an integer D = D 0 p ν , D 0 prime <strong>to</strong> p and ν ≥ 0, suchthat ρ(π geom1 (U)) contains <strong>the</strong> subgroupKer(SL(2, Ẑnot p) × Z × p → SL(2, Z/D 0 Z) × (Z/p ν Z) × )which is <strong>the</strong> kernel of reduction mod (D 0 , p ν ).(2) Denote by G geomD⊂ SL(2, Z/D 0 Z)×(Z/p ν Z) × <strong>the</strong> mod D imageρ D (π geom1 (U)), and denote by G D ⊂ GL(2, Z/D 0 Z) × (Z/p ν Z) ×<strong>the</strong> mod D image ρ D (π 1 (U)). Then G geomDis a normal subgroupof G D , and <strong>the</strong> quotient is cyclic, generated by <strong>the</strong> image ρ D (F )of any element F ∈ π 1 (U)) such that ρ not p (F ) has determinant#k.(3) An element γ 0 ∈ SL(2, Ẑnot p) × Z × p lies in ρ(π geom1 (U)) if andonly if mod D it lies in G geomD .(4) Suppose given an element γ ∈ GL(2, Ẑnot p) det in (#k) Ẑ × Z × p ,with det(γ not p ) = (#k) n , n ∈ Ẑ. This element lies in ρ(π 1(U))if and only if γ mod D lies <strong>the</strong> coset ρ D (F n )G geomDof G D .Proof. It suffices <strong>to</strong> prove (1). For (2) is universally true, and (1)and (2) <strong>to</strong>ge<strong>the</strong>r imply (3). For (4), <strong>the</strong> condition given is obviouslynecessary; applying (3) <strong>to</strong> F −n γ we see that it is sufficient.To prove (1) we argue as follows. The assertion is geometric, so wemay extend scalars from k <strong>to</strong> k. It suffices <strong>to</strong> prove it for some finiteétale cover U 1 of U, since π 1 (U 1 ) is a subgroup of π 1 (U). So we mayassume choose an odd prime l ≠ p and reduce <strong>to</strong> <strong>the</strong> case when E/Uhas an oriented Γ(l)-structure. Then we have a classifying map U →M ord (l), for M(l) <strong>the</strong> Γ(l) modular curve. This map is nonconstant,because our family has nonconstant j-invariant. Therefore <strong>the</strong> imageof π 1 (U) in π 1 (M ord (l)) is a closed subgroup of finite index, hence anopen subgroup of finite index, in π 1 (M ord (l)), <strong>to</strong> which we apply <strong>the</strong><strong>the</strong>orem.□Given E/U/k as in <strong>the</strong> above <strong>the</strong>orem, fibrewise ordinary with nonconstantj-invariant, we say that any integer D for which part (1) of <strong>the</strong>corollary holds is a modulus for E/U/k. Of course if D is a modulus,so is any multiple of D.8. Gekeler’s product formulaTo motivate this section, we first explain <strong>the</strong> heuristic which underliesit. Given a family E/U/F q with nonconstant j-invariant, we know<strong>the</strong> Sa<strong>to</strong>-Tate conjecture for this family, cf. Deligne [De-WeilII, 3.5.7].Given a finite extension F Q /F q , attached <strong>to</strong> each point u ∈ U(F Q ) is an


44 NICHOLAS M. KATZelliptic curve E u,FQ /F Q , which gives rise <strong>to</strong> data (A u,FQ , Q), which inturn gives rise <strong>to</strong> <strong>the</strong> real number t u,FQ := A u,FQ /(2 √ Q) ∈ [−1, 1]. TheSa<strong>to</strong>-Tate <strong>the</strong>orem says that as Q grows, <strong>the</strong>se #U(F Q ) real numberst u,FQ ∈ [−1, 1] become equidistributed for <strong>the</strong> measure 2 (√ 1 − tπ 2 )d<strong>to</strong>n [−1, 1]. This means that∫for any continuous C-valued function f on[−1, 1], we can compute 2 1 f(t)(√ 1 − tπ −1 2 )dt as <strong>the</strong> large Q limit of(1/#U(F Q )) ∑ u∈U(F f(t q) u,F Q). For a fixed Q, we make <strong>the</strong> change ofvariable t = A/(2 √ Q), so <strong>the</strong> integral becomes2π∫ 2√ Q−2 √ Qf(A/(2 √ Q))( √ 1 − A 2 /4Q)dA/(2 √ Q) =∫ 2√ Q= 2 1f(A/(2 √ Q))( √ 4Q − Aπ 4Q2 )dA.−2 √ QAnd <strong>the</strong> approximating sum is∑(1/#U(F Q )) f(A u,FQ /(2 √ Q)).u∈U(F q)Since <strong>the</strong>re are “about” Q points in U(F Q ), it is “as though” a giveninteger A ∈ [−2 √ Q, 2 √ Q] occurs as an A u,FQ for “about”1 √4Q − A22πof <strong>the</strong> u ∈ U(F q ); this is <strong>the</strong> Sa<strong>to</strong>-Tate heuristic.We have already discussed how congruence obstructions can preventsome particular (A, Q) from occurring at all in a given family. TheGekeler product formula says that, at least in certain modular families,whenever (A, Q) is ordinary and has no archimedean or congruenceobstruction, we can use congruence considerations <strong>to</strong> compute <strong>the</strong> ratio#{u ∈ U(F Q )|A u,FQ = A}√ .4Q − A212πThe pro<strong>to</strong>typical example of computing such a ratio by “congruenceconsiderations” is Dirichlet’s class number formula for a quadatic imaginaryfield K:L(1, χ) =2πh K√w K |dK | .Here h K = h(O K ) is <strong>the</strong> class number, w K = #O × Kis <strong>the</strong> order of <strong>the</strong>group of roots of unity in K, and d K is <strong>the</strong> discriminant of O K . So interms of <strong>the</strong> normalized class numberh ⋆ K := h K /w K


<strong>the</strong> formula reads<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 45L(1, χ) =12πh ⋆ K√|dK | .In this example, <strong>the</strong> ratio is <strong>the</strong> value L(1, χ), and “congruence considerations”give <strong>the</strong> Euler fac<strong>to</strong>rs, whose conditionally convergent productis L(1, χ). Indeed, it is precisely this class number formula whichunderlies Gekeler’s, as we will see.We return now <strong>to</strong> a family E/U/F q with nonconstant j-invariant.Fix data (A, Q) with A prime <strong>to</strong> p, |A| < 2 √ Q, and no congruenceobstruction. We introduced, for every integer N ≥ 2 <strong>the</strong> finite groupsG N , <strong>the</strong>ir normal subgroups G geomN⊳ G N , <strong>the</strong> cosetsG N,det=Q ⊂ G N ,and <strong>the</strong>ir subsetsG N (A, Q) ⊂ G N,det=Q ,whose cardinalities were denoted g N (A, Q) and g N,det=Q . We also introduced<strong>the</strong> rational number∑g N (avg, Q) = (1/N)g N,det=Q = (1/N) g N (A, Q).Gekeler’s idea is <strong>to</strong> consider <strong>the</strong> ratiosA mod Ng N (A, Q)/g N (avg, Q) = Ng N (A, Q)/g N,det=Q ,<strong>to</strong> show <strong>the</strong>y have a “large N limit’, and <strong>the</strong>n <strong>to</strong> show that this limitis <strong>the</strong> ratio#{u ∈ U(F Q )|A u,FQ = A}12π√4Q − A2Let us be more precise. We have <strong>the</strong> following elementary lemma.Lemma 8.1. Let D be a modulus for E/U/F q . Suppose given (A, Q)with Q a power of #k and A an integer prime <strong>to</strong> p with A 2 < 4Q.Suppose that (A, Q) has no congruence obstruction. Suppose N ≥ 2and M ≥ 2 are relatively prime. Suppose fur<strong>the</strong>r that N is relativelyprime <strong>to</strong> D.(1) Under <strong>the</strong> “reduction mod NM” map we have an isomorphismof groupsG geomNM(2) We have a bijection of cosetsG NM,det=Q(3) We have a bijection of setsG NM (A, Q)∼−→ G geomN× G geomM .∼−→ G N,det=Q × G M,det=Q .∼−→ G N (A, Q) × G M (A, Q)..


46 NICHOLAS M. KATZ(4) For a prime number l prime <strong>to</strong> pD and an integer n ≥ 1, wehave= SL(2, Z/l n Z),G geoml nG l n ,det=Q = {γ ∈ GL(2, Z/l n Z)| det(γ) = Q},G l n(A, Q) = {γ ∈ GL(2, Z/l n Z)| det(γ) = Q, Trace(γ) = A}.(5) If <strong>the</strong> characteristic p is prime <strong>to</strong> D, <strong>the</strong>n for any n ≥ 1, wehaveG geomp = n (Z/pn Z) × ,G p n ,det=Q = (Z/p n Z) × ,G p n(A, Q) = {α ∈ (Z/p n Z) × | α ≡ unit Q (A) mod p n }.Proof. Assertions (1), (4) and (5) result from Theorem 7.2. Assertion(1) implies (2) by <strong>the</strong> definition of G N,det=Q as a coset, and (2) implies(3) trivially. □Remark 8.2. In <strong>the</strong> case of any of <strong>the</strong> moduli problems we have considered,Theorem 7.1 shows that <strong>the</strong> image groupρ(π geom1 (M ord )) ⊂ SL(2, Ẑnot p) × Z × p = ∏ l≠pSL(2, Z l ) × Z × pis <strong>the</strong> product over all primes l, including l = p, of <strong>the</strong> images of<strong>the</strong> separate l-adic representations. So for (<strong>the</strong> universal families over)<strong>the</strong>se modular curves, assertions (1), (2) and (3) of Lemma 8.1 hold forevery pair (N, M) of relatively prime integers.Gekeler proves that for a prime number l prime <strong>to</strong> pD, <strong>the</strong> sequenceof ratiosl n g l n(A, Q)/g l n ,det=Q, n ≥ 1,i.e., <strong>the</strong> sequence of ratiosl n #{γ ∈ GL(2, Z/l n Z)| Trace(γ) = A, det(γ) = Q}#SL(2, Z/l n Z)becomes constant for large n, and computes this constant explicitly[Ge, Thm. 4.4], calling it ν l (A, Q). He also shows that so long as l is,in addition, ei<strong>the</strong>r prime <strong>to</strong> A 2 − 4Q or split in K := Q( √ A 2 − 4Q),<strong>the</strong>nν l (A, Q) = 1/(1 − χ(l)l )is <strong>the</strong> Euler fac<strong>to</strong>r at l in L(1, χ) for χ <strong>the</strong> quadratic character attached<strong>to</strong> <strong>the</strong> field K. And if <strong>the</strong> characteristic p does not divide D, it isimmediate from <strong>the</strong> definitions that <strong>the</strong> sequencep n g p n(A, Q)/g p n ,det=Q, n ≥ 1,


is constant, with value<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 47(p n × 1)/φ(p n ) = 1/(1 − 1 p ),<strong>the</strong> Euler fac<strong>to</strong>r at p of <strong>the</strong> same L(1, χ).To go fur<strong>the</strong>r, we must define a reasonable fac<strong>to</strong>r “at D”. It should betrue that for any sequence of positive integers M n such that M n |M n+1and such that D n |M n for all n ≥ 1, <strong>the</strong> sequence of ratiosM n g Mn (A, Q)/g Mn,det=Q, n ≥ 1,becomes constant for large n, and that this constant is independent of<strong>the</strong> particular choice of <strong>the</strong> sequence of M n as above. If this is true, wewould call this constant ν D (A, Q). [For <strong>the</strong> moduli problems we havebeen considering, <strong>the</strong> problem of defining <strong>the</strong> fac<strong>to</strong>r “at D” is reduced<strong>to</strong> <strong>the</strong> problem of defining <strong>the</strong> fac<strong>to</strong>r separately at each prime dividingD, cf. Remark 8.2.] The most optimistic conjecture would <strong>the</strong>n be <strong>the</strong>following.Conjecture 8.3. Let E/U/F q be fibrewise ordinary with nonconstantj-invariant, D a modulus. Suppose that U = U max . If (A, Q) hasnei<strong>the</strong>r archimedean nor congruence obstruction, <strong>the</strong>n#{u ∈ U(F Q )|A u,FQ = A}√ = ν1D (A, Q) ∏ ν l (A, Q),2π 4Q − A2l∤Dwhere <strong>the</strong> conditionally convergent product is defined by∏∏ν l (A, Q) := lim ν l (A, Q).l∤DX→∞l


48 NICHOLAS M. KATZcurve V/F q and a finite flat F q -map, f : V → U, such that <strong>the</strong> inducedmaps of fundamental groupsf ⋆ : π geom1 (V ) → π geom1 (U), f ⋆ : π 1 (V ) → π 1 (U)are both surjective. Now consider <strong>the</strong> pullback E V /V/F q of E/U/F q by<strong>the</strong> map f. This pullback family also has V = V max (because supersingularpoints, resp. points of potentially multiplicative reduction downstairshave <strong>the</strong>ir inverse images upstairs supersingular, resp. pointsof potentially multiplicative reduction). Since <strong>the</strong> pullback family has<strong>the</strong> same galois image data, <strong>the</strong> notion of congruence obstruction is<strong>the</strong> same for <strong>the</strong> original family and for its pullback. So <strong>the</strong> conjecturepredicts that for each (A, Q) with no congruence obstruction, we have#{u ∈ U(F Q )|A u,FQ = A} = #{v ∈ V (F Q )|A v,FQ = A}.Fixing Q and summing over <strong>the</strong> allowed A, which are <strong>the</strong> same upstairsand down, we get <strong>the</strong> equality#U(F Q ) = #V (F Q ).As we will recall below, one condition that forces f ⋆ <strong>to</strong> be surjectiveon fundamental groups is that it be fully ramified over some ¯F q -point,say over u 0 ∈ U( ¯F q ), with unique point v 0 ∈ V ( ¯F q ) lying over u 0 . Givensuch an f, we are <strong>to</strong> have#U(F Q ) = #V (F Q ).But of course this is nonsense in general. Here is <strong>the</strong> simplest example.Work over a prime field F p with p ≡ 1 mod 3, pick a cuberoot of unity, and take for E/U/F p <strong>the</strong> universal family for <strong>the</strong> moduliproblem of oriented Γ(3) structures. Here <strong>the</strong> modular curve M Γ(3) isP 1 \ {µ 3 , ∞}, with parameter µ and universal familyX 3 + Y 3 + Z 3 = 3µXY Z.In this family, we have multiplicative reduction at each missing point.And while <strong>the</strong>re are p−1 supersingular points over ¯F p , no supersingularpoint is F p -rational (simply because over F p with p ≥ 5, supersingularpoints have A = 0, whereas our A’s satisfy A ≡ p + 1 mod 9, so arecertainly nonzereo). ThusM ordΓ(3)(F p ) = F p \ {µ 3 }has p − 3 points. Now consider <strong>the</strong> pullback family by <strong>the</strong> doublecovering “square root of µ” of U := P 1 \ {µ 3 , ∞} by V := P 1 \ {µ 6 , ∞}.Explicitly, this is <strong>the</strong> familyX 3 + Y 3 + Z 3 = 3µ 2 XY Z.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 49HereV (F p ) = F p \ {µ 6 }has p − 6 points.Here is <strong>the</strong> precise surjectivity statement used above, whose proofwe owe <strong>to</strong> Deligne. It is not as well known as it should be.Lemma 8.4. Let X and Y be connected, locally noe<strong>the</strong>rian schemes,and f : X → Y a morphism which is proper and flat. Suppose that forsome geometric point y of Y , say with values in <strong>the</strong> algebraically closedfield K, <strong>the</strong> fibre X y (K) := f −1 (y)(K) consists of a single K-valuedpoint x ∈ X(K). Then <strong>the</strong> map of fundamental groupsis surjective.f ⋆ : π 1 (X, x) → π 1 (Y, y)Proof. For any X-scheme h : Z → X, we can also view Z as a Y -scheme, by f ◦ h. So we have <strong>the</strong> fibres Z x (K) := h −1 (x)(K) andZ y (K) := (f ◦ h) −1 (y)(K). The hypo<strong>the</strong>sis that f −1 (y)(K) = x insuresthat Z x (K) = Z y (K).Let g : E → Y be finite étale of some degree d ≥ 1, with E connected,and denote by g X : E X → X its pullback <strong>to</strong> a finite étale cover of X,of <strong>the</strong> same degree d. We must show that E X remains connected. Letj : Z ⊂ E X be <strong>the</strong> inclusion of a connected component Z of E X , andW := f E (Z) ⊂ E its image in E. As f is proper and flat, so is f E . AsZ is both open and closed in E X , its image W := f E (Z) ⊂ E is bothclosed and open in E, and hence W = E.Z⏐↓jf E |Z−−−→ W⏐↓ =E X⏐↓ g Xf E−−−→ E⏐fX −−−→ YSo <strong>the</strong> fibre W y (K) of W over y consists of d points. But Z maps on<strong>to</strong>W , hence, viewing Z as a Y -scheme (by f ◦g X ◦j) and f E |Z : Z → W asa Y -morphism , Z y (K) maps on<strong>to</strong> W y (K). Therefore Z y (K) consists ofat least d points. But Z x (K) = Z y (K), as noted above, hence Z x (K)has at least d points. But <strong>the</strong> entire fibre (E X ) x (K) has d points.Therefore Z and E X have <strong>the</strong> same fibre over x; as both are finite étaleover X and Z ⊂ E X , we conclude that Z = E X . Thus E X is connected,as required.□↓ g


50 NICHOLAS M. KATZDespite <strong>the</strong> failure of <strong>the</strong> conjecture above in general, <strong>the</strong>re are certainmodular families of elliptic curves for which <strong>the</strong> conjecture holds.Theorem 8.5. (Gekeler) The conjecture is true for (<strong>the</strong> universal fibrewiseordinary families over) <strong>the</strong> Igusa curves Ig(p ν )/k/F p , any p ν ≥ 4.Proof. For this family, Theorem 7.1 tells us thatandρ(π geom1 (Ig(p ν ))) = SL(2, Ẑnot p) × (1 + p ν Z p ),ρ(π 1 (Ig(p ν )) = GL(2, Ẑnot p) det in (#k) Ẑ × (1 + p ν Z p ).So here we can take D = p ν . Take an (A, Q) with nei<strong>the</strong>r congruencenor archimedean obstruction. Then A is prime <strong>to</strong> p, unit Q (A) ≡ 1 modp ν , and A 2 < 4Q. For l ≠ p, <strong>the</strong> local fac<strong>to</strong>r is Gekeler’s ν l (A, Q).What about <strong>the</strong> fac<strong>to</strong>r ν p ν(A, Q) at D = p ν ? For any integer n ≥ ν,g p n(A, Q) is <strong>the</strong> ratiop n #{γ ∈ ((1 + p ν Z p )/(1 + p n Z p ))| γ ≡ unit Q (A) mod p n },#((1 + p ν Z p )/(1 + p n Z p ))which is just = p n × 1/p n−ν = p ν , which in turn isand henceφ(p ν ) × (1 − 1/p) −1 ,ν p ν(A, Q) = φ(p ν ) × (1 − 1/p) −1 .Gekeler proves [Ge, Cor. 5.4] (remember his H ⋆ is twice ours and hewrites H ⋆ (A 2 − 4Q) for H ⋆ (Z[F ])) that(1 − 1/p) −1 ∏ l≠pν l (A, Q) = 2πH⋆ (Z[F ])√4Q − A2 .Hence we haveν p ν(A, Q) ∏ l≠pν l (A, Q) = φ(pν )H ⋆ (Z[F ])√ .4Q − A212πBut <strong>the</strong> numera<strong>to</strong>r φ(p ν )H ⋆ (Z[F ]) is precisely#{u ∈ Ig(p v )(F Q )|A u,FQ = A},cf. Lemma 4.3, (1).□Here is a slight generalization of Gekeler’s result.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 51Theorem 8.6. Let N = N 0 p ν with N 0 prime <strong>to</strong> p. Supppose thatei<strong>the</strong>r N 0 ≥ 3 or that p ν ≥ 4. Let k/F p be a finite field containinga chosen primitive N 0 ’th root of unity. Let E univ /M ord /k be <strong>the</strong> universalfamily of ordinary elliptic curves endowed with both an orientedΓ(N 0 )-structure and an Ig(p ν )-structure. The conjecture is true forthis family.Proof. Suppose (A, Q) has nei<strong>the</strong>r archimedean nor congruence obstruction.Our first task is <strong>to</strong> compute, for this family, <strong>the</strong> local fac<strong>to</strong>rsν l (A, Q, M ord ) at <strong>the</strong> primes l dividing pN 0 .Suppose we have done this. Then we proceed as follows. We haveseen in Lemma 4.3, (1) and (4), that#{u ∈ M ord (F Q )|A u,FQ = A}= φ(p ν )#SL(2, Z/N 0 Z)H ⋆ (Z[(F − 1)/N 0 ]).So what we must show is thatφ(p ν )#SL(2, Z/N 0 Z)H ⋆ (Z[(F − 1)/N 0 ])√4Q − A2= ( ∏12πThe fac<strong>to</strong>r ν p ν(A, Q, M ord ) at p isl|pN 0ν l (A, Q, M ord ))( ∏ l∤N 0ν l (A, Q)).ν p ν(A, Q, M ord ) = φ(p ν ) × (1 − 1/p) −1 ,exactly as in <strong>the</strong> proof of <strong>the</strong> previous <strong>the</strong>orem.<strong>the</strong>orem, we have(1 − 1/p) −1 ∏ l≠pν l (A, Q) = H⋆ (Z[F ])√ .4Q − A212πAccording <strong>to</strong> thatComparing <strong>the</strong>se two formulas, what we must show is that∏ ν l (A, Q, M ord )= #SL(2, Z/N 0 Z) H⋆ (Z[(F − 1)/N 0 ]).ν l (A, Q)H ⋆ (Z[F ])l|N 0We next express <strong>the</strong> right hand side as a product over <strong>the</strong> primesdividing N 0 . Fac<strong>to</strong>ring N 0 = ∏ l n ii , we have#SL(2, Z/N 0 Z) = ∏#SL(2, Z/l n ii Z).l i |N 0We can fac<strong>to</strong>r <strong>the</strong> ratioH ⋆ (Z[(F − 1)/N 0 ])H ⋆ (Z[F ])


52 NICHOLAS M. KATZas follows. Denote by K <strong>the</strong> fraction field of Z[F ], O K its ring ofintegers, χ K <strong>the</strong> quadratic character corresponding <strong>to</strong> K/Q, φ K <strong>the</strong>multiplicative function introduced in <strong>the</strong> proof of Theorem 5.1, andWe have <strong>the</strong> formulasf := <strong>the</strong> conduc<strong>to</strong>r of <strong>the</strong> order Z[(F − 1)/N 0 ].H ⋆ (Z[(F − 1)/N 0 ]) = ∑ φ K (d)h ⋆ (O K ),d|fH ⋆ (Z[F ]) = ∑φ K (d)h ⋆ (O K ).d|fN 0Because φ K is multiplicative, we have <strong>the</strong> formulas∑φ K (d) = ∏ ∑φ K (l a ),d|fl|f a≥0, l a |f∑φ K (d) = ∏ ∑φ K (l a ).d|fN 0 l|fN o a≥0, l a |fN 0In <strong>the</strong>se two products, <strong>the</strong> primes l not dividing N 0 give rise <strong>to</strong> <strong>the</strong>same fac<strong>to</strong>r in each. So we getH ⋆ (Z[(F − 1)/N 0 ])H ⋆ (Z[F ])= ∏ ∑a≥0, l a |f φ K(l a )l|N 0∑a≥0, l a |fN 0φ K (l a ) .So we are reduced <strong>to</strong> showing that for each prime l i dividing N 0 = ∏ l n iiwe haveν li (A, Q, M ord )ν li (A, Q)∑= #SL(2, Z/l n a≥0, lii Z)a i |f φ K(l a i )∑a≥0, l a i |fN φ 0K(l a i ).Fix one such l i := l, and put n := ord l (N 0 ), δ := ord l (f). Thusn + δ = ord l (fN 0 ). We now compute explicitly everything in sight,using <strong>the</strong> results [Ge, Thm.4.4] of Gekeler. To state <strong>the</strong>m, we firstestablish a bit of notation. Given an arbitrary pair (A 1 , Q 1 ) of integerswith A 2 1 − 4Q 1 < 0, we wish <strong>to</strong> count <strong>the</strong> number α l k(A 1 , Q 1 ) of 2 × 2matrices X ∈ M 2 (Z/l k Z) with Trace(X) = A 1 and det(X) = Q 1 , atleast for k large. Attached <strong>to</strong> (A 1 , Q 1 ) we have <strong>the</strong> quadratic imaginaryorderZ[F 1 ] := Z[T ]/(T 2 − A 1 T + Q 1 ),


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 53its fraction field K 1 , ring of integers O K1 , and Dirichlet character χ K1 .We denote by f A1 ,Q 1<strong>the</strong> conduc<strong>to</strong>r of <strong>the</strong> order Z[F 1 ], and we defineδ 1 := ord l (f A1 ,Q 1).Gekeler shows that for k ≥ 2δ 1 + 2, α l k(A 1 , Q 1 ) is equal <strong>to</strong>l 2k + l 2k−1 , if χ K1 (l) = 1,l 2k + l 2k−1 − (l + 1)l 2k−δ 1−2 , if χ K1 (l) = 0,l 2k + l 2k−1 − 2l 2k−δ 1−1 , if χ K1 (l) = −1.We apply this first <strong>to</strong> (A, Q). Then K 1 is just K, f A,Q = N 0 f, andδ 1 = δ + n. So for large k, <strong>the</strong> fac<strong>to</strong>rν l k(A, Q) := l k α l k(A, Q)/#SL(2, Z/l k Z) = α l k(A, Q)/l 2k−2 (l 2 − 1)is easily calculated, as α l k(A, Q) is equal <strong>to</strong>l 2k + l 2k−1 , if χ K (l) = 1,l 2k + l 2k−1 − (l + 1)l 2k−δ−n−2 , if χ K (l) = 0,l 2k + l 2k−1 − 2l 2k−δ−n−1 , if χ K (l) = −1.We next calculate <strong>the</strong> fac<strong>to</strong>r ν l k+2n(A, Q, M ord ). Here <strong>the</strong> groupG l k+2n is <strong>the</strong> group of matrices of <strong>the</strong> shape 1+l n X, X ∈ M 2 (Z/l n+k Z),and G l ,det=Q, being a coset of G geom = {1+l n X}∩SL(2, Z/l 2n+k Z),k+2n l k+2nhas#G l k+2n ,det=Q = l 3(k+n) .How do we compute <strong>the</strong> number of X ∈ M 2 (Z/l n+k Z) such that 1+l n Xhas trace A and determinant Q mod l k+2n ? These conditions are2 + l n Trace(X) ≡ A mod l k+2n ,1 + l n Trace(X) + l 2n det(X) ≡ Q mod l k+2n .Because (A, Q) has no congruence obstruction, we know thatA ≡ Q + 1 mod l 2n ,Q ≡ 1 mod l n .Thus <strong>the</strong> conditions on X areTrace(X) ≡ (A − 2)/l n mod l k+n ,det(X) ≡ (Q + 1 − A)/l 2n mod l k .To count <strong>the</strong>se, we first consider X k mod l k , satisfying <strong>the</strong> above twoconditions mod l k . The number of such X k mod l k iswithα l k(A 1 , Q 1 ),A 1 := (A − 2)/l n , Q 1 := (Q + 1 − A)/l 2n .


54 NICHOLAS M. KATZOnce we have such an X k mod l k , we lift it arbitrarily <strong>to</strong> some X k+nmod l k+n ; <strong>the</strong>n can correct this lift by adding <strong>to</strong> it anything of <strong>the</strong> forml k Y , Y mod l n , so long as Y has <strong>the</strong> required trace mod l n , namelyTrace(Y ) ≡ (A 1 − Trace(X k+n ))/l k mod l n .So <strong>the</strong>re are l 3n possible Y , and hence <strong>the</strong> number of elements inG l k+2n ,det=Q with trace A and determinant Q isThus we getl 3n α l k(A 1 , Q 1 ).ν l k+2n(A, Q, M ord ) = l k+2n l 3n α l k(A 1 , Q 1 )/l 3(k+n) .For this (A 1 , Q 1 ), Z[T ]/(T 2 − A T + Q 1 ) is just Z[(F − 1)/l n ], and soits δ 1 is just δ, <strong>the</strong> ord l of <strong>the</strong> conduc<strong>to</strong>r of Z[(F − 1)/N 0 ]. Also its K 1is just K. So for k ≥ 2δ + 2, α l k(A 1 , Q 1 ) is given by Gekeler’s formulasl 2k + l 2k−1 , if χ K (l) = 1,l 2k + l 2k−1 − (l + 1)l 2k−δ−2 , if χ K (l) = 0,l 2k + l 2k−1 − 2l 2k−δ−1 , if χ K (l) = −1.With this data at hand, it is straightforward but unenlightening<strong>to</strong> verify, case by case depending on <strong>the</strong> value of χ K (l), <strong>the</strong> requiredidentityν l (A, Q, M ord )ν l (A, Q)∑ δ= #SL(2, Z/l n a=0Z)φ K(l a )∑ δ+na=0 φ K(l a ) . □Remark 8.7. Perhaps with a more conceptual approach, one couldalso verify <strong>the</strong> conjecture for <strong>the</strong> more general moduli problems weconsidered, where we allow also a Γ 0 (L) structure and a Γ 1 (M) structure.What is <strong>the</strong> relation of <strong>the</strong> conjectured formula <strong>to</strong> <strong>the</strong> formula, interms of orbital integrals, given by Kottwitz in [Ko1, &16, pp. 432-433]and [Ko2, p. 205], when that general formula is specialized <strong>to</strong> <strong>the</strong> caseof elliptic curves, cf. also [Cl, &3,&4]?Question 8.8. As explained above, <strong>the</strong> conjecture is false in general,because of its incompatibility with pullback by a map which is surjectiveon fundamental groups. None<strong>the</strong>less, one could ask if <strong>the</strong> followingconsequence of it is asymp<strong>to</strong>tically correct. Take one of <strong>the</strong>moduli problems M ord /k we have discussed above, and take a finiteflat f : V → M ord , with V/k smooth and geometrically connected,such that f ⋆ is surjective on fundamental groups (e.g., an f which isfully ramified over some point). Fix a prime-<strong>to</strong>-p integer A, and an


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 55extension F q /k with q ≥ 8 and A 2 < 4q. We know precisely what<strong>the</strong> congruence obstructions for (A, q) are, and that, if <strong>the</strong>re are none,<strong>the</strong>n <strong>the</strong>re are F q -valued points of M ord whose Frobenii have trace A,cf. Lemmas 4.2, 4.3. We fur<strong>the</strong>r know that if (A, q) has no congruenceobstruction, <strong>the</strong>n <strong>the</strong>re are infinitely many extensions F Q /k for which(A, Q) has no congruence obstruction, and for each of <strong>the</strong>se <strong>the</strong>re areF Q -valued points of M ord whose Frobenii have trace A, cf. Lemma 6.2.For each such Q, consider <strong>the</strong> ratio#{v ∈ V (F Q )|A v,FQ = A}#{u ∈ M ord (F Q )|A u,FQ = A} .Is it true that this ratio tends <strong>to</strong> 1 as Q tends archimedeanly <strong>to</strong> infinityover <strong>the</strong> Q’s for which <strong>the</strong> denomina<strong>to</strong>r is nonzero?References[Ba] Baier, Stephan The Lang-Trotter conjecture on average. J. Ramanujan Math.Soc. 22 (2007), no. 4, 299-314.[B-K] Bombieri, E., and Katz, N., A Note on Lower bounds for Frobenius traces,2008 preprint available at www.math.prince<strong>to</strong>n.edu/∼nmk.[Cl] Clozel, Laurent, Nombre de points des variétés de Shimura sur un corps fini(d’aprs R. Kottwitz). Séminaire Bourbaki, Vol. 1992/93. Astérisque No. 216(1993), Exp. No. 766, 4, 121-149.[Co-Shp] Cojocaru, Alina Carmen, and Shparlinski, Igor E., Distribution of Fareyfractions in residue classes and Lang-Trotter conjectures on average, Proc.A.M.S. 136 Number 6 (2008), 1977-1986.[Cox] Cox, David, Primes of <strong>the</strong> Form x 2 + ny 2 , John Wiley and Sons, New York,1989.[Da-Pa] David, Chantal, and Pappalardi, Francesco, Average Frobenius distributionsof elliptic curves, Internat. Math. Res. Notices 1999 (1999), no. 4, 165-183.[De-VA] Deligne, Pierre, Variétés abéliennes ordinaires sur un corps fini. (French)Invent. Math. 8 1969 238-243.[De-WeilII] Deligne, Pierre, La conjecture de Weil. II, Inst. Hautes Études Sci.Publ. Math. No. 52 (1980), 137-252.[Deu] Deuring,M., Die Typen der Multiplika<strong>to</strong>renringe elliptischer Funktionenkörper,Abh. Math. Sem. Hansischen Univ. 14 (1941), 197-272[Deu-CM] Deuring,M., Die Zetafunktion einer algebraischen Kurve vomGeschlechte Eins, Nachr. Akad.Wiss.Göttingen. Math.-Phys. Kl. Math.-Phys.-Chem. Abt. (1953) 8594.[Elkies-Real] Elkies, Noam D. Supersingular primes for elliptic curves over realnumber fields. Compositio Math. 72 (1989), no. 2, 165-172.


56 NICHOLAS M. KATZ[Elkies-SS] Elkies, Noam D. The existence of infinitely many supersingular primesfor every elliptic curve over Q. Invent. Math. 89 (1987), no. 3, 561-567.[Ge] Gekeler, Ernst-Ulrich, Frobenius distributions of elliptic curves over finiteprime fields, IMRN 2003, no. 37, 1999-2018.[H-SB-T] Harris, Michael, Shepherd-Barron, Nicholas, and Taylor, Richard, A familyof Calabi-Yau varieties and potential au<strong>to</strong>morphy, preprint available atwww.math.harvard.edu/∼rtaylor/cy3.pdf.[He] Hecke, Erich, ´’Uber eine neue Art von Zetafunktionen und ihre Beziehungenzur Verteilungen der Primzahlen. Math. Z. 1 (1918), 357-376;4 (1920), 11-21.Reprinted in Ma<strong>the</strong>matishe Werke, Vandenhoeck and Ruprecht, G´’ottingen,1983.[Honda] Honda, Taira, Isogeny classes of abelian varieties over finite fields, J. Math.Soc. Japan 20 (1968), 83-95.[Howe] Howe, Everett W. Principally polarized ordinary abelian varieties over finitefields. Trans. Amer. Math. Soc. 347 (1995), no. 7, 2361-2401.[Ig] Igusa, Jun-ichi, Class number of a definite quaternion with prime discriminant.Proc. Nat. Acad. Sci. U.S.A. 44 1958 312-314.[Ir-Ros] Ireland, Kenneth F., and Rosen, Michael I., A classical introduction <strong>to</strong>modern number <strong>the</strong>ory. Revised edition of Elements of number <strong>the</strong>ory. GraduateTexts in Ma<strong>the</strong>matics, 84. Springer-Verlag, New York-Berlin, 1982.[I<strong>to</strong>] I<strong>to</strong>, Kyosi, Introduction <strong>to</strong> Probability Theory, Cambridge University Press,New York, 1984.[Ka-ESDE] Katz, Nicholas M., Exponential sums and differential equations, Annalsof Ma<strong>the</strong>matics Studies, 124. Prince<strong>to</strong>n University Press, Prince<strong>to</strong>n, NJ, 1990.[Ka-TLFM] Katz, Nicholas M.,Twisted L-Functions and Monodromy, Annals ofMa<strong>the</strong>matics Studies, 108. Prince<strong>to</strong>n University Press, Prince<strong>to</strong>n, NJ, 1985.[K-M] Katz, Nicholas M., and Mazur, Barry, Arithmetic moduli of elliptic curves.Annals of Ma<strong>the</strong>matics Studies, 150. Prince<strong>to</strong>n University Press, Prince<strong>to</strong>n,NJ, 2002.[Ko1] Kottwitz, Robert E., Points on some Shimura varieties over finite fields. J.Amer. Math. Soc. 5 (1992), no. 2, 373-444.[Ko2] Kottwitz, Robert E., Shimura varieties and λ-adic representations. Au<strong>to</strong>morphicforms, Shimura varieties, and L-functions, Vol. I (Ann Arbor, MI, 1988),161-209, Perspect. Math., 10, Academic Press, Bos<strong>to</strong>n, MA, 1990.[L-T] Lang, Serge, and Trotter, Hale, Frobenius distributions in GL 2 -extensions,Springer Lecture Notes in Ma<strong>the</strong>matics 504, 1976.[Maz] Mazur, Barry, Rational points of abelian varieties with values in <strong>to</strong>wers ofnumber fields. Invent. Math. 18 (1972), 183-266.[Mes] Messing, William, The crystals associated <strong>to</strong> Barsotti-Tate groups: withapplications <strong>to</strong> abelian schemes. Lecture Notes in Ma<strong>the</strong>matics, Vol. 264.Springer-Verlag, Berlin-New York, 1972.


<strong>LANG</strong>-<strong>TROTTER</strong> <strong>REVISITED</strong> 57[Pa] Pacheco, Amícar, Distribution of <strong>the</strong> traces of Frobenius on elliptic curves overfunction fields. Acta Arith. 106 (2003), no. 3, 255–263.[Sch] Schoof, René Nonsingular plane cubic curves over finite fields. J. Combin.Theory Ser. A 46 (1987), no. 2, 183-211.[Se-Cheb] Serre, Jean-Pierre, Quelques applications du théorème de densité deChebotarev, Pub. Math. IHES 54(1981), 123-201.[Se-Mot] Serre, Jean-Pierre, Propriétés conjecturales des groupes de Galois motiviqueset des représentations l-adiques, Proceedings of Symposia in PureMa<strong>the</strong>matics 55 (1994) Part I, 377-400.[Sh] Shimura, Goro, Introduction <strong>to</strong> <strong>the</strong> arithmetic <strong>the</strong>ory of au<strong>to</strong>morphic functions.Kan Memorial Lectures, No. 1. Publications of <strong>the</strong> Ma<strong>the</strong>matical Societyof Japan, No. 11. Iwanami Shoten, Publishers, Tokyo; Prince<strong>to</strong>n UniversityPress, Prince<strong>to</strong>n, N.J., 1971[Sie] Siegel, Carl Ludwig, Über die Classenzahl quadratischer Zahlkörper, ActaArith. 1 (1935), 83-86.[St] Stein, William, The Modular Forms Explorer, database available athttp://modular.fas.harvard.edu.[Tate] Classes d’isogénie des variétés abéliennes sur un corps fini, Séminaire Bourbaki1968-69, exp. 352, 95-110.[Wat] Waterhouse, William C. Abelian varieties over finite fields. Ann. Sci. ÉcoleNorm. Sup. (4) 2 1969 521-560.[Z]Zarhin, Yuri G., Very simple 2-adic representations and hyperelliptic Jacobians,Mosc.Math. J. 2 (2002), no. 2, 403431.Prince<strong>to</strong>n University, Dept. Math., Fine Hall, Prince<strong>to</strong>n, NJ 08544-1000, USAE-mail address: nmk@math.prince<strong>to</strong>n.edu

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!