13.11.2014 Views

Margulis Lemma

Margulis Lemma

Margulis Lemma

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

STRUCTURE OF FUNDAMENTAL GROUPS OF MANIFOLDS<br />

WITH RICCI CURVATURE BOUNDED BELOW<br />

VITALI KAPOVITCH AND BURKHARD WILKING<br />

The main result of this paper is the following theorem which settles a conjecture<br />

of Gromov.<br />

Theorem 1 (Generalized <strong>Margulis</strong> <strong>Lemma</strong>). In each dimension n there are positive<br />

constants C(n), ε(n) such that the following holds for any complete n-dimensional<br />

Riemannian manifold (M, g) with Ric > −(n − 1) on a metric ball B 1 (p) ⊂ M.<br />

The image of the natural homomorphism<br />

π 1<br />

(<br />

Bε (p), p ) → π 1<br />

(<br />

B1 (p), p )<br />

contains a nilpotent subgroup N of index ≤ C(n). Moreover, N has a nilpotent basis<br />

of length at most n.<br />

We call a generator system b 1 , . . . , b n of a group N a nilpotent basis if the commutator<br />

[b i , b j ] is contained in the subgroup 〈b 1 , . . . , b i−1 〉 for 1 ≤ i < j ≤ n.<br />

Having a nilpotent basis of length n implies in particular rank(N) ≤ n. We will<br />

also show that equality in this inequality can only occur if M is homeomorphic to<br />

an infranilmanifold, see Corollary 7.1.<br />

In the case of a sectional curvature bound the theorem is due to Kapovitch,<br />

Petrunin and Tuschmann [KPT10], based on an earlier version which was proved<br />

by Fukaya and Yamaguchi [FY92].<br />

In the case of Ricci curvature a weaker form of the theorem was stated by<br />

Cheeger and Colding [CC96]. The main difference is that – similar to Fukaya and<br />

Yamaguchi’s theorem – no uniform bound on the index of the subgroup is provided.<br />

Cheeger and Colding never wrote up the details of the proof, and relied on some<br />

claims in [FY92] stating that their results would carry over from lower sectional<br />

curvature bounds to lower Ricci curvature bounds if certain structure results would<br />

be obtained. But compare also Remark 6.2 below.<br />

Corollary 2. Let (M, g) be a compact manifold with Ric > −(n−1) and diam(M) ≤<br />

ε(n) then π 1 (M) contains a nilpotent subgroup N of index ≤ C(n). Moreover, N<br />

has a nilpotent basis of length ≤ n.<br />

One of the tools used to prove these results is<br />

Theorem 3. Given n and D there exists C such that for any n-manifold with<br />

Ric ≥ −(n−1) and diam(M, g) ≤ D, the fundamental group π 1 (M) can be generated<br />

by at most C elements.<br />

2000 Mathematics Subject classification. Primary 53C20. Keywords: almost nonnegative Ricci<br />

curvature, <strong>Margulis</strong> <strong>Lemma</strong>.<br />

The first author was supported in part by a Discovery grant from NSERC.<br />

1


2 VITALI KAPOVITCH AND BURKHARD WILKING<br />

This estimate was previously proven by Gromov [Gro78] under the stronger<br />

assumption of a lower sectional curvature bound K ≥ −1. Recall that a conjecture<br />

of Milnor states that the fundamental group of an open manifold with nonnegative<br />

Ricci curvature is finitely generated. Although Theorem 3 is far from a solution to<br />

that problem, the <strong>Margulis</strong> <strong>Lemma</strong> immediately implies the following.<br />

Corollary 4. Let (M, g) be an open n-manifold with nonnegative Ricci curvature.<br />

Then π 1 (M) contains a nilpotent subgroup N of index ≤ C(n) such that any finitely<br />

generated subgroup of N has a nilpotent basis of length ≤ n.<br />

Of course, by work of Milnor [Mil68] and Gromov [Gro81], the corollary is well<br />

known (without uniform bound on the index) in the case of finitely generated<br />

fundamental groups. We should mention that by work of Wei [Wei88] (and an<br />

extension in [Wil00]) every finitely generated virtually nilpotent group appears<br />

as fundamental group of an open manifold with positive Ricci curvature in some<br />

dimension.<br />

The proofs of our results are based on the structure results of Cheeger and<br />

Colding for limit spaces of manifolds with lower Ricci curvature bounds [Col96,<br />

Col97, CC96, CC97, CC00a, CC00b]. This is also true for the proof of following<br />

new tool which can be considered as a Ricci curvature replacement of Yamaguchi’s<br />

fibration theorem as well as a replacement of the gradient flow of semi-concave<br />

functions used by Kapovitch, Petrunin and Tuschmann.<br />

Theorem 5 (Rescaling theorem). Suppose a sequence of Riemannian n-manifolds<br />

(M i , p i ) with Ric ≥ − 1 i<br />

converges in the pointed Gromov–Hausdorff topology to the<br />

Euclidean space (R k , 0) with k < n. Then after passing to a subsequence there is a<br />

subset G 1 (p i ) ⊂ B 1 (p i ), a rescaling sequence λ i → ∞ and a compact metric space<br />

K ≠ {pt} such that<br />

• vol(G 1 (p i )) ≥ (1 − 1 i ) vol(B 1(p i ))<br />

• For all x i ∈ G 1 (p i ) the isometry type of any limit (λ i M i , x i ) is given by<br />

K × R k .<br />

• For all x i , y i ∈ G 1 (p i ) we can find a diffeomorphism f i : M i → M i such<br />

that f i subconverges in the weakly measured sense to an isometry<br />

f ∞ :<br />

lim (λ iM i , x i ) → lim (λ iM i , y i ).<br />

G-H,i→∞ G-H,i→∞<br />

We will prove a technical generalization of this theorem in section 5. It will be<br />

important for the proof of the <strong>Margulis</strong> <strong>Lemma</strong> that the diffeomorphisms f i are<br />

in a suitable sense close to isometries on all scales. The precise concept is called<br />

zooming in property and is defined in section 3.<br />

The diffeomorphisms f i will be composed out of gradient flows of harmonic<br />

functions arising in the analysis of Cheeger and Colding. Their L 2 -estimates on the<br />

Hessian’s of these functions play a crucial role.<br />

Like in Fukaya and Yamaguchi’s paper the idea of the proof of the <strong>Margulis</strong><br />

<strong>Lemma</strong> is to consider a contradicting sequence (M i , g i ). By a fundamental observation<br />

of Gromov, the set of complete n-manifolds with Ric ≥ −(n − 1) is<br />

precompact in Gromov–Hausdorff topology. Therefore one can assume that the<br />

contradicting sequence converges to a (possibly very singular) space X. One then<br />

uses various rescalings and normal coverings of (M i , g i ) in order to find contradicting<br />

sequences converging to higher and higher dimensional spaces. One of the<br />

differences to Fukaya and Yamaguchi’s approach is that if we pass to a normal


STRUCTURE OF FUNDAMENTAL GROUPS 3<br />

covering we endow the cover with generators of the deck transformation group in<br />

order not to loose information. Since after rescaling the displacements of these<br />

deck transformations converge to infinity, this approach seems bound to failure.<br />

However, using the rescaling theorem we are able to alter the deck transformations<br />

by composing them with a sequence of diffeomorphisms which are isotopic to the<br />

identity and which have the zooming in property. With these alterations we are<br />

able to keep much more information on the action by conjugation of long homotopy<br />

classes on very short ones. Since the altered deck transformations still converge in<br />

a weakly measured sense to isometries, this allows one to eventually rule out the<br />

existence of a contradicting sequence.<br />

For the proof of the <strong>Margulis</strong> <strong>Lemma</strong> we do not need any sophisticated structure<br />

results on the isometry group of the limit space except for the relatively elementary<br />

Gap <strong>Lemma</strong> 2.4 guaranteeing that generic orbits of the limit group are locally path<br />

connected. However, for the following application of the <strong>Margulis</strong> <strong>Lemma</strong> a recent<br />

structure result of Colding and Naber [CN10] is key. They showed that the isometry<br />

group of a limit space with lower Ricci curvature bound has no small subgroups and<br />

thus, by the small subgroup theorem of Gleason, Montgomery and Zippin [MZ55],<br />

is a Lie group.<br />

Theorem 6 (Compact Version of the <strong>Margulis</strong> <strong>Lemma</strong>). Given n and D there are<br />

positive constants ε 0 and C such that the following holds: If (M, g) is a compact<br />

n-manifold M with Ric > −(n − 1) and diam(M) ≤ D, then there is ε ≥ ε 0 and a<br />

normal subgroup N ⊳ π 1 (M) such that for all p ∈ M:<br />

• the image of π 1 (B ε/1000 (p), p) → π 1 (M, p) contains N,<br />

• the index of N in the image of π 1 (B ε (p), p) → π 1 (M, p) is ≤ C and<br />

• N is a nilpotent group which has a nilpotent basis of length ≤ n.<br />

If D, n, ε 0 and C are given, there is an effective way to limit the number of<br />

possibilities for the quotient group π 1 (M, p)/N, see <strong>Lemma</strong> 9.2. In fact we have the<br />

following finiteness result. We recall that the torsion elements Tor(N) of a nilpotent<br />

group N form a subgroup.<br />

Theorem 7. a) For each D > 0 and each dimension n there are finitely<br />

many groups F 1 , . . . , F k such that the following holds: If M is a compact n-<br />

manifold with Ric > −(n − 1) and diam(M) ≤ D, then there is a nilpotent<br />

normal subgroup N ⊳ π 1 (M) with a nilpotent basis of length ≤ n − 1 and<br />

rank(N) ≤ n − 2 such that π 1 (M)/N ∼ = F i for suitable i.<br />

b) In addition to a) one can choose a finite collection of irreducible rational<br />

representations ρ j i : F i → GL(n j i , Q) (j = 1, . . . , µ i, i = 1, . . . , k) such that<br />

for a suitable choice of the isomorphism π 1 (M)/N ∼ = F i the following holds:<br />

There is a chain of subgroups Tor(N) = N 0 ⊳ · · · ⊳ N h0 = N which are all<br />

normal in π 1 (M) such that [N, N h ] ⊂ N h−1 and N h /N h−1 is free abelian.<br />

Moroever, the action of π 1 (M) on N by conjugation induces an action of F i<br />

on N h /N h−1 and the induced representation ρ: F i → GL ( (N h /N h−1 ) ⊗ Z Q )<br />

is isomorphic to ρ j i for a suitable j = j(h), h = 1, . . . , h 0.<br />

Part a) of Theorem 7 generalizes a result of Anderson [And90] who proved finiteness<br />

of fundamental groups under the additional assumption of a uniform lower<br />

bound on volume.<br />

In the preprint [Wil11] a partial converse of Theorem 7 is proved. In particular<br />

it is shown there that for a finite collection of finitely presented groups and any


4 VITALI KAPOVITCH AND BURKHARD WILKING<br />

finite collection of rational representations one can find D such that each group Γ<br />

satisfying the algebraic restrictions described in a) and b) of the above theorem<br />

with respect to this data contains a finite nilpotent normal subgroup H such that<br />

Γ/H can be realized as a fundamental group of a n + 2-dimensional manifold with<br />

diam(M) ≤ D and sectional curvature |K| ≤ 1.<br />

We can also extend the diameter ratio theorem of Fukaya and Yamaguchi [FY92].<br />

Theorem 8 (Diameter Ratio Theorem). For n and D there is a ˜D such that any<br />

compact manifold M with Ric ≥ −(n − 1) and diam(M) = D satisfies: If π 1 (M) is<br />

finite, then the diameter of the universal cover ˜M of M is bounded above by ˜D.<br />

In the case of nonnegative Ricci curvature the theorem says that the ratio<br />

diam( ˜M)/ diam(M) is bounded above. Fukaya and Yamaguchi’s theorem covers<br />

the case that M has almost nonnegative sectional curvature. The proof of Theorem<br />

8 has some similarities to parts of the proof of Gromov’s polynomial growth<br />

theorem [Gro81].<br />

Part of the paper was written up while the second named author was a Visiting<br />

Miller Professor at the University of California at Berkeley. He would like to thank<br />

the Miller institute for support and hospitality.<br />

Organization of the Paper<br />

1. Prerequisites 5<br />

2. Finite generation of fundamental groups 9<br />

3. Maps which are on all scales close to isometries 13<br />

4. A rough idea of the proof of the <strong>Margulis</strong> <strong>Lemma</strong> 20<br />

5. The Rescaling Theorem 21<br />

6. The Induction Theorem for C-Nilpotency 26<br />

7. <strong>Margulis</strong> <strong>Lemma</strong> 31<br />

8. Almost nonnegatively curved manifolds with b 1 (M, Z p ) = n 35<br />

9. Finiteness Results 37<br />

10. The Diameter Ratio Theorem 44<br />

We start in section 1 with prerequisites. We have included a subsection on<br />

notational conventions. Next, in section 2 we will prove Theorem 3.<br />

Section 3 is somewhat technical. We define the zooming in property, prove the<br />

needed properties, and provide two somewhat similar construction methods. This<br />

section serves mainly as a preparation for the proof of the rescaling theorem.<br />

We have added a short section 4 in which we sketch a rough idea of the proof of<br />

the <strong>Margulis</strong> <strong>Lemma</strong>.<br />

In section 5 the refined rescaling theorem (Theorem 5.1) is stated and proven.<br />

In Section 6 we put things together and provide a proof of the Induction Theorem,<br />

which has Corollary 2 as its immediate consequence.<br />

The <strong>Margulis</strong> <strong>Lemma</strong> follows from the Induction Theorem in two steps which are<br />

somewhat similar to Fukaya and Yamaguchi’s approach. Nevertheless, we included<br />

all details in section 7. We also show that the nilpotent group N in the <strong>Margulis</strong><br />

<strong>Lemma</strong> can only have rank n if the underlying manifold is homeomorphic to an<br />

infranilmanifold.<br />

Section 8 uses lower sectional curvature bounds. We give counterexamples to a<br />

theorem of Fukaya and Yamaguchi [FY92] stating that almost nonnegatively curved<br />

n-manifolds with first Z p -Betti number equal to n have to be tori, provided p is


STRUCTURE OF FUNDAMENTAL GROUPS 5<br />

sufficiently big. We show that instead these manifolds are nilmanifolds and that<br />

every nilmanifold covers another nilmanifold with maximal first Z p Betti number.<br />

Section 9 contains the proof of Theorem 6 and Theorem 7. Finally, Theorem 8<br />

is proved in section 10.<br />

1. Prerequisites<br />

1.1. Short basis. We will use the following construction due to Gromov.<br />

Given a manifold (M, p) and a group Γ acting properly discontinuously and<br />

isometrically on M one can define a short basis of the action of Γ at p as follows:<br />

For γ ∈ Γ we will refer to |γ| = d(p, γ(p)) as the norm or the length of γ. Choose<br />

γ 1 ∈ Γ with the minimal norm in Γ. Next choose γ 2 to have minimal norm in<br />

Γ\〈γ 1 〉. On the n-th step choose γ n to have minimal norm in Γ\〈γ 1 , γ 2 , ..., γ n−1 〉.<br />

The sequence {γ 1 , γ 2 , ...} is called a short basis of Γ at p. In general, the number of<br />

elements of a short basis can be finite or infinite. In the special case of the action<br />

of the fundamental group π 1 (M, p) on the universal cover ˜M of M one speaks of<br />

the short basis of π 1 (M, p). For any i > j we have |γ i | ≤ |γ −1<br />

j γ i |.<br />

While a short basis need not be unique the non-uniqueness will not matter in<br />

any of the proofs in this article and will largely be suppressed.<br />

If M/Γ is a closed manifold, then the short basis is finite and |γ i | ≤ 2 diam(M/Γ).<br />

1.2. Ricci curvature. Throughout this paper we will use the notation − ∫ to denote<br />

the average integral. We will use the following results of Cheeger and Colding.<br />

Theorem 1.1. [CC96] Let (M n i , p i) G−H<br />

−→<br />

i→∞ (X, p) with Ric(M i) ≥ − 1 i . Suppose X<br />

has a line. Then X splits isometrically as X ∼ = Y × R.<br />

We will also need the following corollary of the stability theorem.<br />

Theorem 1.2. [CC97] Let (Mi n, p i) → (R n , 0) with Ric Mi > −1. Then B R (p i ) is<br />

contractible in B R+ε (p i ) for all i ≥ i 0 (R, ε).<br />

Even more important for us is the following theorem which is closely linked with<br />

the proof of the splitting theorem for limit spaces.<br />

Theorem 1.3. [CC00a] Suppose (Mi n, p i) → (R k , 0) with Ric Mi ≥ −1/i. Then<br />

there exist harmonic functions b i 1, . . . , b i k : B 2(p i ) → R such that<br />

(1) |∇b i j| ≤ C(n) for all i and j and<br />

∫<br />

(2) −<br />

B 1(p i)<br />

∑<br />

∣ ∑<br />

∣ < ∇b<br />

i<br />

j , ∇b i l > −δ j,l + ‖Hess b i<br />

j<br />

‖ 2 dµ → 0 as i → ∞.<br />

j,l<br />

Moreover, the maps Φ i = (b i 1, . . . , b i k ): M i → R k provide ε i -Gromov–Hausdorff<br />

approximations between B 1 (p i ) and B 1 (0) with ε i → 0.<br />

The functions b i j in the above theorem are constructed as follows. Approximate<br />

Busemann functions f j in R k given by f j = d(·, N i e j )) − N i are lifted to M i using<br />

Hausdorff approximations to corresponding functions fj i. Here e j is the j-th coordinate<br />

vector in the standard basis of R k and N i → ∞ sufficiently slowly so that<br />

j


6 VITALI KAPOVITCH AND BURKHARD WILKING<br />

d G−H (B Ni (p i ), B Ni (0)) → 0 as i → ∞. The functions b i j are obtained by solving<br />

the Dirichlet problem on B 2 (p i ) with b i i | ∂B 2(p i ) = fi i| ∂B 2(p i )<br />

We will need a weak type 1-1 estimate for manifolds with lower Ricci curvature<br />

bounds which is a well-known consequence of the doubling inequality [Ste93, p. 12].<br />

<strong>Lemma</strong> 1.4 (Weak type 1-1 inequality). Suppose (M n , g) has Ric ≥ −1 and let<br />

f : M → R be a nonnegative function. Define Mx f(p) := sup r≤1 − ∫ B r(p)<br />

f. Then<br />

the following holds<br />

(a) If f ∈ L α (M) with α ≥ 1 then Mx f is finite almost everywhere.<br />

(b) If f ∈ L 1 (M) then vol { x | Mx f(x) > c } ≤ C(n) ∫<br />

c<br />

f for any c > 0.<br />

M<br />

(c) If f ∈ L α (M) with α > 1 then Mx f ∈ L α (M) and || Mx f|| α ≤ C(n, α)||f|| α .<br />

Note that as an immediate corollary we see that if f ∈ L α (M) with α > 1 then<br />

for any x ∈ M we have the following pointwise estimate<br />

(3) Mx((Mx f) α )(x) ≤ C(n, α) Mx(f α )(x).<br />

We will also need the following inequality (for any α > 1) which is an immediate<br />

consequence of the definition of Mx and Hölder inequality.<br />

(4) [Mx f(x)] α ≤ Mx(f α )(x).<br />

Indeed, for any r ≤ 1 we have that Mx(f α )(x) ≥ − ∫ B r(x) f α ≥ ( − ∫ B r(x) f) α<br />

and<br />

the inequality follows by taking the supremum over all r ≤ 1.<br />

Combining the last two inequalities we deduce<br />

(5) Mx[Mx(f)](x) ≤ Mx[(Mx(f)) α ] 1/α (x) ≤ C(n, α) 1/α Mx[f α ](x)<br />

for α > 1. Finally we note that for α > 1, β > 1 we have<br />

(6) Mx[Mx(f β/α )] α (x) ≤ C(n, α) Mx(f β )(x).<br />

This inequality immediately follows by applying (3) to g = f β α .<br />

We will also make use of the so-called segment inequality of Cheeger and Colding<br />

which says that in average the integral of a nonnegative function along all geodesics<br />

in ball can be estimated by the L 1 norm of the function.<br />

Theorem 1.5 (Segment inequality). [CC96] Given n and r 0 there exists τ =<br />

τ(n, r 0 ) such that the following holds. Let Ric(M n ) ≥ −(n−1) and let g : M → R +<br />

be a nonnegative function. Then for r ≤ r 0<br />

∫<br />

−<br />

B r(p)×B r(p)<br />

∫ d(z1,z 2)<br />

0<br />

∫<br />

g(γ z1,z 2<br />

(t))dµ(z 1 )dµ(z 2 ) ≤ τ · r · − g(q) dµ(q),<br />

B 2r(p)<br />

where γ z1,z 2<br />

denotes a minimal geodesic from z 1 to z 2 .<br />

Finally we will need the following observation<br />

<strong>Lemma</strong> 1.6 (Covering <strong>Lemma</strong>). There exists a constant C(n) such that the following<br />

holds. Suppose (M n , g) has Ric g ≥ −(n − 1). Let f : M → R be a nonnegative<br />

function, p ∈ M, π : ˜M → M be the universal cover of M, ˜p ∈ ˜M a lift of p, and<br />

let ˜f = f ◦ π. Then ∫<br />

∫<br />

− ˜f ≤ C(n)− f.<br />

B 1(˜p)<br />

B 1(p)


STRUCTURE OF FUNDAMENTAL GROUPS 7<br />

Proof. We choose a measurable section j : B 1 (p) → B 1 (˜p), i.e. π(j(x)) = x for any<br />

x ∈ B 1 (p). Let T = j(B 1 (p)). Then we obviously have that diam(T ) ≤ 2 and<br />

∫ ∫<br />

− f = − ˜f.<br />

B 1(p)<br />

Let S be the union of g(T ) over all g ∈ π 1 (M) such that g(T ) ∩ B 1 (˜p) ≠ ∅. It’s<br />

obvious from the triangle inequality that S ⊂ B 3 (˜p). It is also clear that<br />

∫ ∫<br />

− f = − ˜f<br />

B 1(p)<br />

and B 1 (˜p) ⊂ S. Lastly, notice that by Bishop–Gromov relative volume comparison<br />

vol B 3 (˜p) ≤ C(n) vol B 1 (˜p) for some universal constant C(n). Since B 1 (˜p) ⊂ S ⊂<br />

B 3 (˜p), we also have vol B 3 (˜p) ≤ C(n) vol S and hence<br />

∫<br />

∫ ∫<br />

− ˜f ≤ C(n)− ˜f = C(n)− f.<br />

B 1(˜p)<br />

S<br />

T<br />

S<br />

B 1(p)<br />

It is easy to see that the above proof generalizes to an arbitrary cover of M.<br />

1.3. Equivariant Gromov–Hausdorff convergence. Let Γ i be a sequence of<br />

closed subgroups of the isometry groups of metric spaces X i and ˜p i ∈ X i . If<br />

(X i , ˜p i ) converges in the pointed Gromov–Hausdorff topology to (Y, ˜p ∞ ), then after<br />

passing to a subsequence one can find a closed subgroup G ⊂ Iso(Y ) such that<br />

(X i , Γ i , ˜p i ) → (Y, G, ˜p ∞ ) in the equivariant Gromov–Hausdorff topology. For details<br />

we refer the reader to [FY92].<br />

Definition 1.7. A sequence of subgroups Υ i ⊂ Γ i is called uniformly open, if there<br />

is some ε > 0 such that Υ i contains the set<br />

{<br />

g ∈ Γi | d(gq, q) < ε for all q ∈ B 1/ε (˜p i ) } for all i.<br />

Γ i is called uniformly discrete if {e} ⊂ Γ i is uniformly open. We say the sequence<br />

Υ i is boundedly generated if there is some R such that Υ i is generated by<br />

{g ∈ Υ i | d(g˜p i , ˜p i ) < R}.<br />

<strong>Lemma</strong> 1.8. Let Υ j i ⊂ Γ i be uniformly open with Υ j i → Υj ∞ ⊂ G, j = 1, 2.<br />

a) Υ 1 i ∩ Υ2 i is uniformly open and converges to Υ1 ∞ ∩ Υ 2 ∞.<br />

b) If g i ∈ Γ i converges to g ∞ ∈ G then g i Υ 1 i g−1 i is uniformly open and converges<br />

to g ∞ Υ 1 ∞g∞ −1 .<br />

c) Suppose in addition that Υ j i is boundedly generated, j = 1, 2. If Υ1 ∞ ∩ Υ 2 ∞<br />

has finite index H in Υ 1 ∞, then Υ 1 i ∩ Υ2 i has index H in Υ1 i for all large i.<br />

The proof is an easy exercise.<br />

Example 1 (Z l actions converging to a Z 2 -action.). Consider a 2-torus T 2 k<br />

given by<br />

a Riemannian product S 1 ×S 1 where each of the factors has length k. Put l = k 2 +1<br />

and let Z/lZ act on T 2 k by ( e 2πi<br />

k 2 k2πi )<br />

+1 , e k 2 +1 . In the equivariant Gromov–Hausdorff<br />

topology the action will converge (for k → ∞) to the standard Z 2 action on R 2 .<br />

One can, of course, define a similar sequence of actions on S 3 × S 3 but in this<br />

case the actions will not be uniformly cocompact.<br />


8 VITALI KAPOVITCH AND BURKHARD WILKING<br />

Remark 1.9. a) The example shows that it is difficult to relate an open subgroup<br />

in the limit to corresponding subgroups in the sequence.<br />

b) If g1, i . . . , gh i i<br />

is a generator system of some subgroup Γ ′ i ⊆ Γ i, then one can<br />

consider – after passing to a subsequence – a different limit construction to<br />

get a subgroup G ′ ⊂ G: For some fixed R consider all words w in g1, i . . . , gh i i<br />

with the property w ⋆ p i ∈ B R (˜p i ). This defines a subset in the Cayley<br />

graph of Γ ′ i . Consider now all elements in Γ′ i represented by words in the<br />

identity component of this subset. Passing to the limit gives a closed subset<br />

S R ⊂ G of the limit group and one can now take the limit of S R for R → ∞.<br />

This sort of word limit group can be much smaller than the regular limit<br />

group as the above examples shows. Although it depends on the choice<br />

of the generator system, there are occasions where this limit behaves more<br />

natural than the usual. However, this idea is used only indirectly in the<br />

paper.<br />

1.4. Notations and conventions. Throughout the paper we will use the following<br />

conventions.<br />

• As already mentioned − ∫ S f dµ stands for ∫<br />

1<br />

vol(S) S f dµ.<br />

• Mx(f)(x) denotes the maximum function of f evaluated at x, see <strong>Lemma</strong> 1.4<br />

for the definition.<br />

• A map σ : X → Y between inner metric spaces is called a submetry if<br />

σ(B r (p)) = B r (σ(p)) for all r > 0 and p ∈ X.<br />

• If S ⊂ F is a subset of a group, then 〈S〉 denotes the subgroup generated<br />

by S.<br />

• If g 1 and g 2 are elements in a group, [g 1 , g 2 ] := g 1 g 2 g1 −1 g−1 2 denotes the<br />

commutator. For subgroups F 1 , F 2 we put [F 1 , F 2 ] := 〈{[f 1 , f 2 ] | f i ∈ F i }〉.<br />

• Generators b 1 , . . . , b n ∈ F of a group are called a nilpotent basis if [b i , b j ] ∈<br />

〈b 1 , . . . , b i−1 〉 for i < j.<br />

• N ⊳ F means: N is a normal subgroup of F.<br />

• For a Riemannian manifold (M, g) and λ > 0 we let λM denote the Riemannian<br />

manifold (M, λ 2 g).<br />

• For a limit space Y a tangent cone C p Y at p is some Gromov–Hausdorff<br />

limit of (λ i Y, p) for some λ i → ∞. Tangent cones are not always cones and<br />

are not necessarily unique.<br />

• For a limit space Y of manifolds with lower Ricci curvature bound a regular<br />

point p ∈ Y is a point all of whose tangent cones at p are given by R kp . By<br />

a result of Cheeger and Colding [CC97] these points have full measure and<br />

are thus dense.<br />

• For two sequences of pointed metric spaces (X i , p i ), (Y i , q i ) and maps<br />

f i : X i → Y i we use the notation f i : [X i , p i ] → [Y i , q i ] to indicate that<br />

f i converges in some sense to a map from the pointed Gromov–Hausdorff<br />

limit of (X i , p i ) to the pointed Gromov–Hausdorff limit of (Y i , q i ). It usually<br />

does not mean that f i (p i ) = q i . However, if this is the case we write<br />

f i : (X i , p i ) → (Y i , q i ).<br />

• If a group G acts on a metric space we say g ∈ G displaces p ∈ X by r if<br />

d(p, gp) = r. We will denote the orbit of p by G ⋆ p.<br />

• Gromov’s short generator system (or short basis for short) of a fundamental<br />

group is defined at the beginning of section 1.


STRUCTURE OF FUNDAMENTAL GROUPS 9<br />

2. Finite generation of fundamental groups<br />

<strong>Lemma</strong> 2.1 (Product <strong>Lemma</strong>). Let M i be a sequence of manifolds with Ric Mi ><br />

−ε i → 0 satisfying<br />

• B ri (p i ) is compact for all i with r i → ∞ and p i ∈ M i ,<br />

• for every i and j = 1, . . . , k there are harmonic functions b i j : B r i<br />

(p i ) → R<br />

which are L-Lipschitz and fulfill<br />

∫ k∑<br />

k∑<br />

− | < ∇b i j, ∇b i l > −δ jl | + ‖Hess b i<br />

j<br />

‖ 2 dµ → 0 for all R > 0.<br />

B R (p i)<br />

j,l=1<br />

j=1<br />

Then (B ri (p i ), p i ) subconverges in the pointed Gromov–Hausdorff topology to a metric<br />

product (R k ×X, p ∞ ) for some metric space X. Moreover, (b i 1, . . . , b i k ) converges<br />

to the projection onto the Euclidean factor.<br />

The lemma remains true if one just has a uniform lower Ricci curvature bound<br />

and one can also prove a local version of the lemma if r i = R is a fixed number.<br />

However, the above version suffices for our purposes.<br />

Proof. The main problem is to prove this in the case of k = 1. Put b i = b i 1.<br />

After passing to a subsequence we may assume that (B ri (p i ), p i ) converges to some<br />

limit space (Y, p ∞ ). We also may assume that b i converges to an L-Lipschitz map<br />

b ∞ : Y → R.<br />

Step 1. b ∞ is 1-Lipschitz.<br />

This is essentially immediate from the segment inequality. Let x, y ∈ Y be<br />

arbitrary. Choose R so large that x, y ∈ B R/4 (p ∞ ) and let x i , y i ∈ B R/2 (p i ) be<br />

sequences converging to x and y.<br />

For a fixed δ


10 VITALI KAPOVITCH AND BURKHARD WILKING<br />

<strong>Lemma</strong> 2.2. Suppose a sequence of complete inner metric spaces (Y i , p i ) converges<br />

to a locally compact metric space (Y ∞ , p ∞ ). Suppose G i acts isometrically on Y i and<br />

that the action of G i converges to an action of a closed subgroup G ∞ ⊂ Iso(Y ∞ ).<br />

Let G i (ε) denote the subgroup generated by those elements that displace p i by at<br />

most ε, i ∈ N ∪ {∞}. Suppose G ∞ (ε) = G ∞ ( a+b<br />

2<br />

Then there is some sequence ε i<br />

(a + ε i , b − ε i ).<br />

) for all ε ∈ (a, b) and 0 ≤ a < b.<br />

) for all ε ∈<br />

→ 0 such that G i (ε) = G i ( a+b<br />

2<br />

Proof. Suppose on the contrary we can find g i ∈ G i (ε 2 ) \ G i (ε 1 ) for fixed ε 1 < ε 2 ∈<br />

(a, b). Without loss of generality d(p i , g i p i ) ≤ ε 2 .<br />

Because of g i ∉ G i (ε 1 ) it follows that for any finite sequence of points p i =<br />

x 1 , . . . , x h = g i p i ∈ G i ⋆ p i there is one j with d(x j , x j+1 ) ≥ ε 1 . Clearly this<br />

property carries over to the limit and implies that g ∞ ∈ G ∞ (ε 2 ) is not contained<br />

in G((ε 1 + a)/2) – a contradiction.<br />

□<br />

<strong>Lemma</strong> 2.3. Suppose (Mi n, q i) converges to (R k × K, q ∞ ) where Ric Mi ≥ −1/i<br />

and K is compact. Assume the action of π 1 (M i ) on the universal cover ( ˜M i , ˜q i )<br />

converges to a limit action of a group G on some limit space (Y, ˜q ∞ ).<br />

Then G(r) = G(r ′ ) for all r, r ′ > 2 diam(K).<br />

Proof. Since Y/G is isometric to R k ×K, it follows that there is a submetry σ : Y →<br />

R k . It is immediate from the splitting theorem that this submetry has to be linear,<br />

that is, for any geodesic c in Y the curve σ◦c is affine linear. Hence we get a splitting<br />

Y = R k × Z such that G acts trivially on R k and on Z with compact quotient K.<br />

We may think of ˜q ∞ as a point in Z. For g ∈ G consider a mid point x ∈ Z of<br />

˜q ∞ and g˜q ∞ . Because Z/G = K we can find g 2 ∈ G with d(g 2˜q ∞ , x) ≤ diam(K).<br />

Clearly<br />

d(˜q ∞ , g 2˜q ∞ ) ≤ 1 2 d(˜q ∞, g˜q ∞ ) + diam(K)<br />

d(˜q ∞ , g −1<br />

2 g˜q ∞) = d(g 2˜q ∞ , g˜q ∞ ) ≤ 1 2 d(˜q ∞, g˜q ∞ ) + diam(K).<br />

This proves G(r) ⊂ G ( r/2 + diam(K) ) and the lemma follows.<br />

<strong>Lemma</strong> 2.4 (Gap <strong>Lemma</strong>). Suppose we have a sequence of manifolds (M i , p i ) with<br />

a lower Ricci curvature bound converging to some limit space (X, p ∞ ) and suppose<br />

that the limit point p ∞ is regular. Then there is a sequence ε i → 0 and a number<br />

δ > 0 such that the following holds. If γ 1 , . . . , γ li is a short basis of π 1 (M i , p i ) then<br />

either |γ j | ≥ δ or |γ j | < ε i .<br />

Moreover, if the action of π 1 (M i ) on the universal cover ( ˜M i , ˜p i ) converges to<br />

an action of the limit group G on (Y, ˜p ∞ ), then the orbit G ⋆ ˜p ∞ is locally path<br />

connected.<br />

Proof. That the orbit is locally path connected is a consequence of the following<br />

Claim. There is a δ > 0 such that for all points x ∈ G ⋆ ˜p ∞ with d(˜p ∞ , p) < δ we<br />

can find y ∈ G ⋆ ˜p ∞ with max{d(˜p ∞ , y), d(y, x)} ≤ 0.51 · d(˜p ∞ , x).<br />

To prove the claim we argue by contradiction and assume that for some δ h > 0<br />

converging to 0 we can find g h ∈ G with d(g h ˜p ∞ , ˜p ∞ ) = δ h such that for all a ∈ G<br />

with d(a˜p ∞ , ˜p ∞ ) ≤ 0.51δ h we have d(a˜p ∞ , g˜p ∞ ) > 0.51δ h .<br />

After passing to a subsequence we can assume that ( 1<br />

δ h<br />

Y, ˜p ∞ ) converges to a<br />

tangent cone (C p∞ Y, o) and that the action of G on 1<br />

δ h<br />

Y converges to an action of<br />


STRUCTURE OF FUNDAMENTAL GROUPS 11<br />

some group K on C p∞ Y . Let g ∞ ∈ K be the limit of g h . Clearly d(o, g ∞ o) = 1 and<br />

for all a ∈ K with d(o, ao) ≤ 0.51 we have d(go, ao) ≥ 0.51. In particular the orbit<br />

K ⋆ o is not convex.<br />

Because X = Y/G the quotient C˜p∞ Y/K is isometric to lim h→∞ ( 1<br />

ε h<br />

X, p ∞ ) which<br />

by assumption is isometric to some Euclidean space R k .<br />

As in the proof of <strong>Lemma</strong> 2.3 it follows that C˜p∞ Y is isometric to R k × Z and<br />

the orbits of the action of K are given by v × Z where v ∈ R k . In particular, the K<br />

orbits are convex – a contradiction.<br />

Clearly, the claim implies that G ⋆ ˜p ∞ ∩ B r (˜p ∞ ) is path connected for r < δ. In<br />

fact, it is now easy to construct a Hölder continuous path from ˜p ∞ to any point in<br />

G ⋆ ˜p ∞ ∩ B r (˜p ∞ ).<br />

Of course, the claim also implies G(ε) = G(δ) for all ε ∈ (0, δ]. Therefore the<br />

first part of <strong>Lemma</strong> 2.4 follows from <strong>Lemma</strong> 2.2.<br />

□<br />

Theorem 3 will follow from the following slightly more general result.<br />

Theorem 2.5. Given n and R there is a constant C such that the following holds.<br />

Suppose (M, g) is a manifold, p ∈ M, B 2R (p) is compact and Ric > −(n − 1) on<br />

B 2R (p). Furthermore, we assume that π 1 (M, g) is generated by loops of length ≤ R.<br />

Then π 1 (M, p) can be generated by C loops of length ≤ R.<br />

Moreover, there is a point q ∈ B R/2 (p) such that any Gromov short generator<br />

system of π 1 (M, q) has at most C elements.<br />

Proof. It is clear that it suffices to show that for a point q ∈ B R/2 (p) the number<br />

of elements in a Gromov short generator system of π 1 (M, q) is bounded above by<br />

some a priori constant.<br />

We consider a Gromov short generator system γ 1 , . . . , γ k of π 1 (M, q). With<br />

|γ i | ≤ |γ i+1 | for some q ∈ B R/2 (p).<br />

Clearly |γ i | ≤ 2R and it easily follows from Bishop–Gromov relative volume<br />

comparison that there are effective a priori bounds (depending only on n, R and r)<br />

for the number of short generators with length ≥ r.<br />

We will argue by contradiction. It will be convenient to modify the assumption<br />

on π 1 being boundedly generated. We call (M i , g i ) a contradicting sequence if the<br />

following holds<br />

• B 3 (p i ) is compact and Ric > −(n − 1) on B 3 (p i ).<br />

• For all q i ∈ B 1 (p i ) the number of short generators of π 1 (M i , q i ) of length<br />

≤ 4 is larger than 2 i .<br />

Clearly it suffices to rule out the existence of a contradicting sequence. We may<br />

assume that (B 3 (p i ), p i ) converges to some limit space (X, p ∞ ). We put<br />

dim(X) = max { k | there is a regular p ∈ B 1/4 (p ∞ ) with C x X ∼ = R k} .<br />

We argue by reverse induction on dim(X). We we start our induction at dim(X) ≥<br />

n + 1. It is well known that this can not happen so there is nothing to prove. The<br />

induction step is subdivided in two substeps.<br />

Step 1. For any contradicting sequence (M i , p i ) converging to (X, p ∞ ) there is a<br />

new contradicting sequence converging to (R dim(X) , 0).<br />

Choose q i ∈ B 1/4 (p i ) converging to some point q ∞ ∈ B 1/4 (p ∞ ) with C q∞ X =


12 VITALI KAPOVITCH AND BURKHARD WILKING<br />

R dim(X) . After passing to a subsequence we can assume that for all x i ∈ B 1/4 (q i )<br />

the number of short generators of π 1 (M i , x i ) of length ≤ 1 is at least 3 i .<br />

Since the number of short generators of length ∈ [ε, 4] is bounded by some a priori<br />

constant, we find a sequence λ i → ∞ very slowly such that for every x ∈ B 1/λi (q i )<br />

the number of short generators of π 1 (M i , x) of length ≤ 4/λ i is at least 2 i and<br />

(λ i M i , q i ) converges to C q∞ X = R dim(X) . Replacing M i by λ i M i and p i and q i<br />

gives a new contradicting sequence, as claimed.<br />

Step 2. If there is a contradicting sequence converging to R k , then we can find a<br />

contradicting sequence converging to a space whose dimension is larger than k.<br />

Let (M i , p i ) be a contradicting sequence converging to (R k , 0). We may assume<br />

without loss of generality that for some r i → ∞ and ε i → 0 the Ricci curvature on<br />

B ri (p i ) is bounded below by −ε i and that B ri (p i ) is compact. In fact, one can run<br />

through the arguments of the first step, to see that, after passing to a subsequence,<br />

a rescaling by λ i → ∞ (very slowly) is always possible.<br />

By Theorem 1.3 we can find a harmonic map (b i 1, . . . , b i k ): B 1(q i ) → R k with<br />

and<br />

∫<br />

−<br />

B 1(q i)<br />

j,l=1<br />

k∑<br />

|〈∇b i l, ∇b i j〉 − δ lj | + ‖Hess(b i l)‖ 2 = ε i → 0<br />

|∇b i j| ≤ C(n).<br />

By the weak (1,1) inequality (<strong>Lemma</strong> 1.4) we can find z i ∈ B 1/2 (q i ) with<br />

∫<br />

−<br />

B r(z i)<br />

j,l=1<br />

k∑<br />

|〈∇b i l, ∇b i j〉 − δ lj | + ‖Hess(b i l)‖ 2 ≤ Cε i → 0<br />

for all r ≤ 1/4. By the Product <strong>Lemma</strong> (2.1), for any sequence µ i → ∞ the spaces<br />

(µ i B r (z i ), z i ) subconverge to a metric product (R k × Z, z ∞ ) for some Z depending<br />

on the rescaling.<br />

From <strong>Lemma</strong>s 2.2 and 2.3 we deduce that there is a sequence δ i → 0 such that<br />

for all z i ∈ B 2 (p i ) the short generator system of π 1 (M i , z i ) does not contain any<br />

elements with length in [δ i , 4].<br />

Choose r i ≤ 1 maximal with the property that there is y i ∈ B ri (z i ) such that<br />

the short generator system of π 1 (M i , y i ) contains one generator of length r i . We<br />

have seen above that r i ≤ δ i → 0<br />

Put N i = 1 r i<br />

M i . By construction, π 1 (N i , y i ) still has 2 i short generators of<br />

length ≤ 1 for all y i ∈ B 1 (z i ) ⊂ N i , and there is one with length 1 for a suitable<br />

y i . By the Product <strong>Lemma</strong> (2.1), (N i , z i ) subconverges to a product (R k × Z, z ∞ ).<br />

<strong>Lemma</strong>s 2.2 and 2.3 imply that Z can not be a point and the claim is proved. □<br />

Finally, let us mention that there is a measured version of Theorem 2.5.<br />

Theorem 2.6. For any n > 1 for any 1 > ε > 0 there exists C(n, ε) such that<br />

if Ric(M n ) ≥ −(n − 1) on B 3 (p) with B 3 (p) compact. Then there is a subset<br />

B 1 (p) ′ ⊂ B 1 (p) with vol B 1 (p) ′ ≥ (1 − ε) vol B 1 (p) and for any q ∈ B 1 (p) ′ the short<br />

basis of π 1 (M, q) has at most C(n, ε) elements of length ≤ 1.<br />

Since we do not have any applications of this theorem, we omit its proof.


STRUCTURE OF FUNDAMENTAL GROUPS 13<br />

3. Maps which are on all scales close to isometries.<br />

For a map f : X → Y between metric spaces we define the distance distortion<br />

on scale r by<br />

(7) dt f r (p, q) = min{r, |d(p, q) − d(f(p), f(q))|}.<br />

Definition 3.1. Let (M i , p 1 i ) and (N i, p 2 i ) be two sequences of Riemannian manifolds<br />

with B i (p j i ) being compact and Ric B i(p j i )<br />

> −C for some C, all i and j = 1, 2.<br />

We say that a sequence of diffeomorphisms f i : M i → N i has the zooming in property<br />

if the following holds:<br />

There exist R 0 > 0, sequences r i → ∞, ε i → 0 and subsets B 2ri (p j i )′ ⊂ B 2ri (p j i )<br />

(j = 1, 2) satisfying<br />

a) vol ( B 1 (q) ∩ B 2ri (p j i )′) ≥ (1 − ε i ) vol(B 1 (q)) for all q ∈ B ri (p j i ).<br />

b) For all p ∈ B ri (p 1 i )′ , all q ∈ B ri (p 2 i )′ and all r ∈ (0, 1] we have<br />

∫<br />

− dt fi<br />

r (x, y)dµ(x)dµ(y) ≤ rε i and<br />

B<br />

∫ r(p)×B r(p)<br />

− dt f −1<br />

i<br />

r (x, y)dµ(x)dµ(y) ≤ rε i .<br />

B r(q)×B r(q)<br />

c) There are subsets S j i ⊂ B 1(p j i ) with vol(Sj i ) ≥ 1 2 vol( B 1 (p j i )) (j = 1, 2) and<br />

f(S 1 i ) ⊂ B R 0<br />

(p 2 i ) and f −1 (S 2 i ) ⊂ B R 0<br />

(p 1 i ).<br />

We will call elements of B 2ri (p 1 i )′ good points and sometimes use the convention<br />

B r (q) ′ := B 2ri (p j i )′ ∩ B r (q) for all B r (q) ⊂ B 2ri (p j i ).<br />

In the applications we will always have N i = M i . However, in some instances<br />

d(p 1 i , p2 i ) → ∞. That is why it might be helpful to also think about maps between<br />

two unrelated pointed manifolds. If the choice of the base points is not clear we<br />

will say that f i : [M i , p 1 i ] → [N i, p 2 i ] has the zooming in property. Notice that we<br />

do not require f i (p 1 i ) = p2 i . However, property c) ensures that the base points are<br />

respected in a weaker sense.<br />

<strong>Lemma</strong> 3.2. Let (M i , p 1 i ), (N i, p 2 i ) and f i be as above. Then, after passing to a<br />

subsequence, (M i , p 1 i ) G−H<br />

−→ (X 1, p 1 ∞), (N i , p 2 i ) G−H<br />

−→ (X 2, p 2 ∞) and f i converges in<br />

i→∞ i→∞<br />

the weakly measured sense to a measure preserving isometry f ∞ : X 1 → X 2 , that is<br />

for each r > 0 there is a sequence δ i → 0 and subsets S i ⊂ B r (p 1 i ) satisfying<br />

• vol(S i ) ≥ (1 − δ i ) vol(B r (p 1 i )) and<br />

• f i|Si is pointwise close to f ∞|Br(p ∞).<br />

Moreover, f −1<br />

i<br />

converges in this sense to f −1<br />

∞ .<br />

First we claim that the following holds.<br />

Sublemma 3.3. There exists C 1 (n) such that for any good point x ∈ B ri (p 1 i )′ and<br />

any r ≤ 1 there is a subset B r (x) ′′ ⊂ B r (x) with<br />

vol B r (x) ′′ ≥ (1 − C 1 ε i ) vol B r (x) and f i (B r (x) ′′ ) ⊂ B 2r (f i (x)).<br />

Proof. The proof proceeds by induction on the size of r as follows. It is clear that<br />

at good points x ∈ B ri (p 1 i )′ the differential of f i has a bilipschitz constant e Cεi<br />

with some universal C provided ε i < 1/2. Therefore it is clear that the statement


14 VITALI KAPOVITCH AND BURKHARD WILKING<br />

holds for all small r. Hence it suffices to prove that if it holds for r ≤ 1/10, then it<br />

holds for 10r. By assumption<br />

∫<br />

−<br />

dt fi<br />

10r (p, q)dµ(p)dµ(q) ≤ 10rε i.<br />

B 10r(x)×B 10r(x)<br />

Furthermore, as long as C 1 ε i ≤ 1/2 our induction assumption implies that there is<br />

a subset S ⊂ B r (x) with vol(S) ≥ 1 2 vol(B r(x)) and f i (S) ⊂ B 2r (f i (x)).<br />

By Bishop–Gromov<br />

∫<br />

− dt fi<br />

10r (p, q)dµ(p)dµ(q) ≤ C 2(n)rε i<br />

S×B 10r(x)<br />

with some universal constant C 2 (n). Therefore there is a subset B 10r (x) ′′ ⊂ B 10r (x)<br />

with vol(B 10r (x)) ′′ ≥ (1 − C 2 (n)ε i /2) vol(B 10r (x)) and<br />

∫<br />

− dt fi<br />

10r (p, q)dµ(p) ≤ 2r for all q ∈ B 10r(x) ′′ .<br />

S<br />

Using that f i (S) ⊂ B 2r (f i (x)) this clearly implies that f i (B 10r (x) ′′ ) ⊂ B 20r (f i (x)).<br />

Thus the sublemma is valid if we put C 1 (n) = C 2 (n)/2 provided we know in addition<br />

that C 1 (n)ε i ≤ 1/2. We can remove the upper bound on ε i by just putting C 1 (n) =<br />

C 2 (n).<br />

□<br />

Of course, a similar inequality holds for f −1<br />

i .<br />

Proof of <strong>Lemma</strong> 3.2. Clearly, we can finish the proof of the lemma by establishing<br />

Claim. Given δ ∈ (0, 1/10) there exists i 0 such that for all x i , y i ∈ B ri (p 1 i )′ with<br />

d(x i , y i ) < 1/2 we have |d(f i (x i ), f i (x i )) − d(x i , y i )| ≤ 10δ for all i ≥ i 0 (δ).<br />

Consider subsets B δ (x i ) ′′ and B δ (y i ) ′′ as in the sublemma. Then vol(B δ (x i ) ′′ ) ≥<br />

1<br />

2 vol(B δ(x i )) for large i and combining with Bishop–Gromov gives<br />

vol(B 1 (x i )) 2 ≤ C 2 (n, δ) vol(B δ (x i ) ′′ ) vol(B δ (y i ) ′′ ).<br />

Thus, ∫<br />

∫<br />

−<br />

dt fi<br />

1 (p, q) ≤ C 2− dt fi<br />

1 (p, q) ≤ C 2ε i .<br />

B δ (x i) ′′ ×B δ (y i) ′′ B 1(x i) 2<br />

Choose i 0 so large that C 2 ε i ≤ δ for i ≥ i 0 . Then for such i we can find x ′ i ∈ B δ(x i ) ′′<br />

and y i ′ ∈ B δ(y i ) ′′ with dt fi<br />

1 (x′ i , y′ i ) ≤ δ. Combining with d(f i(x ′ i ), f i(x i )) ≤ 2δ and<br />

d(f i (y i ′), f i(y i )) ≤ 2δ we deduce dt fi<br />

1 (x i, y i ) ≤ 7δ as claimed. □<br />

<strong>Lemma</strong> 3.4. Consider (M i , p 1 i ), (N i, p 2 i ), (P i, p 3 i ) like above and two sequences of<br />

diffeomorphisms f i : M i → N i and g i : N i → P i with the zooming in property. Then<br />

f i ◦ g i also has the zooming in property.<br />

Proof. By Sublemma 3.3 there is an i 0 such that for all q i ∈ B ri (p) and r ≤ 1/2<br />

there is a subset B r (q i ) ′′ ⊂ B r (q i ) ′ with vol(B r (q i ) ′′ ) ≥ (1−C 1 ε i ) and f i (B r (q i ) ′′ ) ⊂<br />

B 2r (q i ). Thus,<br />

∫<br />

∫<br />

− dt fi◦gi<br />

r (a, b) ≤ C 2 ε i r + e 2nCεi − dt gi<br />

r (f i (a), f i (b))<br />

B r(q) 2 (B r(q) ′′ )<br />

∫<br />

2<br />

≤ C 2 ε i r + C 3 −<br />

r (x, y) ≤ C 4 ε i ,<br />

B 2r(f i(q)) 2 dt gi


STRUCTURE OF FUNDAMENTAL GROUPS 15<br />

where we used that the differential of f i at q ∈ B r (q i ) ′ has a bilipschitz constant<br />

e Cεi and that the ratio of vol(B 2r (f i (q))) and vol(B r (q)) is for large i bounded by<br />

a universal constant.<br />

We are left with the case of r ∈ [1/2, 1]. By <strong>Lemma</strong> 3.2, f i and g i converge in the<br />

weakly measured sense to isometries. Clearly this implies that f i ◦ g i also converges<br />

in the weakly measured sense to an isometry. And this implies that the distortion<br />

function dt fi◦gi<br />

1 has averaged L 1 norm ≤ δ i on every unit ball in B ρi (p i ) for some<br />

ρ i → ∞ and δ i → 0.<br />

□<br />

The next lemma explains the notion zooming in property.<br />

<strong>Lemma</strong> 3.5. Let (M i , p 1 i ), (N i, p 2 i ) and f i be as above. Then there is a ρ i → ∞,<br />

δ i → 0 and Ti<br />

1 ⊂ B ρi (p 1 i ) such that the following holds<br />

• vol(B 1 (q) ∩ Ti 1) ≥ (1 − δ i) vol(B 1 (q)) for all q ∈ B ρi/2(p 1 i ).<br />

• For any sequence of real numbers λ i → ∞ and any sequence q i ∈ Ti<br />

1<br />

f i : (λ i M i , q i ) → (λ i N i , f i (q i ))<br />

has the zooming in property. We say that f i is good on all scales at q i .<br />

Proof. Let G j i = B 2r i<br />

(p j i )′ and B j i = B 2r i<br />

(p i ) \ B 2ri (p j i )′ . After adjusting r i → ∞<br />

and ε i → 0 we may assume vol(B j i ) ≤ ε i vol(B 1 (q)) for all q ∈ B 2ri (p j i ). Let χ ij<br />

be the characteristic function of B j i . By the weak 1-1 inequality there exists a<br />

universal C such that the set<br />

H j i := { x ∈ B ri/2(p j i ) | Mx(χ ij) ≥ √ }<br />

ε i<br />

satisfies<br />

vol(H j i ) ≤ C√ ε i vol(B 1 (q))<br />

for all q ∈ B 2ri (p j i ). We put T i 1 := ( ) ( )<br />

B ri/2(p 1 i )\H1 i ∩ f<br />

−1<br />

i Bri/2(p 2 i )\H2 i and<br />

Ti<br />

2 := f i (Ti 1). Using <strong>Lemma</strong> 3.2 we can find ρ i → ∞ and δ i → 0 such that<br />

vol ( B ρi (p j i ) \ T j )<br />

i ≤ δi vol(B 1 (q)) for all q ∈ B ri (p j i ), j = 1, 2.<br />

By definition of T j i<br />

vol(B r (q) ∩ G j i )<br />

vol(B r (q))<br />

≥ 1 − √ ε i for all q ∈ T j i and all r ≤ 1, j = 1, 2.<br />

Let dt λif<br />

r<br />

denote the distortion on scale r of f i : λ i M i → λ i N i . Clearly<br />

dt λif<br />

r (p, q) = λ i dt fi<br />

r/λ i<br />

(p, q).<br />

Thus for all λ i → ∞ and all q i ∈ T j i<br />

the zooming in property.<br />

the map f i : (λ i M i , q i ) → (λ i N i , f i (q i )) has<br />

□<br />

Proposition 3.6 (First main example). Let α > 1. Consider a sequence of manifolds<br />

(M i , p i ) with a fixed lower Ricci curvature bound and a sequence of time<br />

dependent vector fields Xi<br />

t (piecewise constant in time) with compact support. Let<br />

c i : [0, 1] → B ri (p i ) be an integral curve of Xi t with c i(0) = p i


16 VITALI KAPOVITCH AND BURKHARD WILKING<br />

Assume that Xi<br />

t is divergence free on B r i+100(p i ) and that B ri+100(p i ) is compact.<br />

Put<br />

u s,i (x) := ( Mx ‖∇·Xi s ‖ α) 1/α<br />

and suppose<br />

∫ 1<br />

0<br />

u t,i (c i (t)) = ε i → 0.<br />

Let f i = φ i1 be the flow of X t i evaluated at time 1. Then for all λ i → ∞<br />

f i : (λ i M i , c(0)) → (λ i M i , c(1))<br />

has the zooming in property. Moreover, for any lift ˜fi : ˜Mi → ˜M i of f i to the<br />

universal cover ˜Mi of M i and for any lift ˜p i ∈ ˜M of c(0) = p i the sequence<br />

˜f i : (λ i ˜Mi , ˜p i ) → (λ i ˜Mi , ˜f i (˜p i )) has the zooming in property as well.<br />

The proposition remains valid if the assumption on Xi<br />

t being divergence free<br />

is removed. However, the proof is easier in this case and we do not have any<br />

applications of the more general case. We will need the following<br />

<strong>Lemma</strong> 3.7. There exists (explicit) C = C(n) such that the following holds. Suppose<br />

(M n , g) has Ric ≥ −1 and X t is a vector field with compact support, which<br />

depends on time (piecewise constant). Let c(t) be the integral curve of X t with<br />

c(0) = p 0 ∈ M and assume that X t is divergence free on B 10 (c(t)) for all t ∈ [0, 1].<br />

Let φ t be the flow of X t . Define the distortion function dt r (t)(p, q) of the flow<br />

on scale r by the formula<br />

{<br />

}<br />

dt r (t)(p, q) := min r, max ∣ d(p, q) − d(φ τ (p), φ τ (q))| .<br />

0≤τ≤t<br />

Put ε := ∫ 1<br />

0 Mx(‖∇·X t ‖)(c(t)) dt. Then for any r ≤ 1/10 we have<br />

∫<br />

−<br />

dt r (1)(p, q) dµ(p)dµ(q) ≤ Cr · ε<br />

B r(p 0)×B r(p 0)<br />

and there exists B r (p 0 ) ′ ⊂ B r (p 0 ) such that<br />

vol(B r(p 0) ′ )<br />

vol(B r(p 0))<br />

≥ (1 − Cε) and φ t (B r (p 0 ) ′ ) ⊂ B 2r (c(t)) for all t ∈ [0, 1].<br />

Proof. We prove the statement for a constant in time vector field X t . The general<br />

case is completely analogous except for additional notational problems.<br />

Notice that all estimates are trivial if ε ≥ 2 C<br />

. Therefore it suffices to prove the<br />

statement with a universal constant C(n) for all ε ≤ ε 0 . We put ε 0 = 1/2C and<br />

determine C in the process. We again proceed by induction on the size of r.<br />

Notice that the differential of φ s at c(0) is bilipschitz with bilipschitz constant<br />

e R s<br />

0 ‖∇·X‖(c(t))dt ≤ 1 + 2ε. Thus the <strong>Lemma</strong> holds for very small r.<br />

Suppose the result holds for some r/10 ≤ 1/100. It suffices to prove that it<br />

then holds for r. By induction assumption we know that for any t there exists<br />

B r/10 (c(t)) ′ ⊂ B r/10 (c(t)) such that for any s ∈ [−t, 1 − t] we have<br />

and<br />

vol(B r/10 (c(t)) ′ ) ≥ (1 − Cε) vol(B r/10 (c(t))) ≥ 1 2 vol(B r/10(c(t)))<br />

φ s (B r/10 (c(t)) ′ ) ⊂ B r/5 (c(t + s)),


STRUCTURE OF FUNDAMENTAL GROUPS 17<br />

where we used ε ≤ 1<br />

2C<br />

in the inequality. This easily implies that vol(B r/10(c(t)))<br />

are comparable for all t. More precisely, for any t 1 , t 2 ∈ [0, 1] we have that<br />

(8)<br />

1<br />

C 0<br />

vol B r/10 (c(t 1 )) ≤ vol B r/10 (c(t 2 )) ≤ C 0 vol B r/10 (c(t 1 ))<br />

with a computable universal C 0 = C 0 (n). Put<br />

∫<br />

h(s) = −<br />

dt r (s)(p, q) dµ(p) dµ(q),<br />

B r/10 (p 0) ′ ×B r(c(0))<br />

U := {(p, q) ∈ B r/10 (p 0 ) ′ × B r (c(0)) | dt r (s)(p, q) < r},<br />

φ s (U) := {(φ s (p), φ s (q)) | (p, q) ∈ U},<br />

and dt ′ dt<br />

r(s)(p, q) := lim sup r(s+h)(p,q)<br />

h↘0 h<br />

. Since dt r is monotonously increasing<br />

and bounded by r, we deduce that if dt r (s)(p, q) = r, then dt ′ r(s)(p, q) = 0.<br />

Since φ s is measure preserving, it follows<br />

∫<br />

h ′ d<br />

(s) ≤ −<br />

dt |t=0 dt′ r(0)(p, q)<br />

φ s(U)<br />

∫<br />

≤<br />

4 vol(B3r(c(s))2<br />

vol(B r/10 (c(0)))<br />

− dt ′ r(0)(p, q),<br />

2<br />

B 3r(c(s)) 2<br />

where we used that φ s (B r/10 (p 0 ) ′ ) 2 ⊂ φ s (U) ⊂ B 3r (c(s)) 2 . If p is not in the cut<br />

locus of q and γ pq : [0, 1] → M is a minimal geodesic between p and q then<br />

dt ′ r(0)(p, q) ≤ d(p, q)<br />

∫ 1<br />

0<br />

‖∇·X‖(γ pq (t)) dt.<br />

Combining the last two inequalities with the segment inequality we deduce<br />

h ′ (s) ≤<br />

∫<br />

C 1 (n)r− ‖∇·X‖<br />

B 6r(c(s))<br />

≤ C 1 (n)r Mx ‖∇·X‖(c(s))<br />

and thus h(1) ≤ C 1 (n)rε. Note that the choice of the constant C 1 (n) can be made<br />

explicit and independent of the induction assumption.<br />

This in turn implies that that the subset<br />

{<br />

∫<br />

}<br />

B r (p 0 ) ′ := p ∈ B r (p 0 ) ∣ − dt r (1)(p, q) dµ(q) ≤ r/2<br />

B r/10 (p 0) ′<br />

satisfies<br />

(9)<br />

It is elementary to check that<br />

vol(B r (p 0 ) ′ ) ≥ (1 − 2C 1 (n)ε) vol(B r (p 0 )).<br />

φ t (B r (p 0 ) ′ ) ⊂ B 2r (c(t)) for all t ∈ [0, 1].<br />

Then arguing as before we estimate that<br />

∫<br />

−<br />

dt r (s)(p, q)dµ(p)dµ(q) ≤ C 2 (n) · r · ε.<br />

B r(c(t)) ′ ×B r(c(t))


18 VITALI KAPOVITCH AND BURKHARD WILKING<br />

Using the definition of dt r (s) and the volume estimate (9) this gives<br />

∫<br />

− dt r (s)(p, q) dµ(p) dµ(q) ≤ C 2 (n) · r · ε + 2rC 1 ε =: C 3 rε.<br />

B r(c(t)) 2<br />

This completes the induction step with C(n) = C 3 and ε 0 = 1<br />

2C 3<br />

. In order to<br />

remove the restriction ε ≤ ε 0 one can just increase C 3 by the factor 4, as indicated<br />

at the beginning.<br />

□<br />

Proof of Proposition 3.6. Put g t,i (x) = ‖∇·Xi t ‖(x). First notice that by Hölder<br />

∫ 1<br />

0<br />

Mx(g t,i (x))(c i (t)) dt ≤<br />

∫ 1<br />

0<br />

u t,i (c i (t)) dt → 0.<br />

Let λ i → ∞ and put r i = R λ i<br />

, where R > 1 is arbitrary. By <strong>Lemma</strong> 3.7 there is<br />

S i ⊂ B ri (c i (0)) with<br />

• vol(S i ) ≥ (1 − δ i ) vol(B ri (c i (0))) with δ i → 0 and<br />

• φ t (S i ) ⊂ B 2ri (c(t)) for all t.<br />

In the following we assume that i is so large that δ i ≤ 1/2, and r i ≤ 1/100. As<br />

in the proof of <strong>Lemma</strong> 3.7 this easily implies that there is a universal constant<br />

C = C(n) with<br />

and<br />

Using that φ it|Bri (c(0)) ′<br />

vol(B 2ri (c i(t)))<br />

vol(B ri (c i(0))) ≤ C 2<br />

for all t and all i.<br />

is measure preserving, we deduce<br />

∫ 1<br />

− Mx(g t,i )(φ it (p)) dt dµ(p) ≤ C<br />

∫S i 0<br />

≤<br />

by (5)<br />

C<br />

∫ 1<br />

0<br />

∫ 1<br />

0<br />

∫<br />

−<br />

B 2ri (c i(t))<br />

Mx(g t,i )(q) dt dµ(q)<br />

Mx(Mx(g t,i ))(c i (t)) dt<br />

∫ 1<br />

≤ C · C 2 Mx(gt,i) α 1/α (c i (t)) dt<br />

0<br />

∫ 1<br />

= C · C 2 u t,i (c i (t)) dt → 0.<br />

Thus we can find ˜δ i → 0 and a subset B ri (c i (0)) ′′ ⊂ S i with<br />

vol(B ri (c i (0))) − vol(B ri (c i (0)) ′′ ) ≤ ˜δ i min vol(B r<br />

q∈B ri (c i/R(q))<br />

i(0))<br />

∫ 1<br />

0<br />

Mx(g t,i )(φ it (q)) ≤ ˜δ i for all q ∈ B ri (c(0)) ′′ .<br />

Recall that r i = R λ i<br />

with an arbitrary R. By a diagonal sequence argument it is<br />

easy to deduce that after replacing ˜δ i by a another sequence converging slowly to<br />

0 we can keep the above estimates for r i = Ri<br />

λ i<br />

with R i → ∞ sufficiently slowly.<br />

Combining this with <strong>Lemma</strong> 3.7 shows that f i = φ i1 : (λ i M i , c(0)) → (λ i M i , c(1))<br />

has the zooming in property.<br />

Let ˜X i t be a lift of Xt i to the universal covering ˜M i of M i . Consider the integral<br />

curve ˜c i : [0, 1] → ˜M i of ˜Xt i with ˜c(0) = ˜p i . Clearly ˜c i is a lift of c i and by the<br />

0


STRUCTURE OF FUNDAMENTAL GROUPS 19<br />

Covering <strong>Lemma</strong> (1.6) we have<br />

∫ 1<br />

0<br />

Mx(‖∇· ˜Xt i ‖ α ) 1/α (˜c i (t)) dt ≤ C<br />

∫ 1<br />

0<br />

Mx(‖∇·X t i ‖ α ) 1/α (c i (t)) dt<br />

with some universal constant C. Thus ˜φ i1 : (λ i ˜Mi , ˜p i ) → (λ i ˜Mi , φ i1 (˜p i )) has the<br />

zooming in property as well. Any other lift ˜f i of f i is obtained by composing ˜φ i1<br />

with a deck transformation and thus the result carries over to any lift of f i .<br />

□<br />

Proposition 3.8 (Second main example). Let (M i , g i ) be a sequence of manifolds<br />

with Ric > −1/i on B i (p i ) and B i (p i ) compact. Suppose that (M i , p i ) converges to<br />

(R k × Y, p ∞ ).<br />

Then for each v ∈ R k there is a sequence of diffeomorphisms f i : [M i , p i ] →<br />

[M i , p i ] with the zooming in property which converges in the weakly measured sense<br />

to an isometry f ∞ of R k × Y that acts trivially on Y and by w ↦→ w + v on R k .<br />

Moreover, f i is isotopic to the identity and there is a lift ˜f i : [ ˜M i , ˜p i ] → [ ˜M i , ˜p i ] of<br />

f i to the universal cover which has the zooming in property as well.<br />

Proof. Using the splitting R k = Rv ⊕(v) ⊥ and replacing Y by Y ×(v) ⊥ we see that<br />

it suffices to prove the statement for k = 1.<br />

By the work of Cheeger and Colding [CC96] we can find sequences ρ i → ∞ and<br />

ε i → 0 and harmonic functions b i : B 4ρi (p i ) → R such that<br />

|∇b i | ≤ L(n) for all i and<br />

∫<br />

(∣<br />

− ∣|∇bi | − 1 ∣ + ‖Hessbi ‖ ) 2<br />

≤ ε i for any R ∈ [1/4, 4ρ i ].<br />

B 4R (p i)<br />

Let X i be a vector field with compact support with X i = ∇b i on B 3ρi (p i ), and<br />

let φ it denote the flow of X i . Clearly for any t we can find r i → ∞ such that<br />

φ it|Bri (p i) is measure preserving.<br />

Put ψ i := ∣ ∣|∇b i | − 1 ∣ + ‖Hess(b i )‖. We deduce from <strong>Lemma</strong> 1.4 that<br />

∫<br />

− Mx(ψ i ) 2 ≤ C(n, R)ε i<br />

and Cauchy Schwarz gives<br />

∫<br />

−<br />

B 2R (p i)<br />

B R (p i)<br />

Mx(ψ i ) ≤ √ C(n, R)ε i .<br />

Suppose now that t 0 ≤ R<br />

4L(n) . Then φ t(q) ∈ B 3R (p i ) for all q ∈ B<br />

4<br />

R/2 (p i )<br />

and all t ∈ [−t 0 , t 0 ]. Combining that φ t is measure preserving and vol(B R (p i )) ≤<br />

C 3 vol(B R/2 (p i )), we get that<br />

∫ ∫ t0<br />

∫<br />

−<br />

Mx(ψ i )(φ t (p)) ≤ C 3 t 0 − Mx(ψ)<br />

B R/2 (p i)<br />

0<br />

It is now easy to find R i → ∞ and δ i → 0 with<br />

B R (p i)<br />

≤ C 4 (t 0 , R, n) √ ε i .<br />

∫ ∫ t0<br />

− Mx(ψ i )(φ t (p)) ≤ δ i for all R ∈ [1, R i ].<br />

B R (p i) 0


20 VITALI KAPOVITCH AND BURKHARD WILKING<br />

After an adjustment of the sequences δ i → 0 and R i → ∞ we can find B Ri (p i ) ′ ⊂<br />

B Ri (p i ) with<br />

vol(B Ri (p i )) − vol(B Ri (p i ) ′ ) ≤ δ i vol(B 1 (q)) for all q ∈ B Ri (p i ) and<br />

∫ t0<br />

0<br />

Mx(ψ i )(φ t (q)) ≤ δ i for all q ∈ B Ri (p i ) ′ .<br />

Moreover, the displacement of φ it0 is globally bounded by L(n)t 0 and thus,<br />

<strong>Lemma</strong> 3.7 implies that φ it0 has the zooming in property.<br />

As in the proof of the Product <strong>Lemma</strong> (2.1) we see that b i converges to the<br />

projection b ∞ : R × Y → R. In particular b ∞ is a submetry.<br />

For all q i ∈ B Ri (p i ) ′ the length of the integral curve c i (t) = φ it (q) (t ∈ [0, t 0 ]) is<br />

bounded above by t 0 +δ i . Moreover, b i (c(t 0 ))−b i (c i (0)) is bounded below by t 0 −δ i .<br />

This implies that φ ∞t0 satisfies d(p, φ ∞t0 (p)) ≤ t 0 and b ∞ (φ ∞t0 (q)) − b ∞ (p) ≥ t 0 .<br />

Clearly, equality must hold and φ ∞t (q) is a horizontal geodesic with respect to the<br />

submetry b ∞ . This in turn implies that φ ∞t0 respects the decomposition Y × R, it<br />

acts by the identity on the first factor and by translation on the second as claimed.<br />

Let σ : ˜Mi → M i denote the universal cover, ˜p i ∈ ˜M i a lift of p i , and let ˜X i be a<br />

lift of X i . By <strong>Lemma</strong> 1.6 all estimates for X i on B R (p i ) give similar estimates for<br />

˜X i on B R (˜p i ). Thus the flow ˜φ it of ˜Xi has the zooming in property as well. □<br />

4. A rough idea of the proof of the <strong>Margulis</strong> <strong>Lemma</strong>.<br />

The proof of the <strong>Margulis</strong> <strong>Lemma</strong> is somewhat indirect and it might not be easy<br />

for everyone to grasp immediately how things interplay. In this section we want to<br />

give a rough idea of how the arguments would unwrap in a simple but nontrivial<br />

case.<br />

Let p i → ∞ be a sequence of odd primes and let Γ i := Z ⋉ Z pi be the semidirect<br />

product where the homomorphism Z → Aut(Z pi ) maps 1 ∈ Z to the automorphism<br />

ϕ i given by ϕ i (z + p i Z) = 2z + p i Z.<br />

Suppose, contrary to the <strong>Margulis</strong> <strong>Lemma</strong>, we have a sequence of compact n-<br />

manifolds M i with Ric > −1/i and diam(M i ) = 1 and fundamental group Γ i .<br />

A typical problem situation would be that M i converges to a circle and its<br />

universal cover ˜M i converges to R.<br />

We then replace M i by B i = ˜M i /Z pi and in order not to loose information we<br />

endow B i with the deck transformation f i : B i → B i representing a generator of<br />

Γ i /Z pi . Then B i will converge to R as well.<br />

It is part of the rescaling theorem that one can find λ i → ∞ such that the<br />

rescaled sequence λ i B i converges to R × K with K being compact but not equal to<br />

a point. Suppose for illustration that λ i B i converges to R × S 1 and λ i ˜Mi converges<br />

to R 2 and that the action of Z pi converges to a discrete action of Z on R 2 .<br />

The maps f i : λ i B i → λ i B i do not converge, because typically f i would map a<br />

base point x i to some point y i = f i (x i ) with d(x i , y i ) = λ i → ∞ with respect to<br />

the rescaled distance.<br />

But a second statement in the rescaling theorem guarantees that we can find a<br />

sequence of diffeomorphisms g i : [λ i B i , y i ] → [λ i B i , x i ] with the zooming in property.<br />

The composition f new,i := f i ◦ g i : [λ i B i , x i ] → [λ i B i , x i ] also has the zooming<br />

in property and thus converges to an isometry of the limit.


STRUCTURE OF FUNDAMENTAL GROUPS 21<br />

Moreover, a lift ˜f new,i : λ i ˜Mi → λ i ˜Mi of f new,i has the zooming in property, too.<br />

Since g i can be chosen isotopic to the identity, the action of ˜fnew,i on the deck<br />

transformation group Z pi = π 1 (B i ) by conjugation remains unchanged.<br />

On the other hand, the Z pi -action on ˜M i converges to a discrete Z-action on<br />

R 2 and ˜f new,i converges to an isometry ˜f new,∞ of R 2 normalizing the Z-action.<br />

This implies that ˜f new,∞ 2 commutes with the Z-action and it is then easy to get a<br />

contradiction.<br />

Problem. We suspect that for any given n and D only finitely many of the groups<br />

in the above family of groups should occur as fundamental groups of compact n-<br />

manifolds with Ric > −(n − 1) and diameter ≤ D. Note that this is not even<br />

known under the stronger assumption of K > −1 and diam ≤ D.<br />

5. The Rescaling Theorem.<br />

Theorem 5.1 (Rescaling Theorem). Let (M, g i , p i ) be a sequence of n-manifolds<br />

satisfying Ric Bri (p i) > −µ i and B ri (p i ) is compact for some r i → ∞, µ i → 0.<br />

Suppose that (M, g i , p i ) → (R k , 0) for some k < n. Then after passing to a subsequence<br />

we can find a compact metric space K with diam(K) = 10 −n2 , a sequence<br />

of subsets G 1 (p i ) ⊂ B 1 (p i ) with vol(G1(pi))<br />

vol(B → 1 and a sequence λ 1(p i)<br />

i → ∞ such that<br />

the following holds<br />

• For all q i ∈ G 1 (p i ) the isometry type of the limit of any convergent subsequence<br />

of (λ i M i , q i ) is given by the metric product R k × K.<br />

• For all a i , b i ∈ G 1 (p i ) we can find a sequence of diffeomorphisms<br />

f i : [λ i M i , a i ] → [λ i M i , b i ]<br />

with the zooming in property such that f i is isotopic to the identity. Moreover,<br />

for any lift ã i , ˜b i ∈ ˜M i of a i and b i to the universal cover ˜M i we can<br />

find a lift ˜f i of f i such that<br />

˜f i : [λ i ˜Mi , ã i ] → [λ i ˜Mi , ˜b i ]<br />

has the zooming in property as well.<br />

Finally, if π 1 (M i , p i ) is generated by loops of length ≤ R for all i, then we can<br />

find ε i → 0 such that π 1 (M i , q i ) is generated by loops of length ≤ 1+εi<br />

λ i<br />

for all<br />

q i ∈ G 1 (p i ).<br />

Proof. By Theorem 1.3, after passing to a subsequence we can find a harmonic map<br />

b i : B i (p i ) → B i+1 (0) ⊂ R k that gives a 1 2<br />

Gromov–Hausdorff approximation from<br />

i<br />

B i (p i ) to B i (0) ⊂ R k . Put<br />

h i =<br />

k∑<br />

| < ∇b i j, ∇b i l > −δ j,l | + ∑ j<br />

j,l=1<br />

‖Hess b i<br />

j<br />

‖ 2 .<br />

After passing to a subsequence we also have by Theorem 1.3 that<br />

∫<br />

− h i ≤ ε 4 i for all R ∈ [1, i] with ε i → 0.<br />

B R (p i)<br />

Furthermore, we may assume Ric > −ε i on B i (0).<br />

(10) G 4 (p i ) := { x ∈ B 4 (p i ) | Mx h i (x) ≤ ε 2 i<br />

}<br />

.


22 VITALI KAPOVITCH AND BURKHARD WILKING<br />

We will call elements of G 4 (p i ) ”good points” in B 4 (p i ). We put G r (q) = B r (q) ∩<br />

G 4 (p i ) for q ∈ B 2 (p i ) and r ≤ 2. It is immediate from the weak 1-1 inequality<br />

(<strong>Lemma</strong> 1.4) that<br />

vol(G 1(p i))<br />

vol(B 1(p i)) → 1.<br />

By the Product <strong>Lemma</strong> (2.1), for any q i ∈ G 4 (p i ) and any h ≥ 1 we have that<br />

hB 1/h (q i ) is ˜ε i close to a ball in a product R k × Y where ˜ε i → 0 can be chosen<br />

independently of h and q i . Note that the space Y may depend on h and q i .<br />

For any point q i ∈ G 1 (p i ) let ρ i (q i ) be the largest number ρ < 1 such that the<br />

map b i : B ρ (q i ) → R k has distance distortion exactly equal to ρ · 10 −n2 . For any<br />

choice q i ∈ G 1 (p i ) we know that ρ i (q i ) → 0 and by the Product <strong>Lemma</strong> (2.1)<br />

(<br />

1<br />

ρ M )<br />

i(q i) i, q i subconverges to R k × K where K is compact with diam(K) = 10 −n2 .<br />

Again note, that a priori, the constants ρ i (q i ) and the space K depend on the<br />

choice of q i ∈ B 1 (p i ) ′ . We will show that, in fact, both are essentially independent<br />

of the choice of good points in B 1 (p i ).<br />

For each i we define the number ρ i as the supremum of ρ i (q i ) with q i ∈ G 1 (p i )<br />

and put<br />

λ i = 1 ρ i<br />

.<br />

The statement of the theorem will follow easily from the following sublemma.<br />

Sublemma 5.2. There exist constants ¯C 1 , ¯C 2 independent of i such that the following<br />

holds. For any good point p ∈ G 1 (p i ) and any r ≤ 2 there exists B r (p) ′ ⊂ B r (p)<br />

such that<br />

(11) vol B r (p) ′ ≥ (1 − ¯C 1 rε i ) vol B r (p)<br />

and such that for any q ∈ B r (p) ′ there exists p ′ ∈ B Lρi (p) (with L = 9 n ) and a time<br />

dependent (piecewise constant in time) divergence free vector field X t (dependence<br />

on q is suppressed) and its integral curve c q (t): [0, 1] → B 2 (p i ) such that c(0) =<br />

p ′ , c(1) = q and<br />

1<br />

vol(B r(p))<br />

∫B r(p) ′ ∫ 1<br />

0<br />

(<br />

Mx(‖∇·X t ‖ 3/2 ) ) 2/3<br />

(cq (t)) dt dµ(q) ≤ ¯C 2 rε i .<br />

Proof. Despite the fact that L can be chosen as L = 9 n we treat it for now as a<br />

large constant L ≥ 10 which needs to be determined. Throughout the proof we will<br />

denote by C j various constants depending on n. Sometimes these constants will<br />

also depend on L in which case that dependence will always be explicitly stated.<br />

Although we consider a fixed i for the major part of the proof, we take the<br />

liberty of assuming that i is large. This way we can ensure that for any r ≥ ρ i<br />

1<br />

and any good point p ∈ G 1 (p i ) the ball<br />

Lr B 2Lr(p) is by the Product <strong>Lemma</strong> in the<br />

measured Gromov–Hausdorff sense arbitrarily close to a ball in a product R k × K<br />

where by our choice of ρ i , K has diameter ≤ 10 −n2 . This implies, for example, that<br />

we can assume that<br />

(<br />

1<br />

(12)<br />

r1<br />

) k vol(B<br />

2 r 2<br />

≤<br />

r1 (q 1))<br />

vol(B r2 (q ≤ 2( )<br />

r 1 k<br />

2)) r 2<br />

for all q1 , q 2 ∈ B Lr (p), r 1 , r 2 ∈ [r/20, 2Lr].<br />

The sublemma is trivially true for r ≤ Lρ i with X ≡ 0. By induction it suffices<br />

to show that if the sublemma holds for some r ∈ (ρ i , 2 L<br />

] then it holds for Lr.


STRUCTURE OF FUNDAMENTAL GROUPS 23<br />

Let q 1 , . . . q l be a maximal r/2-separated net in B Lr (p). Note that<br />

l⋃<br />

B Lr (p) ⊂ B r/2 (q m ).<br />

m=1<br />

By a standard volume argument we can deduce from (12) that<br />

(13)<br />

∑ l<br />

m=1 vol B r/2(q m )<br />

≤ 3 k+1 ≤ 3 n .<br />

vol B Lr (p)<br />

Fix q m and consider the following vector field<br />

k∑<br />

X(x) = (b i α(p) − b i α(q m ))∇b i α(x).<br />

α=1<br />

Since b i α are Lipschitz with a universal Lipschitz constant by (1), X satisfies<br />

(14) |X(x)| ≤ C(n) · Lr.<br />

Also, by construction, X satisfies<br />

(15) Mx ‖∇·X‖ 2 (p) ≤ C 4 ε 2 i L 2 r 2 .<br />

Note that we get an extra L 2 r 2 factor as compared to (10) because |b i (p)−b i (q m )| ≤<br />

C(n)Lr. By applying (6) we get<br />

(<br />

Mx [Mx(‖∇·X‖ 3/2 )] 4/3) (p) ≤ C(n) Mx(‖∇·X‖ 2 )(p) ≤ (C 5 Lε i r) 2 .<br />

In particular, for R 1 = 2C(n)Lr<br />

∫<br />

− [Mx(‖∇·X‖ 3/2 )] 4/3 (p) ≤ (C 5 Lε i r) 2<br />

B R1 (p)<br />

provided R 1 ≤ 1. However, in the case of R 1 ∈ [1, 4C(n)] the same follows more<br />

directly from our initial assumptions combined with <strong>Lemma</strong> 1.4 for all large i. By<br />

Cauchy inequality the last estimate gives<br />

∫<br />

(<br />

(16) − Mx(‖∇·X‖ 3/2 ) ) 2/3<br />

(p) ≤ C5 Lε i r.<br />

B R1 (p)<br />

Consider the measure preserving flow φ of X on [0, 1]. Because of (14) the flow<br />

lines φ t (q) with q ∈ B r (p) stay in the ball B R1 (p) for t ∈ [0, 1]. We choose a<br />

universal constant C 6 (L) with C 5 L · vol(B 2C(n)Lr(p))<br />

vol(B r/10 (q m)<br />

≤ C 6 (L)<br />

Combining that the flow is measure preserving, inequality (16) and our choice<br />

of C 6 we see that<br />

(17)<br />

∫<br />

1<br />

vol(B r (q 10 m))<br />

B r(q m)<br />

∫ 1<br />

0<br />

(<br />

Mx(‖∇·X‖ 3/2 ) ) 2/3<br />

(φt (x)) dt dµ(x) ≤ C 6 (L)rε i .<br />

By the Product <strong>Lemma</strong> (2.1) applied to the rescaled balls<br />

1<br />

Lr B Lr(p) we know<br />

that 1 Lr B Lr(p) is measured Gromov–Hausdorff close to a unit ball in R k ×K 3 where<br />

diam K 3 ≤ 10 −n2 . Moreover, φ 1 is measured close to a translation by b i (p)−b i (q m )<br />

in R k , see proof of Proposition 3.8. Since we only need to argue for large i, we may


24 VITALI KAPOVITCH AND BURKHARD WILKING<br />

assume that by volume 3/4 of the points in B r/10 (q m ) are mapped by φ 1 to points<br />

in B r/9 (p).<br />

We choose such a point q ∈ B r/10 (q m ). In view of (17) we may assume in<br />

addition that<br />

∫ 1<br />

By <strong>Lemma</strong> 3.7 this implies<br />

0<br />

Mx(‖∇·X‖ 3/2 ) 2/3 (φ t (q)) dt ≤ 4 3 C 6(L)rε i .<br />

∫<br />

(18) − dt r (1)(x, y) dµ(x) dµ(y) ≤ C 7 (L)r · ε i .<br />

B r(q)×B r(q)<br />

Note that by definition, dt r (1)(x, y) ≥ min{r, |d(φ 1 (x), φ 1 (y)−d(x, y)|}. Combining<br />

this inequality with the knowledge that 3/4 of the points in B r/10 (q m ) end up in<br />

B 1/9r (p), implies that we can find a subset B r/2 (q m ) ′ ⊂ B r/2 (q m ) with<br />

(19)<br />

(20)<br />

vol(B r/2 (q m ) ′ ) ≥ (1 − C 8 (L)ε i r) vol(B r/2 (q m )) and<br />

φ 1 (B r/2 (q m ) ′ ) ⊂ B r (p).<br />

Set B r/2 (q m ) ′′ := B r/2 (q m ) ′ ∩ φ −1<br />

1 (B r(p) ′ ). Then we get<br />

φ 1 (B r/2 (q m ) ′′ ) ⊂ B r (p) ′<br />

and<br />

vol(B r/2 (q m ) ′′ ) ≥ vol(B r/2 (q m ) ′ ) − (vol B r (p) − vol B r (p) ′ )<br />

(21)<br />

by (19) and (11)<br />

≥ (1 − C 8 (L)rε i ) vol B r/2 (q m ) − ¯C 1 rε i vol B r (p)<br />

by (12)<br />

≥ (1 − C 8 (L)rε i ) vol B r/2 (q m ) − 2 k+1 ¯C1 rε i vol B r/2 (q m )<br />

≥ (1 − (C 8 (L) + 2 n ¯C1 )rε i ) vol B r/2 (q m ),<br />

where we used that φ 1 is volume-preserving in the first inequality.<br />

Using the induction assumption, we can prolong each integral curve from any<br />

q ∈ B r/2 (q m ) ′′ to a point x ∈ B r (p) ′ by extending it by a previously constructed<br />

integral curve from x to p ′ ∈ B Lρi (p) of a vector field Xold t (which depends on p′ ).<br />

We set Xnew t = 2Xold 2t for 0 ≤ t ≤ 1/2 and Xt new = −2X for 1/2 ≤ t ≤ 1. Let c q (t)<br />

be the integral curve of Xnew t with c q (1) = q and c q (0) = p ′ .<br />

By the induction assumption and (17), we get<br />

(22)<br />

∫B r/2 (q m) ′′ ∫ 1<br />

0<br />

(Mx ‖∇·X t new‖ 3/2 ) 2/3 (c q (t)) dt dµ(q) ≤<br />

≤ C 6 (L)(rε i ) vol B r/2 (q m ) + ¯C 2 (rε i ) vol B r (p)<br />

(12)<br />

≤ C 6 (L)(rε i ) vol B r/2 (q m ) + 2 n ¯C2 (rε i ) vol B r/2 (q m )<br />

= ( C 9 (L) + 2 n ¯C2<br />

)<br />

(rεi ) vol B r/2 (q m ).<br />

Recall that the balls B r/2 (q m ) cover B Lr (p). We put<br />

B Lr (p) ′ := B Lr (p) ∩ ⋃ m<br />

B r/2 (q m ) ′′ .


STRUCTURE OF FUNDAMENTAL GROUPS 25<br />

By construction, for every point in B Lr (p) ′ there exists a vector field X t whose<br />

integral curve connects it to a point in B Lρi (p) satisfying (22). For points covered<br />

by several sets B r/2 (q m ) ′′ we pick any one. Then we have<br />

∫B Lr (p) ′ ∫ 1<br />

0<br />

Mx ( ‖∇·Xnew‖ t ) 3 2<br />

2 3<br />

(c q (t)) dt dµ(q) ≤<br />

≤<br />

(13)<br />

≤<br />

≤<br />

l∑<br />

(C 9 (L) + 2 n ¯C2 )(rε i ) vol B r/2 (q m )<br />

m=1<br />

3 n (C 9 (L) + 2 n ¯C2 )(rε i ) vol B rL (p)<br />

¯C 2 · Lrε i · vol B rL (p).<br />

The last inequality holds if L = 9 n ≥ 2 · 2 n · 3 n and ¯C 2 ≥ 3 n · C 9 (L).<br />

Recall that by (21)<br />

vol B r/2 (q m ) − vol B r/2 (q m ) ′′ ≥ (C 8 (L) + 2 n ¯C1 )rε i vol B r/2 (q m )<br />

for any m and thus<br />

vol B Lr (p) − vol B Lr (p) ′<br />

≤<br />

(13)<br />

≤<br />

≤<br />

l∑<br />

(C 8 (L) + 2 n ¯C1 )rε i vol B r/2 (q m )<br />

m=1<br />

provided that L = 9 n ≥ 2 · 6 n and ¯C 1 ≥ 3 n C 8 (L).<br />

This finishes the proof of Sublemma 5.2.<br />

3 n (C 8 (L) + 2 n ¯C1 )rε i vol(B Lr (p))<br />

¯C 1 Lrε i vol B Lr (p)<br />

Observe that for any two good points x i , y i ∈ G 1 (p i ) we can deduce from Sublemma<br />

5.2 that vol(B 2 (x i ) ′ ∩ B 2 (y i ) ′ ) ≥ vol(B 1/2 (p i )) for all large i.<br />

Then it follows, also by Sublemma 5.2, that there is q ∈ B 2 (x i ) ′ ∩ B 2 (y i ) ′ ,<br />

x ′ i ∈ B Lρ i<br />

(x i ) and y i ′ ∈ B Lρ i<br />

(y i ) such that for the integral curve c connecting x ′ i<br />

with q on the first half of the interval and q with y i ′ on the second half we have for<br />

the corresponding time dependent vector field Xi<br />

t<br />

∫ 1<br />

(<br />

Mx(‖∇·X t i ‖ 3/2 ) ) 2/3<br />

(c(t)) dt ≤ ¯C3 ε i<br />

□<br />

0<br />

with a constant ¯C 3 independent of i.<br />

Let f i := φ i1 be the flow of X t i evaluated at time 1. Since λ i = 1 ρ i<br />

, we can<br />

employ Proposition 3.6 to see that<br />

f i : [λ i M i , x i ] → [λ i M i , y i ]<br />

has the zooming in property. Thus, the Gromov–Hausdorff limits of the two sequences<br />

are isometric.<br />

For any lifts ˜x i and ỹ i of x i and y i to the universal cover we can lifts ˜x ′ i and ỹ′ i<br />

of x ′ i and y′ i in the Lρ i neighbourhoods of ˜x i and ỹ i . Let ˜f i : ˜Mi → ˜M i be the lift<br />

f i with ˜f i (˜x ′ i ) = ỹ′ i . Proposition 3.6 ensures that ˜f i : [λ i ˜Mi , ˜x i ] → [λ i ˜Mi , ỹ i ] has the<br />

zooming in property as well.<br />

It remains to check the last part of the Rescaling Theorem concerning the fundamental<br />

group. Assume that π 1 (M i , p i ) is generated by loops of length ≤ R.


26 VITALI KAPOVITCH AND BURKHARD WILKING<br />

For every good point q ∈ B 1 (p i ) let r i (q) denote the minimal number such that<br />

π 1 (M i , q) can be generated by loops of length ≤ r i (q). Let r i denote the supremum<br />

of r i (q) over all good points q and choose a good point q i with r i (q i ) ≥ 1 2 r i. It<br />

suffices to check that lim sup i→∞ λ i r i ≤ 1. Suppose we can find a subsequence<br />

with λ i r i ≥ 1/4 for all i. By the Product <strong>Lemma</strong> (2.1), the sequence ( 1 r i<br />

M i , q i )<br />

subconverges to (R k × K ′ , q ∞ ) with diam(K ′ ) ≤ 4 · 10 −n2 . Combining <strong>Lemma</strong> 2.2<br />

with <strong>Lemma</strong> 2.3 we see that π 1 ( 1 r i<br />

M i , q i ) can be generated by loops of length<br />

≤ 2 diam(K ′ ) + c i with c i → 0 – a contradiction.<br />

□<br />

6. The Induction Theorem for C-Nilpotency<br />

C-nilpotency of fundamental groups of manifolds with almost nonnegative Ricci<br />

curvature (Corollary 2) will follow from the following technical result.<br />

Theorem 6.1 (Induction Theorem). Suppose (M n i , p i) is a sequence of pointed<br />

n-dimensional Riemannian manifolds (not necessarily complete) satisfying<br />

(1) Ric Mi ≥ −1/i;<br />

(2) B i (p i ) is compact for any i;<br />

(3) There is some R > 0 such that π 1 (B R (p i )) → π 1 (M i ) is surjective for all i;<br />

(4) (M i , p i ) G−H<br />

−→<br />

i→∞ (Rk × K, (0, p ∞ )) where K is compact.<br />

Suppose in addition that we have k sequences f j i : [ ˜M i , ˜p i ] → [ ˜M i , ˜p i ] of diffeomorphisms<br />

of the universal covers ˜M i of M i which have the zooming in property,<br />

where ˜p i is a lift of p i , and which normalize the deck transformation group acting<br />

on ˜M i , j = 1, . . . k.<br />

Then there exists a positive integer C such that for all sufficiently large i, π 1 (M i )<br />

contains a nilpotent subgroup N ⊳ π 1 (M i ) of index at most C such that N has an<br />

(f j i )C! -invariant (j = 1, . . . , k) cyclic nilpotent chain of length ≤ n − k, that is:<br />

We can find {e} = N 0 ⊳ · · · ⊳ N n−k = N such that [N, N h ] ⊂ N h−1 and each<br />

factor group N h+1 /N h is cyclic. Furthermore, each N h is invariant under the action<br />

of (f j i )C! by conjugation and the induced automorphism of N h /N h+1 is the identity.<br />

Although we will have f j i = id in the applications, it is crucial for the proof of<br />

the theorem by induction to establish it in the stated generality.<br />

Proof. The proof proceeds by reverse induction on k. For k = n the result follows<br />

immediately from Theorem 1.2. We assume that the statement holds for all k ′ > k<br />

and we plan to prove it for k.<br />

We argue by contradiction. After passing to a subsequence we can assume that<br />

any subgroup N ⊂ π 1 (M i ) of index ≤ i does not have nilpotent cyclic chain of<br />

length ≤ n − k which is invariant under (f j i )i! (j = 1, . . . , k).<br />

After passing to a subsequence, ( ˜M i , ˜p i ) converges to (R¯k × ˜K, (0, ˜p ∞ )), where<br />

˜K contains no lines. Moreover, we can assume that the action of π 1 (M i ) converges<br />

with respect to the pointed equivariant Gromov–Hausdorff topology to an isometric<br />

action of a closed subgroup G ⊂ Iso(R¯k × ˜K).<br />

It is clear that the metric quotient (R¯k × ˜K)/G is isometric to R k ×K. Using that<br />

lines in R k × K can be lifted to lines in R¯k × ˜K, we deduce that R¯k = R k × R l and<br />

the action of G on the first Euclidean factor is trivial, (cf. proof of <strong>Lemma</strong> 2.3).<br />

Since the action of G on R l × ˜K and hence on ˜K is cocompact, we can use an


STRUCTURE OF FUNDAMENTAL GROUPS 27<br />

observation of Cheeger and Gromoll [CG72] to deduce that ˜K is compact for there<br />

are no lines in ˜K.<br />

By passing once more to a subsequence, we can assume that f j i converges in the<br />

weakly measured sense to an isometry f∞ j on the limit space, j = 1, . . . , k. The<br />

overall most difficult step is essentially a consequence of the Rescaling Theorem:<br />

Step 1. Without loss of generality we can assume that K is not a point.<br />

We assume K = pt. The strategy is to find a new contradicting sequence converging<br />

to R k × K ′ where K ′ ≠ pt.<br />

After rescaling every manifold down by a fixed factor we may assume that the<br />

limit isometry f∞ j displaces (0, p ∞ ) by less than 1/100, j = 1, . . . , k. We choose<br />

the set of ”good” points G 1 (p i ) as in Rescaling Theorem 5.1. We let G 1 (˜p i ) denote<br />

those points in B 1 (˜p i ) projecting to G 1 (p i ). By <strong>Lemma</strong> 1.6, vol(G1(˜pi))<br />

vol(B → 1.<br />

1(˜p i))<br />

By <strong>Lemma</strong> 3.5, we can remove small subsets H i ⊂ G 1 (p i ) (and the corresponding<br />

subsets from G 1 (˜p i )) such that ≤ δ i → 0 and for any choice of points<br />

vol H i<br />

vol G 1(p i)<br />

˜q i ∈ G 1 (˜p i )\H i the sequence f j i and (f j i )−1 (i ∈ N) is good on all scales at ˜q i ,<br />

j = 1, . . . , k. To simplify notations we will assume that it is already true for all<br />

choices of ˜q i ∈ G 1 (˜p i ).<br />

For all large i we can choose a point ˜q i ∈ G 1/2 (˜p i ) with f j i (˜q i) ∈ G 1 (˜p i ), j =<br />

1, . . . , k.<br />

Let q i be the image of ˜q i in M i and ¯f j i : M i → M i be the induced diffeomorphism.<br />

By Rescaling Theorem 5.1, there is a sequence g j i of isotopic to identity diffeomorphisms<br />

g j i : [λ iM i , ¯f j i (q i)] → [λ i M i , q i ]<br />

with the zooming in property. Moreover, by the same theorem, we can find lifts ˜g j i<br />

of g j i such that ˜g j i : [λ i ˜M i , f j i (˜q i)] → [λ i ˜Mi , ˜q i ]<br />

has the zooming in property. Using <strong>Lemma</strong> 3.4 we see that<br />

f j i,new := f j i ◦ ˜gj i : [λ i ˜M i , ˜q i ] → [λ i ˜Mi , ˜q i ]<br />

has the zooming in property as well for j = 1, . . . , k. Since g j i is isotopic to the<br />

identity, it follows that conjugation by ˜g j i induces an inner automorphism of π 1(M i ).<br />

Therefore f j i,new produces the same element αj i ∈ Out(π 1(M i )) as f j i .<br />

The Rescaling Theorem also ensures that π 1 (λ i M i , q i ) remains boundedly generated.<br />

Finally, it states that (λ i M i , q i ) converges to (R k × K, (0, q ∞ )) with K being<br />

a compact space with diam(K) = 10 −n2 .<br />

Thus, (λ i M i , q i ) and the maps f j i,new on the universal covers give a new contradicting<br />

sequence with the limit satisfying K ≠ pt.<br />

From now on we will assume that it’s true for the original contradicting sequence.<br />

Step 2. Without loss of generality we can assume that f j i converges in the measured<br />

sense to the identity map of the limit space R k × R l × ˜K, j = 1, . . . , k.<br />

We prove this by finite induction on j. Suppose we already found a contradicting<br />

sequence where fi 1, . . . , f j−1<br />

i converge to the identity. We have to construct one<br />

where in addition f i := f j i converges to the identity.


28 VITALI KAPOVITCH AND BURKHARD WILKING<br />

We first consider the induced diffeomorphisms ¯f i : M i → M i which converge to an<br />

isometry ¯f ∞ of R k × K. Thus, there is A ∈ O(k) and v ∈ R k such that the induced<br />

isometry of the Euclidean factor pr( ¯f ∞ ): R k → R k is given by (w ↦→ Aw + v).<br />

We claim that we can assume without loss of generality that v = 0. In fact,<br />

by Proposition 3.8, we can find a sequence of diffeomorphisms g i : M i → M i with<br />

the zooming in property such that g i converges to an isometry g ∞ which induces<br />

translation by −v on the Euclidean factor. Moreover, there is a lift ˜g i : ˜Mi → ˜M i<br />

which also has the zooming in property if we endow ˜M i with the base point ˜p i . By<br />

<strong>Lemma</strong> 3.4, we are free to replace f i by ˜g i ◦ f i and hence v = 0 without loss of<br />

generality.<br />

Since ¯f ∞ fixes the origin of the Euclidean factor, we obtain that for the limit f ∞<br />

of f i the following property holds. Given any m > 0 we can find g m ∈ G such that<br />

d(f∞(g m m (0, ˜p ∞ )), (0, ˜p ∞ )) ≤ diam(K) in R k × R l × ˜K.<br />

It is now an easy exercise to find a sequence ν b of natural numbers and a sequence<br />

of g b ∈ G for which f ν b<br />

∞ ◦ g b converges to the identity:<br />

We consider a finite ε-dense set {a 1 , . . . , a N } in the ball of radius 1 ε<br />

around<br />

(0, ˜p ∞ ) ∈ R k+l × ˜K. For each integer m choose a g m ∈ G such that<br />

d ( f m ∞(<br />

gm (0, ˜p ∞ ) ) , (0, p ∞ ) ) ≤ diam(K).<br />

The elements (f∞(g m m a 1 ), · · · , f∞(g m m a N )) are contained in B 1/ε+diam(K ′ )(0, ˜p ∞ ).<br />

Thus, there are m 1 ≠ m 2 such that d(f∞ m1 (g m1 a j ), f∞ m2 (g m2 a j )) < ε for j =<br />

1, . . . , N. Consequently, d(f∞<br />

m1−m2 (ga j ), a j ) < ε for j = 1, . . . , N with<br />

g = f∞<br />

m2−m1 gm −1<br />

1<br />

f∞ m1−m2 g m2 ∈ G.<br />

In summary, f∞ m1−m2 ◦ g displaces any point in the ball of radius 1 ε around (0, p ∞)<br />

by at most 4ε. Since ε was arbitrary, this clearly proves our claim.<br />

Let (ν b , g b ) be as above. For each b we choose a large i = i(b) ≥ 2ν b and<br />

g b ∈ π 1 (M i ) such that f ν b<br />

i ◦ g b is in the measured sense close to f ν b<br />

∞ ◦ g b . We also<br />

choose i so large that (f ν b<br />

i(b) ◦ g b) b∈N still has the zooming in property and converges<br />

to the identity. This finishes the proof of Step 2.<br />

Further Notations. We may replace p i with any other point p ∈ B 1/2 (p i ). Thus<br />

we may assume that p ∞ is a regular point in K. Let π 1 (M i , p i , r) = π 1 (M i , p i )(r)<br />

denote the subgroup of π 1 (M, p i ) generated by loops of length ≤ r. Similarly, recall<br />

that G(r) is the group generated by elements which displace (0, ˜p ∞ ) by at most r.<br />

By the Gap <strong>Lemma</strong> (2.4) there is an ε > 0 and ε i → 0 such that π 1 (M i , p i , ε) =<br />

π 1 (M i , p i , ε i ). Moreover, G(r) = G(ε) for all r ∈ (0, 2ε]. We put Γ i := π 1 (M i , p i )<br />

and Γ iε = π 1 (M i , p i , ε).<br />

Step 3. The desired contradiction arises if the index [Γ i : Γ iε ] is bounded by some<br />

constant C for all large i.<br />

If C denotes a bound on the index and d = C!, then (f j i )d leaves the subgroup Γ iε<br />

invariant and we can find a new contradicting sequence for the manifolds ˜M i /Γ iε .<br />

Thus we may assume that Γ i = Γ iε .<br />

As we have observed above, π 1 (M i , p i ) = π 1 (M i , p i , ε i ) for some ε i → 0. We now<br />

choose a sequence λ i → ∞ very slowly such that<br />

• (λ i M i , p i ) → (R k × C p∞ K, o) = (R k′ , 0) with k ′ > k.<br />

• π 1 (λ i M i , p i ) is generated by loops of length ≤ 1.


STRUCTURE OF FUNDAMENTAL GROUPS 29<br />

• f j i : λ ˜M i i → λ i ˜Mi still has the zooming in property and still converges to<br />

the identity of the limit space.<br />

We put f k+1<br />

i = · · · = fi<br />

k′ = id and obtain a new sequence contradicting our<br />

induction assumption.<br />

Step 4. There is H > 0 and a sequence of uniformly open subgroups (see Definition<br />

1.7) Υ i ⊂ Γ iε such that the index [Γ iε : Υ i ] ≤ H and such that Υ i is normalized<br />

by a subgroup of Γ i with index ≤ H for all large i.<br />

After passing to a subsequence we may assume that Γ iε converges to the limit<br />

action of a closed subgroup G ε ⊂ G. Clearly G ε contains the open subgroup G(ε) ⊂<br />

G. Since the kernel of pr is compact (as a closed subgroup of Iso( ˜K)), the images<br />

pr(G) and pr(G ε ) are closed subgroups. Moreover, pr(G ε ) is an open subgroup of<br />

pr(G) since it contains pr(G(ε)) ⊃ pr(G) 0 . Using Fukaya and Yamaguchi [FY92,<br />

Theorem 4.1] or that the component group of pr(G) is the fundamental group of the<br />

nonnegatively curved manifold pr(G)\ Iso(R l ), we see that pr(G)/ pr(G) 0 is virtually<br />

abelian.<br />

Choose a subgroup G ′ ⊳ G of finite index such that pr(G ′ )/ pr(G) 0 is free abelian.<br />

Let G ′ ε = G ′ ∩ G ε . Then pr(G) 0 ⊂ pr(G ′ ε) ⊂ pr(G ′ ). Moreover, pr(G ′ ε) is normal in<br />

pr(G ′ ) and pr(G ′ )/ pr(G ′ ε) is a finitely generated abelian group.<br />

The inverse image Ĝε := pr −1 (pr(G ε )) ∩ G ′ is a normal subgroup of G ′ with the<br />

same quotient, i.e. G ′ /Ĝε = pr(G ′ )/ pr(G ′ ε). Since the kernel of pr is compact, the<br />

open subgroup G ′ ε is cocompact in Ĝε and thus its index [Ĝε : G ′ ε] is finite.<br />

In particular, we can say that there is some integer H 1 > 0 and a subgroup G ′<br />

with [G : G ′ ] ≤ H 1 such that [G ε : (gG ε g −1 ∩ G ε )] < H 1 for all g ∈ G ′ .<br />

We want to check that there are only finitely many possibilities for gG ε g −1 ∩ G ε<br />

where g ∈ G ′ .<br />

Let g ∈ G ′ . Notice that Γ iε is a uniformly open sequence of subgroups. Choose<br />

g i ∈ Γ i with g i → g. By <strong>Lemma</strong> 1.8, Υ i := g i Γ iε g −1<br />

i ∩ Γ iε converges to gG ε g −1 ∩ G ε<br />

and the index of Υ i in Γ iε is ≤ H 1 for all large i. By Theorem 2.5, the number of<br />

generators of Γ iε is bounded by some a priori constant. Therefore, Γ iε contains at<br />

most h 0 = h 0 (n, H 1 ) subgroups of index ≤ H 1 .<br />

Combining these statements we see that<br />

{gG ε g −1 ∩ G ε | g ∈ G ′ } = {g h G ε g −1<br />

h ∩ G ε | h = 1, . . . , h 0 }<br />

for suitably chosen elements g 1 , . . . , g h0 ∈ G ′ . We choose g hi ∈ Γ i converging to<br />

g h ∈ G. By <strong>Lemma</strong> 1.8, the sequence of subgroups<br />

Υ i :=<br />

is uniformly open and converges to<br />

Υ ∞ :=<br />

⋂h 0<br />

i=1<br />

⋂h 0<br />

h=1<br />

g h G ε g −1<br />

h<br />

g hi Γ iε g −1<br />

hi<br />

=<br />

⋂<br />

g∈G ′ gG ε g −1 .<br />

Clearly, Υ ∞ ⊳G ′ . It is not hard to see that we can find elements c 1 i , . . . , cτ i ∈ Γ i that<br />

generate a subgroup of index ≤ [G : G ′ ] such that each c j i converges to an element<br />

c j ∞ ∈ G ′ . Since c j i Υ i(c j i )−1 is uniformly open and converges to c j ∞Υ ∞ (c j ∞) −1 = Υ ∞ ,<br />

one can now apply <strong>Lemma</strong> 1.8 c) to see that c j i normalizes Υ i for large i.


30 VITALI KAPOVITCH AND BURKHARD WILKING<br />

Thus, the normalizer of Υ i has finite index ≤ [G : G ′ ] in Γ i for all large i and<br />

Step 4 is established.<br />

Step 5. The desired contradiction arises if, after passing to a subsequence, the<br />

indices [Γ i : Γ iε ] ∈ N ∪ {∞} converge to ∞.<br />

By Step 4 there is a subgroup Υ i ⊂ Γ iε of index ≤ H which is normalized by a<br />

subgroup of Γ i of index ≤ H. Similarly to the beginning of the proof of Step 3, one<br />

can reduce the situation to the case of Υ i ⊳ Γ i .<br />

Moreover, Υ i ⊳ Γ i is uniformly open and hence the action of Γ i /Υ i on M i /Υ i is<br />

uniformly discrete and converges to a properly discontinuous action of the virtually<br />

abelian group G/Υ ∞ on the space R k × (R l × ˜K)/Υ ∞ . It is now easy to see that<br />

Γ iε /Υ i contains an abelian subgroup of controlled finite index for large i.<br />

After replacing Γ i once more by a subgroup of controlled finite index we may<br />

assume that Γ i /Υ i is abelian. This also shows that without loss of generality<br />

Υ i = Γ iε .<br />

Recall that by Step 2 we have assumed that f j i converges to the identity in the<br />

weakly measured sense for every j = 1, . . . , k. Therefore, the same is true for the<br />

commutator [f j i , γ i] ∈ Γ i if γ i ∈ Γ i has bounded displacement. This in turn implies<br />

that [f j i , γ i] ∈ Γ iε for all large i and j = 1, . . . , k.<br />

By Theorem 2.5, we can find for some large σ elements d i 1, . . . , d i σ ∈ Γ i with<br />

bounded displacement generating Γ i . We also assume that the first τ elements<br />

generate Γ iε for some τ < σ.<br />

Let ˆΓ i := 〈d i 1, . . . , d i σ−1〉. Note that ˆΓ i ⊳ Γ i since ˆΓ i contains Γ iε = Υ i ⊳ Γ i and<br />

Γ i /Γ iε is abelian.<br />

If for some subsequence and some integer H the group ˆΓ i has index ≤ H in Γ i<br />

then we may replace Γ i by ˆΓ i . Thus, without loss of generality, the order of the<br />

cyclic group Γ i /ˆΓ i tends to infinity. The element f k+1<br />

i := d i σ ∈ Γ i represents a<br />

generator of this factor group which has bounded displacement.<br />

We have seen above that f j i normalizes ˆΓ i and [f k+1<br />

i , f j i ] ∈ Γ iε ⊂ ˆΓ i , j = 1, . . . , k.<br />

Clearly f k+1<br />

i , being an isometry with bounded displacement, has the zooming in<br />

property.<br />

We replace M i by B i := ˜M i /ˆΓ i . Notice that (B i , ˆp i ) converges to (R k × (R l ×<br />

˜K)/Ĝ, ˆp ∞) where Ĝ is the limit group of ˆΓi . Since G/Ĝ is a noncompact group<br />

acting cocompactly on (R l × ˜K)/Ĝ, we see that (Rl × ˜K)/Ĝ splits as Rp × K ′ with<br />

K ′ compact and p > 0. If p > 1, we put f j i = id for j = k + 2, . . . , k + p.<br />

Thus the induction assumption applies: There is a positive integer C such that<br />

the following holds for all large i:<br />

We can find a subgroup N i of π 1 (B i ) of index ≤ C and a nilpotent chain of<br />

normal subgroups {e} = N 0 ⊳ . . . ⊳ N n−k−1 = N i such that the quotients are cyclic.<br />

Moreover, the groups are invariant under the automorphism induced by conjugation<br />

of (f j i )C! and the induced automorphism of N h+1 /N h is the identity, j = 1, . . . , k+1.<br />

We put d = C!, and consider the group ¯N i ⊂ π 1 (M i ) generated by (f k+1<br />

i ) d and<br />

N i . Clearly, the index of ¯N i in π 1 (M i ) is bounded by C · d.<br />

Moreover, the chain {e} = N 0 ⊳ . . . ⊳ N n−k−1 ⊳ N n−k := ¯N i is normalized by the<br />

elements (f j i )d (j = 1, . . . , k) and the action on the cyclic quotients is trivial.<br />

In other words, the sequence fulfills the conclusion of our theorem – a contradiction.<br />


STRUCTURE OF FUNDAMENTAL GROUPS 31<br />

Remark 6.2. If one was only interested in proving that the fundamental group<br />

contains a polycylic subgroup of controlled index, the proof would simplify considerably.<br />

First of all, one would not need any diffeomorphisms f j i to make the<br />

induction work. In the proof of Step 1 the use of the rescaling theorem could be<br />

replaced with the more elementary Product <strong>Lemma</strong> 2.1. Step 2 would become unnecessary.<br />

The core of the remaining arguments would be the same although they<br />

would simplify somewhat.<br />

7. <strong>Margulis</strong> <strong>Lemma</strong><br />

Proof of Theorem 1. Let B 1 (p) be a metric ball in complete n-manifold with Ric ><br />

−(n − 1) on B 1 (p), Ñ the universal cover of B 1 (p) and let ˜p ∈ Ñ be a lift of p.<br />

Step 1. There are universal positive constants ε 1 (n) and C 1 (n) such that the group<br />

〈 {g<br />

Γ := ∈ π1 (B 1 (p), p) ∣ d(q, gq) ≤ ε1 (n) for all q ∈ B 1/2 (˜p) }〉<br />

has a subgroup of index ≤ C 1 (n) which has a nilpotent basis of length at most n.<br />

Put N = Ñ/Γ and let ˆp denote the image of ˜p. By the definition of Γ, for each<br />

point q ∈ B 1/2 (ˆp), the fundamental group π 1 (N, q) is generated by loops of length<br />

≤ ε 1 (n). Moreover, B 3/4 (ˆp) is compact.<br />

Assume, on the contrary, that the statement is false. Then we can find a sequence<br />

of pointed n-dimensional manifolds (N i , p i ) satisfying<br />

• Ric Ni ≥ −(n − 1),<br />

• B 3/4 (p i ) is compact,<br />

• for each point q ∈ B 1/2 (p i ) the fundamental group π 1 (N i , q) is generated<br />

by loops of length ≤ 2 −i , and<br />

• π 1 (N i , p i ) does not contain a subgroup of index ≤ 2 i which has a nilpotent<br />

basis of length ≤ n.<br />

After passing to a subsequence we can assume that (N i , p i ) converges to (X, p ∞ ).<br />

We choose q i ∈ B 1/4 (p i ) such that q i converges to a regular point q ∈ X.<br />

Choose λ i → ∞ very slowly such that π 1 (λ i N i , q i ) is still generated by loops of<br />

length ≤ 1 and such that (λ i N i , q i ) converges to C q X ∼ = R k for some k ≥ 0.<br />

Notice that the rescaled sequence M i = λ i N i satisfies Ric Mi ≥ − n−1 → 0<br />

λ 2 i<br />

and B ri (q i ) is compact with r i = λi<br />

2<br />

→ ∞. Thus the existence of this sequence<br />

contradicts the Induction Theorem 6.1 with fi 1 = · · · = f i k = id.<br />

We can now finish the proof of the theorem by establishing.<br />

Step 2. Consider ε 1 (n) > 0 and Γ from Step 1. Then there are ε 2 (n), C 2 (n) > 0<br />

such that the group<br />

〈 {g<br />

H := ∈ π1 (B 1 (p), p) | d(˜p, g˜p) ≤ ε 2 (n) }〉<br />

satisfies: Γ ∩ H has index at most C 2 (n) in H.<br />

We will provide (in principle) effective bounds on ε 2 and C 2 depending on n and<br />

the (ineffective) bound ε 1 (n). Put<br />

Γ ′ :=<br />

〈 {g<br />

∈ π1 (B 1 (p), p) | d(q, gq) ≤ ε 1 (n) for all q ∈ B 3/4 (˜p) }〉 ⊂ Γ.


32 VITALI KAPOVITCH AND BURKHARD WILKING<br />

By Theorem 2.5, there exists h = h(n) such that H can be generated by some<br />

b 1 , . . . , b h ∈ H satisfying d(˜p, b i ˜p) ≤ 4ε 2 (n) for any i = 1, . . . h. We can obviously<br />

assume that the generating set {b 1 , . . . , b h } contains inverses of all its elements. We<br />

proceed in three substeps.<br />

Claim 1. There is a positive integer L(n, ε 1 (n)) such that for all ε 2 (n) < 1<br />

100L and<br />

any choice of g i ∈ {b 1 , . . . , b h }, i = 1, . . . , L we can find l, k ∈ {1, . . . , L} with l < k<br />

and g l · g l+1 · · · g k ∈ Γ ′ .<br />

We assume ε 2 (n) < 1<br />

100L<br />

and we will show that there is an a priori estimate for<br />

L. Notice that d(˜p, g 1 · · · g l ˜p) < 4ε 2 (n)l ≤ 1 25 holds for l = 1, . . . , L. Thus g 1 · · · g l<br />

maps the ball of radius B 2/3 (p) into the ball B 3/4 (p).<br />

We choose a maximal collection of points {a 1 , . . . , a m } in B 2/3 (p) with pairwise<br />

distances ≥ ε 1 (n)/4. It is immediate from Bishop–Gromov that m can be estimated<br />

just in terms of ε 1 and n.<br />

Assume we can choose g i ∈ {b 1 , . . . , b h }, i = 1, . . . , L, such that g l · · · g k ∉ Γ ′<br />

for all 1 ≤ l < k ≤ L. This implies that d(g l · · · g k a u , a u ) ≥ ε 1 (n)/4 for some<br />

u = u(l, k) ∈ {1, . . . , m}. Hence,<br />

d(g 1 · · · g k a u(l,k) , g 1 · · · g l a u(l,k) ) ≥ ε 1 (n)/4, for 1 < k < l < L.<br />

We consider the diagonal action of Γ on the m-fold product Ñ i<br />

m and the point<br />

a = (a 1 , . . . , a m ). We just showed d(g 1 · · · , g k a, g 1 · · · g l a) ≥ ε 1 (n)/4 for all 1 ≤<br />

k < l ≤ L. Thus there L points in the m-fold product B 3/4 (˜p) m which are ε 1 (n)/4<br />

separated. Now the Bishop–Gromov inequality provides an a priori bound for L.<br />

Claim 2. Choose L as in Claim 1 and assume ε 2 (n) ≤ 1<br />

w = b ν1 . . . b νl of length l ≤ L we have wΓ ′ w −1 ⊂ Γ.<br />

100L<br />

. Then for every word<br />

Let γ ′ ∈ π 1 (B 1 (p)) be an element that displaces every point in B 2/3 (˜p) by at most<br />

ε 1 (n). By the definition of Γ ′ it suffices to show that wγ ′ w −1 ∈ Γ. Let q ∈ B 1/2 (˜p).<br />

Since ε 2 (n) < 1<br />

100L , it follows that w−1 ˜p ∈ B 1/25 (˜p) and hence w −1 q ∈ B 2/3 (˜p).<br />

Therefore<br />

d(q, wγ ′ w −1 q) = d(w −1 q, γ ′ w −1 q) < ε 1 (n) for all q ∈ B 1/2 (˜p).<br />

This proves Claim 2. Step 2 now follows from<br />

Claim 3. Every element h ∈ H is of the form h = wγ with γ ∈ Γ and w = b ν1 · · · b νl<br />

with l ≤ L.<br />

We write h = w · γ, where w is a word of length l. We may assume l is chosen<br />

minimal. It suffices to prove that l ≤ L. Suppose l > L. Then we can apply<br />

Claim 1 to the tail of w and obtain w = w 1 γ ′ w 2 with γ ′ ∈ Γ ′ , w 2 a word of length<br />

< L and the word-length of w 1 · w 2 is smaller than the length of w.<br />

By Claim 2, w2 −1 γ′ w 2 ∈ Γ. Putting γ 2 = w2 −1 γ′ w 2 γ gives h = w 1 · w 2 γ 2 – a<br />

contradiction, as the length of w 1 w 2 is smaller.<br />

□<br />

Corollary 7.1. In each dimension n there exists ε > 0 such that the following<br />

holds for any complete n-manifold M and p ∈ M with Ric > −(n − 1) on B 1 (p).<br />

If the image of π 1 (B ε (p)) → π 1 (B 1 (p)) contains a nilpotent group of rank n, then<br />

M is homeomorphic to a compact infranilmanifold.


STRUCTURE OF FUNDAMENTAL GROUPS 33<br />

Actually, we will only show that M is homotopically equivalent to an infranilmanifold.<br />

By work of Farrell and Hsiang [FH83] this determines the homeomorphism<br />

type in dimensions above 4. The 4 -dimensional case follows from work of<br />

Freedman–Quinn [FQ90]. Lastly, the 3-dimensional case follows from Perelman’s<br />

solution of the geometrization conjecture.<br />

Proof of Corollary 7.1. Notice that it suffices to prove that M i is aspherical for<br />

large i: In fact, since the cohomological dimension of a rank n nilpotent group<br />

is n, the group π 1 (M i ) must then be a torsion free virtually nilpotent group of<br />

rank n and thus, by a result of Lee and Raymond [LR85], it is isomorphic to the<br />

fundamental group of an infranilmanifold. Therfore M i is homotopically equivalent<br />

to an infranilmanifold and as explained above this then gives the result.<br />

We argue by contradiction and assume that we can find ε i → 0 and a sequence<br />

of complete manifolds M i with Ric > −1 on B 1 (p i ) such that the image of<br />

π 1 (B εi (p i ), p i ) → π 1 (B 1 (p i ), p i ) contains a nilpotent group of rank n and M i is not<br />

aspherical.<br />

By the <strong>Margulis</strong> <strong>Lemma</strong> (Theorem 1), the nilpotent group can be chosen to<br />

have index ≤ C(n) in the image. Arguing on the universal cover of B 1 (p i ) it is not<br />

hard to deduce that there is δ i → 0 such that for all q i ∈ B 1/10 (p i ) the image of<br />

π 1 (B δi (q i ), q i ) → π 1 (B 1 (p i ), p i ) also contains a nilpotent subgroup of rank n.<br />

Next we observe that diam(M i , p i ) → 0. In fact, otherwise we could, after passing<br />

to a subsequence, assume that (M i , p i ) converges in the Gromov–Hausdorff sense<br />

to (Y, p ∞ ) with Y ≠ pt. Choose q i ∈ B 1/10 (p i ) converging to a regular point q ∞<br />

in the limit Y . Similarly, choose λ i ≤ 1 √ δi<br />

slowly converging to infinity such that<br />

(λ i M i , q i ) → (C q∞ Y, o) = (R k , 0) for some k > 0. Let Ñi denote the universal cover<br />

of B 1 (p i ) and ˜q i a lift of q i . Let Γ i be the subgroup of the deck transformation group<br />

generated by those elements which displace q i by at most δ i . By construction, Γ i<br />

contains a nilpotent group of rank n. Put N i = Ñi/Γ i . Since (λ i N i , q i ) → (R k , 0)<br />

with k > 0, we get a contradiction to the Induction Theorem (6.1).<br />

Thus, diam(M i ) → 0 and in particular, M i = B 1 (p i ) is a closed manifold for all<br />

large i. After a slow rescaling we may assume in addition that Ric Mi ≥ −h i → 0.<br />

Next we plan to show that the universal cover of M i converges to R n . This will<br />

follow from<br />

Claim 1. Suppose a sequence of complete n-manifolds (N i , p i ) with lower Ricci<br />

curvature bound > −h i → 0 converges to (R k × K, p ∞ ) with K compact. Also<br />

assume that π 1 (N i ) is generated by loops of length ≤ R and contains a nilpotent<br />

subgroup of rank n − k. Then the universal cover of (Ñi, ˜p i ) converges to (R n , 0).<br />

We argue by reverse induction on k. The base of induction k = n is obvious.<br />

By the Induction Theorem, after passing to a bounded cover, we may assume<br />

that π 1 (N i ) itself is a torsion free nilpotent group of rank n − k.<br />

Choose N i ⊳ π 1 (N i ) with π 1 (N i )/N i<br />

∼ = Z. Then ( Ñ i /N i , ˆp i ) converges to (R l ×<br />

K ′ , (0, p ∞ )) for some l > k and K ′ compact and the claim follows from the induction<br />

assumption.<br />

Therefore ( ˜M i , ˜p i ) converges to (R n , 0). From the Cheeger–Colding Stability Theorem<br />

(see Theorem 1.2) it follows that for any R > 0 the ball B R (p) is contractible<br />

in B R+1 (p) for all p ∈ B R (˜p i ) and i ≥ i 0 (R). Since we have a cocompact deck


34 VITALI KAPOVITCH AND BURKHARD WILKING<br />

transformation group with nearly dense orbits, the result actually holds for all<br />

p ∈ ˜M i .<br />

In order to show that M i is aspherical we may replace M i by a bounded cover and<br />

thus, by Theorem 1 without loss of generality, π 1 (M i ) has a nilpotent basis of length<br />

≤ n. Because rank(π 1 (M i )) ≥ n it follows that π 1 (M i ) is torsion free. Therefore<br />

we can choose subgroups {e} = N i 0 ⊳ · · · ⊳ N i n = π 1 (M i ) with N i j /Ni j−1 ∼ = Z.<br />

In the rest of the proof we will slightly abuse notations and sometimes drop<br />

basepoints when talking about pointed Gromov–Hausdorff convergence when the<br />

base points are clear.<br />

Note that the above claim easily implies that ˜M i /N i j → Rn−j for all j = 0, . . . n.<br />

Claim 2. N i j ⋆ B r(˜p i ) is contractible in N i j ⋆ B 4 4j r (˜p i) for j = 1, · · · , n and r ∈<br />

[<br />

1, 4<br />

4 (2n−j)2 ] and all large i.<br />

We want to prove the statement by induction on j. For j = 0 it holds as was<br />

pointed out above. Suppose it holds for j < n and we need to prove it for j + 1.<br />

Choose g ∈ N i j+1 representing a generator of Ni j+1 /Ni j .<br />

Notice that N i j+1 ⋆ B R(˜p i )/N i j converges to R × B R(0) ⊂ R n−j as i → ∞ where<br />

B R (0) is the ball in R n−j−1 . Moreover, the action of N i j+1 /Ni j on the set converges<br />

to the R action on R × B R (0) given by translations.<br />

We can also find finite index subgroups ¯N i j+1 ⊂ Ni j+1 with Ni j ⊂ ¯N i j+1 such that<br />

N i j+1 ⋆ B R(˜p i )/¯N i j+1 converges to S1 × B R (0) ⊂ S 1 × R n−j where S 1 has diameter<br />

10 −n . It is easy to construct a smooth map ¯σ : N i j+1 ⋆ B R(˜p i )/¯N i j+1 → S1 that is<br />

arbitrary close to the projection map S 1 × B R (0) → S 1 in the Gromov–Hausdorff<br />

sense.<br />

We can lift ¯σ to a map σ : N i j+1 ⋆ B R(˜p i ) → R. Notice that σ commutes with the<br />

action of ¯N i j+1 where ¯N i j+1 /Ni j can be thought of as the deck transformation group<br />

of the covering R → S 1 .<br />

It suffices to show that N i j+1 ⋆ B r(˜p i ) → N i j+1 ⋆ B 4 4j+1 r (˜p i) induces the trivial<br />

map on the level of homotopy groups. Thus, we have to show that for any k > 0<br />

any map ι: S k → N i j+1 ⋆ B r(˜p i ) is null homotopic in N i j+1 ⋆ B 4 4j+1 r (˜p i).<br />

We can assume that ι is smooth. The image of σ ◦ ι is given by an interval<br />

[a, b] in R. We choose a 10 −n -fine finite subdivision a < t 1 < · · · < t h < b of the<br />

interval such that the t α are regular values. Thus ι −1 (σ −1 (t α )) = H α is a smooth<br />

hypersurface in S k for every α.<br />

Notice that by construction, the image ι(H α ) is contained in gN j ⋆ B 2r (˜p i ) for<br />

some g ∈ ¯N i j+1 . By induction assumption, ι |H α<br />

is homotopic to a point map in<br />

gN i j ⋆ B 4 4j 2r (˜p i). We homotope ι into ˜ι such that ˜ι(H α ) is a point for all α and ˜ι is<br />

4 4j 4r close to ι.<br />

Consider now all components of H α for all α. They divide the sphere into<br />

connected regions such that each boundary component of a region is mapped by ˜ι<br />

to a point and the whole region is mapped to a set gN i j ⋆ B 4 4j 8r (˜p i) for some g.<br />

Thus the map ˜ι restricted to a region with crushed boundary components is null<br />

homotopic in g ⋆B 4 2·4 j 8r (˜p i) by the induction assumption. Clearly, this implies that<br />

ι is null homotopic in N i j+1 ⋆ B 4 4j+1 r (˜p i).<br />

This finishes the proof of Claim 2. Notice that N i n ⋆ B 1 (˜p i ) = ˜M i for large i since<br />

diam(M i ) → 0. Thus ˜M i is contractible by Claim 2.<br />


STRUCTURE OF FUNDAMENTAL GROUPS 35<br />

8. Almost nonnegatively curved manifolds with maximal first Betti<br />

number<br />

This is the only section where we assume lower sectional curvature bounds. The<br />

main purpose is to prove<br />

Corollary 8.1. In each dimension there are positive constants ε(n) and p 0 (n) such<br />

that for all primes p > p 0 (n) the following holds.<br />

Any manifold with diam(M, g) 2 K sec ≥ −ε(n) and b 1 (M, Z p ) ≥ n is diffeomorphic<br />

to a nilmanifold. Conversely, for every p, every compact n-dimensional nilmanifold<br />

covers another (almost flat) nilmanifold M with b 1 (M, Z p ) = n.<br />

The second part of the Corollary is fairly elementary but it provides counterexamples<br />

to a theorem of Fukaya and Yamaguchi [FY92, Corollary 0.9] which asserted<br />

that only tori should show up. The second part follows from the following <strong>Lemma</strong><br />

and the fact that every nilmanifold is almost flat, i.e. for any i > 0 it admits a<br />

metric with diam(M, g) 2 |K sec | ≤ 1 i .<br />

<strong>Lemma</strong> 8.2. Let Γ be a torsion free nilpotent group of rank n, and let p be a prime<br />

number.<br />

a) There is a subgroup of finite index Γ ′ ⊂ Γ for which we can find a surjective<br />

homomorphism Γ ′ → (Z/pZ) n .<br />

b) We can find a torsion free nilpotent group ˆΓ containing Γ as finite index<br />

subgroup such that there a surjective homomorphism ˆΓ → (Z/pZ) n .<br />

Proof. a) We argue by induction on n. The statement is trivial for n = 1. Assume<br />

it holds for n − 1. Choose a subgroup Λ ⊳ Γ with Γ/Λ ∼ = Z. By the induction<br />

assumption, we can find a subgroup Λ ′ ⊂ Λ of finite index and Λ ′′ ⊳ Λ ′ with<br />

Λ ′ /Λ ′′ ∼ = (Z/pZ) n−1 . Let g ∈ Γ represent a generator of Γ/Λ. Since Λ contains<br />

only finitely many subgroups with of index ≤ [Λ : Λ ′′ ], it follows that g l is in the<br />

normalizer of Λ ′ and Λ ′′ for a suitable l > 0. After increasing l further we may<br />

assume that the automorphism of Λ ′ /Λ ′′ ∼ = (Z/pZ) n−1 induced by the conjugation<br />

by g l is the identity.<br />

Define Γ ′ as the group generated by Λ ′ and g l . It is now easy to see that Γ ′ /Λ ′′<br />

is isomorphic to (Z/pZ) n−1 × Z and thus Γ ′ has a surjective homomorphism to<br />

(Z/pZ) n .<br />

b). An analysis of the proof of a) shows that in a) the index of Γ ′ in Γ can<br />

be bounded by a constant C only depending on n and p. Let L be the Malcev<br />

completion of Γ, i.e. L is the unique n-dimensional nilpotent Lie group containing<br />

Γ as a lattice. Let exp: l → L denote the exponential map of the group. It is<br />

well known that the group ¯Γ generated by exp( 1 C! exp−1 (Γ)) contains Γ as a finite<br />

index subgroup. By part a) we can assume now that ¯Γ contains a subgroup ˆΓ of<br />

index ≤ C such that there is a surjective homomorphism to ˆΓ → (Z/pZ) n . By<br />

construction, any subgroup of ¯Γ of index ≤ C must contain Γ and hence we are<br />

done.<br />

□<br />

<strong>Lemma</strong> 8.3. For any n there exists p(n) such that if Γ is a torsion free virtually<br />

nilpotent of rank n which admits an epimorphism onto (Z/pZ) n for some p ≥ p(n)<br />

then Γ is nilpotent.<br />

Proof. By [LR85], Γ is an almost crystallographic group, that is, there is an n-<br />

dimensional nilpotent Lie group L, a compact subgroup K ⊂ Aut(L) such that Γ


36 VITALI KAPOVITCH AND BURKHARD WILKING<br />

is isomorphic to a lattice in K ⋉ L. By a result of Auslander (see [LR85] for a<br />

proof), the projection of Γ to K is finite and we may assume that the projection<br />

is surjective. Thus K is a finite group and it is easy to see that the action of K<br />

on L/[L, L] is effective. It is known that N ′ := Γ ∩ [L, L] is a lattice in [L, L]. Thus,<br />

the image ¯Γ := Γ/N ′ of Γ in K ⋉ L/[L, L] is discrete and cocompact. Since K acts<br />

effectively on L/[L, L], we can view K ⋉ L/[L, L] as a cocompact subgroup of Iso(R k )<br />

where k = dim(L/[L, L]).<br />

In particular, ¯Γ is a crystallographic group. As the projection of Γ to K is<br />

surjective, ¯Γ is abelian if and only if K is trivial. If it is not abelian, then<br />

rank(¯Γ/[¯Γ, ¯Γ]) < rank(¯Γ) = k.<br />

By the third Bieberbach theorem there are only finitely many crystallographic<br />

groups of any given rank, and consequently there are only finitely many possibilities<br />

for the isomorphism type of ¯Γ/[¯Γ, ¯Γ]. Obviously, any homomorphism ¯Γ → (Z/pZ) k<br />

factors through ¯Γ → ¯Γ/[¯Γ, ¯Γ] → (Z/pZ) k ; this easily implies that there is p 0 such<br />

that that there is no surjective homomorphism ¯Γ → (Z/pZ) k for p ≥ p 0 .<br />

Therefore, there is no surjective homomorphism Γ → (Z/pZ) n for p > p 0 . If we<br />

put p(n) = p 0 , then ¯Γ is abelian, K = {e} and hence Γ is nilpotent.<br />

□<br />

Corollary 8.1 follows from <strong>Lemma</strong> 8.2, <strong>Lemma</strong> 8.3 and the following result<br />

Corollary 8.4. In each dimension there are positive constants C(n) and ε(n) such<br />

that for any complete manifold with K sec > −1 on B 1 (0) one of the following holds<br />

a) The image π 1 (B ε(n) (p)) → B 1 (p) contains a subgroup N of index less than<br />

C(n) such that N has a nilpotent basis of length ≤ n − 1 or<br />

b) M is compact and homeomorphic to an infranilmanifold. Moreover, any<br />

finite cover with a nilpotent fundamental group is diffeomorphic to a nilmanifold.<br />

Proof. Consider a sequence of manifolds (M i , p i ) with K sec > −1 such that M i<br />

does not satisfy a) with C = i and ε(n) = 1 i<br />

. As in proof of Corollary 7.1, it is clear<br />

that diam(M i ) → 0.<br />

By the <strong>Margulis</strong> <strong>Lemma</strong> or by work of [KPT10], we may assume that π 1 (M i ) is<br />

nilpotent. We plan to show that π 1 (M i ) has rank n for large i.<br />

The fact that π 1 (M i ) does not contain a subgroup of bounded index with a<br />

nilpotent basis of length ≤ n − 1, guarantees a certain extremal behavior if we run<br />

through the proof of the Induction Theorem. It will be clear that the blow up<br />

limits R k × K, that occur if we go through the procedure provided in the proof of<br />

the Induction Theorem, are flat – more precisely each K has to be a torus. This<br />

follows from the fact that we can never run into a situation where Step 3 of the<br />

proof of the Induction Theorem applies.<br />

In particular, if we rescale the manifolds to have diameter 1 they must converge<br />

to a torus K = T h . We can now use Yamaguchi’s fibration theorem [Yam91] to get<br />

a fibration F i → M i → T h .<br />

Let Γ i denote the kernel of π 1 (M i ) → π 1 (T h ). In the next h steps of the procedure<br />

in the proof of the Induction Theorem we replace M i by ˜M i /Γ i endowed with<br />

h deck transformations fi 1, . . . , f i h projecting to generators of π 1 (T h ).<br />

( ˆM i := ˜M i /Γ i , ˆp i ) converges to (R h , 0). The next step in the Induction Theorem<br />

would be to rescale ˆM i so that (λ i ˆMi , ˆp i ) → (R h × K ′ , ˆp ∞ ) with K ′ not a point<br />

and it will be clear again that K ′ ∼ = T h′ is a torus.


STRUCTURE OF FUNDAMENTAL GROUPS 37<br />

It is easy to see that 1 λ i<br />

is necessarily comparable to the size of the fiber F i of<br />

the Yamaguchi fibration.<br />

From the exact homotopy sequence we can deduce that Γ i = π 1 ( ˆM i ) ∼ = π 1 (F i ).<br />

The fibration theorem ensures that F i fibers over a new torus T h′ . Let Γ ′ i denote<br />

the kernel of π 1 (F i ) → π 1 (T h′ ). Clearly π 1 (M i ) has rank n if and only if Γ ′ i has<br />

rank n − h − h ′ . After finitely many similar steps this shows that π 1 (M i ) has rank<br />

n. By Corollary 7.1 the first part of b) follows. To get the second part of part b)<br />

observe that if π 1 (M i ) is nilpotent and torsion free then by the proof of [KPT10,<br />

Theorem 5.1.1], for all large i we have that M i smoothly fibers over a nilmanifold<br />

with simply connected factors. The fibers obviously have to be trivial as M i is<br />

aspherical which means that M i is diffeomorphic to a nilmanifold.<br />

□<br />

Problem. a) We suspect that in the case of almost nonnegatively curved manifolds<br />

Corollary 8.4 remains valid if one replaces the n − 1 in a) by n − 2.<br />

b) The result of this section remain valid if one just has a uniform lower bound<br />

on sectional curvature and lower Ricci curvature bounds close enough to 0.<br />

However, it would be nice to know whether or whether not they remain valid<br />

without sectional curvature assumptions.<br />

9. Finiteness results<br />

In the proof of the following theorem the work of Colding and Naber [CN10] is<br />

key.<br />

Theorem 9.1 (Normal Subgroup Theorem). Given n, D, ε 1 > 0 there are positive<br />

constants ε 0 < ε 1 and C such that the following holds:<br />

If (M, g) is a compact n-manifold M with Ric > −(n − 1) and diam(M) ≤ D,<br />

then there is ε ∈ [ε 0 , ε 1 ] and a normal subgroup N ⊳ π 1 (M) such that for all p ∈ M<br />

we have:<br />

• the image of π 1 (B ε/1000 (p), p) → π 1 (M, p) contains N and<br />

• the index of N in the image of π 1 (B ε (p), p) → π 1 (M, p) is ≤ C.<br />

To avoid confusion, we should recall that a normal subgroup N ⊳ π 1 (M, p) naturally<br />

induces a normal subgroup N ⊳ π 1 (M, q) for all p, q ∈ M.<br />

If we put ε 1 = ε Marg (n), the constant in the <strong>Margulis</strong> <strong>Lemma</strong>, then by Theorem<br />

1, the group N in the above theorem contains a nilpotent subgroup of index<br />

≤ C Marg which has a nilpotent basis of length ≤ n. After replacing N by a characteristic<br />

subgroup of controlled index Theorem 6 follows.<br />

Proof of Theorem 9.1. We will argue by contradiction. It suffices to rule out the<br />

existence of a sequence (M i , g i ) with the following properties.<br />

(1) Ric Mi > −(n − 1), diam(M i ) ≤ D;<br />

(2) for all ε ∈ (2 −i , ε 1 ) and any normal subgroup N⊳π 1 (M i ) one of the following<br />

holds<br />

(i) N is not contained in the image of π 1 (B ε/1000 (p)) for some p ∈ M i or<br />

(ii) the index of N in the image of π 1 (B ε (q i ), q i ) → π 1 (M i , q i ) is ≥ 2 i for<br />

some q i ∈ M i .<br />

Since any subsequence of (M i , g i ) also satisfies this condition, we can assume that<br />

the universal covers ( ˜M i , π 1 (M i ), ˜p i ) converge to (Y, Γ ∞ , ˜p ∞ ). Put Γ i := π 1 (M i ).


38 VITALI KAPOVITCH AND BURKHARD WILKING<br />

For δ > 0 define<br />

S i (δ) := { g ∈ Γ i | d(p, gp) ≤ δ for all p ∈ B 1/δ (˜p i ) } and Γ iδ := 〈S i (δ)〉 ⊂ Γ i<br />

for i ∈ N ∪ {∞}.<br />

By Colding and Naber [CN10], Γ ∞ is a Lie group. In particular, the component<br />

1<br />

group Γ ∞ /Γ ∞0 is discrete and hence there is some positive δ 0 ≤ min{ε 1 ,<br />

2D } such<br />

that Γ ∞δ is given by the identity component of Γ ∞ for all δ ∈ (0, 2δ 0 ).<br />

It is easy to see that this property carries over to the sequence in the following<br />

sense. We fix a large constant R ≥ 10, to be determined later, and put ε 0 := δ 0 /R.<br />

Then Γ iδ = Γ iε0 for all δ ∈ [ε 0 /1000, Rε 0 ] and all i ≥ i 0 (R).<br />

We claim that Γ iε0 is normal in Γ i . In fact, Γ i can be generated by elements<br />

which displace ˜p i by at most 2D ≤ 1 ε 0<br />

. It is straightforward to check that for any<br />

g i ∈ Γ i satisfying d(g i ˜p i , ˜p i ) ≤ 1 ε 0<br />

we have g i S(ε 0 /2)g −1<br />

i ⊂ S(ε 0 ). Thus,<br />

g i Γ iε0/2g −1<br />

i ⊂ Γ iε0 = Γ iε0/2<br />

and clearly the claim follows.<br />

In summary, Γ iε0/1000 = Γ iRε0 ⊳ Γ i with ε 0 ≤ ε 1 for all i ≥ i 0 (R). By the choice<br />

of our sequence this implies that for some q i ∈ M the index of Γ iε0 in the image of<br />

π 1 (B ε0 (q i ), q i )) → π 1 (M i ) is larger than 2 i .<br />

Similarly to Step 2 in the proof of the <strong>Margulis</strong> <strong>Lemma</strong> in section 7, this gives a<br />

contradiction provided we choose R sufficiently large. In fact, the present situation<br />

is quite a bit easier as Γ iRε0 is normal in π 1 (M i ). □<br />

<strong>Lemma</strong> 9.2. a) Let ε 0 , D, H, n > 0. Then there is a finite collection of groups<br />

such that the following holds:<br />

Let (M, g) be a compact n-dimensional manifold with diam(M) < D,<br />

Ric(M) > −(n−1) and a normal subgroup N⊳π 1 (M) and ε ≥ ε 0 . Suppose<br />

that<br />

• The image I of π 1 (B ε (q)) → π 1 (M, q) satisfies that the index [I : I ∩N]<br />

is bounded by H, for all q ∈ M.<br />

• The group N is in the image of π 1 (B ε/1000 (q 0 )) → π 1 (M, q 0 ) for some<br />

q 0 ∈ M.<br />

Then π 1 (M)/N is isomorphic to a group in the a priori chosen finite collection.<br />

b) (Bounded presentation) Given a manifold as in a) one can find finitely<br />

many loops in M based at p ∈ M whose length and number is bounded by<br />

an a priori constant such that the following holds: the loops project to a<br />

generator system of π 1 (M, p)/N and there are finitely many relations (with<br />

an a priori bound on the length of the words involved) such that these words<br />

provide a finite presentation of the group π 1 (M, p)/N.<br />

Instead of compact manifolds one could also consider manifolds satisfying the<br />

condition that the image of π 1 (B r0 (p 1 )) → π 1 (B R0 (p 1 )) is via the natural homomorphism<br />

isomorphic to π 1 (M i ), where r 0 and R 0 are given constants. In that<br />

case one would only need the Ricci curvature bound in the ball B R0+1(p 1 ). In fact,<br />

the only purpose of the Ricci curvature bound is to guarantee the existence of an<br />

ε/100–dense finite set in M with an a priori bound on the order of the set.<br />

Of course, b) implies a). However, we prove a) first and then see that b) follows<br />

from the proof. In the context of equivariant Gromov–Hausdorff convergence a


STRUCTURE OF FUNDAMENTAL GROUPS 39<br />

bounded presentation can often be carried over to the limit. That is why part b)<br />

has some advantages over a).<br />

Proof. a). The idea is to define for each such manifold a 2-dimensional finite CWcomplex<br />

C whose combinatorics is bounded by some a priori constants and which<br />

has the fundamental group π 1 (M)/N.<br />

Choose a maximal collection of points p 1 , . . . , p k in M with pairwise distances<br />

≥ ε/100. Clearly, there is an a priori bound on k depending only on n, D and ε 0 .<br />

We also may assume that p 1 = q 0 .<br />

The points p 1 , . . . , p k will represent the 0-skeleton C 0 of our CW -complex.<br />

For each point p i define F i as the finite group given by im ε/5 (p i )/(N ∩ im ε/5 (p i ))<br />

where im δ (p i ) denotes the image of π 1 (B δ (p i ), p i ) → π 1 (M, p i ). For each of the at<br />

most H elements in F i we choose a loop in B ε/5 (p i ) representing the class modulo<br />

N. We attach to p i an oriented 1-cell for each loop and call the loop the model path<br />

of the cell.<br />

For any two points p i , p j with d(p i , p j ) < ε/20 we choose in the manifold a<br />

shortest path from p i to p j . In the CW -complex we attach an edge (1-cell) between<br />

these two points. We call the chosen path the model path of the edge.<br />

This finishes the definition of the 1-skeleton C 1 of our CW -complex.<br />

For each point p i ∈ C 0 we consider the part A i of the 1-skeleton C 1 containing<br />

all points p j with d(p i , p j ) < ε/2 and also all 1-cells connecting them including the<br />

cells attached initially. If A i has more than one connected component we replace<br />

A i by the connected component containing p i .<br />

We choose loops in A i based at p i representing free generators of the free group<br />

π 1 (A i , p i ). We now consider the homomorphism π 1 (A i , p i ) → π 1 (M, p i ) induced<br />

by mapping each loop to its model path in M. The induced homomorphism<br />

α i : π 1 (A i ) → π 1 (M, p i )/N has finite image of order ≤ H since the model paths<br />

are contained in B ε (p i ).<br />

The kernel of α i is a normal subgroup of finite index ≤ H in π 1 (A i , p i ). Thus,<br />

it is finitely generated and there is an a priori bound on the number of possibilities<br />

for the kernel. We choose a free finite generator system of the kernel and attach<br />

the corresponding 2-cells to the CW -complex for each generator of the kernel.<br />

This finishes the definition of the CW -complex C.<br />

Note that by construction, the total number of cells in our complex is bounded<br />

by a constant depending only on n, D, H and ε 0 and that the attaching maps of the<br />

two cells are under control as well. Hence there are only finitely many possibilities<br />

for the homotopy type of C. Thus, we can finish the proof by establishing that<br />

π 1 (M)/N is isomorphic to π 1 (C).<br />

First notice that there is a natural surjective map π 1 (C) → π 1 (M)/N: Consider<br />

the 1-skeleton C 1 of C. Its fundamental group is free and there is a homomorphism<br />

π 1 (C 1 ) → π 1 (M) → π 1 (M)/N induced by mapping a path onto its model<br />

path. The two cells that where attached to the 1-skeleton only added relations to<br />

the fundamental group contained in the kernel of this homomorphism. Thus, the<br />

homomorphism induces a homomorphism π 1 (C, p 1 ) → π 1 (M, p 1 )/N. Moreover, it<br />

is easy to see that the homomorphism is surjective.<br />

In order to show that the homomorphism is injective we need to show that<br />

homotopies in M can be lifted in some sense to C.<br />

Suppose we have a closed curve γ in the 1-skeleton C 1 which is based at p 1 and<br />

whose model curve c 0 in M is homotopic to a curve in N ⊂ π 1 (M). We parametrize


40 VITALI KAPOVITCH AND BURKHARD WILKING<br />

γ on [0, j 0 ] for some large integer j 0 . We will assume that γ(t) = p j for some j<br />

implies t ∈ [0, j 0 ] ∩ Z – we do not assume that the converse holds.<br />

Using that p 1 = q 0 we can find a homotopy H : [0, 1] × [0, j 0 ] → M, (s, t) ↦→<br />

H(s, t) = c s (t) such that each c s is a closed loop at p 1 , c 0 is the original curve and<br />

c 1 is a curve in the ball B ε/1000 (p 1 ). If we choose j 0 large enough, we can arrange<br />

for L(c s[i,i+1] ) < ε/1000 for all i = 0, . . . , j 0 − 1, s ∈ [0, 1].<br />

After an arbitrary small change of the homotopy we can assume that c s (i) intersects<br />

the cut locus of the 0-dimensional submanifold {p 1 , . . . , p k } only finitely<br />

many times for every i = 1, . . . , j 0 − 1. We can furthermore assume that at any<br />

given parameter value s there is at most one i such that c s (i) is in the cut locus of<br />

{p 1 , . . . , p k }.<br />

For each c s (i) we choose a point ˜c s (i) := p li (s) with minimal distance to c s (i).<br />

By construction ˜c s (i) is piecewise constant in s and at each parameter value s 0<br />

at most one of the chosen points is changed. Moreover, ˜c s (0) = ˜c s (1) = p 1 and<br />

˜c 1 (i) = p 1 .<br />

We now define ˜c s|[i,i+1] in three steps. In the middle third of the interval we run<br />

through c s|[i,i+1] with triple speed. In the last third of the interval we run through<br />

a shortest path from c s (i + 1) to p li+1 (s). In the first third of the interval we run<br />

through a shortest path from p li (s) to c s (i) and if there is a choice we choose the<br />

same path that was chosen in the last third of the previous interval.<br />

Each curve ˜c s is homotopic to c s . Moreover, ˜c s (i) is piecewise constant equal to<br />

a vertex. Clearly, s ↦→ ˜c s is only piecewise continuous.<br />

The ’jumps’ for ˜c occur exactly at those parameter values at which c s (i) is the<br />

cut locus of the finite set {p 1 , . . . , p k } for some i.<br />

We introduce the following notation: for two continuous curves c 1 , c 2 in M from<br />

p to q ∈ M we say that c 1 and c 2 are homotopic modulo N if the homotopy class of<br />

the loop obtained by prolonging c 1 with the inverse parametrized curve c 2 is in N.<br />

We define a corresponding curve γ s lifting ˜c s to our CW -complex as follows:<br />

γ s|[i,i+1] should run in the one skeleton of our CW complex from ˜c s (i) to ˜c s (i+1) ∈<br />

C 0 and the model path of γ s|[i,i+1] should be modulo N homotopic to ˜c s|[i,i+1] . We<br />

choose γ s[i,i+1] in such a way that on the first half of the interval it runs through<br />

the edge from ˜c s (i) to ˜c s (i + 1) and in order to get the right homotopy type for the<br />

model path, in the second half of the interval it runs through one of the 1-cells that<br />

were attached initially to the vertex ˜c s (i + 1) ∈ C 0 .<br />

Thus, the model curve of γ s|[i,i+1] is homotopic to ˜c s|[i,i+1] modulo N. In particular,<br />

the model curve of γ s and the loop c s are homotopic modulo N.<br />

The map s ↦→ γ s is piecewise constant in s. We have to check that at the<br />

parameter where γ s jumps the homotopy type of γ s does not change.<br />

However, first we want to check that the original curve γ is homotopic to γ 0 in C.<br />

By our initial assumption, there are integers 0 = k 0 < . . . < k h = j 0 such that γ(t)<br />

is in the 0-skeleton if and only if t ∈ {k 0 , . . . , k h }. Going through the definitions<br />

above it is easy to deduce that γ(k i ) = c 0 (k i ) = ˜c 0 (k i ) = γ 0 (k i ). Thus, it suffices to<br />

check that γ |[ki,k i+1] is homotopic to γ 0|[ki,k i+1]. We have just seen that the model<br />

curve of γ 0|[ki,k i+1] is homotopic to ˜c 0|[ki,k i+1] modulo N. This curve is homotopic<br />

to c 0|[ki,k i+1], which is the model curve of γ |[ki,k i+1].<br />

Moreover, the model curve of γ |[ki,k i+1] is in an ε/5-neighbourhood of γ(k i ). By<br />

the definition of ˜c 0 this implies that ˜c 0|[ki,k i+1] is in an ε/4-neighbourhood of γ(k i ).<br />

Therefore, γ |[ki,k i+1] as well as γ 0|[ki,k i+1] only meet points of the zero skeleton


STRUCTURE OF FUNDAMENTAL GROUPS 41<br />

with distance < ε/4 to γ(k i ). Thus, they are contained in A j , where j is defined<br />

by p j = γ(k i ). Since their model curves are homotopic modulo N, our rules for<br />

attaching 2-cells imply that γ 0|[ki,k i+1] is homotopic to γ [ki,k i+1] in C.<br />

It remains to check that the homotopy type of γ s does not change at a parameter<br />

s 0 at which s ↦→ γ s is not continuous. By construction there is an i such that<br />

γ s|[0,i−1] and γ s|[i+1,j0] are independent of s ∈ [s 0 − δ, s 0 + δ] for some δ > 0.<br />

The model curve of γ s0−δ|[i−1,i+1] is modulo N homotopic to ˜c s0−δ|[i−1,i+1] which<br />

in turn is homotopic to ˜c s0+δ|[i−1,i+1] and, finally, this curve is modulo N homotopic<br />

to the model curve of γ s0+δ|[i−1,i+1].<br />

Furthermore, the model curves of γ s0±δ|[i−1,i+1] are in an ε/2-neighbourhood of<br />

˜c s0 (i − 1). Since they are homotopic modulo N, this implies by definition of our<br />

complex that γ s0−δ|[i−1,i+1] is homotopic to γ s0+δ|[i−1,i+1] in C .<br />

Thus, the curve γ is homotopic to γ 1 , which is the point curve by construction.<br />

b) Since the number of CW -complexes constructed in a) is finite, we can think<br />

of C as fixed. We choose loops in C 1 representing a free generator system of the<br />

free group π 1 (C 1 , p 1 ). The model curves of these loops represent generators of<br />

π 1 (M, p 1 )/N and the lengths of the loops are bounded.<br />

For each of the attached 2-cells we consider the loop based at some vertex p i in<br />

the one skeleton given by the attaching map. We choose a path in C 1 from p 1 to<br />

p i and conjugate the loop back to π 1 (C 1 , p 1 ).<br />

We can express this loop as word in our free generators and if collect all these<br />

words for all 2-cells, we get a finite presentation of π 1 (C, p 1 ) ∼ = π 1 (M)/N. □<br />

It will be convenient for the proof of Theorem 7 to restate it in a slightly different<br />

form.<br />

Theorem 9.3. a) Given D and n there are finitely many groups F 1 , . . . , F k<br />

such that the following holds: For any compact n-manifold M with Ric ><br />

−(n − 1) and diam(M) ≤ D we can find a nilpotent normal subgroup N ⊳<br />

π 1 (M) which has a nilpotent basis of length ≤ n − 1 such that π 1 (M)/N is<br />

isomorphic to one of the groups in our collection.<br />

b) In addition to a) one can choose a finite collection of irreducible rational<br />

representations ρ j i : F i → GL(n j i , Q) (j = 1, . . . , µ i, i = 1, . . . , k) such that<br />

for a suitable choice of the isomorphism π 1 (M)/N ∼ = F i the following holds:<br />

There is a chain of subgroups Tor(N) = N 0 ⊳ · · · ⊳ N h0 = N which are all<br />

normal in π 1 (M) such that [N, N h ] ⊂ N h−1 and N h /N h−1 is free abelian.<br />

Moroever, the action of π 1 (M) on N by conjugation induces an action of F i<br />

on N h /N h−1 and the induced representation ρ: F i → GL ( (N h /N h−1 ) ⊗ Z Q )<br />

is isomorphic to ρ j i for a suitable j = j(h), h = 1, . . . , h 0.<br />

In addition one can assume in a) that rank(N) ≤ n − 2.<br />

Proof of Theorem 9.3. We consider a contradicting sequence (M i , g i ), that is, we<br />

have diam(M i , g i ) ≤ D and Ric Mi ≥ −(n − 1) but the theorem is not true for any<br />

subsequence of (M i , g i ).<br />

We may assume that diam(M i , g i ) = D and (M i , g i ) converges to some space X.<br />

Choose p i ∈ M i converging to a regular point p ∈ X. By the Gap <strong>Lemma</strong> (2.4)<br />

there is some δ > 0 and ε i → 0 with π 1 (M i , p i , ε i ) = π 1 (M i , p i , δ).<br />

Claim. There is C and i 0 such that π 1 (M i , p i , δ) contains a subgroup of index ≤ C<br />

which has a nilpotent basis of length ≤ n − dim(X) for all large i ≥ i 0 .


42 VITALI KAPOVITCH AND BURKHARD WILKING<br />

We choose λ i < 1 ε i<br />

with λ i → ∞ such that (λ i M i , p i ) converges to (R u , 0) with<br />

u = dim(X). It is easy to see that (λ i ˜Mi /π 1 (M i , p i , ε i ), ˜p i ) converges to (R u , 0) as<br />

well. The claim now follows from the Induction Theorem (6.1) with f j i = id.<br />

We now apply the Normal Subgroup Theorem (9.1) with ε 1 = δ. There is<br />

a normal subgroup N i ⊳ π 1 (M i , g i ), some positive constants ε and C such that<br />

N i is contained in π 1 (B εi (p i )) → π 1 (M i , g i ) and N i is contained in the image of<br />

π 1 (B ε (q i ), q i ) → π 1 (M i , q i ) with index ≤ C for all q i ∈ M.<br />

By the Claim above, N i has a subgroup of bounded index with a nilpotent basis<br />

of length ≤ n − dim(X) ≤ n − 1.<br />

We are free to replace N i by a characteristic subgroup of bounded index and<br />

thus, we may assume that N i itself has a nilpotent basis of length ≤ n − dim(X).<br />

By <strong>Lemma</strong> 9.2 a), the number of possibilities for π 1 (M i )/N i is finite. In particular,<br />

part a) of Theorem 9.3 holds for the sequence and thus either b) or the<br />

addendum is the problem.<br />

By <strong>Lemma</strong> 9.2 b), we can assume that π 1 (M i , p i )/N i is boundedly presented. By<br />

passing to a subsequence we can assume that we have the same presentation for all i.<br />

Let g i1 , . . . , g iβ ∈ π 1 (M i ) denote elements with bounded displacement projecting to<br />

our chosen generator system of π 1 (M i )/N i . Moreover, there are finitely many words<br />

in g i1 , . . . , g il (independent of i) such that these words give a finite presentation of<br />

the group π 1 (M i )/N i .<br />

We can assume that g ij converges to g ∞j ∈ G ⊂ Iso(Y ) where (Y, ˜p ∞ ) is the<br />

limit of ( ˜M i , ˜p i ). Let N ∞ ⊳G be the limit of N i . By construction, G/N ∞ is discrete.<br />

Since π 1 (M i )/N i is boundedly presented, it follows that there is an epimorphism<br />

π 1 (M i )/N i → G/N ∞ induced by mapping g ij to g ∞j , for all large i.<br />

b). We plan to show that a subsequence satisfies b). We may assume that G⋆ ˜p ∞<br />

is not connected and by the Gap <strong>Lemma</strong> (2.4)<br />

(23)<br />

2ρ 0 := min{d(˜p ∞ , g˜p ∞ ) | g˜p ∞ /∈ G 0 ⋆ ˜p ∞ } > 0.<br />

Let r be the maximal nonnegative integer such that the following holds. After<br />

passing to a subsequence there is a subgroup H i ⊆ N i of rank r satisfying<br />

• H i ⊳ π 1 (M i ), N i /H i is torsion free and<br />

• there is a chain Tor(N i ) = N i0 ⊳ · · · ⊳ N ih0 = H i such that each group<br />

N ih is normal in π 1 (M i ), [N i , N ih ] ⊂ N ih−1 , N ih /N ih−1 is free abelian and<br />

for each h = 1, . . . , h 0 the induced representations of F = π 1 (M i )/N i in<br />

(N ih /N ih−1 ) ⊗ Z Q and in (N jh /N jh−1 ) ⊗ Z Q are equivalent for all i, j.<br />

Here we used implicitly that we have a natural isomorphism between π 1 (M i )/N i<br />

and π 1 (M j )/N j in order to talk about equivalent representations.<br />

Notice that these statements hold for H i = Tor(N i ). We need to prove that H i =<br />

N i . Suppose on the contrary that rank(H i ) < rank(N i ). Consider ˆM i := ˜M i /H i<br />

endowed with the action of Γ i := π 1 (M i )/H i , and let ˆp i be a lift of p i to ˆM i . By<br />

construction ˆN i := N i /H i is a torsion free normal subgroup of Γ i .<br />

We claim that there is a central subgroup A i ⊂ ˆN i of positive rank which is<br />

normal in Γ i and is generated by {a ∈ A i | d(ˆp i , aˆp i ) ≤ d i } for a sequence d i → 0:<br />

Recall that ˆN i is generated by {a ∈ ˆN i | d(ˆp i , aˆp i ) ≤ ε i }. If ˆN i is not abelian this<br />

implies that [ˆN i , ˆN i ] is generated by {a ∈ [ˆN i , ˆN i ] | d(ˆp i , aˆp i ) ≤ 4nε i }. If [ˆN i , ˆN i ] is


STRUCTURE OF FUNDAMENTAL GROUPS 43<br />

not central in ˆN i one replaces it by [ˆN i , [ˆN i , ˆN i ]]. After finitely many similar steps<br />

this proves the claim.<br />

For each positive integer l put l · A i = {g l | g ∈ A i } ⊳ Γ i . We define l i = 2 ui as<br />

the maximal power of 2 such that there is an element in l i · A i which displaces ˆp i<br />

by at most ρ 0 . Thus, any element in L i := l i · A i displaces ˆp i by at least ρ 0 /2.<br />

After passing to a subsequence we may assume that ( ˆM i , Γ i , ˆp i ) converges to<br />

(Ŷ , Ĝ, ˆp ∞) and the action of L i converges to an action of some discrete abelian<br />

subgroup L ∞ ⊳ Ĝ. Finally we let ˆN ∞ ⊳ Ĝ denote the limit group of ˆN i . Let g i ∈ L i<br />

be an element which displaces ˆp i by at most ρ 0 . Combining d(gi k ˆp i, g k+1<br />

i ˆp i ) ≤ ρ 0<br />

with our choice of ρ 0 , see (23), gives that the sets {gi k ˆp i | k ∈ Z} converge to a<br />

discrete subset in the identity component of the limit orbit Ĝ0 ⋆ ˆp ∞ . Therefore,<br />

{g ∈ L ∞ | g ⋆ ˆp ∞ ∈ Ĝ0 ⋆ ˆp ∞ }<br />

is discrete and infinite. Let K ⊂ Ĝ denote the isotropy group of ˆp ∞. By Colding and<br />

Naber K is a Lie group and thus it only has finitely many connected components.<br />

Hence L ′ ∞ := L ∞ ∩Ĝ0 ⊳Ĝ is infinite as well. Since the abelian group L′ ∞ is a discrete<br />

subgroup of a connected Lie group, it is finitely generated.<br />

We choose a free abelian subgroup ˆL ⊂ L ′ ∞ of positive rank which is normalized<br />

by Ĝ such that the induced representation of Ĝ in ˆL ⊗ Z Q is irreducible. Notice that<br />

ˆL ⊂ L ∞ commutes with ˆN ∞ . Hence we can view this as a representation of Ĝ/ˆN ∞ .<br />

Recall that π 1 (M i )/N i<br />

∼ = Γi /ˆN i is boundedly represented. Let ĝ i1 , . . . , ĝ iβ ∈ Γ i<br />

denote the images of g i1 , . . . , g iβ . There is an epimorphism Γ i /ˆN i → Ĝ/ˆN ∞ induced<br />

by sending ĝ im to its limit element ĝ ∞m ∈ Ĝ for all large i. Thus, ˆL is also naturally<br />

endowed with a representation of Γ i /ˆN i .<br />

Let b 1 , . . . , b l ∈ ˆL ∼ = Z l be a basis. For large i there are unique elements h i (b j ) ∈<br />

L i which are close to b j , j = 1, . . . , k. We extend h i to a Z-linear map h i : ˆL → L i .<br />

We plan to prove that h i : ˆL → L i is equivariant for large i. For any given<br />

linear combination ∑ l<br />

α=1 z αb α (z j ∈ Z) we know that ∑ l<br />

α=1 z αh i (b α ) is the unique<br />

element in L i which is close to ∑ lα=1 z αb α for all large i.<br />

For each ĝ ∞m and each b j we have ĝ ∞m b j ĝ∞m −1 = ∑ lα=1 z αb α for z α ∈ Z (we<br />

(∑ l<br />

suppress the dependence on m and j). We have just seen that h i α=1 z )<br />

αb α is<br />

close to ĝ ∞m b j ĝ∞m −1 for all large i. On the other hand, ĝ im h i (b j )ĝ −1<br />

im ∈ L i is the<br />

unique element in L i close to ĝ ∞m b j ĝ∞m −1 for all large i.<br />

In summary, h i (ĝ ∞m b j ĝ∞m) −1 = ĝ im h i (b j )ĝ −1<br />

im for m = 1, . . . , β, j = 1, . . . , k and<br />

all large i. This shows that h i is equivariant with respect to the representation.<br />

Thus, h i (ˆL) is a normal subgroup of Γ i and the induced representation of F =<br />

π 1 (M i )/N i = Γ/ˆN i in h i (ˆL) ⊗ Z Q and is isomorphic to the one in h j (ˆL) ⊗ Z Q for all<br />

large i, j.<br />

There is a unique subgroup A ′ i of ˆN i such that h i (ˆL) has finite index in A ′ i and<br />

ˆN i /A ′ i is torsion free. Let N ih 0+1 ⊳ N i denote the inverse image of A ′ i ⊂ ˆN i = N i /H i .<br />

Clearly, the representation of π 1 (M i )/N i in (N ih0+1/H i ) ⊗ Z Q is isomorphic to<br />

the one in (N jh0+1/H j ) ⊗ Z Q for all large i, j – a contradiction to our choice of H i .<br />

Proof of the addendum. Thus, the sequence satisfies a) and b) but not the<br />

addendum. Recall that N i has a nilpotent basis of length ≤ n − dim(X). Therefore<br />

dim(X) = 1 and N i is a torsion free group of rank n−1. Since X is one-dimensional<br />

we deduce that G/N ∞ is virtually cyclic, that is, it contains a cyclic subgroup of


44 VITALI KAPOVITCH AND BURKHARD WILKING<br />

finite index. This in turn implies that π 1 (M i )/N i is virtually cyclic and by a) we<br />

can assume that it is a fixed group F.<br />

The result (contradiction) will now follow algebraically from b): Let ρ j : F →<br />

GL(n j , Q) be a finite collection of irreducible rational representations j = 1, . . . , j 0 .<br />

Consider, for all nilpotent torsion free groups N of rank n − 1, all short exact<br />

sequences<br />

N → Γ → F<br />

with the property that there is a chain {e} = N 0 ⊳ · · · ⊳ N h0 = N such that<br />

[N, N h ] ⊂ N h−1 , each N h is normal in Γ, N h /N h−1 is free abelian group of positive<br />

rank and the induced representation of F in (N h /N h−1 )⊗ Z Q is in the finite collection.<br />

Using that F is virtually cyclic, it is now easy to see that this leaves only finitely<br />

many possibilities for the isomorphism type of Γ/N h0−1. This shows that if we<br />

replace N by N h0−1 we are still left with finitely many possibilities in a). – a<br />

contradiction.<br />

□<br />

Corollary 9.4. Given n, D there is a constant C such that any finite fundamental<br />

group of an n-manifold (M, g) with Ric > −(n − 1), diam(M) ≤ D contains a<br />

nilpotent subgroup of index ≤ C which has a nilpotent basis of length ≤ (n − 1).<br />

Example 2. Our theorems rule out some rather innocuous families of groups as<br />

fundamental groups of manifolds with lower Ricci curvature bounds.<br />

a) Consider a homomorphism h: Z → GL(2, Z) whose image does not contain<br />

a unipotent subgroup of finite index. Put h d (x) = h(dx) and consider the<br />

group Z ⋉ hd Z 2 . By part b) of Theorem 9.3 the following holds:<br />

In a given dimension n and for a fixed diameter bound D only finitely<br />

many of these groups can be realized as fundamental groups of manifold<br />

with Ric > −(n − 1) and diam(M) ≤ D.<br />

b) Consider the action of Z p k−1 on Z p k induced by 1 ↦→ 1 + p. In a given<br />

dimension n we have that for k ≥ n and p > C Marg (constant in the<br />

<strong>Margulis</strong> <strong>Lemma</strong>) the group Z p k−1 ⋉Z p k can not be a subgroup of a compact<br />

manifold with almost nonnegative Ricci curvature, since it does not have a<br />

nilpotent basis of length ≤ n.<br />

Similarly, by Corollary 9.4, for a given D there are only finitely many<br />

primes p such that Z p n−1 ⋉ Z p n is isomorphic to the fundamental group of<br />

an n-manifold with Ric > −(n − 1) and diam(M) ≤ D.<br />

Problem. Let D > 0. Can one find a finite collection of 3-manifolds such that for<br />

any 3-manifold M with Ric > −1 and diam(M) ≤ D, there is a manifold T in the<br />

finite collection and a finite normal covering T → M for which the covering group<br />

contains a cyclic subgroup of index ≤ 2?<br />

10. The Diameter Ratio Theorem<br />

The aim of this section is to prove Theorem 8.<br />

<strong>Lemma</strong> 10.1. Let X i be a sequence of compact inner metric spaces converging to<br />

a torus T. Then π 1 (X i ) is infinite for large i.<br />

The proof of the lemma is an easy exercise.


STRUCTURE OF FUNDAMENTAL GROUPS 45<br />

Proof of Theorem 8. Suppose, on the contrary, that (M i , g i ) is a sequence of compact<br />

manifolds with diam(M i ) = D, Ric Mi > −(n − 1), #π 1 (M i ) < ∞ and the<br />

diameter of the universal cover ˜M i tends to infinity.<br />

By Corollary 9.4, we know that π 1 (M i ) contains a subgroup of index ≤ C(n, D)<br />

which has a nilpotent basis of length ≤ (n − 1).<br />

Thus, we may assume that π 1 (M i ) itself has a nilpotent basis of length u < n.<br />

We also may assume that u is minimal with the property that a contradicting<br />

sequence exist. Put<br />

ˆM i := ˜M i /[π 1 (M i ), π 1 (M i )].<br />

Clearly [π 1 (M i ), π 1 (M i )] has a nilpotent basis of length ≤ u − 1. By construction<br />

ˆM i can not be a contradicting sequence and therefore diam( ˆM i ) → ∞.<br />

Let A i := π 1 (M i )/[π 1 (M i ), π 1 (M i )] denote the deck transformation group of the<br />

normal covering ˆM i → M i .<br />

1<br />

Claim. The rescaled sequence ˆM<br />

diam( ˆM i is precompact in the Gromov–Hausdorff<br />

i)<br />

topology and any limit space has finite Hausdorff dimension.<br />

The problem is, of course, that no lower curvature bound is available after rescaling.<br />

The claim will follow from a similar precompactness result for certain Cayley graphs<br />

of A i .<br />

Recall that diam(M i ) = D. Choose a base point ˆp i ∈ ˆM i . Let f 1 , . . . , f ki ∈ A i<br />

be an enumeration of all the elements a ∈ A i with d(ˆp i , aˆp i ) < 10D.<br />

There is no bound for k i , but clearly f 1 , . . . , f ki is a generator system of A i . We<br />

define a weighted metric on the abelian group A i as follows:<br />

⎧<br />

⎨∑k i<br />

d(e, a) := min |ν j | · d(ˆp, f j ˆp)<br />

⎩<br />

j=1<br />

∣<br />

∏k i<br />

j=1<br />

f νj<br />

j<br />

⎫<br />

⎬<br />

= a<br />

⎭<br />

for all a ∈ A i<br />

and d(a, b) = d(ab −1 , e) for all a, b ∈ A i . Note that this metric coincides with the<br />

restriction to A i of the inner metric on its Cayley graph where each edge corresponding<br />

to f j is given the length d(ˆp, f j ˆp). It is easy to see that the map<br />

ι i : A i → ˆM i , a i ↦→ a i ˆp i<br />

is a quasi isometry with uniform control on the constants involved. In fact, there<br />

is some L independent of i such that<br />

1<br />

L d(a, b) ≤ d(ι i(a), ι i (b)) ≤ Ld(a, b) for all a, b ∈ A i .<br />

and the image is of ι i is D-dense.<br />

Therefore, it suffices to show that<br />

1<br />

diam(A A i) i is precompact in the Gromov–<br />

Hausdorff topology and all limit spaces of convergent subsequences are finite dimensional.<br />

For the proof we need<br />

Subclaim. There is a R 0 > D (independent of i) such that the homomorphism<br />

h: A i → A i , x ↦→ x 2 satisfies that the R 0 neighborhood of h(B R (e)) contains<br />

(e), for all R and all i.<br />

B 3R<br />

2<br />

Using that B 10D (e) ⊂ A i is isometric to a subset of B 10D (ˆp i ), we can employ the<br />

Bishop–Gromov inequality in order to find a universal constant k such that B 10D (e)<br />

does not contain k points with pairwise distance ≥ D. Put R 0 := 10D · k.


46 VITALI KAPOVITCH AND BURKHARD WILKING<br />

There is nothing to prove if R ≤ 2R 0 /3. Suppose the statement holds for R ′ ≤<br />

R − D. We claim it holds for R.<br />

Let a ∈ B 3R/2 (e) \ B R0 (e). By the definition of the metric on A i , there are<br />

g 1 , . . . , g l ∈ A i with d(e, g j ) ≤ 10D, a = ∏ l<br />

j=1 g j and d(e, a) = ∑ l<br />

j=1 d(ˆp, g j ˆp). If<br />

there is any choice we assume in addition that l is minimal with these properties. By<br />

assumption l ≥ R0<br />

10D<br />

= k. By the choice of k, after a renumbering, we may assume<br />

that d(g 1 , g 2 ) ≤ D. Our assumption on l being minimal implies that d(e, g 1 g 2 ) ><br />

10D and we may assume 4D ≤ d(e, g 1 ) ≤ d(e, g 2 ). Thus,<br />

d ( e, g −2<br />

1 a) ≤ d ( e, (g 1 g 2 ) −1 a ) + D = d(e, a) − d(e, g 1 ) − d(e, g 2 ) + D<br />

≤ d(e, a) − 2d(e, g 1 ) + D ≤ d(e, a) − 7D.<br />

By assumption, this implies that g1 −2 a has distance ≤ R 0 to some b 2 ∈ A i with<br />

d(e, b) ≤ 3( 2 d(e, a)−2d(e, g1 )+D ) . Consequently, a has distance ≤ R 0 to g1b 2 2 ∈ A i<br />

with d(e, g 1 b) < 2 3d(e, a). This finishes the proof of the subclaim.<br />

Since the Ricci curvature of ˆMi is bounded below and the L-bilipschitz embedding<br />

ι i maps the ball B 50R0 (e) ⊂ A i to a subset of B 50R0 (ˆp i ), we can use Bishop–<br />

Gromov once more to see that there is a number Q > 0 (independent of i) such<br />

that the ball B 50R0 (e) ⊂ A i can be covered by Q balls of radius R 0 for all i.<br />

We now claim that the ball B 2R (e) ⊂ A i can be covered by Q balls of radius R for<br />

1<br />

all R ≥ 20R 0 and all i. This will clearly imply that<br />

diam(A A i) i is precompact in the<br />

Gromov–Hausdorff topology and that the limits have finite Hausdorff dimension.<br />

Consider the homomorphism<br />

h 8 : A i → A i , x ↦→ x 16 .<br />

It is obviously 16-Lipschitz since A i is abelian. Choose a maximal collection of<br />

points p 1 , . . . , p l ∈ B 2R (e) ⊂ A i with pairwise distances ≥ 2R 0 .<br />

5<br />

From the subclaim it easily follows that B 3R (e) ⊂ ⋃ l<br />

i=1 B 50R 0<br />

(h(p i )). Hence we<br />

can cover B 3R (e) by l · Q balls of radius R 0 .<br />

Consider now a maximal collection of points q 1 , . . . , q h ∈ B 2R (e) with pairwise<br />

distances ≥ R. In each of the balls B 2R (q i ) we can choose l points with pairwise<br />

5<br />

distances ≥ 2R 0 . Thus, B 3R (e) contains h · l points with pairwise distances ≥ 2R 0 .<br />

Since we have seen before that B 3R (e) can be covered by l · Q balls of radius R 0 ,<br />

this implies h ≤ Q as claimed.<br />

Thus,<br />

1<br />

diam( ˆM i)<br />

ˆM i is precompact in the Gromov–Hausdorff topology. After pass-<br />

1<br />

ing to a subsequence we may assume that ˆM<br />

diam( ˆM i → T. Notice that T comes<br />

i)<br />

with a transitive action of an abelian group. Therefore T itself has a natural group<br />

structure. Moreover, T is an inner metric space and the Hausdorff dimension of T is<br />

finite. Like Gromov in [Gro81] we can now deduce from a theorem of Montgomery<br />

Zippin [MZ55, Section 6.3] that T is a Lie group and thus, a torus.<br />

By <strong>Lemma</strong> 10.1, this shows π 1 ( ˆM i ) is infinite for large i – a contradiction.<br />

□<br />

Final Remarks<br />

We would like to mention that in [Wil11] the following partial converse of the<br />

<strong>Margulis</strong> <strong>Lemma</strong> (Theorem 1) is proved.


STRUCTURE OF FUNDAMENTAL GROUPS 47<br />

Theorem. Given C and n there exists m such that the following holds: Let ε > 0,<br />

and let Γ be a group containing a nilpotent subgroup N of index ≤ C which has a<br />

nilpotent basis of length ≤ n. Then there is a compact m-dimensional manifold M<br />

with sectional curvature K > −1 and a point p ∈ M such that Γ is isomorphic to<br />

the image of the homomorphism<br />

π 1<br />

(<br />

Bε (p), p ) → π 1 (M, p).<br />

Apart from the issue of finding the optimal dimension another difference to<br />

Theorem 1 is that this theorem uses the homomorphism to π 1 (M) rather than to<br />

π 1 (B 1 (p), p). This actually allows for more flexibility (by adding relations to the<br />

fundamental group at large distances to p). This is the reason why the following<br />

problem remains open.<br />

Problem. The most important problem in the context of the <strong>Margulis</strong> <strong>Lemma</strong><br />

for manifolds with lower Ricci curvature bound that remains open is whether or<br />

whether not one can arrange in Theorem 1 for the torsion of N to be abelian. We<br />

refer the reader to [KPT10] for some related conjectures for manifolds with almost<br />

nonnegative sectional curvature.<br />

References<br />

[And90] M. T. Anderson. Short geodesics and gravitational instantons. Journal of Differential<br />

geometry, 31(1):265–275, 1990.<br />

[CC96] J. Cheeger and T. H. Colding. Lower bounds on Ricci curvature and the almost rigidity<br />

of warped products. Ann. of Math., 144(1):189–237, 1996.<br />

[CC97] J. Cheeger and T. H. Colding. On the structure of spaces with Ricci curvature bounded<br />

below. I. J. Differential Geom., 46(3):406–480, 1997.<br />

[CC00a] J. Cheeger and T. H. Colding. On the structure of spaces with Ricci curvature bounded<br />

below. II. J. Differential Geom., 54(1):13–35, 2000.<br />

[CC00b] J. Cheeger and T. H. Colding. On the structure of spaces with Ricci curvature bounded<br />

below. III. J. Differential Geom., 54(1):37–74, 2000.<br />

[CG72] J. Cheeger and D. Gromoll. The splitting theorem for manifolds of nonnegative Ricci<br />

curvature. J. Differential Geom, 6:119–128, 1971/72.<br />

[CN10] T. Colding and A. Naber. Sharp hölder continuity of tangent cones for spaces with a<br />

lower ricci curvature bound and applications. http://front.math.ucdavis.edu/1102.5003,<br />

2010.<br />

[Col96] T. H. Colding. Large manifolds with positive Ricci curvature. Invent. Math., 124(1-<br />

3):193–214, 1996.<br />

[Col97] T. H. Colding. Ricci curvature and volume convergence. Ann. of Math. (2), 145(3):477–<br />

501, 1997.<br />

[FH83] F.T. Farrell and W.C. Hsiang. Topological characterization of flat and almost flat riemannian<br />

manifolds m n (n ≠ 3, 4). Am. J. Math., 105:641–672, 1983.<br />

[FQ90] M. H. Freedman and F. S. Quinn. Topology of 4-manifolds. Princeton Mathematical<br />

Series, 39. Princeton, NJ: Princeton University Press. viii, 259 p., 1990.<br />

[FY92] K. Fukaya and T. Yamaguchi. The fundamental groups of almost nonnegatively curved<br />

manifolds. Ann. of Math. (2), 136(2):253–333, 1992.<br />

[Gro78] M. Gromov. Almost flat manifolds. J. Differ. Geom., 13:231–241, 1978.<br />

[Gro81] M. Gromov. Groups of polynomial growth and expanding maps. Publications mathematiques<br />

I.H.É.S., 53:53–73, 1981.<br />

[Gro82] M. Gromov. Volume and bounded cohomology. Inst. Hautes Études Sci. Publ. Math.,<br />

56:5–99 (1983), 1982.<br />

[KPT10] V. Kapovitch, A. Petrunin, and W. Tuschmann. Nilpotency, almost nonnegative curvature<br />

and the gradient push. Annals of Mathematics, 171(1):343–373, 2010.<br />

[LR85] K. B. Lee and F. Raymond. Rigidity of almost crystallographic groups. In Combinatorial<br />

methods in topology and algebraic geometry (Rochester, N.Y., 1982), volume 44 of<br />

Contemp. Math., pages 73–78. Amer. Math. Soc., Providence, RI, 1985.


48 VITALI KAPOVITCH AND BURKHARD WILKING<br />

[Mil68] J. Milnor. A note on curvature and the fundamental group. Journal of Differential<br />

geometry, 2:1–7, 1968.<br />

[MZ55] D. Montgomery and L. Zippin. Topological transformation groups. Interscience Publishers,<br />

1955.<br />

[Ste93] E. Stein. Harmonic Analysis. Princeton University Press, 1993.<br />

[Wei88] G. Wei. Examples of complete manifolds of positive ricci curvature with nilpotent isometry<br />

groups. Bull. Amer. Math. Soci., 19(1):311–313, 1988.<br />

[Wil00] B. Wilking. On fundamental groups of manifolds of nonnegative curvature. Differential<br />

Geom. Appl., 13(2):129–165, 2000.<br />

[Wil11] B. Wilking. Fundamental groups of manifolds with bounded curvature. in preparation,<br />

2011.<br />

[Yam91] T. Yamaguchi. Collapsing and pinching under a lower curvature bound. Ann. of Math.,<br />

133:317–357, 1991.<br />

Department of Mathematics, University of Toronto, Toronto, Ontario, Canada<br />

E-mail address: vtk@math.toronto.edu<br />

University of Münster, Einsteinstrasse 62, 48149 Münster, Germany<br />

E-mail address: wilking@math.uni-muenster.de

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!