12.07.2015 Views

Initial sequencing and analysis of the human genome - Vitagenes

Initial sequencing and analysis of the human genome - Vitagenes

Initial sequencing and analysis of the human genome - Vitagenes

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

articlesThe most extreme mechanism is whole-<strong>genome</strong> duplication(WGD), through a polyploidization event in which a diploidorganism becomes tetraploid. Such events are classi®ed as autopolyploidyor allopolyploidy, depending on whe<strong>the</strong>r <strong>the</strong>y involvehybridization between members <strong>of</strong> <strong>the</strong> same species or differentspecies. Polyploidization is common in <strong>the</strong> plant kingdom, withmany known examples among wild <strong>and</strong> domesticated crop species.Alfalfa (Medicago sativa) is a naturally occurring autotetraploid 364 ,<strong>and</strong> Nicotiana tabacum, some species <strong>of</strong> cotton (Gossypium) <strong>and</strong>several <strong>of</strong> <strong>the</strong> common brassicas are allotetraploids containing pairs<strong>of</strong> `homeologous' chromosome pairs.In principle, WGD provides <strong>the</strong> raw material for great bursts <strong>of</strong>innovation by allowing <strong>the</strong> duplication <strong>and</strong> divergence <strong>of</strong> entirepathways. Ohno 365 suggested that WGD has played a key role inevolution. There is evidence for an ancient WGD event in <strong>the</strong>ancestry <strong>of</strong> yeast <strong>and</strong> several independent such events in <strong>the</strong> ancestry<strong>of</strong> mustard weed 366±369 . Such ancient WGD events can be hard todetect because only a minority <strong>of</strong> <strong>the</strong> duplicated loci may beretained, with <strong>the</strong> result that <strong>the</strong> genes in duplicated segmentscannot be aligned in a one-to-one correspondence but ra<strong>the</strong>rrequire many gaps. In addition, duplicated segments may besubsequently rearranged. For example, <strong>the</strong> ancient duplication in<strong>the</strong> yeast <strong>genome</strong> appears to have been followed by loss <strong>of</strong> more than90% <strong>of</strong> <strong>the</strong> newly duplicated genes 366 .One <strong>of</strong> <strong>the</strong> most controversial hypo<strong>the</strong>ses about vertebrateevolution is <strong>the</strong> proposal that two WGD events occurred early in<strong>the</strong> vertebrate lineage, around <strong>the</strong> time <strong>of</strong> jawed ®shes some 500 Myrago. Some authors 370±373 have seen support for this <strong>the</strong>ory in <strong>the</strong> factthat many <strong>human</strong> genes occur in sets <strong>of</strong> four homologuesÐmostnotably <strong>the</strong> four extensive HOX gene clusters on chromosomes 2, 7,12 <strong>and</strong> 17, whose duplication dates to around <strong>the</strong> correct time.However, o<strong>the</strong>r authors have disputed this interpretation 374 ,suggesting that <strong>the</strong>se cases may re¯ect unrelated duplications <strong>of</strong>speci®c regions ra<strong>the</strong>r than successive WGD.We analysed <strong>the</strong> draft <strong>genome</strong> sequence for evidence that mightbear on this question. The <strong>analysis</strong> provides many interestingobservations, but no convincing evidence <strong>of</strong> ancient WGD. Welooked for evidence <strong>of</strong> pairs <strong>of</strong> chromosomal regions containingmany homologous genes. Although we found many pairs containinga few homologous genes, <strong>the</strong> <strong>human</strong> <strong>genome</strong> does not appear tocontain any pairs <strong>of</strong> regions where <strong>the</strong> density <strong>of</strong> duplicated genesapproaches <strong>the</strong> densities seen in yeast or mustard weed 366±369 .We also examined <strong>human</strong> proteins in <strong>the</strong> IPI for which <strong>the</strong>orthologues among ¯y or worm proteins occur in <strong>the</strong> ratios 2:1:1,3:1:1, 4:1:1 <strong>and</strong> so on (Fig. 49). The number <strong>of</strong> such families fallssmoothly, with no peak at four <strong>and</strong> some instances <strong>of</strong> ®ve or morehomologues. Although this does not rule out two rounds <strong>of</strong> WGDfollowed by extensive gene loss <strong>and</strong> some unrelated gene duplication,it provides no support for <strong>the</strong> <strong>the</strong>ory. More probatively, if twosuccessive rounds <strong>of</strong> <strong>genome</strong> duplication occurred, phylogenetic<strong>analysis</strong> <strong>of</strong> <strong>the</strong> proteins having 4:1:1 ratios between <strong>human</strong>, ¯y <strong>and</strong>worm would be expected to show more trees with <strong>the</strong> topology(A,B)(C,D) for <strong>the</strong> <strong>human</strong> sequences than (A,(B,(C,D))) 375 .However,<strong>of</strong> 57 sets studied carefully, only 24% <strong>of</strong> <strong>the</strong> trees constructedfrom <strong>the</strong> 4:1:1 set have <strong>the</strong> former topology; this is not signi®cantlydifferent from what would be expected under <strong>the</strong> hypo<strong>the</strong>sis <strong>of</strong>r<strong>and</strong>om sequential duplication <strong>of</strong> individual loci.100Occurrences101 2 3 4 5 6 7 8 9 10 11 12100 10 20 30 40 50 60 70 80 90 100Genes per conserved segmentFigure 47 Distribution <strong>of</strong> number <strong>of</strong> genes per conserved segment between <strong>human</strong> <strong>and</strong>mouse <strong>genome</strong>s.10013 14 15 16 17 18 19 20 21 22 X Y1 2 3 4 5 6 7 8 9 10Occurrences1011 12 13 14 15 16 17 18 19 X YFigure 46 Conserved segments in <strong>the</strong> <strong>human</strong> <strong>and</strong> mouse <strong>genome</strong>. Humanchromosomes, with segments containing at least two genes whose order is conserved in<strong>the</strong> mouse <strong>genome</strong> as colour blocks. Each colour corresponds to a particular mousechromosome. Centromeres, subcentromeric heterochromatin <strong>of</strong> chromosomes 1, 9 <strong>and</strong>16, <strong>and</strong> <strong>the</strong> repetitive short arms <strong>of</strong> 13, 14, 15, 21 <strong>and</strong> 22 are in black.105 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90Conserved segment length (Mb)Figure 48 Distribution <strong>of</strong> lengths (in 5-Mb bins) <strong>of</strong> conserved segments between <strong>human</strong><strong>and</strong> mouse <strong>genome</strong>s, omitting singletons.910 © 2001 Macmillan Magazines Ltd NATURE | VOL 409 | 15 FEBRUARY 2001 | www.nature.com

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!