13.07.2015 Views

(' § ди)0 132ев ¡ '¢ )5 4 © ев6¡ ¢87 ' ¤ 9 - School of Mathematics

(' § ди)0 132ев ¡ '¢ )5 4 © ев6¡ ¢87 ' ¤ 9 - School of Mathematics

(' § ди)0 132ев ¡ '¢ )5 4 © ев6¡ ¢87 ' ¤ 9 - School of Mathematics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

¦MATH2740 Environmental StatisticsBackground 2 (quadrat analysis)University <strong>of</strong> Leeds<strong>School</strong> <strong>of</strong> <strong>Mathematics</strong>Semester 2Poisson index <strong>of</strong> dispersion testConsider the following frequency data from a regular 10 10 grid <strong>of</strong> quadrats:No. <strong>of</strong> indivs. per quadrat (<strong>¡</strong>£¢) 0 1 2 3 4 5 6 7 8 9+ Total (<strong>¤</strong>)Observed no. <strong>of</strong> quadrats (¥¢) 21 27 22 14 8 2 3 1 2 0 100First find the sample mean and variance:and<strong>©</strong> ¥¢<strong>¡</strong>¢ <strong>¤</strong> <strong>§</strong><strong>§</strong> <strong>¡</strong>¨<strong>§</strong>Then, the test statistic:<strong>§</strong> <strong>¤</strong>¨ ! ¥¢<strong>¡</strong> ¢#"<strong>©</strong> ¥¢$<strong>¡</strong>¢&%<strong>¤</strong> ' <strong>§</strong> !()*#"% +' <strong>§</strong>-,., which follows 57698distribution.a4<strong>§</strong>0"<strong>¤</strong>¨% ¦ /<strong>§</strong>,123The critical values can be obtained by noting<strong>¡</strong>that<strong>§</strong>H.2). =A:CB%D<strong>§</strong>AEF with:G;4?@?Since/:CB P"!:=andlies outside this range we can reject the null hypothesis, that is there is strongevidence that the pattern is non-random.(,) <strong>§</strong>Note that since the variance is bigger than the mean the data is overdispersed, and so we might considera contagious distribution to be a better model.4goodness-<strong>of</strong>-fit testConsider the above data again:(KNo. <strong>of</strong> indivs. (<strong>¡</strong>I¢) 0 1 2 3 4 5 6 7 8 9+ (<strong>¤</strong>)Obs. frequencies (J) 21 27 22 14 8 2 3 1 2 0 100Exp. frequencies 14.1 27.6 27.05 17.68 8.66 3.39 1.11 0.31 0.08 0.03 100To calculate the expected frequencies recall that for the Poisson distribution parameterL the is equal tothe mean and hence we can estimate it by the mean,MLN<strong>§</strong> sample <strong>¡</strong>¨<strong>§</strong>(from above).¦)The expected frequencies are then simply the probabilities, evaluated for each <strong>of</strong><strong>¡</strong> value multiplied bythe total frequency, is:<strong>¤</strong> that <strong>§</strong>P<strong>¡</strong>9%for<strong>¡</strong>¨<strong>§</strong>>E33.P"O1


,MATH2740 Environmental StatisticsBackground 2 (quadrat analysis)University <strong>of</strong> Leeds<strong>School</strong> <strong>of</strong> <strong>Mathematics</strong>Semester 2It is a requirement for a reliable test that none <strong>of</strong> the expected frequencies are too small, and this isusually taken to mean that none should be less than about 5. So we should combine some <strong>of</strong> the smallfrequencies into a single group, say<strong>¡</strong><strong>¡</strong> PFwith observed frequency2and expected frequency)..The4test statistics is then:O <strong>§</strong> "J> K %<strong>§</strong>where ’s are the observed values and(1’s the expected frequencies. Comparing this with the criticalvalue )7from tablesKtheK theJ<strong>of</strong>4¢6"EF%C<strong>§</strong>number <strong>of</strong> frequency classes after pooling)we conclude that we cannot reject the null hypothesis that the data are from a Poisson distribution.with £ <strong>§</strong> (theBut the index <strong>of</strong> dispersion test rejected the same null hypothesis quite strongly!Note that the degrees <strong>of</strong> freedom in the4test is .because there are two restrictions, that the sum <strong>of</strong>the observed and expected frequencies are equal and that the parameterL is equal to the sample mean.£Index <strong>of</strong> dispersion test vs. goodness-<strong>of</strong>-fit testConsider the following artificial example:(KNo. <strong>of</strong> indivs. (<strong>¡</strong>I¢) 0 1 2 3 4 5 6+ (<strong>¤</strong>)Obs. frequencies (J) 20 76 0 0 0 5 0 101Exp. frequencies 37.16 27.16 18.58 6.19 1.55 0.31 0.05 101For the index test,¦<strong>of</strong> <strong>§</strong>#anddispersion that is the mean and variance are exactly equal, andthe Poisson hypothesis is accepted; but the observed and expected frequencies are clearly different.<strong>§</strong> <strong>¡</strong>)The4statistic is,.)with4"EF%G<strong>§</strong>-F. Thepooling the four smallest frequency classes) which should be comparedtest statistic is far larger and so we clearly reject the Poisson assumption.(afterSo either test may detect non-randomness even when the other fails to do so. Note, however, that datalike this, which is generally regular with occasional clusters, is rather unlikely to arise in practice.Note that the index <strong>of</strong> dispersion test can be used for quite small sample sizes, while the4test requireslarger samples to get sufficient classes with moderate expected frequencies.Fitting a negative binomial distributionConsider the distribution <strong>of</strong> the number <strong>of</strong> Plantago Major plants in 100 sq.cm. quadrats 1 :1 J.G. Skellam. (1952). Studies in statistical ecology: I spatial pattern. Biometrika, 39, 346-362.2


andM<strong>¡</strong> <strong>§</strong><strong>§</strong> <strong>¤</strong>¦ <strong>¡</strong> MMATH2740 Environmental StatisticsBackground 2 (quadrat analysis)University <strong>of</strong> Leeds<strong>School</strong> <strong>of</strong> <strong>Mathematics</strong>Semester 2give:<strong>¤</strong> <strong>§</strong>P),¦ <strong>¡</strong> <strong>§</strong>PE2F.F and"<strong>¤</strong>¨% <strong>§</strong>No. <strong>of</strong> plants (<strong>¡</strong>I¢) 0 1 2 3 4 5 6 7 8 9 (<strong>¤</strong>)Obs. Frequencies (¥¢) 235 81 43 18 9 6 4 3 0 1 400(E.(F, These leading to:"<strong>¤</strong>¨% <strong>§</strong>>E)7,E)72,<strong>¡</strong> M% <strong>§</strong>>E)(Fitting a negative binomial distribution, the expected frequencies are givenM¦E33with"C<strong>§</strong><strong>¡</strong> and replaced byM<strong>¡</strong> andM. If calculated by hand, it is easiest to use the recurrence relationP"Oby<strong>¤</strong>(see Background 1).No. <strong>of</strong> plants (<strong>¡</strong>I¢) 0 1 2 3 4 5 6 7 8 9 (<strong>¤</strong>)Obs. frequencies (J) 235 81 43 18 9 6 4 3 0 1 400Exp. frequencies 232.2 85.4 40.0 20.1 10.4 5.5 2.9 3.5 400<strong>§</strong> <strong>¡</strong>9%for<strong>¡</strong>)To determine whether or not this is a good fit, the4test is performed pooling for<strong>¡</strong> values(so as toavoid very small expected frequencies (we might pooling<strong>¡</strong><strong>¡</strong> consider P).The isOtest statistics based £ categories and degrees <strong>of</strong> freedom (becausewe have three restrictions, estimation <strong>of</strong> <strong>¡</strong> and and equality <strong>of</strong> the sum <strong>of</strong> the observed and expectedon2frequencies). do not reject the null hypothesis, and so conclude that thenegative binomial distribution is a good<strong>§</strong>0Ffit.>, ,E) <strong>§</strong>Since4<strong>¤</strong>"EF%<strong>§</strong> (weTwo-term quadrat variance methodSuppose that we have<strong>¤</strong> <strong>§</strong>-2contiguous quadrats with counts:The average variance is calculated for a range <strong>of</strong> block sizes, using quadrats in pairs:"3E3,3F3.33@)37%Block size 1 (1,0) (0,3) (3,5) (5,2) (2,1) (1,4) (4,6) Average varianceVariance <strong>of</strong> pairs 0.5 4.5 2 4.5 0.5 2.5 2 2.64Block size 2 (1+0,3+5) (0+3,5+2) (3+5,2+1) (5+2,1+4) (2+1,4+6) Average varianceVariance <strong>of</strong> pairs 24.5 8 12.5 2 24.5 14.33


"<strong>¡</strong>8<strong>¡</strong> <strong>§</strong> <strong>¡</strong>% "<strong>¡</strong>8¢ . ' <strong>¡</strong> <strong>¡</strong>% . ' ¦ <strong>¡</strong>9% <strong>¡</strong>8 <strong>¡</strong> <strong>¡</strong>8 <strong>¡</strong> <strong>¡</strong>8 <strong>§</strong>" 3.3E3,3@)33@)3E33E333,E3E3(3F3.3E3333,3(3)3.333 E3.3@)33F3.33E3E33.3@)3(3,E3E3E33E333,E33F3.3E333X<strong>§</strong>E33,323@)33E3E333E3FE3E3,E33E3E3E3333@)3(3)3.333MATH2740 Environmental StatisticsBackground 2 (quadrat analysis)University <strong>of</strong> Leeds<strong>School</strong> <strong>of</strong> <strong>Mathematics</strong>Semester 2Block size 3 (1+0+3,5+2+1) (0+3+5,2+1+4) (3+5+2,1+4+6) Average varianceVariance <strong>of</strong> pairs 8 0.5 0.5 3Block size 4 (1+0+3+5,2+1+4+6) Average varianceVariance <strong>of</strong> pairs 8 8The average variances per sampling unit (i.e. per quadrat) are then: . (1F3,,The peak <strong>of</strong> the average variance per quadrat is at block size 2, this is suggestive <strong>of</strong> clustering with meancluster radius <strong>of</strong> 2 quadrats (or, equivalently, a gap between cluster centers <strong>of</strong> 4 quadrats).<strong>§</strong>-. ) <strong>§</strong>32 <strong>§</strong> <strong>§</strong>-.)3), .)Notes:1. It is useful to recall that, for any pair <strong>of</strong> numbers"<strong>¡</strong>83@<strong>¡</strong>%the sample variance <strong>of</strong> the pair can becalculated as:"<strong>¡</strong>¢£2. Whilst the method can provide variances for any block size to8uprestrict maximum block to8sizelarger block sizes.<strong>¤</strong><strong>¤</strong> (<strong>¤</strong> = F) or8' <strong>§</strong> ."<strong>¡</strong>8 <strong>¡</strong>% .8<strong>¤</strong>£<strong>¤</strong> (<strong>¤</strong>¦¥ ),. 'as(as above), it is usual toestimates lack precision at<strong>¤</strong>A less trivial example! An insect ecologist studying the spatial pattern <strong>of</strong> fire ant hills in a pasturein Florida samples the pasture with a transect 10m wide and 1280m long, in 10m sections giving 128sampling units. The vectors <strong>of</strong> observations <strong>of</strong> the number <strong>of</strong> ant hills observed in each quadrat was:4% E33.3@)3(3F3,33E3E3E3.E33,E323) E3E33E3F3(3.33E3E3E333@)33,3E3E3333E3E3.3F32E3,E3.


8<strong>¤</strong>£ .2<strong>¡</strong> 8MATH2740 Environmental StatisticsBackground 2 (quadrat analysis)University <strong>of</strong> Leeds<strong>School</strong> <strong>of</strong> <strong>Mathematics</strong>Semester 2The average sample variance per sampling unit is calculated for blockusing quadrats in pairs. i.e.:sizes3.33Block size 1:Block size 2:Block size 3:etc." ,,In each case, the sample variance <strong>of</strong> the pair is calculated, hence giving the average variance for eachblock size. This average is divided by the block size, and then plotted against the block size. A peak inthe graph indicates the block size at which clustering occurs..3 ,%3". E3, )%3" ,3@) %3 "3.%3".37%3"E3,7%3",3)%3E3, ) %3". ,3@) )%3" , )3 ) 7%3 .Variance per sampling unit2 6 10 142 4 6 8 10 12Block sizeFig. 1: Plot <strong>of</strong> variance per sampling unit against block size for fire ant hill dataFig. 1 is suggestive <strong>of</strong> clustering <strong>of</strong> fire ant hills, with mean cluster radius <strong>of</strong> around 4 quadrats (or gapbetween clusters <strong>of</strong> 80m).5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!