Statistical Power - People.stat.sfu.ca

STATISTICAL POWER ANALYSIS 

IN WILDLIFE RESEARCH 

November 19, 2011!

Back to the basics: ! 

α: Probability of a false positive  

(detecting an effect that doesn’t exist)! 

β: Probability of a false negative  

(failure to detect an effect when it actually exists)! 

Power: 1 – β,   

Probability of correctly rejecting a null 

hypothesis.! 

Practical definition: PROBABILITY of detecting  

an effect when the effect actually exists.! 

Background

!Interrelated components: target power (1 – β) ,   

α, sample size, and effect size.! 

• Probability of correctly detecting an effect.! 

• Probability of incorrectly detecting an effect.! 

• Sample size.! 

• Minimum response size that is considered biologically 

significant.! 

!Examples of mutual relationship:! 

Target power = 0.8 with α= .05! 

Target power = 0.9 with α= .10! 

! 

Background

• Effect: “Minimum response size that is considered 

biologically significant.”! 

• Statistical and biological significance are different.! 

• Biologically trivial differences may be statistically 

significant with large sample sizes and high power.! 

• Biologically important differences may not be 

statistically significant is power is low.! 

Background

!Effect: magnitude of response, original units! 

increase in fish concentration, 20 fish/m 2 ! 

!Effect size: standardized effect, percentage! 

if sd = 50 fish/m 2 , effect size = 20/50 = 0.4 = 40! 

!Effect size (alternate form): pct. difference from mean value! 

if mean = 60 fish/m 2 , 72 fish/m 2 = 20% increase! 

if mean = 60 fish/m 2 , 45 fish/m 2 = 25% decrease ! 

!Power to detect large effects is always greater  

than power to detect small effects.! 

Background

PPA: Conducted before the experiment is carried out.! 

Goal: to improve research design to increase the 

probability of detecting biologically significant effects.! 

! 

• Determine the probability that an effect size of 

interest will be detected with a given sample size.! 

• Determine the sample size necessary to achieve 

acceptably high power.! 

! 

! 

Prospective Power Analysis (PPA)

• Set a meaningful effect size, α, and sample size.! 

• Compute range of values for combinations of parameters. ! 

Prospective Power Analysis

Objective in research design: minimize experimental 

error and maximize precision of parameter estimates.! 

! 

Error reduction = increase in statistical power.! 

! 

Choices that influence the power of the experiment:! 

• Range of treatment levels selected.! 

• Number and type of experimental unit.! 

• Assignment of treatment to experimental units.! 

Power and Research Design

Typical constraints:! 

• Maximum number of replicates! 

• Range of treatment levels.! 

! 

Power can be increased cheaply by:! 

• Blocking! 

• Measuring related information (covariates)! 

• Efficient experimental design! 


Example: effect of people camping near nests on time 

spent by eagles with their nestlings.! 

Treatments: 100m and 500m; effect size: 20%;  

α = 0.1; power = 0.2! 

Test: 2-tailed t-test for independent samples! 

Results: null hypothesis not rejected; (t = 0.54, df = 52, 

p = 0.59, observed effect = 4.5%, se = 4.1)! 

! 

Problem: eagle nesting behaviour changes rapidly as 

nestlings mature (not accounted for)! 


• Change to crossover (paired) design! 

• Treatment and control are both applied to the same 

experimental unit (nest)! 

• Eliminates variability due to nestling age.! 

• Null hypothesis rejected (t = 2.19, 26 df, p = 0.038).! 

• Eagle behaviour change when people camp near 

their nests.! 

• Pooled sd in CRD: 29.8; sd for paired design: 10.7, 

even though sample size is half.! 


RPA: Conducted after the experiment has taken place.! 

! 

If a null hypothesis is not rejected there are two! 

possible reasons:! 

• No real effect existed.! 

• There is an effect but it was not detected.! 

! 

Type II error?! 

Retrospective Power Analysis (RPA)

Power is calculated using sample size, α, and! 

observed effect size… but so is p !!!! 

! 

There is no relationship between the observed! 

p value for a hypothesis test that was not rejected! 

and true power.! 

Retrospective Power Analysis (RPA)

• Alternative to Retrospective Power Analysis:  

Provide range of effect sizes! 

• Confidence Intervals provide information about  

the true size of an effect instead of just  

“statistically different from 0”! 

• The same factors that reduce power (high α,  

low sample size, high sample variability) also  

increase the width of confidence intervals.! 

Confidence Intervals and Power

Confidence Intervals and Power

• If the cost of environmental effect could be great   

the consequences of a false negative error (Type II)  

may outweigh those from a false positive (Type I) error. ! 

• Example: can we harvest timber without adversely   

affecting songbird populations?! 

• Typical Null Hypothesis: timber harvesting has no effect! 

Conduct a low power test, fail to reject null.! 

Conclusion: no effect !!! 

• (Wrong) assumptions:! 

Cost of Type I error > cost of Type II error.! 

Failure to reject = accept.! 

Consequences of Type I and Type II errors

Points To Remember:! 

• Hypothesis testing has been overused.! 

• Practical (biological) importance is preferrable  

to statistical significance.! 

• Confidence intervals are more adequate for  

practical importance.! 

• Is there a significant effect? Should be: what’s the 

magnitude of the effect?! 

Points To Remember

! 

THANK YOU! 

! 

(Now go and design  

good experiments…)!

Statistical Power - People.stat.sfu.ca

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?