Behavioural Surveillance Surveys - The Wisdom of Whores
Behavioural Surveillance Surveys - The Wisdom of Whores
Behavioural Surveillance Surveys - The Wisdom of Whores
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Should one- or two-tailed z-score values be used?<br />
In the examples in Figure 4, we are<br />
interested in detecting changes in one direction,<br />
an increase in the proportion <strong>of</strong> sex workers<br />
who always use a condom, and thus used a<br />
value <strong>of</strong> Z 1-α<br />
for a one-tailed test with a 95%<br />
confidence (1.645). This will result in a<br />
smaller sample size than if the corresponding<br />
value for a two-tailed test had been used.<br />
If we were interested in being able simultaneously<br />
to detect a change <strong>of</strong> the same<br />
magnitude in either direction, i.e. either an<br />
increase or a decrease, we would use a<br />
two-tailed test with a 90% confidence<br />
level, with a value <strong>of</strong> 1.96 instead <strong>of</strong> 1.645.<br />
In the latter case, the resulting required<br />
sample size for each survey round would be<br />
somewhat larger.<br />
As a general rule, the prudent course <strong>of</strong><br />
action is to use two-tailed values <strong>of</strong> Z 1-α<br />
/2.<br />
However, BSS is <strong>of</strong>ten undertaken in the<br />
context <strong>of</strong> prevention efforts which are<br />
deliberately aiming to produce a change in<br />
a given direction. In this case, it is reasonable<br />
to use one-tailed tests.<br />
<strong>The</strong> power <strong>of</strong> a study<br />
In the context <strong>of</strong> BSS, power is shorthand<br />
for the probability that a study will detect a<br />
change in behavior <strong>of</strong> a specified magnitude,<br />
if such a change did in fact occur. <strong>The</strong>re is<br />
no point carrying out a survey that does not<br />
have the power to detect the changes you<br />
aim to measure. To illustrate, suppose we<br />
wanted to be able to measure a change <strong>of</strong><br />
10 percentage points in the proportion <strong>of</strong> sex<br />
workers who always use a condom with their<br />
clients. We compare two pairs <strong>of</strong> hypothetical<br />
surveys taken 2 years apart: one with a sample<br />
size <strong>of</strong> n=500 in each survey round and the<br />
other with a sample size <strong>of</strong> n=200 per survey<br />
round. While both surveys might indicate the<br />
expected increase <strong>of</strong> 10 percentage points,<br />
the increase may well not be statistically<br />
significant for the survey with 200 respondents<br />
per round. Thus, we would be forced to<br />
conclude that there was no significant change<br />
in this behavior over the study period , when<br />
in fact there was a real increase that was not<br />
detectable with a sample size <strong>of</strong> n=200 per<br />
survey round. To ensure sufficient power, a<br />
minimum value <strong>of</strong> Z 1-β<br />
<strong>of</strong> .80 should be used.<br />
This means you can be 80 percent sure that if<br />
a change has occurred between survey<br />
rounds, your study will pick it up. Where<br />
resources permit, .90 is better yet.<br />
<strong>The</strong> level <strong>of</strong> significance<br />
In describing the results <strong>of</strong> a study, and<br />
particularly the measurement <strong>of</strong> changes over<br />
time, the phrase “statistically significant”<br />
is frequently used. If a measured increase<br />
in condom use over time is deemed to be<br />
statistically significant, it means that surveillance<br />
<strong>of</strong>ficials are confident that the observed<br />
change could not have occurred by chance,<br />
because <strong>of</strong> random differences in the characteristics<br />
<strong>of</strong> respondents selected for the survey.<br />
Survey designers must choose a level at which<br />
they wish to be confident that observed<br />
differences are not due to chance. Traditionally,<br />
this level is set at 95 percent. In other<br />
words, a statistically significant result is one<br />
where investigators are 95 percent sure that<br />
the observed change in behavior would<br />
not have happened by chance. Measures<br />
<strong>of</strong> significance are sometimes expressed as<br />
p-values. A p-value is the inverse <strong>of</strong> the<br />
level <strong>of</strong> significance. It indicates the probability<br />
that the observed could have happened by<br />
chance. A p-value <strong>of</strong> 0.05 means there is a<br />
five percent chance that the observed change<br />
could have occurred by chance. In other<br />
words, it corresponds to a 95 percent level<br />
<strong>of</strong> significance.<br />
54<br />
C H A PTER 4 B EHAV I OR A L S U R V EI L L A NC E S U R V EY S