Behavioural Surveillance Surveys - The Wisdom of Whores

Behavioural Surveillance Surveys - The Wisdom of Whores Behavioural Surveillance Surveys - The Wisdom of Whores

wisdomofwhores.com
from wisdomofwhores.com More from this publisher
14.03.2015 Views

Under the heading of non-probability sampling methods are a variety of approaches that are not based upon the statistical principles which govern probability samples. There are various reasons for using non-probability methods. Some methods (e.g. snowball or network sampling) are designed for use when probability sampling is not feasible. In snowball sampling, key informants in a sub-population identify other members of their community. These people are contacted, and they in turn identify further contacts. The process goes on until an adequate sample is achieved. Other methods (e.g. purposive sampling) are designed to provide the maximum amount of information possible for key groups of study subjects in order to develop and/or test social theories. Yet others (convenience sampling) are designed to obtain a sample of subjects at the least possible cost. In general, non-probability sampling methods are not intended to produce “representative” data for larger populations, although they are sometimes (incorrectly) used to try to do so. Probability sampling has two major advantages. Firstly, it is less prone to bias than non-probability methods and secondly, it permits the application of statistical theory to estimate sampling error from the survey data themselves. Consistent use of probability sampling methods in the context of BSS has the critical advantage of producing data which are comparable from one survey to the next, and which can therefore be used to measure statistically significant changes in risk behavior over time. Therefore probability sampling methods are the preferred choice for BSS whenever feasible. The major disadvantage of probability sampling is that a list or sampling frame is needed, and this can take time and resources to produce. While there are ways to make the task of developing sampling frames less costly and time consuming, the use of probability sampling methods will nevertheless involve greater time and expense than sampling approaches that do not require a list or sampling frame. While they are generally cheaper and easier to use, non-probability sampling methods have several important drawbacks. The first is the risk of sampling bias resulting from the subjectivity that often enters into the sample selection process. Where a list of sampling units is not available from which to select a sample following fixed rules, there is the danger that certain types of subjects will be disproportionately included in and others disproportionately excluded from the sample. Secondly, there is the issue of replicability, which is of key importance for surveys intended to monitor behavioral trends over time. Where sample selection criteria are not defined in operationally precise terms so that they can be replicated in subsequent survey rounds, there is a danger that observed changes will be due to changes in sampling rather than real changes in behavior. Finally, non-probability methods provide no statistical basis for assessing the precision or reliability of survey estimates. In fact, conventional statistical tests cannot reliably be used with non-probability samples, although in practice this limitation is often overlooked. B EHAV I OR A L S U R V EI L L A NC E SURV EY S CHAPTER 4 31

In the end, the issue boils down to one of credibility. A survey based upon nonprobability sampling methods may produce the same results as a probability survey, but the results will be harder to defend against skeptics who suspect that the findings may reflect poor sampling rather than actual behavior. Probability methods produce data that can be interpreted with much greater confidence. This should in turn translate into a firmer basis for decision-making in designing HIV prevention programs and in allocating resources. Early in the HIV epidemic, much research was conducted on an ad hoc basis — a response to the need for ANY information, as quickly as possible. More recently, however, the demand has grown for the systematic collection of high-quality data that can be interpreted and acted upon with greater confidence. This demand has spurred the development of methods to extend probability sampling as much as possible to surveys of population sub-groups that are difficult to enumerate. It is acknowledged, however, that the use of probability sampling methods will not be feasible for some populations; notably, those whose members do not congregate in fixed locations and for whom it is thus not feasible to develop a list or sampling frame. When a sampling frame cannot be constructed, the use of non-probability sampling methods is the only alternative. Guidance on how to make data collected under these circumstances as objective and replicable as possible is provided later in this chapter. Multi-stage cluster sampling methods In probability sampling, the key element in the sample selection process is randomization. This means that units and/or respondents are randomly selected from all those included in the sampling frame. This reduces potential bias. Randomization can occur at various levels; the scheme chosen will depend on the level of error surveillance managers are prepared to accept in their results, as well as what is most feasible. There is almost always a trade-off between these two elements. In general, easier types of sampling such as the multi-stage cluster sampling described below carry wider margins of error than simple random sampling from the entire population. This means that a larger sample size will be needed to achieve the same levels of precision. Where a complete list of all individuals in the group to be sampled is available, it is possible to select individuals randomly from that list. However this is rarely the case. More commonly, a list of larger units where the individuals gather is more likely to be available, or easy to construct. These units are known as primary sampling units (PSUs) or clusters. Examples might include schools, brothels, or gay bars. If a list of all PSUs can be compiled, a set number of PSUs can be selected at random. Then lists of individuals need only be compiled for the PSUs selected, and individuals can be chosen, preferably at random, from within the selected PSUs. As discussed below, some variants of multi-stage cluster sampling do not even require that a list of elements within sample clusters be created. 32 C H A PTER 4 B EHAV I OR A L S U R V EI L L A NC E S U R V EY S

In the end, the issue boils down to one<br />

<strong>of</strong> credibility. A survey based upon nonprobability<br />

sampling methods may produce<br />

the same results as a probability survey,<br />

but the results will be harder to defend against<br />

skeptics who suspect that the findings may<br />

reflect poor sampling rather than actual<br />

behavior. Probability methods produce data<br />

that can be interpreted with much greater<br />

confidence. This should in turn translate into<br />

a firmer basis for decision-making in designing<br />

HIV prevention programs and in allocating<br />

resources.<br />

Early in the HIV epidemic, much research<br />

was conducted on an ad hoc basis — a response<br />

to the need for ANY information, as quickly<br />

as possible. More recently, however, the<br />

demand has grown for the systematic<br />

collection <strong>of</strong> high-quality data that can be<br />

interpreted and acted upon with greater<br />

confidence. This demand has spurred the<br />

development <strong>of</strong> methods to extend probability<br />

sampling as much as possible to surveys <strong>of</strong><br />

population sub-groups that are difficult to<br />

enumerate. It is acknowledged, however,<br />

that the use <strong>of</strong> probability sampling methods<br />

will not be feasible for some populations;<br />

notably, those whose members do not<br />

congregate in fixed locations and for whom<br />

it is thus not feasible to develop a list or<br />

sampling frame. When a sampling frame<br />

cannot be constructed, the use <strong>of</strong> non-probability<br />

sampling methods is the only alternative.<br />

Guidance on how to make data collected<br />

under these circumstances as objective and<br />

replicable as possible is provided later in this<br />

chapter.<br />

Multi-stage cluster sampling<br />

methods<br />

In probability sampling, the key element in<br />

the sample selection process is randomization.<br />

This means that units and/or respondents are<br />

randomly selected from all those included in<br />

the sampling frame. This reduces potential<br />

bias. Randomization can occur at various<br />

levels; the scheme chosen will depend on<br />

the level <strong>of</strong> error surveillance managers are<br />

prepared to accept in their results, as well as<br />

what is most feasible. <strong>The</strong>re is almost always<br />

a trade-<strong>of</strong>f between these two elements.<br />

In general, easier types <strong>of</strong> sampling such as<br />

the multi-stage cluster sampling described<br />

below carry wider margins <strong>of</strong> error than<br />

simple random sampling from the entire<br />

population. This means that a larger sample<br />

size will be needed to achieve the same levels<br />

<strong>of</strong> precision.<br />

Where a complete list <strong>of</strong> all individuals<br />

in the group to be sampled is available, it is<br />

possible to select individuals randomly from<br />

that list. However this is rarely the case.<br />

More commonly, a list <strong>of</strong> larger units where<br />

the individuals gather is more likely to be<br />

available, or easy to construct. <strong>The</strong>se units<br />

are known as primary sampling units<br />

(PSUs) or clusters. Examples might include<br />

schools, brothels, or gay bars. If a list <strong>of</strong> all<br />

PSUs can be compiled, a set number <strong>of</strong> PSUs<br />

can be selected at random. <strong>The</strong>n lists <strong>of</strong><br />

individuals need only be compiled for the<br />

PSUs selected, and individuals can be chosen,<br />

preferably at random, from within the selected<br />

PSUs. As discussed below, some variants <strong>of</strong><br />

multi-stage cluster sampling do not even<br />

require that a list <strong>of</strong> elements within sample<br />

clusters be created.<br />

32<br />

C H A PTER 4 B EHAV I OR A L S U R V EI L L A NC E S U R V EY S

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!