Environmental Health Criteria 214
Environmental Health Criteria 214
Environmental Health Criteria 214
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
HUMAN EXPOSURE ASSESSMENT<br />
PM 2.5 concentration (Y) was regressed against outdoor air<br />
concentrations ( X 1 ), smoking rates ( X 2 ), cooking durations<br />
( X 3 ), air exchange rates ( X 4 ) and house volumes ( X 5 ) to<br />
determine the major factors affecting indoor PM 2.5 concentrations<br />
(Ozkaynak et al., 1996).<br />
Further information on ANOVA and linear regression may be found<br />
in Ott (1995), Kleinbaum et al. (1988) and most introductory<br />
statistics textbooks.<br />
4.4.4.2 Logistic regression<br />
An approach which is different from the linear or additive<br />
relationship described above is to consider a categorical outcome for<br />
exposures, e.g., exposure measurements grouped into ordinal levels<br />
such as low, medium and high. When the response is binary, that is, if<br />
an exposure is either present or absent (i.e., a threshold effect),<br />
then a linear relationship is not appropriate. In this case, we must<br />
use an alternative model, for example:<br />
logit P(Y = 1) = alpha + ß 1 X 1 + ß 2 X 2 + ... + ß k X k<br />
where the logit function is logit (x) = ln ( x/(1- x)), the<br />
function P(Y = 1) denotes the probability that the response variable<br />
Y will take on the value 1 (denoting "success"), and the role of<br />
epsilon from the previous model is taken by modelling the parameter<br />
representing the probability that Y = 1, as opposed to Y itself.<br />
This model is known as logistic regression. In this model, alpha<br />
denotes the baseline odds for exposure given that the associated<br />
factors, X ..., X k , are zero, and ß i denotes the change in the<br />
log-odds that the response is Y = 1 given a 1-unit increase in<br />
X i . This approach can be adjusted to allow for the analysis of<br />
other types of categorical outcomes (McCullagh & Nelder, 1990). One<br />
common parameter which describes logistic regression results is the<br />
odds ratio. For a particular set of covariates, X i, the odds of<br />
the event occurring ( Y = 1), is exp(ß X i) . To compare the odds<br />
ratio for two situations, compute the first set of odds and take its<br />
ratio over the second odds. Usually, the situations will be identical,<br />
except that the covariate of interest will be zero for one of the<br />
sets. For example, if the model for the linear combination of<br />
covariates is 1.3 +2.5 X, then the odds ratio for Y = 1, comparing<br />
X = 1 versus X = 0, is e (1.3+2.5) /e 1.3 . A similar computation can be<br />
done when X is a continuous random variable, for two different<br />
values of X. In exposure assessment, a logistic regression model<br />
could be used to evaluate the importance of demographic or temporal<br />
factors on the likelihood that an individual will engage in an<br />
activity such as applying pesticides.<br />
4.4.5 Sample size determination<br />
Hypothesis testing attempts to determine if the data reject or<br />
fail to reject a particular (null) hypothesis. The test is based upon<br />
statistical considerations, and hence just reports how likely or<br />
unlikely the null hypothesis is. If there is minimal information, it<br />
will be difficult to statistically reject the null hypothesis; hence,<br />
sample size calculations are done in order to ensure that there is<br />
sufficient information from which to make a decision. The decision is<br />
http://www.inchem.org/documents/ehc/ehc/ehc<strong>214</strong>.htm<br />
Page 75 of 284<br />
6/1/2007