12.02.2013 Views

Environmental Health Criteria 214

Environmental Health Criteria 214

Environmental Health Criteria 214

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

HUMAN EXPOSURE ASSESSMENT<br />

PM 2.5 concentration (Y) was regressed against outdoor air<br />

concentrations ( X 1 ), smoking rates ( X 2 ), cooking durations<br />

( X 3 ), air exchange rates ( X 4 ) and house volumes ( X 5 ) to<br />

determine the major factors affecting indoor PM 2.5 concentrations<br />

(Ozkaynak et al., 1996).<br />

Further information on ANOVA and linear regression may be found<br />

in Ott (1995), Kleinbaum et al. (1988) and most introductory<br />

statistics textbooks.<br />

4.4.4.2 Logistic regression<br />

An approach which is different from the linear or additive<br />

relationship described above is to consider a categorical outcome for<br />

exposures, e.g., exposure measurements grouped into ordinal levels<br />

such as low, medium and high. When the response is binary, that is, if<br />

an exposure is either present or absent (i.e., a threshold effect),<br />

then a linear relationship is not appropriate. In this case, we must<br />

use an alternative model, for example:<br />

logit P(Y = 1) = alpha + ß 1 X 1 + ß 2 X 2 + ... + ß k X k<br />

where the logit function is logit (x) = ln ( x/(1- x)), the<br />

function P(Y = 1) denotes the probability that the response variable<br />

Y will take on the value 1 (denoting "success"), and the role of<br />

epsilon from the previous model is taken by modelling the parameter<br />

representing the probability that Y = 1, as opposed to Y itself.<br />

This model is known as logistic regression. In this model, alpha<br />

denotes the baseline odds for exposure given that the associated<br />

factors, X ..., X k , are zero, and ß i denotes the change in the<br />

log-odds that the response is Y = 1 given a 1-unit increase in<br />

X i . This approach can be adjusted to allow for the analysis of<br />

other types of categorical outcomes (McCullagh & Nelder, 1990). One<br />

common parameter which describes logistic regression results is the<br />

odds ratio. For a particular set of covariates, X i, the odds of<br />

the event occurring ( Y = 1), is exp(ß X i) . To compare the odds<br />

ratio for two situations, compute the first set of odds and take its<br />

ratio over the second odds. Usually, the situations will be identical,<br />

except that the covariate of interest will be zero for one of the<br />

sets. For example, if the model for the linear combination of<br />

covariates is 1.3 +2.5 X, then the odds ratio for Y = 1, comparing<br />

X = 1 versus X = 0, is e (1.3+2.5) /e 1.3 . A similar computation can be<br />

done when X is a continuous random variable, for two different<br />

values of X. In exposure assessment, a logistic regression model<br />

could be used to evaluate the importance of demographic or temporal<br />

factors on the likelihood that an individual will engage in an<br />

activity such as applying pesticides.<br />

4.4.5 Sample size determination<br />

Hypothesis testing attempts to determine if the data reject or<br />

fail to reject a particular (null) hypothesis. The test is based upon<br />

statistical considerations, and hence just reports how likely or<br />

unlikely the null hypothesis is. If there is minimal information, it<br />

will be difficult to statistically reject the null hypothesis; hence,<br />

sample size calculations are done in order to ensure that there is<br />

sufficient information from which to make a decision. The decision is<br />

http://www.inchem.org/documents/ehc/ehc/ehc<strong>214</strong>.htm<br />

Page 75 of 284<br />

6/1/2007

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!