12.02.2013 Views

Environmental Health Criteria 214

Environmental Health Criteria 214

Environmental Health Criteria 214

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

HUMAN EXPOSURE ASSESSMENT<br />

be the time of day when the levels are measured. For simplicity,<br />

divide time into three categories -- morning, afternoon, or evening --<br />

so k = 3. If there is no known or hypothesized functional form for<br />

the relationship, the resulting abstract model for exposure, Y,<br />

during a particular time period, i, should be the sum of the mean<br />

(average) during the time period i, denoted by gamma i, and an<br />

error, epsilon, which will represent the natural variation of the<br />

measurement. It is common to assume that the variation of the outcome<br />

is the same among all levels of the factor; this assumption is known<br />

as homoscedasticity.<br />

This model is referred to as the one-way ANOVA. The resulting<br />

model for the observed data, with Y ij denoting the jth PM 2.5<br />

measurement collected during the ith time period with i ranging<br />

from 1 to 3, is Y ij = gamma i + epsilon ij where gamma i represents<br />

the average outcome due to the ith factor level (in this example<br />

i ranges from 1 to 3), and epsilon ij (the error term) represents<br />

independent random variation. One common assumption is that the error<br />

terms follow a normal distribution with variance rho 2 . The parameters<br />

which need to be estimated in this model from the data are the means<br />

of the subsamples, gamma i , and the variance of the outcome, rho 2 .<br />

This type of model, which compares the means of distinct groups, is<br />

the basis for ANOVA.<br />

Increasing the level of complexity leads to multiway or<br />

multifactor ANOVA as well as the multiple linear regression model,<br />

which is a more specific model for the effects of independent<br />

variables on the dependent variable. Let Y denote the exposure level<br />

for a particular person or location; this is the dependent variable.<br />

Let X, ..., X n denote n independent variables (known as<br />

covariates) which potentially influence the exposure level Y. If the<br />

assumption of the existence of a linear relationship between the<br />

independent and dependent variables is reasonable, then a model for<br />

the outcome, Y, based on the covariates X i , can be written as<br />

where the information not conveyed by the covariates results in the<br />

error (epsilon), which is assumed to be normally distributed. theta 0<br />

denotes the average exposure when all the X values are zero, and<br />

theta i denotes the change in exposure for a unit change in the ith<br />

variable. An example would be 24-h personal exposures to nitrogen<br />

dioxide. In this case, the factors may be distinct times and locations<br />

(or microenvironments) where nitrogen dioxide exposure may occur; for<br />

example, outdoors, indoors while home cooking on a gas range, and in<br />

an automobile.<br />

A regression model is used to evaluate the relationship of one or<br />

more independent variables X 1 , X 2 , ..., X k to a single,<br />

continuous dependent variable Y. It is often used in exposure<br />

assessment to characterize the relationship between the dependent and<br />

independent variables (continuous and discrete) by determining the<br />

extent, direction and strength of the association. For example, in the<br />

particle total exposure assessment methodology (PTEAM) study, indoor<br />

http://www.inchem.org/documents/ehc/ehc/ehc<strong>214</strong>.htm<br />

Page 74 of 284<br />

6/1/2007

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!