12.11.2014 Views

SAE Manual Sections 1 to 4_1 (May 06).pdf - National Statistical ...

SAE Manual Sections 1 to 4_1 (May 06).pdf - National Statistical ...

SAE Manual Sections 1 to 4_1 (May 06).pdf - National Statistical ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

A Guide <strong>to</strong> Small Area Estimation - Version 1.1 05/05/20<strong>06</strong><br />

Figure 4.2: Key Questions for Small Area Modelling<br />

If NO<br />

Q1. Do you have good quality auxiliary Data?<br />

If Yes<br />

Use Linear or Generalised<br />

linear models, depending<br />

on your data.<br />

Q2. Is the variable of interest of continuous, binary or<br />

count data?<br />

If<br />

Continuous<br />

data: Linear<br />

model<br />

If Binary<br />

data:<br />

Logistic<br />

Model<br />

If Count<br />

Data:<br />

Poisson<br />

model<br />

Simple<br />

Direct or<br />

Broad Area<br />

Ratio<br />

Estima<strong>to</strong>rs<br />

are the<br />

likely<br />

candidates.<br />

Q3. Shall I use an area-level or unit-level model or both?<br />

Q4. At what level is my auxiliary data available and of<br />

good quality?<br />

Good Area Level<br />

or unit level<br />

continuous data<br />

Good<br />

Unit Level binary<br />

data<br />

Good Area Level<br />

count data<br />

Q5. Are there likely <strong>to</strong> be major differences between<br />

small areas that are not taken in<strong>to</strong> account by the<br />

auxiliary data?<br />

If Yes<br />

Use random effects model<br />

Consult methodology<br />

staff for technical advice<br />

The next key question (as indicated by questions 5 of Figure 4.2) is when and why do we<br />

use the random effects models as compared <strong>to</strong> the synthetic models. To start with, the<br />

preceding discussion on the choice of models (linear versus generalised linear) also<br />

applies <strong>to</strong> the random effects models as well. However, the random effects models are<br />

different in that they include an additional error component <strong>to</strong> account for differences<br />

between units that aren’t explained by the auxiliary variables. In other words, synthetic<br />

models assume that the variable of interest can be determined from the same functional<br />

relationship with the auxiliary variables, and that this relationship applies across all small<br />

areas.<br />

This assumption, however, could be restrictive for a number of reasons. For example, in<br />

the disability data some small areas are located in remote areas with limited support<br />

facilities and services while others are in big cities with better infrastructure and services<br />

where people with disability could move there <strong>to</strong> take advantage of the improved<br />

services. Some areas are may have larger population of indigenous people relative <strong>to</strong><br />

others which again may affect disability rates in different areas. Yet, others are located in<br />

coastal areas that attract people of retirement age and the elderly. These fac<strong>to</strong>rs are not<br />

fully accounted for in the auxiliary data. Thus, unless these and other fac<strong>to</strong>rs are taken<br />

in<strong>to</strong> account in the model, they could limit the predictive abilities of synthetic models<br />

Australian Bureau of Statistics 31

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!