12.01.2015 Views

RESEARCH METHOD COHEN ok

RESEARCH METHOD COHEN ok

RESEARCH METHOD COHEN ok

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

NORM-REFERENCED, CRITERION-REFERENCED AND DOMAIN-REFERENCED TESTS 415<br />

parameters of abilities are known. They assume the<br />

following (Morrison 1993):<br />

<br />

<br />

There is a normal curve of distribution of scores<br />

in the population: the bell-shaped symmetry<br />

of the Gaussian curve of distribution seen,<br />

for example, in standardized scores of IQ or<br />

the measurement of people’s height or the<br />

distribution of achievement on reading tests in<br />

the population as a whole.<br />

There are continuous and equal intervals<br />

between the test scores and, with tests<br />

that have a true zero (see Chapter 24), the<br />

opportunity for a score of, say, 80 per cent to be<br />

double that of 40 per cent; this differs from the<br />

ordinal scaling of rating scales discussed earlier<br />

in connection with questionnaire design where<br />

equal intervals between each score could not<br />

be assumed.<br />

Parametric tests will usually be published tests<br />

which are commercially available and which have<br />

been piloted and standardized on a large and<br />

representative sample of the whole population.<br />

They usually arrive complete with the backup<br />

data on sampling, reliability and validity statistics<br />

which have been computed in the devising of<br />

the tests. Working with these tests enables the<br />

researcher to use statistics applicable to interval<br />

and ratio levels of data.<br />

Non-parametric tests make few or no assumptions<br />

about the distribution of the population (the<br />

parameters of the scores) or the characteristics<br />

of that population. The tests do not assume a<br />

regular bell-shaped curve of distribution in the<br />

wider population; indeed the wider population<br />

is perhaps irrelevant as these tests are designed<br />

for a given specific population – a class in school,<br />

achemistrygroup,aprimaryschoolyeargroup.<br />

Because they make no assumptions about the wider<br />

population, the researcher must work with nonparametric<br />

statistics appropriate to nominal and<br />

ordinal levels of data. Parametric tests, with a<br />

true zero and marks awarded, are the stock-intrade<br />

of classroom teachers – the spelling test, the<br />

mathematics test, the end-of-year examination,<br />

the mock-examination.<br />

The attraction of non-parametric statistics is<br />

their utility for small samples because they do<br />

not make any assumptions about how normal,<br />

even and regular the distributions of scores<br />

will be. Furthermore, computation of statistics<br />

for non-parametric tests is less complicated<br />

than that for parametric tests. Non-parametric<br />

tests have the advantage of being tailored<br />

to particular institutional, departmental and<br />

individual circumstances. They offer teachers<br />

a valuable opportunity for quick, relevant and<br />

focused feedback on student performance.<br />

Parametric tests are more powerful than nonparametric<br />

tests because they not only derive<br />

from standardized scores but also enable the<br />

researcher to compare sub-populations with a<br />

whole population (e.g. to compare the results of<br />

one school or local education authority with the<br />

whole country, for instance in comparing students’<br />

performance in norm-referenced or criterionreferenced<br />

tests against a national average score<br />

in that same test). They enable the researcher<br />

to use powerful statistics in data processing (see<br />

Chapters 24–26), and to make inferences about<br />

the results. Because non-parametric tests make no<br />

assumptions about the wider population a different<br />

set of statistics is available to the researcher (see<br />

Chapter 24). These can be used in very specific<br />

situations – one class of students, one year group,<br />

one style of teaching, one curriculum area – and<br />

hence are valuable to teachers.<br />

Norm-referenced, criterion-referenced<br />

and domain-referenced tests<br />

A norm-referenced test compares students’ achievements<br />

relative to other students’ achievements,<br />

for example a national test of mathematical performance<br />

or a test of intelligence which has been<br />

standardized on a large and representative sample<br />

of students between the ages of 6 and 16. A<br />

criterion-referenced test does not compare student<br />

with student but, rather, requires the student to<br />

fulfil a given set of criteria, a predefined and absolute<br />

standard or outcome (Cunningham 1998).<br />

For example, a driving test is usually criterionreferenced<br />

since to pass it requires the ability to<br />

Chapter 19

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!