21.06.2015 Views

The mnemonic keyword method: The effects of bidirectional retrieval ...

The mnemonic keyword method: The effects of bidirectional retrieval ...

The mnemonic keyword method: The effects of bidirectional retrieval ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Learning and Instruction 17 (2007) 360e371<br />

www.elsevier.com/locate/learninstruc<br />

<strong>The</strong> <strong>mnemonic</strong> <strong>keyword</strong> <strong>method</strong>: <strong>The</strong> <strong>effects</strong> <strong>of</strong> <strong>bidirectional</strong><br />

<strong>retrieval</strong> training and <strong>of</strong> ability to image on foreign language<br />

vocabulary recall<br />

Mirella Wyra*, Michael J. Lawson, Njora Hungi<br />

School <strong>of</strong> Education, Flinders University, Adelaide, Australia<br />

Abstract<br />

<strong>The</strong> <strong>mnemonic</strong> <strong>keyword</strong> <strong>method</strong> is an effective technique for vocabulary acquisition. This study examines the <strong>effects</strong> on recall <strong>of</strong><br />

word-meaning pairs <strong>of</strong> (a) training in use <strong>of</strong> the <strong>keyword</strong> procedure at the time <strong>of</strong> <strong>retrieval</strong>; and (b) the influence <strong>of</strong> the self-rated<br />

ability to image. <strong>The</strong> performance <strong>of</strong> students trained in <strong>bidirectional</strong> <strong>retrieval</strong> using the <strong>keyword</strong> <strong>method</strong> to learn new Spanish<br />

words and their English definitions was compared to that <strong>of</strong> control group students who used the standard <strong>keyword</strong> procedure.<br />

Data on recall performance were gathered on five occasions and analysed using multilevel analysis procedures (HLM). <strong>The</strong> <strong>retrieval</strong><br />

training was a significant predictor <strong>of</strong> both backward and forward recall performance, as was the ability to make images.<br />

Ó 2007 Elsevier Ltd. All rights reserved.<br />

Keywords: Keyword <strong>method</strong>; Vocabulary learning; Retrieval; Foreign language learning; Second language learning; Imagery<br />

1. Introduction<br />

<strong>The</strong> <strong>keyword</strong> <strong>method</strong> has been shown to be an effective procedure for the acquisition <strong>of</strong> vocabulary in foreignlanguage<br />

(FL) learning (Atkinson & Raugh, 1975; Levin & Pressley, 1985). This <strong>method</strong> is one <strong>of</strong> a number <strong>of</strong> procedures<br />

that have proved useful for the task <strong>of</strong> acquiring definitions <strong>of</strong> new foreign-language words (Ellis & Beaton,<br />

1993; McDaniel & Pressley, 1984), particularly for immediate recall. For longer term retention findings related to the<br />

effect <strong>of</strong> use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> are more mixed, with some research demonstrating growth in recall after an immediate<br />

decline (Lawson & Hogben, 1998) and other research showing decline in levels <strong>of</strong> recall (e.g., Avila & Sadowski,<br />

1996; Wang & Thomas, 1995). <strong>The</strong> mixed results seem to be associated with use <strong>of</strong> different experimental<br />

procedures and testing protocols. Although the beneficial <strong>effects</strong> <strong>of</strong> use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> have been demonstrated<br />

for a range <strong>of</strong> materials and for learners <strong>of</strong> different ages, it is only recently that researchers have begun to<br />

investigate the operation <strong>of</strong> the internal components <strong>of</strong> the <strong>method</strong> (e.g., Crutcher & Ericsson, 2000). In this paper<br />

we continue the examination <strong>of</strong> the details <strong>of</strong> the <strong>method</strong>, focusing attention on two issues: (1) use <strong>of</strong> the <strong>method</strong><br />

at the time <strong>of</strong> <strong>retrieval</strong>; and (2) the influence on recall performance <strong>of</strong> individual differences in students’ self-reports<br />

<strong>of</strong> ability to form images.<br />

* Corresponding author. Fax: þ61 8 8201 3184.<br />

E-mail address: mirella.wyra@flinders.edu.au (M. Wyra).<br />

0959-4752/$ - see front matter Ó 2007 Elsevier Ltd. All rights reserved.<br />

doi:10.1016/j.learninstruc.2007.02.008


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

361<br />

For those whose first language is English, the standard <strong>keyword</strong> <strong>method</strong> for acquiring new foreign-language vocabulary<br />

involves a two-stage process: generation <strong>of</strong> an English word, the <strong>keyword</strong>, that is similar in sound or appearance<br />

to the new FL word; and development <strong>of</strong> an interactive image involving that <strong>keyword</strong> and the definition <strong>of</strong> the<br />

new word. For example, to learn that apio (Spanish) means celery (English), a learner could choose the English word<br />

‘‘ape’’ as a <strong>keyword</strong>. <strong>The</strong>n the learner might create an interactive image <strong>of</strong> an ape carrying a huge bunch <strong>of</strong> celery. For<br />

<strong>retrieval</strong> <strong>of</strong> the definition when given the FL word, it is commonly suggested that the processes involved in the encoding<br />

procedure be adapted along the following lines: the FL word is examined for presence <strong>of</strong> a <strong>keyword</strong>; a search is<br />

made for images involving the <strong>keyword</strong>; and any image generated is scanned to identify the most prominent feature<br />

(Carney & Levin, 1998; Ellis & Beaton, 1993). Such <strong>retrieval</strong> is <strong>of</strong>ten referred to as receptive or forward recall. If the<br />

direction <strong>of</strong> recall was to be from the definition to the FL word (productive or backward recall), the <strong>retrieval</strong> procedure<br />

would need to be further adapted to access an image <strong>of</strong> the definition and then to identify a component <strong>of</strong> the image<br />

that would suggest a <strong>keyword</strong> that might then be used to cue <strong>retrieval</strong> <strong>of</strong> the FL word.<br />

1.1. <strong>The</strong> <strong>keyword</strong> <strong>method</strong> at time <strong>of</strong> <strong>retrieval</strong><br />

It is difficult to find evidence in reports <strong>of</strong> previous <strong>keyword</strong> <strong>method</strong> studies indicating that explicit <strong>retrieval</strong> training<br />

has been given to learners. In most reports on the <strong>keyword</strong> <strong>method</strong> it is the steps involved in encoding that are given<br />

most attention.<br />

Recently, Crutcher and Ericsson (2000, 2003) identified two models <strong>of</strong> <strong>retrieval</strong> that could be operative during<br />

recall <strong>of</strong> the definition <strong>of</strong> FL words that had been learned using the <strong>keyword</strong> <strong>method</strong>. In the direct access model <strong>of</strong><br />

<strong>retrieval</strong> it is proposed that <strong>retrieval</strong> occurs through accessing <strong>of</strong> a simple associative link between the FL word<br />

and the definition, without the involvement <strong>of</strong> the mediational <strong>keyword</strong>. In contrast, in the mediational model <strong>retrieval</strong><br />

results from accessing <strong>of</strong> the <strong>keyword</strong> mediator that links the FL word and its definition. Using SpanisheEnglish vocabulary<br />

items Crutcher and Ericsson (2000) found that the evidence available from accuracy, latency and verbal report<br />

data supported the mediational model <strong>of</strong> <strong>retrieval</strong> when moderate amounts <strong>of</strong> practice used in most <strong>keyword</strong><br />

research is provided. When the amount <strong>of</strong> practice given to the vocabulary pairs was substantially increased, the pattern<br />

<strong>of</strong> recall performance was more compatible with ‘‘a process in which the English equivalent is directly accessed<br />

from the Spanish word with no intermediate working memory steps’’ (p. 1312).<br />

<strong>The</strong> results <strong>of</strong> the Crutcher and Ericsson research provide support for the description <strong>of</strong> <strong>retrieval</strong> processes involving<br />

<strong>keyword</strong>s and images suggested by Carney and Levin (1998) and Ellis and Beaton (1993). Under typical <strong>keyword</strong><br />

training conditions, which do not involve large numbers <strong>of</strong> practice trials, the <strong>keyword</strong> mediators can be predicted to<br />

act as cues for <strong>retrieval</strong> <strong>of</strong> the definitions. <strong>The</strong> strength <strong>of</strong> this evidence suggested that it would be <strong>of</strong> interest to see if<br />

the impact <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> training could be enhanced by providing more attention to the use <strong>of</strong> the <strong>keyword</strong><br />

procedure at time <strong>of</strong> <strong>retrieval</strong>. To this end we set out to compare the <strong>effects</strong> <strong>of</strong> a standard <strong>keyword</strong> training condition<br />

with one in which students were given additional training in use <strong>of</strong> the <strong>keyword</strong> procedure at time <strong>of</strong> <strong>retrieval</strong>. On the<br />

basis <strong>of</strong> the evidence available from Crutcher and Ericsson (2000) we predicted that further <strong>retrieval</strong> training would<br />

increase the level <strong>of</strong> recall beyond that associated with the typical <strong>keyword</strong> <strong>method</strong> training in which the major<br />

emphasis was on the encoding procedure. Compared with the typical encoding training we expected that practice<br />

in adaptation <strong>of</strong> the <strong>keyword</strong> elements for <strong>retrieval</strong> would increase the influence <strong>of</strong> <strong>keyword</strong> mediators and that<br />

this would impact positively on recall performance.<br />

1.2. Ability to make images<br />

A second issue <strong>of</strong> concern in this paper was the influence <strong>of</strong> individual differences in students’ ability to make<br />

images in the <strong>keyword</strong> <strong>method</strong>. Imaging is a process in which individual differences are predicted to be influential.<br />

Kosslyn, Van Kleeck, and Kirby’s (1990) theory <strong>of</strong> processing subsystems represents visual imagery as involving<br />

high-level processes that use ‘‘stored information to reconstruct the appearance <strong>of</strong>, reinterpret, and possibly anticipate<br />

the consequences <strong>of</strong> transforming an object.’’ (p. 42). In this theory four imagery abilities are proposed: those involved<br />

in generation, transformation, inspection and maintenance <strong>of</strong> the image. Individual differences are predicted in each <strong>of</strong><br />

these abilities, arising from individual differences in the underlying processing subsystems. Kosslyn et al. (1990)<br />

argued that their experimental results provide strong support for the predictions generated from this model. Individual


362 M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

differences in visual imagery are also predicted within dual coding theory in which people are seen to vary in their<br />

‘‘tendency and capacity to use imagery’’ (Clark & Paivio, 1991, p. 156).<br />

In most research involving the <strong>keyword</strong> <strong>method</strong> consideration <strong>of</strong> individual differences in imagery has focussed on<br />

vividness <strong>of</strong> imagery (e.g., Baddeley & Andrade, 2000; Campos, 1995a,b, 1998; Campos & Perez, 1997; Marks,<br />

1973). Participants have typically been asked to rate the vividness <strong>of</strong> the images they generate in common imaging<br />

situation using the Vividness <strong>of</strong> Visual Imagery Questionnaire (VVIQ) developed by Marks (1973). <strong>The</strong> findings<br />

related to the influence on recall <strong>of</strong> individual differences in these ratings have been mixed, with positive, negative<br />

and close to zero correlations being reported between vividness ratings and recall (see Baddeley & Andrade,<br />

2000). <strong>The</strong> findings <strong>of</strong> Baddeley and Andrade (2000) suggest that this mixed pattern <strong>of</strong> associations between recall<br />

and vividness ratings does not arise because <strong>of</strong> the use <strong>of</strong> a self-rating procedure. Those researchers were able to<br />

get stable and reliable data from use <strong>of</strong> self-ratings. Of more concern to us is the narrowness <strong>of</strong> the processes being<br />

considered when vividness is the only dimension <strong>of</strong> imagery being rated. Because Kosslyn et al.’s (1990) model <strong>of</strong><br />

imagery involves more than just a generation process that might be impacted by vividness, it is important to consider<br />

more <strong>of</strong> the complexity <strong>of</strong> the imagery process when self-ratings are being made. In the <strong>keyword</strong> <strong>method</strong> the influence<br />

<strong>of</strong> individual differences in imagery processes should be operative in the generation <strong>of</strong> a vivid image, but should also<br />

affect the way that the <strong>keyword</strong> and definitional components <strong>of</strong> the image are related, or transformed. For instance, in<br />

<strong>keyword</strong> <strong>method</strong> instructions about imaging it is commonly suggested that the definitional component should be the<br />

most prominent feature in the image. If such instruction is influential we might predict that all four <strong>of</strong> Kosslyn et al.’s<br />

(1990) subsystems would be involved across a period <strong>of</strong> acquisition and recall. In this project we were interested to see<br />

whether a more general self-rating <strong>of</strong> ability to image would show a stronger relationship with recall performance than<br />

the ratings provided by use <strong>of</strong> the VVIQ.<br />

Thus, in this project we sought answers to two major questions:<br />

1. Does explicit training in use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> for <strong>bidirectional</strong> <strong>retrieval</strong> show benefits for recall performance<br />

when compared to the standard <strong>keyword</strong> training?<br />

2. Does students’ self-rated ability to image show a significant relationship with <strong>bidirectional</strong> recall performance<br />

that is stronger than that associated with ratings on the VVIQ?<br />

Answers to these questions were sought for both directions <strong>of</strong> recall, when students were provided with the new<br />

Spanish word and were required to produce its English definition (forward), and when they were provided with the<br />

English definition and required to produce the Spanish word (backward). Although much research on the <strong>keyword</strong><br />

<strong>method</strong> focuses on forward recall, in practical language use both directions <strong>of</strong> recall are essential.<br />

A final feature <strong>of</strong> this study is that it was carried out in regular classroom situations, with students who were<br />

engaged in their normal Spanish lessons. Data were gathered over multiple occasions so that the trajectories <strong>of</strong> students’<br />

recall performance could be analysed using a multilevel modelling procedure that could accommodate the<br />

nested structure <strong>of</strong> the design.<br />

2. Method<br />

2.1. Participants<br />

Participants were 36 female and 41 male Year 6/7 students (age 11e12 years) in three public primary schools<br />

located in close proximity in the same suburb in Adelaide, South Australia. <strong>The</strong> students, who came from one class<br />

in each school, had been foreign-language learners since Year 2 and most had studied Spanish for five years. All<br />

students were taught by specialist Spanish teachers who were not familiar with the <strong>keyword</strong> <strong>method</strong>. By the end <strong>of</strong><br />

the study data were available from 71 students, the remaining six students having missed a key training session.<br />

2.2. Materials and procedure<br />

Before training all students completed two imaging questionnaires, the VVIQ (Marks, 1973) and the Ability to<br />

Make Images (AMI) questionnaire designed for this study. <strong>The</strong> AMI questionnaire consisted <strong>of</strong> three sections,<br />

with items concerned with quality <strong>of</strong> mental imagery, frequency <strong>of</strong> mental imagery, and participants’ perceptions


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

363<br />

<strong>of</strong> themselves as imagers. <strong>The</strong> complete set <strong>of</strong> AMI items is shown in Appendix 1. Reliability analysis showed a Cronbach’s<br />

alpha <strong>of</strong> 0.89 with all items having high item-total correlations. To prepare students for the imaging questionnaires<br />

they listened to a fragment <strong>of</strong> Dahl’s (1982) book ‘‘<strong>The</strong> Twits’’ and were asked to think about what they ‘‘saw in<br />

their mind’’ when listening to the story. Students were not aware that VVIQ and AMI would follow.<br />

Based on paired rank classroom achievement scores students were randomly assigned to two groups, the standard<br />

<strong>keyword</strong> group and the <strong>keyword</strong> <strong>retrieval</strong> group. Students participated on six occasions (see Table 1). Occasion 0 provided<br />

training and practice in use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong>. <strong>The</strong> next two sessions, Occasions 1 and 2, involved revision <strong>of</strong><br />

use <strong>of</strong> the <strong>method</strong>, then learning <strong>of</strong> the 22 SpanisheEnglish wordedefinition pairs at 20-s pace and then an immediate<br />

<strong>bidirectional</strong> recall test (Tests 1 and 2 in Table 1). This was designed to make the acquisition <strong>of</strong> word-meaning pairs as<br />

powerful as possible, within the limits <strong>of</strong> the time available. Occasions 0, 1 and 2 were conducted in small groups <strong>of</strong><br />

between five and eight students. <strong>The</strong> three delayed testing sessions (Occasions 3, 4 and 5) occurred once each week<br />

for the next 3 weeks. All sessions were <strong>of</strong> the same duration for both the standard group and the <strong>retrieval</strong> group.<br />

Twenty eight concrete two- and three-syllable nouns were chosen for this study. Six words were used as instruction/<br />

practice words, with the remaining 22 words being used as target words in the learning/testing part <strong>of</strong> this study. Care<br />

was taken to ensure that all the words were new for the students, first through consultation with teachers and then<br />

through pilot testing with students in a parallel Spanish program in a nearby school.<br />

<strong>The</strong> training instructions for students were prepared as booklets that were studied by students under the guidance <strong>of</strong><br />

the researcher. <strong>The</strong> scripts in the booklets were adapted from those described by Lawson and Hogben (1998). Using<br />

Lawson and Hogben’s (1998) elaborated version <strong>of</strong> encoding, three main conditions were imposed on the process <strong>of</strong><br />

linking the image <strong>of</strong> definition and the image <strong>of</strong> <strong>keyword</strong> into the interactive image. <strong>The</strong>se conditions were (1) there<br />

are only two images in the interactive pictures; (2) the image <strong>of</strong> the definition has to be bigger than the image <strong>of</strong> the<br />

<strong>keyword</strong> image; and (3) if possible, the interactive image should be funny or unusual.<br />

<strong>The</strong> scripts were supplemented with three posters that contained graphical illustration <strong>of</strong> the key steps in the<br />

encoding (IN) and in the two directions <strong>of</strong> <strong>retrieval</strong> (OUTf for forward recall training, OUTb for backward recall training)<br />

uses <strong>of</strong> the <strong>keyword</strong> procedure. <strong>The</strong> standard <strong>keyword</strong> group used only the IN poster and the <strong>keyword</strong> <strong>retrieval</strong><br />

group was shown all three posters. For both the standard and <strong>retrieval</strong> groups, the <strong>keyword</strong> generating and image<br />

generating steps were explained using the Spanish word camarera (waitress) and possible <strong>keyword</strong>s and interactive<br />

images, and desirable qualities <strong>of</strong> the images were discussed. Students in both groups practised immediately on two<br />

new words (jacal, grossella) and then standard group practised encoding with another three new words (tomada, mancha,<br />

cazador), with students in the standard group having time to discuss the possible <strong>keyword</strong>s and images throughout<br />

the training session. Students in the <strong>retrieval</strong> group were given additional training in how to use the <strong>keyword</strong><br />

procedure to retrieve bi-directionally (forward and backward). With the assistance <strong>of</strong> the ‘‘OUT’’ posters (OUTf<br />

and OUTb), they practised the <strong>retrieval</strong> <strong>of</strong> all the training words jacal and grossella) and practised encoding and <strong>retrieval</strong><br />

<strong>of</strong> three new words (tomada, mancha, cazador). For forward <strong>retrieval</strong> the OUTf poster identified a set <strong>of</strong> steps,<br />

starting with examination <strong>of</strong> the FL word for presence <strong>of</strong> a <strong>keyword</strong>, making a search for images involving the <strong>keyword</strong><br />

that had the characteristics emphasised as being desirable during the <strong>keyword</strong> <strong>method</strong> training, and scanning the<br />

image generated to identify the most prominent component <strong>of</strong> the image, which is expected to be the definition. In the<br />

case <strong>of</strong> backward <strong>retrieval</strong> the OUTb poster steps began with a search for any images <strong>of</strong> the definition. <strong>The</strong> next steps<br />

Table 1<br />

Training, learning and testing procedures<br />

OCCASION STANDARD RETRIEVAL<br />

0 Keyword <strong>method</strong> training (encoding) Keyword <strong>method</strong> training (encoding)<br />

Bidirectional <strong>retrieval</strong> training<br />

1 Recap <strong>of</strong> first session (occasion 0) Recap <strong>of</strong> first session (occasion 0)<br />

Learning 22 SpanisheEnglish wordedefinition pairs<br />

Learning 22 SpanisheEnglish wordedefinition pairs<br />

Test 1 Test 1<br />

2 Learning words again Learning words again<br />

Test 2 Test 2<br />

3 Test 3 Test 3<br />

4 Test 4 Test 4<br />

5 Test 5 Test 5


364 M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

involved looking for interactive images that had the characteristics emphasised as being desirable during the <strong>keyword</strong><br />

<strong>method</strong> training, selection <strong>of</strong> the less prominent component <strong>of</strong> the image that represented the <strong>keyword</strong>, and generation<br />

and selection <strong>of</strong> the FL word. All training materials were retrieved by the researcher at the completion <strong>of</strong> each training<br />

session.<br />

In the next session, immediately after a discussion and a revision <strong>of</strong> the procedures used in the first training session,<br />

students in both groups learned 22 new pairs <strong>of</strong> Spanish word e English definitions using the provided <strong>keyword</strong>s. <strong>The</strong><br />

target word content was presented in booklets, with one Spanish word e English definition and the English <strong>keyword</strong><br />

per page, e.g. camarera, waitress, <strong>keyword</strong> e camera. This learning session was paced by the researcher so that<br />

students had 20 s to study each page. At the end <strong>of</strong> the period <strong>of</strong> learning students were given the written immediate<br />

recall test, following a 5-min period <strong>of</strong> free talk that was not related to the <strong>keyword</strong> <strong>method</strong>. <strong>The</strong> order <strong>of</strong> words on the<br />

immediate recall test was randomized, as was the selection <strong>of</strong> words for forward (FWR) and backward recall (BWR),<br />

with the restriction that equal number <strong>of</strong> two- and three-syllable words were in both FWR and BWR sections <strong>of</strong> the<br />

test. <strong>The</strong> recall tests on Testing Occasions 1 and 2 were conducted in the small groups that were used during training<br />

and learning. For Testing Occasions 3, 4 and 5 testing was done in the students’ classrooms as whole classes, due to the<br />

need to minimize disruption <strong>of</strong> school days and to simulate the regular classroom environment. <strong>The</strong> order <strong>of</strong> words<br />

was the same on all tests. <strong>The</strong> results <strong>of</strong> testing were not made available to students until after the final testing occasion.<br />

<strong>The</strong> students’ responses on both forward and backward recall were scored on a 0e1 scale. A zero was scored for<br />

missing or incorrect responses, a score <strong>of</strong> 1 was given for conservatively judged correct Spanish words (in backward<br />

recall) and definitions (in forward recall).<br />

2.3. Multilevel analysis<br />

When dealing with multilevel data such as the data in this study, the appropriate procedure is to formulate multilevel<br />

models, ‘‘which enable the testing <strong>of</strong> hypotheses about <strong>effects</strong> occurring within each level and the interrelations<br />

among them’’ (Raudenbush & Bryk, 1994, p. 2590). Consequently, in this study, two-level models were hypothesized<br />

to enable the testing <strong>of</strong> hypotheses about the factors influencing forward recall (FWR) and backward recall (BWR).<br />

<strong>The</strong> hierarchical linear modelling program (HLM) (Raudenbush, Bryk, Cheong, & Congdon, 2000) was used for this<br />

analysis. As a <strong>method</strong> for analysis <strong>of</strong> growth in nested designs, HLM is argued by Bryk and Raudenbush (1992, p. 133)<br />

to provide significant advantages over a technique such as repeated measures analysis. HLM provides explicit modelling<br />

<strong>of</strong> student growth that can describe the pattern <strong>of</strong> both within-student change and the <strong>effects</strong> on that <strong>of</strong> betweenstudent<br />

characteristics such as teacher or school level variables. In addition, HLM can consider the interactive <strong>effects</strong><br />

across these levels (Bryk & Raudenbush, 1992). <strong>The</strong> multilevel analysis is argued by Bryk and Raudenbush to overcome<br />

the problems associated with conflation <strong>of</strong> individual and group <strong>effects</strong>. Multilevel models are also less restrictive<br />

in their assumptions than the more conventional repeated measures analysis <strong>of</strong> variance procedures. Of particular<br />

moment for longitudinal classroom research, where student attendance is <strong>of</strong>ten variable, is the flexibility <strong>of</strong> HLM in its<br />

treatment <strong>of</strong> the spacing <strong>of</strong> observations and <strong>of</strong> missing data.<br />

Two separate models were specified, one for FWR and the other for BWR. It was initially hypothesized that student<br />

recall was influenced by within-student factors (occasion <strong>of</strong> testing and learning) and between-student factors such as<br />

gender, ability to make images and treatment (see Table 2). <strong>The</strong> models proposed can be denoted in equation form as<br />

follows.<br />

Level-1 model<br />

½YŠ ij<br />

¼ b 0j þ b 1j ðOCCASIONÞ 1ij<br />

þ b 2j ðLEARNINGÞ 2ij<br />

þ r ij<br />

Level-2 model<br />

b 0j ¼ g 00 þ g 0gj W 0gj þ u 0j<br />

b 1j ¼ g 10 þ g 1gj W 1gj þ u 1j<br />

b 2j ¼ g 20 þ g 2gj W 2gj þ u 2j<br />

ð1Þ<br />

where Y ij is the FWR (or BWR) score pupil i at occasion j.


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

365<br />

Table 2<br />

Variables tested on each level <strong>of</strong> the hierarchy<br />

Level Variable code Variable description<br />

Within-student OCCASION Testing occasions (0 ¼ occasion 1,., 4¼ occasion 5)<br />

LEARNING Dummy variable for learning occasion (1 ¼ occasion 2; 0 ¼ occasions 1, 3, 4 and 5)<br />

Between-student GENDER Sex <strong>of</strong> the student (0 ¼ male; 1 ¼ female)<br />

YRLEVEL Year level (0 ¼ Year 6; 1 ¼ Year 7)<br />

ACHIEV<br />

Teachers ranking <strong>of</strong> students’ academic achievement in class<br />

(1 ¼ highest achiever,., 29¼ lowest achiever)<br />

TREATMNT Treatment (0 ¼ control group; 1 ¼ experimental group)<br />

VVIQ<br />

Total score for the VVIQ questionnaire i.e. vividness <strong>of</strong> images (29 ¼ lowest,., 80¼ highest)<br />

AMI<br />

Total score for AMI questionnaire i.e. ability to make images (21 ¼ lowest,., 60¼ highest)<br />

SCHOOL1 Dummy variable for School 1 (1 ¼ School 1; 0 ¼ Schools 2 & 3)<br />

SCHOOL2 Dummy variable for School 2 (1 ¼ School 2; 0 ¼ Schools 1 & 3)<br />

SCHOOL3 Dummy variable for School 3 (1 ¼ School 3; 0 ¼ Schools 1 & 2)<br />

For parsimony, W gj in Eq. (1) represents the control for several relevant level-2 variables (W 1j þ W 2j þ . þ W gj )<br />

that describe the student characteristics, treatment and school attended by the student. Thus, W gj represents a combination<br />

<strong>of</strong> any <strong>of</strong> the nine level-2 variables listed in Table 2. <strong>The</strong> ‘u’ is the error term<br />

<strong>The</strong> first step in the multilevel analyses was to run the so-called ‘null model’ in order to estimate the amounts <strong>of</strong><br />

variance available to be explained at each level <strong>of</strong> the hierarchy (Raudenbush & Bryk, 2002). A null model contained<br />

only the dependent variable (FWR or BWR) and no predictor variables were specified at any level. <strong>The</strong> second step<br />

was to build up the Level-1 model, that is, the within-student model. This involved adding level-1 predictors to the<br />

model, but without entering predictors at the second level. An approach referred to as a ‘step-up’ approach was followed<br />

to examine which <strong>of</strong> the level-1 variables (listed in Table 2 above) had a significant influence on FWR or BWR<br />

in each <strong>of</strong> the hypothesized models. Bryk and Raudenbush (1992) have recommended the step-up approach for inclusion<br />

<strong>of</strong> variables into the model to the alternative approach referred as ‘working-backward’ where all the possible predictors<br />

are included in the model and then the non-significant variables are progressively eliminated from the model.<br />

<strong>The</strong> final step in the multilevel analyses involved building up the model to the second level through adding the<br />

significant level-2 predictor variables into the model using the step-up strategy. <strong>The</strong> final model for FWR at levels<br />

1 and 2 was<br />

Level-1 model<br />

½FWRŠ ij<br />

¼ b 0j þ b 1j ðOCCASIONÞ 1ij<br />

þ b 2j ðLEARNINGÞ 2ij<br />

þ r ij<br />

Level-2 model<br />

b 0j ¼ g 00 þ g 01 ðYRLEVELÞ 01j<br />

þ g 02 ðTREATMNTÞ 02j<br />

þ g 03 ðAMIÞ 03j<br />

þ u 0j<br />

b 1j ¼ g 10 þ g 11 ðSCHOOL2Þ 11j<br />

þ u 1j<br />

b 2j ¼ g 20 þ u 2j<br />

<strong>The</strong> final model for BWR at Levels 1 and 2 was<br />

Level-1 model<br />

ð2Þ<br />

½BWRŠ ij<br />

¼ b 0j þ b 1j ðOCCASIONÞ 1ij<br />

þ b 2j ðLEARNINGÞ 2ij<br />

þ r ij<br />

Level-2 model<br />

b 0j ¼ g 00 þ g 01 ðYRLEVELÞ 01j<br />

þ g 02 ðTREATMNTÞ 02j<br />

þ g 03 ðAMIÞ 03j<br />

þ u 0j<br />

b 1j ¼ g 10 þ g 11 ðTREATMNTÞ 11j<br />

þ g 12 ðSCHOOL2Þ 12j<br />

þ u 1j<br />

b 2j ¼ g 20 þ u 2j<br />

ð3Þ


366 M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

<strong>The</strong>re are two issues worth noting regarding the analyses described above. First, the level-1 predictor variables<br />

(OCCASION and LEARNING) were grand-mean-centred in the HLM analyses, so that the intercept term would represent<br />

the average score for the students. Second, for both FWR and BWR, the regression coefficient <strong>of</strong> level-1 variables<br />

were left to vary across occasions because their reliability estimates were above the 0.05 value recommended by<br />

Raudenbush and Bryk (2002, p. 125) based on their extensive experience.<br />

3. Results<br />

Descriptive statistics for each group across the five testing occasions are shown in Table 3. <strong>The</strong> pattern <strong>of</strong> recall performance<br />

is similar for both groups in both recall directions. After Test 2 there is a small decline followed by a leveling<br />

<strong>of</strong>f or recovery. <strong>The</strong>re is, however, a noticeable difference in the level <strong>of</strong> recall between the two groups. At Test 5 for<br />

forward recall the <strong>retrieval</strong> group recalled about 70% <strong>of</strong> the meanings <strong>of</strong> the 11 target words, while the level for the<br />

standard group was about 50%. For backward recall the corresponding figures were about 40% and 19%.<br />

Estimates <strong>of</strong> fixed <strong>effects</strong> from the two-level models for FWR and BWR have been given in Table 4 for null, Level-<br />

1 and final models. <strong>The</strong> descriptive statistics <strong>of</strong> the variables included in the final models have been given at the bottom<br />

<strong>of</strong> Table 4. <strong>The</strong> results <strong>of</strong> the final estimation <strong>of</strong> variance components for the final FWR and BWR models and the<br />

results <strong>of</strong> the analyses <strong>of</strong> the variance components obtained from the corresponding null models are presented in Table 4,<br />

in rows ‘a’ and ‘b’, respectively. From the information in Table 4 rows ‘a’ and ‘b’, the information presented in<br />

rows ‘c’ to ‘f’ were calculated. A discussion <strong>of</strong> the calculations involved here is to be found in Raudenbush and<br />

Bryk (2002, pp. 69e95). <strong>The</strong> results in Tables 4 and 5 are discussed next in two sub-sections.<br />

3.1. Fixed <strong>effects</strong><br />

For the null model, the results in Tables 4 and 5 show that the predicted mean FWR score for the students is 6.16<br />

with a variance <strong>of</strong> 10.47. In forward recall, most students were expected to remember between about three and nine<br />

words. <strong>The</strong> corresponding expectation for backward recall was between one and five words. Thus, students were much<br />

better in forward recall than in backward recall, which is consistent with expectation and findings from other studies<br />

(Ellis & Beaton, 1993).<br />

For the level-1 and final model, the results in Table 4 show that both forward and backward recall were positively<br />

influenced by occasion <strong>of</strong> testing (OCCASION) and learning (LEARNING). For example, based on the level-1 model,<br />

the results in Table 4 show that, on average, recall increased at a rate <strong>of</strong> 0.56 and 0.49 words per occasion for FWR and<br />

BWR, respectively. In addition, the results indicate that recall was significantly better immediately after learning<br />

regardless <strong>of</strong> the direction <strong>of</strong> recall. On average, for the occasion immediately after learning, recall was better by<br />

1.11 and 1.65 words for FWR and BWR, respectively, compared to recall at the non-learning occasions.<br />

In addition, for both FWR and BWR final models, the results in Table 4 show that recall was significantly influenced<br />

by Year level (YRLEVEL), treatment (TREATMNT) and ability to make images (AMI). On average, Year 7 students<br />

outperformed Year 6 students by 1.56 and 0.89 words for forward and backward recall, respectively, while the experimental<br />

group outperformed the control group by 1.68 and 1.83 words for forward and backward recall, respectively.<br />

<strong>The</strong> significant effect <strong>of</strong> treatment indicated that there was a substantial benefit associated with the <strong>retrieval</strong> training.<br />

<strong>The</strong> impact <strong>of</strong> the treatment on forward recall was to move the mean for the group from 40.7% to 56%. <strong>The</strong><br />

Table 3<br />

Descriptive statistics<br />

Test 1 Test 2 Test 3 Test 4 Test 5<br />

Mean SD Mean SD Mean SD Mean SD Mean SD<br />

Forward recall score (FWR)<br />

Standard group 3.24 2.92 5.73 3.40 5.12 3.39 5.12 3.46 5.54 3.43<br />

Retrieval group 5.80 3.17 7.31 3.61 7.69 3.36 8.06 3.59 7.92 3.64<br />

Backward recall score (BWR)<br />

Standard group 0.76 1.23 3.12 2.25 1.79 1.90 2.00 2.11 2.11 2.03<br />

Retrieval group 2.23 1.99 5.00 2.61 4.39 3.15 5.09 3.11 4.39 2.43


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

367<br />

Table 4<br />

Final estimation <strong>of</strong> fixed <strong>effects</strong> for the FWR and BWR models<br />

Variable Null model Level-1 model Final model<br />

Coefficient SE P-value Coefficient SE P-value Coefficient SE P-value<br />

FWR<br />

Level-2 Intercept, g 00 6.16 0.39 0.00 6.16 0.39 0.00 4.48 0.46 0.00<br />

(Between-student) YRLEVEL, g 01 1.56 0.66 0.02<br />

TREATMNT, g 02 1.68 0.63 0.01<br />

AMI, g 03 0.12 0.03 0.00<br />

Level-1 OCCASION, g 10 0.56 0.08 0.00 0.43 0.08 0.00<br />

(Within student) Interaction with SCHOOL2, g 11 0.36 0.15 0.02<br />

LEARNING, g 20 1.11 0.22 0.00 1.11 0.22 0.00<br />

BWR<br />

Level-2 Intercept, g 00 3.07 0.28 0.00 3.07 0.28 0.00 1.67 0.25 0.00<br />

(Between-student) YRLEVEL, g 01 0.89 0.39 0.02<br />

TREATMNT, g 02 1.83 0.43 0.00<br />

AMI, g 03 0.06 0.02 0.01<br />

Level-1 OCCASION, g 10 0.49 0.06 0.00 0.25 0.06 0.00<br />

(Within-student) Interaction with TREATMNT, g 11 0.26 0.10 0.01<br />

Interaction with SCHOOL2, g 12 0.30 0.08 0.00<br />

LEARNING, g 20 1.65 0.17 0.00 1.65 0.17 0.00<br />

Variable Level-1 descriptive statistics Level-2 descriptive statistics<br />

N Mean SD Min. Max. Variable J Mean SD Min. Max.<br />

FWR 344 6.18 3.67 0 11 YRLEVEL 71 0.54 0.50 0 1<br />

BWR 344 3.11 2.74 0 11 TREATMNT 71 0.51 0.50 0 1<br />

OCCASION 355 2.00 1.42 0 4 AMI 71 44.82 8.99 21 60<br />

LEARNING 355 0.20 0.40 0 1 SCHOOL2 71 0.37 0.49 0 1<br />

corresponding figures for backward recall were 15.2% and 31.8%. <strong>The</strong> effect for backward recall is <strong>of</strong> particular note<br />

given the very low level <strong>of</strong> recall for non-treatment students in this study and the levels <strong>of</strong> backward recall that has<br />

been found in other research (Ellis & Beaton, 1993). In addition, students with higher AMI scores outperformed their<br />

counterparts in both forward and backward recall. <strong>The</strong> ability to make images was associated with slightly improved<br />

recall, the effect being stronger for forward than backward recall.<br />

<strong>The</strong> results in Table 4 also show significant cross-level interaction <strong>effects</strong> (a) between OCCASION and SCHOOL2<br />

for both FWR and BWR, and (b) between OCCASION and TREATMNT for BWR. Fig. 1 is a graphical representation<br />

<strong>of</strong> the interaction effect between OCCASION and TREATMNT for backward recall. <strong>The</strong> co-ordinates for this graph<br />

were calculated using the procedure described by Aiken and West (1996) and Lietz (1996). Fig. 1 shows that, when<br />

other factors are equal, time had more impact on backward recall <strong>of</strong> the experimental group than on backward recall <strong>of</strong><br />

the control group. Thus, the experimental group was on average better than the control group in backward recall and<br />

also improved at a greater rate in backward recall than the control group.<br />

Table 5<br />

Variance explained in the final FWR and BWR models<br />

FWR<br />

BWR<br />

Within Between Total Within Between Total<br />

(a) Null model 2.95 10.47 13.43 2.39 5.01 7.40<br />

(b) Final model 1.27 7.22 1.43 3.35<br />

(c) Variance available 22.0% 78.0% 32.3% 67.7%<br />

(d) Variance explained 57.0% 31.1% 40.2% 33.1%<br />

(e) Total variance explained 12.5% 24.3% 36.8% 13.0% 22.5% 35.4%<br />

(f) Variance left unexplained 9.5% 53.7% 63.2% 19.3% 45.3% 64.6%


368 M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

13<br />

12<br />

Experimental Group<br />

Backward recall (BWR) score<br />

11<br />

10<br />

9<br />

8<br />

7<br />

6<br />

5<br />

Control Group<br />

4<br />

3<br />

Early Occasion<br />

Occasion<br />

Later Occasion<br />

Fig. 1. Impact <strong>of</strong> the interaction effect <strong>of</strong> occasion with treatment on backward recall.<br />

3.2. Variance partitioning and variance explained<br />

<strong>The</strong> results in Table 5 show that the percentages <strong>of</strong> variances available at levels 1 and 2 were 22.0 and 78.0,<br />

respectively, for FWR, and 32.3 and 67.7, respectively, for BWR. <strong>The</strong>se percentages <strong>of</strong> variance <strong>of</strong> scores at the various<br />

levels <strong>of</strong> the hierarchy are the maximum amounts <strong>of</strong> variance available at those levels that could be explained in<br />

subsequent analyses. Thus, there was markedly more variance to be explained at the between-student level (the treatment<br />

level) than at the within-student level.<br />

As it can be seen from the results in Table 5, the predictors included in the final FWR model explained 57.0% <strong>of</strong><br />

22.0% variance available at the within-student level and that is equal to 12.5% <strong>of</strong> the total variance explained at the<br />

within-student level. Similarly, the predictors included in the final FWR model explained 24.3% (that is, 31.1% <strong>of</strong><br />

78.0%) at the between-student level. Thus, the total variance explained by the predictors included in the final FWR<br />

model was 12.5 þ 24.3 ¼ 36.8%, which left 63.2% <strong>of</strong> the total variance unexplained in this model. Likewise, the predictors<br />

included in the final BWR model explained 35.4% <strong>of</strong> the variance, which left 64.6% unexplained. <strong>The</strong> large<br />

amounts <strong>of</strong> variance left unexplained indicate that there are other important within student, between-student and probably<br />

class or school level factors influencing recall that were not included in the models developed in this study.<br />

4. Discussion<br />

In this study we set out to examine <strong>effects</strong> associated with two internal components <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> as it has<br />

been used for FL vocabulary acquisition: explicit training in use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> for <strong>retrieval</strong>, and students’<br />

self-rated ability to image. <strong>The</strong> <strong>effects</strong> <strong>of</strong> these variables were examined on forward and backward recall.<br />

<strong>The</strong> results indicate that training in the use <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> at the time <strong>of</strong> <strong>retrieval</strong> did <strong>of</strong>fer an educationally<br />

meaningful advantage for recall performance. This finding suggests that future training involving the <strong>keyword</strong> <strong>method</strong><br />

should include explicit attention to the use <strong>of</strong> the procedure for both encoding and <strong>retrieval</strong>.<br />

In addition, the rating for ability to make images made a small but significant contribution to recall performance.<br />

<strong>The</strong>re is, therefore, a basis for further investigation <strong>of</strong> the influence <strong>of</strong> training in making images on recall in the <strong>keyword</strong><br />

<strong>method</strong> studies.<br />

<strong>The</strong> results show evidence <strong>of</strong> growth in students’ vocabulary. Recall scores for both forward and backward recall<br />

increased across the recall testing occasions. <strong>The</strong>re was a small positive impact across occasion for both directions <strong>of</strong><br />

recall. This finding parallels the results observed by Lawson and Hogben (1998) in their study with high school


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

369<br />

students learning Italian. As was the case in that study the positive impact <strong>of</strong> learning is strongest for recall following<br />

the initial learning occasions. <strong>The</strong> positive slope across the multiple occasions used in the current study is noteworthy<br />

given other research described earlier that has shown decline in recall when a smaller number <strong>of</strong> recall occasions has<br />

been used.<br />

<strong>The</strong>re was a significant impact <strong>of</strong> the <strong>retrieval</strong> training on recall score compared to that associated with the standard<br />

<strong>keyword</strong> training. Again the size <strong>of</strong> the benefit from <strong>retrieval</strong> training was as <strong>of</strong> practical significance. When compared<br />

with the traditional <strong>keyword</strong> training, reports <strong>of</strong> which give most emphasis to encoding, the relatively simple explicit<br />

training in use <strong>of</strong> the <strong>keyword</strong> procedure for <strong>retrieval</strong> was associated with successful recall <strong>of</strong> not only definitions<br />

(FWR) but also with successful recall <strong>of</strong> Spanish words (BWR). <strong>The</strong> fact that the impact <strong>of</strong> the <strong>retrieval</strong> training<br />

occurred for both forward and backward recall is also important because it indicates that students gained benefit<br />

from this additional training for both receptive and productive vocabulary use.<br />

<strong>The</strong> difference in level <strong>of</strong> recall in forward and backward directions mirrors that found in previous research (Ellis &<br />

Beaton, 1993). Ellis and Beaton (1993) suggest that backward recall is more complex because there are ‘‘many more<br />

competing active paths in production [backward recall] than in reception [forward recall].’’ (p. 549). It is not clear<br />

from their description what these authors meant by ‘active paths’. Examination <strong>of</strong> the descriptions <strong>of</strong> <strong>retrieval</strong> processes<br />

involved in forward and backward directions suggests some ways in which the demands on the learner could<br />

be different in the two directions. We base this argument on the analysis <strong>of</strong> the forward and backward <strong>retrieval</strong> processes<br />

shown in Fig. 2. We think that the number <strong>of</strong> associations is relatively similar in both directions <strong>of</strong> recall. <strong>The</strong><br />

learner has to retrieve an image <strong>of</strong> the definition (in backward recall, see Fig. 2 step 1B) or an image <strong>of</strong> the <strong>keyword</strong> (in<br />

forward recall, see Fig. 2 step 2F). Each task involving native language words would potentially stimulate comparable<br />

number <strong>of</strong> activations. Alternatively, Ellis and Beaton suggest that the strictness <strong>of</strong> scoring <strong>of</strong> acceptable responses is<br />

likely to act to disadvantage recall performance in the backward direction.<br />

In Fig. 2 the central row <strong>of</strong> boxes identifies phases <strong>of</strong> representation and the arrows in both directions indicate broad<br />

processes that bring about changes in representation. Thus, as described in the forward path legend at the top <strong>of</strong> the<br />

figure, process 1F involves scanning <strong>of</strong> the foreign word to search for and select the <strong>keyword</strong> that was used during<br />

encoding. Other process descriptions in the forward direction and those in the backward direction are given in the<br />

respective legends in the Fig. 2.<br />

<strong>The</strong> model set out in Fig. 2 suggests that the number <strong>of</strong> competing active paths would be the same for the forward<br />

and backward recall. In FWR demand on the student in terms <strong>of</strong> the number <strong>of</strong> possible paths that could be activated<br />

would seem to be greatest in steps 2F, 3F and 5F. For BWR, steps 1B, 2B and 5B would also involve high demand<br />

FORWARD PATH<br />

1F – scanning FW for <strong>keyword</strong> (KW), generating and/or searching, selecting<br />

2F – searching for pictorial image/s <strong>of</strong> KW<br />

3F – searching for interaction (as per encoding and <strong>retrieval</strong> training conditions), generating and/or searching, selecting<br />

4F – selecting the pictorial image <strong>of</strong> definition from the (as per encoding and <strong>retrieval</strong> training conditions)<br />

5F – interpretation <strong>of</strong> image to recall the definition<br />

1F 2F 3F 4F 5F<br />

FOREIGN WORD<br />

KW<br />

IMAGE<br />

OF KW<br />

IMAGE<br />

OF<br />

INTERA<br />

CTION<br />

IMAGE<br />

OF<br />

DEFINITION<br />

DEFINITION<br />

apio<br />

5B<br />

ape<br />

4B<br />

3B 2B 1B<br />

celery<br />

BACKWARD PATH<br />

1B – searching for pictorial image <strong>of</strong> definition, generating and/or searching, selecting<br />

2B – searching for image <strong>of</strong> interaction (as per encoding and <strong>retrieval</strong> training conditions)<br />

3B – selecting the pictorial image <strong>of</strong> <strong>keyword</strong> (KW) (as per encoding and <strong>retrieval</strong> training conditions)<br />

4B – interpretation <strong>of</strong> the image <strong>of</strong> KW to recall the image <strong>of</strong> the spelling <strong>of</strong> KW<br />

5B – searching for or generating, selecting the foreign word<br />

Fig. 2. Retrieval phases and broad processes involved in forward and backward directions.


370 M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

associated with making a selection from a potentially large number <strong>of</strong> alternatives. However, in the typical <strong>keyword</strong><br />

situation, where recall is required across relatively short periods <strong>of</strong> time, the number <strong>of</strong> possibilities that need to be<br />

scanned in steps 2F, 3F, 5F, 1B and 2B would seem to be reduced. This reduction is argued to arise from the fact that<br />

the interactive images have been set up recently in training and so may be in a relatively high state <strong>of</strong> activation. <strong>The</strong><br />

same argument does not appear to be applicable to step 5B. When the students get to this point they must use the<br />

<strong>keyword</strong> to scan their FL lexicon. Likely <strong>of</strong> most importance at this step for novice learners <strong>of</strong> the FL is that there<br />

are relatively few words that are strongly established in the FL lexicon, so that the <strong>keyword</strong> does not act as a strong<br />

cue for identification <strong>of</strong> the FL word. This analysis provides an alternative to that <strong>of</strong> Ellis and Beaton (1993) who<br />

suggested that there is a greater number <strong>of</strong> associations activated by the native language definition relative to the<br />

number activated by presentation <strong>of</strong> a foreign-language word so that a sort <strong>of</strong> fan effect (Anderson, 2000) weakens<br />

the backward link. Of course both analyses are speculative at the moment and need to be examined in further<br />

research.<br />

<strong>The</strong> second major point <strong>of</strong> interest involved the impact <strong>of</strong> learners’ ability to form images on recall in the <strong>keyword</strong><br />

<strong>method</strong>. <strong>The</strong> Ability to Make Images questionnaire designed for this study has been used to measure the ability to<br />

make images in a broader sense than the frequently used VVIQ and it has been found that students who have high<br />

rated ability to make images outperformed the students with low rated ability to make images in both forward and<br />

backward recalls. Although the size <strong>of</strong> the effect associated with AMI rating is relatively small, it is <strong>of</strong> interest<br />

here that the more general rating <strong>of</strong> ability to image emerged as a stronger predictor than did the more commonly<br />

used VVIQ.<br />

<strong>The</strong> results observed here add to the research findings that show that the <strong>keyword</strong> <strong>method</strong> is a useful procedure for<br />

acquisition <strong>of</strong> vocabulary in FL learning. In particular the findings associated with the treatment effect observed here<br />

suggest that if <strong>keyword</strong> <strong>method</strong> training does not give emphasis to use <strong>of</strong> the procedure at time <strong>of</strong> <strong>retrieval</strong> it will not<br />

maximise its impact on both forward and backward recall.<br />

Acknowledgements<br />

This research was supported by a grant from the Australian Research Council. We acknowledge the cooperation <strong>of</strong><br />

the principals, teachers and students <strong>of</strong> the schools who participated in the project.<br />

Appendix 1. <strong>The</strong> ability to make images questionnaire (AMI)<br />

Image quality Not at all Vaguely Some <strong>of</strong> it Most <strong>of</strong> it Everything as<br />

in a movie<br />

1 2 3 4 5<br />

When someone tells you or reads to you a story do<br />

you see in your mind what happens in the story?<br />

When you read a story do you see in your mind<br />

what happens in the story?<br />

Imaging frequency Never Rarely Sometimes Often Always<br />

1 2 3 4 5<br />

When someone tells you or reads to you a story<br />

do you see in your mind what happens<br />

in the story?<br />

When you read a story do you see in your<br />

mind what happens in the story?<br />

Do you like to imagine when reading?<br />

Do you like to imagine when listening?<br />

Are you a dreamer?<br />

Do you like to imagine when watching?<br />

Do you like to imagine when doing<br />

nothing/daydreaming?


M. Wyra et al. / Learning and Instruction 17 (2007) 360e371<br />

371<br />

Appendix 1 (continued)<br />

Imaging performance Not at all Not so good Reasonably good Good Very good<br />

1 2 3 4 5<br />

Are you good at imagining things/objects/animals?<br />

Are you good at imagining people?<br />

Are you good at imagining actions/what<br />

happens/what people do?<br />

References<br />

Anderson, J. R. (2000). Cognitive psychology and its implication (5th ed.). New York: Worth.<br />

Aiken, L. S., & West, S. G. (1996). Multiple regression: Testing and interpreting interactions. Newbury Park, CA: Sage Publications.<br />

Atkinson, R. C., & Raugh, M. R. (1975). An application <strong>of</strong> the <strong>mnemonic</strong> <strong>keyword</strong> <strong>method</strong> to the acquisition <strong>of</strong> a Russian vocabulary. Journal <strong>of</strong><br />

Experimental Psychology, 104, 126e133.<br />

Avila, E., & Sadowski, M. (1996). Exploring new applications <strong>of</strong> the <strong>keyword</strong> <strong>method</strong> to acquire English vocabulary. Language Learning, 46(3),<br />

379e395.<br />

Baddeley, A. D., & Andrade, J. (2000). Working memory and the vividness <strong>of</strong> imagery. Journal <strong>of</strong> Experimental Psychology: General, 129(1),<br />

126e145.<br />

Bryk, A. S., & Raudenbush, S. W. (1992). Hierarchical linear models: Applications and data analysis <strong>method</strong>s. Newbury Park: Sage Publications.<br />

Campos, A. (1995a). Imagery, concreteness, emotionality, meaningfulness, and pleasantness <strong>of</strong> words. Perceptual & Motor Skills, 80(3, Pt 1),<br />

867e880.<br />

Campos, A. (1995b). Twenty-two years <strong>of</strong> the VVIQ. Journal <strong>of</strong> Mental Imagery, 19(3e4), 129e131.<br />

Campos, A. (1998). A measure <strong>of</strong> visual imaging capacity: a preliminary study. Perceptual & Motor Skills, 87(3, Pt 1), 1012e1014.<br />

Campos, A., & Perez, M. J. (1997). Mnemonic images and associated pair recall. Journal <strong>of</strong> Mental Imagery, 21(3e4), 73e82.<br />

Carney, R. N., & Levin, J. R. (1998). Do <strong>mnemonic</strong> memories fade as time goes by? Here’s looking anew! Contemporary Educational Psychology,<br />

23, 276e297.<br />

Clark, J. M., & Paivio, A. (1991). Dual coding theory and education. Educational Psychology Review, 3(3), 149e210.<br />

Crutcher, R. J., & Ericsson, K. A. (2000). <strong>The</strong> role <strong>of</strong> mediators in memory <strong>retrieval</strong> as a function <strong>of</strong> practice: controlled mediation to direct<br />

access. Journal <strong>of</strong> Experimental Psychology: Learning, Memory, & Cognition, 26(5), 1297e1317.<br />

Crutcher, R. J., & Ericsson, K. A. (2003). <strong>The</strong> <strong>effects</strong> <strong>of</strong> practice on <strong>mnemonic</strong> encodings involving prior knowledge and semantic memory: reply<br />

to Rickard and Bajic (2003). Journal <strong>of</strong> Experimental Psychology: Learning, Memory, and Cognition, 29(6), 1387e1389.<br />

Dahl, R. (1982). <strong>The</strong> twits. Harmondsworth: Puffin.<br />

Ellis, N., & Beaton, A. (1993). Factors affecting the learning <strong>of</strong> foreign language vocabulary: imagery <strong>keyword</strong> mediators and phonological shortterm<br />

memory. <strong>The</strong> Quarterly Journal <strong>of</strong> Experimental Psychology, 46A, 533e558.<br />

Kosslyn, S. M., Van Kleeck, M. H., & Kirby, K. N. (1990). A neurologically plausible model <strong>of</strong> individual differences in visual mental imagery. In<br />

P. J. Hampson, D. F. Marks, & J. T. E. Richardson (Eds.), Imagery: Current developments (pp. 39e77). London: Routledge.<br />

Lawson, M. J., & Hogben, D. (1998). Learning and recall <strong>of</strong> foreign-language vocabulary: <strong>effects</strong> <strong>of</strong> a <strong>keyword</strong> strategy for immediate and delayed<br />

recall. Learning and Instruction, 8(2), 179e194.<br />

Levin, J. R., & Pressley, M. (1985). Mnemonic vocabulary instruction: What’s fact, what’s fiction. In R. F. Dillon (Ed.), Individual differences in<br />

cognition, Vol. 2 (pp. 145e172). Orlando, FL: Academic Press.<br />

Lietz, P. (1996). Changes in reading comprehension across culture and over time. Minister: Waxman.<br />

Marks, D. F. (1973). Visual imagery differences in the recall <strong>of</strong> pictures. British Journal <strong>of</strong> Psychology, 64, 17e24.<br />

McDaniel, M. A., & Pressley, M. (1984). Putting the <strong>keyword</strong> <strong>method</strong> in context. Journal <strong>of</strong> Educational Psychology, 76, 598e609.<br />

Raudenbush, S. W., & Bryk, A. S. (1994). Hierarchical linear models. In T. Husén, & T. N. Postlethwaite (Eds.), International encyclopedia <strong>of</strong><br />

education: Research and studies (2nd ed.) (pp. 2590e2596). Oxford: Pergamon Press.<br />

Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis <strong>method</strong>s (2nd ed.). Thousand Oaks: Sage<br />

Publications.<br />

Raudenbush, S. W., Bryk, A. S., Cheong, Y. F., & Congdon, R. T. (2000). HLM 5: Hierarchical linear and nonlinear modeling. Lincolnwood, IL:<br />

Scientific S<strong>of</strong>tware International. (Version 5.01.2067.1).<br />

Wang, A. Y., & Thomas, M. H. (1995). Effect <strong>of</strong> <strong>keyword</strong>s on long-term retention: help or hindrance? Journal <strong>of</strong> Educational Psychology, 87,<br />

468e475.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!