17.01.2013 Views

Reasoning and Problem Solving - matrics

Reasoning and Problem Solving - matrics

Reasoning and Problem Solving - matrics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong><br />

General Instructions: Please review each c<strong>and</strong>idate test within the domain of <strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>. A test description is<br />

provided for each test, followed by data relevant to the five test selection criteria: a) test-retest reliability, b) utility as a repeated measure, c)<br />

relationship to functional outcome, d) potential changeability in response to pharmacological agents, <strong>and</strong> e) practicality <strong>and</strong> tolerability. The<br />

list of tests being considered in this domain appears below. The order in which the tests are presented was r<strong>and</strong>omized within each domain.<br />

At the end of the <strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong> section, there is a rating form to be completed for the selection criteria for the c<strong>and</strong>idate<br />

tests in this cognitive domain. Please complete the ratings on the c<strong>and</strong>idate tests one selection criterion at a time before beginning the next<br />

section (e.g., complete ratings for test-retest reliability on all of the tests before going on to utility as a repeated measure). In making your<br />

ratings, please give due consideration to the data available. Avoid use of global impressions across the selection criteria for any given test.<br />

We prefer that you enter your ratings directly onto the web-based data entry system, <strong>and</strong> you will receive instructions <strong>and</strong> your password for<br />

this system shortly. Alternatively, you can complete the forms at the end of each section <strong>and</strong> fax them to us at 310-268-4056, or send them<br />

via overnight mail to Robert S. Kern, Ph.D. at the address listed on the cover letter.<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong> C<strong>and</strong>idate Tests<br />

1. WAIS-III Block Design<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

3. Neuropsychology Assessment Battery (NAB) – Mazes<br />

4. Penn Conditional Exclusion Test (PCET)<br />

5. Neuropsychology Assessment Battery (NAB) – Categories<br />

6. Delis-Kaplan Executive Functioning Scale (D-KEFS) - Sorting<br />

Test Description<br />

A brief description of each c<strong>and</strong>idate test is provided below. The information includes: (a) a description of test procedures, (b) sample test<br />

stimuli (if available), (c) scoring procedures, (d) key dependent variables, <strong>and</strong> (e) approximate administration time for clinical <strong>and</strong> control<br />

samples.<br />

1. WAIS-III Block Design<br />

As stated in the WAIS-III Manual, “The examinee is asked to replicate models or pictures of two-color designs with blocks. The<br />

designs progress in difficulty from simple two-block designs to more complex, nine-block designs. Each block has two white sides, two red<br />

sides, <strong>and</strong> two half-red <strong>and</strong> half-white sides.”<br />

- 1 -


The test stimuli materials include the Stimulus Booklet <strong>and</strong> Block Design Blocks (9 total).<br />

As stated in the WAIS-III Manual, “On the Record Form <strong>and</strong> in [the] Manual, the designs are shown from [the examiner’s]<br />

perspective. For incorrect constructions, [the examiner has] the option of recording the examinee’s final design in the blank grids in the<br />

Incorrect Design column. For each item, record in the Completion Time column the number of seconds the examinee takes to construct the<br />

design. For correct constructions, circle Y in the Correct Design column to indicate that the examinee constructed the full design correctly.<br />

Record the score by circling the number of points.”<br />

Block Design raw scores range from 0 to 64 points. Raw scores may be converted to scaled scores using the normative tables in the<br />

WAIS-3 Manual. Scaled scores range from 1 to 19.<br />

Total administration time for Block Design is between 10 <strong>and</strong> 15 minutes for both clinical <strong>and</strong> control subjects.<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

Patients are shown two pictures simultaneously. Each picture shows 3 balls of different colors arranged on 3 pegs, with the balls in a<br />

unique arrangement in each picture. Patients are instructed to determine the fewest possible moves of the balls in one picture to make the<br />

arrangement of balls identical to that of the other, opposing picture. There are twenty test items. The items are of variable difficulty, with a<br />

general tendency for later items to be more difficult. The test is discontinued if patients make five consecutive incorrect responses. If<br />

patients respond correctly to all twenty trials, two additional trials of greater difficulty are administered. There are two alternate forms. The<br />

key dependent variable is the number of correct responses (range: 0-22). Test administration requires approximately 8 minutes.<br />

3. Neuropsychology Assessment Battery (NAB) – Mazes<br />

The NAB Mazes test is based on a maze-tracing paradigm <strong>and</strong> was designed to avoid both floor <strong>and</strong> ceiling effects found in existing<br />

maze tests. The NAB Mazes test involves single-trial attempts for each of 7 mazes, with time to completion being the primary variable;<br />

mazes are not discontinued at wrong turns (i.e., “dead ends”). Seven mazes were initially created, with increasing complexity from very<br />

simple to very difficult. All mazes were 9 in. wide <strong>and</strong>, with the exception of the first <strong>and</strong> easiest maze (which was 3 in. high), were 6 in.<br />

high. The alley width for the first two mazes was kept constant at 1 in. The third maze has 3/4 in. alleys. The fourth maze has 1/2 in. alleys.<br />

The last three mazes have 1/4 in. alleys. Care was taken so that the paths traversed all four quadrants of the maze. The “start” <strong>and</strong> “end”<br />

points for the mazes were divided between right <strong>and</strong> left <strong>and</strong> between center <strong>and</strong> perimeter. Initial designs were pilot tested <strong>and</strong><br />

subsequently modified to ensure increasing completion times across the seven mazes. Once the seven mazes were finalized, two additional<br />

alternate forms of each maze were created in the following manner. The first alternate form was created by rotating the original maze 180<br />

degrees along its vertical axis (keeping the “start” <strong>and</strong> “end” points in the original, unrotated position). The second alternate form was<br />

created by rotating the original maze 180 degrees along its horizontal axis (again, keeping the “start” <strong>and</strong> “end” points in the original,<br />

unrotated position). In this manner, a total of 21 mazes was created. The 21 mazes (three alternate forms of 7 mazes each) were rated by the<br />

NAB Advisory Council members for difficulty level <strong>and</strong> overall task satisfaction. The two mazes from each three-maze group with the best<br />

combination of highest satisfaction <strong>and</strong> similar difficulty levels were retained. That is, seven mazes were eliminated, <strong>and</strong> two forms with<br />

seven mazes each were retained. The results of pilot testing were used to empirically determine the difficulty level of each item, to order the<br />

items in ascending difficulty, <strong>and</strong> to equate items across the two equivalent forms.<br />

- 2 -


As with all NAB tests, Mazes raw scores are quickly <strong>and</strong> easily calculated on the record form itself, with no need to refer to a<br />

manual. St<strong>and</strong>ard scores can be derived either by h<strong>and</strong>, using the NAB Normative Manuals, or with the NAB Software Portfolio, an easy-to<br />

use computerized scoring program that calculates st<strong>and</strong>ardized scores from raw scores using either of the two sets of norms<br />

(Demographically Corrected or U.S. Census-Matched).<br />

There is one key Mazes test variable, based on the total scores for all of the mazes administered; each maze is scored based on<br />

whether it is completed or not, with bonus points provided for speed.<br />

The total maximum duration for administration is 15 minutes, though most examinees, both clinical <strong>and</strong> control subjects, require<br />

much less time to complete (with discontinuation rules substantially decreasing administration time).<br />

4. Penn Conditional Exclusion Test (PCET)<br />

The PCET is a computerized test of executive function with four alternate forms in which the subject is visually presented with four<br />

figures oriented horizontally <strong>and</strong> must use a mouse to click on the figure that is different than the other three. The task begins by instructing<br />

the subject, “In this task you will be shown four objects. You will need to determine which object does not belong with the other three.<br />

When you correctly choose the object that does not belong you will be told ‘correct’. If you choose an incorrect item you will be told<br />

‘incorrect’”. For each presentation of four stimuli there is text at the bottom of the page that states, “click on the object that does not<br />

belong”. The task progresses through 3 correctness criteria <strong>and</strong> the subject is presented with immediate visual feedback stating “correct” or<br />

“incorrect” for 500 msec. These sorting criteria are non-overlapping for the four test forms. For form 1, the first category is line thickness,<br />

the second is shape <strong>and</strong> the third is size. For form 2, the first category is location, the second is color <strong>and</strong> the third is type of shape. For form<br />

3, the first category is line angle, the second is solid versus dashed lines <strong>and</strong> the third is presence or absence of a border. For form 4, the first<br />

category is sharp versus rounded edges, the second is angle of the stimulus box <strong>and</strong> the third is whether the box is filled or empty. Number<br />

of categories, number of errors, number of perseverative errors, trials to first category, <strong>and</strong> total number of trials are primary dependent<br />

measures. Because the task is self-paced, the length is variable, but in the validation study it took both patients <strong>and</strong> controls approximately<br />

10 minutes to complete. If a participant is unable to achieve a single category, the test ends after 144 trials.<br />

5. Neuropsychology Assessment Battery (NAB) – Categories<br />

In the design of the NAB Categories test, the strengths of existing sorting <strong>and</strong> classification tasks were incorporated, along with<br />

additional features that allow for an alternate form with little practice effects, no additional manipulatives or cards, <strong>and</strong> face-valid <strong>and</strong><br />

visually attractive stimuli. Three complete forms of the Categories test were created initially. Each form consisted of two separate panels;<br />

each panel contained photographs of six adults along with associated identifying information. To perform the task, the examinee is<br />

instructed to come up with as many different two-group categories as they can, based on any of the information on the page; each group<br />

must contain at least two people, <strong>and</strong> all six people must fit into one group or the other. To create the original three forms, hundreds of<br />

photographs were initially culled from catalogs of digital stock photography. The selection of the final 36 photographs used in the initial<br />

three forms was made through an iterative process involving extensive pilot testing <strong>and</strong> regrouping <strong>and</strong> reselecting photographs in order to<br />

yield forms with similar possible categorization solutions based on the visual information included in the photographs (e.g., eye glasses vs.<br />

no eye glasses, round shirt collar vs. button-down shirt collar). Once photographs were selected, the background of each photograph was<br />

digitally modified to be either blue-gray or yellow (according to categorization decision rules). For each of the six panels (i.e., two panels<br />

- 3 -


per form), additional categorization rules were used to create distinct outlines (e.g., three punch holes vs. spiral-binding holes) <strong>and</strong><br />

backgrounds (thin-lined border vs. thick-lined border) for each photograph, in addition to the identification information printed below each<br />

photograph. For each form, the first panel included six identifying information categories (e.g., name, occupation, place of birth), <strong>and</strong> the<br />

second panel included five identifying information categories. Each category (except marital status) was designed to have several possible<br />

solutions for 1- <strong>and</strong> 2-point scores; the possible correct responses for the first panel were not possible correct responses for the second panel.<br />

Once again, pilot testing results guided modifications to the category information such that each panel had a similar number of possible<br />

solutions across the three forms. The three forms (6 panels) were rated by the NAB Advisory Council members for difficulty level,<br />

ethnic/racial/cultural bias, educational bias, U.S. regional bias, sex bias, <strong>and</strong> overall task satisfaction. The two forms (two panels each) with<br />

the lowest bias ratings, highest task satisfaction ratings, <strong>and</strong> most similar difficulty ratings were retained. The results of pilot testing were<br />

used to empirically determine the difficulty level of each panel <strong>and</strong> to equate panels across the two equivalent forms.<br />

As with all NAB tests, Categories raw scores are quickly <strong>and</strong> easily calculated on the record form itself, with no need to refer to a<br />

manual. St<strong>and</strong>ard scores can be derived either by h<strong>and</strong>, using the NAB Normative Manuals, or with the NAB Software Portfolio, an easy-to<br />

use computerized scoring program that calculates st<strong>and</strong>ardized scores from raw scores using either of the two sets of norms<br />

(Demographically Corrected or U.S. Census-Matched).<br />

There is one key Categories test variable based on the total number of points received (0-1-2) for all categories produced by the<br />

examinee over the two panels.<br />

The total maximum duration for administration is 8 minutes, though most examinees, both clinical <strong>and</strong> control subjects, require less<br />

time to complete.<br />

6. Delis-Kaplan Executive Functioning Scale (D-KEFS) - Sorting<br />

The D-KEFS Sorting Test consists of two testing conditions: Free Sorting <strong>and</strong> Sort Recognition. For the MATRICS battery, the<br />

Free Sorting condition is being considered. For the Free Sorting condition, the examinee is presented with six mixed-up cards that display<br />

both stimulus words <strong>and</strong> various perceptual features. The examinee is asked to sort the cards into two groups, three cards per group,<br />

according to as many different categorization rules or concepts as possible, <strong>and</strong> to describe the concepts he or she used to generate each sort.<br />

Each of the two card sets can be grouped into a maximum of eight target sorts: three sorts are based on verbal-semantic information from the<br />

stimulus words, <strong>and</strong> five are based on visual-spatial features or patterns on the cards. For the Sort Recognition condition, the same sets of<br />

cards are each sorted by the examiner into two groups, three cards per group, according to the eight target sorts. After each sort make by the<br />

examiner, the examinee attempts to describe the correct categorization rule or concept used to generate the sort. Because no right or wrong<br />

feedback is provided by the test, the possible adverse effect of repetitive negative feedback is minimized. This test affords an assessment of<br />

several executive-function components, including (a) initiating of problem-solving behavior; (b) concept-formation skills; (c) modalityspecific<br />

problem-solving skills (verbal versus nonverbal); (d) the ability to explain the sorting concepts abstractly; (e) the ability to transfer<br />

sorting concepts into action; (f) the ability to inhibit previous sorting responses; <strong>and</strong> (g) the ability to inhibit previous descriptions of sorting<br />

rules. The test is scored using a scoring-assisted software program. The Free Sorting takes about 10-15 minutes to administer <strong>and</strong> it can be<br />

administered by itself. The test has an alternate form.<br />

- 4 -


Test-Retest Reliability<br />

- 5 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Test-Retest Reliability<br />

The tables below provide information about test-retest reliability for each c<strong>and</strong>idate test. Data are provided for normal samples <strong>and</strong>, when<br />

available, for schizophrenia <strong>and</strong> other clinical samples. In making your ratings, please try to take into account several factors, including the<br />

type of statistic used to measure reliability, the time interval between testings, <strong>and</strong> whether the data are from normal or clinical samples. As<br />

a general guide for immediate test-retest reliability, consider r values of 0.60 = modest, 0.70 = good, <strong>and</strong> 0.80 = very good. For the<br />

generalizability statistic, G, consider 0.50 = modest, 0.60 = good, <strong>and</strong> 0.70 = very good. Intraclass correlations (ICCs) are identical to r<br />

when no mean differences are present. However, unlike r, ICCs will be somewhat lower if mean differences are present, so should be<br />

viewed as potentially more similar to G when considering comparisons that may include practice effects. Remember that longer test-retest<br />

intervals would be expected to yield lower values than would immediate test-retest evaluations.<br />

1. WAIS-III Block Design<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) Wechsler (1997)<br />

Healthy adults: 16-29 y.o. 100 2-12 weeks r = 0.83<br />

Healthy adults: 30-54 y.o. 102 2-12 weeks r = 0.88<br />

Healthy elderly: 55-74 y.o. 104 2-12 weeks r = 0.80<br />

Healthy elderly: 75-89 y.o. 88 2-12 weeks r = 0.82<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) Keefe et al. (submitted) Healthy adults Version A repeated: 18 1 – 3 days Version A repeated:<br />

Version B repeated: 16<br />

ICC=0.83; Version B<br />

repeated: ICC=0.73<br />

Schizophrenia patients Version A repeated: 50 1 – 3 days Version A repeated:<br />

Version B repeated: 51<br />

ICC=0.66; Version B<br />

repeated: ICC=0.77<br />

3. Neuropsychology Assessment Battery (NAB) – Mazes<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) White & Stern, 2003 Healthy adults 95 193 (+ 20) days (same form<br />

on both occasions)<br />

r = 0.55<br />

Healthy adults 100 25 (+ 6) days (different form<br />

on each occasion)<br />

G coefficient= 0.95


4. Penn Conditional Exclusion Test (PCET)<br />

- 6 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Test-Retest Reliability<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) Gur RE et al., in press Healthy adults 24 6 weeks r = 0.69<br />

Schizophrenia patients 16 6 weeks r = 0.66<br />

5. Neuropsychology Assessment Battery (NAB) – Categories<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) White & Stern, 2003 Healthy adults 95 193 (+ 20) days (same form<br />

on both occasions)<br />

r = 0.60<br />

Healthy adults 100 25 (+ 6) days (different form<br />

on each occasion)<br />

G coefficient=0.89<br />

6. Delis-Kaplan Executive Functioning Scale - Sorting<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

1) Delis et al., 2001 Healthy adults 286 3 weeks Correct Confirmed Sorts:<br />

r = 0.60<br />

Correct Description: r = 0.61


Utility as a Repeated Measure<br />

- 7 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Utility as a Repeated Measure<br />

The tables below provide information regarding practice effects with repeated administration. Data are provided for normal samples <strong>and</strong>,<br />

when available, for schizophrenia <strong>and</strong> other clinical samples. Data from schizophrenia samples are most relevant. The primary<br />

consideration is whether the magnitude of practice effects compromises the test’s utility in a clinical trials design. For example, a test’s<br />

utility may be compromised by practice effects if they are of a size that they would lead to reduced range, ceiling effects, or reduced<br />

variance (i.e., smaller SDs over succeeding trials). If low scores on a measure reflect good performance, this is noted in the table. The first<br />

table under each test contains data on practice effects. The second table contains data on alternate form reliability, if the test is available in<br />

alternate forms. Alternate forms are generally not needed for tests with minimal practice effects, or ones that do not approach ceiling. If a<br />

test has practice effects of sufficient magnitude to compromise its utility in a clinical trials design with repeated measurements, then<br />

consideration of alternate form reliability is appropriate.<br />

1. WAIS-III Block Design<br />

Practice Effects<br />

Author(s) & year Sample type Sample<br />

1) Wechsler<br />

(1997)<br />

Healthy adults:<br />

16-29 y.o.<br />

Healthy adults:<br />

30-54 y.o.<br />

Healthy elderly:<br />

55-74 y.o.<br />

Healthy elderly:<br />

75-89 y.o.<br />

Alternate Form Reliability<br />

No alternate forms<br />

size<br />

Range of possible<br />

scores<br />

Mean (SD) at<br />

T1<br />

Interval between T1 <strong>and</strong> T2<br />

(weeks)<br />

Mean (SD) at T2<br />

100 1-19 (scaled scores) 10.3 (2.6) 2-12 weeks (mean = 4) 11.3 (2.7)<br />

102 1-19 (scaled scores) 10.2 (2.7) 2-12 weeks (mean = 4) 10.9 (3.0)<br />

104 1-19 (scaled scores) 9.9 (2.8) 2-12 weeks (mean = 4) 10.1 (2.8)<br />

88 1-19 (scaled scores) 9.7 (2.9) 2-12 weeks (mean = 4) 10.0 (3.2)


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Utility as a Repeated Measure<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

Practice Effects<br />

Author(s) & year Sample type Sample size Range of<br />

possible<br />

1) Keefe et al.<br />

(submitted)<br />

Healthy adults Total=50<br />

-Version A repeated = 18<br />

Schizophrenia<br />

patients<br />

-Version B repeated = 16<br />

Total = 150<br />

-Version A repeated = 50<br />

-Version B repeated = 51<br />

scores<br />

0-22<br />

(raw<br />

scores)<br />

0-22<br />

(raw<br />

scores)<br />

- 8 -<br />

Mean (SD) at T1 Interval between<br />

15.39 (3.18)<br />

17.19 (2.93)<br />

12.46 (4.72)<br />

10.90 (5.52)<br />

T1 <strong>and</strong> T2<br />

(weeks)<br />

1-3 days<br />

1-3 days<br />

16.83 (2.75)<br />

18.69 (2.5)<br />

13.75 (4.80)<br />

11.78 (5.72)<br />

Mean (SD) at T2<br />

Alternate Form Reliability<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

(specify coefficient type, e.g.<br />

r (ICC) = .89)<br />

1) Keefe, et al.<br />

Healthy adults Version A, then B: 8 1 – 3 days Version A, then B: ICC=0.97<br />

Version B, then A: 8<br />

Schizophrenia patients Version A, then B: 23<br />

Version B, then A: 25<br />

3. Neuropsychology Assessment Battery (NAB) - Mazes<br />

Practice Effects<br />

Author(s) &<br />

year<br />

1) White &<br />

Stern, 2003<br />

Version B, then A: ICC =0.93<br />

1 – 3 days Version A, then B: ICC=0.81<br />

Version B, then A: ICC=0.89<br />

Sample type Sample Range of Mean (SD) Interval between Mean (SD) Interval between Mean (SD) at<br />

size possible scores at T1 T1 <strong>and</strong> T2<br />

(weeks)<br />

at T2 T2 <strong>and</strong> T3<br />

(weeks)<br />

T3<br />

Healthy adults 95 19-81 (T-scores) 49.0 (11.5) 28 50.5 (8.7) N/A N/A<br />

Alternate Form Reliability<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

(specify coefficient type, e.g.<br />

r (ICC) = .89)<br />

1) White & Stern, 2003 Healthy adults 100 25 (+ 6) days G coefficient= 0.95


4. Penn Conditional Exclusion Test (PCET)<br />

Practice Effects<br />

Author(s) & year Sample type Sample<br />

size<br />

1) Gur RE, et al.,<br />

in press<br />

Range of<br />

possible<br />

scores<br />

Healthy adults 24 (z-scores)<br />

-3 to +3<br />

Schizophrenia 16 (z-scores)<br />

patients<br />

-3 to +3<br />

Mean (SD) at<br />

Alternate Form Reliability<br />

Author(s) & year Sample type Sample<br />

size<br />

T1<br />

- 9 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Utility as a Repeated Measure<br />

Interval between<br />

T1 <strong>and</strong> T2<br />

Mean (SD) at<br />

(weeks)<br />

(weeks)<br />

0.07 (0.40) 6 weeks 0.13 (0.41) 24 weeks 0.24 (0.40)<br />

T2<br />

Interval between<br />

T2 <strong>and</strong> T3<br />

Mean (SD) at<br />

-0.50 (1.68) 6 weeks -0.75 (1.57) 24 weeks -0.19 (0.93)<br />

Interval between testings Reliability coefficient<br />

(specify coefficient type, e.g.<br />

r (ICC) = .89)<br />

1) Kurtz et al, in press Healthy adults 80 Same test session (1-2 minutes) ICC=0.39; p


6. Delis-Kaplan Executive Functioning Scale - Sorting<br />

Practice Effects<br />

Author(s) &<br />

year<br />

1) D-KEFS Test<br />

Manual<br />

Sample type Sample<br />

size<br />

Range of possible<br />

scores<br />

Healthy adults 101 Confirmed sorts<br />

Corr. description<br />

Range = 1-19<br />

Mean<br />

(SD) at T1<br />

10.2 (2.8)<br />

10.1 (2.7)<br />

- 10 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Utility as a Repeated Measure<br />

Interval between<br />

T1 <strong>and</strong> T2<br />

(weeks)<br />

Mean (SD)<br />

at T2<br />

25 (+12.8) days 11.3 (2.4)<br />

11.3 (2.5)<br />

Interval between<br />

T2 <strong>and</strong> T3<br />

(weeks)<br />

N/A N/A<br />

Mean (SD) at<br />

Alternate Form Reliability<br />

Author(s) & year Sample type Sample size Interval between testings Reliability coefficient<br />

(specify coefficient type, e.g. r<br />

(ICC) = .89)<br />

1) D-KEFS Test Manual Healthy adults 101 25 (+12.8) days r (Correct confirmed sorts) = .57<br />

r (Correct description) = .39<br />

T3


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Relationship to Functional Outcome<br />

Relationship to Functional Outcome<br />

The tables below provide information on the relationship between performance on c<strong>and</strong>idate tests <strong>and</strong> functional outcome. Functional<br />

outcome measures include measures of community functioning, social problem-solving ability, or psychosocial rehabilitation. Data from<br />

schizophrenia samples are most important <strong>and</strong> are listed first in each table. If available, data from other groups, such as other clinical<br />

samples, are also listed. Demonstrations of prospective relationships to functional outcome are especially powerful <strong>and</strong> should be given<br />

greater weight than cross-sectional findings, even though the strength of the relationship between cognition <strong>and</strong> functional outcome might<br />

be lower in prospective studies than cross-sectional ones. As a general guide to effect sizes, correlation coefficients of 0.1 are considered<br />

small, 0.3 are considered medium, <strong>and</strong> 0.5 large.<br />

1. WAIS-III Block Design<br />

Studies Using This Specific Test<br />

Author(s) & year Sample type Sample size Functional outcome<br />

measure(s)<br />

1) Dickerson et al.,<br />

1999<br />

2) Gold, J.<br />

(INS, 2003)<br />

3) Brekke et al.,<br />

1997<br />

4) Buchanan et al.,<br />

1994<br />

Schizophrenia<br />

patients<br />

Stable<br />

Schizophrenia<br />

outpatients<br />

Schizophrenia <strong>and</strong><br />

Schizoaffective<br />

Disorder patients<br />

Schizophrenia<br />

patients<br />

- 11 -<br />

Findings (r or other magnitude of<br />

relationship statistic)<br />

Cross-sectional vs.<br />

prospective study (if<br />

prospective, provide<br />

interval)<br />

72 Social functioningoccupational<br />

activity<br />

r = 0.24, p=0.045 Prospective, 2 year follow-up<br />

77 Total Vocational outcome r (with employment status ) = 0.262, Cross-sectional<br />

(39 competitively<br />

employed; 38<br />

unemployed)<br />

p=0.021<br />

40 Vocational, Social,<br />

Independent Living<br />

r = 0.35, p


8) Lysaker et al.,<br />

2001<br />

Schizophrenia <strong>and</strong><br />

Schizoaffective<br />

Disorder patients<br />

49<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Relationship to Functional Outcome<br />

Self-efficacy r = ?, ns<br />

Cross-sectional<br />

Well-being r = -0.30, p


4. Penn Conditional Exclusion Test (PCET)<br />

Studies Using This Specific Test<br />

Author(s) & year Sample type Sample<br />

size<br />

1) Kurtz et al. (in<br />

press)<br />

Schizophrenia<br />

patients<br />

Functional outcome<br />

measure(s)<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Relationship to Functional Outcome<br />

- 13 -<br />

Findings (r or other magnitude of<br />

relationship statistic)<br />

28 Work competence PCET categories <strong>and</strong> errors related to<br />

“cooperativeness” on the job (r=0.47 <strong>and</strong><br />

r=-0.49 respectively). PCET errors related<br />

to “work quality” (r=-0.44) <strong>and</strong> “general<br />

impression” (r=-0.42).<br />

Cross-sectional vs.<br />

prospective study (if<br />

prospective, provide interval)<br />

Cross-sectional<br />

Meta-Analysis of Similar Tests<br />

Author(s) & year Type of tests Sample type Sample size Pooled estimated r p value<br />

1) Green et al., 2000 Card sorting tests Schizophrenia <strong>and</strong> Schizoaffective<br />

Disorder patients<br />

1002 r = 0.23 p


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Potential Changeability in Response to Pharmacological Agents<br />

Potential Changeability in Response to Pharmacological Agents<br />

The following tables provide information about changeability in response to pharmacological agents. The data in this section are<br />

illustrative, <strong>and</strong> not comprehensive. The findings listed below were provided by test developers, or come from representative reviews <strong>and</strong><br />

meta-analyses. Data for schizophrenia samples are listed first in the tables below. Data from other groups, such as other clinical samples,<br />

are also listed. The studies listed include those that used novel cognitive-enhancing agents, ones that examined newer versus conventional<br />

antipsychotic medications, or ones that compared two or more newer antipsychotic medications. In examining the data, give greater weight<br />

to between-group effects except in those cases in which all treatment arms would be expected to yield cognitive enhancing effects. In<br />

evaluating the tests for this particular criterion, consideration should be given to the fact that this area of investigation is relatively new.<br />

1. WAIS-III Block Design<br />

Studies Using This Specific Test<br />

Author(s) & year Sample type Sample<br />

size<br />

Drug comparison Findings (specify within- <strong>and</strong> between-group effects)<br />

1) Buchanan et al., 1994 Schizophrenia 38 Clozapine vs. haloperidol Significant group difference on block design (p < 0.05)<br />

2) Meltzer <strong>and</strong> McGurk<br />

1999<br />

patients<br />

Schizophrenia<br />

patients<br />

Review of<br />

literature<br />

Studies Using Similar Tests (similar in format <strong>and</strong> assessing same construct)<br />

No data available<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

Studies Using This Specific Test<br />

Author(s) & year Sample type Sample<br />

1) Allen et al., 2003 Schizophrenia<br />

patients<br />

size<br />

Clozapine 2/3 studies showed improvement with clozapine (withingroup)<br />

Drug comparison Findings (specify within- <strong>and</strong> between-group effects)<br />

20 Galantamine (procholinergic<br />

agent <strong>and</strong><br />

modulator at the nicotinic<br />

receptor) vs. placebo<br />

Studies Using Similar Tests (similar in format <strong>and</strong> assessing same construct)<br />

No data available<br />

- 14 -<br />

Tower of London change score, galantamine vs placebo at<br />

trend level (p=0.073)


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Potential Changeability in Response to Pharmacological Agents<br />

3. Neuropsychology Assessment Battery (NAB) – Mazes<br />

Studies Using This Specific Test<br />

The NAB is a newly released test <strong>and</strong> has no data available.<br />

Studies Using Similar Tests (similar in format <strong>and</strong> assessing same construct)<br />

Author(s) & year Sample type Sample<br />

size<br />

Drug comparison Findings (specify within- <strong>and</strong> between-group effects)<br />

1) Gallhofer et al., 1996 Schizophrenia 64 Clozapine vs. risperidone Clozapine <strong>and</strong> risperidone treated patients performed<br />

patients<br />

vs. conventional<br />

significantly better than unmedicated patients, or those taking<br />

medications vs. no meds conventional meds (contrasts, p > 0.05)<br />

2) Meltzer <strong>and</strong> McGurk Schizophrenia Review of Clozapine <strong>and</strong> risperidone 1/2 studies showed improvement with clozapine; 0/1 showed<br />

1999<br />

patients<br />

literature<br />

improvement risperidone<br />

4. Penn Conditional Exclusion Test (PCET)<br />

Studies Using This Specific Test<br />

Author(s) & year Sample type Sample<br />

size<br />

1) Gur RE, et al., in press Schizophrenia<br />

patients<br />

16<br />

Healthy adults 24<br />

Drug comparison Findings (specify within- <strong>and</strong> between-group effects)<br />

Patients off vs. on<br />

Olanzapine, vs. controls<br />

- 15 -<br />

Between-group, diagnosis by time interaction: F(5,66) = 2.72,<br />

p = 0.0268; patients improved more than controls<br />

Meta-Analyses <strong>and</strong> Reviews of Similar Tests<br />

Author(s) & year Type of tests Sample type No. of studies Effect size (within-group<br />

1) Harvey & Keefe, 2001 Tests of executive functioning Schizophrenia<br />

patients<br />

2) Keefe et al. 1999 Tests of executive functioning Schizophrenia<br />

patients<br />

3) Meltzer & McGurk, Wisconsin Card Sorting Test Schizophrenia<br />

1999<br />

patients<br />

effect), or box score<br />

N = 20 for all d = 0.18<br />

tests, no info on<br />

number of<br />

executive tests<br />

12 8/12 showed improvement<br />

9 2/7 studies showed improvement<br />

with clozapine; 1/2 showed<br />

improvement with risperidone


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Potential Changeability in Response to Pharmacological Agents<br />

5. Neuropsychology Assessment Battery (NAB) – Categories<br />

Studies Using This Specific Test<br />

The NAB is a newly released test <strong>and</strong> has no data available.<br />

Meta-Analyses <strong>and</strong> Reviews of Similar Tests<br />

Author(s) & year Type of tests Sample type No. of studies Effect size (within-group<br />

1) Harvey & Keefe, 2001 Tests of executive functioning Schizophrenia<br />

patients<br />

2) Keefe et al., 1999 Tests of executive functioning Schizophrenia<br />

patients<br />

3) Meltzer & McGurk, Wisconsin Card Sorting Test Schizophrenia<br />

1999<br />

patients<br />

6. Delis-Kaplan Executive Functioning Scale - Sorting<br />

Studies Using This Specific Test<br />

No data available<br />

- 16 -<br />

effect), or box score<br />

N = 20 for all d = 0.18<br />

tests, no info on<br />

number of<br />

executive tests<br />

12 8/12 showed improvement<br />

9 2/7 studies showed improvement<br />

with clozapine; 1/2 showed<br />

improvement with risperidone<br />

Meta-Analyses <strong>and</strong> Reviews of Similar Tests<br />

Author(s) & year Type of tests Sample type No. of studies Effect size (within-group<br />

1) Harvey & Keefe, 2001 Tests of executive functioning Schizophrenia<br />

patients<br />

2) Keefe et al. 1999 Tests of executive functioning Schizophrenia<br />

patients<br />

3) Meltzer & McGurk, Wisconsin Card Sorting Test Schizophrenia<br />

1999<br />

patients<br />

effect), or box score<br />

N = 20 for all d = 0.18<br />

tests, no info on<br />

number of<br />

executive tests<br />

12 8/12 showed improvement<br />

9 2/7 studies showed improvement<br />

with clozapine; 1/2 showed<br />

improvement with risperidone


Practicality <strong>and</strong> Tolerability<br />

- 17 -<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Practicality <strong>and</strong> Tolerability<br />

The following narratives provide information about the steps that are needed to set up each test at a new site. In evaluating each test, take<br />

into consideration: (a) ease of set-up including use of any specialized equipment (e.g., particular computer requirements), (b) duration of<br />

test, (c) the length of time to train testers to administer <strong>and</strong> score the test, (d) the length of time to score an individual test after testers have<br />

been trained, (e) subjective reactions of the individuals who are taking the test (e.g., do they find it boring, frustrating, intrinsically<br />

interesting)? It may be helpful to refer back to the Test Description section (pages 1-4) to review specific information on a given test.<br />

1. WAIS-III Block Design<br />

The materials needed to administer the test are: WAIS-III Manual, WAIS-III Stimulus Booklet, Block Design Blocks (9), stopwatch,<br />

<strong>and</strong> a Record Form.<br />

Administration <strong>and</strong> scoring are readily learned by testers, although testers must practice manipulating multiple materials at once.<br />

Total administration time for Block Design is between 10 <strong>and</strong> 15 minutes for both clinical <strong>and</strong> control subjects. Scoring time is minimal<br />

requiring the tester to simply add the total correct <strong>and</strong> look up a scaled score in the manual. Additional scoring time (minimal) is required if<br />

the option to draw incorrect designs is implemented. Both clinical <strong>and</strong> controls groups find the task mildly to moderately challenging.<br />

Clinical subjects tend to be more easily frustrated <strong>and</strong> tolerate the task less well than controls.<br />

2. Brief Assessment of Cognition in Schizophrenia (BACS) - Tower of London<br />

The procedures for setting up this test at new sites have been established through its use in nine multi-site clinical trials. Each site<br />

receives the test materials, along with an instruction <strong>and</strong> scoring manual <strong>and</strong> an instructional CD. It is very simple to set up this test at a<br />

new site. The only materials needed are the Tower of London stimulus book, the one page response sheet, a stopwatch <strong>and</strong> a pen or pencil.<br />

It usually requires less than 15 minutes to train the tester on the administration <strong>and</strong> scoring of the test. The test is scored during<br />

administration. Patients may become frustrated as the items become more complex but frustration is limited due to the discontinuation<br />

criteria. Most controls find the test to be interesting <strong>and</strong> increasingly more challenging as complexity increases.<br />

3. Neuropsychology Assessment Battery (NAB) – Mazes<br />

The NAB Mazes test is quite easy to set up at a new site. The only materials required are: 1) the NAB Executive Functions Module<br />

Record Forms (one for each participant; separate forms for each of the two equivalent forms); 2) the NAB Executive Functions Module<br />

Response Booklets (one for each participant; separate forms for each of the two equivalent forms); <strong>and</strong> 3) a stop watch. Testers can be<br />

trained to administer <strong>and</strong> score the Mazes test in less than an hour. It is recommended that they practice administering the test with at least<br />

two examinees prior to using it for actual data collection. Calculation of raw scores takes 1-2 minutes. Calculation of the st<strong>and</strong>ardized Tscore<br />

using the NAB Software Portfolio computerized scoring package takes 1 minute. In the st<strong>and</strong>ardization sample (N = 1448) <strong>and</strong> in the


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong>: Practicality <strong>and</strong> Tolerability<br />

numerous clinical samples examined for validity studies (including patients with MS, TBI, ADHD, HIV/AIDS), respondents reacted<br />

favorably to the test. It was viewed as neither too easy nor too challenging/frustrating.<br />

4. Penn Conditional Exclusion Test (PCET)<br />

This is a computerized task originally developed on the PowerLaboratory platform. It runs on older Macintosh computers using OS9<br />

system software, or on newer Macintosh computers running MAC OSX using the Classic environment. Portability has been a concern so the<br />

task has also been implemented with a Web-based interface using FLASH MacroMedia. The Web-based version of the task can be run on or<br />

off-line with any computer. Either version of the task outputs performance data as a text file that gets scored using a Python script running<br />

on Linux.<br />

Training on task administration <strong>and</strong> scoring requires about 1 hour. The majority of the training involves learning how to set-up <strong>and</strong><br />

launch the task. Once the task is launched, the administrator reads the on-screen instructions along with the subject <strong>and</strong> answers any<br />

questions to insure that the subject underst<strong>and</strong>s the task. However, once the task starts, it runs <strong>and</strong> saves data automatically with no<br />

additional input from the administrator. Scoring of the data requires a simple one-line comm<strong>and</strong> to launch the scoring program that scores<br />

<strong>and</strong> databases the results automatically. Scoring requires less than a minute.<br />

Subjects have reacted very favorably to taking this task. For patients, the computer interface is particularly welcome as it requires<br />

less social interaction than st<strong>and</strong>ard paper <strong>and</strong> pencil administration. In some cases, older control subjects who are not familiar with<br />

computers are initially intimidated. For this reason, we have developed a separate “Mouse Practice” task that familiarizes subjects with the<br />

computer at the beginning of our computerized test battery.<br />

5. Neuropsychology Assessment Battery (NAB) – Categories<br />

The NAB Categories test is quite easy to set up at a new site. The only materials required are: 1) the NAB Executive Functions<br />

Module Stimulus Book (one for each of the two equivalent forms); 2) NAB Executive Functions Module Record Forms (one for each<br />

participant; separate forms for each of the two equivalent forms); <strong>and</strong> 3) a stop watch. Testers can be trained to administer <strong>and</strong> score the<br />

Categories test in less than an hour. It is recommended that they practice administering the test with at least two examinees prior to using it<br />

for actual data collection. Calculation of raw scores takes 3-5 minutes. Calculation of the st<strong>and</strong>ardized T-score using the NAB Software<br />

Portfolio computerized scoring package takes 1 minute. In the st<strong>and</strong>ardization sample (N = 1448) <strong>and</strong> in the numerous clinical samples<br />

examined for validity studies (including patients with MS, TBI, ADHD, HIV/AIDS), respondents reacted favorably to the test. It was<br />

viewed as neither too easy nor too challenging/frustrating.<br />

6. Delis-Kaplan Executive Functioning Scale - Sorting<br />

The sorting test is a paper-<strong>and</strong>-pencil test, though computer scoring is available. The free sorting condition, which is being<br />

considered for the MATRICS battery, takes considerably less time than administering both the free sort <strong>and</strong> recognition conditions. Free<br />

sorting alone takes 10-15 minutes to administer. No information on tolerability.<br />

- 18 -


<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong><br />

Name: ______________________________ Date: _________________<br />

General Instructions: Please indicate your rating for each test on the test criteria by placing an “X” in the box below the<br />

desired numeric rating. Descriptors are provided to help anchor the ratings. Use the “5” anchor point to indicate, “There<br />

are just enough data on this criterion to consider the test useful for clinical trials.” Ratings greater than “5” indicate that<br />

there is more than enough support for the test on this criterion; ratings less than “5” indicate that there is less than<br />

adequate support.<br />

Test-Retest Reliability<br />

1. WAIS-III Block Design<br />

2. BACS - Tower of London<br />

3. NAB – Mazes<br />

4. Penn Conditional<br />

Exclusion Test (PCET)<br />

5. NAB – Categories<br />

6. DKEFS – Sorting<br />

Comments:<br />

Utility as a Repeated<br />

Measure<br />

1. WAIS-III Block Design<br />

2. BACS - Tower of London<br />

3. NAB – Mazes<br />

4. Penn Conditional<br />

Exclusion Test (PCET)<br />

5. NAB – Categories<br />

6. DKEFS – Sorting<br />

Comments:<br />

Poor<br />

1<br />

Poor<br />

1<br />

2<br />

2<br />

Fair<br />

3<br />

Fair<br />

3<br />

- 19 -<br />

4<br />

4<br />

Rating<br />

Good<br />

5<br />

Rating<br />

Good<br />

5<br />

6<br />

6<br />

Very<br />

Good<br />

7<br />

Very<br />

Good<br />

7<br />

8<br />

8<br />

Superb<br />

9<br />

Superb<br />

9


Relationship to<br />

Functional Outcome<br />

1. WAIS-III Block Design<br />

2. BACS - Tower of<br />

London<br />

3. NAB – Mazes<br />

4. Penn Conditional<br />

Exclusion Test (PCET)<br />

5. NAB – Categories<br />

6. DKEFS – Sorting<br />

Comments:<br />

Potential Changeability<br />

in Response to<br />

Pharmacological<br />

Agents<br />

1. WAIS-III Block Design<br />

2. BACS - Tower of London<br />

3. NAB – Mazes<br />

4. Penn Conditional<br />

Exclusion Test (PCET)<br />

5. NAB – Categories<br />

6. DKEFS – Sorting<br />

Comments:<br />

None<br />

1<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong><br />

2<br />

Weak<br />

3<br />

4<br />

- 20 -<br />

Rating<br />

Moderate<br />

5<br />

Rating<br />

6<br />

Strong<br />

7<br />

Unlikely to be Sensitive Difficult to Judge Likely to be Sensitive<br />

1<br />

2<br />

3<br />

4<br />

5<br />

6<br />

7<br />

8<br />

8<br />

Very<br />

Strong<br />

9<br />

9


Practicality <strong>and</strong><br />

Tolerability<br />

1. WAIS-III Block Design<br />

2. BACS - Tower of London<br />

3. NAB – Mazes<br />

4. Penn Conditional<br />

Exclusion Test (PCET)<br />

5. NAB – Categories<br />

6. DKEFS – Sorting<br />

Comments:<br />

Poor<br />

1<br />

<strong>Reasoning</strong> <strong>and</strong> <strong>Problem</strong> <strong>Solving</strong><br />

2<br />

Fair<br />

3<br />

- 21 -<br />

4<br />

Rating<br />

Good<br />

5<br />

6<br />

Very<br />

Good<br />

7<br />

8<br />

Superb<br />

9

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!