06.09.2021 Views

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

and then determine the relationship between the third note <strong>of</strong> the major chord<br />

component and the fourth “added” note. This is particularly difficult because <strong>of</strong> the<br />

pitch class representation, which throws away note-order information that might<br />

be useful in identifying chord type.<br />

It was decided that the network that would be trained on the chord classification<br />

task would be a network <strong>of</strong> value units (Dawson & Schopflocher, 1992b). The<br />

hidden units and output units in a network <strong>of</strong> value units use a Gaussian activation<br />

function, which means that they behave as if they carve two parallel planes<br />

through a pattern space. Such networks can be trained with a variation <strong>of</strong> the generalized<br />

delta rule. This type <strong>of</strong> network was chosen for this problem for two reasons.<br />

First, networks <strong>of</strong> value units have emergent properties that make them easier to<br />

interpret than other types <strong>of</strong> networks trained on similar problems (Dawson, 2004;<br />

Dawson et al., 1994). One reason for this is because value units behave as if they<br />

are “tuned” to respond to very particular input signals. Second, previous research<br />

on different versions <strong>of</strong> chord classification problems had produced networks that<br />

revealed elegant internal structure (Yaremchuk & Dawson, 2005, 2008).<br />

The simplest network <strong>of</strong> value units that could learn to solve the chord classification<br />

problem required three hidden units. At the start <strong>of</strong> training, the value<br />

<strong>of</strong> m for each unit was initialized as 0. (The value <strong>of</strong> m for a value unit is analogous<br />

to a threshold in other types <strong>of</strong> units [Dawson, Kremer, & Gannon, 1994;<br />

Dawson & Schopflocher, 1992b]; if a value unit’s net input is equal to m then the unit<br />

generates a maximum activity <strong>of</strong> 1.00.) All connection weights were set to values randomly<br />

selected from the range between –0.1 and 0.1. The network was trained with<br />

a learning rate <strong>of</strong> 0.01 until it produced a “hit” for every output unit on every pattern.<br />

Because <strong>of</strong> the continuous nature <strong>of</strong> the activation function, a hit was defined as follows:<br />

a value <strong>of</strong> 0.9 or higher when the desired output was 1, and a value <strong>of</strong> 0.1 or lower<br />

when the desired output was 0. The network that is interpreted below learned the<br />

chord classification task after 299 presentations <strong>of</strong> the training set.<br />

What is the role <strong>of</strong> a layer <strong>of</strong> hidden units? In a perceptron, which has no<br />

hidden units, input patterns can only be represented in a pattern space. Recall<br />

from the discussion <strong>of</strong> Figure 4-2 that a pattern space represents each pattern as<br />

a point in space. The dimensionality <strong>of</strong> this space is equal to the number <strong>of</strong> input<br />

units. The coordinates <strong>of</strong> each pattern’s point in this space are given by the activities<br />

<strong>of</strong> the input units. For some networks, the positioning <strong>of</strong> the points in the<br />

pattern space prevents some patterns from being correctly classified, because the<br />

output units are unable to adequately carve the pattern space into the appropriate<br />

decision regions.<br />

In a multilayer perceptron, the hidden units serve to solve this problem. They<br />

do so by transforming the pattern space into a hidden unit space (Dawson, 2004).<br />

The dimensionality <strong>of</strong> a hidden unit space is equal to the number <strong>of</strong> hidden units<br />

Elements <strong>of</strong> Connectionist <strong>Cognitive</strong> <strong>Science</strong> 165

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!