06.09.2021 Views

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

The circle <strong>of</strong> fifths includes all 12 notes in a chromatic scale, but arranges them<br />

so that adjacent notes in the circle are a musical interval <strong>of</strong> a perfect fifth (i.e., 7<br />

semitones) apart. The circle <strong>of</strong> fifths is a standard topic for music students, and it<br />

is foundational to many concepts in music theory. It is provided here, though, to be<br />

contrasted later with “strange circles” that are revealed in the internal structure <strong>of</strong> a<br />

network trained to identify musical chords.<br />

Any one <strong>of</strong> the notes in the circle <strong>of</strong> fifths can be used to define a musical key<br />

and therefore can serve as the root note <strong>of</strong> a major scale. Similarly, any one <strong>of</strong> these<br />

notes can be the root <strong>of</strong> a major triad created using the pattern <strong>of</strong> root + 4 semitones<br />

+ 3 semitones that was described earlier for the key <strong>of</strong> C major (Baker, 1982).<br />

Furthermore, the rules described earlier can also be applied to produce added note<br />

chords for any <strong>of</strong> the 12 major key signatures. These possible major triads and added<br />

note chords were used as inputs for training a network to correctly classify different<br />

types <strong>of</strong> chords, ignoring musical key.<br />

A training set <strong>of</strong> 48 chords was created by building the major triad, as well as<br />

the major 7 th , dominant 7 th , and 6 th chord for each <strong>of</strong> the 12 possible major key signatures<br />

(i.e., using each <strong>of</strong> the notes in Figure 4-9 as a root). When presented with<br />

a chord, the network was trained to classify it into one <strong>of</strong> the four types <strong>of</strong> interest:<br />

major triad, major 7 th , dominant 7 th , or 6 th . To do so, the network had 4 output units,<br />

one for each type <strong>of</strong> chord. For any input, the network learned to turn the correct<br />

output unit on and to turn the other three output units <strong>of</strong>f.<br />

The input chords were encoded with a pitch class representation (Laden<br />

& Keefe, 1989; Yaremchuk & Dawson, 2008). In a pitch class representation, only<br />

12 input units are employed, one for each <strong>of</strong> the 12 different notes that can appear<br />

in a scale. Different versions <strong>of</strong> the same note (i.e., the same note played at different<br />

octaves) are all mapped onto the same input representation. For instance, notes<br />

1, 13, 25, and 37 in Figure 4-7 all correspond to different pitches but belong to the<br />

same pitch class—they are all C notes, played at different octaves <strong>of</strong> the keyboard. In<br />

a pitch class representation, the playing <strong>of</strong> any <strong>of</strong> these input notes would be encoded<br />

by turning on a single input unit—the one unit used to represent the pitch class <strong>of</strong> C.<br />

A pitch class representation <strong>of</strong> chords was used for two reasons. First, it requires<br />

a very small number <strong>of</strong> input units to represent all <strong>of</strong> the possible stimuli. Second, it<br />

is a fairly abstract representation that makes the chord classification task difficult,<br />

which in turn requires using hidden units in a network faced with this task.<br />

Why chord classification might be difficult for a network when pitch class<br />

encoding is employed becomes evident by thinking about how we might approach<br />

the problem if faced with it ourselves. Classifying the major chords is simple: they<br />

are the only input stimuli that activate three input units instead <strong>of</strong> four. However,<br />

classifying the other chord types is very challenging. One first has to determine what<br />

key the stimulus is in, identify which three notes define its major chord component,<br />

164 Chapter 4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!