06.09.2021 Views

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

Mind, Body, World- Foundations of Cognitive Science, 2013a

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

was a low-level interpretation that was guided by knowledge, and they argued that<br />

the more knowledge the better.<br />

Barrow and Tenenbaum’s (1975) review described a New Look within computer<br />

vision:<br />

Higher levels <strong>of</strong> perception could involve partitioning the picture into ‘meaningful’<br />

regions, based on models <strong>of</strong> particular objects, classes <strong>of</strong> objects, likely<br />

events in the world, likely configurations, and even on nonvisual events. Vision<br />

might be viewed as a vast, multi-level optimization problem, involving a<br />

search for the best interpretation simultaneously over all levels <strong>of</strong> knowledge.<br />

(Barrow & Tenenbaum, 1975, p. 2)<br />

However, around the same time a very different data-driven alternative to computer<br />

vision emerged (Waltz, 1975).<br />

Waltz’s (1975) computer vision system was designed to assign labels to regions<br />

and line segments in a scene produced by drawing lines and shadows. “These labels<br />

describe the edge geometry, the connection or lack <strong>of</strong> connection between adjacent<br />

regions, the orientation <strong>of</strong> each region in three dimensions, and the nature <strong>of</strong> the<br />

illumination for each region” (p. 21). The goal <strong>of</strong> the program was to assign one and<br />

only one label to each part <strong>of</strong> a scene that could be labelled, except in cases where a<br />

human observer would find ambiguity.<br />

Waltz (1975) found that extensive, general knowledge <strong>of</strong> the world was not<br />

required to assign labels. Instead, all that was required was a propagation <strong>of</strong> local<br />

constraints between neighbouring labels. That is, if two to-be-labelled segments<br />

were connected by a line, then the segments had to be assigned consistent labels.<br />

Two ends <strong>of</strong> a line segment could not be labelled in such a way that one end <strong>of</strong> the<br />

line would be given one interpretation and the other end a different interpretation<br />

that was incompatible with the first. Waltz found that this approach was very<br />

powerful and could be easily applied to novel scenes, because it did not depend on<br />

specialized, scene-specific knowledge. Instead, all that was required was a method<br />

to determine what labels were possible for any scene location, followed by a method<br />

for comparisons between possible labels, in order to choose unique and compatible<br />

labels for neighbouring locations.<br />

The use <strong>of</strong> constraints to filter out incompatible labels is called relaxation<br />

labelling (Rosenfeld, Hummel, & Zucker, 1976); as constraints propagate through<br />

neighbouring locations in a representation, the representation moves into a stable,<br />

lower-energy state by removing unnecessary labels. The discussion <strong>of</strong> solving<br />

Sudoku problems in Chapter 7 illustrates an application <strong>of</strong> relaxation labelling.<br />

Relaxation labelling proved to be a viable data-driven approach to dealing with<br />

visual underdetermination.<br />

Towards a <strong>Cognitive</strong> Dialectic 413

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!