01.11.2014 Views

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

176<br />

reinforcement learning we are very much concerned with cases in which optimal solutions cannot<br />

be found but must be approximated in some way.<br />

Policy<br />

Rewar<br />

d<br />

Value<br />

Figure 7-7: Reinforcement learning is characterized by:<br />

❐ Policy: what to do<br />

❐ Reward: what is good<br />

❐ Value function: what is good because it predicts reward<br />

❐ Model: what follows what<br />

Model<br />

environmen of<br />

t<br />

© A.G.Billard 2004 – Last Update March 2011

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!