01.05.2015 Views

Q-Learning

Q-Learning

Q-Learning

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Q-<strong>Learning</strong> Basics<br />

● At each step s, choose the action a which<br />

maximizes the function Q(s, a)<br />

– Q is the estimated utility function – it tells us how<br />

good an action is given a certain state<br />

● Q(s, a) = immediate reward for making an action +<br />

best utility (Q) for the resulting state<br />

● Note: recursive definition<br />

● More formally ...

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!