User contributions for 128.148.193.121

For 128.148.193.121 talk block log logs
Jump to navigation Jump to search
Search for contributionsExpandCollapse
⧼contribs-top⧽
⧼contribs-date⧽

5 November 2013

  • 19:2919:29, 5 November 2013 diff hist +10,202 N Q-learningThe Q-learning equation previously was *wrong* There are two ways to represent it and what was present was some incorrect hybrid of the two. I have corrected it. See http://webdocs.cs.ualberta.ca/~sutton/book/ebook/node65.html if you wish to validate.