{"id":589,"date":"2025-03-18T17:48:01","date_gmt":"2025-03-18T21:48:01","guid":{"rendered":"https:\/\/www.notexponential.com\/notes\/?page_id=589"},"modified":"2025-03-18T17:49:02","modified_gmt":"2025-03-18T21:49:02","slug":"i-project-4-gridworld-exploration-using-reinforcement-learning","status":"publish","type":"page","link":"https:\/\/www.notexponential.com\/notes\/artificial-intelligence\/projects\/i-project-4-gridworld-exploration-using-reinforcement-learning\/","title":{"rendered":"AI Project 4 &#8211; GridWorld Exploration using Reinforcement Learning"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Objective<\/h2>\n\n\n\n<p>In this project P4, you are essentially exploring a gridworld.&nbsp; (Go North, Go East, Go South, Go West.&nbsp; Go wherever you want.&nbsp; Go crazy.)&nbsp; Just try to maximize your rewards.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">T &amp; R<\/h2>\n\n\n\n<p>Yeah, those would be nice to know, but unfortunately, we don\u2019t know the T&amp;R values.&nbsp; We will have to learn those along the way.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">How many grid worlds?<\/h2>\n\n\n\n<p>There are many environments.&nbsp; Each environment can be explored up to m times.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What\u2019s the score?<\/h2>\n\n\n\n<p>Your score is always being averaged.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">What to Submit<\/h1>\n\n\n\n<p>Nothing.&nbsp; Present in class.<\/p>\n\n\n\n<p>Your score is updated via the API.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Use of API<\/h1>\n\n\n\n<p>We will play and record the games interactively with each other.&nbsp; Details of the API will be shared via Slack and discussed in class.<\/p>\n\n\n\n<h1 class=\"wp-block-heading\">Grading Rubrik<\/h1>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>5 points:<\/strong>\u00a0 Based on your in class presentation.<\/li>\n\n\n\n<li><strong>5 points<\/strong>: Based on your average score.<\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Objective In this project P4, you are essentially exploring a gridworld.&nbsp; (Go North, Go East, Go South, Go West.&nbsp; Go wherever you want.&nbsp; Go crazy.)&nbsp; Just try to maximize your rewards. T &amp; R Yeah, those would be nice to know, but unfortunately, we don\u2019t know the T&amp;R values.&nbsp; We will have to learn those [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":464,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-589","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/pages\/589","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/comments?post=589"}],"version-history":[{"count":2,"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/pages\/589\/revisions"}],"predecessor-version":[{"id":592,"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/pages\/589\/revisions\/592"}],"up":[{"embeddable":true,"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/pages\/464"}],"wp:attachment":[{"href":"https:\/\/www.notexponential.com\/notes\/wp-json\/wp\/v2\/media?parent=589"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}