{"id":19382,"date":"2021-01-28T23:54:05","date_gmt":"2021-01-29T04:54:05","guid":{"rendered":"https:\/\/bangla.salearningschool.com\/recent-posts\/resources-reinforcement-learning-and-deep-reinforcement-learning\/"},"modified":"2021-01-28T23:54:05","modified_gmt":"2021-01-29T04:54:05","slug":"resources-reinforcement-learning-and-deep-reinforcement-learning","status":"publish","type":"post","link":"http:\/\/bangla.sitestree.com\/?p=19382","title":{"rendered":"Resources: Reinforcement Learning and Deep Reinforcement Learning"},"content":{"rendered":"<p dir=\"ltr\">Platform:<\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/gym.openai.com\/\">https:\/\/gym.openai.com\/<\/a><\/p>\n<p dir=\"ltr\">Code Examples:<\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/towardsdatascience.com\/using-deep-q-learning-in-fifa-18-to-perfect-the-art-of-free-kicks-f2e4e979ee66?gi=b96ce845729c\">https:\/\/towardsdatascience.com\/using-deep-q-learning-in-fifa-18-to-perfect-the-art-of-free-kicks-f2e4e979ee66?gi=b96ce845729c<\/a><\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/becominghuman.ai\/reinforcement-learning-with-fifa-and-keras-85ec792e25b2\">https:\/\/becominghuman.ai\/reinforcement-learning-with-fifa-and-keras-85ec792e25b2<\/a><\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/towardsdatascience.com\/reinforcement-learning-demystified-solving-mdps-with-dynamic-programming-b52c8093c919\">https:\/\/towardsdatascience.com\/reinforcement-learning-demystified-solving-mdps-with-dynamic-programming-b52c8093c919<\/a><\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/github.com\/openai\/gym\/blob\/master\/gym\/envs\/toy_text\/nchain.py\">https:\/\/github.com\/openai\/gym\/blob\/master\/gym\/envs\/toy_text\/nchain.py<\/a><\/p>\n<p dir=\"ltr\">Theory<\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/towardsdatascience.com\/introduction-to-various-reinforcement-learning-algorithms-i-q-learning-sarsa-dqn-ddpg-72a5e0cb6287\">https:\/\/towardsdatascience.com\/introduction-to-various-reinforcement-learning-algorithms-i-q-learning-sarsa-dqn-ddpg-72a5e0cb6287<\/a><\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/cecas.clemson.edu\/ayalew\/Papers\/Vehicle%20Systems%20Dynamics%20and%20Control\/Papers\/A%20Saturation%20Balancing%20Control%20Method%20for%20Enhancing%20Dynamic%20Vehicle%20Stability\/IJVD%2061_1-4_Paper%203.pdf\">https:\/\/cecas.clemson.edu\/ayalew\/Papers\/Vehicle%20Systems%20Dynamics%20and%20Control\/Papers\/A%20Saturation%20Balancing%20Control%20Method%20for%20Enhancing%20Dynamic%20Vehicle%20Stability\/IJVD%2061_1-4_Paper%203.pdf<\/a><\/p>\n<p dir=\"ltr\"><a href=\"https:\/\/arxiv.org\/pdf\/1712.01815.pdf\">https:\/\/arxiv.org\/pdf\/1712.01815.pdf<\/a><\/p>\n<p>*** . *** *** . *** . *** . ***<\/p>\n<p><em><strong><em><strong>Courses: <\/strong><a href=\"http:\/\/training.sitestree.com\/\">http:\/\/Training.SitesTree.com<\/a> (Big Data, Cloud, Security, Machine Learning)<\/em><br \/>\nBlog<\/strong>: <a href=\"http:\/\/bangla.salearningschool.com\/\">http:\/\/Bangla.SaLearningSchool.com<\/a>, <a href=\"http:\/\/sitestree.com\">http:\/\/SitesTree.com<\/a><\/em><br \/>\n<em><strong>8112223 Canada Inc.\/JustEtc<\/strong>: <a href=\"http:\/\/JustEtc.net\">http:\/\/JustEtc.net<\/a><\/em><\/p>\n<p><em><strong>Shop Online: <\/strong><\/em><a href=\"http:\/\/www.shopforsoul.com\/\">https:\/\/www.ShopForSoul.com\/<\/a><br \/>\n<em><strong>Linkedin<\/strong>: <a href=\"https:\/\/ca.linkedin.com\/in\/sayedjustetc\">https:\/\/ca.linkedin.com\/in\/sayedjustetc<\/a><\/em><\/p>\n<p><strong>Medium<\/strong>: <a href=\"https:\/\/medium.com\/@SayedAhmedCanada\">https:\/\/medium.com\/@SayedAhmedCanada<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Platform: https:\/\/gym.openai.com\/ Code Examples: https:\/\/towardsdatascience.com\/using-deep-q-learning-in-fifa-18-to-perfect-the-art-of-free-kicks-f2e4e979ee66?gi=b96ce845729c https:\/\/becominghuman.ai\/reinforcement-learning-with-fifa-and-keras-85ec792e25b2 https:\/\/towardsdatascience.com\/reinforcement-learning-demystified-solving-mdps-with-dynamic-programming-b52c8093c919 https:\/\/github.com\/openai\/gym\/blob\/master\/gym\/envs\/toy_text\/nchain.py Theory https:\/\/towardsdatascience.com\/introduction-to-various-reinforcement-learning-algorithms-i-q-learning-sarsa-dqn-ddpg-72a5e0cb6287 https:\/\/cecas.clemson.edu\/ayalew\/Papers\/Vehicle%20Systems%20Dynamics%20and%20Control\/Papers\/A%20Saturation%20Balancing%20Control%20Method%20for%20Enhancing%20Dynamic%20Vehicle%20Stability\/IJVD%2061_1-4_Paper%203.pdf https:\/\/arxiv.org\/pdf\/1712.01815.pdf *** . *** *** . *** . *** . *** Courses: http:\/\/Training.SitesTree.com (Big Data, Cloud, Security, Machine Learning) Blog: http:\/\/Bangla.SaLearningSchool.com, http:\/\/SitesTree.com 8112223 Canada Inc.\/JustEtc: http:\/\/JustEtc.net Shop Online: https:\/\/www.ShopForSoul.com\/ Linkedin: https:\/\/ca.linkedin.com\/in\/sayedjustetc Medium: https:\/\/medium.com\/@SayedAhmedCanada<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[182],"tags":[],"class_list":["post-19382","post","type-post","status-publish","format-standard","hentry","category---blog","item-wrap"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":19381,"url":"http:\/\/bangla.sitestree.com\/?p=19381","url_meta":{"origin":19382,"position":0},"title":"Reinforcement Learning and Deep Learning","author":"Sayed","date":"January 28, 2021","format":false,"excerpt":"An interactive notebook training Keras to play Catch https:\/\/github.com\/JannesKlaas\/sometimes_deep_sometimes_learning\/blob\/master\/reinforcement.ipynb Key Papers in Deep RL https:\/\/spinningup.openai.com\/en\/latest\/spinningup\/keypapers.html#key-papers-in-deep-rl DEEP REINFORCEMENT LEARNING https:\/\/arxiv.org\/pdf\/1810.06339v1.pdf Tesla: Deep Learning https:\/\/quantdare.com\/deep-reinforcement-trading\/ Playing Atari with Deep Reinforcement Learning https:\/\/arxiv.org\/pdf\/1312.5602.pdf AlphaGo is the first computer program to defeat a professional human Go player, the first to defeat a Go world\u2026","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":14773,"url":"http:\/\/bangla.sitestree.com\/?p=14773","url_meta":{"origin":19382,"position":1},"title":"Reinforcement Learning Concepts Explained in a Simple Way.","author":"Sayed","date":"May 17, 2019","format":false,"excerpt":"Reinforcement Learning Concepts Explained in a Simple (or not) Way. This is intended for the beginners who want to know the concepts used in Reinforcement Learning i.e. Interactive Learning. Reinforcement Learning is also one aspect of Machine Learning, Data Science, and AI Summary of Tabular Methods in Reinforcement Learning Comparison\u2026","rel":"","context":"In &quot;AI ML DS RL DL NN NLP Data Mining Optimization&quot;","block_context":{"text":"AI ML DS RL DL NN NLP Data Mining Optimization","link":"http:\/\/bangla.sitestree.com\/?cat=1910"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":26802,"url":"http:\/\/bangla.sitestree.com\/?p=26802","url_meta":{"origin":19382,"position":2},"title":"Bandits and #Reinforcement Learning Course","author":"Sayed","date":"May 2, 2021","format":false,"excerpt":"Best Multi-Armed Bandit Strategy? (feat: UCB Method) https:\/\/www.youtube.com\/watch?v=FgmMK6RPU1c Reinforcement Learning: Complete Course: https:\/\/www.youtube.com\/watch?v=4SLGEq_HZxk&list=PLnn6VZp3hqNvRrdnMOVtgV64F_O-61C1D From the Book by Sutton: http:\/\/incompleteideas.net\/book\/RLbook2018.pdf https:\/\/web.stanford.edu\/class\/psych209\/Readings\/SuttonBartoIPRLBook2ndEd.pdf Just did a google search, and I see that the algorithms from the book are implemented and provided at: https:\/\/github.com\/LyWangPX\/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions Article: https:\/\/towardsdatascience.com\/a-comparison-of-bandit-algorithms-24b4adfcabb https:\/\/towardsdatascience.com\/a-comparison-of-bandit-algorithms-24b4adfcabb","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":14733,"url":"http:\/\/bangla.sitestree.com\/?p=14733","url_meta":{"origin":19382,"position":3},"title":"On Reinforcement Learning:","author":"Sayed","date":"April 17, 2019","format":false,"excerpt":"On Reinforcement Learning: Questions and Answers https:\/\/www.inf.ed.ac.uk\/teaching\/courses\/rl\/tutorials.html Monte Carlo: https:\/\/medium.com\/@zsalloum\/monte-carlo-in-reinforcement-learning-the-easy-way-564c53010511 TD in Reinforcement Learning, the Easy Way: Temporal Difference https:\/\/towardsdatascience.com\/td-in-reinforcement-learning-the-easy-way-f92ecfa9f3ce Implementations of TD Algorithms: https:\/\/github.com\/dennybritz\/reinforcement-learning\/tree\/master\/TD Learning and Planning: https:\/\/courses.cs.washington.edu\/courses\/csep573\/12au\/lectures\/18-rl.pdf Sayed Ahmed sayedum Linkedin: https:\/\/ca.linkedin.com\/in\/sayedjustetc Blog: http:\/\/sitestree.com, http:\/\/bangla.salearningschool.com","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":14750,"url":"http:\/\/bangla.sitestree.com\/?p=14750","url_meta":{"origin":19382,"position":4},"title":"Reinforcement Learning: Tutorials: Code: Questions and Answers","author":"Sayed","date":"May 3, 2019","format":false,"excerpt":"Reinforcement Learning: Tutorials: Code: Questions and Answers Must Check: Questions and Answers: https:\/\/www.inf.ed.ac.uk\/teaching\/courses\/rl\/tutorials.html Check if you can find an equation here: Check the grid example https:\/\/medium.com\/@zsalloum\/monte-carlo-in-reinforcement-learning-the-easy-way-564c53010511 TD: Temporal Difference https:\/\/towardsdatascience.com\/td-in-reinforcement-learning-the-easy-way-f92ecfa9f3ce Not that good: https:\/\/courses.cs.washington.edu\/courses\/csep573\/12au\/lectures\/18-rl.pdf http:\/\/incompleteideas.net\/609%20dropbox\/slides%20(pdf%20and%20keynote)\/9-10-MC.pdf","rel":"","context":"In &quot;\u09ac\u09cd\u09b2\u0997 \u0964 Blog&quot;","block_context":{"text":"\u09ac\u09cd\u09b2\u0997 \u0964 Blog","link":"http:\/\/bangla.sitestree.com\/?cat=182"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":24925,"url":"http:\/\/bangla.sitestree.com\/?p=24925","url_meta":{"origin":19382,"position":5},"title":"On Reinforcement Learning: #Root","author":"Author-Check- Article-or-Video","date":"April 13, 2021","format":false,"excerpt":"On Reinforcement Learning: Questions and Answers https:\/\/www.inf.ed.ac.uk\/teaching\/courses\/rl\/tutorials.html Monte Carlo: https:\/\/medium.com\/@zsalloum\/monte-carlo-in-reinforcement-learning-the-easy-way-564c53010511 TD in Reinforcement Learning, the Easy Way: Temporal Difference https:\/\/towardsdatascience.com\/td-in-reinforcement-learning-the-easy-way-f92ecfa9f3ce Implementations of TD Algorithms: https:\/\/github.com\/dennybritz\/reinforcement-learning\/tree\/master\/TD Learning and Planning: https:\/\/courses.cs.washington.edu\/courses\/csep573\/12au\/lectures\/18-rl.pdf Sayed Ahmed sayedum Linkedin: https:\/\/ca.linkedin.com\/in\/sayedjustetc Blog: http:\/\/sitestree.com, http:\/\/bangla.salearningschool.com From: http:\/\/sitestree.com\/on-reinforcement-learning\/ Categories:RootTags: Post Data:2019-04-17 12:45:47 Shop Online: https:\/\/www.ShopForSoul.com\/ (Big Data, Cloud, Security,\u2026","rel":"","context":"In &quot;FromSitesTree.com&quot;","block_context":{"text":"FromSitesTree.com","link":"http:\/\/bangla.sitestree.com\/?cat=1917"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/19382","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=19382"}],"version-history":[{"count":0,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=\/wp\/v2\/posts\/19382\/revisions"}],"wp:attachment":[{"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=19382"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=19382"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/bangla.sitestree.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=19382"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}