Optimally Facing the uncertainty : A brief survey on Reinforcement Learning
Keywords:
multiarmed Bandit problem, Reinforcement Learning, Greedy algorithmAbstract
Reinforcement Learning is a combination of supervised learning and unsupervised learning, the two main streams of Machine Learning .It has many applications in Artificial Intelligence arena. Multi Armed Bandits problem, a classical Reinforcement Learning task employs exploration and exploitation tradeoff. Efficient Bandit Algorithms for solving Bandit problem proides solutions for various problems from Dynamic pricing to online multi class prediction. This research article analyses the elements of Reinforcement learning, mathematical formulation of multi armed Bandits problem and attempts to present a naive RL algorithm for N-Queens problem for an instance of N=4 and concludes with applications of Reinforcement Learning
References
Alpaydm, Introduction to Machine Learning, MIT Press, 2010.
Dai Shi, Exploring Bandit algorithms for Automatic content selection, Barcelona, 2014.
Ellis Horrowitz, Fundamentals of Computer Algorithms, Galgottia Publictions,1996
Kakade et al, Efficient Bandit Algorithms for online multiclass prediction, Conference Proceedings, University of Pennysylvnia,2008.
Nahum Shimkin, Reinforcement Learning –Basic algorithms, Learning in Complex Systems Lecture notes, Spring 2011.
Prathamesh, Multi armed bandit approach for Dynamic pricing, M.Tech Dissertation, Iit, Mumbai, 2015
Sutton and Barto, Reinforcement Learning – an introduction, The MIT Press, England, 2017.
Szepesvari, Algorithms for Reinforcement Learning, Synthesis lectures on Artificial Intelligence and machine learning, 2009.
http://cs.mcgill.ca/~dprecup/courses /AI/Lectures/ai-lecture13.pdf
www.medium.com/machinelearning for humans .html
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
