Optimally Facing the uncertainty : A brief survey on Reinforcement Learning

Authors

Rajeswari RR Department of Computer Science, Mother Teresa Women’s University, Kodaikanal
Pethalakshmi A Department of Computer Science, Mother Teresa Women’s University, Kodaikanal

Keywords:

multiarmed Bandit problem, Reinforcement Learning, Greedy algorithm

Abstract

Reinforcement Learning is a combination of supervised learning and unsupervised learning, the two main streams of Machine Learning .It has many applications in Artificial Intelligence arena. Multi Armed Bandits problem, a classical Reinforcement Learning task employs exploration and exploitation tradeoff. Efficient Bandit Algorithms for solving Bandit problem proides solutions for various problems from Dynamic pricing to online multi class prediction. This research article analyses the elements of Reinforcement learning, mathematical formulation of multi armed Bandits problem and attempts to present a naive RL algorithm for N-Queens problem for an instance of N=4 and concludes with applications of Reinforcement Learning

References

Alpaydm, Introduction to Machine Learning, MIT Press, 2010.

Dai Shi, Exploring Bandit algorithms for Automatic content selection, Barcelona, 2014.

Ellis Horrowitz, Fundamentals of Computer Algorithms, Galgottia Publictions,1996

Kakade et al, Efficient Bandit Algorithms for online multiclass prediction, Conference Proceedings, University of Pennysylvnia,2008.

Nahum Shimkin, Reinforcement Learning –Basic algorithms, Learning in Complex Systems Lecture notes, Spring 2011.

Prathamesh, Multi armed bandit approach for Dynamic pricing, M.Tech Dissertation, Iit, Mumbai, 2015

Sutton and Barto, Reinforcement Learning – an introduction, The MIT Press, England, 2017.

Szepesvari, Algorithms for Reinforcement Learning, Synthesis lectures on Artificial Intelligence and machine learning, 2009.

http://cs.mcgill.ca/~dprecup/courses /AI/Lectures/ai-lecture13.pdf

www.medium.com/machinelearning for humans .html

Downloads

PDF ⁰

Published

2025-11-13

How to Cite

[1]

R. R. Rajeswari and A. Pethalakshmi, “Optimally Facing the uncertainty : A brief survey on Reinforcement Learning”, Int. J. Comp. Sci. Eng., vol. 6, no. 4, pp. 45–48, Nov. 2025.

Download Citation

Issue

Vol. 6 No. 4 (2018): IJCSE Special Issue May Edition

Section

Survey Article

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.

Optimally Facing the uncertainty : A brief survey on Reinforcement Learning

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

UGC Gazette Regulation

Join Editorial Board

Information

Current Issue

Keywords