Reinforcement_Learning

Lecture Module: Department of Computer Science, Reinforcement Learning, University College London Supervisor: Prof. Hado Van Hasselt, Matteo Hessel, and Diana Borsa

All data was provided by UCL

Agent Implementation: Random, UCB, Epsilon-greedy and REINFORCE Agent. Implemented four different kinds of experiment and analysed them.
Learning Algorithms for Sequential Decision Problems: Tabular Reinforcement Learning, TD Learning, Policy Iteration, Q-learning agents [General Q-learning, Sarsa, Expected Sarsa, Double Q-learning], and analysed each result.
Analysed Q-learning, Double Q-learning, and Target Q-learning
Off-policy Bellman Operators with Fuction Approximation and Analysed Each Results

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Convergence_of_Q-learning_Mathematically.ipynb		Convergence_of_Q-learning_Mathematically.ipynb
General_Q-learning_agents.ipynb		General_Q-learning_agents.ipynb
Multi-armed_bandit_framework.ipynb		Multi-armed_bandit_framework.ipynb
Off-policy_Bellman_Operators_with_FA.ipynb		Off-policy_Bellman_Operators_with_FA.ipynb
README.md		README.md
TD_learning&Policy_Iteration.ipynb		TD_learning&Policy_Iteration.ipynb
Tabular_RL.ipynb		Tabular_RL.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement_Learning

About

Releases

Packages

Languages

MSathishkumar1990/Reinforcement_Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement_Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages