Skip to content

MSathishkumar1990/Reinforcement_Learning

Repository files navigation

Reinforcement_Learning

Lecture Module: Department of Computer Science, Reinforcement Learning, University College London Supervisor: Prof. Hado Van Hasselt, Matteo Hessel, and Diana Borsa

All data was provided by UCL

  1. Agent Implementation: Random, UCB, Epsilon-greedy and REINFORCE Agent. Implemented four different kinds of experiment and analysed them.

  2. Learning Algorithms for Sequential Decision Problems: Tabular Reinforcement Learning, TD Learning, Policy Iteration, Q-learning agents [General Q-learning, Sarsa, Expected Sarsa, Double Q-learning], and analysed each result.

  3. Analysed Q-learning, Double Q-learning, and Target Q-learning

  4. Off-policy Bellman Operators with Fuction Approximation and Analysed Each Results

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published