Skip to content

tombewley/one-hour-rl

Repository files navigation

One-hour Reinforcement Learning

Modules:

  • 1 | Markov Decision Processes: A Model of Sequential Decision Making
  • 2 | Policy Evaluation: The Temporal Difference Method
  • 3 | Policy Improvement: The Q-learning Algorithm
  • 4 | Going Deep
  • 5 | What Did We Miss Out?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published