One-hour Reinforcement Learning Modules: 1 | Markov Decision Processes: A Model of Sequential Decision Making 2 | Policy Evaluation: The Temporal Difference Method 3 | Policy Improvement: The Q-learning Algorithm 4 | Going Deep 5 | What Did We Miss Out?