Chapter 1 : Introduction
Chapter 2 : Markov Decision Process
Chapter 3 : Bellman Equation
Chapter 4 : Dynamic Programming
CHAPTER 5: Monte-Carlo Methods
CHAPTER 6 : Temporal Difference Methods
CHAPTER 7 : Off-Policy Control
CHAPER 8: Value Function Approximatiobn
CHAPTER 9 : DQN(Deep Q-Networks)
CHAPTER 10 : Policy Gradient