Reinforcement Learning
Reinforcement Learning
2021
- [RL] 12. Monte Carlo Method
- [RL] 11. Dynamic Programming
- [RL] 10. Bellman Equation
- [RL] 9. Markov Decision Process & "p" function
- [RL] 8. Upper Confidence Bound(UCB)
- [RL] 7. Optimistic Initial Value
- [RL] 6. Epsilon Greedy
- [RL] 5. Exploitation, Exploration
- [RL] 4. Task 종류(Episodic, Continuous)
- [RL] 3. Policy
- [RL] 2. Value Function
- [RL] 1. 강화학습