We finally kicked off our MLT Reinforcement Learning sessions with an “Intro to RL” study group at the Tokyo Metropolitan Library. Anugraha Sinha walked us through the following:
Theory
- Introduction to RL
- Important elements of an RL problem
- Description of Markov Decision Process (MDP) and and Markov Assumption.
- Importance of parametrization of State, Action, Reward and Environment.
- Model Based and Model Free Methods
- Meaning of Control Problem and Evaluation Problem.
- Algorithm of Policy Evaluation and Value iteration methods
Code examples
- Finding the best route through a maze/obstruction avoidance using policy iteration algorithm.
- Above problem statement with value iterations algorithm.
- Code exercise
You can find the study material (slides and notebooks) for this session on GitHub. Join us for our next sessions on Meetup. We also provide a zoom link for participants to join remotely.