Machine Learning School

TLA details

Name
Lecture content
Description
Content: – Reinforcement learning; – Motivational examples; – MDPs: the elements of an MDP, the Markov condition; – Policies; – Long-term rewards; – The goal of RL; – The types of RL: – Value-based; – Policy-based; – Actor-critic; – Value functions; – Recursiveness, Bellman equations; – Exploration vs. exploitation; – Greedy, ε-greedy, softmax; – Tabular methods: – Dynamic programming; – Monte Carlo learning; – Temporal difference learning; – SARSA and Q-learning: the difference between on-policy and off-policy methods; – Experience replay;
Learning type
Acquisition
Description Learning through acquisition is what learners are doing when they are listening to a lecture or podcast, reading from books or websites, and watching demos or videos.
Example usage Reading books, papers, listening to teacher presentations face-to-face, lectures, watching animations, videos,…
Workload
120
Activity delivery
Online
On-site
Hybrid
Synchronous
Asynchronous
Teacher-present
Teacher not present
Collaboration
Work in groups
Feedback

Assessment