TLA details

Name

Lecture content

Description

Content: – Reinforcement learning; – Motivational examples; – MDPs: the elements of an MDP, the Markov condition; – Policies; – Long-term rewards; – The goal of RL; – The types of RL: – Value-based; – Policy-based; – Actor-critic; – Value functions; – Recursiveness, Bellman equations; – Exploration vs. exploitation; – Greedy, ε-greedy, softmax; – Tabular methods: – Dynamic programming; – Monte Carlo learning; – Temporal difference learning; – SARSA and Q-learning: the difference between on-policy and off-policy methods; – Experience replay;

Learning type

Acquisition

Description Learning through acquisition is what learners are doing when they are listening to a lecture or podcast, reading from books or websites, and watching demos or videos.

Example usage Reading books, papers, listening to teacher presentations face-to-face, lectures, watching animations, videos,…

Workload

120

Activity delivery

Online

On-site

Hybrid

Synchronous

Asynchronous

Teacher-present

Teacher not present

Collaboration

Work in groups

Feedback

Assessment

Back

Machine Learning School

TLA details