MSc Machine Learning / Course details

Year of entry: 2025

View tabs
View full page

Course unit details:
Reinforcement Learning

Course unit fact file
Unit code	COMP64202
Credit rating	15
Unit level	FHEQ level 7 – master's degree or fourth year of an integrated master's degree
Teaching period(s)	Semester 2
Available as a free choice unit?	Yes

Overview

Reinforcement Learning (COMP64202) is an advanced artificial intelligence course that explores how machines can learn to make sequential decisions through trial and error interaction with their environment, similar to how humans learn. The course covers both theoretical foundations and cutting-edge applications in areas such as game playing, robotics, and resource management.

Aims

The unit aims to provide an explanation of the key ideas and algorithms of reinforcement learning (RL). This course is meant to provide both an introductory and advanced treatment of reinforcement learning, emphasizing foundations and ideas as well as the latest developments and mathematical proofs. This course aims to make the work accessible to the broadest possible audience in artificial intelligence, control engineering, operations research, psychology and neuroscience.

Learning outcomes

Knowledge and understanding:

1. List the strengths and limitations of modern deep RL approaches.

2. Explain the underlying concepts of RL methods and how they differ from each other.

3. Derive the objectives and constraints of selected algorithms.

Intellectual skills: 4. Analyze new tasks to decide which algorithms/architectures to apply. 5. Compare and evaluate different RL approaches for specific problem contexts.

Practical skills: 6. Implement selected RL algorithms and architectures. 7. Apply RL methods to solve practical problems.

Transferable skills and personal qualities: 8. Break down complex problems into manageable components. 9. Evaluate technical approaches against practical constraints. 10. Communicate technical concepts and results effectively.

Syllabus

The course covers the following topics:

Fundamentals of Markov Decision Processes (MDPs)
Dynamic Programming methods
Monte Carlo methods
Temporal-Difference learning
Function approximation and Deep Reinforcement Learning
Policy gradient methods
Actor-critic algorithms
Model-based Reinforcement Learning
Partial observability and multi-agent systems
Real-world applications and case studies

Teaching and learning methods

The learning and teaching in this course combines three complementary components. Lectures provide the theoretical foundations and key concepts of reinforcement learning. Tutorial sessions focus on solving theoretical exercises to deepen understanding and develop problem-solving skills. Computer lab sessions offer hands-on experience implementing and experimenting with reinforcement learning algorithms, allowing students to see the practical challenges and opportunities of these methods.

Employability skills

Analytical skills
Group/team working
Innovation/creativity
Project management
Problem solving
Research
Written communication

Assessment methods

Method	Weight
Written exam	80%
Practical skills assessment	20%

Feedback methods

Feedback is provided through:

Cohort and individual feedback for the practical assignment when marks are returned
Discussion during tutorial sessions
Peer feedback during group activities
Office hours with teaching staff

Study hours

Scheduled activity hours
Lectures	20
Practical classes & workshops	25
Tutorials	5

Independent study hours
Independent study	100

Teaching staff

Staff member	Role
Wei Pan	Unit coordinator
Mingfei Sun	Unit coordinator

Additional notes

This course unit is at Level 6 with 15 credits (7.5 ECTS). The unit is taught by Dr. Wei Pan from the School of Engineering.

Return to course details