.jpg)
Course unit details:
Reinforcement Learning
Unit code | COMP64202 |
---|---|
Credit rating | 15 |
Unit level | FHEQ level 7 – master's degree or fourth year of an integrated master's degree |
Teaching period(s) | Semester 2 |
Available as a free choice unit? | Yes |
Overview
Reinforcement Learning (COMP64202) is an advanced artificial intelligence course that explores how machines can learn to make sequential decisions through trial and error interaction with their environment, similar to how humans learn. The course covers both theoretical foundations and cutting-edge applications in areas such as game playing, robotics, and resource management.
Aims
The unit aims to provide an explanation of the key ideas and algorithms of reinforcement learning (RL). This course is meant to provide both an introductory and advanced treatment of reinforcement learning, emphasizing foundations and ideas as well as the latest developments and mathematical proofs. This course aims to make the work accessible to the broadest possible audience in artificial intelligence, control engineering, operations research, psychology and neuroscience.
Learning outcomes
Knowledge and understanding:
1. List the strengths and limitations of modern deep RL approaches.
2. Explain the underlying concepts of RL methods and how they differ from each other.
3. Derive the objectives and constraints of selected algorithms.
Intellectual skills: 4. Analyze new tasks to decide which algorithms/architectures to apply. 5. Compare and evaluate different RL approaches for specific problem contexts.
Practical skills: 6. Implement selected RL algorithms and architectures. 7. Apply RL methods to solve practical problems.
Transferable skills and personal qualities: 8. Break down complex problems into manageable components. 9. Evaluate technical approaches against practical constraints. 10. Communicate technical concepts and results effectively.
Syllabus
The course covers the following topics:
- Fundamentals of Markov Decision Processes (MDPs)
- Dynamic Programming methods
- Monte Carlo methods
- Temporal-Difference learning
- Function approximation and Deep Reinforcement Learning
- Policy gradient methods
- Actor-critic algorithms
- Model-based Reinforcement Learning
- Partial observability and multi-agent systems
- Real-world applications and case studies
Teaching and learning methods
The learning and teaching in this course combines three complementary components. Lectures provide the theoretical foundations and key concepts of reinforcement learning. Tutorial sessions focus on solving theoretical exercises to deepen understanding and develop problem-solving skills. Computer lab sessions offer hands-on experience implementing and experimenting with reinforcement learning algorithms, allowing students to see the practical challenges and opportunities of these methods.
Employability skills
- Analytical skills
- Group/team working
- Innovation/creativity
- Project management
- Problem solving
- Research
- Written communication
Assessment methods
Method | Weight |
---|---|
Written exam | 80% |
Practical skills assessment | 20% |
Feedback methods
Feedback is provided through:
- Cohort and individual feedback for the practical assignment when marks are returned
- Discussion during tutorial sessions
- Peer feedback during group activities
- Office hours with teaching staff
Recommended reading
An Introduction to Reinforcement Learning, R.S. Sutton and A.G. Barto. MIT Press, 2018.
Algorithms for Reinforcement Learning, C. Szepesvari, Morgan and Claypool, 2010.
Study hours
Scheduled activity hours | |
---|---|
Lectures | 20 |
Practical classes & workshops | 25 |
Tutorials | 5 |
Independent study hours | |
---|---|
Independent study | 100 |
Teaching staff
Staff member | Role |
---|---|
Wei Pan | Unit coordinator |
Mingfei Sun | Unit coordinator |
Additional notes
This course unit is at Level 6 with 15 credits (7.5 ECTS). The unit is taught by Dr. Wei Pan from the School of Engineering.