- UCAS course code
- G104
- UCAS institution code
- M20
Master of Mathematics (MMath)
MMath Mathematics
- Typical A-level offer: A*AA including specific subjects
- Typical contextual A-level offer: A*AB including specific subjects
- Refugee/care-experienced offer: A*BB including specific subjects
- Typical International Baccalaureate offer: 37 points overall with 7,6,6 at HL, including specific requirements
Course unit details:
Statistical Machine Learning
Unit code | MATH48292 |
---|---|
Credit rating | 15 |
Unit level | Level 4 |
Teaching period(s) | Semester 2 |
Available as a free choice unit? | No |
Overview
An introduction to machine learning models and algorithms, both for unsupervised and supervised learning, with a focus on the underlying mathematical and statistical principles, while including some aspects of implementation and practice. Studying the relation of machine learning approaches with traditional statistical methods.
Pre/co-requisites
Unit title | Unit code | Requirement type | Description |
---|---|---|---|
Multivariate Statistics and Machine Learning | MATH38161 | Pre-Requisite | Compulsory |
Multivariate Statistics | MATH48061 | Pre-Requisite | Compulsory |
Probability and Statistics 2 | MATH27720 | Pre-Requisite | Compulsory |
Linear Regression Models | MATH27711 | Pre-Requisite | Compulsory |
One of either MATH38161 Multivariate Statistics and Machine Learning or MATH48061 Multivariate Statistics required as a pre-requisite.
Aims
The unit aims to:
Provide an introduction to machine learning models and algorithms, both for unsupervised and supervised learning, with a focus on the underlying mathematical and statistical principles, while including some aspects of implementation and practice. A further aim is to study the relation of machine learning approaches with traditional statistical methods.
Learning outcomes
On the successful completion of the course, students will be able to:
- Explain the principles underlying both probabilistic and algorithmic machine learning techniques
- Select and identify appropriate methodology for data analysis at the interface of statistics and machine learning
- Apply modern machine learning methods to supervised and unsupervised learning problems
- Use R and/or Python to analyse data with the methods discussed in the course
Syllabus
Part A - Nonlinear (non-neural network) machine learning approaches
- Limitations of traditional linear statistical models
- Supervised Machine Learning:
- K-nearest neighbours (including Bayesian KNN)
- Decision trees and random forests
- Unsupervised Machine Learning:
- Nonparametric clustering (e.g. Dirichlet process mixture models)
- Topic models (e.g. Latent Dirichlet Allocation)
- Dimensionality reduction including manifold learning (SNE, t-SNE)
Part B - Nonlinear (neural network) machine learning approaches
- Single-layer neural networks and deep neural networks
- Optimising neural networks (gradient descent, backpropagation)
- Statistical properties (overparametrisation, regularisation, universal function approximation)
- Standard models (e.g. multi-layer perceptron, convolutional networks, auto-encoders) and diverse data types (e.g. image data, time series, text data)
Teaching and learning methods
Teaching is composed of two hours of lectures per week and one tutorial class per week. Teaching materials will be made available online for reference and review. One week is reserved for coursework.
Assessment methods
Method | Weight |
---|---|
Other | 30% |
Written exam | 70% |
Other = coursework data analysis project, with a 30% weighting
Feedback methods
Generic feedback will be provided after marks are released.
Recommended reading
C. M. Bishop and H. Bishop. 2024. Deep learning: Foundations and concepts. Springer.
https://link.springer.com/book/10.1007/978-3-031-45468-4
G. James et al. 2021. An introduction to statistical learning with applications in R (2nd edition). Springer.
https://link.springer.com/book/10.1007/978-1-0716-1418-1
G. James et al. 2023. An introduction to statistical learning with applications in Python. Springer.
https://link.springer.com/book/10.1007/978-3-031-38747-0
https://www.statlearning.com/
K. P. Murphy. 2022. Probabilistic machine learning: An introduction. MIT Press.
https://mitpress.mit.edu/9780262046824/probabilistic-machine-learning/
https://probml.github.io/pml-book/book1.html
A. Zhang et al. 2024. Dive into deep learning. Cambridge University Press.
https://d2l.ai/
Study hours
Scheduled activity hours | |
---|---|
Lectures | 22 |
Practical classes & workshops | 12 |
Independent study hours | |
---|---|
Independent study | 116 |
Teaching staff
Staff member | Role |
---|---|
Rendani Mbuvha | Unit coordinator |
Thomas House | Unit coordinator |