Clinical Data Science / Course details

Year of entry: 2025

Course unit details:
Maths, Stats, and Machine Learning

Course unit fact file
Unit code IIDS69021
Credit rating 15
Unit level FHEQ level 7 – master's degree or fourth year of an integrated master's degree
Teaching period(s) Semester 1
Available as a free choice unit? No

Overview

Clinical Data Scientists are required to generate insights from data, create or contribute to data products and interpret/translate clinical research. In order to achieve this they require a working knowledge and experience of data analysis methods including statistical learning (statistics and machine learning methods) supported by knowledge and understanding of the mathematical principles underpinning these methods and in which situations to apply them, including how to select the most appropriate method(s) based on the nature of the problem, data available and clinical/organisational priorities. 

Aims

The unit aims to:

  • Provide the underpinning applied mathematical concepts to common data science and machine learning methods
  • Provide students experience and practice with running analysis scripts (using R and Python) to analyse datasets using statistical and machine learning methods and interpreting output
  •  Expose students to a variety of analytic methods for tackling a range real world clinical problems

Syllabus

This unit will cover the following indicative content:

  • Fundamental applied mathematics for data science
  • Examples of using R and Python for data science
  • Introduction to inferential statistics and hypothesis testing
  • Introduction to machine learning algorithms for classification and regression problems
  • Selecting the appropriate analysis method and evaluating model performance
  • Issues around bias in data and wider ethical and legal issues related to automation of tasks with machine learning and their impact on patient outcomes

Teaching and learning methods

The unit will be delivered online making use of workshops, lectures, self-directed learning material delivered through interactive digital (Jupyter) notebooks and synchronous labs helping students to work through analysis of various data sets.

Knowledge and understanding

LO1: Explain the underpinning mathematical concepts behind commonly used statistical and machine learning methods (e.g. probability theory, linear algebra)

LO2: Critique different approaches to analysis based on the data available and clinical/organisational goals

LO3: Identify the main differences, requirements and caveats in analytical approaches depending on intended goals and properties of the data (e.g. hypothesis testing vs data-driven approaches) 

LO4: State the stages involved in a typical machine learning analysis pipeline

Intellectual skills

LO5: Interpret and annotate model output providing context to convey findings to a variety stakeholders

LO6: Evaluate different statistical and machine learning/NLP approaches and select appropriate methods depending on the properties of the available data and task requirements

Practical skills

LO7: Evaluate the outputs of statistical tests and machine learning models and report these results using standard metrics

Transferable skills and personal qualities

LO8: Work through the problem-solving cycle

LO9: Develop an analytical problem solving mind-set

Assessment methods

Data analysis task 1000 words  100%

 

Feedback methods

Formative assessment and feedback to students is a key feature of the on-line learning materials for this unit and is provided through self-directed learning activities in the interactive notebooks. 

Recommended reading

  • Field, A., Miles, J., Field, Z (2012) Discovering Statistics Using R. Los Angeles: SAGE
  • James, G., Witten, D., Hastie, T., Tibshirani, R (2015) An Introduction to Statistical Learning: With applications in R. New York: Springer
  • Géron, A (2017) Hands-on Machine Learning with Scikit-Learn & TensorFlow. Beijing: O'Reilly
  • Lane, H., Howard, C., Hapke, H (2019) Natural Language Processing in action. Shelter Island: Manning Publications Co.

Study hours

Independent study hours
Independent study 150

Teaching staff

Staff member Role
David Jenkins Unit coordinator
Jon Parkinson Unit coordinator

Return to course details