MSc ACS: Computer Security

Year of entry: 2020

Course unit details:
Data Engineering

Unit code COMP60711
Credit rating 15
Unit level FHEQ level 7 – master's degree or fourth year of an integrated master's degree
Teaching period(s) Semester 1
Offered by Department of Computer Science
Available as a free choice unit? Yes

Overview

The Harvard Business Review in October 2012 described the role of data scientist as 'the sexiest job of the 21st century'. The 'big data' phenomenon has become part of the vernacular, with the digital universe expected to grow by a factor of 44 from 2009-2020 to a trillion Gigabytes [IDC Digital Universe Study, 2010]. This has led to recognition of a data lifecycle and the need for its systematic management, including both technical and societal issues. Particular focus here is on issues such as data standardisation and data quality, and data analytics (description and prediction) across all application domains.

Aims

This module will examine the entire data life cycle, including data creation, modelling, acquisition, representation, use, maintenance, preservation and disposal. As the majority of data is stored in databases, the module will examine various database engineering approaches to support data management, including database design, data warehousing, maintenance and analytics. Data standards and data quality will be examined and the challenge of "big datasets" will be considered.

Learning outcomes

  • Explain and apply the constituent steps of the data life cycle

  • Describe data engineering techniques; be able to apply and document large-scale data engineering for a given task, comprising various multimodal data types.

  • Describe and apply technical, ethical and societal issues related to data engineering, storage, access and maintenance.

  • Explain and apply the main principles of data analytics/ algorithms, and explain their application to various domains.

  • Describe relevant standards and best practice in data engineering, analyse shortcomings and identify possible strategies and approaches to overcome them.

Syllabus

  • An overview of the data life cycle
  • Data engineering, modelling and design techniques
  • Data storage and warehousing
  • Data access and maintenance
  • Data analytics application and algorithms
  • Engineering non-traditional data types
  • Data standards and data quality

Employability skills

Analytical skills
Problem solving
Research
Written communication

Assessment methods

Method Weight
Written exam 50%
Written assignment (inc essay) 50%

Feedback methods

Regular coursework, returned marked with feedback

Recommended reading

COMP60711 reading list can be found on the School of Computer Science website for current students.

Study hours

Independent study hours
Independent study 150

Teaching staff

Staff member Role
John Keane Unit coordinator

Additional notes

Course unit materials

Links to course unit teaching materials can be found on the School of Computer Science website for current students.

Return to course details