This course is unavailable through clearing

This course is now full for our 2020 entry, but have a look at our clearing vacancies to see if a similar course has space.

Search all clearing vacancies

MEng Software Engineering

Year of entry: 2020

Coronavirus information for applicants and offer-holders

For the latest updates on how coronavirus will affect applicants and offer-holders, you can visit our FAQs.

Read our latest coronavirus information

Holding an offer for 2020 entry? Visit our dedicated offer-holders page.

Information for offer-holders

Course unit details:
Natural Language Systems

Unit code COMP34412
Credit rating 10
Unit level Level 3
Teaching period(s) Semester 2
Offered by Department of Computer Science
Available as a free choice unit? Yes

Overview

The course unit will cover key linguistic and algorithmic foundations of natural language processing. The topics will include lexical processing, word sense disambiguation, information extraction, and speech recognition and speech synthesis. It will consider both rule-based and machine learning methods, and key applications such as document summarisation or sentiment analysis.

Pre/co-requisites

Unit title Unit code Requirement type Description
Documents and Data on the Web COMP38211 Pre-Requisite Compulsory
Students who are not from the School of Computer Science must have permission from both Computer Science and their home School to enrol.

Aims

Enabling computers to use 'natural language' (the kind of language that people use to communicate with one another) is becoming more and more important. It allows both people to communicate with computers, and the computers to access the enormous amount of material that is stored as natural language text on the web or in document repositories. This course unit provides an introduction to the area of natural language processing (NLP) as one of the key areas of artificial intelligence. It aims to introduce essential components of NLP and explain the major challenges in processing large-scale, real-world natural language both in its written and spoken forms.

Learning outcomes

  • Understand and explain the major challenges in processing large-scale, real-world natural language data.
  • Demonstrate how the essential components of NLP systems are built and modified.
  • Understand the opportunities and challenges of knowledge- and data-driven NLP methods.
  • Explain the principal applications of NLP, including information extraction, question-answering, document summarisation, spoken language access to software services,
  • Understand the issues involved in evaluating NLP systems.
 

Syllabus

·         Introduction, motivation, review of NLP principles

·         Essential steps for NLP algorithms

o   Part-of-speech tagging: probabilistic tagging, transformation-based learning

o   Parsing: chunking, shallow parsing, statistical parsing

o   Lexical semantics: lexical resources, word sense disambiguation algorithms

·         Evaluation of natural language systems

o   Metrics

o   Crowdsourcing and inter-annotator agreement

·         Information retrieval and extraction

o   Document matching

o   Named-entity recognition

o   Template-filling, free text question answering systems

o   Summarisation algorithms

·         Natural language generation

o   Surface realisation

o   Discourse planning

·         Machine translation

o   Transfer-based approaches: the MT pyramid, transfer rules

o   Statistical MT, memory-based MT

·         Introduction to spoken language systems

o   The nature of speech

o   Speech synthesis

o   Speech recognition

Teaching and learning methods

Lectures, practicals, surgeries, coursework

Employability skills

Analytical skills
Problem solving

Assessment methods

Method Weight
Written exam 30%
Written assignment (inc essay) 70%

Feedback methods

There is unassessed formative homework given to the students weekly; each lecture then starts with a 10 min feedback session where the students may review the homework from the previous week.  Assessed coursework is due in weeks 6 and 11, with feedback in weeks 7 and 12 respectively.

Recommended reading

COMP34412 reading list can be found on the Department of Computer Science website for current students.

Study hours

Scheduled activity hours
Lectures 22
Independent study hours
Independent study 78

Teaching staff

Staff member Role
Goran Nenadic Unit coordinator

Additional notes

Course unit materials

Links to course unit teaching materials can be found on the School of Computer Science website for current students.

Return to course details