THE UNIVERSITY of EDINBURGH

DEGREE REGULATIONS & PROGRAMMES OF STUDY 2016/2017

University Homepage
DRPS Homepage
DRPS Search
DRPS Contact
DRPS : Course Catalogue : School of Informatics : Informatics

Postgraduate Course: Machine Translation (Level 11) (INFR11062)

Course Outline
SchoolSchool of Informatics CollegeCollege of Science and Engineering
Credit level (Normal year taken)SCQF Level 11 (Year 4 Undergraduate) AvailabilityAvailable to all students
SCQF Credits10 ECTS Credits5
Summary***PLEASE NOTE - this course has been replaced by Machine Translation INFR11133 (20 credit course) from 2016/17.***

Machine Translation deals with computers translating human languages (for example, from Arabic to English). The field is now sufficiently mature that Google use it to allow millions of people to translate Web Documents each day. This course deals with all aspects of designing, building and evaluating a range of state-of-the-art translation systems. The systems covered are largely statistical and include: word-based, phrase-based, syntax-based and discriminative models. As well as exploring these systems, the course will cover practical aspects such as using very large training sets, evaluation and the open problem of whether linguistics can be useful for translation.
Course description History of MT
* rule-based systems, ALPAC report, IBM models, phrase-based systems

Models
* word-based
* phrase-based
* syntax-based
* discriminative
* Factored Models

Reordering
* Lexicalised reordering
* Distortion
* Changing the source

Language Modelling
* Ngram models
* Scaling LMs (cluster-based LMs, Bloom Filter LMs)

Decoding
* Knight on complexity, problem statement
* Stack decoding

Evaluation
* Human evaluation
* Automatic methods
* NIST competitions

Adding linguistics
* Reranking
* As factors

Parallel corpora etc (data)
*What they are, where they come from
* Comparable corpora
* Multi-parallel corpora

Relevant QAA Computing Curriculum Sections: Artificial Intelligence, Human-Computer Interaction (HCI), Natural Language Computing
Entry Requirements (not applicable to Visiting Students)
Pre-requisites It is RECOMMENDED that students have passed Advanced Natural Language Processing (INFR11059) OR Accelerated Natural Language Processing (INFR11125) OR Foundations of Natural Language Processing (INFR09028)
Co-requisites
Prohibited Combinations Other requirements This course is open to all Informatics students including those on joint degrees. For external students where this course is not listed in your DPT, please seek special permission from the course organiser.

Advanced Natural Language Processing or equivalent.


Students are expected to understand the following topics, or be prepared to learn them independently.

- Discrete mathematics: analysis of algorithms, dynamic programming, basic graph algorithms, finite and pushdown automata.

- Other essential maths: basic probability theory; basic calculus and linear algebra; ability to read and manipulate mathematical notation including sums, products, log, and exp.

- Programming: ability to read and modify python programs; ability to design and implement a function based on high-level description such as pseudocode or a precise mathematical statement of what the function computes.

- Linguistics: willingness to learn basic elements of linguistic description; no formal linguistics background required.
Information for Visiting Students
Pre-requisitesNone
High Demand Course? Yes
Course Delivery Information
Not being delivered
Learning Outcomes
On completion of this course, the student will be able to:
  1. Provide a written description of the main algorithms used in the system.
  2. Design and justify an approach to the evaluation of the system using state of the art tools and metrics
  3. Analyse the data collected by such an evaluation.
  4. Where the system is designed to deal with large volumes of data the student should also be able to describe how the system handles large data volumes and critically compare the system=s solution with other common solutions to the problem.
  5. Identify where linguistics knowledge is relevant in the design of the system and what influence of linguistic knowledge has on the translation quality and performance of the system.
Reading List
* Statistical Machine Translation, P. Koehn, Cambridge University Press, 2010
* Primary literature
Additional Information
Course URL http://course.inf.ed.ac.uk/mt
Graduate Attributes and Skills Not entered
KeywordsNot entered
Contacts
Course organiserDr Adam Lopez
Tel: (0131 6)50 4430
Email: alopez@inf.ed.ac.uk
Course secretaryMiss Claire Edminson
Tel: (0131 6)51 4164
Email: C.Edminson@ed.ac.uk
Navigation
Help & Information
Home
Introduction
Glossary
Search DPTs and Courses
Regulations
Regulations
Degree Programmes
Introduction
Browse DPTs
Courses
Introduction
Humanities and Social Science
Science and Engineering
Medicine and Veterinary Medicine
Other Information
Combined Course Timetable
Prospectuses
Important Information
 
© Copyright 2016 The University of Edinburgh - 3 February 2017 4:26 am