THE UNIVERSITY of EDINBURGH

DEGREE REGULATIONS & PROGRAMMES OF STUDY 2018/2019

University Homepage

DRPS : Course Catalogue : School of Informatics : Informatics

Postgraduate Course: Reinforcement Learning (INFR11010)

Course Outline
School	School of Informatics	College	College of Science and Engineering
Credit level (Normal year taken)	SCQF Level 11 (Postgraduate)	Availability	Available to all students
SCQF Credits	10	ECTS Credits	5
Summary	Reinforcement learning (RL) refers to a collection of machine learning techniques which solve sequential decision making problems using a process of trial-and-error. It is a core area of research in artificial intelligence and machine learning, and today provides one of the most powerful approaches to solving decision problems. This course covers foundational models and algorithms used in RL, as well as advanced topics such as scalable function approximation using neural network representations and concurrent interactive learning of multiple RL agents.
Course description	Main topics to be covered include the following (see course website for more details): * Reinforcement learning framework * Bandit problems and action selection * Dynamic programming * Monte Carlo methods * Temporal difference learning * Planning in RL * Temporal abstraction * Function approximation for generalisation * Actor-critic and gradient-based optimisation * Multi-agent reinforcement learning * Environments with partial observability * Training agents and evaluating performance Relevant QAA Computing Curriculum Sections: Artificial Intelligence, Data Structures and Algorithms, Intelligent Information Systems Technologies, Simulation and Modelling

Entry Requirements (not applicable to Visiting Students)
Pre-requisites		Co-requisites
Prohibited Combinations		Other requirements	This course is open to all Informatics students including those on joint degrees. For external students where this course is not listed in your DPT, please seek special permission from the course organiser (lecturer). Mathematical background, at the level of undergraduate informatics, particularly linear algebra, multivariate calculus, probability theory and statistics. The coursework involves substantial programming work.

Information for Visiting Students
Pre-requisites	None
High Demand Course?	Yes

Course Delivery Information

Academic year 2018/19, Available to all students (SV1)		Quota: None
Course Start	Semester 2
Timetable	Timetable
Learning and Teaching activities (Further Info)	Total Hours: 100 ( Lecture Hours 20, Seminar/Tutorial Hours 8, Summative Assessment Hours 2, Programme Level Learning and Teaching Hours 2, Directed Learning and Independent Learning Hours 68 )
Assessment (Further Info)	Written Exam 75 %, Coursework 25 %, Practical Exam 0 %
Additional Information (Assessment)	One assignment worth 25%, one exam worth 75%. The assignment will consist of a large programming exercise in which several of the discussed RL algorithms will be implemented and evaluated. The exam will test factual knowledge and understanding of modelling/algorithmic concepts.
Feedback	Not entered
Exam Information
Exam Diet	Paper Name		Hours & Minutes
Main Exam Diet S2 (April/May)			2:00

Learning Outcomes
On completion of this course, the student will be able to: Knowledge of basic and advanced reinforcement learning techniques. Identification of suitable learning tasks to which these learning techniques can be applied. Appreciation of some of the current limitations of reinforcement learning techniques. Formulation of decision problems, set up and run computational experiments, evaluation of results from experiments.

Reading List
Reinforcement Learning: An Introduction (second edition). R. Sutton and A. Barto. MIT Press, 2018 Algorithms for Reinforcement Learning. C. Szepesvari. Morgan and Claypool Publishers, 2010 Reinforcement Learning: State-of-the-Art. M. Wiering and M. van Otterlo. Springer, 2012

Additional Information
Course URL	http://course.inf.ed.ac.uk/rl
Graduate Attributes and Skills	Not entered
Keywords	Artificial Intelligence,Machine Learning,Reinforcement Learning

Contacts
Course organiser	Dr Stefano Albrecht Tel: (0131 6)51 3218 Email: s.albrecht@ed.ac.uk	Course secretary	Mrs Sam Stewart Tel: (0131 6)51 3266 Email: Sam.Stewart@ed.ac.uk

Navigation

Help & Information

Search DPTs and Courses

Regulations

Degree Programmes

Courses

Humanities and Social Science

Science and Engineering

Medicine and Veterinary Medicine

Other Information

Combined Course Timetable

Important Information