THE UNIVERSITY of EDINBURGH

DEGREE REGULATIONS & PROGRAMMES OF STUDY 2013/2014
Archive for reference only
THIS PAGE IS OUT OF DATE

University Homepage

DRPS : Course Catalogue : School of Informatics : Informatics

Postgraduate Course: Reinforcement Learning (INFR11010)

Course Outline
School	School of Informatics	College	College of Science and Engineering
Course type	Standard	Availability	Available to all students
Credit level (Normal year taken)	SCQF Level 11 (Postgraduate)	Credits	10
Home subject area	Informatics	Other subject area	None
Course website	http://course.inf.ed.ac.uk/rl	Taught in Gaelic?	No
Course description	This module covers a range of adaptive learning systems, in particular reinforcement learning and unsupervised methods, particularly as used in RL systems. By the end of the module the student should have a grasp of modern learning techniques and the issues involved in dealing with real-world data. The main techniques covered in the course are basic reinforcement learning, dynamic programming, Monte Carlo methods, Q-learning, function approximation, unsupervised and constructive methods, radial basis and other local functions, classifier systems as compared to RL systems.

Entry Requirements (not applicable to Visiting Students)
Pre-requisites		Co-requisites	It is RECOMMENDED that students also take Natural Computing (INFR09038)
Prohibited Combinations		Other requirements	This course is open to all Informatics students including those on joint degrees. For external students where this course is not listed in your DPT, please seek special permission from the course organiser. Mathematical background, at the level of undergraduate informatics, particularly linear algebra, multivariate calculus and statistics. Some programming (e.g. in Matlab) will be required.
Additional Costs	None

Information for Visiting Students
Pre-requisites	None
Displayed in Visiting Students Prospectus?	Yes

Course Delivery Information

Delivery period: 2013/14 Semester 2, Available to all students (SV1)						Learn enabled: No		Quota: None
Web Timetable	Web Timetable
Course Start Date	13/01/2014
Breakdown of Learning and Teaching activities (Further Info)	Total Hours: 100 ( Lecture Hours 20, Summative Assessment Hours 2, Programme Level Learning and Teaching Hours 2, Directed Learning and Independent Learning Hours 76 )
Additional Notes
Breakdown of Assessment Methods (Further Info)	Written Exam 80 %, Coursework 20 %, Practical Exam 0 %
Exam Information
Exam Diet	Paper Name		Hours & Minutes
Main Exam Diet S2 (April/May)			2:00

Summary of Intended Learning Outcomes
1 - Knowledge of basic and advanced reinforcement learning techniques. 2 - Insight into the problems involved in applying these techniques to deal with real world data, and how to overcome those problems. 3 - Appreciation and identification of suitable learning tasks to which these learning techniques can be applied 4 - Ability to evaluate how effective a particular learning procedure has been -- internal indicators of learning success vs. actual behaviour of the learner. 5 - Use and writing of Matlab programs, ability to set up and run computational experiments to produce statistically sound results 6 - Formulation of problems, evaluation of results from the student's own experiments and those presented in some cases in the research literature.

Assessment Information
Written Examination 80 Assessed Assignments 20 Oral Presentations 0 Assessment Two assignments are set, each accounting for 10% of the overall mark. Typically they require programming a learning system or experimenting with an existing system, using MATLAB. If delivered in semester 1, this course will have an option for semester 1 only visiting undergraduate students, providing assessment prior to the end of the calendar year.

Special Arrangements
None

Additional Information
Academic description	Not entered
Syllabus	The main topics to be covered are some or all of the following (there are some changes from year to year) * Reinforcement learning framework * Bandit problems and action selection * Dynamic programming methods * Monte-Carlo methods * Temporal difference methods * Q-learning and eligibility traces * Environment modelling * Function approximation for generalisation * Actor-critic, applications * Planning in the RL context * Unsupervised, self-organising networks and RL * Constructive methods - nets that grow * Evaluating performance Relevant QAA Computing Curriculum Sections: Artificial Intelligence, Data Structures and Algorithms, Intelligent Information Systems Technologies, Simulation and Modelling
Transferable skills	Not entered
Reading list	# Reinforcement Learning. An Introduction. Richard S. Sutton and Andrew G. Barto. MIT Press, Cambridge MA, 1998. # Other material as handouts or on web page
Study Abroad	Not entered
Study Pattern	Lectures 20 Tutorials 0 Timetabled Laboratories 0 Non-timetabled assessed assignments 25 Private Study/Other 55 Total 100
Keywords	Not entered

Contacts
Course organiser	Dr Iain Murray Tel: (0131 6)51 9078 Email: I.Murray@ed.ac.uk	Course secretary	Ms Katey Lee Tel: (0131 6)50 2701 Email: Katey.Lee@ed.ac.uk

Navigation

Help & Information

Search DPTs and Courses

Regulations

Degree Programmes

Courses

Humanities and Social Science

Science and Engineering

Medicine and Veterinary Medicine

Other Information

Combined Course Timetable

Important Information

© Copyright 2013 The University of Edinburgh - 13 January 2014 4:27 am