Undergraduate Course: Natural Language Understanding (Level 10) (INFR10035)
|School||School of Informatics
||College||College of Science and Engineering
||Availability||Available to all students
|Credit level (Normal year taken)||SCQF Level 10 (Postgraduate)
|Home subject area||Informatics
||Other subject area||None
||Taught in Gaelic?||No
|Course description||This course explores current research into interpreting natural language. Motivations for this study range from foundational attempts to understand how people interpret communication to entirely practical efforts to engineer systems for performing a variety of language tasks, such as information extraction, question answering, natural language front ends to databases, human-robot interaction and customer relationship management, to name a few.
This course represents an introduction to the theory and practice of computational approaches to natural language understanding. The course will cover common parsing methods for sentences, discourse and dialogue, and it will also address lexical processing tasks such as word sense disambiguation and clustering. We will study state of the art symbolic techniques in deep and shallow language processing, as well as statistical models, acquired by both unsupervised and supervised machine learning from online linguistic resources. Students will have the opportunity to explore what they have learned in written and practical assignments. These assignments will be designed to enable students to gain an understanding for the pervasiveness of language ambiguity at all levels and the problems this poses for automated language understanding, and for the relative strengths and weaknesses of the various theories and engineering approaches to these problems.
Information for Visiting Students
|Displayed in Visiting Students Prospectus?||No
Course Delivery Information
|Not being delivered|
Summary of Intended Learning Outcomes
|1 - Given a parsing problem students should be able to use state-of-the-art symbolic parsing techniques, including lexicalised parsing to solve the problem and provide a written explanation of the parsing techniques used in the course.
2 - Given a labelled corpus, students should be able to select and use state-of-the-art statistical parsing techniques (generative and discriminative) by training parsers on the labelled corpus using existing software packages.
3 - Given an NLU system, students should be able to choose appropriate evaluation metrics for the system, and use error analysis to propose improvements to the language processing models.
4 - Given an example of a problem in coreference resolution, discourse segmentation, and discourse parsing, students should be able to provide a written description of how current symbolic and statistical techniques help solve the problem.
5 - Given a description of an NLU system, the student should be able to relate it to features of human models of language interpretation at various levels of processing (words, sentences, discourse and dialogue).
6 - Given a model and a labelled corpus, students should be able to employ existing ML software packages to train the model on the corpus in order to perform a lexical semantic task.
|Written Examination 70|
Assessed Assignments 30
Oral Presentations 0
Practical exercises, addressing semantic tasks such as word sense disambiguation.
If delivered in semester 1, this course will have an option for semester 1 only visiting undergraduate students, providing assessment prior to the end of the calendar year.
* Advanced parsing models; e.g., headed PCFGs
* Grammar Induction
* Discriminative Parsing
* Shallow parsing
* Human models of sentential parsing (e.g., incrementality)
* Semantic Construction in wide-coverage online grammars
* Word sense disambiguation
* clustering, similarity distributions
* lexical subcat acquisition and semantic role labelling
* Human models of lexical processing (e.g., semantic priming)
* Anaphora resolution
* Discourse segmentation
* Dialogue act recognition
* Discourse parsing (including learning discourse structure)
* Human models of discourse and dialogue (e.g., the alignment model)
* Advanced topics
Relevant QAA Computing Curriculum Sections: Artificial Intelligence, Human-Computer Interaction (HCI), Natural Language Computing
||* Jurafsky and Martin (2007) Speech and Language Processing, Prentice Hall (2nd Edition)
* Bird, Klein and Loper (2007) Natural Language Processing in Python
* Various research papers
Timetabled Laboratories 0
Non-timetabled assessed assignments 35
Private Study/Other 45
|Course organiser||Dr Amos Storkey
Tel: (0131 6)51 1208
|Course secretary||Miss Kate Weston
Tel: (0131 6)50 2692