Postgraduate Course: Data Integration and Exchange (Level 11) (INFR11058)
Course Outline
School | School of Informatics |
College | College of Science and Engineering |
Credit level (Normal year taken) | SCQF Level 11 (Year 4 Undergraduate) |
Availability | Available to all students |
SCQF Credits | 10 |
ECTS Credits | 5 |
Summary | With the proliferation of data on the web, data in different formats is scattered across many sources. Data sources may only provide partial access to data and the structure of the data may differ drastically from the assumptions used in conventional queries.
Querying such data requires data integration and exchange between different sources. The notion of querying and the meaning of an answer to a query are different from their classical counterparts: since only a partial view of the data is available, answers to queries are at best approximate. Furthermore, integrated data may not be consistent with its specification, and hence techniques for querying inconsistent data become more important.
This course presents the basics of data integration and exchange, explains techniques for querying incomplete and inconsistent information, for specifying correspondences between database schemas and schema mappings, and for querying databases using views. Both relational and XML data formats are considered. |
Course description |
Topics include:
* review of relational and XML data models
* examples of data integration and exchange scenarios
* dealing with incomplete information in databases
* local- and global-as-view scenarios of data integration
* query rewriting
* answering queries using views
* relational data exchange
* XML data exchange
* schema mappings; composing and inverting schema mappings
* querying inconsistent databases
* aggregation over multiple sources
Relevant QAA Computing Curriculum Sections: Not yet available
Reading List
|
Entry Requirements (not applicable to Visiting Students)
Pre-requisites |
It is RECOMMENDED that students have passed
Database Systems (INFR09011) OR
Database Systems (INFR10055)
|
Co-requisites | |
Prohibited Combinations | |
Other requirements | This course is open to all Informatics students including those on joint degrees. For external students where this course is not listed in your DPT, please seek special permission from the course organiser.
Maths background: discrete maths, basics of logic (propositional and predicate calculus). Programming: SQL |
Information for Visiting Students
Pre-requisites | None |
Course Delivery Information
|
Academic year 2014/15, Available to all students (SV1)
|
Quota: None |
Course Start |
Semester 2 |
Timetable |
Timetable |
Learning and Teaching activities (Further Info) |
Total Hours:
100
(
Lecture Hours 20,
Summative Assessment Hours 2,
Programme Level Learning and Teaching Hours 2,
Directed Learning and Independent Learning Hours
76 )
|
Assessment (Further Info) |
Written Exam
70 %,
Coursework
30 %,
Practical Exam
0 %
|
Additional Information (Assessment) |
Three assignments, worth 10% each.
You should expect to spend approximately 40 hours on the coursework for this course.
If delivered in semester 1, this course will have an option for semester 1 only visiting undergraduate students, providing assessment prior to the end of the calendar year. |
Feedback |
Not entered |
No Exam Information |
Learning Outcomes
This course introduces students to new techniques for analysing, integrating and querying data that may come from a variety of sources. The learning objectives are:
1 - Given a description of a situation requiring data integration students will be able to classify it as belonging to one of a number of standard scenarios.
2 - Given a description of data integration and exchange scenario and a commercial database management solution to the problem students will be able to predict erroneous or counterintuitive answers that will arise from the solution. Students will be able to compare & evaluate different solutions & justify the choice of one solution over another.
3 - Given a description of a data integration & exchange scenario where access to data is restricted, students will be able to design a scheme to provide approximate results to queries taking account of restricted access. Students will be able to generate a range of design options & will know techniques and their limitations for comparison between options.
4 - Given a database query & a data integration & exchange scenario, students will know how to rewrite the query to take account of the scenario, providing approximate solutions where these are necessary. Students will be able to develop rewritings of the query & will be able to reason about basic properties of such rewritings.
5 - Given two database schemas students will be able to specify mappings between schemas & how to compose such mappings.
6 - Given a collection of databases, their schemas & mappings between schemas students will be able to describe how data is exchanged based on the schema mappings.
7 - Given a collection of databases, their schemas & schema mappings, students will be able to identify instances of inconsistent data.
8 - Given a collection of databases with inconsistent data, their schemas and schema mappings, students will be able describe the results of queries on inconsistent databases.
|
Contacts
Course organiser | Prof Leonid Libkin
Tel: (0131 6)51 3816
Email: libkin@ed.ac.uk |
Course secretary | Miss Claire Edminson
Tel: (0131 6)51 4164
Email: C.Edminson@ed.ac.uk |
|
© Copyright 2014 The University of Edinburgh - 12 January 2015 4:11 am
|