Postgraduate Course: Topics in Distributed Databases (INFR11025)
|School||School of Informatics
||College||College of Science and Engineering
||Availability||Available to all students
|Credit level (Normal year taken)||SCQF Level 11 (Postgraduate)
|Home subject area||Informatics
||Other subject area||None
||Taught in Gaelic?||No
|Course description||This course covers not only the basic technology required for distributed databases, but also some of the emerging technology of database integration, data cleaning, schema matching/mapping and peer-to-peer technology for highly distributed databases.
Entry Requirements (not applicable to Visiting Students)
|| Students MUST have passed:
Database Systems (INFR09011) OR
Informatics Research Review (INFR11034)
||Other requirements|| For Informatics PG students and final year MInf students only, or by special permission of the School.
|Additional Costs|| None
Information for Visiting Students
|Displayed in Visiting Students Prospectus?||Yes
Course Delivery Information
|Delivery period: 2011/12 Semester 2, Available to all students (SV1)
||WebCT enabled: No
|Central||Lecture||1-11|| 09:00 - 10:50|
||Week 1, Thursday, 09:00 - 10:50, Zone: Central. Robson LT |
|No Exam Information
Summary of Intended Learning Outcomes
|1 - Describe emerging issues in distributed databases: data integration, schema matching, schema mapping, data cleaning, distributed query evaluation and optimisation.
2 - Describe the problems faced in data integration, as well as model solutions to these problems.
3 - Translate data between example XML schemas without loss of information.
4 - Describe the need for data cleaning in data integration, and approaches to improving the quality of integrated data.
5 - Detect inconsistencies in data using integrity constraints.
6 - Repair dirty databases based on integrity constraints.
7 - Propagate data quality rules via data transformation/integration
8 - Demonstrate the issues involved in data integration for distributed query processing.
9 - Describe the issues involved in distributed query optimisation regarding cost modeling and algorithms for query evaluation.
|Written Examination 0|
Assessed Assignments 100
Oral Presentations 0
This is a research seminar module. Each student is required to read research papers, complete a practical project, and write and present a final report for the project.
The project (70%) deals in more depth with a topic covered in the class. It should consist of algorithm design, prototype implementation, and experimental study for developing a practical tool. Example projects include: SQL techniques for detection of dara inconcistencies based on integrity constraints, a tool for schema matching/mapping, or a tool for repairing dirty databases.
The presentation (30%) should report and demonstrate the tool developed in the project.
If delivered in semester 1, this course will have an option for semester 1 only visiting undergraduate students, providing assessment prior to the end of the calendar year.
||Topics to be covered:
* Parallel and Distributed Databases
* Distributed Query Optimisation and Evaluation
* Integrating data from distributed sources
* Schema matching and mapping
* Cleaning integrated data
* Propagation analysis of data quality rules via views.
Relevant QAA Computing Curriculum Sections: Computer Networks, Databases, Distributed Computer Systems, Information Systems, Web-based Computing
||* An introduction to Database Management Systems by Raghu Ramakrishnan (Chapters 16-18, 22).
* Research papers
|Course organiser||Dr Michael Rovatsos
Tel: (0131 6)51 3263
|Course secretary||Miss Kate Weston
Tel: (0131 6)50 2701
© Copyright 2011 The University of Edinburgh - 16 January 2012 6:17 am