Postgraduate Course: Topics in Distributed Databases (INFR11025)
|School||School of Informatics
||College||College of Science and Engineering
|Credit level (Normal year taken)||SCQF Level 11 (Postgraduate)
||Availability||Available to all students
|Summary||***PLEASE NOTE: This course is no longer being offered, you may wish to look at Advanced Topics in Foundations of Databases INFR11122 instead***
This course covers not only the basic technology required for distributed databases, but also some of the emerging technology of database integration, data cleaning, schema matching/mapping and peer-to-peer technology for highly distributed databases.
Topics to be covered:
* Parallel and Distributed Databases
* Distributed Query Optimisation and Evaluation
* Integrating data from distributed sources
* Schema matching and mapping
* Cleaning integrated data
* Propagation analysis of data quality rules via views.
Relevant QAA Computing Curriculum Sections: Computer Networks, Databases, Distributed Computer Systems, Information Systems, Web-based Computing
Entry Requirements (not applicable to Visiting Students)
||Other requirements|| This course assumes knowledge of database systems comparable to that covered in Database Systems. Students who have not taken Database Systems must obtain permission to take the course from the course organiser.
Information for Visiting Students
|High Demand Course?
Course Delivery Information
|Not being delivered|
| 1 - Describe emerging issues in distributed databases: data integration, schema matching, schema mapping, data cleaning, distributed query evaluation and optimisation.
2 - Describe the problems faced in data integration, as well as model solutions to these problems.
3 - Translate data between example XML schemas without loss of information.
4 - Describe the need for data cleaning in data integration, and approaches to improving the quality of integrated data.
5 - Detect inconsistencies in data using integrity constraints.
6 - Repair dirty databases based on integrity constraints.
7 - Propagate data quality rules via data transformation/integration
8 - Demonstrate the issues involved in data integration for distributed query processing.
9 - Describe the issues involved in distributed query optimisation regarding cost modeling and algorithms for query evaluation.
|* An introduction to Database Management Systems by Raghu Ramakrishnan (Chapters 16-18, 22).|
* Research papers
|Course organiser||Prof Wenfei Fan
Tel: (0131 6)51 3818
|Course secretary||Ms Katey Lee
Tel: (0131 6)50 2701