Postgraduate Course: Data Mining and Exploration (INFR11007)
|School||School of Informatics
||College||College of Science and Engineering
|Credit level (Normal year taken)||SCQF Level 11 (Postgraduate)
||Availability||Available to all students
|Summary||The aim of this course is to discuss modern techniques for analyzing, interpreting, visualizing and exploiting the data that is captured in scientific and commercial environments. The course will develop the ideas taught in various machine learning courses and discuss the issues in applying them to real-world data sets, as well as teaching about other techniques and data-visualization methods. The course will also feature case-study presentations and each student will undertake a mini-project on a real-world dataset.
The course will consist of two parts, the first part being a series of lectures on what is outlined below. It is anticipated that there will also be one or two guest lectures from data mining practitioners.
The second part will consist of student presentations of papers relating to relevant topics. Students will also carry out a practical mini-project on a real-world dataset. For both paper presentations and mini-projects, lists of suggestions will be available, but students may also propose their own, subject to approval from the instructor.
* Introduction, overview
* Data preprocessing and cleaning, dealing with missing data
* Data visualization, exploratory data analysis
* Data mining techniques
* Predictive modelling techniques (e.g. SVMs)
* Performance evaluation (e.g. ROC curves)
* Issues relating to large data sets
* Application areas, e.g. text mining, collaborative filtering, retrieval-by-content, web mining, bioinformatics data, astronomy data
Relevant QAA Computing Curriculum Sections: Artificial Intelligence
Information for Visiting Students
|High Demand Course?
Course Delivery Information
|Academic year 2020/21, Available to all students (SV1)
|Learning and Teaching activities (Further Info)
Lecture Hours 20,
Supervised Practical/Workshop/Studio Hours 4,
Programme Level Learning and Teaching Hours 2,
Directed Learning and Independent Learning Hours
|Assessment (Further Info)
|Additional Information (Assessment)
1) time-limited class test
2) engagement with the course material
3) presentation of a research paper
4) mini-project on one dataset chosen from a list of datasets selected by the instructor
|No Exam Information
On completion of this course, the student will be able to:
- Describe the data mining/analysis process in overview, and demonstrate assessment of the challenges of a given data mining project.
- Describe methods used for exploratory data analysis, predictive modelling and performance evaluation.
- Critical evaluation of papers presented in the second part of the course.
- In the mini-project, demonstrate the ability to conduct experimental investigations and draw valid conclusions from them.
- Demonstrate use of data mining packages/computational environments in the mini-project phase.
|Provided on course homepage.|
|Course organiser||Dr Michael Urs Gutmann
Tel: (0131 6)50 5190
|Course secretary||Ms Lindsay Seal
Tel: (0131 6)50 2701