Note: This is an archived Handbook entry from 2009. Search for this in the current handbook
|Dates & Locations:|| |
This subject has the following teaching availabilities in 2009:Semester 2, - Taught on campus.
Timetable can be viewed here. For information about these dates, click here.
|Time Commitment:||Contact Hours: 36 hours comprising one two-hour lecture per week and one one-hour practical class per week. |
Total Time Commitment: Not available
|Recommended Background Knowledge:||
It is recommended students complete a second year statistics subject (such as 620-202  Statistics or its equivalent) and have had some exposure to computer packages.
|Non Allowed Subjects:||None|
|Core Participation Requirements:||
It is University policy to take all reasonable steps to minimise the impact of disability upon academic study and reasonable steps will be made to enhance a student's participation in the University's programs. Students who feel their disability may impact upon their participation are encouraged to discuss this with the subject coordinator and the Disability Liaison Unit.
CoordinatorDr Owen Dafydd Jones
Data Mining refers to the management and analysis of large data sets.
Data Mining became possible with the advent of large-scale data collection and the computing power necessary to process it. It involves all of the following steps
1. Data Warehousing
2. Data Cleaning
3. Data Description and Visualisation
4. Data Analysis and Interpretation
This course deals only with step 4 of the Data Mining process: data analysis and interpretation. It considers techniques for Rule Finding, Classification, Regression and Clustering. The themes that run through the course are:
1. Model fitting and selection and how to avoid overfitting
2. Scalable algorithms that can be used with very large data sets
3. How to acommodate high-dimensional data
4. Actionability and interpretability of models
|Objectives:||After completing this subject, students should: |
Up to 40 pages of written assignments (20%: two assignments worth 10% each, due mid and late in semester), a three-hour written examination (80%, in the examination period).
|Breadth Options:|| |
This subject is not available as a breadth subject.
|Fees Information:||Subject EFTSL, Level, Discipline & Census Date|
Upon completion of this subject, students should develop:
R05 RM Master of Science - Mathematics and Statistics |
Download PDF version.