Computational Statistics and Data Mining
Subject MAST90083 (2016)
Note: This is an archived Handbook entry from 2016.
Credit Points: | 12.5 |
---|---|
Level: | 9 (Graduate/Postgraduate) |
Dates & Locations: | This subject is not offered in 2016. |
Time Commitment: | Contact Hours: Contact Hours: 36 hours comprising 2 one-hour lectures per week and 1 one-hour practice class per week. Total Time Commitment: Estimated Total Time Commitment - 170 hours |
Prerequisites: | One of Subject Study Period Commencement: Credit Points: |
Corequisites: | None |
Recommended Background Knowledge: | Subject Study Period Commencement: Credit Points: |
Non Allowed Subjects: | None |
Core Participation Requirements: |
For the purposes of considering request for Reasonable Adjustments under the Disability Standards for Education (Cwth 2005), and Student Support and Engagement Policy, academic requirements for this subject are articulated in the Subject Overview, Learning Outcomes, Assessment and Generic Skills sections of this entry. It is University policy to take all reasonable steps to minimise the impact of disability upon academic study, and reasonable adjustments will be made to enhance a student's participation in the University's programs. Students who feel their disability may impact on meeting the requirements of this subject are encouraged to discuss this matter with a Faculty Student Adviser and Student Equity and Disability Support: http://services.unimelb.edu.au/disability |
Contact
Email: qguoqi@unimelb.edu.au
Subject Overview: |
Computing techniques and data mining methods are indispensible in modern statistical research and applications, where “Big Data” problems are often involved. This subject will introduce a number of recently developed statistical data mining methods that are scalable to large datasets and high-performance computing. These include regularized regression such as the Lasso; tree based methods such as bagging, boosting and random forests; and support vector machines. Important statistical computing algorithms and techniques used in data mining will be explained in detail. These include the bootstrap, cross-validation, the EM algorithm, and Markov chain Monte Carlo methods including the Gibbs sampler and Metropolis-Hastings algorithm. |
---|---|
Learning Outcomes: |
After completing this subject students should gain:
|
Assessment: |
|
Prescribed Texts: | None |
Breadth Options: | This subject is not available as a breadth subject. |
Fees Information: | Subject EFTSL, Level, Discipline & Census Date |
Generic Skills: |
In addition to learning specific skills that will assist students in their future careers in science, they will have the opportunity to develop generic skills that will assist them in any future career path. These include:
|
Related Course(s): |
Doctor of Philosophy - Engineering Graduate Diploma in Biostatistics Master of Biostatistics Master of Philosophy - Engineering Master of Science (Mathematics and Statistics) |
Related Majors/Minors/Specialisations: |
Mathematics and Statistics |
Download PDF version.