[an error occurred while processing this directive] [an error occurred while processing this directive]
[an error occurred while processing this directive]
[an error occurred while processing this directive]
Monash University

FIT5197 Modelling for data analysis - Monash Online Teaching Period 6, 2015

This unit explores the statistical modelling foundations that underlie the analytic aspects of Data Science. Motivated by case studies and working through real examples, this unit covers the mathematical and statistical basis with an emphasis on using the techniques in practice. It introduces data collection, sampling and quality. It considers analytic tasks such as statistical hypothesis testing and exploratory versus confirmatory analysis. It presents basic probability distributions, random number generation and simulation as well as estimation methods and effects such as maximum likelihood estimators, Monte Carlo estimators, Bayes theorem, bias versus variance and cross validation. Basic information theory and dependence models such as Bayesian networks and log-linear models are also presented, as well as the role of general modelling such as inference and decision making, predictive models, experts and assessing probabilities.

Mode of Delivery

Monash Online (Online)

Workload Requirements

Minimum total expected workload equals 144 hours per semester comprising:

  1. Contact hours for on-campus students:
    • Two hours/week lectures
    • Two hours/week laboratories
  2. Contact hours for Monash Online students:
    • Two hours/week online group sessions.
    • Online students generally do not attend lecture, tutorial and laboratory sessions, however should plan to spend equivalent time working through resources and participating in discussions.
  3. Additional requirements (all students):
    • A minimum of 8 hours per week of personal study (22 hours per week for Monash online students) for completing lab/tutorial activities, assignments, private study and revision, and for online students, participating in discussions.

See also Unit timetable information

Unit Relationships

Prerequisites

Students need to have the equivalent of first year undergraduate university mathematics as taught in an analytics degree such as Engineering, Finance, Physics and some Computer Science degrees.

Chief Examiner

Campus Lecturer

Clayton

Muhammad Amar

Kevin Korb

Your feedback to Us

Monash is committed to excellence in education and regularly seeks feedback from students, employers and staff. One of the key formal ways students have to provide feedback is through the Student Evaluation of Teaching and Units (SETU) survey. The University’s student evaluation policy requires that every unit is evaluated each year. Students are strongly encouraged to complete the surveys. The feedback is anonymous and provides the Faculty with evidence of aspects that students are satisfied and areas for improvement.

For more information on Monash’s educational strategy, see:

www.monash.edu.au/about/monash-directions/ and on student evaluations, see: www.policy.monash.edu/policy-bank/academic/education/quality/student-evaluation-policy.html

Academic Overview

Learning Outcomes

Upon successful completion of this unit, it is expected that students will be able to:
  1. perform the general roles of exploratory, confirmatory and decision analysis applied to data;
  2. explain how the source of data affects analysis;
  3. summarise the role of domain experts in supporting analysis and the difficulties they may have;
  4. implement a computational model for statistical analysis of simple problems and evaluate the results;
  5. conduct statistical analysis using the concepts of entropy, likelihood, correlation, and independence;
  6. interpret the challenges involved in estimation from data;
  7. describe basic methods of random sampling, simulation, and hypothesis testing;
  8. write basic programs for analysing data.

Unit Schedule

Week Activities Assessment
0   No formal assessment or activities are undertaken in week 0
1 Module one - Introduction to Modelling for Data Science Programming Assignment 1
2 Module two - Data analysis Programming Assignment 2
3 Module three - Dependence, regression and clustering Programming Assignment 3; Quiz
4 Module four - Statistical inference and evaluation Programming Assignment 4
5 Module five - Simulation Programming Assignment 5
6 Module six - Modelling and validation Programming Assignment 6; Quiz; Report
7    
8    
9    
10    
11    
12    
  SWOT VAC No formal assessment is undertaken during SWOT VAC
  Examination period LINK to Assessment Policy: http://policy.monash.edu.au/policy-bank/
academic/education/assessment/
assessment-in-coursework-policy.html

*Unit Schedule details will be maintained and communicated to you via your learning system.

Teaching Approach

Other
This subject is learned online, using Moodle and Alexandria as the primary methods of delivering information and guidance, drawing upon text readings, videos, exercises and activities, as well as online contact with instructors.

Assessment Summary

In-semester assessment: 100%

Assessment Task Value Due Date
Programming Assignment 1 5% 1/11/15
Programming Assignment 2 5% 8/11/15
Programming Assignment 3 5% 15/11/15
Quiz I 25% 15/11/15
Programming Assignment 4 5% 22/11/15
Programming Assignment 5 5% 29/11/15
Programming Assignment 6 5% 6/12/15
Quiz II 25% 6/12/15
Report 20% 6/12/15

Assessment Requirements

Assessment Policy

Assessment Tasks

Participation

  • Assessment task 1
    Title:
    Programming Assignment 1
    Description:
    Students are required to write and submit a program to sample from two binomial distributions, with and without a dependency between them.

    Learning outcomes 1, 2, 5, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    1/11/15
  • Assessment task 2
    Title:
    Programming Assignment 2
    Description:
    Students are required to write and submit a program that visualizes sample data and their relation to probability models.

    Learning outcomes 1, 2, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    8/11/15
  • Assessment task 3
    Title:
    Programming Assignment 3
    Description:
    Students are required to write and submit a programming exercise that produces a multiple regression model for a multivariate data set and computes the coefficient of determination.

    Learning outcomes 1, 4, 5, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    15/11/15
  • Assessment task 4
    Title:
    Quiz I
    Description:
    Students are required to demonstrate their understanding of theoretical, practical and algorithmic methods of data analysis by completing a quiz made up multiple choice and short-answer questions.

    Learning outcomes 1, 4, 5
    Weighting:
    25%
    Criteria for assessment:

    Correctness

    Due date:
    15/11/15
  • Assessment task 5
    Title:
    Programming Assignment 4
    Description:
    Students are required to write and submit a programming exercise that tests a null hypothesis and an alternative on a data set and performs a confidence interval estimation of effect size.

    Learning outcomes 1, 4, 6, 7, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    22/11/15
  • Assessment task 6
    Title:
    Programming Assignment 5
    Description:
     Students are required to write and submit a programming exercise that tests the claim that a particular statistical test has the power needed to decide that an alternative model is true and the null model false.

    Learning outcomes 1, 2, 4, 7, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    29/11/15
  • Assessment task 7
    Title:
    Programming Assignment 6
    Description:
    Students are required to write and submit a programming exercise that selects variables relevant to predicting a target variable in a data set.

    Learning outcomes 1, 3, 4, 5, 7, 8
    Weighting:
    5%
    Criteria for assessment:

    Correctness of result; program commentary/explanation.

    Due date:
    6/12/15
  • Assessment task 8
    Title:
    Quiz II
    Description:
    Students are required to demonstrate their understanding of theoretical, practical and algorithmic methods of data analysis by completing a quiz made up multiple choice and short-answer questions.

    Learning outcomes 1, 2, 4, 6, 7
    Weighting:
    25%
    Criteria for assessment:

    Correctness

    Due date:
    6/12/15
  • Assessment task 9
    Title:
    Report
    Description:
    Students are required to produce a report analysing a given data set and solving a scientific problem about it.

    Learning outcomes 1, 2, 3, 4, 5, 6, 7
    Weighting:
    20%
    Criteria for assessment:

    Students will be assessed according to: the clarity & organization of the report; the appropriateness of tools used; the explanation of analytic choices made; the quality of the interpretation of results; the correctness of conclusions drawn and support provided for those conclusions; the individual contribution made to the overall report for group work.

    Due date:
    6/12/15

Learning resources

Reading list

Ross, S.M. (2014) Introduction to Probability and Statistics for Engineers and Scientists, 5th ed. Academic Press. (Available from Monash Library)

Monash Library Unit Reading List (if applicable to the unit)
http://readinglists.lib.monash.edu/index.html

Feedback to you

Types of feedback you can expect to receive in this unit are:

  • Informal feedback on progress in labs/tutes
  • Graded assignments with comments
  • Graded assignments without comments
  • Quiz results
  • Solutions to tutes, labs and assignments

Extensions and penalties

Returning assignments

Referencing requirements

Students should use APA reference style, as explained at http://intranet.monash.edu.au/infotech/resources/students/style-guide/referencing.html

Assignment submission

It is a University requirement (http://www.policy.monash.edu/policy-bank/academic/education/conduct/student-academic-integrity-managing-plagiarism-collusion-procedures.html) for students to submit an assignment coversheet for each assessment item. Faculty Assignment coversheets can be found at http://www.infotech.monash.edu.au/resources/student/forms/. Please check with your Lecturer on the submission method for your assignment coversheet (e.g. attach a file to the online assignment submission, hand-in a hard copy, or use an electronic submission). Please note that it is your responsibility to retain copies of your assessments.

Online submission

Assessment work should be submitted via Moodle.

Required Resources

Please check with your lecturer before purchasing any Required Resources. Limited copies of prescribed texts are available for you to borrow in the library, and prescribed software is available in student labs.

R and RStudio

Prescribed text(s)

Limited copies of prescribed texts are available for you to borrow in the library.

Ross. (2014). Introduction to Probability and Statistics for Engineers and Scientists, 5th ed. (5th) Academic.

Technological Requirements

Students may use Windows, Linux or Mac environments for this subject. R and/or RStudio must be used for programming assignments. Other tools will be useful or necessary, but the choice of a particular tool (e.g., Weka, MATLAB, or a spreadsheet) is not mandated.

Recommended Resources

Weka. See http://www.cs.waikato.ac.nz/ml/weka/

Other Information

Policies

Monash has educational policies, procedures and guidelines, which are designed to ensure that staff and students are aware of the University’s academic standards, and to provide advice on how they might uphold them. You can find Monash’s Education Policies at: www.policy.monash.edu.au/policy-bank/academic/education/index.html

Faculty resources and policies

Important student resources including Faculty policies are located at http://intranet.monash.edu.au/infotech/resources/students/

Graduate Attributes Policy

Student Charter

Student services

Monash University Library

Disability Liaison Unit

Students who have a disability or medical condition are welcome to contact the Disability Liaison Unit to discuss academic support services. Disability Liaison Officers (DLOs) visit all Victorian campuses on a regular basis.

[an error occurred while processing this directive]