Unit information: Multivariate Analysis in 2011/12

Please note: you are viewing unit and programme information for a past academic year. Please see the current academic year for up to date information.

Unit name Multivariate Analysis
Unit code MATH30510
Credit points 10
Level of study H/6
Teaching block(s) Teaching Block 2C (weeks 13 - 18)
Unit director Dr. Didelez
Open unit status Not open
Pre-requisites

None

Co-requisites

None

School/department School of Mathematics
Faculty Faculty of Science

Description including Unit Aims

Aims

Multivariate analysis is a branch of statistics involving the consideration of objects on each of which are observed the values of a number of variables. Multivariate techniques are used in medicine, physical, environmental, and biological sciences, economics and social science, and of course in many industrial and commercial applications.

A wide range of methods is used for the analysis of multivariate data, both unstructured and structured, and this course will review some of the more common and useful methods, with emphasis on implementation and interpretation.

Syllabus

  1. General introduction to multivariate data and revision of relevant matrix algebra.
  2. Principal components analysis for dimensional reduction and data visualisation.
  3. Factor analysis for dimensional reduction and interpretation.
  4. Discriminant analysis for classification.
  5. Cluster analysis for unsupervised learning.
  6. Multidimensional scaling for visualisation based on similarity/dissimilarity.

Relation to Other Units

As with the units Linear Models, Generalized Linear Models, and Time Series Analysis, this course is concerned with developing statistical methodology for a particular class of problems.

Applications will be implemented and presented using the statistical computing environment R (used in Probability 1 and Statistics 1).

Intended Learning Outcomes

To gain an understanding of:

  • Dimensional reduction and visualisation of high-dimensional datasets;
  • Structured and unstructured learning approaches, including classification and clustering;
  • Approaches based on notions of similarity/dissimilarity;
  • Implementation in the statistical computing environment R.

Transferable Skills:

Self assessment by working examples sheets and using solutions provided.

Teaching Information

Lectures (including both theory and illustrative applications), exercises to be done by students.

Assessment Information

The assessment mark for Multivariate Analysis is calculated from a 1½-hour written examination in May/June consisting of THREE questions. A candidate's TWO best answers will be used for assessment. Calculators of the approved type (non-programmable, no text facility) may be used. Statistical Tables will be provided.

Reading and References

There is no one set text. Any one of the following will be useful, particularly the first one (from which the notation for the course is taken):

  • K V Mardia, J T Kent and J Bibby, Multivariate Analysis, Academic Press, 1979.
  • W J Krzanowski, Principles of Multivariate Analysis: A User's Perspective. Clarendon Press, 1988.
  • C Chatfield and A J Collins, Introduction to Multivariate Analysis. Chapman and Hall, 1986.
  • Krzanowski, W. J. and Marriott, F. H. C. Multivariate Analysis, Parts I and II. Edward Arnold. 1994.