FDSC50060 Advanced Statistics

Academic Year 2022/2023

This module is designed for graduates who want to develop their expertise in RStudio. The online lectures and virtual classroom tutorials will give the necessary theoretical foundation and applied practical skills to manage and manipulate data using RStudio. It will also give researchers the knowledge and ability to apply Statistical Inference, Correlation and Multiple Linear Regression through this data software.

Show/hide contentOpenClose All

Curricular information is subject to change

Learning Outcomes:

Data Analysis with RStudio has 5 learning outcomes:

1. Use statistics to reduce complex data situations to manageable formats in order to describe, explain and model them;

2. Derive descriptive statistics for various data types and manipulate data in RStudio;

3. Perform and critique statistical tests (parametric and non-parametric) on two and more sample data;

4. Use multiple regression with RStudio and other advanced statistical techniques to allow prediction of a score on one variable on the basis of the scores on several other variables;

5. Communicate effectively research findings in a clear concise manner using correct terminology based on output from RStudio.

Indicative Module Content:

Introduction to Data Analysis in RStudio

This part of the training provides an introduction to RStudio. The courses addresses LO1, LO2, and L06 by covering:
• A review of statistical terminology;
• Demonstration on how to organise data, create a dataframe, import data, export results, etc.;
• Conclude with compiling descriptive statistics (both numerical and graphical) for various data types.

Data Manipulation with RStudio

An important skill to using RStudio is being comfortable with manipulating your data. Often when it comes to statistical tests, data needs to be in a certain format. This part of the training will focus on common data manipulation techniques encountered in RStudio – i.e., Indexing; Sorting; Ordering a dataframe; Renaming variables; Data types; Grouping a variable; Merging Rows/Columns; Handling missing values; Locating outliers; Filtering. This part of the training addresses LO1 and LO3.
Statistical Inference with RStudio
The final segment to Part I assumes that the delegate will have an understanding of hypothesis testing. The focus of this part of the training is to demonstrate how to perform a proportion of inferential statistics in RStudio, with a focus on how to line up and analyse various data sets properly in both a parametric and non-parametric way. The course focusses on LO3, LO4 and L06 by covering:

• Worked examples of testing for normality/differences between two measurements;
• In the case of non-normal data, the workshop will discuss suitable transformations;
• One-way Analysis of Variance (ANOVA) with suitable posthoc testing;
• Within-Subjects Designs (Repeated Measures);
• Between- and Within-Subjects Designs (Mixed Factorial Experiments);
• Between-Subjects Designs (Two-way ANOVA).

Part II
Correlation and Multiple Linear Regression in RStudio
Correlation analysis is used to determine if there is a statistically significant relationship between two variables. Linear regression analysis is used to make predictions based on the relationship that exists between two variables. However, most things are too complicated to “model” them with just two variables - Multiple Regression is needed. Multiple regression is a statistical technique that allows us to predict a score on one variable based on the scores on several other variables. The course focuses on LO2, LO3, LO5 and L06 by covering:
• Scatterplots and partial regression plots;
• Correlation and regression analysis;
• How detect for multi-collinearity [using Variance Inflation Factor (VIF)] and outliers (using leveraging);
• How to check that the residuals (errors) are approximately normally distributed and random;
• How to interpret regression equations and how to use them to make predictions.

Student Effort Hours: 
Student Effort Type Hours
Lectures

12

Practical

12

Autonomous Student Learning

50

Online Learning

26

Total

100

Approaches to Teaching and Learning:
Approaches to teaching will include:
active/task based learning
lectures
lab/studio work
online learning
problem based learning 
Requirements, Exclusions and Recommendations

Not applicable to this module.


Module Requisites and Incompatibles
Not applicable to this module.
 
Assessment Strategy  
Description Timing Open Book Exam Component Scale Must Pass Component % of Final Grade
Assignment: Short Question Based RStudio analysis Assessment(40%)
Practical/Skills Evaluation Assessment (60%)
Using prescribed dataset to complete challenging tasks and compile a report on their findings
Throughout the Trimester n/a Pass/Fail Grade Scale Yes

100


Carry forward of passed components
No
 
Resit In Terminal Exam
Autumn No
Please see Student Jargon Buster for more information about remediation types and timing. 
Feedback Strategy/Strategies

• Feedback individually to students, on an activity or draft prior to summative assessment
• Feedback individually to students, post-assessment

How will my Feedback be Delivered?

Not yet recorded.

Name Role
Mr SEAN LACEY Tutor