1Using and Managing Data and InformationApril cohort (2019 –2020)BA3020QA -Assignment 3

Task 1

A supermarket manager wanted to investigate the profile of shoppers resistant to the use of self-checkout counters available at the store he manages and to find out the reasons behind their resistance. He asks three of his staff to conduct a survey among customers using the store in Sheffield Hallam area during a quiet working day in October of 2019 as they exited the store. The short questionnaire used for the survey as well as the data collected can be found on an Excel file named “SCC”. There are two worksheet, one for the questionnaire used and the other worksheet for the raw data collected from the customers. The data worksheet has 18 columns. Column A “Customer” is simply a column to identify the customer surveyed –Customer 1, Customer 2 etc… Each of the other 17 columns refer to aquestion –Column B refers to the gender question, Column C refers to the age question etc… The manager has employed you as data analyst to conduct this investigation. Your analysis should use the statistical facilities on Excel as required below.

Required:

1.Label the codes for the categories of the following variables: • Gender• Age • Education • Are you a regular user of self-checkout user Note to students: There are FOUR tasks and you are expected to complete all of them. You are required to transfer the work of all tasks on to a single word-processed MS Word file and upload it on Turnitin, which is the submission of this assignment. The deadline for this assignment is (Monday 14thDecember 2020 before 3pm).

22.Produce frequency and percentage frequency tables for each of the following variables: • Gender • Education • Age • SCC user • Spending (£) For this last quantitative variable, the categories should be as follows:<100100-109110-119120-129130-140>140

3.Draw the graphical representation for each of the following variables: • Education (Pie chart) • Age (Bar chart) • Spending (Histogram) using the following categories:<100100-109110-119120-129130-140>140• Spending (Box and whisker) graph

4.Consider now the variable “Spending (£)”. • Calculate the minimum, maximum, median, quartiles and the mean average statistics. • Calculate the mean and standard deviation” split by users (and non-users) of SCC.

5.Produce a cross-table between each of the following two variables and draw an appropriate multiple bar chart: • Are you a regular user of self-checkout user and Gender • Are you a regular user of self-checkout user and Age

6.Draw a scatter diagram and calculate the correlation coefficient of Spending against the reliability total score.𝑅𝑒𝑙𝑖𝑎𝑏𝑖𝑙𝑖𝑡𝑦𝑡𝑜𝑡𝑎𝑙𝑠𝑐𝑜𝑟𝑒= 𝑅𝑒𝑙1 + 𝑅𝑒𝑙2 + 𝑅𝑒𝑙3 + 𝑅𝑒𝑙4

3Note: The tasksand all their requirements should be clearly separated (preferably each task starts on a separate page). All variables should be clearly named and their correct categories clearly labelled. Graphs should be appropriate and correctly labelled and each of them on the same page. Any calculations should be clearly presented and explained